2025-08-14T21:16:56.4273078Z Current runner version: '2.328.0' 2025-08-14T21:16:56.4277432Z Runner name: 'i-0819c8fa835cec089' 2025-08-14T21:16:56.4278027Z Runner group name: 'default' 2025-08-14T21:16:56.4278670Z Machine name: 'ip-10-0-18-145' 2025-08-14T21:16:56.4280608Z ##[group]GITHUB_TOKEN Permissions 2025-08-14T21:16:56.4282463Z Contents: read 2025-08-14T21:16:56.4282976Z Metadata: read 2025-08-14T21:16:56.4283386Z ##[endgroup] 2025-08-14T21:16:56.4285395Z Secret source: Actions 2025-08-14T21:16:56.4286011Z Prepare workflow directory 2025-08-14T21:16:56.4657769Z Prepare all required actions 2025-08-14T21:16:56.4688964Z Getting action download info 2025-08-14T21:16:56.8285225Z Download action repository 'pytorch/test-infra@main' (SHA:83f58f391e939c10dcb8cb6d745e4cefa3b98a84) 2025-08-14T21:16:59.1882066Z Download action repository 'pytorch/pytorch@main' (SHA:3be70dc30e893b552fc0f23ca06cd8f7949b6d08) 2025-08-14T21:17:15.5477732Z Download action repository 'actions/setup-python@a26af69be951a213d495a4c3e4e4022e16d87065' (SHA:a26af69be951a213d495a4c3e4e4022e16d87065) 2025-08-14T21:17:15.9185791Z Download action repository 'aws-actions/configure-aws-credentials@ececac1a45f3b08a01d2dd070d28d111c5fe6722' (SHA:ececac1a45f3b08a01d2dd070d28d111c5fe6722) 2025-08-14T21:17:16.1742188Z Download action repository 'aws-actions/amazon-ecr-login@062b18b96a7aff071d4dc91bc00c4c1a7945b076' (SHA:062b18b96a7aff071d4dc91bc00c4c1a7945b076) 2025-08-14T21:17:16.3831959Z Download action repository 'seemethere/upload-artifact-s3@baba72d0712b404f646cebe0730933554ebce96a' (SHA:baba72d0712b404f646cebe0730933554ebce96a) 2025-08-14T21:17:16.6672666Z Getting action download info 2025-08-14T21:17:16.7743048Z Download action repository 'actions/checkout@v4' (SHA:08eba0b27e820071cde6df949e0beb9ba4906955) 2025-08-14T21:17:17.0961568Z Getting action download info 2025-08-14T21:17:17.1985158Z Download action repository 'nick-fields/retry@v3.0.0' (SHA:7152eba30c6575329ac0576536151aca5a72780e) 2025-08-14T21:17:17.4250872Z Getting action download info 2025-08-14T21:17:17.5369613Z Download action repository 'nick-fields/retry@3e91a01664abd3c5cd539100d10d33b9c5b68482' (SHA:3e91a01664abd3c5cd539100d10d33b9c5b68482) 2025-08-14T21:17:17.7232292Z Getting action download info 2025-08-14T21:17:17.8463236Z Uses: pytorch/pytorch/.github/workflows/_linux-test.yml@refs/heads/main (1fc683cf17c8c673044538d10266c00f92987be2) 2025-08-14T21:17:17.8466267Z ##[group] Inputs 2025-08-14T21:17:17.8466542Z build-environment: linux-jammy-py3.9-gcc11-build 2025-08-14T21:17:17.8468123Z test-matrix: {"include": [{"config": "cpu_inductor_torchbench", "shard": 1, "num_shards": 2, "runner": "linux.8xlarge.amx"}, {"config": "cpu_inductor_torchbench", "shard": 2, "num_shards": 2, "runner": "linux.8xlarge.amx"}, {"config": "dynamic_cpu_inductor_huggingface", "shard": 1, "num_shards": 1, "runner": "linux.8xlarge.amx"}, {"config": "dynamic_cpu_inductor_timm", "shard": 1, "num_shards": 2, "runner": "linux.8xlarge.amx"}, {"config": "dynamic_cpu_inductor_timm", "shard": 2, "num_shards": 2, "runner": "linux.8xlarge.amx"}, {"config": "dynamic_cpu_inductor_torchbench", "shard": 1, "num_shards": 2, "runner": "linux.8xlarge.amx"}, {"config": "dynamic_cpu_inductor_torchbench", "shard": 2, "num_shards": 2, "runner": "linux.8xlarge.amx"}, {"config": "inductor_torchbench_cpu_smoketest_perf", "shard": 1, "num_shards": 1, "runner": "linux.24xl.spr-metal"}]} 2025-08-14T21:17:17.8469906Z docker-image: 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/ci-image:pytorch-linux-jammy-py3.9-gcc11-inductor-benchmarks-bfa89110622ba7202628e9faac705f183070defe 2025-08-14T21:17:17.8470419Z sync-tag: 2025-08-14T21:17:17.8470949Z timeout-minutes: 240 2025-08-14T21:17:17.8471108Z use-gha: 2025-08-14T21:17:17.8471264Z dashboard-tag: 2025-08-14T21:17:17.8471430Z s3-bucket: gha-artifacts 2025-08-14T21:17:17.8471609Z aws-role-to-assume: 2025-08-14T21:17:17.8471942Z disable-monitor: false 2025-08-14T21:17:17.8472135Z monitor-log-interval: 5 2025-08-14T21:17:17.8472544Z monitor-data-collect-interval: 1 2025-08-14T21:17:17.8472824Z ##[endgroup] 2025-08-14T21:17:17.8473168Z Complete job name: linux-jammy-cpu-py3.9-gcc11-inductor / test (dynamic_cpu_inductor_huggingface, 1, 1, linux.8xlarge.amx) 2025-08-14T21:17:17.8925753Z A job started hook has been configured by the self-hosted runner administrator 2025-08-14T21:17:17.9000319Z ##[group]Run '/home/ec2-user/runner-scripts/before_job.sh' 2025-08-14T21:17:17.9006954Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-08-14T21:17:17.9007368Z ##[endgroup] 2025-08-14T21:17:18.8013676Z Runner Type: linux.8xlarge.amx 2025-08-14T21:17:18.8014126Z Instance Type: m7i-flex.8xlarge 2025-08-14T21:17:18.8014353Z AMI Name: unknown 2025-08-14T21:17:18.8052249Z AMI ID: ami-05ffe3c48a9991133 2025-08-14T21:17:22.5842392Z ##[group]Run pytorch/test-infra/.github/actions/setup-ssh@main 2025-08-14T21:17:22.5842857Z with: 2025-08-14T21:17:22.5843453Z github-secret: *** 2025-08-14T21:17:22.5843949Z instructions: All testing is done inside the container, to start an interactive session run: docker exec -it $(docker container ps --format '{{.ID}}') bash 2025-08-14T21:17:22.5844455Z activate-with-label: false 2025-08-14T21:17:22.5844688Z label: with-ssh 2025-08-14T21:17:22.5844886Z remove-existing-keys: true 2025-08-14T21:17:22.5845114Z fail-silently: true 2025-08-14T21:17:22.5845322Z env: 2025-08-14T21:17:22.5845500Z GIT_DEFAULT_BRANCH: main 2025-08-14T21:17:22.5845741Z ##[endgroup] 2025-08-14T21:17:22.6962579Z Please see https://github.com/pytorch/pytorch/wiki/Debugging-using-with-ssh-for-Github-Actions for more info. 2025-08-14T21:17:22.6964593Z Not on pull request and ciflow reference could not be extracted, skipping adding ssh keys 2025-08-14T21:17:22.7201280Z ##[group]Run pytorch/pytorch/.github/actions/checkout-pytorch@main 2025-08-14T21:17:22.7201543Z with: 2025-08-14T21:17:22.7201698Z no-sudo: true 2025-08-14T21:17:22.7201861Z submodules: recursive 2025-08-14T21:17:22.7202041Z fetch-depth: 0 2025-08-14T21:17:22.7202198Z env: 2025-08-14T21:17:22.7202342Z GIT_DEFAULT_BRANCH: main 2025-08-14T21:17:22.7202518Z ##[endgroup] 2025-08-14T21:17:22.7271148Z ##[group]Run echo "IN_CONTAINER_RUNNER=$(if [ -f /.inarc ] || [ -f /.incontainer ]; then echo true ; else echo false; fi)" >> "$GITHUB_OUTPUT" 2025-08-14T21:17:22.7271697Z echo "IN_CONTAINER_RUNNER=$(if [ -f /.inarc ] || [ -f /.incontainer ]; then echo true ; else echo false; fi)" >> "$GITHUB_OUTPUT" 2025-08-14T21:17:22.7278820Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-08-14T21:17:22.7279052Z env: 2025-08-14T21:17:22.7279231Z GIT_DEFAULT_BRANCH: main 2025-08-14T21:17:22.7279443Z ##[endgroup] 2025-08-14T21:17:22.7361160Z ##[group]Run # Use all available CPUs for fetching 2025-08-14T21:17:22.7361448Z # Use all available CPUs for fetching 2025-08-14T21:17:22.7361657Z cd "${GITHUB_WORKSPACE}" 2025-08-14T21:17:22.7361870Z git config --global fetch.parallel 0 2025-08-14T21:17:22.7362103Z git config --global submodule.fetchJobs 0 2025-08-14T21:17:22.7362315Z  2025-08-14T21:17:22.7362619Z # Clean workspace. The default checkout action should also do this, but 2025-08-14T21:17:22.7362911Z # do it here as well just in case 2025-08-14T21:17:22.7363110Z if [[ -d .git ]]; then 2025-08-14T21:17:22.7363296Z  if [ -z "${NO_SUDO}" ]; then 2025-08-14T21:17:22.7363480Z  sudo git clean -ffdx 2025-08-14T21:17:22.7363660Z  else 2025-08-14T21:17:22.7363817Z  git clean -ffdx 2025-08-14T21:17:22.7363978Z  fi 2025-08-14T21:17:22.7364121Z fi 2025-08-14T21:17:22.7367984Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-08-14T21:17:22.7368206Z env: 2025-08-14T21:17:22.7368353Z GIT_DEFAULT_BRANCH: main 2025-08-14T21:17:22.7368518Z NO_SUDO: true 2025-08-14T21:17:22.7368657Z ##[endgroup] 2025-08-14T21:17:22.7472899Z ##[group]Run actions/checkout@v4 2025-08-14T21:17:22.7473091Z with: 2025-08-14T21:17:22.7473250Z ref: 1fc683cf17c8c673044538d10266c00f92987be2 2025-08-14T21:17:22.7473603Z fetch-depth: 0 2025-08-14T21:17:22.7473748Z submodules: recursive 2025-08-14T21:17:22.7473913Z show-progress: false 2025-08-14T21:17:22.7474081Z repository: pytorch/pytorch 2025-08-14T21:17:22.7474327Z token: *** 2025-08-14T21:17:22.7474466Z ssh-strict: true 2025-08-14T21:17:22.7474610Z ssh-user: git 2025-08-14T21:17:22.7474763Z persist-credentials: true 2025-08-14T21:17:22.7474925Z clean: true 2025-08-14T21:17:22.7475079Z sparse-checkout-cone-mode: true 2025-08-14T21:17:22.7475258Z fetch-tags: false 2025-08-14T21:17:22.7475393Z lfs: false 2025-08-14T21:17:22.7475540Z set-safe-directory: true 2025-08-14T21:17:22.7475712Z env: 2025-08-14T21:17:22.7475845Z GIT_DEFAULT_BRANCH: main 2025-08-14T21:17:22.7476007Z ##[endgroup] 2025-08-14T21:17:22.8332738Z Syncing repository: pytorch/pytorch 2025-08-14T21:17:22.8333653Z ##[group]Getting Git version info 2025-08-14T21:17:22.8333946Z Working directory is '/home/ec2-user/actions-runner/_work/pytorch/pytorch' 2025-08-14T21:17:22.8334359Z [command]/usr/bin/git version 2025-08-14T21:17:22.8571527Z git version 2.47.1 2025-08-14T21:17:22.8600958Z ##[endgroup] 2025-08-14T21:17:22.8609211Z Copying '/home/ec2-user/.gitconfig' to '/home/ec2-user/actions-runner/_work/_temp/63472596-9dd7-4d53-8b7b-04d9db958d6b/.gitconfig' 2025-08-14T21:17:22.8640683Z Temporarily overriding HOME='/home/ec2-user/actions-runner/_work/_temp/63472596-9dd7-4d53-8b7b-04d9db958d6b' before making global git config changes 2025-08-14T21:17:22.8641504Z Adding repository directory to the temporary git global config as a safe directory 2025-08-14T21:17:22.8656314Z [command]/usr/bin/git config --global --add safe.directory /home/ec2-user/actions-runner/_work/pytorch/pytorch 2025-08-14T21:17:22.8699676Z Deleting the contents of '/home/ec2-user/actions-runner/_work/pytorch/pytorch' 2025-08-14T21:17:22.8704542Z ##[group]Initializing the repository 2025-08-14T21:17:22.8712606Z [command]/usr/bin/git init /home/ec2-user/actions-runner/_work/pytorch/pytorch 2025-08-14T21:17:22.8767040Z hint: Using 'master' as the name for the initial branch. This default branch name 2025-08-14T21:17:22.8768909Z hint: is subject to change. To configure the initial branch name to use in all 2025-08-14T21:17:22.8769267Z hint: of your new repositories, which will suppress this warning, call: 2025-08-14T21:17:22.8769835Z hint: 2025-08-14T21:17:22.8770044Z hint: git config --global init.defaultBranch 2025-08-14T21:17:22.8770249Z hint: 2025-08-14T21:17:22.8770459Z hint: Names commonly chosen instead of 'master' are 'main', 'trunk' and 2025-08-14T21:17:22.8770795Z hint: 'development'. The just-created branch can be renamed via this command: 2025-08-14T21:17:22.8771036Z hint: 2025-08-14T21:17:22.8771182Z hint: git branch -m 2025-08-14T21:17:22.8783586Z Initialized empty Git repository in /home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/ 2025-08-14T21:17:22.8793636Z [command]/usr/bin/git remote add origin https://github.com/pytorch/pytorch 2025-08-14T21:17:22.8845028Z ##[endgroup] 2025-08-14T21:17:22.8849573Z ##[group]Disabling automatic garbage collection 2025-08-14T21:17:22.8852875Z [command]/usr/bin/git config --local gc.auto 0 2025-08-14T21:17:22.8868107Z ##[endgroup] 2025-08-14T21:17:22.8868406Z ##[group]Setting up auth 2025-08-14T21:17:22.8873723Z [command]/usr/bin/git config --local --name-only --get-regexp core\.sshCommand 2025-08-14T21:17:22.8899661Z [command]/usr/bin/git submodule foreach --recursive sh -c "git config --local --name-only --get-regexp 'core\.sshCommand' && git config --local --unset-all 'core.sshCommand' || :" 2025-08-14T21:17:22.9219800Z [command]/usr/bin/git config --local --name-only --get-regexp http\.https\:\/\/github\.com\/\.extraheader 2025-08-14T21:17:22.9255306Z [command]/usr/bin/git submodule foreach --recursive sh -c "git config --local --name-only --get-regexp 'http\.https\:\/\/github\.com\/\.extraheader' && git config --local --unset-all 'http.https://github.com/.extraheader' || :" 2025-08-14T21:17:22.9609158Z [command]/usr/bin/git config --local http.https://github.com/.extraheader AUTHORIZATION: basic *** 2025-08-14T21:17:22.9678888Z ##[endgroup] 2025-08-14T21:17:22.9679438Z ##[group]Fetching the repository 2025-08-14T21:17:22.9683754Z [command]/usr/bin/git -c protocol.version=2 fetch --prune --no-recurse-submodules origin +refs/heads/*:refs/remotes/origin/* +refs/tags/*:refs/tags/* 2025-08-14T21:18:05.9487642Z From https://github.com/pytorch/pytorch 2025-08-14T21:18:05.9489613Z * [new branch] 2.6.0.dev20241004+ -> origin/2.6.0.dev20241004+ 2025-08-14T21:18:05.9493798Z * [new branch] 5addvllmbuild -> origin/5addvllmbuild 2025-08-14T21:18:05.9498223Z * [new branch] AaronWang04_addmmfusion_perftest -> origin/AaronWang04_addmmfusion_perftest 2025-08-14T21:18:05.9499996Z * [new branch] HDCharles-2.6.0-release-notes -> origin/HDCharles-2.6.0-release-notes 2025-08-14T21:18:05.9500598Z * [new branch] JackCaoG/dynamo_make_fx_non_core_aten_ops -> origin/JackCaoG/dynamo_make_fx_non_core_aten_ops 2025-08-14T21:18:05.9504932Z * [new branch] PR-AOTInductorNoneBug -> origin/PR-AOTInductorNoneBug 2025-08-14T21:18:05.9507088Z * [new branch] PR-AOTInductorNoneBugFix -> origin/PR-AOTInductorNoneBugFix 2025-08-14T21:18:05.9507634Z * [new branch] PR-FixConfigsIssue -> origin/PR-FixConfigsIssue 2025-08-14T21:18:05.9511257Z * [new branch] PR-NoneBugFix-viable -> origin/PR-NoneBugFix-viable 2025-08-14T21:18:05.9515198Z * [new branch] PR-ResetToZero -> origin/PR-ResetToZero 2025-08-14T21:18:05.9519076Z * [new branch] Update-Flash-Packaging -> origin/Update-Flash-Packaging 2025-08-14T21:18:05.9519822Z * [new branch] add-missing-args-normalization -> origin/add-missing-args-normalization 2025-08-14T21:18:05.9520217Z * [new branch] add-user-guide-structure -> origin/add-user-guide-structure 2025-08-14T21:18:05.9520540Z * [new branch] addVllmPin -> origin/addVllmPin 2025-08-14T21:18:05.9520846Z * [new branch] add_windows_testing_back -> origin/add_windows_testing_back 2025-08-14T21:18:05.9521230Z * [new branch] addbuildvllm -> origin/addbuildvllm 2025-08-14T21:18:05.9523586Z * [new branch] addmm-heuristic -> origin/addmm-heuristic 2025-08-14T21:18:05.9527548Z * [new branch] addsimde -> origin/addsimde 2025-08-14T21:18:05.9534990Z * [new branch] addvllpinnedfile -> origin/addvllpinnedfile 2025-08-14T21:18:05.9539099Z * [new branch] adi/acl_upgrade -> origin/adi/acl_upgrade 2025-08-14T21:18:05.9539627Z * [new branch] adi/skip_slow_tests -> origin/adi/skip_slow_tests 2025-08-14T21:18:05.9540437Z * [new branch] adi/test -> origin/adi/test 2025-08-14T21:18:05.9540855Z * [new branch] adi/test_bgemm -> origin/adi/test_bgemm 2025-08-14T21:18:05.9541164Z * [new branch] adi/test_fusions -> origin/adi/test_fusions 2025-08-14T21:18:05.9541467Z * [new branch] adi/test_onednn_v3.9 -> origin/adi/test_onednn_v3.9 2025-08-14T21:18:05.9541784Z * [new branch] adi/test_presve_change -> origin/adi/test_presve_change 2025-08-14T21:18:05.9542085Z * [new branch] adi/test_timm -> origin/adi/test_timm 2025-08-14T21:18:05.9542400Z * [new branch] adi/testpresve_change -> origin/adi/testpresve_change 2025-08-14T21:18:05.9542815Z * [new branch] aditew01/test/vec_bf16 -> origin/aditew01/test/vec_bf16 2025-08-14T21:18:05.9543152Z * [new branch] ah-globalfeedback-hook -> origin/ah-globalfeedback-hook 2025-08-14T21:18:05.9543484Z * [new branch] albanD-patch-1 -> origin/albanD-patch-1 2025-08-14T21:18:05.9543776Z * [new branch] alt-disable -> origin/alt-disable 2025-08-14T21:18:05.9544460Z * [new branch] angelayi/aoti_additional_files -> origin/angelayi/aoti_additional_files 2025-08-14T21:18:05.9544821Z * [new branch] angelayi/aoti_inductor_fx -> origin/angelayi/aoti_inductor_fx 2025-08-14T21:18:05.9545195Z * [new branch] angelayi/assert_tensor_metadata_device -> origin/angelayi/assert_tensor_metadata_device 2025-08-14T21:18:05.9545569Z * [new branch] angelayi/benchmark -> origin/angelayi/benchmark 2025-08-14T21:18:05.9545884Z * [new branch] angelayi/benchmark2 -> origin/angelayi/benchmark2 2025-08-14T21:18:05.9546254Z * [new branch] angelayi/change_pytree_serialization -> origin/angelayi/change_pytree_serialization 2025-08-14T21:18:05.9546612Z * [new branch] angelayi/cpp_loader -> origin/angelayi/cpp_loader 2025-08-14T21:18:05.9546940Z * [new branch] angelayi/custom_op_subgraph -> origin/angelayi/custom_op_subgraph 2025-08-14T21:18:05.9547262Z * [new branch] angelayi/customop -> origin/angelayi/customop 2025-08-14T21:18:05.9547553Z * [new branch] angelayi/del_lib -> origin/angelayi/del_lib 2025-08-14T21:18:05.9547827Z * [new branch] angelayi/docs -> origin/angelayi/docs 2025-08-14T21:18:05.9548104Z * [new branch] angelayi/docs2 -> origin/angelayi/docs2 2025-08-14T21:18:05.9548393Z * [new branch] angelayi/fix_pt2 -> origin/angelayi/fix_pt2 2025-08-14T21:18:05.9548741Z * [new branch] angelayi/logging.bak -> origin/angelayi/logging.bak 2025-08-14T21:18:05.9549050Z * [new branch] angelayi/logging2 -> origin/angelayi/logging2 2025-08-14T21:18:05.9549353Z * [new branch] angelayi/no_so_weight -> origin/angelayi/no_so_weight 2025-08-14T21:18:05.9549654Z * [new branch] angelayi/pytree -> origin/angelayi/pytree 2025-08-14T21:18:05.9549949Z * [new branch] angelayi/save_error -> origin/angelayi/save_error 2025-08-14T21:18:05.9550249Z * [new branch] angelayi/scan_layers -> origin/angelayi/scan_layers 2025-08-14T21:18:05.9550558Z * [new branch] angelayi/symint_input -> origin/angelayi/symint_input 2025-08-14T21:18:05.9550893Z * [new branch] angelayi/tensor_nn_module_meta -> origin/angelayi/tensor_nn_module_meta 2025-08-14T21:18:05.9551219Z * [new branch] angelayi/torch_size -> origin/angelayi/torch_size 2025-08-14T21:18:05.9551516Z * [new branch] aoti-cuda-alloc -> origin/aoti-cuda-alloc 2025-08-14T21:18:05.9551815Z * [new branch] aoti_weight_sharing -> origin/aoti_weight_sharing 2025-08-14T21:18:05.9552118Z * [new branch] arsh/symint_mm_ind_decomp -> origin/arsh/symint_mm_ind_decomp 2025-08-14T21:18:05.9552468Z * [new branch] atalman-inductor-perf-cu124 -> origin/atalman-inductor-perf-cu124 2025-08-14T21:18:05.9552842Z * [new branch] atalman-inductor-perf-cu124.1 -> origin/atalman-inductor-perf-cu124.1 2025-08-14T21:18:05.9553182Z * [new branch] atalman-patch-1 -> origin/atalman-patch-1 2025-08-14T21:18:05.9553463Z * [new branch] atalman-patch-2 -> origin/atalman-patch-2 2025-08-14T21:18:05.9553745Z * [new branch] atalman-patch-3 -> origin/atalman-patch-3 2025-08-14T21:18:05.9554026Z * [new branch] atalman-patch-6 -> origin/atalman-patch-6 2025-08-14T21:18:05.9554303Z * [new branch] atalman-patch-7 -> origin/atalman-patch-7 2025-08-14T21:18:05.9554628Z * [new branch] atalman-patch-8 -> origin/atalman-patch-8 2025-08-14T21:18:05.9554916Z * [new branch] atalman_inductor_2.3.0 -> origin/atalman_inductor_2.3.0 2025-08-14T21:18:05.9555267Z * [new branch] atalman_inductor_2.3.1 -> origin/atalman_inductor_2.3.1 2025-08-14T21:18:05.9555572Z * [new branch] atalman_inductor_2.4.0 -> origin/atalman_inductor_2.4.0 2025-08-14T21:18:05.9555874Z * [new branch] atalman_inductor_2.4.x -> origin/atalman_inductor_2.4.x 2025-08-14T21:18:05.9556245Z * [new branch] autoupdate-transformers-pin-via-pr -> origin/autoupdate-transformers-pin-via-pr 2025-08-14T21:18:05.9556603Z * [new branch] backupvllm -> origin/backupvllm 2025-08-14T21:18:05.9556882Z * [new branch] base/1.5 -> origin/base/1.5 2025-08-14T21:18:05.9557205Z * [new branch] batching_sdpa_efficient_attention -> origin/batching_sdpa_efficient_attention 2025-08-14T21:18:05.9557555Z * [new branch] benchmark-updates -> origin/benchmark-updates 2025-08-14T21:18:05.9557867Z * [new branch] benchmarking-script -> origin/benchmarking-script 2025-08-14T21:18:05.9558290Z * [new branch] benjaminglass1/mark-large-tensor-tests-serial -> origin/benjaminglass1/mark-large-tensor-tests-serial 2025-08-14T21:18:05.9558699Z * [new branch] bertmaher/pinbump26 -> origin/bertmaher/pinbump26 2025-08-14T21:18:05.9558999Z * [new branch] bertrand/cutlass -> origin/bertrand/cutlass 2025-08-14T21:18:05.9559280Z * [new branch] bf/cg-log -> origin/bf/cg-log 2025-08-14T21:18:05.9559558Z * [new branch] bf/cg-remove-check -> origin/bf/cg-remove-check 2025-08-14T21:18:05.9559881Z * [new branch] bf/cg-skip-1-kernel -> origin/bf/cg-skip-1-kernel 2025-08-14T21:18:05.9560222Z * [new branch] bf/cudagraph -> origin/bf/cudagraph 2025-08-14T21:18:05.9560580Z * [new branch] bf/cudagraph-disable-input-mutation -> origin/bf/cudagraph-disable-input-mutation 2025-08-14T21:18:05.9561115Z * [new branch] bf/cudagraph-enable-input-mutation-support-benchmark -> origin/bf/cudagraph-enable-input-mutation-support-benchmark 2025-08-14T21:18:05.9561588Z * [new branch] bf/cudagraph-partition -> origin/bf/cudagraph-partition 2025-08-14T21:18:05.9561931Z * [new branch] bf/default-recompile-reason -> origin/bf/default-recompile-reason 2025-08-14T21:18:05.9562275Z * [new branch] bf/donated-buffer-bench -> origin/bf/donated-buffer-bench 2025-08-14T21:18:05.9562598Z * [new branch] bf/improve-kernel-bench -> origin/bf/improve-kernel-bench 2025-08-14T21:18:05.9562907Z * [new branch] bf/kernel-benchmark -> origin/bf/kernel-benchmark 2025-08-14T21:18:05.9563210Z * [new branch] bf/partition-doc -> origin/bf/partition-doc 2025-08-14T21:18:05.9563520Z * [new branch] bf/partition-move-cpu -> origin/bf/partition-move-cpu 2025-08-14T21:18:05.9563833Z * [new branch] bf/partition-turn-on -> origin/bf/partition-turn-on 2025-08-14T21:18:05.9564160Z * [new branch] bf/remove-check-55b0c39d -> origin/bf/remove-check-55b0c39d 2025-08-14T21:18:05.9564522Z * [new branch] bf/skip-asserts -> origin/bf/skip-asserts 2025-08-14T21:18:05.9564800Z * [new branch] bf16adamw -> origin/bf16adamw 2025-08-14T21:18:05.9565105Z * [new branch] bisect_perf_hf_T5_3acc6eac492 -> origin/bisect_perf_hf_T5_3acc6eac492 2025-08-14T21:18:05.9565451Z * [new branch] bisect_perf_hf_T5_3fcf66f61fb -> origin/bisect_perf_hf_T5_3fcf66f61fb 2025-08-14T21:18:05.9565784Z * [new branch] bisect_perf_hf_T5_4009d154129 -> origin/bisect_perf_hf_T5_4009d154129 2025-08-14T21:18:05.9566110Z * [new branch] bisect_perf_hf_T5_40d0740e73d -> origin/bisect_perf_hf_T5_40d0740e73d 2025-08-14T21:18:05.9566429Z * [new branch] bisect_perf_hf_T5_5268754e -> origin/bisect_perf_hf_T5_5268754e 2025-08-14T21:18:05.9566792Z * [new branch] bisect_perf_hf_T5_7d89a8d385c -> origin/bisect_perf_hf_T5_7d89a8d385c 2025-08-14T21:18:05.9567125Z * [new branch] bisect_perf_hf_T5_b7a25c1ee7c -> origin/bisect_perf_hf_T5_b7a25c1ee7c 2025-08-14T21:18:05.9567452Z * [new branch] bisect_perf_hf_T5_c25b201583f -> origin/bisect_perf_hf_T5_c25b201583f 2025-08-14T21:18:05.9567821Z * [new branch] bisect_perf_hf_T5_c93e57efac0 -> origin/bisect_perf_hf_T5_c93e57efac0 2025-08-14T21:18:05.9568158Z * [new branch] bisect_perf_hf_T5_ca9813ea149 -> origin/bisect_perf_hf_T5_ca9813ea149 2025-08-14T21:18:05.9568485Z * [new branch] bisect_perf_hf_T5_d65f194a -> origin/bisect_perf_hf_T5_d65f194a 2025-08-14T21:18:05.9568807Z * [new branch] bisect_perf_hf_T5_da94ab0b -> origin/bisect_perf_hf_T5_da94ab0b 2025-08-14T21:18:05.9569131Z * [new branch] bisect_perf_hf_T5_da94ab0b_new -> origin/bisect_perf_hf_T5_da94ab0b_new 2025-08-14T21:18:05.9569477Z * [new branch] bisect_perf_hf_T5_db4e8a1d8a8 -> origin/bisect_perf_hf_T5_db4e8a1d8a8 2025-08-14T21:18:05.9569809Z * [new branch] bisect_perf_hf_T5_e0d97e936a2 -> origin/bisect_perf_hf_T5_e0d97e936a2 2025-08-14T21:18:05.9570139Z * [new branch] bisect_perf_hf_T5_f23621ec563 -> origin/bisect_perf_hf_T5_f23621ec563 2025-08-14T21:18:05.9570461Z * [new branch] bowbao/bench_updates_stage -> origin/bowbao/bench_updates_stage 2025-08-14T21:18:05.9570785Z * [new branch] bowbao/dort_rewriter -> origin/bowbao/dort_rewriter 2025-08-14T21:18:05.9571132Z * [new branch] bowbao/wip_prs -> origin/bowbao/wip_prs 2025-08-14T21:18:05.9571463Z * [new branch] bowenbao/partial_min_max_reduce -> origin/bowenbao/partial_min_max_reduce 2025-08-14T21:18:05.9571807Z * [new branch] brister/always_wrapper_ir -> origin/brister/always_wrapper_ir 2025-08-14T21:18:05.9572136Z * [new branch] brister/flatten_contig -> origin/brister/flatten_contig 2025-08-14T21:18:05.9572459Z * [new branch] brister/test_block_ptr_same -> origin/brister/test_block_ptr_same 2025-08-14T21:18:05.9572825Z * [new branch] brister/tiled_reduction_no_numel_check -> origin/brister/tiled_reduction_no_numel_check 2025-08-14T21:18:05.9573163Z * [new branch] c57382a49 -> origin/c57382a49 2025-08-14T21:18:05.9573425Z * [new branch] ca_0431d47eaa -> origin/ca_0431d47eaa 2025-08-14T21:18:05.9573698Z * [new branch] ca_fix_0431d47eaa -> origin/ca_fix_0431d47eaa 2025-08-14T21:18:05.9574227Z * [new branch] camyll/revert-94bc900da97ad7f3c35b3b819bb53b23c74b581a-for-release-2.8 -> origin/camyll/revert-94bc900da97ad7f3c35b3b819bb53b23c74b581a-for-release-2.8 2025-08-14T21:18:05.9574824Z * [new branch] camyll/test_precommit_hooks_lintrunner -> origin/camyll/test_precommit_hooks_lintrunner 2025-08-14T21:18:05.9575269Z * [new branch] camyllh/cherrypick-151547-for-release28 -> origin/camyllh/cherrypick-151547-for-release28 2025-08-14T21:18:05.9575673Z * [new branch] camyllh/test_setup_hooks_push -> origin/camyllh/test_setup_hooks_push 2025-08-14T21:18:05.9576056Z * [new branch] cherry-pick-149654-by-pytorch_bot_bot_ -> origin/cherry-pick-149654-by-pytorch_bot_bot_ 2025-08-14T21:18:05.9576505Z * [new branch] cherry-pick-151939-by-pytorch_bot_bot_ -> origin/cherry-pick-151939-by-pytorch_bot_bot_ 2025-08-14T21:18:05.9576922Z * [new branch] cherry-pick-154174-by-pytorch_bot_bot_ -> origin/cherry-pick-154174-by-pytorch_bot_bot_ 2025-08-14T21:18:05.9577336Z * [new branch] cherry-pick-155896-by-pytorch_bot_bot_ -> origin/cherry-pick-155896-by-pytorch_bot_bot_ 2025-08-14T21:18:05.9577746Z * [new branch] cherry-pick-156260-by-pytorch_bot_bot_ -> origin/cherry-pick-156260-by-pytorch_bot_bot_ 2025-08-14T21:18:05.9578217Z * [new branch] cherry-pick-156719-by-pytorch_bot_bot_ -> origin/cherry-pick-156719-by-pytorch_bot_bot_ 2025-08-14T21:18:05.9578636Z * [new branch] cherry-pick-156876-by-pytorch_bot_bot_ -> origin/cherry-pick-156876-by-pytorch_bot_bot_ 2025-08-14T21:18:05.9579057Z * [new branch] cherry-pick-156888-by-pytorch_bot_bot_ -> origin/cherry-pick-156888-by-pytorch_bot_bot_ 2025-08-14T21:18:05.9579479Z * [new branch] cherry-pick-157014-by-pytorch_bot_bot_ -> origin/cherry-pick-157014-by-pytorch_bot_bot_ 2025-08-14T21:18:05.9579890Z * [new branch] cherry-pick-157179-by-pytorch_bot_bot_ -> origin/cherry-pick-157179-by-pytorch_bot_bot_ 2025-08-14T21:18:05.9580310Z * [new branch] cherry-pick-157453-by-pytorch_bot_bot_ -> origin/cherry-pick-157453-by-pytorch_bot_bot_ 2025-08-14T21:18:05.9580726Z * [new branch] cherry-pick-157513-by-pytorch_bot_bot_ -> origin/cherry-pick-157513-by-pytorch_bot_bot_ 2025-08-14T21:18:05.9581152Z * [new branch] cherry-pick-157558-by-pytorch_bot_bot_ -> origin/cherry-pick-157558-by-pytorch_bot_bot_ 2025-08-14T21:18:05.9581565Z * [new branch] cherry-pick-157598-by-pytorch_bot_bot_ -> origin/cherry-pick-157598-by-pytorch_bot_bot_ 2025-08-14T21:18:05.9581992Z * [new branch] cherry-pick-157600-by-pytorch_bot_bot_ -> origin/cherry-pick-157600-by-pytorch_bot_bot_ 2025-08-14T21:18:05.9582413Z * [new branch] cherry-pick-157630-by-pytorch_bot_bot_ -> origin/cherry-pick-157630-by-pytorch_bot_bot_ 2025-08-14T21:18:05.9582863Z * [new branch] cherry-pick-157695-by-pytorch_bot_bot_ -> origin/cherry-pick-157695-by-pytorch_bot_bot_ 2025-08-14T21:18:05.9583278Z * [new branch] cherry-pick-157732-by-pytorch_bot_bot_ -> origin/cherry-pick-157732-by-pytorch_bot_bot_ 2025-08-14T21:18:05.9583696Z * [new branch] cherry-pick-157733-by-pytorch_bot_bot_ -> origin/cherry-pick-157733-by-pytorch_bot_bot_ 2025-08-14T21:18:05.9584131Z * [new branch] cherry-pick-157985-by-pytorch_bot_bot_ -> origin/cherry-pick-157985-by-pytorch_bot_bot_ 2025-08-14T21:18:05.9584771Z * [new branch] cherry-pick-157993-by-pytorch_bot_bot_ -> origin/cherry-pick-157993-by-pytorch_bot_bot_ 2025-08-14T21:18:05.9585197Z * [new branch] cherry-pick-158064-by-pytorch_bot_bot_ -> origin/cherry-pick-158064-by-pytorch_bot_bot_ 2025-08-14T21:18:05.9585617Z * [new branch] cherry-pick-158152-by-pytorch_bot_bot_ -> origin/cherry-pick-158152-by-pytorch_bot_bot_ 2025-08-14T21:18:05.9586036Z * [new branch] cherry-pick-158295-by-pytorch_bot_bot_ -> origin/cherry-pick-158295-by-pytorch_bot_bot_ 2025-08-14T21:18:05.9586457Z * [new branch] cherry-pick-158301-by-pytorch_bot_bot_ -> origin/cherry-pick-158301-by-pytorch_bot_bot_ 2025-08-14T21:18:05.9586864Z * [new branch] cherry-pick-158537-by-pytorch_bot_bot_ -> origin/cherry-pick-158537-by-pytorch_bot_bot_ 2025-08-14T21:18:05.9587288Z * [new branch] cherry-pick-158572-by-pytorch_bot_bot_ -> origin/cherry-pick-158572-by-pytorch_bot_bot_ 2025-08-14T21:18:05.9587658Z * [new branch] cherry-pick-158595 -> origin/cherry-pick-158595 2025-08-14T21:18:05.9588158Z * [new branch] cherry-pick-159181-by-pytorch_bot_bot_ -> origin/cherry-pick-159181-by-pytorch_bot_bot_ 2025-08-14T21:18:05.9588608Z * [new branch] cherry-pick-159969-by-pytorch_bot_bot_ -> origin/cherry-pick-159969-by-pytorch_bot_bot_ 2025-08-14T21:18:05.9589050Z * [new branch] cherry-pick-160586-by-pytorch_bot_bot_ -> origin/cherry-pick-160586-by-pytorch_bot_bot_ 2025-08-14T21:18:05.9589420Z * [new branch] cherry-pick-PR-158746 -> origin/cherry-pick-PR-158746 2025-08-14T21:18:05.9589885Z * [new branch] cherrypick-e4e2701429c17078c3c475382a8b1fa4c8a8cefc -> origin/cherrypick-e4e2701429c17078c3c475382a8b1fa4c8a8cefc 2025-08-14T21:18:05.9590416Z * [new branch] chilli/flex_vllm -> origin/chilli/flex_vllm 2025-08-14T21:18:05.9590735Z * [new branch] ckluk2-compileThread-1 -> origin/ckluk2-compileThread-1 2025-08-14T21:18:05.9591061Z * [new branch] ckluk2-compileThread-2 -> origin/ckluk2-compileThread-2 2025-08-14T21:18:05.9591393Z * [new branch] ckluk2-compileThread-64 -> origin/ckluk2-compileThread-64 2025-08-14T21:18:05.9591714Z * [new branch] ckluk2-test-1 -> origin/ckluk2-test-1 2025-08-14T21:18:05.9591999Z * [new branch] cleantest1 -> origin/cleantest1 2025-08-14T21:18:05.9592269Z * [new branch] codex-testing -> origin/codex-testing 2025-08-14T21:18:05.9592950Z * [new branch] codex/create-test-for-tensor-memory-leak-in-cudagraph -> origin/codex/create-test-for-tensor-memory-leak-in-cudagraph 2025-08-14T21:18:05.9593626Z * [new branch] codex/fix-issue-121219-in-pytorch -> origin/codex/fix-issue-121219-in-pytorch 2025-08-14T21:18:05.9594515Z * [new branch] codex/fix-issue-160415-in-pytorch -> origin/codex/fix-issue-160415-in-pytorch 2025-08-14T21:18:05.9595087Z * [new branch] codex/fix-noqengine-quantized-engine-support -> origin/codex/fix-noqengine-quantized-engine-support 2025-08-14T21:18:05.9595605Z * [new branch] codex/fix-pin_memory-error-handling -> origin/codex/fix-pin_memory-error-handling 2025-08-14T21:18:05.9596036Z * [new branch] codex/propose-fix-for-issue-160332 -> origin/codex/propose-fix-for-issue-160332 2025-08-14T21:18:05.9596796Z * [new branch] codex/refactor-lintrunner-config-to-use-uv-run -> origin/codex/refactor-lintrunner-config-to-use-uv-run 2025-08-14T21:18:05.9597349Z * [new branch] codex/verify-torch-output-and-log-results -> origin/codex/verify-torch-output-and-log-results 2025-08-14T21:18:05.9597981Z * [new branch] compile_fsdp2_disable_stream_and_event -> origin/compile_fsdp2_disable_stream_and_event 2025-08-14T21:18:05.9598911Z * [new branch] comply-with-setuptools -> origin/comply-with-setuptools 2025-08-14T21:18:05.9599493Z * [new branch] context_test -> origin/context_test 2025-08-14T21:18:05.9600188Z * [new branch] copilot/fix-157446 -> origin/copilot/fix-157446 2025-08-14T21:18:05.9600790Z * [new branch] copilot/fix-159257 -> origin/copilot/fix-159257 2025-08-14T21:18:05.9601358Z * [new branch] copy_graph -> origin/copy_graph 2025-08-14T21:18:05.9603044Z * [new branch] cpio/fix_new_ami_tests -> origin/cpio/fix_new_ami_tests 2025-08-14T21:18:05.9603606Z * [new branch] csl/3_proc_sm -> origin/csl/3_proc_sm 2025-08-14T21:18:05.9604070Z * [new branch] csl/add_file_merge_conflict_csv -> origin/csl/add_file_merge_conflict_csv 2025-08-14T21:18:05.9604917Z * [new branch] csl/always_produce_xml -> origin/csl/always_produce_xml 2025-08-14T21:18:05.9605474Z * [new branch] csl/build_test_more_procs -> origin/csl/build_test_more_procs 2025-08-14T21:18:05.9606241Z * [new branch] csl/build_test_more_procs2 -> origin/csl/build_test_more_procs2 2025-08-14T21:18:05.9606785Z * [new branch] csl/disable_flaky_cpp_test -> origin/csl/disable_flaky_cpp_test 2025-08-14T21:18:05.9607171Z * [new branch] csl/disable_periodic_test -> origin/csl/disable_periodic_test 2025-08-14T21:18:05.9607852Z * [new branch] csl/executorch_docker_fail -> origin/csl/executorch_docker_fail 2025-08-14T21:18:05.9608451Z * [new branch] csl/fix_check_alerts -> origin/csl/fix_check_alerts 2025-08-14T21:18:05.9608984Z * [new branch] csl/katex -> origin/csl/katex 2025-08-14T21:18:05.9609511Z * [new branch] csl/larger_runner -> origin/csl/larger_runner 2025-08-14T21:18:05.9610187Z * [new branch] csl/lintrunner_changed_files_removed -> origin/csl/lintrunner_changed_files_removed 2025-08-14T21:18:05.9610922Z * [new branch] csl/lintrunner_changed_files_removed_test -> origin/csl/lintrunner_changed_files_removed_test 2025-08-14T21:18:05.9611417Z * [new branch] csl/lintrunner_stuff -> origin/csl/lintrunner_stuff 2025-08-14T21:18:05.9611958Z * [new branch] csl/mps_sharding -> origin/csl/mps_sharding 2025-08-14T21:18:05.9612586Z * [new branch] csl/multistage_docker -> origin/csl/multistage_docker 2025-08-14T21:18:05.9613178Z * [new branch] csl/no_keep_goin_rocm -> origin/csl/no_keep_goin_rocm 2025-08-14T21:18:05.9613849Z * [new branch] csl/not_600_timeout -> origin/csl/not_600_timeout 2025-08-14T21:18:05.9614431Z * [new branch] csl/remove_unused_docker_images -> origin/csl/remove_unused_docker_images 2025-08-14T21:18:05.9614976Z * [new branch] csl/revert_open -> origin/csl/revert_open 2025-08-14T21:18:05.9615653Z * [new branch] csl/rocm_upload_artifacts_while_running -> origin/csl/rocm_upload_artifacts_while_running 2025-08-14T21:18:05.9616274Z * [new branch] csl/skip_build -> origin/csl/skip_build 2025-08-14T21:18:05.9616810Z * [new branch] csl/td_dynamo -> origin/csl/td_dynamo 2025-08-14T21:18:05.9617472Z * [new branch] csl/test_cuda_build_large_runner -> origin/csl/test_cuda_build_large_runner 2025-08-14T21:18:05.9618153Z * [new branch] csl/unused_docker -> origin/csl/unused_docker 2025-08-14T21:18:05.9618693Z * [new branch] csl/win_sccache -> origin/csl/win_sccache 2025-08-14T21:18:05.9619624Z * [new branch] cublasltrelax2 -> origin/cublasltrelax2 2025-08-14T21:18:05.9619967Z * [new branch] cublasrelax2 -> origin/cublasrelax2 2025-08-14T21:18:05.9620759Z * [new branch] cudnnsdparefactor -> origin/cudnnsdparefactor 2025-08-14T21:18:05.9621360Z * [new branch] custom_lowering_dict -> origin/custom_lowering_dict 2025-08-14T21:18:05.9621909Z * [new branch] czhuge_muon_dev -> origin/czhuge_muon_dev 2025-08-14T21:18:05.9623219Z * [new branch] d4l3k/delete_hook -> origin/d4l3k/delete_hook 2025-08-14T21:18:05.9623531Z * [new branch] d4l3k/dist_queue -> origin/d4l3k/dist_queue 2025-08-14T21:18:05.9624285Z * [new branch] d4l3k/wait_stream -> origin/d4l3k/wait_stream 2025-08-14T21:18:05.9624955Z * [new branch] dcp-safetensor-test-fix -> origin/dcp-safetensor-test-fix 2025-08-14T21:18:05.9625609Z * [new branch] dcp_zoc -> origin/dcp_zoc 2025-08-14T21:18:05.9626276Z * [new branch] delete-quant-docs -> origin/delete-quant-docs 2025-08-14T21:18:05.9629692Z * [new branch] dependabot/pip/dot-ci/docker/protobuf-5.29.5 -> origin/dependabot/pip/dot-ci/docker/protobuf-5.29.5 2025-08-14T21:18:05.9630178Z * [new branch] desertfire/test_cpp_wrapper -> origin/desertfire/test_cpp_wrapper 2025-08-14T21:18:05.9630569Z * [new branch] desertfire/triton-cpu-for-aarch64 -> origin/desertfire/triton-cpu-for-aarch64 2025-08-14T21:18:05.9630960Z * [new branch] dev/joona/MPSNDArrayAdd -> origin/dev/joona/MPSNDArrayAdd 2025-08-14T21:18:05.9635190Z * [new branch] dev/joona/Unranked -> origin/dev/joona/Unranked 2025-08-14T21:18:05.9635700Z * [new branch] dev/joona/cat -> origin/dev/joona/cat 2025-08-14T21:18:05.9636154Z * [new branch] dev/joona/cat_remove_graph -> origin/dev/joona/cat_remove_graph 2025-08-14T21:18:05.9636512Z * [new branch] dev/joona/embeddingbag -> origin/dev/joona/embeddingbag 2025-08-14T21:18:05.9637027Z * [new branch] dev/joona/getTensorsString -> origin/dev/joona/getTensorsString 2025-08-14T21:18:05.9637458Z * [new branch] dev/joona/maxpool2dwithindices_errmsg -> origin/dev/joona/maxpool2dwithindices_errmsg 2025-08-14T21:18:05.9637861Z * [new branch] dev/joona/mps_linear_macos14 -> origin/dev/joona/mps_linear_macos14 2025-08-14T21:18:05.9638317Z * [new branch] dev/joona/sdpa -> origin/dev/joona/sdpa 2025-08-14T21:18:05.9638788Z * [new branch] dev/joona/synchronize_benchmark -> origin/dev/joona/synchronize_benchmark 2025-08-14T21:18:05.9639151Z * [new branch] dev/joona/topk_newapi -> origin/dev/joona/topk_newapi 2025-08-14T21:18:05.9642638Z * [new branch] dev/joona/type_inf -> origin/dev/joona/type_inf 2025-08-14T21:18:05.9643167Z * [new branch] dev/joona/upsize3d -> origin/dev/joona/upsize3d 2025-08-14T21:18:05.9643950Z * [new branch] disable -> origin/disable 2025-08-14T21:18:05.9644393Z * [new branch] divyanshk-log-api-usage-datapipes-1 -> origin/divyanshk-log-api-usage-datapipes-1 2025-08-14T21:18:05.9644773Z * [new branch] e2e-baseline -> origin/e2e-baseline 2025-08-14T21:18:05.9645090Z * [new branch] embg/test_inductor_ci_128B -> origin/embg/test_inductor_ci_128B 2025-08-14T21:18:05.9645413Z * [new branch] embg/test_inductor_ci_base -> origin/embg/test_inductor_ci_base 2025-08-14T21:18:05.9646098Z * [new branch] embg/test_inductor_ci_control -> origin/embg/test_inductor_ci_control 2025-08-14T21:18:05.9646470Z * [new branch] embg/triton_l2_prefetch_128B -> origin/embg/triton_l2_prefetch_128B 2025-08-14T21:18:05.9647011Z * [new branch] embg/triton_l2_prefetch_256B -> origin/embg/triton_l2_prefetch_256B 2025-08-14T21:18:05.9647363Z * [new branch] enable-b200-benchmark -> origin/enable-b200-benchmark 2025-08-14T21:18:05.9647692Z * [new branch] eqy-patch-1 -> origin/eqy-patch-1 2025-08-14T21:18:05.9647983Z * [new branch] eqy-patch-10 -> origin/eqy-patch-10 2025-08-14T21:18:05.9648269Z * [new branch] eqy-patch-2 -> origin/eqy-patch-2 2025-08-14T21:18:05.9648719Z * [new branch] example-convert-torch.nn -> origin/example-convert-torch.nn 2025-08-14T21:18:05.9649280Z * [new branch] exclamaforte/amd-ma -> origin/exclamaforte/amd-ma 2025-08-14T21:18:05.9649893Z * [new branch] exclamaforte/bump-transformer-version -> origin/exclamaforte/bump-transformer-version 2025-08-14T21:18:05.9650496Z * [new branch] exclamaforte/combo-kernels-perf-run -> origin/exclamaforte/combo-kernels-perf-run 2025-08-14T21:18:05.9651158Z * [new branch] exclamaforte/debug-autotuner-profile -> origin/exclamaforte/debug-autotuner-profile 2025-08-14T21:18:05.9651659Z * [new branch] exclamaforte/do_bench_refactor -> origin/exclamaforte/do_bench_refactor 2025-08-14T21:18:05.9653260Z * [new branch] exclamaforte/enable-mem-dep-fusion -> origin/exclamaforte/enable-mem-dep-fusion 2025-08-14T21:18:05.9653926Z * [new branch] exclamaforte/fix-exhaustive-autotuning -> origin/exclamaforte/fix-exhaustive-autotuning 2025-08-14T21:18:05.9654515Z * [new branch] exclamaforte/fix-trace-parsing-fx-svg -> origin/exclamaforte/fix-trace-parsing-fx-svg 2025-08-14T21:18:05.9655018Z * [new branch] exclamaforte/force-pointwise-cat-perf-run -> origin/exclamaforte/force-pointwise-cat-perf-run 2025-08-14T21:18:05.9655447Z * [new branch] exclamaforte/fusion-data -> origin/exclamaforte/fusion-data 2025-08-14T21:18:05.9655937Z * [new branch] exclamaforte/gemm-benchmark-run -> origin/exclamaforte/gemm-benchmark-run 2025-08-14T21:18:05.9656527Z * [new branch] exclamaforte/gemm-export-model -> origin/exclamaforte/gemm-export-model 2025-08-14T21:18:05.9657085Z * [new branch] exclamaforte/gemm-model -> origin/exclamaforte/gemm-model 2025-08-14T21:18:05.9657775Z * [new branch] exclamaforte/gemm-model-all-data-collection -> origin/exclamaforte/gemm-model-all-data-collection 2025-08-14T21:18:05.9658358Z * [new branch] exclamaforte/gemm-to-amd -> origin/exclamaforte/gemm-to-amd 2025-08-14T21:18:05.9659125Z * [new branch] exclamaforte/just-gemm-model -> origin/exclamaforte/just-gemm-model 2025-08-14T21:18:05.9659717Z * [new branch] exclamaforte/just-gemm-model-no-refactor -> origin/exclamaforte/just-gemm-model-no-refactor 2025-08-14T21:18:05.9660280Z * [new branch] exclamaforte/memory-counter -> origin/exclamaforte/memory-counter 2025-08-14T21:18:05.9660930Z * [new branch] exclamaforte/scheduler-refactor -> origin/exclamaforte/scheduler-refactor 2025-08-14T21:18:05.9661648Z * [new branch] exclamaforte/test_cpp_wrapper_mode -> origin/exclamaforte/test_cpp_wrapper_mode 2025-08-14T21:18:05.9662205Z * [new branch] exclamaforte/update-autotune-configs -> origin/exclamaforte/update-autotune-configs 2025-08-14T21:18:05.9662819Z * [new branch] exclamaforte/update-autotune-configs-2 -> origin/exclamaforte/update-autotune-configs-2 2025-08-14T21:18:05.9663294Z * [new branch] exclamaforte/update-pandas-numpy-ci -> origin/exclamaforte/update-pandas-numpy-ci 2025-08-14T21:18:05.9664721Z * [new branch] exclamforte/gemm-model-final -> origin/exclamforte/gemm-model-final 2025-08-14T21:18:05.9665468Z * [new branch] exec -> origin/exec 2025-08-14T21:18:05.9666144Z * [new branch] experimental-mosaic -> origin/experimental-mosaic 2025-08-14T21:18:05.9666908Z * [new branch] export-D58091437 -> origin/export-D58091437 2025-08-14T21:18:05.9667529Z * [new branch] export-D61047529 -> origin/export-D61047529 2025-08-14T21:18:05.9668216Z * [new branch] export-D68846308 -> origin/export-D68846308 2025-08-14T21:18:05.9668969Z * [new branch] export-D70112642 -> origin/export-D70112642 2025-08-14T21:18:05.9669595Z * [new branch] export-D71412006 -> origin/export-D71412006 2025-08-14T21:18:05.9670275Z * [new branch] export-D72483950 -> origin/export-D72483950 2025-08-14T21:18:05.9670955Z * [new branch] export-D73042989 -> origin/export-D73042989 2025-08-14T21:18:05.9671647Z * [new branch] export-D73287751 -> origin/export-D73287751 2025-08-14T21:18:05.9672221Z * [new branch] export-D75183591 -> origin/export-D75183591 2025-08-14T21:18:05.9672896Z * [new branch] export-D75605373 -> origin/export-D75605373 2025-08-14T21:18:05.9673601Z * [new branch] export-D75617432 -> origin/export-D75617432 2025-08-14T21:18:05.9674214Z * [new branch] export-D75659965 -> origin/export-D75659965 2025-08-14T21:18:05.9674865Z * [new branch] export-D76080931 -> origin/export-D76080931 2025-08-14T21:18:05.9675558Z * [new branch] export-D76463347 -> origin/export-D76463347 2025-08-14T21:18:05.9676194Z * [new branch] export-D76797250 -> origin/export-D76797250 2025-08-14T21:18:05.9676814Z * [new branch] export-D76885271 -> origin/export-D76885271 2025-08-14T21:18:05.9677410Z * [new branch] export-D76885620 -> origin/export-D76885620 2025-08-14T21:18:05.9678006Z * [new branch] export-D76936623 -> origin/export-D76936623 2025-08-14T21:18:05.9678685Z * [new branch] export-D76958268 -> origin/export-D76958268 2025-08-14T21:18:05.9679299Z * [new branch] export-D78047846 -> origin/export-D78047846 2025-08-14T21:18:05.9679912Z * [new branch] export-D78308105 -> origin/export-D78308105 2025-08-14T21:18:05.9680525Z * [new branch] export-D78363609 -> origin/export-D78363609 2025-08-14T21:18:05.9681132Z * [new branch] export-D78375400 -> origin/export-D78375400 2025-08-14T21:18:05.9681786Z * [new branch] export-D78431075 -> origin/export-D78431075 2025-08-14T21:18:05.9682394Z * [new branch] export-D78431305 -> origin/export-D78431305 2025-08-14T21:18:05.9684002Z * [new branch] export-D78458745 -> origin/export-D78458745 2025-08-14T21:18:05.9684307Z * [new branch] export-D78524147 -> origin/export-D78524147 2025-08-14T21:18:05.9684750Z * [new branch] export-D78580107 -> origin/export-D78580107 2025-08-14T21:18:05.9685261Z * [new branch] export-D78588406 -> origin/export-D78588406 2025-08-14T21:18:05.9686679Z * [new branch] export-D78691422 -> origin/export-D78691422 2025-08-14T21:18:05.9687131Z * [new branch] export-D78758466 -> origin/export-D78758466 2025-08-14T21:18:05.9687528Z * [new branch] export-D78822171 -> origin/export-D78822171 2025-08-14T21:18:05.9687930Z * [new branch] export-D78822351 -> origin/export-D78822351 2025-08-14T21:18:05.9688494Z * [new branch] export-D78822507 -> origin/export-D78822507 2025-08-14T21:18:05.9689386Z * [new branch] export-D78826994 -> origin/export-D78826994 2025-08-14T21:18:05.9689755Z * [new branch] export-D78894142 -> origin/export-D78894142 2025-08-14T21:18:05.9690373Z * [new branch] export-D78894324 -> origin/export-D78894324 2025-08-14T21:18:05.9690977Z * [new branch] export-D78907485 -> origin/export-D78907485 2025-08-14T21:18:05.9691642Z * [new branch] export-D78929245 -> origin/export-D78929245 2025-08-14T21:18:05.9692191Z * [new branch] export-D78934925 -> origin/export-D78934925 2025-08-14T21:18:05.9694069Z * [new branch] export-D78953203 -> origin/export-D78953203 2025-08-14T21:18:05.9694437Z * [new branch] export-D78953229 -> origin/export-D78953229 2025-08-14T21:18:05.9694729Z * [new branch] export-D78957093 -> origin/export-D78957093 2025-08-14T21:18:05.9695028Z * [new branch] export-D78957389 -> origin/export-D78957389 2025-08-14T21:18:05.9695319Z * [new branch] export-D78957974 -> origin/export-D78957974 2025-08-14T21:18:05.9696035Z * [new branch] export-D78979812 -> origin/export-D78979812 2025-08-14T21:18:05.9697156Z * [new branch] export-D78996107 -> origin/export-D78996107 2025-08-14T21:18:05.9697869Z * [new branch] export-D79026433 -> origin/export-D79026433 2025-08-14T21:18:05.9698256Z * [new branch] export-D79230339 -> origin/export-D79230339 2025-08-14T21:18:05.9699210Z * [new branch] export-D79319835 -> origin/export-D79319835 2025-08-14T21:18:05.9699703Z * [new branch] export-D79328456 -> origin/export-D79328456 2025-08-14T21:18:05.9700398Z * [new branch] export-D79534608 -> origin/export-D79534608 2025-08-14T21:18:05.9701064Z * [new branch] export-D79647167 -> origin/export-D79647167 2025-08-14T21:18:05.9701791Z * [new branch] export-D79751098 -> origin/export-D79751098 2025-08-14T21:18:05.9702628Z * [new branch] export-D79785974 -> origin/export-D79785974 2025-08-14T21:18:05.9703236Z * [new branch] export-D80025417 -> origin/export-D80025417 2025-08-14T21:18:05.9704035Z * [new branch] export-D80120333 -> origin/export-D80120333 2025-08-14T21:18:05.9704689Z * [new branch] export-D80214882 -> origin/export-D80214882 2025-08-14T21:18:05.9705629Z * [new branch] exported-model-train-idempotent -> origin/exported-model-train-idempotent 2025-08-14T21:18:05.9706408Z * [new branch] ezyang/wip-aot-descriptors -> origin/ezyang/wip-aot-descriptors 2025-08-14T21:18:05.9707060Z * [new branch] fa_u8_brgemm -> origin/fa_u8_brgemm 2025-08-14T21:18:05.9708506Z * [new branch] fastmath_baseline -> origin/fastmath_baseline 2025-08-14T21:18:05.9708836Z * [new branch] fbcode/warm -> origin/fbcode/warm 2025-08-14T21:18:05.9710191Z * [new branch] fca -> origin/fca 2025-08-14T21:18:05.9710503Z * [new branch] fca2_ca5984c -> origin/fca2_ca5984c 2025-08-14T21:18:05.9711353Z * [new branch] fca5 -> origin/fca5 2025-08-14T21:18:05.9712765Z * [new branch] feature/function-numa-binding -> origin/feature/function-numa-binding 2025-08-14T21:18:05.9713166Z * [new branch] fengyuan/external-proj -> origin/fengyuan/external-proj 2025-08-14T21:18:05.9713875Z * [new branch] fengyuan/out-of-tree-xpu-ops-improve-test -> origin/fengyuan/out-of-tree-xpu-ops-improve-test 2025-08-14T21:18:05.9714659Z * [new branch] fengyuan/out-of-tree-xpu-ops-remove-dtype -> origin/fengyuan/out-of-tree-xpu-ops-remove-dtype 2025-08-14T21:18:05.9715270Z * [new branch] fengyuan/test-xpu -> origin/fengyuan/test-xpu 2025-08-14T21:18:05.9715891Z * [new branch] ffast_math_baseline -> origin/ffast_math_baseline 2025-08-14T21:18:05.9716561Z * [new branch] ffast_math_target -> origin/ffast_math_target 2025-08-14T21:18:05.9717764Z * [new branch] findhao/base_commit -> origin/findhao/base_commit 2025-08-14T21:18:05.9718082Z * [new branch] findhao/base_commit1 -> origin/findhao/base_commit1 2025-08-14T21:18:05.9718694Z * [new branch] findhao/fix-indirect-access -> origin/findhao/fix-indirect-access 2025-08-14T21:18:05.9719251Z * [new branch] findhao/multistream2 -> origin/findhao/multistream2 2025-08-14T21:18:05.9719873Z * [new branch] findhao/multistream5 -> origin/findhao/multistream5 2025-08-14T21:18:05.9720461Z * [new branch] findhao/multistream6 -> origin/findhao/multistream6 2025-08-14T21:18:05.9720972Z * [new branch] findhao/operatorbench3 -> origin/findhao/operatorbench3 2025-08-14T21:18:05.9722804Z * [new branch] findhao/operatorbench5 -> origin/findhao/operatorbench5 2025-08-14T21:18:05.9723188Z * [new branch] findhao/tritonparse -> origin/findhao/tritonparse 2025-08-14T21:18:05.9723533Z * [new branch] fix -> origin/fix 2025-08-14T21:18:05.9724868Z * [new branch] fix-ck-gemm-template-format -> origin/fix-ck-gemm-template-format 2025-08-14T21:18:05.9725455Z * [new branch] fix-config-ignore -> origin/fix-config-ignore 2025-08-14T21:18:05.9725882Z * [new branch] fix-dict-guard -> origin/fix-dict-guard 2025-08-14T21:18:05.9726222Z * [new branch] fix-distributed-warning -> origin/fix-distributed-warning 2025-08-14T21:18:05.9726895Z * [new branch] fix-inductor-periodic-0528 -> origin/fix-inductor-periodic-0528 2025-08-14T21:18:05.9727566Z * [new branch] fix-rlease-feature-template -> origin/fix-rlease-feature-template 2025-08-14T21:18:05.9728060Z * [new branch] fix_153389 -> origin/fix_153389 2025-08-14T21:18:05.9728729Z * [new branch] fixes-triage -> origin/fixes-triage 2025-08-14T21:18:05.9729544Z * [new branch] flash_decoding_cpu -> origin/flash_decoding_cpu 2025-08-14T21:18:05.9730034Z * [new branch] flex-flash -> origin/flex-flash 2025-08-14T21:18:05.9730641Z * [new branch] flex-lowering -> origin/flex-lowering 2025-08-14T21:18:05.9731260Z * [new branch] flex-warning -> origin/flex-warning 2025-08-14T21:18:05.9731954Z * [new branch] flex_attention_functorch_grad -> origin/flex_attention_functorch_grad 2025-08-14T21:18:05.9732580Z * [new branch] flex_flash -> origin/flex_flash 2025-08-14T21:18:05.9734262Z * [new branch] fmassa/fix_memeff_sharding_rule -> origin/fmassa/fix_memeff_sharding_rule 2025-08-14T21:18:05.9734704Z * [new branch] fmassa/try_fix_ac_tag_propagation -> origin/fmassa/try_fix_ac_tag_propagation 2025-08-14T21:18:05.9735259Z * [new branch] fsdp2_trace_rules -> origin/fsdp2_trace_rules 2025-08-14T21:18:05.9735549Z * [new branch] fsdpv2_3d -> origin/fsdpv2_3d 2025-08-14T21:18:05.9736190Z * [new branch] fsdpv2_3d_m1 -> origin/fsdpv2_3d_m1 2025-08-14T21:18:05.9739339Z * [new branch] fx_cpp -> origin/fx_cpp 2025-08-14T21:18:05.9739666Z * [new branch] fy/fix-win -> origin/fy/fix-win 2025-08-14T21:18:05.9739986Z * [new branch] gh/AlnisM/1/base -> origin/gh/AlnisM/1/base 2025-08-14T21:18:05.9740625Z * [new branch] gh/AlnisM/1/head -> origin/gh/AlnisM/1/head 2025-08-14T21:18:05.9741273Z * [new branch] gh/CaoE/2/base -> origin/gh/CaoE/2/base 2025-08-14T21:18:05.9741795Z * [new branch] gh/CaoE/2/head -> origin/gh/CaoE/2/head 2025-08-14T21:18:05.9742434Z * [new branch] gh/CaoE/2/orig -> origin/gh/CaoE/2/orig 2025-08-14T21:18:05.9744030Z * [new branch] gh/ColinPeppler/72/base -> origin/gh/ColinPeppler/72/base 2025-08-14T21:18:05.9744469Z * [new branch] gh/ColinPeppler/72/head -> origin/gh/ColinPeppler/72/head 2025-08-14T21:18:05.9745065Z * [new branch] gh/ColinPeppler/72/orig -> origin/gh/ColinPeppler/72/orig 2025-08-14T21:18:05.9746458Z * [new branch] gh/ColinPeppler/77/base -> origin/gh/ColinPeppler/77/base 2025-08-14T21:18:05.9746817Z * [new branch] gh/ColinPeppler/77/head -> origin/gh/ColinPeppler/77/head 2025-08-14T21:18:05.9747434Z * [new branch] gh/ColinPeppler/77/orig -> origin/gh/ColinPeppler/77/orig 2025-08-14T21:18:05.9749058Z * [new branch] gh/ColinPeppler/78/base -> origin/gh/ColinPeppler/78/base 2025-08-14T21:18:05.9749448Z * [new branch] gh/ColinPeppler/78/head -> origin/gh/ColinPeppler/78/head 2025-08-14T21:18:05.9749974Z * [new branch] gh/ColinPeppler/78/orig -> origin/gh/ColinPeppler/78/orig 2025-08-14T21:18:05.9750879Z * [new branch] gh/EikanWang/67/base -> origin/gh/EikanWang/67/base 2025-08-14T21:18:05.9751344Z * [new branch] gh/EikanWang/67/head -> origin/gh/EikanWang/67/head 2025-08-14T21:18:05.9752470Z * [new branch] gh/EikanWang/80/base -> origin/gh/EikanWang/80/base 2025-08-14T21:18:05.9752866Z * [new branch] gh/EikanWang/80/head -> origin/gh/EikanWang/80/head 2025-08-14T21:18:05.9753392Z * [new branch] gh/EikanWang/80/orig -> origin/gh/EikanWang/80/orig 2025-08-14T21:18:05.9757702Z * [new branch] gh/EikanWang/81/base -> origin/gh/EikanWang/81/base 2025-08-14T21:18:05.9758238Z * [new branch] gh/EikanWang/81/head -> origin/gh/EikanWang/81/head 2025-08-14T21:18:05.9759078Z * [new branch] gh/EikanWang/81/orig -> origin/gh/EikanWang/81/orig 2025-08-14T21:18:05.9759610Z * [new branch] gh/Gasoonjia/1/base -> origin/gh/Gasoonjia/1/base 2025-08-14T21:18:05.9759922Z * [new branch] gh/Gasoonjia/1/head -> origin/gh/Gasoonjia/1/head 2025-08-14T21:18:05.9760229Z * [new branch] gh/H-Huang/131/base -> origin/gh/H-Huang/131/base 2025-08-14T21:18:05.9760527Z * [new branch] gh/H-Huang/131/head -> origin/gh/H-Huang/131/head 2025-08-14T21:18:05.9760992Z * [new branch] gh/H-Huang/131/orig -> origin/gh/H-Huang/131/orig 2025-08-14T21:18:05.9761409Z * [new branch] gh/H-Huang/132/base -> origin/gh/H-Huang/132/base 2025-08-14T21:18:05.9761828Z * [new branch] gh/H-Huang/132/head -> origin/gh/H-Huang/132/head 2025-08-14T21:18:05.9762221Z * [new branch] gh/H-Huang/132/orig -> origin/gh/H-Huang/132/orig 2025-08-14T21:18:05.9766286Z * [new branch] gh/H-Huang/180/base -> origin/gh/H-Huang/180/base 2025-08-14T21:18:05.9766809Z * [new branch] gh/H-Huang/180/head -> origin/gh/H-Huang/180/head 2025-08-14T21:18:05.9767242Z * [new branch] gh/H-Huang/180/orig -> origin/gh/H-Huang/180/orig 2025-08-14T21:18:05.9767550Z * [new branch] gh/H-Huang/182/base -> origin/gh/H-Huang/182/base 2025-08-14T21:18:05.9767848Z * [new branch] gh/H-Huang/182/head -> origin/gh/H-Huang/182/head 2025-08-14T21:18:05.9768133Z * [new branch] gh/H-Huang/182/orig -> origin/gh/H-Huang/182/orig 2025-08-14T21:18:05.9768578Z * [new branch] gh/H-Huang/183/base -> origin/gh/H-Huang/183/base 2025-08-14T21:18:05.9768880Z * [new branch] gh/H-Huang/183/head -> origin/gh/H-Huang/183/head 2025-08-14T21:18:05.9769176Z * [new branch] gh/H-Huang/183/orig -> origin/gh/H-Huang/183/orig 2025-08-14T21:18:05.9769502Z * [new branch] gh/H-Huang/187/base -> origin/gh/H-Huang/187/base 2025-08-14T21:18:05.9770063Z * [new branch] gh/H-Huang/187/head -> origin/gh/H-Huang/187/head 2025-08-14T21:18:05.9770644Z * [new branch] gh/H-Huang/187/orig -> origin/gh/H-Huang/187/orig 2025-08-14T21:18:05.9774278Z * [new branch] gh/H-Huang/192/base -> origin/gh/H-Huang/192/base 2025-08-14T21:18:05.9774785Z * [new branch] gh/H-Huang/192/head -> origin/gh/H-Huang/192/head 2025-08-14T21:18:05.9775217Z * [new branch] gh/H-Huang/192/orig -> origin/gh/H-Huang/192/orig 2025-08-14T21:18:05.9776028Z * [new branch] gh/H-Huang/195/base -> origin/gh/H-Huang/195/base 2025-08-14T21:18:05.9776393Z * [new branch] gh/H-Huang/195/head -> origin/gh/H-Huang/195/head 2025-08-14T21:18:05.9776697Z * [new branch] gh/H-Huang/195/orig -> origin/gh/H-Huang/195/orig 2025-08-14T21:18:05.9777001Z * [new branch] gh/H-Huang/196/base -> origin/gh/H-Huang/196/base 2025-08-14T21:18:05.9777298Z * [new branch] gh/H-Huang/196/head -> origin/gh/H-Huang/196/head 2025-08-14T21:18:05.9777590Z * [new branch] gh/H-Huang/196/orig -> origin/gh/H-Huang/196/orig 2025-08-14T21:18:05.9778332Z * [new branch] gh/H-Huang/197/base -> origin/gh/H-Huang/197/base 2025-08-14T21:18:05.9779039Z * [new branch] gh/H-Huang/197/head -> origin/gh/H-Huang/197/head 2025-08-14T21:18:05.9779703Z * [new branch] gh/H-Huang/197/orig -> origin/gh/H-Huang/197/orig 2025-08-14T21:18:05.9781316Z * [new branch] gh/H-Huang/198/base -> origin/gh/H-Huang/198/base 2025-08-14T21:18:05.9781637Z * [new branch] gh/H-Huang/198/head -> origin/gh/H-Huang/198/head 2025-08-14T21:18:05.9781959Z * [new branch] gh/H-Huang/198/orig -> origin/gh/H-Huang/198/orig 2025-08-14T21:18:05.9782808Z * [new branch] gh/H-Huang/199/base -> origin/gh/H-Huang/199/base 2025-08-14T21:18:05.9783413Z * [new branch] gh/H-Huang/199/head -> origin/gh/H-Huang/199/head 2025-08-14T21:18:05.9784014Z * [new branch] gh/H-Huang/199/orig -> origin/gh/H-Huang/199/orig 2025-08-14T21:18:05.9785657Z * [new branch] gh/H-Huang/200/base -> origin/gh/H-Huang/200/base 2025-08-14T21:18:05.9785964Z * [new branch] gh/H-Huang/200/head -> origin/gh/H-Huang/200/head 2025-08-14T21:18:05.9786288Z * [new branch] gh/H-Huang/200/orig -> origin/gh/H-Huang/200/orig 2025-08-14T21:18:05.9787843Z * [new branch] gh/H-Huang/201/base -> origin/gh/H-Huang/201/base 2025-08-14T21:18:05.9788348Z * [new branch] gh/H-Huang/201/head -> origin/gh/H-Huang/201/head 2025-08-14T21:18:05.9788783Z * [new branch] gh/H-Huang/201/orig -> origin/gh/H-Huang/201/orig 2025-08-14T21:18:05.9789111Z * [new branch] gh/H-Huang/202/base -> origin/gh/H-Huang/202/base 2025-08-14T21:18:05.9789788Z * [new branch] gh/H-Huang/202/head -> origin/gh/H-Huang/202/head 2025-08-14T21:18:05.9790292Z * [new branch] gh/H-Huang/202/orig -> origin/gh/H-Huang/202/orig 2025-08-14T21:18:05.9792591Z * [new branch] gh/H-Huang/203/base -> origin/gh/H-Huang/203/base 2025-08-14T21:18:05.9793109Z * [new branch] gh/H-Huang/203/head -> origin/gh/H-Huang/203/head 2025-08-14T21:18:05.9794127Z * [new branch] gh/H-Huang/203/orig -> origin/gh/H-Huang/203/orig 2025-08-14T21:18:05.9794507Z * [new branch] gh/H-Huang/204/base -> origin/gh/H-Huang/204/base 2025-08-14T21:18:05.9794807Z * [new branch] gh/H-Huang/204/head -> origin/gh/H-Huang/204/head 2025-08-14T21:18:05.9795425Z * [new branch] gh/H-Huang/204/orig -> origin/gh/H-Huang/204/orig 2025-08-14T21:18:05.9795760Z * [new branch] gh/H-Huang/205/base -> origin/gh/H-Huang/205/base 2025-08-14T21:18:05.9796394Z * [new branch] gh/H-Huang/205/head -> origin/gh/H-Huang/205/head 2025-08-14T21:18:05.9796895Z * [new branch] gh/H-Huang/205/orig -> origin/gh/H-Huang/205/orig 2025-08-14T21:18:05.9798323Z * [new branch] gh/H-Huang/206/base -> origin/gh/H-Huang/206/base 2025-08-14T21:18:05.9798747Z * [new branch] gh/H-Huang/206/head -> origin/gh/H-Huang/206/head 2025-08-14T21:18:05.9799176Z * [new branch] gh/H-Huang/206/orig -> origin/gh/H-Huang/206/orig 2025-08-14T21:18:05.9799826Z * [new branch] gh/H-Huang/207/base -> origin/gh/H-Huang/207/base 2025-08-14T21:18:05.9800489Z * [new branch] gh/H-Huang/207/head -> origin/gh/H-Huang/207/head 2025-08-14T21:18:05.9801096Z * [new branch] gh/H-Huang/207/orig -> origin/gh/H-Huang/207/orig 2025-08-14T21:18:05.9803384Z * [new branch] gh/H-Huang/208/base -> origin/gh/H-Huang/208/base 2025-08-14T21:18:05.9803887Z * [new branch] gh/H-Huang/208/head -> origin/gh/H-Huang/208/head 2025-08-14T21:18:05.9804304Z * [new branch] gh/H-Huang/208/orig -> origin/gh/H-Huang/208/orig 2025-08-14T21:18:05.9804604Z * [new branch] gh/H-Huang/209/base -> origin/gh/H-Huang/209/base 2025-08-14T21:18:05.9804900Z * [new branch] gh/H-Huang/209/head -> origin/gh/H-Huang/209/head 2025-08-14T21:18:05.9805266Z * [new branch] gh/H-Huang/209/orig -> origin/gh/H-Huang/209/orig 2025-08-14T21:18:05.9809156Z * [new branch] gh/IvanKobzarev/107/base -> origin/gh/IvanKobzarev/107/base 2025-08-14T21:18:05.9809694Z * [new branch] gh/IvanKobzarev/107/head -> origin/gh/IvanKobzarev/107/head 2025-08-14T21:18:05.9810278Z * [new branch] gh/IvanKobzarev/107/orig -> origin/gh/IvanKobzarev/107/orig 2025-08-14T21:18:05.9810610Z * [new branch] gh/IvanKobzarev/110/base -> origin/gh/IvanKobzarev/110/base 2025-08-14T21:18:05.9810924Z * [new branch] gh/IvanKobzarev/110/head -> origin/gh/IvanKobzarev/110/head 2025-08-14T21:18:05.9811245Z * [new branch] gh/IvanKobzarev/110/orig -> origin/gh/IvanKobzarev/110/orig 2025-08-14T21:18:05.9811704Z * [new branch] gh/IvanKobzarev/111/base -> origin/gh/IvanKobzarev/111/base 2025-08-14T21:18:05.9812038Z * [new branch] gh/IvanKobzarev/111/head -> origin/gh/IvanKobzarev/111/head 2025-08-14T21:18:05.9812357Z * [new branch] gh/IvanKobzarev/111/orig -> origin/gh/IvanKobzarev/111/orig 2025-08-14T21:18:05.9813800Z * [new branch] gh/IvanKobzarev/112/base -> origin/gh/IvanKobzarev/112/base 2025-08-14T21:18:05.9814345Z * [new branch] gh/IvanKobzarev/112/head -> origin/gh/IvanKobzarev/112/head 2025-08-14T21:18:05.9814829Z * [new branch] gh/IvanKobzarev/112/orig -> origin/gh/IvanKobzarev/112/orig 2025-08-14T21:18:05.9815604Z * [new branch] gh/IvanKobzarev/115/base -> origin/gh/IvanKobzarev/115/base 2025-08-14T21:18:05.9816165Z * [new branch] gh/IvanKobzarev/115/head -> origin/gh/IvanKobzarev/115/head 2025-08-14T21:18:05.9816880Z * [new branch] gh/IvanKobzarev/115/orig -> origin/gh/IvanKobzarev/115/orig 2025-08-14T21:18:05.9820211Z * [new branch] gh/IvanKobzarev/116/base -> origin/gh/IvanKobzarev/116/base 2025-08-14T21:18:05.9820743Z * [new branch] gh/IvanKobzarev/116/head -> origin/gh/IvanKobzarev/116/head 2025-08-14T21:18:05.9821077Z * [new branch] gh/IvanKobzarev/116/orig -> origin/gh/IvanKobzarev/116/orig 2025-08-14T21:18:05.9821404Z * [new branch] gh/IvanKobzarev/118/base -> origin/gh/IvanKobzarev/118/base 2025-08-14T21:18:05.9821735Z * [new branch] gh/IvanKobzarev/118/head -> origin/gh/IvanKobzarev/118/head 2025-08-14T21:18:05.9822053Z * [new branch] gh/IvanKobzarev/118/orig -> origin/gh/IvanKobzarev/118/orig 2025-08-14T21:18:05.9822411Z * [new branch] gh/IvanKobzarev/124/base -> origin/gh/IvanKobzarev/124/base 2025-08-14T21:18:05.9823068Z * [new branch] gh/IvanKobzarev/124/head -> origin/gh/IvanKobzarev/124/head 2025-08-14T21:18:05.9823688Z * [new branch] gh/IvanKobzarev/124/orig -> origin/gh/IvanKobzarev/124/orig 2025-08-14T21:18:05.9824851Z * [new branch] gh/IvanKobzarev/126/base -> origin/gh/IvanKobzarev/126/base 2025-08-14T21:18:05.9825275Z * [new branch] gh/IvanKobzarev/126/head -> origin/gh/IvanKobzarev/126/head 2025-08-14T21:18:05.9827615Z * [new branch] gh/IvanKobzarev/126/orig -> origin/gh/IvanKobzarev/126/orig 2025-08-14T21:18:05.9828160Z * [new branch] gh/IvanKobzarev/127/base -> origin/gh/IvanKobzarev/127/base 2025-08-14T21:18:05.9828987Z * [new branch] gh/IvanKobzarev/127/head -> origin/gh/IvanKobzarev/127/head 2025-08-14T21:18:05.9829380Z * [new branch] gh/IvanKobzarev/127/orig -> origin/gh/IvanKobzarev/127/orig 2025-08-14T21:18:05.9829720Z * [new branch] gh/IvanKobzarev/128/base -> origin/gh/IvanKobzarev/128/base 2025-08-14T21:18:05.9830576Z * [new branch] gh/IvanKobzarev/128/head -> origin/gh/IvanKobzarev/128/head 2025-08-14T21:18:05.9830967Z * [new branch] gh/IvanKobzarev/128/orig -> origin/gh/IvanKobzarev/128/orig 2025-08-14T21:18:05.9831428Z * [new branch] gh/IvanKobzarev/129/base -> origin/gh/IvanKobzarev/129/base 2025-08-14T21:18:05.9832124Z * [new branch] gh/IvanKobzarev/129/head -> origin/gh/IvanKobzarev/129/head 2025-08-14T21:18:05.9832549Z * [new branch] gh/IvanKobzarev/129/orig -> origin/gh/IvanKobzarev/129/orig 2025-08-14T21:18:05.9834757Z * [new branch] gh/IvanKobzarev/130/base -> origin/gh/IvanKobzarev/130/base 2025-08-14T21:18:05.9835303Z * [new branch] gh/IvanKobzarev/130/head -> origin/gh/IvanKobzarev/130/head 2025-08-14T21:18:05.9835765Z * [new branch] gh/IvanKobzarev/130/orig -> origin/gh/IvanKobzarev/130/orig 2025-08-14T21:18:05.9836220Z * [new branch] gh/IvanKobzarev/131/base -> origin/gh/IvanKobzarev/131/base 2025-08-14T21:18:05.9836549Z * [new branch] gh/IvanKobzarev/131/head -> origin/gh/IvanKobzarev/131/head 2025-08-14T21:18:05.9837042Z * [new branch] gh/IvanKobzarev/131/orig -> origin/gh/IvanKobzarev/131/orig 2025-08-14T21:18:05.9840737Z * [new branch] gh/IvanKobzarev/132/base -> origin/gh/IvanKobzarev/132/base 2025-08-14T21:18:05.9841294Z * [new branch] gh/IvanKobzarev/132/head -> origin/gh/IvanKobzarev/132/head 2025-08-14T21:18:05.9841756Z * [new branch] gh/IvanKobzarev/132/orig -> origin/gh/IvanKobzarev/132/orig 2025-08-14T21:18:05.9842108Z * [new branch] gh/IvanKobzarev/133/base -> origin/gh/IvanKobzarev/133/base 2025-08-14T21:18:05.9842433Z * [new branch] gh/IvanKobzarev/133/head -> origin/gh/IvanKobzarev/133/head 2025-08-14T21:18:05.9842744Z * [new branch] gh/IvanKobzarev/133/orig -> origin/gh/IvanKobzarev/133/orig 2025-08-14T21:18:05.9846489Z * [new branch] gh/IvanKobzarev/134/base -> origin/gh/IvanKobzarev/134/base 2025-08-14T21:18:05.9850658Z * [new branch] gh/IvanKobzarev/134/head -> origin/gh/IvanKobzarev/134/head 2025-08-14T21:18:05.9852052Z * [new branch] gh/IvanKobzarev/134/orig -> origin/gh/IvanKobzarev/134/orig 2025-08-14T21:18:05.9852403Z * [new branch] gh/IvanKobzarev/135/base -> origin/gh/IvanKobzarev/135/base 2025-08-14T21:18:05.9852734Z * [new branch] gh/IvanKobzarev/135/head -> origin/gh/IvanKobzarev/135/head 2025-08-14T21:18:05.9853061Z * [new branch] gh/IvanKobzarev/135/orig -> origin/gh/IvanKobzarev/135/orig 2025-08-14T21:18:05.9853394Z * [new branch] gh/NikhilAPatel/1/base -> origin/gh/NikhilAPatel/1/base 2025-08-14T21:18:05.9853718Z * [new branch] gh/NikhilAPatel/1/head -> origin/gh/NikhilAPatel/1/head 2025-08-14T21:18:05.9854042Z * [new branch] gh/NikhilAPatel/16/base -> origin/gh/NikhilAPatel/16/base 2025-08-14T21:18:05.9854364Z * [new branch] gh/NikhilAPatel/16/head -> origin/gh/NikhilAPatel/16/head 2025-08-14T21:18:05.9854687Z * [new branch] gh/NikhilAPatel/16/orig -> origin/gh/NikhilAPatel/16/orig 2025-08-14T21:18:05.9854996Z * [new branch] gh/NikhilAPatel/18/base -> origin/gh/NikhilAPatel/18/base 2025-08-14T21:18:05.9855330Z * [new branch] gh/NikhilAPatel/18/head -> origin/gh/NikhilAPatel/18/head 2025-08-14T21:18:05.9855731Z * [new branch] gh/NikhilAPatel/18/orig -> origin/gh/NikhilAPatel/18/orig 2025-08-14T21:18:05.9856072Z * [new branch] gh/NikhilAPatel/19/base -> origin/gh/NikhilAPatel/19/base 2025-08-14T21:18:05.9856452Z * [new branch] gh/NikhilAPatel/19/head -> origin/gh/NikhilAPatel/19/head 2025-08-14T21:18:05.9859445Z * [new branch] gh/NikhilAPatel/19/orig -> origin/gh/NikhilAPatel/19/orig 2025-08-14T21:18:05.9859848Z * [new branch] gh/NikhilAPatel/2/base -> origin/gh/NikhilAPatel/2/base 2025-08-14T21:18:05.9860213Z * [new branch] gh/NikhilAPatel/2/head -> origin/gh/NikhilAPatel/2/head 2025-08-14T21:18:05.9860525Z * [new branch] gh/NikhilAPatel/4/base -> origin/gh/NikhilAPatel/4/base 2025-08-14T21:18:05.9860846Z * [new branch] gh/NikhilAPatel/4/head -> origin/gh/NikhilAPatel/4/head 2025-08-14T21:18:05.9861164Z * [new branch] gh/NikhilAPatel/8/base -> origin/gh/NikhilAPatel/8/base 2025-08-14T21:18:05.9861622Z * [new branch] gh/NikhilAPatel/8/head -> origin/gh/NikhilAPatel/8/head 2025-08-14T21:18:05.9861930Z * [new branch] gh/NikhilAPatel/8/orig -> origin/gh/NikhilAPatel/8/orig 2025-08-14T21:18:05.9862240Z * [new branch] gh/NikhilAPatel/9/base -> origin/gh/NikhilAPatel/9/base 2025-08-14T21:18:05.9862552Z * [new branch] gh/NikhilAPatel/9/head -> origin/gh/NikhilAPatel/9/head 2025-08-14T21:18:05.9862854Z * [new branch] gh/NikhilAPatel/9/orig -> origin/gh/NikhilAPatel/9/orig 2025-08-14T21:18:05.9863180Z * [new branch] gh/PaliC/1/base -> origin/gh/PaliC/1/base 2025-08-14T21:18:05.9863467Z * [new branch] gh/PaliC/1/head -> origin/gh/PaliC/1/head 2025-08-14T21:18:05.9863744Z * [new branch] gh/PaliC/1/orig -> origin/gh/PaliC/1/orig 2025-08-14T21:18:05.9864570Z * [new branch] gh/PaliC/12/base -> origin/gh/PaliC/12/base 2025-08-14T21:18:05.9865184Z * [new branch] gh/PaliC/12/head -> origin/gh/PaliC/12/head 2025-08-14T21:18:05.9867869Z * [new branch] gh/PaliC/12/orig -> origin/gh/PaliC/12/orig 2025-08-14T21:18:05.9868391Z * [new branch] gh/PaliC/13/base -> origin/gh/PaliC/13/base 2025-08-14T21:18:05.9869208Z * [new branch] gh/PaliC/13/head -> origin/gh/PaliC/13/head 2025-08-14T21:18:05.9869563Z * [new branch] gh/PaliC/13/orig -> origin/gh/PaliC/13/orig 2025-08-14T21:18:05.9869992Z * [new branch] gh/PaliC/14/base -> origin/gh/PaliC/14/base 2025-08-14T21:18:05.9870801Z * [new branch] gh/PaliC/14/head -> origin/gh/PaliC/14/head 2025-08-14T21:18:05.9871160Z * [new branch] gh/PaliC/14/orig -> origin/gh/PaliC/14/orig 2025-08-14T21:18:05.9871455Z * [new branch] gh/PaliC/15/base -> origin/gh/PaliC/15/base 2025-08-14T21:18:05.9871917Z * [new branch] gh/PaliC/15/head -> origin/gh/PaliC/15/head 2025-08-14T21:18:05.9872747Z * [new branch] gh/PaliC/15/orig -> origin/gh/PaliC/15/orig 2025-08-14T21:18:05.9873236Z * [new branch] gh/PaliC/16/base -> origin/gh/PaliC/16/base 2025-08-14T21:18:05.9874115Z * [new branch] gh/PaliC/16/head -> origin/gh/PaliC/16/head 2025-08-14T21:18:05.9874918Z * [new branch] gh/PaliC/16/orig -> origin/gh/PaliC/16/orig 2025-08-14T21:18:05.9875317Z * [new branch] gh/PaliC/17/base -> origin/gh/PaliC/17/base 2025-08-14T21:18:05.9875883Z * [new branch] gh/PaliC/17/head -> origin/gh/PaliC/17/head 2025-08-14T21:18:05.9876571Z * [new branch] gh/PaliC/17/orig -> origin/gh/PaliC/17/orig 2025-08-14T21:18:05.9878821Z * [new branch] gh/PaliC/18/base -> origin/gh/PaliC/18/base 2025-08-14T21:18:05.9879194Z * [new branch] gh/PaliC/18/head -> origin/gh/PaliC/18/head 2025-08-14T21:18:05.9879490Z * [new branch] gh/PaliC/18/orig -> origin/gh/PaliC/18/orig 2025-08-14T21:18:05.9879930Z * [new branch] gh/PaliC/19/base -> origin/gh/PaliC/19/base 2025-08-14T21:18:05.9880234Z * [new branch] gh/PaliC/19/head -> origin/gh/PaliC/19/head 2025-08-14T21:18:05.9880676Z * [new branch] gh/PaliC/19/orig -> origin/gh/PaliC/19/orig 2025-08-14T21:18:05.9881740Z * [new branch] gh/PaliC/2/base -> origin/gh/PaliC/2/base 2025-08-14T21:18:05.9882082Z * [new branch] gh/PaliC/2/head -> origin/gh/PaliC/2/head 2025-08-14T21:18:05.9882670Z * [new branch] gh/PaliC/2/orig -> origin/gh/PaliC/2/orig 2025-08-14T21:18:05.9884265Z * [new branch] gh/PaliC/20/base -> origin/gh/PaliC/20/base 2025-08-14T21:18:05.9884928Z * [new branch] gh/PaliC/20/head -> origin/gh/PaliC/20/head 2025-08-14T21:18:05.9885227Z * [new branch] gh/PaliC/20/orig -> origin/gh/PaliC/20/orig 2025-08-14T21:18:05.9885815Z * [new branch] gh/PaliC/21/base -> origin/gh/PaliC/21/base 2025-08-14T21:18:05.9886441Z * [new branch] gh/PaliC/21/head -> origin/gh/PaliC/21/head 2025-08-14T21:18:05.9886981Z * [new branch] gh/PaliC/21/orig -> origin/gh/PaliC/21/orig 2025-08-14T21:18:05.9888806Z * [new branch] gh/PaliC/22/base -> origin/gh/PaliC/22/base 2025-08-14T21:18:05.9889320Z * [new branch] gh/PaliC/22/head -> origin/gh/PaliC/22/head 2025-08-14T21:18:05.9889749Z * [new branch] gh/PaliC/22/orig -> origin/gh/PaliC/22/orig 2025-08-14T21:18:05.9890549Z * [new branch] gh/PaliC/23/base -> origin/gh/PaliC/23/base 2025-08-14T21:18:05.9890917Z * [new branch] gh/PaliC/23/head -> origin/gh/PaliC/23/head 2025-08-14T21:18:05.9891391Z * [new branch] gh/PaliC/23/orig -> origin/gh/PaliC/23/orig 2025-08-14T21:18:05.9891816Z * [new branch] gh/PaliC/24/base -> origin/gh/PaliC/24/base 2025-08-14T21:18:05.9892456Z * [new branch] gh/PaliC/24/head -> origin/gh/PaliC/24/head 2025-08-14T21:18:05.9892966Z * [new branch] gh/PaliC/24/orig -> origin/gh/PaliC/24/orig 2025-08-14T21:18:05.9895609Z * [new branch] gh/PaulZhang12/17/base -> origin/gh/PaulZhang12/17/base 2025-08-14T21:18:05.9896188Z * [new branch] gh/PaulZhang12/17/head -> origin/gh/PaulZhang12/17/head 2025-08-14T21:18:05.9896532Z * [new branch] gh/PaulZhang12/18/base -> origin/gh/PaulZhang12/18/base 2025-08-14T21:18:05.9896863Z * [new branch] gh/PaulZhang12/18/head -> origin/gh/PaulZhang12/18/head 2025-08-14T21:18:05.9897187Z * [new branch] gh/PaulZhang12/18/orig -> origin/gh/PaulZhang12/18/orig 2025-08-14T21:18:05.9898507Z * [new branch] gh/PaulZhang12/19/base -> origin/gh/PaulZhang12/19/base 2025-08-14T21:18:05.9898890Z * [new branch] gh/PaulZhang12/19/head -> origin/gh/PaulZhang12/19/head 2025-08-14T21:18:05.9899242Z * [new branch] gh/PaulZhang12/19/orig -> origin/gh/PaulZhang12/19/orig 2025-08-14T21:18:05.9900515Z * [new branch] gh/PaulZhang12/20/base -> origin/gh/PaulZhang12/20/base 2025-08-14T21:18:05.9900850Z * [new branch] gh/PaulZhang12/20/head -> origin/gh/PaulZhang12/20/head 2025-08-14T21:18:05.9901465Z * [new branch] gh/PaulZhang12/20/orig -> origin/gh/PaulZhang12/20/orig 2025-08-14T21:18:05.9902664Z * [new branch] gh/PaulZhang12/21/base -> origin/gh/PaulZhang12/21/base 2025-08-14T21:18:05.9902984Z * [new branch] gh/PaulZhang12/21/head -> origin/gh/PaulZhang12/21/head 2025-08-14T21:18:05.9903669Z * [new branch] gh/PaulZhang12/21/orig -> origin/gh/PaulZhang12/21/orig 2025-08-14T21:18:05.9905057Z * [new branch] gh/PaulZhang12/22/base -> origin/gh/PaulZhang12/22/base 2025-08-14T21:18:05.9908244Z * [new branch] gh/PaulZhang12/22/head -> origin/gh/PaulZhang12/22/head 2025-08-14T21:18:05.9908791Z * [new branch] gh/PaulZhang12/22/orig -> origin/gh/PaulZhang12/22/orig 2025-08-14T21:18:05.9909243Z * [new branch] gh/SamGinzburg/11/base -> origin/gh/SamGinzburg/11/base 2025-08-14T21:18:05.9910012Z * [new branch] gh/SamGinzburg/11/head -> origin/gh/SamGinzburg/11/head 2025-08-14T21:18:05.9910424Z * [new branch] gh/Sidharth123-cpu/24/base -> origin/gh/Sidharth123-cpu/24/base 2025-08-14T21:18:05.9910784Z * [new branch] gh/Sidharth123-cpu/25/base -> origin/gh/Sidharth123-cpu/25/base 2025-08-14T21:18:05.9911300Z * [new branch] gh/Sidharth123-cpu/26/base -> origin/gh/Sidharth123-cpu/26/base 2025-08-14T21:18:05.9911643Z * [new branch] gh/Sidharth123-cpu/27/base -> origin/gh/Sidharth123-cpu/27/base 2025-08-14T21:18:05.9912111Z * [new branch] gh/Sidharth123-cpu/42/base -> origin/gh/Sidharth123-cpu/42/base 2025-08-14T21:18:05.9912807Z * [new branch] gh/Sidharth123-cpu/42/head -> origin/gh/Sidharth123-cpu/42/head 2025-08-14T21:18:05.9913310Z * [new branch] gh/Sidharth123-cpu/42/orig -> origin/gh/Sidharth123-cpu/42/orig 2025-08-14T21:18:05.9915092Z * [new branch] gh/Sidharth123-cpu/43/base -> origin/gh/Sidharth123-cpu/43/base 2025-08-14T21:18:05.9915652Z * [new branch] gh/Sidharth123-cpu/43/head -> origin/gh/Sidharth123-cpu/43/head 2025-08-14T21:18:05.9916131Z * [new branch] gh/Sidharth123-cpu/43/orig -> origin/gh/Sidharth123-cpu/43/orig 2025-08-14T21:18:05.9916486Z * [new branch] gh/Sidharth123-cpu/44/base -> origin/gh/Sidharth123-cpu/44/base 2025-08-14T21:18:05.9917350Z * [new branch] gh/Sidharth123-cpu/44/head -> origin/gh/Sidharth123-cpu/44/head 2025-08-14T21:18:05.9918046Z * [new branch] gh/Sidharth123-cpu/44/orig -> origin/gh/Sidharth123-cpu/44/orig 2025-08-14T21:18:05.9918717Z * [new branch] gh/Sidharth123-cpu/45/base -> origin/gh/Sidharth123-cpu/45/base 2025-08-14T21:18:05.9919286Z * [new branch] gh/Sidharth123-cpu/45/head -> origin/gh/Sidharth123-cpu/45/head 2025-08-14T21:18:05.9920154Z * [new branch] gh/Sidharth123-cpu/45/orig -> origin/gh/Sidharth123-cpu/45/orig 2025-08-14T21:18:05.9921228Z * [new branch] gh/StrongerXi/1/base -> origin/gh/StrongerXi/1/base 2025-08-14T21:18:05.9921559Z * [new branch] gh/StrongerXi/1/head -> origin/gh/StrongerXi/1/head 2025-08-14T21:18:05.9923545Z * [new branch] gh/StrongerXi/103/base -> origin/gh/StrongerXi/103/base 2025-08-14T21:18:05.9924100Z * [new branch] gh/StrongerXi/103/head -> origin/gh/StrongerXi/103/head 2025-08-14T21:18:05.9924553Z * [new branch] gh/StrongerXi/103/orig -> origin/gh/StrongerXi/103/orig 2025-08-14T21:18:05.9924871Z * [new branch] gh/StrongerXi/133/base -> origin/gh/StrongerXi/133/base 2025-08-14T21:18:05.9925190Z * [new branch] gh/StrongerXi/133/head -> origin/gh/StrongerXi/133/head 2025-08-14T21:18:05.9926021Z * [new branch] gh/StrongerXi/133/orig -> origin/gh/StrongerXi/133/orig 2025-08-14T21:18:05.9926758Z * [new branch] gh/StrongerXi/134/base -> origin/gh/StrongerXi/134/base 2025-08-14T21:18:05.9927305Z * [new branch] gh/StrongerXi/134/head -> origin/gh/StrongerXi/134/head 2025-08-14T21:18:05.9927992Z * [new branch] gh/StrongerXi/134/orig -> origin/gh/StrongerXi/134/orig 2025-08-14T21:18:05.9928940Z * [new branch] gh/StrongerXi/135/base -> origin/gh/StrongerXi/135/base 2025-08-14T21:18:05.9929449Z * [new branch] gh/StrongerXi/135/head -> origin/gh/StrongerXi/135/head 2025-08-14T21:18:05.9930058Z * [new branch] gh/StrongerXi/135/orig -> origin/gh/StrongerXi/135/orig 2025-08-14T21:18:05.9931203Z * [new branch] gh/StrongerXi/136/base -> origin/gh/StrongerXi/136/base 2025-08-14T21:18:05.9931523Z * [new branch] gh/StrongerXi/136/head -> origin/gh/StrongerXi/136/head 2025-08-14T21:18:05.9931985Z * [new branch] gh/StrongerXi/136/orig -> origin/gh/StrongerXi/136/orig 2025-08-14T21:18:05.9933421Z * [new branch] gh/StrongerXi/137/base -> origin/gh/StrongerXi/137/base 2025-08-14T21:18:05.9933863Z * [new branch] gh/StrongerXi/137/head -> origin/gh/StrongerXi/137/head 2025-08-14T21:18:05.9934261Z * [new branch] gh/StrongerXi/137/orig -> origin/gh/StrongerXi/137/orig 2025-08-14T21:18:05.9935060Z * [new branch] gh/StrongerXi/138/base -> origin/gh/StrongerXi/138/base 2025-08-14T21:18:05.9935483Z * [new branch] gh/StrongerXi/138/head -> origin/gh/StrongerXi/138/head 2025-08-14T21:18:05.9935963Z * [new branch] gh/StrongerXi/138/orig -> origin/gh/StrongerXi/138/orig 2025-08-14T21:18:05.9937599Z * [new branch] gh/StrongerXi/71/base -> origin/gh/StrongerXi/71/base 2025-08-14T21:18:05.9937975Z * [new branch] gh/StrongerXi/71/head -> origin/gh/StrongerXi/71/head 2025-08-14T21:18:05.9938346Z * [new branch] gh/StrongerXi/72/base -> origin/gh/StrongerXi/72/base 2025-08-14T21:18:05.9938861Z * [new branch] gh/StrongerXi/72/head -> origin/gh/StrongerXi/72/head 2025-08-14T21:18:05.9940362Z * [new branch] gh/XilunWu/131/base -> origin/gh/XilunWu/131/base 2025-08-14T21:18:05.9940862Z * [new branch] gh/XilunWu/131/head -> origin/gh/XilunWu/131/head 2025-08-14T21:18:05.9941422Z * [new branch] gh/XilunWu/131/orig -> origin/gh/XilunWu/131/orig 2025-08-14T21:18:05.9942651Z * [new branch] gh/XilunWu/133/base -> origin/gh/XilunWu/133/base 2025-08-14T21:18:05.9942960Z * [new branch] gh/XilunWu/133/head -> origin/gh/XilunWu/133/head 2025-08-14T21:18:05.9943614Z * [new branch] gh/XilunWu/133/orig -> origin/gh/XilunWu/133/orig 2025-08-14T21:18:05.9944752Z * [new branch] gh/XilunWu/136/base -> origin/gh/XilunWu/136/base 2025-08-14T21:18:05.9945348Z * [new branch] gh/XilunWu/136/head -> origin/gh/XilunWu/136/head 2025-08-14T21:18:05.9945681Z * [new branch] gh/XilunWu/136/orig -> origin/gh/XilunWu/136/orig 2025-08-14T21:18:05.9947740Z * [new branch] gh/XilunWu/139/base -> origin/gh/XilunWu/139/base 2025-08-14T21:18:05.9948110Z * [new branch] gh/XilunWu/139/head -> origin/gh/XilunWu/139/head 2025-08-14T21:18:05.9948444Z * [new branch] gh/XilunWu/139/orig -> origin/gh/XilunWu/139/orig 2025-08-14T21:18:05.9948785Z * [new branch] gh/XilunWu/143/base -> origin/gh/XilunWu/143/base 2025-08-14T21:18:05.9949429Z * [new branch] gh/XilunWu/143/head -> origin/gh/XilunWu/143/head 2025-08-14T21:18:05.9949964Z * [new branch] gh/XilunWu/143/orig -> origin/gh/XilunWu/143/orig 2025-08-14T21:18:05.9951282Z * [new branch] gh/XilunWu/144/base -> origin/gh/XilunWu/144/base 2025-08-14T21:18:05.9951666Z * [new branch] gh/XilunWu/144/head -> origin/gh/XilunWu/144/head 2025-08-14T21:18:05.9952241Z * [new branch] gh/XilunWu/144/orig -> origin/gh/XilunWu/144/orig 2025-08-14T21:18:05.9953429Z * [new branch] gh/XilunWu/145/base -> origin/gh/XilunWu/145/base 2025-08-14T21:18:05.9953725Z * [new branch] gh/XilunWu/145/head -> origin/gh/XilunWu/145/head 2025-08-14T21:18:05.9954299Z * [new branch] gh/XilunWu/145/orig -> origin/gh/XilunWu/145/orig 2025-08-14T21:18:05.9955172Z * [new branch] gh/XilunWu/146/base -> origin/gh/XilunWu/146/base 2025-08-14T21:18:05.9955575Z * [new branch] gh/XilunWu/146/head -> origin/gh/XilunWu/146/head 2025-08-14T21:18:05.9956327Z * [new branch] gh/XilunWu/146/orig -> origin/gh/XilunWu/146/orig 2025-08-14T21:18:05.9957048Z * [new branch] gh/XilunWu/147/base -> origin/gh/XilunWu/147/base 2025-08-14T21:18:05.9957747Z * [new branch] gh/XilunWu/147/head -> origin/gh/XilunWu/147/head 2025-08-14T21:18:05.9958439Z * [new branch] gh/XilunWu/147/orig -> origin/gh/XilunWu/147/orig 2025-08-14T21:18:05.9959162Z * [new branch] gh/XilunWu/148/base -> origin/gh/XilunWu/148/base 2025-08-14T21:18:05.9959672Z * [new branch] gh/XilunWu/148/head -> origin/gh/XilunWu/148/head 2025-08-14T21:18:05.9960364Z * [new branch] gh/XilunWu/148/orig -> origin/gh/XilunWu/148/orig 2025-08-14T21:18:05.9961399Z * [new branch] gh/XilunWu/149/base -> origin/gh/XilunWu/149/base 2025-08-14T21:18:05.9961952Z * [new branch] gh/XilunWu/149/head -> origin/gh/XilunWu/149/head 2025-08-14T21:18:05.9962420Z * [new branch] gh/XilunWu/149/orig -> origin/gh/XilunWu/149/orig 2025-08-14T21:18:05.9963173Z * [new branch] gh/XilunWu/150/base -> origin/gh/XilunWu/150/base 2025-08-14T21:18:05.9963768Z * [new branch] gh/XilunWu/150/head -> origin/gh/XilunWu/150/head 2025-08-14T21:18:05.9964338Z * [new branch] gh/XilunWu/150/orig -> origin/gh/XilunWu/150/orig 2025-08-14T21:18:05.9965465Z * [new branch] gh/XilunWu/151/base -> origin/gh/XilunWu/151/base 2025-08-14T21:18:05.9965796Z * [new branch] gh/XilunWu/151/head -> origin/gh/XilunWu/151/head 2025-08-14T21:18:05.9966515Z * [new branch] gh/XilunWu/151/orig -> origin/gh/XilunWu/151/orig 2025-08-14T21:18:05.9967666Z * [new branch] gh/XilunWu/152/base -> origin/gh/XilunWu/152/base 2025-08-14T21:18:05.9967973Z * [new branch] gh/XilunWu/152/head -> origin/gh/XilunWu/152/head 2025-08-14T21:18:05.9968463Z * [new branch] gh/XilunWu/152/orig -> origin/gh/XilunWu/152/orig 2025-08-14T21:18:05.9969672Z * [new branch] gh/XilunWu/153/base -> origin/gh/XilunWu/153/base 2025-08-14T21:18:05.9969977Z * [new branch] gh/XilunWu/153/head -> origin/gh/XilunWu/153/head 2025-08-14T21:18:05.9970564Z * [new branch] gh/XilunWu/153/orig -> origin/gh/XilunWu/153/orig 2025-08-14T21:18:05.9971650Z * [new branch] gh/XilunWu/154/base -> origin/gh/XilunWu/154/base 2025-08-14T21:18:05.9972109Z * [new branch] gh/XilunWu/154/head -> origin/gh/XilunWu/154/head 2025-08-14T21:18:05.9972756Z * [new branch] gh/XilunWu/154/orig -> origin/gh/XilunWu/154/orig 2025-08-14T21:18:05.9974179Z * [new branch] gh/XilunWu/156/base -> origin/gh/XilunWu/156/base 2025-08-14T21:18:05.9974631Z * [new branch] gh/XilunWu/156/head -> origin/gh/XilunWu/156/head 2025-08-14T21:18:05.9975329Z * [new branch] gh/XilunWu/156/orig -> origin/gh/XilunWu/156/orig 2025-08-14T21:18:05.9976415Z * [new branch] gh/XilunWu/157/base -> origin/gh/XilunWu/157/base 2025-08-14T21:18:05.9976842Z * [new branch] gh/XilunWu/157/head -> origin/gh/XilunWu/157/head 2025-08-14T21:18:05.9977423Z * [new branch] gh/XilunWu/157/orig -> origin/gh/XilunWu/157/orig 2025-08-14T21:18:05.9978531Z * [new branch] gh/XilunWu/158/base -> origin/gh/XilunWu/158/base 2025-08-14T21:18:05.9979046Z * [new branch] gh/XilunWu/158/head -> origin/gh/XilunWu/158/head 2025-08-14T21:18:05.9979671Z * [new branch] gh/XilunWu/158/orig -> origin/gh/XilunWu/158/orig 2025-08-14T21:18:05.9980915Z * [new branch] gh/XilunWu/159/base -> origin/gh/XilunWu/159/base 2025-08-14T21:18:05.9981407Z * [new branch] gh/XilunWu/159/head -> origin/gh/XilunWu/159/head 2025-08-14T21:18:05.9982044Z * [new branch] gh/XilunWu/159/orig -> origin/gh/XilunWu/159/orig 2025-08-14T21:18:05.9983086Z * [new branch] gh/XilunWu/160/base -> origin/gh/XilunWu/160/base 2025-08-14T21:18:05.9983572Z * [new branch] gh/XilunWu/160/head -> origin/gh/XilunWu/160/head 2025-08-14T21:18:05.9984342Z * [new branch] gh/XilunWu/160/orig -> origin/gh/XilunWu/160/orig 2025-08-14T21:18:05.9988419Z * [new branch] gh/XilunWu/161/base -> origin/gh/XilunWu/161/base 2025-08-14T21:18:05.9989246Z * [new branch] gh/XilunWu/161/head -> origin/gh/XilunWu/161/head 2025-08-14T21:18:05.9990089Z * [new branch] gh/XilunWu/161/orig -> origin/gh/XilunWu/161/orig 2025-08-14T21:18:05.9991275Z * [new branch] gh/XilunWu/162/base -> origin/gh/XilunWu/162/base 2025-08-14T21:18:05.9991907Z * [new branch] gh/XilunWu/162/head -> origin/gh/XilunWu/162/head 2025-08-14T21:18:05.9992547Z * [new branch] gh/XilunWu/162/orig -> origin/gh/XilunWu/162/orig 2025-08-14T21:18:05.9993710Z * [new branch] gh/XilunWu/163/base -> origin/gh/XilunWu/163/base 2025-08-14T21:18:05.9994145Z * [new branch] gh/XilunWu/163/head -> origin/gh/XilunWu/163/head 2025-08-14T21:18:05.9995143Z * [new branch] gh/XilunWu/163/orig -> origin/gh/XilunWu/163/orig 2025-08-14T21:18:05.9996230Z * [new branch] gh/XuehaiPan/14/base -> origin/gh/XuehaiPan/14/base 2025-08-14T21:18:05.9996697Z * [new branch] gh/XuehaiPan/14/head -> origin/gh/XuehaiPan/14/head 2025-08-14T21:18:05.9997302Z * [new branch] gh/XuehaiPan/14/orig -> origin/gh/XuehaiPan/14/orig 2025-08-14T21:18:05.9998496Z * [new branch] gh/XuehaiPan/179/base -> origin/gh/XuehaiPan/179/base 2025-08-14T21:18:05.9998920Z * [new branch] gh/XuehaiPan/179/head -> origin/gh/XuehaiPan/179/head 2025-08-14T21:18:05.9999600Z * [new branch] gh/XuehaiPan/179/orig -> origin/gh/XuehaiPan/179/orig 2025-08-14T21:18:06.0000708Z * [new branch] gh/XuehaiPan/189/base -> origin/gh/XuehaiPan/189/base 2025-08-14T21:18:06.0001110Z * [new branch] gh/XuehaiPan/189/head -> origin/gh/XuehaiPan/189/head 2025-08-14T21:18:06.0001583Z * [new branch] gh/XuehaiPan/189/orig -> origin/gh/XuehaiPan/189/orig 2025-08-14T21:18:06.0003366Z * [new branch] gh/XuehaiPan/227/base -> origin/gh/XuehaiPan/227/base 2025-08-14T21:18:06.0003746Z * [new branch] gh/XuehaiPan/227/head -> origin/gh/XuehaiPan/227/head 2025-08-14T21:18:06.0004076Z * [new branch] gh/XuehaiPan/227/orig -> origin/gh/XuehaiPan/227/orig 2025-08-14T21:18:06.0004718Z * [new branch] gh/XuehaiPan/231/base -> origin/gh/XuehaiPan/231/base 2025-08-14T21:18:06.0005256Z * [new branch] gh/XuehaiPan/231/head -> origin/gh/XuehaiPan/231/head 2025-08-14T21:18:06.0005896Z * [new branch] gh/XuehaiPan/231/orig -> origin/gh/XuehaiPan/231/orig 2025-08-14T21:18:06.0006848Z * [new branch] gh/XuehaiPan/232/base -> origin/gh/XuehaiPan/232/base 2025-08-14T21:18:06.0007277Z * [new branch] gh/XuehaiPan/232/head -> origin/gh/XuehaiPan/232/head 2025-08-14T21:18:06.0007958Z * [new branch] gh/XuehaiPan/232/orig -> origin/gh/XuehaiPan/232/orig 2025-08-14T21:18:06.0009187Z * [new branch] gh/XuehaiPan/249/base -> origin/gh/XuehaiPan/249/base 2025-08-14T21:18:06.0009508Z * [new branch] gh/XuehaiPan/249/head -> origin/gh/XuehaiPan/249/head 2025-08-14T21:18:06.0010175Z * [new branch] gh/XuehaiPan/249/orig -> origin/gh/XuehaiPan/249/orig 2025-08-14T21:18:06.0011004Z * [new branch] gh/XuehaiPan/253/base -> origin/gh/XuehaiPan/253/base 2025-08-14T21:18:06.0011495Z * [new branch] gh/XuehaiPan/253/head -> origin/gh/XuehaiPan/253/head 2025-08-14T21:18:06.0012167Z * [new branch] gh/XuehaiPan/253/orig -> origin/gh/XuehaiPan/253/orig 2025-08-14T21:18:06.0013210Z * [new branch] gh/XuehaiPan/254/base -> origin/gh/XuehaiPan/254/base 2025-08-14T21:18:06.0013528Z * [new branch] gh/XuehaiPan/254/head -> origin/gh/XuehaiPan/254/head 2025-08-14T21:18:06.0014202Z * [new branch] gh/XuehaiPan/254/orig -> origin/gh/XuehaiPan/254/orig 2025-08-14T21:18:06.0015062Z * [new branch] gh/XuehaiPan/255/base -> origin/gh/XuehaiPan/255/base 2025-08-14T21:18:06.0029346Z * [new branch] gh/XuehaiPan/255/head -> origin/gh/XuehaiPan/255/head 2025-08-14T21:18:06.0029738Z * [new branch] gh/XuehaiPan/255/orig -> origin/gh/XuehaiPan/255/orig 2025-08-14T21:18:06.0030058Z * [new branch] gh/XuehaiPan/257/base -> origin/gh/XuehaiPan/257/base 2025-08-14T21:18:06.0030363Z * [new branch] gh/XuehaiPan/257/head -> origin/gh/XuehaiPan/257/head 2025-08-14T21:18:06.0030690Z * [new branch] gh/XuehaiPan/257/orig -> origin/gh/XuehaiPan/257/orig 2025-08-14T21:18:06.0031000Z * [new branch] gh/XuehaiPan/271/base -> origin/gh/XuehaiPan/271/base 2025-08-14T21:18:06.0031310Z * [new branch] gh/XuehaiPan/271/head -> origin/gh/XuehaiPan/271/head 2025-08-14T21:18:06.0031616Z * [new branch] gh/XuehaiPan/271/orig -> origin/gh/XuehaiPan/271/orig 2025-08-14T21:18:06.0031920Z * [new branch] gh/XuehaiPan/283/base -> origin/gh/XuehaiPan/283/base 2025-08-14T21:18:06.0032223Z * [new branch] gh/XuehaiPan/283/head -> origin/gh/XuehaiPan/283/head 2025-08-14T21:18:06.0032520Z * [new branch] gh/XuehaiPan/283/orig -> origin/gh/XuehaiPan/283/orig 2025-08-14T21:18:06.0032826Z * [new branch] gh/XuehaiPan/290/base -> origin/gh/XuehaiPan/290/base 2025-08-14T21:18:06.0033254Z * [new branch] gh/XuehaiPan/290/head -> origin/gh/XuehaiPan/290/head 2025-08-14T21:18:06.0033570Z * [new branch] gh/XuehaiPan/290/orig -> origin/gh/XuehaiPan/290/orig 2025-08-14T21:18:06.0033867Z * [new branch] gh/XuehaiPan/328/base -> origin/gh/XuehaiPan/328/base 2025-08-14T21:18:06.0034169Z * [new branch] gh/XuehaiPan/328/head -> origin/gh/XuehaiPan/328/head 2025-08-14T21:18:06.0034483Z * [new branch] gh/XuehaiPan/328/orig -> origin/gh/XuehaiPan/328/orig 2025-08-14T21:18:06.0034790Z * [new branch] gh/XuehaiPan/339/base -> origin/gh/XuehaiPan/339/base 2025-08-14T21:18:06.0035090Z * [new branch] gh/XuehaiPan/339/head -> origin/gh/XuehaiPan/339/head 2025-08-14T21:18:06.0035392Z * [new branch] gh/XuehaiPan/339/orig -> origin/gh/XuehaiPan/339/orig 2025-08-14T21:18:06.0035693Z * [new branch] gh/XuehaiPan/343/base -> origin/gh/XuehaiPan/343/base 2025-08-14T21:18:06.0035996Z * [new branch] gh/XuehaiPan/343/head -> origin/gh/XuehaiPan/343/head 2025-08-14T21:18:06.0036292Z * [new branch] gh/XuehaiPan/343/orig -> origin/gh/XuehaiPan/343/orig 2025-08-14T21:18:06.0036603Z * [new branch] gh/XuehaiPan/344/base -> origin/gh/XuehaiPan/344/base 2025-08-14T21:18:06.0036903Z * [new branch] gh/XuehaiPan/344/head -> origin/gh/XuehaiPan/344/head 2025-08-14T21:18:06.0037210Z * [new branch] gh/XuehaiPan/344/orig -> origin/gh/XuehaiPan/344/orig 2025-08-14T21:18:06.0037512Z * [new branch] gh/XuehaiPan/345/base -> origin/gh/XuehaiPan/345/base 2025-08-14T21:18:06.0037814Z * [new branch] gh/XuehaiPan/345/head -> origin/gh/XuehaiPan/345/head 2025-08-14T21:18:06.0038117Z * [new branch] gh/XuehaiPan/345/orig -> origin/gh/XuehaiPan/345/orig 2025-08-14T21:18:06.0038415Z * [new branch] gh/XuehaiPan/346/base -> origin/gh/XuehaiPan/346/base 2025-08-14T21:18:06.0038726Z * [new branch] gh/XuehaiPan/346/head -> origin/gh/XuehaiPan/346/head 2025-08-14T21:18:06.0039028Z * [new branch] gh/XuehaiPan/346/orig -> origin/gh/XuehaiPan/346/orig 2025-08-14T21:18:06.0039871Z * [new branch] gh/XuehaiPan/347/base -> origin/gh/XuehaiPan/347/base 2025-08-14T21:18:06.0040412Z * [new branch] gh/XuehaiPan/347/head -> origin/gh/XuehaiPan/347/head 2025-08-14T21:18:06.0040733Z * [new branch] gh/XuehaiPan/347/orig -> origin/gh/XuehaiPan/347/orig 2025-08-14T21:18:06.0041040Z * [new branch] gh/XuehaiPan/348/base -> origin/gh/XuehaiPan/348/base 2025-08-14T21:18:06.0041338Z * [new branch] gh/XuehaiPan/348/head -> origin/gh/XuehaiPan/348/head 2025-08-14T21:18:06.0041693Z * [new branch] gh/XuehaiPan/348/orig -> origin/gh/XuehaiPan/348/orig 2025-08-14T21:18:06.0042154Z * [new branch] gh/XuehaiPan/350/base -> origin/gh/XuehaiPan/350/base 2025-08-14T21:18:06.0042591Z * [new branch] gh/XuehaiPan/350/head -> origin/gh/XuehaiPan/350/head 2025-08-14T21:18:06.0043037Z * [new branch] gh/XuehaiPan/350/orig -> origin/gh/XuehaiPan/350/orig 2025-08-14T21:18:06.0043360Z * [new branch] gh/XuehaiPan/352/base -> origin/gh/XuehaiPan/352/base 2025-08-14T21:18:06.0043679Z * [new branch] gh/XuehaiPan/352/head -> origin/gh/XuehaiPan/352/head 2025-08-14T21:18:06.0043988Z * [new branch] gh/XuehaiPan/352/orig -> origin/gh/XuehaiPan/352/orig 2025-08-14T21:18:06.0048299Z * [new branch] gh/XuehaiPan/356/base -> origin/gh/XuehaiPan/356/base 2025-08-14T21:18:06.0048834Z * [new branch] gh/XuehaiPan/356/head -> origin/gh/XuehaiPan/356/head 2025-08-14T21:18:06.0049287Z * [new branch] gh/XuehaiPan/356/orig -> origin/gh/XuehaiPan/356/orig 2025-08-14T21:18:06.0049759Z * [new branch] gh/XuehaiPan/357/base -> origin/gh/XuehaiPan/357/base 2025-08-14T21:18:06.0050094Z * [new branch] gh/XuehaiPan/357/head -> origin/gh/XuehaiPan/357/head 2025-08-14T21:18:06.0050397Z * [new branch] gh/XuehaiPan/357/orig -> origin/gh/XuehaiPan/357/orig 2025-08-14T21:18:06.0050704Z * [new branch] gh/XuehaiPan/358/base -> origin/gh/XuehaiPan/358/base 2025-08-14T21:18:06.0051018Z * [new branch] gh/XuehaiPan/358/head -> origin/gh/XuehaiPan/358/head 2025-08-14T21:18:06.0051321Z * [new branch] gh/XuehaiPan/358/orig -> origin/gh/XuehaiPan/358/orig 2025-08-14T21:18:06.0051614Z * [new branch] gh/XuehaiPan/359/base -> origin/gh/XuehaiPan/359/base 2025-08-14T21:18:06.0051913Z * [new branch] gh/XuehaiPan/359/head -> origin/gh/XuehaiPan/359/head 2025-08-14T21:18:06.0052215Z * [new branch] gh/XuehaiPan/359/orig -> origin/gh/XuehaiPan/359/orig 2025-08-14T21:18:06.0052815Z * [new branch] gh/XuehaiPan/360/base -> origin/gh/XuehaiPan/360/base 2025-08-14T21:18:06.0053133Z * [new branch] gh/XuehaiPan/360/head -> origin/gh/XuehaiPan/360/head 2025-08-14T21:18:06.0053442Z * [new branch] gh/XuehaiPan/360/orig -> origin/gh/XuehaiPan/360/orig 2025-08-14T21:18:06.0053878Z * [new branch] gh/XuehaiPan/365/base -> origin/gh/XuehaiPan/365/base 2025-08-14T21:18:06.0054320Z * [new branch] gh/XuehaiPan/365/head -> origin/gh/XuehaiPan/365/head 2025-08-14T21:18:06.0054949Z * [new branch] gh/XuehaiPan/365/orig -> origin/gh/XuehaiPan/365/orig 2025-08-14T21:18:06.0058799Z * [new branch] gh/XuehaiPan/366/base -> origin/gh/XuehaiPan/366/base 2025-08-14T21:18:06.0059322Z * [new branch] gh/XuehaiPan/366/head -> origin/gh/XuehaiPan/366/head 2025-08-14T21:18:06.0060212Z * [new branch] gh/XuehaiPan/368/base -> origin/gh/XuehaiPan/368/base 2025-08-14T21:18:06.0060763Z * [new branch] gh/XuehaiPan/368/head -> origin/gh/XuehaiPan/368/head 2025-08-14T21:18:06.0061554Z * [new branch] gh/XuehaiPan/368/orig -> origin/gh/XuehaiPan/368/orig 2025-08-14T21:18:06.0061928Z * [new branch] gh/XuehaiPan/369/base -> origin/gh/XuehaiPan/369/base 2025-08-14T21:18:06.0062399Z * [new branch] gh/XuehaiPan/369/head -> origin/gh/XuehaiPan/369/head 2025-08-14T21:18:06.0062714Z * [new branch] gh/XuehaiPan/369/orig -> origin/gh/XuehaiPan/369/orig 2025-08-14T21:18:06.0063025Z * [new branch] gh/XuehaiPan/370/base -> origin/gh/XuehaiPan/370/base 2025-08-14T21:18:06.0063325Z * [new branch] gh/XuehaiPan/370/head -> origin/gh/XuehaiPan/370/head 2025-08-14T21:18:06.0063630Z * [new branch] gh/XuehaiPan/370/orig -> origin/gh/XuehaiPan/370/orig 2025-08-14T21:18:06.0063941Z * [new branch] gh/XuehaiPan/371/base -> origin/gh/XuehaiPan/371/base 2025-08-14T21:18:06.0064362Z * [new branch] gh/XuehaiPan/371/head -> origin/gh/XuehaiPan/371/head 2025-08-14T21:18:06.0064671Z * [new branch] gh/XuehaiPan/371/orig -> origin/gh/XuehaiPan/371/orig 2025-08-14T21:18:06.0064975Z * [new branch] gh/XuehaiPan/372/base -> origin/gh/XuehaiPan/372/base 2025-08-14T21:18:06.0065285Z * [new branch] gh/XuehaiPan/372/head -> origin/gh/XuehaiPan/372/head 2025-08-14T21:18:06.0065764Z * [new branch] gh/XuehaiPan/372/orig -> origin/gh/XuehaiPan/372/orig 2025-08-14T21:18:06.0066190Z * [new branch] gh/XuehaiPan/373/base -> origin/gh/XuehaiPan/373/base 2025-08-14T21:18:06.0066622Z * [new branch] gh/XuehaiPan/373/head -> origin/gh/XuehaiPan/373/head 2025-08-14T21:18:06.0067058Z * [new branch] gh/XuehaiPan/373/orig -> origin/gh/XuehaiPan/373/orig 2025-08-14T21:18:06.0071133Z * [new branch] gh/XuehaiPan/374/base -> origin/gh/XuehaiPan/374/base 2025-08-14T21:18:06.0071673Z * [new branch] gh/XuehaiPan/374/head -> origin/gh/XuehaiPan/374/head 2025-08-14T21:18:06.0072120Z * [new branch] gh/XuehaiPan/374/orig -> origin/gh/XuehaiPan/374/orig 2025-08-14T21:18:06.0072452Z * [new branch] gh/XuehaiPan/375/base -> origin/gh/XuehaiPan/375/base 2025-08-14T21:18:06.0072769Z * [new branch] gh/XuehaiPan/375/head -> origin/gh/XuehaiPan/375/head 2025-08-14T21:18:06.0073073Z * [new branch] gh/XuehaiPan/375/orig -> origin/gh/XuehaiPan/375/orig 2025-08-14T21:18:06.0073370Z * [new branch] gh/XuehaiPan/376/base -> origin/gh/XuehaiPan/376/base 2025-08-14T21:18:06.0073676Z * [new branch] gh/XuehaiPan/376/head -> origin/gh/XuehaiPan/376/head 2025-08-14T21:18:06.0074132Z * [new branch] gh/XuehaiPan/376/orig -> origin/gh/XuehaiPan/376/orig 2025-08-14T21:18:06.0074446Z * [new branch] gh/XuehaiPan/377/base -> origin/gh/XuehaiPan/377/base 2025-08-14T21:18:06.0075046Z * [new branch] gh/XuehaiPan/377/head -> origin/gh/XuehaiPan/377/head 2025-08-14T21:18:06.0075441Z * [new branch] gh/XuehaiPan/377/orig -> origin/gh/XuehaiPan/377/orig 2025-08-14T21:18:06.0076640Z * [new branch] gh/XuehaiPan/378/base -> origin/gh/XuehaiPan/378/base 2025-08-14T21:18:06.0077415Z * [new branch] gh/XuehaiPan/378/head -> origin/gh/XuehaiPan/378/head 2025-08-14T21:18:06.0077940Z * [new branch] gh/XuehaiPan/378/orig -> origin/gh/XuehaiPan/378/orig 2025-08-14T21:18:06.0078356Z * [new branch] gh/XuehaiPan/379/base -> origin/gh/XuehaiPan/379/base 2025-08-14T21:18:06.0079197Z * [new branch] gh/XuehaiPan/379/head -> origin/gh/XuehaiPan/379/head 2025-08-14T21:18:06.0079633Z * [new branch] gh/XuehaiPan/379/orig -> origin/gh/XuehaiPan/379/orig 2025-08-14T21:18:06.0081253Z * [new branch] gh/ZhiweiYan-96/39/base -> origin/gh/ZhiweiYan-96/39/base 2025-08-14T21:18:06.0081605Z * [new branch] gh/ZhiweiYan-96/39/head -> origin/gh/ZhiweiYan-96/39/head 2025-08-14T21:18:06.0081937Z * [new branch] gh/ZhiweiYan-96/39/orig -> origin/gh/ZhiweiYan-96/39/orig 2025-08-14T21:18:06.0082783Z * [new branch] gh/ZhiweiYan-96/44/base -> origin/gh/ZhiweiYan-96/44/base 2025-08-14T21:18:06.0083304Z * [new branch] gh/ZhiweiYan-96/44/head -> origin/gh/ZhiweiYan-96/44/head 2025-08-14T21:18:06.0084915Z * [new branch] gh/ZhiweiYan-96/45/base -> origin/gh/ZhiweiYan-96/45/base 2025-08-14T21:18:06.0085362Z * [new branch] gh/ZhiweiYan-96/45/head -> origin/gh/ZhiweiYan-96/45/head 2025-08-14T21:18:06.0085807Z * [new branch] gh/ZhiweiYan-96/49/base -> origin/gh/ZhiweiYan-96/49/base 2025-08-14T21:18:06.0086420Z * [new branch] gh/ZhiweiYan-96/49/head -> origin/gh/ZhiweiYan-96/49/head 2025-08-14T21:18:06.0090244Z * [new branch] gh/ZhiweiYan-96/62/base -> origin/gh/ZhiweiYan-96/62/base 2025-08-14T21:18:06.0090774Z * [new branch] gh/ZhiweiYan-96/62/head -> origin/gh/ZhiweiYan-96/62/head 2025-08-14T21:18:06.0091229Z * [new branch] gh/ZhiweiYan-96/64/base -> origin/gh/ZhiweiYan-96/64/base 2025-08-14T21:18:06.0091551Z * [new branch] gh/ZhiweiYan-96/64/head -> origin/gh/ZhiweiYan-96/64/head 2025-08-14T21:18:06.0091871Z * [new branch] gh/ZhiweiYan-96/64/orig -> origin/gh/ZhiweiYan-96/64/orig 2025-08-14T21:18:06.0092183Z * [new branch] gh/ZhiweiYan-96/65/base -> origin/gh/ZhiweiYan-96/65/base 2025-08-14T21:18:06.0092486Z * [new branch] gh/ZhiweiYan-96/65/head -> origin/gh/ZhiweiYan-96/65/head 2025-08-14T21:18:06.0092988Z * [new branch] gh/ZhiweiYan-96/65/orig -> origin/gh/ZhiweiYan-96/65/orig 2025-08-14T21:18:06.0093352Z * [new branch] gh/ZhiweiYan-96/66/base -> origin/gh/ZhiweiYan-96/66/base 2025-08-14T21:18:06.0093852Z * [new branch] gh/ZhiweiYan-96/66/head -> origin/gh/ZhiweiYan-96/66/head 2025-08-14T21:18:06.0095653Z * [new branch] gh/ZhiweiYan-96/67/base -> origin/gh/ZhiweiYan-96/67/base 2025-08-14T21:18:06.0096212Z * [new branch] gh/ZhiweiYan-96/67/head -> origin/gh/ZhiweiYan-96/67/head 2025-08-14T21:18:06.0096663Z * [new branch] gh/ZhiweiYan-96/68/base -> origin/gh/ZhiweiYan-96/68/base 2025-08-14T21:18:06.0097275Z * [new branch] gh/ZhiweiYan-96/68/head -> origin/gh/ZhiweiYan-96/68/head 2025-08-14T21:18:06.0097657Z * [new branch] gh/ZhiweiYan-96/68/orig -> origin/gh/ZhiweiYan-96/68/orig 2025-08-14T21:18:06.0098340Z * [new branch] gh/aakhundov/1/base -> origin/gh/aakhundov/1/base 2025-08-14T21:18:06.0099031Z * [new branch] gh/aakhundov/1/head -> origin/gh/aakhundov/1/head 2025-08-14T21:18:06.0099747Z * [new branch] gh/aakhundov/2/base -> origin/gh/aakhundov/2/base 2025-08-14T21:18:06.0100345Z * [new branch] gh/aakhundov/2/head -> origin/gh/aakhundov/2/head 2025-08-14T21:18:06.0101493Z * [new branch] gh/aditew01/openblas -> origin/gh/aditew01/openblas 2025-08-14T21:18:06.0101822Z * [new branch] gh/aditew01/sbgemm -> origin/gh/aditew01/sbgemm 2025-08-14T21:18:06.0102463Z * [new branch] gh/aditew01/vecbf16 -> origin/gh/aditew01/vecbf16 2025-08-14T21:18:06.0106635Z * [new branch] gh/alexbrauckmann/paddedtensor_faketensor_init -> origin/gh/alexbrauckmann/paddedtensor_faketensor_init 2025-08-14T21:18:06.0107276Z * [new branch] gh/alexbrauckmann/paddedtensor_init -> origin/gh/alexbrauckmann/paddedtensor_init 2025-08-14T21:18:06.0111002Z * [new branch] gh/alexbrauckmann/paddedtensor_meta_init -> origin/gh/alexbrauckmann/paddedtensor_meta_init 2025-08-14T21:18:06.0111624Z * [new branch] gh/alexsamardzic/7/base -> origin/gh/alexsamardzic/7/base 2025-08-14T21:18:06.0111985Z * [new branch] gh/alexsamardzic/7/head -> origin/gh/alexsamardzic/7/head 2025-08-14T21:18:06.0112525Z * [new branch] gh/alexsamardzic/7/orig -> origin/gh/alexsamardzic/7/orig 2025-08-14T21:18:06.0112855Z * [new branch] gh/alexsamardzic/8/base -> origin/gh/alexsamardzic/8/base 2025-08-14T21:18:06.0113173Z * [new branch] gh/alexsamardzic/8/head -> origin/gh/alexsamardzic/8/head 2025-08-14T21:18:06.0113480Z * [new branch] gh/alexsamardzic/8/orig -> origin/gh/alexsamardzic/8/orig 2025-08-14T21:18:06.0113798Z * [new branch] gh/amjames/18/base -> origin/gh/amjames/18/base 2025-08-14T21:18:06.0114112Z * [new branch] gh/amjames/18/head -> origin/gh/amjames/18/head 2025-08-14T21:18:06.0114406Z * [new branch] gh/amjames/18/orig -> origin/gh/amjames/18/orig 2025-08-14T21:18:06.0114702Z * [new branch] gh/andrewor14/35/base -> origin/gh/andrewor14/35/base 2025-08-14T21:18:06.0115020Z * [new branch] gh/andrewor14/35/head -> origin/gh/andrewor14/35/head 2025-08-14T21:18:06.0115327Z * [new branch] gh/andrewor14/35/orig -> origin/gh/andrewor14/35/orig 2025-08-14T21:18:06.0115633Z * [new branch] gh/andrewor14/50/base -> origin/gh/andrewor14/50/base 2025-08-14T21:18:06.0116088Z * [new branch] gh/andrewor14/50/head -> origin/gh/andrewor14/50/head 2025-08-14T21:18:06.0116969Z * [new branch] gh/andrewor14/50/orig -> origin/gh/andrewor14/50/orig 2025-08-14T21:18:06.0117480Z * [new branch] gh/andyanwang/1/base -> origin/gh/andyanwang/1/base 2025-08-14T21:18:06.0117967Z * [new branch] gh/andyanwang/1/head -> origin/gh/andyanwang/1/head 2025-08-14T21:18:06.0118499Z * [new branch] gh/andyanwang/1/orig -> origin/gh/andyanwang/1/orig 2025-08-14T21:18:06.0120024Z * [new branch] gh/andyanwang/13/base -> origin/gh/andyanwang/13/base 2025-08-14T21:18:06.0120362Z * [new branch] gh/andyanwang/13/head -> origin/gh/andyanwang/13/head 2025-08-14T21:18:06.0120792Z * [new branch] gh/andyanwang/13/orig -> origin/gh/andyanwang/13/orig 2025-08-14T21:18:06.0124826Z * [new branch] gh/andyanwang/2/base -> origin/gh/andyanwang/2/base 2025-08-14T21:18:06.0125363Z * [new branch] gh/andyanwang/2/head -> origin/gh/andyanwang/2/head 2025-08-14T21:18:06.0126225Z * [new branch] gh/andyanwang/2/orig -> origin/gh/andyanwang/2/orig 2025-08-14T21:18:06.0126613Z * [new branch] gh/andyanwang/28/base -> origin/gh/andyanwang/28/base 2025-08-14T21:18:06.0126945Z * [new branch] gh/andyanwang/28/head -> origin/gh/andyanwang/28/head 2025-08-14T21:18:06.0127248Z * [new branch] gh/andyanwang/28/orig -> origin/gh/andyanwang/28/orig 2025-08-14T21:18:06.0127560Z * [new branch] gh/andyanwang/3/base -> origin/gh/andyanwang/3/base 2025-08-14T21:18:06.0127870Z * [new branch] gh/andyanwang/3/head -> origin/gh/andyanwang/3/head 2025-08-14T21:18:06.0128183Z * [new branch] gh/andyanwang/3/orig -> origin/gh/andyanwang/3/orig 2025-08-14T21:18:06.0128484Z * [new branch] gh/andyanwang/30/base -> origin/gh/andyanwang/30/base 2025-08-14T21:18:06.0128859Z * [new branch] gh/andyanwang/30/orig -> origin/gh/andyanwang/30/orig 2025-08-14T21:18:06.0130441Z * [new branch] gh/andyanwang/31/base -> origin/gh/andyanwang/31/base 2025-08-14T21:18:06.0130907Z * [new branch] gh/andyanwang/31/orig -> origin/gh/andyanwang/31/orig 2025-08-14T21:18:06.0134090Z * [new branch] gh/andyanwang/32/base -> origin/gh/andyanwang/32/base 2025-08-14T21:18:06.0134459Z * [new branch] gh/andyanwang/32/head -> origin/gh/andyanwang/32/head 2025-08-14T21:18:06.0134769Z * [new branch] gh/andyanwang/32/orig -> origin/gh/andyanwang/32/orig 2025-08-14T21:18:06.0135219Z * [new branch] gh/andyanwang/33/base -> origin/gh/andyanwang/33/base 2025-08-14T21:18:06.0135968Z * [new branch] gh/andyanwang/33/head -> origin/gh/andyanwang/33/head 2025-08-14T21:18:06.0136340Z * [new branch] gh/andyanwang/33/orig -> origin/gh/andyanwang/33/orig 2025-08-14T21:18:06.0136649Z * [new branch] gh/andyanwang/34/base -> origin/gh/andyanwang/34/base 2025-08-14T21:18:06.0136995Z * [new branch] gh/andyanwang/34/head -> origin/gh/andyanwang/34/head 2025-08-14T21:18:06.0137563Z * [new branch] gh/andyanwang/34/orig -> origin/gh/andyanwang/34/orig 2025-08-14T21:18:06.0140002Z * [new branch] gh/andyanwang/35/base -> origin/gh/andyanwang/35/base 2025-08-14T21:18:06.0140525Z * [new branch] gh/andyanwang/35/head -> origin/gh/andyanwang/35/head 2025-08-14T21:18:06.0141398Z * [new branch] gh/andyanwang/35/orig -> origin/gh/andyanwang/35/orig 2025-08-14T21:18:06.0141780Z * [new branch] gh/andyanwang/36/base -> origin/gh/andyanwang/36/base 2025-08-14T21:18:06.0142102Z * [new branch] gh/andyanwang/36/head -> origin/gh/andyanwang/36/head 2025-08-14T21:18:06.0142606Z * [new branch] gh/andyanwang/36/orig -> origin/gh/andyanwang/36/orig 2025-08-14T21:18:06.0144723Z * [new branch] gh/andyanwang/37/base -> origin/gh/andyanwang/37/base 2025-08-14T21:18:06.0145103Z * [new branch] gh/andyanwang/37/head -> origin/gh/andyanwang/37/head 2025-08-14T21:18:06.0145561Z * [new branch] gh/andyanwang/37/orig -> origin/gh/andyanwang/37/orig 2025-08-14T21:18:06.0146192Z * [new branch] gh/andyanwang/38/base -> origin/gh/andyanwang/38/base 2025-08-14T21:18:06.0146704Z * [new branch] gh/andyanwang/38/head -> origin/gh/andyanwang/38/head 2025-08-14T21:18:06.0147152Z * [new branch] gh/andyanwang/38/orig -> origin/gh/andyanwang/38/orig 2025-08-14T21:18:06.0147805Z * [new branch] gh/andyanwang/39/base -> origin/gh/andyanwang/39/base 2025-08-14T21:18:06.0148583Z * [new branch] gh/andyanwang/39/head -> origin/gh/andyanwang/39/head 2025-08-14T21:18:06.0149078Z * [new branch] gh/andyanwang/39/orig -> origin/gh/andyanwang/39/orig 2025-08-14T21:18:06.0150712Z * [new branch] gh/andyanwang/4/base -> origin/gh/andyanwang/4/base 2025-08-14T21:18:06.0151108Z * [new branch] gh/andyanwang/4/head -> origin/gh/andyanwang/4/head 2025-08-14T21:18:06.0151560Z * [new branch] gh/andyanwang/4/orig -> origin/gh/andyanwang/4/orig 2025-08-14T21:18:06.0152429Z * [new branch] gh/andyanwang/40/base -> origin/gh/andyanwang/40/base 2025-08-14T21:18:06.0152835Z * [new branch] gh/andyanwang/40/head -> origin/gh/andyanwang/40/head 2025-08-14T21:18:06.0153483Z * [new branch] gh/andyanwang/40/orig -> origin/gh/andyanwang/40/orig 2025-08-14T21:18:06.0155032Z * [new branch] gh/angelayi/102/base -> origin/gh/angelayi/102/base 2025-08-14T21:18:06.0155564Z * [new branch] gh/angelayi/102/head -> origin/gh/angelayi/102/head 2025-08-14T21:18:06.0155997Z * [new branch] gh/angelayi/102/orig -> origin/gh/angelayi/102/orig 2025-08-14T21:18:06.0156558Z * [new branch] gh/angelayi/103/base -> origin/gh/angelayi/103/base 2025-08-14T21:18:06.0157232Z * [new branch] gh/angelayi/103/head -> origin/gh/angelayi/103/head 2025-08-14T21:18:06.0157798Z * [new branch] gh/angelayi/103/orig -> origin/gh/angelayi/103/orig 2025-08-14T21:18:06.0160158Z * [new branch] gh/angelayi/104/base -> origin/gh/angelayi/104/base 2025-08-14T21:18:06.0160678Z * [new branch] gh/angelayi/104/head -> origin/gh/angelayi/104/head 2025-08-14T21:18:06.0161549Z * [new branch] gh/angelayi/104/orig -> origin/gh/angelayi/104/orig 2025-08-14T21:18:06.0161909Z * [new branch] gh/angelayi/105/base -> origin/gh/angelayi/105/base 2025-08-14T21:18:06.0162223Z * [new branch] gh/angelayi/105/head -> origin/gh/angelayi/105/head 2025-08-14T21:18:06.0162527Z * [new branch] gh/angelayi/105/orig -> origin/gh/angelayi/105/orig 2025-08-14T21:18:06.0162875Z * [new branch] gh/angelayi/106/base -> origin/gh/angelayi/106/base 2025-08-14T21:18:06.0163325Z * [new branch] gh/angelayi/106/head -> origin/gh/angelayi/106/head 2025-08-14T21:18:06.0163975Z * [new branch] gh/angelayi/106/orig -> origin/gh/angelayi/106/orig 2025-08-14T21:18:06.0165152Z * [new branch] gh/angelayi/107/base -> origin/gh/angelayi/107/base 2025-08-14T21:18:06.0165573Z * [new branch] gh/angelayi/107/head -> origin/gh/angelayi/107/head 2025-08-14T21:18:06.0167216Z * [new branch] gh/angelayi/108/base -> origin/gh/angelayi/108/base 2025-08-14T21:18:06.0167749Z * [new branch] gh/angelayi/108/head -> origin/gh/angelayi/108/head 2025-08-14T21:18:06.0168181Z * [new branch] gh/angelayi/108/orig -> origin/gh/angelayi/108/orig 2025-08-14T21:18:06.0168505Z * [new branch] gh/angelayi/109/base -> origin/gh/angelayi/109/base 2025-08-14T21:18:06.0168984Z * [new branch] gh/angelayi/109/head -> origin/gh/angelayi/109/head 2025-08-14T21:18:06.0169703Z * [new branch] gh/angelayi/109/orig -> origin/gh/angelayi/109/orig 2025-08-14T21:18:06.0170338Z * [new branch] gh/angelayi/110/base -> origin/gh/angelayi/110/base 2025-08-14T21:18:06.0170925Z * [new branch] gh/angelayi/110/head -> origin/gh/angelayi/110/head 2025-08-14T21:18:06.0171592Z * [new branch] gh/angelayi/110/orig -> origin/gh/angelayi/110/orig 2025-08-14T21:18:06.0174454Z * [new branch] gh/angelayi/97/base -> origin/gh/angelayi/97/base 2025-08-14T21:18:06.0174824Z * [new branch] gh/angelayi/97/head -> origin/gh/angelayi/97/head 2025-08-14T21:18:06.0175133Z * [new branch] gh/angelayi/97/orig -> origin/gh/angelayi/97/orig 2025-08-14T21:18:06.0175439Z * [new branch] gh/ani300/1/base -> origin/gh/ani300/1/base 2025-08-14T21:18:06.0175737Z * [new branch] gh/ani300/1/head -> origin/gh/ani300/1/head 2025-08-14T21:18:06.0176032Z * [new branch] gh/ani300/1/orig -> origin/gh/ani300/1/orig 2025-08-14T21:18:06.0177267Z * [new branch] gh/anijain2305/753/base -> origin/gh/anijain2305/753/base 2025-08-14T21:18:06.0177795Z * [new branch] gh/anijain2305/753/head -> origin/gh/anijain2305/753/head 2025-08-14T21:18:06.0178472Z * [new branch] gh/anijain2305/753/orig -> origin/gh/anijain2305/753/orig 2025-08-14T21:18:06.0179695Z * [new branch] gh/anijain2305/766/base -> origin/gh/anijain2305/766/base 2025-08-14T21:18:06.0180438Z * [new branch] gh/anijain2305/766/head -> origin/gh/anijain2305/766/head 2025-08-14T21:18:06.0180952Z * [new branch] gh/anijain2305/766/orig -> origin/gh/anijain2305/766/orig 2025-08-14T21:18:06.0181538Z * [new branch] gh/anijain2305/790/base -> origin/gh/anijain2305/790/base 2025-08-14T21:18:06.0182160Z * [new branch] gh/anijain2305/790/head -> origin/gh/anijain2305/790/head 2025-08-14T21:18:06.0182759Z * [new branch] gh/anijain2305/790/orig -> origin/gh/anijain2305/790/orig 2025-08-14T21:18:06.0183830Z * [new branch] gh/anijain2305/792/base -> origin/gh/anijain2305/792/base 2025-08-14T21:18:06.0184145Z * [new branch] gh/anijain2305/792/head -> origin/gh/anijain2305/792/head 2025-08-14T21:18:06.0185004Z * [new branch] gh/anijain2305/792/orig -> origin/gh/anijain2305/792/orig 2025-08-14T21:18:06.0187444Z * [new branch] gh/anijain2305/803/base -> origin/gh/anijain2305/803/base 2025-08-14T21:18:06.0187828Z * [new branch] gh/anijain2305/803/head -> origin/gh/anijain2305/803/head 2025-08-14T21:18:06.0188142Z * [new branch] gh/anijain2305/803/orig -> origin/gh/anijain2305/803/orig 2025-08-14T21:18:06.0188590Z * [new branch] gh/anijain2305/804/base -> origin/gh/anijain2305/804/base 2025-08-14T21:18:06.0188944Z * [new branch] gh/anijain2305/804/head -> origin/gh/anijain2305/804/head 2025-08-14T21:18:06.0189285Z * [new branch] gh/anijain2305/804/orig -> origin/gh/anijain2305/804/orig 2025-08-14T21:18:06.0190052Z * [new branch] gh/anijain2305/805/base -> origin/gh/anijain2305/805/base 2025-08-14T21:18:06.0192966Z * [new branch] gh/anijain2305/805/head -> origin/gh/anijain2305/805/head 2025-08-14T21:18:06.0193480Z * [new branch] gh/anijain2305/805/orig -> origin/gh/anijain2305/805/orig 2025-08-14T21:18:06.0194283Z * [new branch] gh/anijain2305/810/base -> origin/gh/anijain2305/810/base 2025-08-14T21:18:06.0194820Z * [new branch] gh/anijain2305/810/head -> origin/gh/anijain2305/810/head 2025-08-14T21:18:06.0195278Z * [new branch] gh/anijain2305/810/orig -> origin/gh/anijain2305/810/orig 2025-08-14T21:18:06.0195783Z * [new branch] gh/anijain2305/811/base -> origin/gh/anijain2305/811/base 2025-08-14T21:18:06.0196110Z * [new branch] gh/anijain2305/811/head -> origin/gh/anijain2305/811/head 2025-08-14T21:18:06.0196422Z * [new branch] gh/anijain2305/811/orig -> origin/gh/anijain2305/811/orig 2025-08-14T21:18:06.0196730Z * [new branch] gh/anijain2305/812/base -> origin/gh/anijain2305/812/base 2025-08-14T21:18:06.0197057Z * [new branch] gh/anijain2305/812/head -> origin/gh/anijain2305/812/head 2025-08-14T21:18:06.0197408Z * [new branch] gh/anijain2305/812/orig -> origin/gh/anijain2305/812/orig 2025-08-14T21:18:06.0199441Z * [new branch] gh/anijain2305/813/base -> origin/gh/anijain2305/813/base 2025-08-14T21:18:06.0199938Z * [new branch] gh/anijain2305/813/head -> origin/gh/anijain2305/813/head 2025-08-14T21:18:06.0200382Z * [new branch] gh/anijain2305/813/orig -> origin/gh/anijain2305/813/orig 2025-08-14T21:18:06.0201133Z * [new branch] gh/anijain2305/814/base -> origin/gh/anijain2305/814/base 2025-08-14T21:18:06.0201515Z * [new branch] gh/anijain2305/814/head -> origin/gh/anijain2305/814/head 2025-08-14T21:18:06.0201960Z * [new branch] gh/anijain2305/814/orig -> origin/gh/anijain2305/814/orig 2025-08-14T21:18:06.0202590Z * [new branch] gh/anijain2305/815/base -> origin/gh/anijain2305/815/base 2025-08-14T21:18:06.0203132Z * [new branch] gh/anijain2305/815/head -> origin/gh/anijain2305/815/head 2025-08-14T21:18:06.0203713Z * [new branch] gh/anijain2305/815/orig -> origin/gh/anijain2305/815/orig 2025-08-14T21:18:06.0205203Z * [new branch] gh/anijain2305/816/base -> origin/gh/anijain2305/816/base 2025-08-14T21:18:06.0205706Z * [new branch] gh/anijain2305/816/head -> origin/gh/anijain2305/816/head 2025-08-14T21:18:06.0206120Z * [new branch] gh/anijain2305/817/base -> origin/gh/anijain2305/817/base 2025-08-14T21:18:06.0206454Z * [new branch] gh/anijain2305/817/head -> origin/gh/anijain2305/817/head 2025-08-14T21:18:06.0207142Z * [new branch] gh/anijain2305/817/orig -> origin/gh/anijain2305/817/orig 2025-08-14T21:18:06.0209274Z * [new branch] gh/anijain2305/818/base -> origin/gh/anijain2305/818/base 2025-08-14T21:18:06.0209808Z * [new branch] gh/anijain2305/818/head -> origin/gh/anijain2305/818/head 2025-08-14T21:18:06.0209971Z * [new branch] gh/anijain2305/818/orig -> origin/gh/anijain2305/818/orig 2025-08-14T21:18:06.0210434Z * [new branch] gh/anijain2305/819/base -> origin/gh/anijain2305/819/base 2025-08-14T21:18:06.0211342Z * [new branch] gh/anijain2305/819/head -> origin/gh/anijain2305/819/head 2025-08-14T21:18:06.0211697Z * [new branch] gh/anijain2305/819/orig -> origin/gh/anijain2305/819/orig 2025-08-14T21:18:06.0213925Z * [new branch] gh/anijain2305/820/base -> origin/gh/anijain2305/820/base 2025-08-14T21:18:06.0214100Z * [new branch] gh/anijain2305/820/head -> origin/gh/anijain2305/820/head 2025-08-14T21:18:06.0214229Z * [new branch] gh/anijain2305/820/orig -> origin/gh/anijain2305/820/orig 2025-08-14T21:18:06.0215613Z * [new branch] gh/anijain2305/821/base -> origin/gh/anijain2305/821/base 2025-08-14T21:18:06.0215785Z * [new branch] gh/anijain2305/821/head -> origin/gh/anijain2305/821/head 2025-08-14T21:18:06.0216026Z * [new branch] gh/anijain2305/821/orig -> origin/gh/anijain2305/821/orig 2025-08-14T21:18:06.0217090Z * [new branch] gh/anijain2305/822/base -> origin/gh/anijain2305/822/base 2025-08-14T21:18:06.0217404Z * [new branch] gh/anijain2305/822/head -> origin/gh/anijain2305/822/head 2025-08-14T21:18:06.0218457Z * [new branch] gh/anijain2305/822/orig -> origin/gh/anijain2305/822/orig 2025-08-14T21:18:06.0218951Z * [new branch] gh/anijain2305/823/base -> origin/gh/anijain2305/823/base 2025-08-14T21:18:06.0220220Z * [new branch] gh/anijain2305/823/head -> origin/gh/anijain2305/823/head 2025-08-14T21:18:06.0220500Z * [new branch] gh/anijain2305/823/orig -> origin/gh/anijain2305/823/orig 2025-08-14T21:18:06.0221088Z * [new branch] gh/anijain2305/824/base -> origin/gh/anijain2305/824/base 2025-08-14T21:18:06.0221826Z * [new branch] gh/anijain2305/824/head -> origin/gh/anijain2305/824/head 2025-08-14T21:18:06.0222263Z * [new branch] gh/anijain2305/824/orig -> origin/gh/anijain2305/824/orig 2025-08-14T21:18:06.0223679Z * [new branch] gh/anijain2305/825/base -> origin/gh/anijain2305/825/base 2025-08-14T21:18:06.0223864Z * [new branch] gh/anijain2305/825/head -> origin/gh/anijain2305/825/head 2025-08-14T21:18:06.0224811Z * [new branch] gh/anijain2305/825/orig -> origin/gh/anijain2305/825/orig 2025-08-14T21:18:06.0225762Z * [new branch] gh/anijain2305/826/base -> origin/gh/anijain2305/826/base 2025-08-14T21:18:06.0226019Z * [new branch] gh/anijain2305/826/head -> origin/gh/anijain2305/826/head 2025-08-14T21:18:06.0226979Z * [new branch] gh/anijain2305/826/orig -> origin/gh/anijain2305/826/orig 2025-08-14T21:18:06.0227797Z * [new branch] gh/anijain2305/827/base -> origin/gh/anijain2305/827/base 2025-08-14T21:18:06.0227998Z * [new branch] gh/anijain2305/827/head -> origin/gh/anijain2305/827/head 2025-08-14T21:18:06.0228931Z * [new branch] gh/anijain2305/827/orig -> origin/gh/anijain2305/827/orig 2025-08-14T21:18:06.0229770Z * [new branch] gh/anijain2305/828/base -> origin/gh/anijain2305/828/base 2025-08-14T21:18:06.0230227Z * [new branch] gh/anijain2305/828/head -> origin/gh/anijain2305/828/head 2025-08-14T21:18:06.0230921Z * [new branch] gh/anijain2305/828/orig -> origin/gh/anijain2305/828/orig 2025-08-14T21:18:06.0232175Z * [new branch] gh/anijain2305/829/base -> origin/gh/anijain2305/829/base 2025-08-14T21:18:06.0232380Z * [new branch] gh/anijain2305/829/head -> origin/gh/anijain2305/829/head 2025-08-14T21:18:06.0233391Z * [new branch] gh/anijain2305/829/orig -> origin/gh/anijain2305/829/orig 2025-08-14T21:18:06.0234277Z * [new branch] gh/anijain2305/830/base -> origin/gh/anijain2305/830/base 2025-08-14T21:18:06.0234524Z * [new branch] gh/anijain2305/830/head -> origin/gh/anijain2305/830/head 2025-08-14T21:18:06.0235426Z * [new branch] gh/anijain2305/830/orig -> origin/gh/anijain2305/830/orig 2025-08-14T21:18:06.0236415Z * [new branch] gh/anijain2305/831/base -> origin/gh/anijain2305/831/base 2025-08-14T21:18:06.0236767Z * [new branch] gh/anijain2305/831/head -> origin/gh/anijain2305/831/head 2025-08-14T21:18:06.0237624Z * [new branch] gh/anijain2305/831/orig -> origin/gh/anijain2305/831/orig 2025-08-14T21:18:06.0238479Z * [new branch] gh/anijain2305/832/base -> origin/gh/anijain2305/832/base 2025-08-14T21:18:06.0238768Z * [new branch] gh/anijain2305/832/head -> origin/gh/anijain2305/832/head 2025-08-14T21:18:06.0239630Z * [new branch] gh/anijain2305/832/orig -> origin/gh/anijain2305/832/orig 2025-08-14T21:18:06.0240464Z * [new branch] gh/anijain2305/833/base -> origin/gh/anijain2305/833/base 2025-08-14T21:18:06.0241405Z * [new branch] gh/anijain2305/833/head -> origin/gh/anijain2305/833/head 2025-08-14T21:18:06.0241819Z * [new branch] gh/anijain2305/833/orig -> origin/gh/anijain2305/833/orig 2025-08-14T21:18:06.0243008Z * [new branch] gh/anijain2305/834/base -> origin/gh/anijain2305/834/base 2025-08-14T21:18:06.0243253Z * [new branch] gh/anijain2305/834/head -> origin/gh/anijain2305/834/head 2025-08-14T21:18:06.0245025Z * [new branch] gh/anijain2305/834/orig -> origin/gh/anijain2305/834/orig 2025-08-14T21:18:06.0245201Z * [new branch] gh/anijain2305/835/base -> origin/gh/anijain2305/835/base 2025-08-14T21:18:06.0245497Z * [new branch] gh/anijain2305/835/head -> origin/gh/anijain2305/835/head 2025-08-14T21:18:06.0246006Z * [new branch] gh/anijain2305/835/orig -> origin/gh/anijain2305/835/orig 2025-08-14T21:18:06.0246988Z * [new branch] gh/anijain2305/836/base -> origin/gh/anijain2305/836/base 2025-08-14T21:18:06.0247358Z * [new branch] gh/anijain2305/836/head -> origin/gh/anijain2305/836/head 2025-08-14T21:18:06.0248814Z * [new branch] gh/anijain2305/836/orig -> origin/gh/anijain2305/836/orig 2025-08-14T21:18:06.0249023Z * [new branch] gh/anijain2305/837/base -> origin/gh/anijain2305/837/base 2025-08-14T21:18:06.0249550Z * [new branch] gh/anijain2305/837/head -> origin/gh/anijain2305/837/head 2025-08-14T21:18:06.0250129Z * [new branch] gh/anijain2305/837/orig -> origin/gh/anijain2305/837/orig 2025-08-14T21:18:06.0251464Z * [new branch] gh/anijain2305/838/base -> origin/gh/anijain2305/838/base 2025-08-14T21:18:06.0251618Z * [new branch] gh/anijain2305/838/head -> origin/gh/anijain2305/838/head 2025-08-14T21:18:06.0252152Z * [new branch] gh/anijain2305/838/orig -> origin/gh/anijain2305/838/orig 2025-08-14T21:18:06.0255128Z * [new branch] gh/anijain2305/839/base -> origin/gh/anijain2305/839/base 2025-08-14T21:18:06.0255307Z * [new branch] gh/anijain2305/839/head -> origin/gh/anijain2305/839/head 2025-08-14T21:18:06.0255441Z * [new branch] gh/anijain2305/839/orig -> origin/gh/anijain2305/839/orig 2025-08-14T21:18:06.0255588Z * [new branch] gh/anijain2305/840/base -> origin/gh/anijain2305/840/base 2025-08-14T21:18:06.0255934Z * [new branch] gh/anijain2305/840/head -> origin/gh/anijain2305/840/head 2025-08-14T21:18:06.0256085Z * [new branch] gh/anijain2305/840/orig -> origin/gh/anijain2305/840/orig 2025-08-14T21:18:06.0257534Z * [new branch] gh/anijain2305/841/base -> origin/gh/anijain2305/841/base 2025-08-14T21:18:06.0257696Z * [new branch] gh/anijain2305/841/head -> origin/gh/anijain2305/841/head 2025-08-14T21:18:06.0259461Z * [new branch] gh/anijain2305/841/orig -> origin/gh/anijain2305/841/orig 2025-08-14T21:18:06.0259634Z * [new branch] gh/anijain2305/842/base -> origin/gh/anijain2305/842/base 2025-08-14T21:18:06.0259775Z * [new branch] gh/anijain2305/842/head -> origin/gh/anijain2305/842/head 2025-08-14T21:18:06.0260612Z * [new branch] gh/anijain2305/842/orig -> origin/gh/anijain2305/842/orig 2025-08-14T21:18:06.0261450Z * [new branch] gh/anijain2305/843/base -> origin/gh/anijain2305/843/base 2025-08-14T21:18:06.0261821Z * [new branch] gh/anijain2305/843/head -> origin/gh/anijain2305/843/head 2025-08-14T21:18:06.0263370Z * [new branch] gh/anijain2305/843/orig -> origin/gh/anijain2305/843/orig 2025-08-14T21:18:06.0263556Z * [new branch] gh/anijain2305/844/base -> origin/gh/anijain2305/844/base 2025-08-14T21:18:06.0264106Z * [new branch] gh/anijain2305/844/head -> origin/gh/anijain2305/844/head 2025-08-14T21:18:06.0264535Z * [new branch] gh/anijain2305/844/orig -> origin/gh/anijain2305/844/orig 2025-08-14T21:18:06.0267842Z * [new branch] gh/anijain2305/845/base -> origin/gh/anijain2305/845/base 2025-08-14T21:18:06.0268014Z * [new branch] gh/anijain2305/845/head -> origin/gh/anijain2305/845/head 2025-08-14T21:18:06.0268308Z * [new branch] gh/anijain2305/845/orig -> origin/gh/anijain2305/845/orig 2025-08-14T21:18:06.0268449Z * [new branch] gh/anijain2305/846/base -> origin/gh/anijain2305/846/base 2025-08-14T21:18:06.0268580Z * [new branch] gh/anijain2305/846/head -> origin/gh/anijain2305/846/head 2025-08-14T21:18:06.0268759Z * [new branch] gh/anijain2305/846/orig -> origin/gh/anijain2305/846/orig 2025-08-14T21:18:06.0272330Z * [new branch] gh/anijain2305/847/base -> origin/gh/anijain2305/847/base 2025-08-14T21:18:06.0272499Z * [new branch] gh/anijain2305/847/head -> origin/gh/anijain2305/847/head 2025-08-14T21:18:06.0272637Z * [new branch] gh/anijain2305/847/orig -> origin/gh/anijain2305/847/orig 2025-08-14T21:18:06.0272763Z * [new branch] gh/anijain2305/848/base -> origin/gh/anijain2305/848/base 2025-08-14T21:18:06.0272912Z * [new branch] gh/anijain2305/848/head -> origin/gh/anijain2305/848/head 2025-08-14T21:18:06.0273078Z * [new branch] gh/anijain2305/848/orig -> origin/gh/anijain2305/848/orig 2025-08-14T21:18:06.0276591Z * [new branch] gh/anjali411/216/base -> origin/gh/anjali411/216/base 2025-08-14T21:18:06.0276914Z * [new branch] gh/anjali411/216/head -> origin/gh/anjali411/216/head 2025-08-14T21:18:06.0277077Z * [new branch] gh/anjali411/216/orig -> origin/gh/anjali411/216/orig 2025-08-14T21:18:06.0277230Z * [new branch] gh/ankitageorge/10/base -> origin/gh/ankitageorge/10/base 2025-08-14T21:18:06.0277492Z * [new branch] gh/ankitageorge/10/head -> origin/gh/ankitageorge/10/head 2025-08-14T21:18:06.0278028Z * [new branch] gh/ankitageorge/10/orig -> origin/gh/ankitageorge/10/orig 2025-08-14T21:18:06.0278880Z * [new branch] gh/ankitageorge/12/base -> origin/gh/ankitageorge/12/base 2025-08-14T21:18:06.0279283Z * [new branch] gh/ankitageorge/12/head -> origin/gh/ankitageorge/12/head 2025-08-14T21:18:06.0280611Z * [new branch] gh/ankitageorge/12/orig -> origin/gh/ankitageorge/12/orig 2025-08-14T21:18:06.0280909Z * [new branch] gh/ankitageorge/13/base -> origin/gh/ankitageorge/13/base 2025-08-14T21:18:06.0281515Z * [new branch] gh/ankitageorge/13/head -> origin/gh/ankitageorge/13/head 2025-08-14T21:18:06.0282546Z * [new branch] gh/ankitageorge/13/orig -> origin/gh/ankitageorge/13/orig 2025-08-14T21:18:06.0284897Z * [new branch] gh/ankitageorge/14/base -> origin/gh/ankitageorge/14/base 2025-08-14T21:18:06.0285078Z * [new branch] gh/ankitageorge/14/head -> origin/gh/ankitageorge/14/head 2025-08-14T21:18:06.0285222Z * [new branch] gh/ankitageorge/14/orig -> origin/gh/ankitageorge/14/orig 2025-08-14T21:18:06.0285601Z * [new branch] gh/ankitageorge/15/base -> origin/gh/ankitageorge/15/base 2025-08-14T21:18:06.0286439Z * [new branch] gh/ankitageorge/15/head -> origin/gh/ankitageorge/15/head 2025-08-14T21:18:06.0287346Z * [new branch] gh/ankitageorge/15/orig -> origin/gh/ankitageorge/15/orig 2025-08-14T21:18:06.0291084Z * [new branch] gh/ankitageorge/16/base -> origin/gh/ankitageorge/16/base 2025-08-14T21:18:06.0291282Z * [new branch] gh/ankitageorge/16/head -> origin/gh/ankitageorge/16/head 2025-08-14T21:18:06.0291418Z * [new branch] gh/ankitageorge/16/orig -> origin/gh/ankitageorge/16/orig 2025-08-14T21:18:06.0291555Z * [new branch] gh/ankitageorge/17/base -> origin/gh/ankitageorge/17/base 2025-08-14T21:18:06.0291686Z * [new branch] gh/ankitageorge/17/head -> origin/gh/ankitageorge/17/head 2025-08-14T21:18:06.0291986Z * [new branch] gh/ankitageorge/17/orig -> origin/gh/ankitageorge/17/orig 2025-08-14T21:18:06.0292683Z * [new branch] gh/ankitageorge/18/base -> origin/gh/ankitageorge/18/base 2025-08-14T21:18:06.0293406Z * [new branch] gh/ankitageorge/18/head -> origin/gh/ankitageorge/18/head 2025-08-14T21:18:06.0293874Z * [new branch] gh/ankitageorge/18/orig -> origin/gh/ankitageorge/18/orig 2025-08-14T21:18:06.0295831Z * [new branch] gh/ankitageorge/19/base -> origin/gh/ankitageorge/19/base 2025-08-14T21:18:06.0296165Z * [new branch] gh/ankitageorge/19/head -> origin/gh/ankitageorge/19/head 2025-08-14T21:18:06.0296315Z * [new branch] gh/ankitageorge/19/orig -> origin/gh/ankitageorge/19/orig 2025-08-14T21:18:06.0297520Z * [new branch] gh/ankitageorge/20/base -> origin/gh/ankitageorge/20/base 2025-08-14T21:18:06.0297926Z * [new branch] gh/ankitageorge/20/head -> origin/gh/ankitageorge/20/head 2025-08-14T21:18:06.0298830Z * [new branch] gh/ankitageorge/20/orig -> origin/gh/ankitageorge/20/orig 2025-08-14T21:18:06.0299711Z * [new branch] gh/ankitageorge/21/base -> origin/gh/ankitageorge/21/base 2025-08-14T21:18:06.0299983Z * [new branch] gh/ankitageorge/21/head -> origin/gh/ankitageorge/21/head 2025-08-14T21:18:06.0300944Z * [new branch] gh/ankitageorge/21/orig -> origin/gh/ankitageorge/21/orig 2025-08-14T21:18:06.0302136Z * [new branch] gh/anshul-si/1/base -> origin/gh/anshul-si/1/base 2025-08-14T21:18:06.0302431Z * [new branch] gh/anshul-si/1/head -> origin/gh/anshul-si/1/head 2025-08-14T21:18:06.0303791Z * [new branch] gh/anshul-si/10/base -> origin/gh/anshul-si/10/base 2025-08-14T21:18:06.0304137Z * [new branch] gh/anshul-si/10/head -> origin/gh/anshul-si/10/head 2025-08-14T21:18:06.0307124Z * [new branch] gh/anshul-si/10/orig -> origin/gh/anshul-si/10/orig 2025-08-14T21:18:06.0307720Z * [new branch] gh/anshul-si/11/base -> origin/gh/anshul-si/11/base 2025-08-14T21:18:06.0312393Z * [new branch] gh/anshul-si/11/head -> origin/gh/anshul-si/11/head 2025-08-14T21:18:06.0312557Z * [new branch] gh/anshul-si/11/orig -> origin/gh/anshul-si/11/orig 2025-08-14T21:18:06.0312687Z * [new branch] gh/anshul-si/12/base -> origin/gh/anshul-si/12/base 2025-08-14T21:18:06.0313016Z * [new branch] gh/anshul-si/12/head -> origin/gh/anshul-si/12/head 2025-08-14T21:18:06.0313140Z * [new branch] gh/anshul-si/12/orig -> origin/gh/anshul-si/12/orig 2025-08-14T21:18:06.0313272Z * [new branch] gh/anshul-si/13/base -> origin/gh/anshul-si/13/base 2025-08-14T21:18:06.0313574Z * [new branch] gh/anshul-si/13/head -> origin/gh/anshul-si/13/head 2025-08-14T21:18:06.0313719Z * [new branch] gh/anshul-si/13/orig -> origin/gh/anshul-si/13/orig 2025-08-14T21:18:06.0313868Z * [new branch] gh/anshul-si/14/base -> origin/gh/anshul-si/14/base 2025-08-14T21:18:06.0314103Z * [new branch] gh/anshul-si/14/head -> origin/gh/anshul-si/14/head 2025-08-14T21:18:06.0314542Z * [new branch] gh/anshul-si/14/orig -> origin/gh/anshul-si/14/orig 2025-08-14T21:18:06.0314699Z * [new branch] gh/anshul-si/15/base -> origin/gh/anshul-si/15/base 2025-08-14T21:18:06.0314849Z * [new branch] gh/anshul-si/15/head -> origin/gh/anshul-si/15/head 2025-08-14T21:18:06.0316700Z * [new branch] gh/anshul-si/15/orig -> origin/gh/anshul-si/15/orig 2025-08-14T21:18:06.0316925Z * [new branch] gh/anshul-si/16/base -> origin/gh/anshul-si/16/base 2025-08-14T21:18:06.0317062Z * [new branch] gh/anshul-si/16/head -> origin/gh/anshul-si/16/head 2025-08-14T21:18:06.0317404Z * [new branch] gh/anshul-si/16/orig -> origin/gh/anshul-si/16/orig 2025-08-14T21:18:06.0317583Z * [new branch] gh/anshul-si/17/base -> origin/gh/anshul-si/17/base 2025-08-14T21:18:06.0317840Z * [new branch] gh/anshul-si/17/head -> origin/gh/anshul-si/17/head 2025-08-14T21:18:06.0318824Z * [new branch] gh/anshul-si/17/orig -> origin/gh/anshul-si/17/orig 2025-08-14T21:18:06.0323660Z * [new branch] gh/anshul-si/18/base -> origin/gh/anshul-si/18/base 2025-08-14T21:18:06.0323835Z * [new branch] gh/anshul-si/18/head -> origin/gh/anshul-si/18/head 2025-08-14T21:18:06.0323967Z * [new branch] gh/anshul-si/18/orig -> origin/gh/anshul-si/18/orig 2025-08-14T21:18:06.0324087Z * [new branch] gh/anshul-si/19/base -> origin/gh/anshul-si/19/base 2025-08-14T21:18:06.0324211Z * [new branch] gh/anshul-si/19/head -> origin/gh/anshul-si/19/head 2025-08-14T21:18:06.0324369Z * [new branch] gh/anshul-si/19/orig -> origin/gh/anshul-si/19/orig 2025-08-14T21:18:06.0325165Z * [new branch] gh/anshul-si/2/base -> origin/gh/anshul-si/2/base 2025-08-14T21:18:06.0325301Z * [new branch] gh/anshul-si/2/head -> origin/gh/anshul-si/2/head 2025-08-14T21:18:06.0325425Z * [new branch] gh/anshul-si/20/base -> origin/gh/anshul-si/20/base 2025-08-14T21:18:06.0325680Z * [new branch] gh/anshul-si/20/head -> origin/gh/anshul-si/20/head 2025-08-14T21:18:06.0325824Z * [new branch] gh/anshul-si/20/orig -> origin/gh/anshul-si/20/orig 2025-08-14T21:18:06.0328938Z * [new branch] gh/anshul-si/21/base -> origin/gh/anshul-si/21/base 2025-08-14T21:18:06.0329075Z * [new branch] gh/anshul-si/21/head -> origin/gh/anshul-si/21/head 2025-08-14T21:18:06.0329292Z * [new branch] gh/anshul-si/21/orig -> origin/gh/anshul-si/21/orig 2025-08-14T21:18:06.0329435Z * [new branch] gh/anshul-si/22/base -> origin/gh/anshul-si/22/base 2025-08-14T21:18:06.0329638Z * [new branch] gh/anshul-si/22/head -> origin/gh/anshul-si/22/head 2025-08-14T21:18:06.0329775Z * [new branch] gh/anshul-si/22/orig -> origin/gh/anshul-si/22/orig 2025-08-14T21:18:06.0334718Z * [new branch] gh/anshul-si/23/base -> origin/gh/anshul-si/23/base 2025-08-14T21:18:06.0335038Z * [new branch] gh/anshul-si/23/head -> origin/gh/anshul-si/23/head 2025-08-14T21:18:06.0335168Z * [new branch] gh/anshul-si/23/orig -> origin/gh/anshul-si/23/orig 2025-08-14T21:18:06.0335299Z * [new branch] gh/anshul-si/24/base -> origin/gh/anshul-si/24/base 2025-08-14T21:18:06.0335422Z * [new branch] gh/anshul-si/24/head -> origin/gh/anshul-si/24/head 2025-08-14T21:18:06.0335543Z * [new branch] gh/anshul-si/24/orig -> origin/gh/anshul-si/24/orig 2025-08-14T21:18:06.0335693Z * [new branch] gh/anshul-si/25/base -> origin/gh/anshul-si/25/base 2025-08-14T21:18:06.0336580Z * [new branch] gh/anshul-si/25/head -> origin/gh/anshul-si/25/head 2025-08-14T21:18:06.0336810Z * [new branch] gh/anshul-si/25/orig -> origin/gh/anshul-si/25/orig 2025-08-14T21:18:06.0336938Z * [new branch] gh/anshul-si/26/base -> origin/gh/anshul-si/26/base 2025-08-14T21:18:06.0337075Z * [new branch] gh/anshul-si/26/head -> origin/gh/anshul-si/26/head 2025-08-14T21:18:06.0337203Z * [new branch] gh/anshul-si/26/orig -> origin/gh/anshul-si/26/orig 2025-08-14T21:18:06.0337850Z * [new branch] gh/anshul-si/27/base -> origin/gh/anshul-si/27/base 2025-08-14T21:18:06.0338252Z * [new branch] gh/anshul-si/27/head -> origin/gh/anshul-si/27/head 2025-08-14T21:18:06.0339349Z * [new branch] gh/anshul-si/27/orig -> origin/gh/anshul-si/27/orig 2025-08-14T21:18:06.0340365Z * [new branch] gh/anshul-si/3/base -> origin/gh/anshul-si/3/base 2025-08-14T21:18:06.0341943Z * [new branch] gh/anshul-si/3/head -> origin/gh/anshul-si/3/head 2025-08-14T21:18:06.0342280Z * [new branch] gh/anshul-si/4/base -> origin/gh/anshul-si/4/base 2025-08-14T21:18:06.0342426Z * [new branch] gh/anshul-si/4/head -> origin/gh/anshul-si/4/head 2025-08-14T21:18:06.0342548Z * [new branch] gh/anshul-si/5/base -> origin/gh/anshul-si/5/base 2025-08-14T21:18:06.0343975Z * [new branch] gh/anshul-si/5/head -> origin/gh/anshul-si/5/head 2025-08-14T21:18:06.0344380Z * [new branch] gh/anshul-si/6/base -> origin/gh/anshul-si/6/base 2025-08-14T21:18:06.0344652Z * [new branch] gh/anshul-si/6/head -> origin/gh/anshul-si/6/head 2025-08-14T21:18:06.0344904Z * [new branch] gh/anshul-si/6/orig -> origin/gh/anshul-si/6/orig 2025-08-14T21:18:06.0348411Z * [new branch] gh/anshul-si/7/base -> origin/gh/anshul-si/7/base 2025-08-14T21:18:06.0348582Z * [new branch] gh/anshul-si/7/head -> origin/gh/anshul-si/7/head 2025-08-14T21:18:06.0348709Z * [new branch] gh/anshul-si/7/orig -> origin/gh/anshul-si/7/orig 2025-08-14T21:18:06.0348845Z * [new branch] gh/anshul-si/8/base -> origin/gh/anshul-si/8/base 2025-08-14T21:18:06.0348975Z * [new branch] gh/anshul-si/8/head -> origin/gh/anshul-si/8/head 2025-08-14T21:18:06.0349426Z * [new branch] gh/anshul-si/8/orig -> origin/gh/anshul-si/8/orig 2025-08-14T21:18:06.0352508Z * [new branch] gh/anshul-si/9/base -> origin/gh/anshul-si/9/base 2025-08-14T21:18:06.0352667Z * [new branch] gh/anshul-si/9/head -> origin/gh/anshul-si/9/head 2025-08-14T21:18:06.0352804Z * [new branch] gh/anshul-si/9/orig -> origin/gh/anshul-si/9/orig 2025-08-14T21:18:06.0352947Z * [new branch] gh/aorenste/132/base -> origin/gh/aorenste/132/base 2025-08-14T21:18:06.0353349Z * [new branch] gh/aorenste/132/head -> origin/gh/aorenste/132/head 2025-08-14T21:18:06.0356986Z * [new branch] gh/aorenste/235/base -> origin/gh/aorenste/235/base 2025-08-14T21:18:06.0357470Z * [new branch] gh/aorenste/235/head -> origin/gh/aorenste/235/head 2025-08-14T21:18:06.0357687Z * [new branch] gh/aorenste/235/orig -> origin/gh/aorenste/235/orig 2025-08-14T21:18:06.0358367Z * [new branch] gh/aorenste/236/base -> origin/gh/aorenste/236/base 2025-08-14T21:18:06.0358536Z * [new branch] gh/aorenste/236/head -> origin/gh/aorenste/236/head 2025-08-14T21:18:06.0358662Z * [new branch] gh/aorenste/236/orig -> origin/gh/aorenste/236/orig 2025-08-14T21:18:06.0359091Z * [new branch] gh/aorenste/237/base -> origin/gh/aorenste/237/base 2025-08-14T21:18:06.0362082Z * [new branch] gh/aorenste/237/head -> origin/gh/aorenste/237/head 2025-08-14T21:18:06.0362403Z * [new branch] gh/aorenste/237/orig -> origin/gh/aorenste/237/orig 2025-08-14T21:18:06.0362634Z * [new branch] gh/aorenste/238/base -> origin/gh/aorenste/238/base 2025-08-14T21:18:06.0362788Z * [new branch] gh/aorenste/238/head -> origin/gh/aorenste/238/head 2025-08-14T21:18:06.0362991Z * [new branch] gh/aorenste/238/orig -> origin/gh/aorenste/238/orig 2025-08-14T21:18:06.0363809Z * [new branch] gh/bdhirsh/650/base -> origin/gh/bdhirsh/650/base 2025-08-14T21:18:06.0364259Z * [new branch] gh/bdhirsh/650/head -> origin/gh/bdhirsh/650/head 2025-08-14T21:18:06.0365836Z * [new branch] gh/bdhirsh/650/orig -> origin/gh/bdhirsh/650/orig 2025-08-14T21:18:06.0366159Z * [new branch] gh/bdhirsh/656/base -> origin/gh/bdhirsh/656/base 2025-08-14T21:18:06.0366529Z * [new branch] gh/bdhirsh/656/head -> origin/gh/bdhirsh/656/head 2025-08-14T21:18:06.0367466Z * [new branch] gh/bdhirsh/657/base -> origin/gh/bdhirsh/657/base 2025-08-14T21:18:06.0367829Z * [new branch] gh/bdhirsh/657/head -> origin/gh/bdhirsh/657/head 2025-08-14T21:18:06.0368827Z * [new branch] gh/bdhirsh/659/base -> origin/gh/bdhirsh/659/base 2025-08-14T21:18:06.0369131Z * [new branch] gh/bdhirsh/659/head -> origin/gh/bdhirsh/659/head 2025-08-14T21:18:06.0371468Z * [new branch] gh/bdhirsh/659/orig -> origin/gh/bdhirsh/659/orig 2025-08-14T21:18:06.0371629Z * [new branch] gh/bdhirsh/663/base -> origin/gh/bdhirsh/663/base 2025-08-14T21:18:06.0371762Z * [new branch] gh/bdhirsh/663/head -> origin/gh/bdhirsh/663/head 2025-08-14T21:18:06.0371907Z * [new branch] gh/bdhirsh/663/orig -> origin/gh/bdhirsh/663/orig 2025-08-14T21:18:06.0375540Z * [new branch] gh/bdhirsh/665/base -> origin/gh/bdhirsh/665/base 2025-08-14T21:18:06.0375698Z * [new branch] gh/bdhirsh/665/head -> origin/gh/bdhirsh/665/head 2025-08-14T21:18:06.0375846Z * [new branch] gh/bdhirsh/665/orig -> origin/gh/bdhirsh/665/orig 2025-08-14T21:18:06.0375975Z * [new branch] gh/bdhirsh/666/base -> origin/gh/bdhirsh/666/base 2025-08-14T21:18:06.0376098Z * [new branch] gh/bdhirsh/666/head -> origin/gh/bdhirsh/666/head 2025-08-14T21:18:06.0376227Z * [new branch] gh/bdhirsh/666/orig -> origin/gh/bdhirsh/666/orig 2025-08-14T21:18:06.0377526Z * [new branch] gh/benjaminglass1/79/base -> origin/gh/benjaminglass1/79/base 2025-08-14T21:18:06.0377779Z * [new branch] gh/benjaminglass1/79/head -> origin/gh/benjaminglass1/79/head 2025-08-14T21:18:06.0378371Z * [new branch] gh/benjaminglass1/79/orig -> origin/gh/benjaminglass1/79/orig 2025-08-14T21:18:06.0379455Z * [new branch] gh/benjaminglass1/86/base -> origin/gh/benjaminglass1/86/base 2025-08-14T21:18:06.0379682Z * [new branch] gh/benjaminglass1/86/head -> origin/gh/benjaminglass1/86/head 2025-08-14T21:18:06.0380681Z * [new branch] gh/benjaminglass1/86/orig -> origin/gh/benjaminglass1/86/orig 2025-08-14T21:18:06.0381265Z * [new branch] gh/benjaminglass1/89/base -> origin/gh/benjaminglass1/89/base 2025-08-14T21:18:06.0381835Z * [new branch] gh/benjaminglass1/89/head -> origin/gh/benjaminglass1/89/head 2025-08-14T21:18:06.0382717Z * [new branch] gh/benjaminglass1/89/orig -> origin/gh/benjaminglass1/89/orig 2025-08-14T21:18:06.0383273Z * [new branch] gh/benjaminglass1/91/base -> origin/gh/benjaminglass1/91/base 2025-08-14T21:18:06.0384261Z * [new branch] gh/benjaminglass1/91/head -> origin/gh/benjaminglass1/91/head 2025-08-14T21:18:06.0384539Z * [new branch] gh/benjaminglass1/91/orig -> origin/gh/benjaminglass1/91/orig 2025-08-14T21:18:06.0385846Z * [new branch] gh/benjaminglass1/93/base -> origin/gh/benjaminglass1/93/base 2025-08-14T21:18:06.0386159Z * [new branch] gh/benjaminglass1/93/head -> origin/gh/benjaminglass1/93/head 2025-08-14T21:18:06.0388025Z * [new branch] gh/benjaminglass1/93/orig -> origin/gh/benjaminglass1/93/orig 2025-08-14T21:18:06.0388207Z * [new branch] gh/benjaminglass1/94/base -> origin/gh/benjaminglass1/94/base 2025-08-14T21:18:06.0388500Z * [new branch] gh/benjaminglass1/94/head -> origin/gh/benjaminglass1/94/head 2025-08-14T21:18:06.0388991Z * [new branch] gh/benjaminglass1/94/orig -> origin/gh/benjaminglass1/94/orig 2025-08-14T21:18:06.0390718Z * [new branch] gh/benjaminglass1/95/base -> origin/gh/benjaminglass1/95/base 2025-08-14T21:18:06.0391098Z * [new branch] gh/benjaminglass1/95/head -> origin/gh/benjaminglass1/95/head 2025-08-14T21:18:06.0391376Z * [new branch] gh/benjaminglass1/95/orig -> origin/gh/benjaminglass1/95/orig 2025-08-14T21:18:06.0391847Z * [new branch] gh/benjaminglass1/96/base -> origin/gh/benjaminglass1/96/base 2025-08-14T21:18:06.0393101Z * [new branch] gh/benjaminglass1/96/head -> origin/gh/benjaminglass1/96/head 2025-08-14T21:18:06.0393367Z * [new branch] gh/benjaminglass1/96/orig -> origin/gh/benjaminglass1/96/orig 2025-08-14T21:18:06.0395237Z * [new branch] gh/benjaminglass1/97/base -> origin/gh/benjaminglass1/97/base 2025-08-14T21:18:06.0395578Z * [new branch] gh/benjaminglass1/97/head -> origin/gh/benjaminglass1/97/head 2025-08-14T21:18:06.0395833Z * [new branch] gh/benjaminglass1/97/orig -> origin/gh/benjaminglass1/97/orig 2025-08-14T21:18:06.0396237Z * [new branch] gh/benjaminglass1/98/base -> origin/gh/benjaminglass1/98/base 2025-08-14T21:18:06.0397034Z * [new branch] gh/benjaminglass1/98/head -> origin/gh/benjaminglass1/98/head 2025-08-14T21:18:06.0397476Z * [new branch] gh/benjaminglass1/98/orig -> origin/gh/benjaminglass1/98/orig 2025-08-14T21:18:06.0401366Z * [new branch] gh/bobrenjc93/478/base -> origin/gh/bobrenjc93/478/base 2025-08-14T21:18:06.0401708Z * [new branch] gh/bobrenjc93/478/head -> origin/gh/bobrenjc93/478/head 2025-08-14T21:18:06.0401876Z * [new branch] gh/bobrenjc93/478/orig -> origin/gh/bobrenjc93/478/orig 2025-08-14T21:18:06.0402090Z * [new branch] gh/bobrenjc93/514/base -> origin/gh/bobrenjc93/514/base 2025-08-14T21:18:06.0402744Z * [new branch] gh/bobrenjc93/514/head -> origin/gh/bobrenjc93/514/head 2025-08-14T21:18:06.0402921Z * [new branch] gh/bobrenjc93/514/orig -> origin/gh/bobrenjc93/514/orig 2025-08-14T21:18:06.0403197Z * [new branch] gh/bobrenjc93/521/base -> origin/gh/bobrenjc93/521/base 2025-08-14T21:18:06.0403339Z * [new branch] gh/bobrenjc93/521/head -> origin/gh/bobrenjc93/521/head 2025-08-14T21:18:06.0404313Z * [new branch] gh/bobrenjc93/521/orig -> origin/gh/bobrenjc93/521/orig 2025-08-14T21:18:06.0404705Z * [new branch] gh/bobrenjc93/522/base -> origin/gh/bobrenjc93/522/base 2025-08-14T21:18:06.0407033Z * [new branch] gh/bobrenjc93/522/head -> origin/gh/bobrenjc93/522/head 2025-08-14T21:18:06.0407203Z * [new branch] gh/bobrenjc93/522/orig -> origin/gh/bobrenjc93/522/orig 2025-08-14T21:18:06.0407345Z * [new branch] gh/bobrenjc93/525/base -> origin/gh/bobrenjc93/525/base 2025-08-14T21:18:06.0407521Z * [new branch] gh/bobrenjc93/525/head -> origin/gh/bobrenjc93/525/head 2025-08-14T21:18:06.0408140Z * [new branch] gh/bobrenjc93/525/orig -> origin/gh/bobrenjc93/525/orig 2025-08-14T21:18:06.0411759Z * [new branch] gh/bobrenjc93/526/base -> origin/gh/bobrenjc93/526/base 2025-08-14T21:18:06.0411923Z * [new branch] gh/bobrenjc93/526/head -> origin/gh/bobrenjc93/526/head 2025-08-14T21:18:06.0412079Z * [new branch] gh/bobrenjc93/526/orig -> origin/gh/bobrenjc93/526/orig 2025-08-14T21:18:06.0412208Z * [new branch] gh/bobrenjc93/527/base -> origin/gh/bobrenjc93/527/base 2025-08-14T21:18:06.0412334Z * [new branch] gh/bobrenjc93/527/head -> origin/gh/bobrenjc93/527/head 2025-08-14T21:18:06.0412470Z * [new branch] gh/bobrenjc93/527/orig -> origin/gh/bobrenjc93/527/orig 2025-08-14T21:18:06.0412920Z * [new branch] gh/bobrenjc93/528/base -> origin/gh/bobrenjc93/528/base 2025-08-14T21:18:06.0413974Z * [new branch] gh/bobrenjc93/528/head -> origin/gh/bobrenjc93/528/head 2025-08-14T21:18:06.0414135Z * [new branch] gh/bobrenjc93/528/orig -> origin/gh/bobrenjc93/528/orig 2025-08-14T21:18:06.0416446Z * [new branch] gh/bobrenjc93/529/base -> origin/gh/bobrenjc93/529/base 2025-08-14T21:18:06.0416776Z * [new branch] gh/bobrenjc93/529/head -> origin/gh/bobrenjc93/529/head 2025-08-14T21:18:06.0417042Z * [new branch] gh/bobrenjc93/529/orig -> origin/gh/bobrenjc93/529/orig 2025-08-14T21:18:06.0417203Z * [new branch] gh/bobrenjc93/534/base -> origin/gh/bobrenjc93/534/base 2025-08-14T21:18:06.0417824Z * [new branch] gh/bobrenjc93/534/head -> origin/gh/bobrenjc93/534/head 2025-08-14T21:18:06.0419021Z * [new branch] gh/bobrenjc93/534/orig -> origin/gh/bobrenjc93/534/orig 2025-08-14T21:18:06.0419221Z * [new branch] gh/bobrenjc93/535/base -> origin/gh/bobrenjc93/535/base 2025-08-14T21:18:06.0419826Z * [new branch] gh/bobrenjc93/535/head -> origin/gh/bobrenjc93/535/head 2025-08-14T21:18:06.0420400Z * [new branch] gh/bobrenjc93/535/orig -> origin/gh/bobrenjc93/535/orig 2025-08-14T21:18:06.0421524Z * [new branch] gh/bobrenjc93/536/base -> origin/gh/bobrenjc93/536/base 2025-08-14T21:18:06.0421829Z * [new branch] gh/bobrenjc93/536/head -> origin/gh/bobrenjc93/536/head 2025-08-14T21:18:06.0422641Z * [new branch] gh/bobrenjc93/536/orig -> origin/gh/bobrenjc93/536/orig 2025-08-14T21:18:06.0423501Z * [new branch] gh/bobrenjc93/537/base -> origin/gh/bobrenjc93/537/base 2025-08-14T21:18:06.0423894Z * [new branch] gh/bobrenjc93/537/head -> origin/gh/bobrenjc93/537/head 2025-08-14T21:18:06.0424734Z * [new branch] gh/bobrenjc93/537/orig -> origin/gh/bobrenjc93/537/orig 2025-08-14T21:18:06.0427528Z * [new branch] gh/bobrenjc93/538/base -> origin/gh/bobrenjc93/538/base 2025-08-14T21:18:06.0427942Z * [new branch] gh/bobrenjc93/538/head -> origin/gh/bobrenjc93/538/head 2025-08-14T21:18:06.0428345Z * [new branch] gh/bobrenjc93/538/orig -> origin/gh/bobrenjc93/538/orig 2025-08-14T21:18:06.0428736Z * [new branch] gh/bobrenjc93/539/base -> origin/gh/bobrenjc93/539/base 2025-08-14T21:18:06.0429669Z * [new branch] gh/bobrenjc93/539/head -> origin/gh/bobrenjc93/539/head 2025-08-14T21:18:06.0430037Z * [new branch] gh/bobrenjc93/539/orig -> origin/gh/bobrenjc93/539/orig 2025-08-14T21:18:06.0430359Z * [new branch] gh/bobrenjc93/540/base -> origin/gh/bobrenjc93/540/base 2025-08-14T21:18:06.0430803Z * [new branch] gh/bobrenjc93/540/head -> origin/gh/bobrenjc93/540/head 2025-08-14T21:18:06.0431184Z * [new branch] gh/bobrenjc93/540/orig -> origin/gh/bobrenjc93/540/orig 2025-08-14T21:18:06.0435021Z * [new branch] gh/bobrenjc93/541/base -> origin/gh/bobrenjc93/541/base 2025-08-14T21:18:06.0435400Z * [new branch] gh/bobrenjc93/541/head -> origin/gh/bobrenjc93/541/head 2025-08-14T21:18:06.0435729Z * [new branch] gh/bobrenjc93/541/orig -> origin/gh/bobrenjc93/541/orig 2025-08-14T21:18:06.0436048Z * [new branch] gh/bobrenjc93/542/base -> origin/gh/bobrenjc93/542/base 2025-08-14T21:18:06.0436380Z * [new branch] gh/bobrenjc93/542/head -> origin/gh/bobrenjc93/542/head 2025-08-14T21:18:06.0436693Z * [new branch] gh/bobrenjc93/542/orig -> origin/gh/bobrenjc93/542/orig 2025-08-14T21:18:06.0436999Z * [new branch] gh/bobrenjc93/543/base -> origin/gh/bobrenjc93/543/base 2025-08-14T21:18:06.0437493Z * [new branch] gh/bobrenjc93/543/head -> origin/gh/bobrenjc93/543/head 2025-08-14T21:18:06.0437928Z * [new branch] gh/bobrenjc93/543/orig -> origin/gh/bobrenjc93/543/orig 2025-08-14T21:18:06.0438555Z * [new branch] gh/bobrenjc93/544/base -> origin/gh/bobrenjc93/544/base 2025-08-14T21:18:06.0438937Z * [new branch] gh/bobrenjc93/544/head -> origin/gh/bobrenjc93/544/head 2025-08-14T21:18:06.0439590Z * [new branch] gh/bobrenjc93/544/orig -> origin/gh/bobrenjc93/544/orig 2025-08-14T21:18:06.0440450Z * [new branch] gh/bobrenjc93/545/base -> origin/gh/bobrenjc93/545/base 2025-08-14T21:18:06.0440999Z * [new branch] gh/bobrenjc93/545/head -> origin/gh/bobrenjc93/545/head 2025-08-14T21:18:06.0443005Z * [new branch] gh/bobrenjc93/545/orig -> origin/gh/bobrenjc93/545/orig 2025-08-14T21:18:06.0443535Z * [new branch] gh/bobrenjc93/546/base -> origin/gh/bobrenjc93/546/base 2025-08-14T21:18:06.0443972Z * [new branch] gh/bobrenjc93/546/head -> origin/gh/bobrenjc93/546/head 2025-08-14T21:18:06.0444422Z * [new branch] gh/bobrenjc93/546/orig -> origin/gh/bobrenjc93/546/orig 2025-08-14T21:18:06.0445409Z * [new branch] gh/bobrenjc93/547/base -> origin/gh/bobrenjc93/547/base 2025-08-14T21:18:06.0445789Z * [new branch] gh/bobrenjc93/547/head -> origin/gh/bobrenjc93/547/head 2025-08-14T21:18:06.0446407Z * [new branch] gh/bobrenjc93/547/orig -> origin/gh/bobrenjc93/547/orig 2025-08-14T21:18:06.0447911Z * [new branch] gh/bobrenjc93/548/base -> origin/gh/bobrenjc93/548/base 2025-08-14T21:18:06.0448439Z * [new branch] gh/bobrenjc93/548/head -> origin/gh/bobrenjc93/548/head 2025-08-14T21:18:06.0448889Z * [new branch] gh/bobrenjc93/548/orig -> origin/gh/bobrenjc93/548/orig 2025-08-14T21:18:06.0449211Z * [new branch] gh/bobrenjc93/549/base -> origin/gh/bobrenjc93/549/base 2025-08-14T21:18:06.0450186Z * [new branch] gh/bobrenjc93/549/head -> origin/gh/bobrenjc93/549/head 2025-08-14T21:18:06.0450665Z * [new branch] gh/bobrenjc93/549/orig -> origin/gh/bobrenjc93/549/orig 2025-08-14T21:18:06.0452846Z * [new branch] gh/briancoutinho/2/base -> origin/gh/briancoutinho/2/base 2025-08-14T21:18:06.0453385Z * [new branch] gh/briancoutinho/2/head -> origin/gh/briancoutinho/2/head 2025-08-14T21:18:06.0453821Z * [new branch] gh/c00w/23/base -> origin/gh/c00w/23/base 2025-08-14T21:18:06.0454769Z * [new branch] gh/c00w/23/head -> origin/gh/c00w/23/head 2025-08-14T21:18:06.0455418Z * [new branch] gh/c00w/38/base -> origin/gh/c00w/38/base 2025-08-14T21:18:06.0455888Z * [new branch] gh/c00w/38/head -> origin/gh/c00w/38/head 2025-08-14T21:18:06.0456478Z * [new branch] gh/c00w/38/orig -> origin/gh/c00w/38/orig 2025-08-14T21:18:06.0458190Z * [new branch] gh/c00w/48/base -> origin/gh/c00w/48/base 2025-08-14T21:18:06.0458592Z * [new branch] gh/c00w/48/head -> origin/gh/c00w/48/head 2025-08-14T21:18:06.0458953Z * [new branch] gh/c00w/48/orig -> origin/gh/c00w/48/orig 2025-08-14T21:18:06.0460213Z * [new branch] gh/c00w/50/base -> origin/gh/c00w/50/base 2025-08-14T21:18:06.0460544Z * [new branch] gh/c00w/50/head -> origin/gh/c00w/50/head 2025-08-14T21:18:06.0461486Z * [new branch] gh/c00w/50/orig -> origin/gh/c00w/50/orig 2025-08-14T21:18:06.0463051Z * [new branch] gh/c00w/51/base -> origin/gh/c00w/51/base 2025-08-14T21:18:06.0463538Z * [new branch] gh/c00w/51/head -> origin/gh/c00w/51/head 2025-08-14T21:18:06.0464513Z * [new branch] gh/c00w/51/orig -> origin/gh/c00w/51/orig 2025-08-14T21:18:06.0465365Z * [new branch] gh/c00w/52/base -> origin/gh/c00w/52/base 2025-08-14T21:18:06.0465985Z * [new branch] gh/c00w/52/head -> origin/gh/c00w/52/head 2025-08-14T21:18:06.0466582Z * [new branch] gh/c00w/52/orig -> origin/gh/c00w/52/orig 2025-08-14T21:18:06.0470230Z * [new branch] gh/c00w/53/base -> origin/gh/c00w/53/base 2025-08-14T21:18:06.0470659Z * [new branch] gh/c00w/53/head -> origin/gh/c00w/53/head 2025-08-14T21:18:06.0470951Z * [new branch] gh/c00w/53/orig -> origin/gh/c00w/53/orig 2025-08-14T21:18:06.0471227Z * [new branch] gh/c00w/54/base -> origin/gh/c00w/54/base 2025-08-14T21:18:06.0471497Z * [new branch] gh/c00w/54/head -> origin/gh/c00w/54/head 2025-08-14T21:18:06.0471765Z * [new branch] gh/c00w/54/orig -> origin/gh/c00w/54/orig 2025-08-14T21:18:06.0472229Z * [new branch] gh/chenmillie/1/base -> origin/gh/chenmillie/1/base 2025-08-14T21:18:06.0472675Z * [new branch] gh/chenmillie/1/head -> origin/gh/chenmillie/1/head 2025-08-14T21:18:06.0473006Z * [new branch] gh/chenmillie/1/orig -> origin/gh/chenmillie/1/orig 2025-08-14T21:18:06.0476802Z * [new branch] gh/clee2000/1/base -> origin/gh/clee2000/1/base 2025-08-14T21:18:06.0477303Z * [new branch] gh/clee2000/1/head -> origin/gh/clee2000/1/head 2025-08-14T21:18:06.0477749Z * [new branch] gh/clee2000/1/orig -> origin/gh/clee2000/1/orig 2025-08-14T21:18:06.0478239Z * [new branch] gh/coconutruben/1/base -> origin/gh/coconutruben/1/base 2025-08-14T21:18:06.0478620Z * [new branch] gh/coconutruben/1/head -> origin/gh/coconutruben/1/head 2025-08-14T21:18:06.0478943Z * [new branch] gh/coconutruben/11/base -> origin/gh/coconutruben/11/base 2025-08-14T21:18:06.0479270Z * [new branch] gh/coconutruben/11/head -> origin/gh/coconutruben/11/head 2025-08-14T21:18:06.0479736Z * [new branch] gh/coconutruben/11/orig -> origin/gh/coconutruben/11/orig 2025-08-14T21:18:06.0481920Z * [new branch] gh/coconutruben/12/base -> origin/gh/coconutruben/12/base 2025-08-14T21:18:06.0482451Z * [new branch] gh/coconutruben/12/head -> origin/gh/coconutruben/12/head 2025-08-14T21:18:06.0483453Z * [new branch] gh/coconutruben/12/orig -> origin/gh/coconutruben/12/orig 2025-08-14T21:18:06.0483927Z * [new branch] gh/coconutruben/13/base -> origin/gh/coconutruben/13/base 2025-08-14T21:18:06.0484345Z * [new branch] gh/coconutruben/13/head -> origin/gh/coconutruben/13/head 2025-08-14T21:18:06.0485027Z * [new branch] gh/coconutruben/13/orig -> origin/gh/coconutruben/13/orig 2025-08-14T21:18:06.0489635Z * [new branch] gh/coconutruben/14/base -> origin/gh/coconutruben/14/base 2025-08-14T21:18:06.0493825Z * [new branch] gh/coconutruben/14/head -> origin/gh/coconutruben/14/head 2025-08-14T21:18:06.0498041Z * [new branch] gh/coconutruben/14/orig -> origin/gh/coconutruben/14/orig 2025-08-14T21:18:06.0501473Z * [new branch] gh/coconutruben/15/base -> origin/gh/coconutruben/15/base 2025-08-14T21:18:06.0503432Z * [new branch] gh/coconutruben/15/head -> origin/gh/coconutruben/15/head 2025-08-14T21:18:06.0503975Z * [new branch] gh/coconutruben/15/orig -> origin/gh/coconutruben/15/orig 2025-08-14T21:18:06.0504363Z * [new branch] gh/coconutruben/16/base -> origin/gh/coconutruben/16/base 2025-08-14T21:18:06.0504688Z * [new branch] gh/coconutruben/16/head -> origin/gh/coconutruben/16/head 2025-08-14T21:18:06.0504994Z * [new branch] gh/coconutruben/16/orig -> origin/gh/coconutruben/16/orig 2025-08-14T21:18:06.0505308Z * [new branch] gh/coconutruben/17/base -> origin/gh/coconutruben/17/base 2025-08-14T21:18:06.0505791Z * [new branch] gh/coconutruben/17/head -> origin/gh/coconutruben/17/head 2025-08-14T21:18:06.0506116Z * [new branch] gh/coconutruben/17/orig -> origin/gh/coconutruben/17/orig 2025-08-14T21:18:06.0506425Z * [new branch] gh/coconutruben/18/base -> origin/gh/coconutruben/18/base 2025-08-14T21:18:06.0506746Z * [new branch] gh/coconutruben/18/head -> origin/gh/coconutruben/18/head 2025-08-14T21:18:06.0507060Z * [new branch] gh/coconutruben/18/orig -> origin/gh/coconutruben/18/orig 2025-08-14T21:18:06.0507365Z * [new branch] gh/coconutruben/19/base -> origin/gh/coconutruben/19/base 2025-08-14T21:18:06.0507678Z * [new branch] gh/coconutruben/19/head -> origin/gh/coconutruben/19/head 2025-08-14T21:18:06.0507989Z * [new branch] gh/coconutruben/19/orig -> origin/gh/coconutruben/19/orig 2025-08-14T21:18:06.0508302Z * [new branch] gh/coconutruben/20/base -> origin/gh/coconutruben/20/base 2025-08-14T21:18:06.0508610Z * [new branch] gh/coconutruben/20/head -> origin/gh/coconutruben/20/head 2025-08-14T21:18:06.0508919Z * [new branch] gh/coconutruben/20/orig -> origin/gh/coconutruben/20/orig 2025-08-14T21:18:06.0509237Z * [new branch] gh/coconutruben/21/base -> origin/gh/coconutruben/21/base 2025-08-14T21:18:06.0509551Z * [new branch] gh/coconutruben/21/head -> origin/gh/coconutruben/21/head 2025-08-14T21:18:06.0509855Z * [new branch] gh/coconutruben/21/orig -> origin/gh/coconutruben/21/orig 2025-08-14T21:18:06.0510166Z * [new branch] gh/coconutruben/22/base -> origin/gh/coconutruben/22/base 2025-08-14T21:18:06.0510477Z * [new branch] gh/coconutruben/22/head -> origin/gh/coconutruben/22/head 2025-08-14T21:18:06.0510784Z * [new branch] gh/coconutruben/22/orig -> origin/gh/coconutruben/22/orig 2025-08-14T21:18:06.0511094Z * [new branch] gh/coconutruben/23/base -> origin/gh/coconutruben/23/base 2025-08-14T21:18:06.0511408Z * [new branch] gh/coconutruben/23/head -> origin/gh/coconutruben/23/head 2025-08-14T21:18:06.0512148Z * [new branch] gh/coconutruben/23/orig -> origin/gh/coconutruben/23/orig 2025-08-14T21:18:06.0512464Z * [new branch] gh/coconutruben/24/base -> origin/gh/coconutruben/24/base 2025-08-14T21:18:06.0512929Z * [new branch] gh/coconutruben/24/head -> origin/gh/coconutruben/24/head 2025-08-14T21:18:06.0517218Z * [new branch] gh/coconutruben/24/orig -> origin/gh/coconutruben/24/orig 2025-08-14T21:18:06.0520997Z * [new branch] gh/coconutruben/25/base -> origin/gh/coconutruben/25/base 2025-08-14T21:18:06.0524720Z * [new branch] gh/coconutruben/25/head -> origin/gh/coconutruben/25/head 2025-08-14T21:18:06.0528902Z * [new branch] gh/coconutruben/25/orig -> origin/gh/coconutruben/25/orig 2025-08-14T21:18:06.0533260Z * [new branch] gh/coconutruben/26/base -> origin/gh/coconutruben/26/base 2025-08-14T21:18:06.0533763Z * [new branch] gh/coconutruben/26/head -> origin/gh/coconutruben/26/head 2025-08-14T21:18:06.0534092Z * [new branch] gh/coconutruben/26/orig -> origin/gh/coconutruben/26/orig 2025-08-14T21:18:06.0534428Z * [new branch] gh/coconutruben/27/base -> origin/gh/coconutruben/27/base 2025-08-14T21:18:06.0534743Z * [new branch] gh/coconutruben/27/head -> origin/gh/coconutruben/27/head 2025-08-14T21:18:06.0535058Z * [new branch] gh/coconutruben/27/orig -> origin/gh/coconutruben/27/orig 2025-08-14T21:18:06.0535387Z * [new branch] gh/codingwithsurya/10/base -> origin/gh/codingwithsurya/10/base 2025-08-14T21:18:06.0535732Z * [new branch] gh/codingwithsurya/10/head -> origin/gh/codingwithsurya/10/head 2025-08-14T21:18:06.0536198Z * [new branch] gh/codingwithsurya/10/orig -> origin/gh/codingwithsurya/10/orig 2025-08-14T21:18:06.0536539Z * [new branch] gh/codingwithsurya/11/base -> origin/gh/codingwithsurya/11/base 2025-08-14T21:18:06.0536867Z * [new branch] gh/codingwithsurya/11/head -> origin/gh/codingwithsurya/11/head 2025-08-14T21:18:06.0537205Z * [new branch] gh/codingwithsurya/11/orig -> origin/gh/codingwithsurya/11/orig 2025-08-14T21:18:06.0537541Z * [new branch] gh/codingwithsurya/12/base -> origin/gh/codingwithsurya/12/base 2025-08-14T21:18:06.0537873Z * [new branch] gh/codingwithsurya/12/head -> origin/gh/codingwithsurya/12/head 2025-08-14T21:18:06.0538200Z * [new branch] gh/codingwithsurya/12/orig -> origin/gh/codingwithsurya/12/orig 2025-08-14T21:18:06.0538536Z * [new branch] gh/codingwithsurya/13/base -> origin/gh/codingwithsurya/13/base 2025-08-14T21:18:06.0538874Z * [new branch] gh/codingwithsurya/13/head -> origin/gh/codingwithsurya/13/head 2025-08-14T21:18:06.0539205Z * [new branch] gh/codingwithsurya/13/orig -> origin/gh/codingwithsurya/13/orig 2025-08-14T21:18:06.0539529Z * [new branch] gh/codingwithsurya/14/base -> origin/gh/codingwithsurya/14/base 2025-08-14T21:18:06.0539860Z * [new branch] gh/codingwithsurya/14/head -> origin/gh/codingwithsurya/14/head 2025-08-14T21:18:06.0540194Z * [new branch] gh/codingwithsurya/14/orig -> origin/gh/codingwithsurya/14/orig 2025-08-14T21:18:06.0540518Z * [new branch] gh/codingwithsurya/15/base -> origin/gh/codingwithsurya/15/base 2025-08-14T21:18:06.0540848Z * [new branch] gh/codingwithsurya/15/head -> origin/gh/codingwithsurya/15/head 2025-08-14T21:18:06.0541177Z * [new branch] gh/codingwithsurya/15/orig -> origin/gh/codingwithsurya/15/orig 2025-08-14T21:18:06.0541521Z * [new branch] gh/codingwithsurya/16/base -> origin/gh/codingwithsurya/16/base 2025-08-14T21:18:06.0541846Z * [new branch] gh/codingwithsurya/16/head -> origin/gh/codingwithsurya/16/head 2025-08-14T21:18:06.0542178Z * [new branch] gh/codingwithsurya/16/orig -> origin/gh/codingwithsurya/16/orig 2025-08-14T21:18:06.0542509Z * [new branch] gh/codingwithsurya/17/base -> origin/gh/codingwithsurya/17/base 2025-08-14T21:18:06.0542878Z * [new branch] gh/codingwithsurya/17/head -> origin/gh/codingwithsurya/17/head 2025-08-14T21:18:06.0543198Z * [new branch] gh/codingwithsurya/17/orig -> origin/gh/codingwithsurya/17/orig 2025-08-14T21:18:06.0543522Z * [new branch] gh/codingwithsurya/18/base -> origin/gh/codingwithsurya/18/base 2025-08-14T21:18:06.0543848Z * [new branch] gh/codingwithsurya/18/head -> origin/gh/codingwithsurya/18/head 2025-08-14T21:18:06.0544170Z * [new branch] gh/codingwithsurya/18/orig -> origin/gh/codingwithsurya/18/orig 2025-08-14T21:18:06.0544620Z * [new branch] gh/codingwithsurya/19/base -> origin/gh/codingwithsurya/19/base 2025-08-14T21:18:06.0544954Z * [new branch] gh/codingwithsurya/19/head -> origin/gh/codingwithsurya/19/head 2025-08-14T21:18:06.0548902Z * [new branch] gh/codingwithsurya/19/orig -> origin/gh/codingwithsurya/19/orig 2025-08-14T21:18:06.0552919Z * [new branch] gh/codingwithsurya/20/base -> origin/gh/codingwithsurya/20/base 2025-08-14T21:18:06.0556784Z * [new branch] gh/codingwithsurya/20/head -> origin/gh/codingwithsurya/20/head 2025-08-14T21:18:06.0560985Z * [new branch] gh/codingwithsurya/20/orig -> origin/gh/codingwithsurya/20/orig 2025-08-14T21:18:06.0565064Z * [new branch] gh/codingwithsurya/21/base -> origin/gh/codingwithsurya/21/base 2025-08-14T21:18:06.0566671Z * [new branch] gh/codingwithsurya/21/head -> origin/gh/codingwithsurya/21/head 2025-08-14T21:18:06.0567187Z * [new branch] gh/codingwithsurya/21/orig -> origin/gh/codingwithsurya/21/orig 2025-08-14T21:18:06.0567547Z * [new branch] gh/codingwithsurya/8/base -> origin/gh/codingwithsurya/8/base 2025-08-14T21:18:06.0567884Z * [new branch] gh/codingwithsurya/8/head -> origin/gh/codingwithsurya/8/head 2025-08-14T21:18:06.0568221Z * [new branch] gh/codingwithsurya/8/orig -> origin/gh/codingwithsurya/8/orig 2025-08-14T21:18:06.0568570Z * [new branch] gh/codingwithsurya/9/base -> origin/gh/codingwithsurya/9/base 2025-08-14T21:18:06.0568900Z * [new branch] gh/codingwithsurya/9/head -> origin/gh/codingwithsurya/9/head 2025-08-14T21:18:06.0569215Z * [new branch] gh/codingwithsurya/9/orig -> origin/gh/codingwithsurya/9/orig 2025-08-14T21:18:06.0569539Z * [new branch] gh/colinchan15/1/base -> origin/gh/colinchan15/1/base 2025-08-14T21:18:06.0569846Z * [new branch] gh/colinchan15/1/head -> origin/gh/colinchan15/1/head 2025-08-14T21:18:06.0570158Z * [new branch] gh/colinchan15/2/base -> origin/gh/colinchan15/2/base 2025-08-14T21:18:06.0570455Z * [new branch] gh/colinchan15/2/head -> origin/gh/colinchan15/2/head 2025-08-14T21:18:06.0570758Z * [new branch] gh/colinchan15/3/base -> origin/gh/colinchan15/3/base 2025-08-14T21:18:06.0571066Z * [new branch] gh/colinchan15/3/head -> origin/gh/colinchan15/3/head 2025-08-14T21:18:06.0571360Z * [new branch] gh/colinchan15/4/base -> origin/gh/colinchan15/4/base 2025-08-14T21:18:06.0571662Z * [new branch] gh/colinchan15/4/head -> origin/gh/colinchan15/4/head 2025-08-14T21:18:06.0571960Z * [new branch] gh/colinchan15/5/base -> origin/gh/colinchan15/5/base 2025-08-14T21:18:06.0572258Z * [new branch] gh/colinchan15/5/head -> origin/gh/colinchan15/5/head 2025-08-14T21:18:06.0572556Z * [new branch] gh/colinchan15/6/base -> origin/gh/colinchan15/6/base 2025-08-14T21:18:06.0572859Z * [new branch] gh/colinchan15/6/head -> origin/gh/colinchan15/6/head 2025-08-14T21:18:06.0573177Z * [new branch] gh/davidberard98/351/base -> origin/gh/davidberard98/351/base 2025-08-14T21:18:06.0573500Z * [new branch] gh/davidberard98/351/head -> origin/gh/davidberard98/351/head 2025-08-14T21:18:06.0573887Z * [new branch] gh/davidberard98/351/orig -> origin/gh/davidberard98/351/orig 2025-08-14T21:18:06.0574215Z * [new branch] gh/davidberard98/353/base -> origin/gh/davidberard98/353/base 2025-08-14T21:18:06.0574537Z * [new branch] gh/davidberard98/353/head -> origin/gh/davidberard98/353/head 2025-08-14T21:18:06.0574859Z * [new branch] gh/davidberard98/353/orig -> origin/gh/davidberard98/353/orig 2025-08-14T21:18:06.0575174Z * [new branch] gh/davidberard98/356/base -> origin/gh/davidberard98/356/base 2025-08-14T21:18:06.0575504Z * [new branch] gh/davidberard98/356/head -> origin/gh/davidberard98/356/head 2025-08-14T21:18:06.0575833Z * [new branch] gh/davidberard98/356/orig -> origin/gh/davidberard98/356/orig 2025-08-14T21:18:06.0576149Z * [new branch] gh/davidberard98/382/base -> origin/gh/davidberard98/382/base 2025-08-14T21:18:06.0576486Z * [new branch] gh/davidberard98/382/head -> origin/gh/davidberard98/382/head 2025-08-14T21:18:06.0576806Z * [new branch] gh/davidberard98/382/orig -> origin/gh/davidberard98/382/orig 2025-08-14T21:18:06.0577124Z * [new branch] gh/davidberard98/386/base -> origin/gh/davidberard98/386/base 2025-08-14T21:18:06.0577432Z * [new branch] gh/davidberard98/386/head -> origin/gh/davidberard98/386/head 2025-08-14T21:18:06.0577751Z * [new branch] gh/davidberard98/386/orig -> origin/gh/davidberard98/386/orig 2025-08-14T21:18:06.0578104Z * [new branch] gh/davidberard98/389/base -> origin/gh/davidberard98/389/base 2025-08-14T21:18:06.0578426Z * [new branch] gh/davidberard98/389/head -> origin/gh/davidberard98/389/head 2025-08-14T21:18:06.0578736Z * [new branch] gh/davidberard98/389/orig -> origin/gh/davidberard98/389/orig 2025-08-14T21:18:06.0579054Z * [new branch] gh/davidberard98/390/base -> origin/gh/davidberard98/390/base 2025-08-14T21:18:06.0579374Z * [new branch] gh/davidberard98/390/head -> origin/gh/davidberard98/390/head 2025-08-14T21:18:06.0579692Z * [new branch] gh/davidberard98/390/orig -> origin/gh/davidberard98/390/orig 2025-08-14T21:18:06.0580003Z * [new branch] gh/davidberard98/391/base -> origin/gh/davidberard98/391/base 2025-08-14T21:18:06.0580337Z * [new branch] gh/davidberard98/391/head -> origin/gh/davidberard98/391/head 2025-08-14T21:18:06.0580660Z * [new branch] gh/davidberard98/391/orig -> origin/gh/davidberard98/391/orig 2025-08-14T21:18:06.0580981Z * [new branch] gh/davidberard98/392/base -> origin/gh/davidberard98/392/base 2025-08-14T21:18:06.0581291Z * [new branch] gh/davidberard98/392/head -> origin/gh/davidberard98/392/head 2025-08-14T21:18:06.0581606Z * [new branch] gh/davidberard98/392/orig -> origin/gh/davidberard98/392/orig 2025-08-14T21:18:06.0581929Z * [new branch] gh/davidberard98/393/base -> origin/gh/davidberard98/393/base 2025-08-14T21:18:06.0582243Z * [new branch] gh/davidberard98/393/head -> origin/gh/davidberard98/393/head 2025-08-14T21:18:06.0582561Z * [new branch] gh/davidberard98/393/orig -> origin/gh/davidberard98/393/orig 2025-08-14T21:18:06.0582882Z * [new branch] gh/davidberard98/394/base -> origin/gh/davidberard98/394/base 2025-08-14T21:18:06.0583200Z * [new branch] gh/davidberard98/394/head -> origin/gh/davidberard98/394/head 2025-08-14T21:18:06.0583527Z * [new branch] gh/davidberard98/394/orig -> origin/gh/davidberard98/394/orig 2025-08-14T21:18:06.0584429Z * [new branch] gh/davidberard98/395/base -> origin/gh/davidberard98/395/base 2025-08-14T21:18:06.0585122Z * [new branch] gh/davidberard98/395/head -> origin/gh/davidberard98/395/head 2025-08-14T21:18:06.0585887Z * [new branch] gh/davidberard98/395/orig -> origin/gh/davidberard98/395/orig 2025-08-14T21:18:06.0587021Z * [new branch] gh/davidberard98/396/base -> origin/gh/davidberard98/396/base 2025-08-14T21:18:06.0587350Z * [new branch] gh/davidberard98/396/head -> origin/gh/davidberard98/396/head 2025-08-14T21:18:06.0587972Z * [new branch] gh/davidberard98/396/orig -> origin/gh/davidberard98/396/orig 2025-08-14T21:18:06.0589610Z * [new branch] gh/davidberard98/397/base -> origin/gh/davidberard98/397/base 2025-08-14T21:18:06.0589973Z * [new branch] gh/davidberard98/397/head -> origin/gh/davidberard98/397/head 2025-08-14T21:18:06.0590443Z * [new branch] gh/davidberard98/397/orig -> origin/gh/davidberard98/397/orig 2025-08-14T21:18:06.0591011Z * [new branch] gh/davidberard98/398/base -> origin/gh/davidberard98/398/base 2025-08-14T21:18:06.0591546Z * [new branch] gh/davidberard98/398/head -> origin/gh/davidberard98/398/head 2025-08-14T21:18:06.0592109Z * [new branch] gh/davidberard98/398/orig -> origin/gh/davidberard98/398/orig 2025-08-14T21:18:06.0593415Z * [new branch] gh/desertfire/570/base -> origin/gh/desertfire/570/base 2025-08-14T21:18:06.0593978Z * [new branch] gh/desertfire/570/head -> origin/gh/desertfire/570/head 2025-08-14T21:18:06.0595649Z * [new branch] gh/desertfire/570/orig -> origin/gh/desertfire/570/orig 2025-08-14T21:18:06.0596028Z * [new branch] gh/desertfire/572/base -> origin/gh/desertfire/572/base 2025-08-14T21:18:06.0596774Z * [new branch] gh/desertfire/572/head -> origin/gh/desertfire/572/head 2025-08-14T21:18:06.0597121Z * [new branch] gh/desertfire/572/orig -> origin/gh/desertfire/572/orig 2025-08-14T21:18:06.0597857Z * [new branch] gh/desertfire/589/base -> origin/gh/desertfire/589/base 2025-08-14T21:18:06.0598423Z * [new branch] gh/desertfire/589/head -> origin/gh/desertfire/589/head 2025-08-14T21:18:06.0599099Z * [new branch] gh/desertfire/589/orig -> origin/gh/desertfire/589/orig 2025-08-14T21:18:06.0600669Z * [new branch] gh/desertfire/590/base -> origin/gh/desertfire/590/base 2025-08-14T21:18:06.0601195Z * [new branch] gh/desertfire/590/head -> origin/gh/desertfire/590/head 2025-08-14T21:18:06.0601647Z * [new branch] gh/desertfire/590/orig -> origin/gh/desertfire/590/orig 2025-08-14T21:18:06.0602138Z * [new branch] gh/desertfire/591/base -> origin/gh/desertfire/591/base 2025-08-14T21:18:06.0602783Z * [new branch] gh/desertfire/591/head -> origin/gh/desertfire/591/head 2025-08-14T21:18:06.0603497Z * [new branch] gh/desertfire/591/orig -> origin/gh/desertfire/591/orig 2025-08-14T21:18:06.0607206Z * [new branch] gh/desertfire/592/base -> origin/gh/desertfire/592/base 2025-08-14T21:18:06.0607597Z * [new branch] gh/desertfire/592/head -> origin/gh/desertfire/592/head 2025-08-14T21:18:06.0607916Z * [new branch] gh/desertfire/592/orig -> origin/gh/desertfire/592/orig 2025-08-14T21:18:06.0608224Z * [new branch] gh/desertfire/593/base -> origin/gh/desertfire/593/base 2025-08-14T21:18:06.0608539Z * [new branch] gh/desertfire/593/head -> origin/gh/desertfire/593/head 2025-08-14T21:18:06.0608856Z * [new branch] gh/desertfire/593/orig -> origin/gh/desertfire/593/orig 2025-08-14T21:18:06.0609165Z * [new branch] gh/desertfire/594/base -> origin/gh/desertfire/594/base 2025-08-14T21:18:06.0609666Z * [new branch] gh/desertfire/594/head -> origin/gh/desertfire/594/head 2025-08-14T21:18:06.0610003Z * [new branch] gh/desertfire/594/orig -> origin/gh/desertfire/594/orig 2025-08-14T21:18:06.0611699Z * [new branch] gh/desertfire/595/base -> origin/gh/desertfire/595/base 2025-08-14T21:18:06.0612242Z * [new branch] gh/desertfire/595/head -> origin/gh/desertfire/595/head 2025-08-14T21:18:06.0612564Z * [new branch] gh/desertfire/595/orig -> origin/gh/desertfire/595/orig 2025-08-14T21:18:06.0613068Z * [new branch] gh/desertfire/596/base -> origin/gh/desertfire/596/base 2025-08-14T21:18:06.0613581Z * [new branch] gh/desertfire/596/head -> origin/gh/desertfire/596/head 2025-08-14T21:18:06.0614230Z * [new branch] gh/desertfire/596/orig -> origin/gh/desertfire/596/orig 2025-08-14T21:18:06.0615184Z * [new branch] gh/desertfire/597/base -> origin/gh/desertfire/597/base 2025-08-14T21:18:06.0615669Z * [new branch] gh/desertfire/597/head -> origin/gh/desertfire/597/head 2025-08-14T21:18:06.0616331Z * [new branch] gh/desertfire/597/orig -> origin/gh/desertfire/597/orig 2025-08-14T21:18:06.0617679Z * [new branch] gh/dharakk/1/base -> origin/gh/dharakk/1/base 2025-08-14T21:18:06.0618375Z * [new branch] gh/dharakk/1/head -> origin/gh/dharakk/1/head 2025-08-14T21:18:06.0619016Z * [new branch] gh/dharakk/4/base -> origin/gh/dharakk/4/base 2025-08-14T21:18:06.0619662Z * [new branch] gh/dharakk/4/head -> origin/gh/dharakk/4/head 2025-08-14T21:18:06.0620299Z * [new branch] gh/dharakk/4/orig -> origin/gh/dharakk/4/orig 2025-08-14T21:18:06.0621676Z * [new branch] gh/drisspg/140/base -> origin/gh/drisspg/140/base 2025-08-14T21:18:06.0622006Z * [new branch] gh/drisspg/140/head -> origin/gh/drisspg/140/head 2025-08-14T21:18:06.0622496Z * [new branch] gh/drisspg/140/orig -> origin/gh/drisspg/140/orig 2025-08-14T21:18:06.0623543Z * [new branch] gh/drisspg/149/base -> origin/gh/drisspg/149/base 2025-08-14T21:18:06.0623977Z * [new branch] gh/drisspg/149/head -> origin/gh/drisspg/149/head 2025-08-14T21:18:06.0624645Z * [new branch] gh/drisspg/149/orig -> origin/gh/drisspg/149/orig 2025-08-14T21:18:06.0627157Z * [new branch] gh/drisspg/150/base -> origin/gh/drisspg/150/base 2025-08-14T21:18:06.0627457Z * [new branch] gh/drisspg/150/head -> origin/gh/drisspg/150/head 2025-08-14T21:18:06.0627748Z * [new branch] gh/drisspg/150/orig -> origin/gh/drisspg/150/orig 2025-08-14T21:18:06.0628041Z * [new branch] gh/drisspg/151/base -> origin/gh/drisspg/151/base 2025-08-14T21:18:06.0630101Z * [new branch] gh/drisspg/151/head -> origin/gh/drisspg/151/head 2025-08-14T21:18:06.0630491Z * [new branch] gh/drisspg/151/orig -> origin/gh/drisspg/151/orig 2025-08-14T21:18:06.0630804Z * [new branch] gh/drisspg/158/base -> origin/gh/drisspg/158/base 2025-08-14T21:18:06.0631157Z * [new branch] gh/drisspg/158/head -> origin/gh/drisspg/158/head 2025-08-14T21:18:06.0633473Z * [new branch] gh/drisspg/158/orig -> origin/gh/drisspg/158/orig 2025-08-14T21:18:06.0633851Z * [new branch] gh/drisspg/159/base -> origin/gh/drisspg/159/base 2025-08-14T21:18:06.0634169Z * [new branch] gh/drisspg/159/head -> origin/gh/drisspg/159/head 2025-08-14T21:18:06.0634517Z * [new branch] gh/drisspg/159/orig -> origin/gh/drisspg/159/orig 2025-08-14T21:18:06.0639094Z * [new branch] gh/drisspg/166/base -> origin/gh/drisspg/166/base 2025-08-14T21:18:06.0642820Z * [new branch] gh/drisspg/166/head -> origin/gh/drisspg/166/head 2025-08-14T21:18:06.0646494Z * [new branch] gh/drisspg/166/orig -> origin/gh/drisspg/166/orig 2025-08-14T21:18:06.0650179Z * [new branch] gh/drisspg/168/base -> origin/gh/drisspg/168/base 2025-08-14T21:18:06.0651710Z * [new branch] gh/drisspg/168/head -> origin/gh/drisspg/168/head 2025-08-14T21:18:06.0652161Z * [new branch] gh/drisspg/168/orig -> origin/gh/drisspg/168/orig 2025-08-14T21:18:06.0652452Z * [new branch] gh/drisspg/169/base -> origin/gh/drisspg/169/base 2025-08-14T21:18:06.0652746Z * [new branch] gh/drisspg/169/head -> origin/gh/drisspg/169/head 2025-08-14T21:18:06.0653035Z * [new branch] gh/drisspg/169/orig -> origin/gh/drisspg/169/orig 2025-08-14T21:18:06.0653334Z * [new branch] gh/drisspg/170/base -> origin/gh/drisspg/170/base 2025-08-14T21:18:06.0653614Z * [new branch] gh/drisspg/170/head -> origin/gh/drisspg/170/head 2025-08-14T21:18:06.0653901Z * [new branch] gh/drisspg/170/orig -> origin/gh/drisspg/170/orig 2025-08-14T21:18:06.0654185Z * [new branch] gh/drisspg/171/base -> origin/gh/drisspg/171/base 2025-08-14T21:18:06.0654468Z * [new branch] gh/drisspg/171/head -> origin/gh/drisspg/171/head 2025-08-14T21:18:06.0654759Z * [new branch] gh/drisspg/171/orig -> origin/gh/drisspg/171/orig 2025-08-14T21:18:06.0655044Z * [new branch] gh/drisspg/172/base -> origin/gh/drisspg/172/base 2025-08-14T21:18:06.0655330Z * [new branch] gh/drisspg/172/head -> origin/gh/drisspg/172/head 2025-08-14T21:18:06.0655608Z * [new branch] gh/drisspg/172/orig -> origin/gh/drisspg/172/orig 2025-08-14T21:18:06.0655979Z * [new branch] gh/drisspg/173/base -> origin/gh/drisspg/173/base 2025-08-14T21:18:06.0656277Z * [new branch] gh/drisspg/173/head -> origin/gh/drisspg/173/head 2025-08-14T21:18:06.0656568Z * [new branch] gh/drisspg/173/orig -> origin/gh/drisspg/173/orig 2025-08-14T21:18:06.0656851Z * [new branch] gh/drisspg/174/base -> origin/gh/drisspg/174/base 2025-08-14T21:18:06.0657146Z * [new branch] gh/drisspg/174/head -> origin/gh/drisspg/174/head 2025-08-14T21:18:06.0657437Z * [new branch] gh/drisspg/174/orig -> origin/gh/drisspg/174/orig 2025-08-14T21:18:06.0657726Z * [new branch] gh/drisspg/175/base -> origin/gh/drisspg/175/base 2025-08-14T21:18:06.0658008Z * [new branch] gh/drisspg/175/head -> origin/gh/drisspg/175/head 2025-08-14T21:18:06.0658299Z * [new branch] gh/drisspg/175/orig -> origin/gh/drisspg/175/orig 2025-08-14T21:18:06.0658598Z * [new branch] gh/drisspg/176/base -> origin/gh/drisspg/176/base 2025-08-14T21:18:06.0658886Z * [new branch] gh/drisspg/176/head -> origin/gh/drisspg/176/head 2025-08-14T21:18:06.0659167Z * [new branch] gh/drisspg/176/orig -> origin/gh/drisspg/176/orig 2025-08-14T21:18:06.0659460Z * [new branch] gh/drisspg/177/base -> origin/gh/drisspg/177/base 2025-08-14T21:18:06.0659748Z * [new branch] gh/drisspg/177/head -> origin/gh/drisspg/177/head 2025-08-14T21:18:06.0660027Z * [new branch] gh/drisspg/177/orig -> origin/gh/drisspg/177/orig 2025-08-14T21:18:06.0660315Z * [new branch] gh/drisspg/178/base -> origin/gh/drisspg/178/base 2025-08-14T21:18:06.0660603Z * [new branch] gh/drisspg/178/head -> origin/gh/drisspg/178/head 2025-08-14T21:18:06.0660895Z * [new branch] gh/drisspg/178/orig -> origin/gh/drisspg/178/orig 2025-08-14T21:18:06.0661179Z * [new branch] gh/drisspg/179/base -> origin/gh/drisspg/179/base 2025-08-14T21:18:06.0661476Z * [new branch] gh/drisspg/179/head -> origin/gh/drisspg/179/head 2025-08-14T21:18:06.0661766Z * [new branch] gh/drisspg/179/orig -> origin/gh/drisspg/179/orig 2025-08-14T21:18:06.0662094Z * [new branch] gh/drisspg/180/base -> origin/gh/drisspg/180/base 2025-08-14T21:18:06.0662378Z * [new branch] gh/drisspg/180/head -> origin/gh/drisspg/180/head 2025-08-14T21:18:06.0662673Z * [new branch] gh/drisspg/180/orig -> origin/gh/drisspg/180/orig 2025-08-14T21:18:06.0662963Z * [new branch] gh/drisspg/181/base -> origin/gh/drisspg/181/base 2025-08-14T21:18:06.0663256Z * [new branch] gh/drisspg/181/head -> origin/gh/drisspg/181/head 2025-08-14T21:18:06.0663972Z * [new branch] gh/drisspg/181/orig -> origin/gh/drisspg/181/orig 2025-08-14T21:18:06.0665015Z * [new branch] gh/drisspg/182/base -> origin/gh/drisspg/182/base 2025-08-14T21:18:06.0665626Z * [new branch] gh/drisspg/182/head -> origin/gh/drisspg/182/head 2025-08-14T21:18:06.0666192Z * [new branch] gh/drisspg/183/base -> origin/gh/drisspg/183/base 2025-08-14T21:18:06.0666801Z * [new branch] gh/drisspg/183/head -> origin/gh/drisspg/183/head 2025-08-14T21:18:06.0667470Z * [new branch] gh/drisspg/184/base -> origin/gh/drisspg/184/base 2025-08-14T21:18:06.0668000Z * [new branch] gh/drisspg/184/head -> origin/gh/drisspg/184/head 2025-08-14T21:18:06.0669126Z * [new branch] gh/drisspg/185/base -> origin/gh/drisspg/185/base 2025-08-14T21:18:06.0669539Z * [new branch] gh/drisspg/185/head -> origin/gh/drisspg/185/head 2025-08-14T21:18:06.0670931Z * [new branch] gh/dsjohns2/1/base -> origin/gh/dsjohns2/1/base 2025-08-14T21:18:06.0671254Z * [new branch] gh/dsjohns2/1/head -> origin/gh/dsjohns2/1/head 2025-08-14T21:18:06.0672599Z * [new branch] gh/eellison/784/base -> origin/gh/eellison/784/base 2025-08-14T21:18:06.0672913Z * [new branch] gh/eellison/784/head -> origin/gh/eellison/784/head 2025-08-14T21:18:06.0675802Z * [new branch] gh/eellison/784/orig -> origin/gh/eellison/784/orig 2025-08-14T21:18:06.0676109Z * [new branch] gh/eellison/785/base -> origin/gh/eellison/785/base 2025-08-14T21:18:06.0676405Z * [new branch] gh/eellison/785/head -> origin/gh/eellison/785/head 2025-08-14T21:18:06.0676692Z * [new branch] gh/eellison/785/orig -> origin/gh/eellison/785/orig 2025-08-14T21:18:06.0680379Z * [new branch] gh/eellison/789/base -> origin/gh/eellison/789/base 2025-08-14T21:18:06.0680777Z * [new branch] gh/eellison/789/head -> origin/gh/eellison/789/head 2025-08-14T21:18:06.0681087Z * [new branch] gh/eellison/789/orig -> origin/gh/eellison/789/orig 2025-08-14T21:18:06.0681377Z * [new branch] gh/eellison/800/base -> origin/gh/eellison/800/base 2025-08-14T21:18:06.0681675Z * [new branch] gh/eellison/800/head -> origin/gh/eellison/800/head 2025-08-14T21:18:06.0681981Z * [new branch] gh/eellison/800/orig -> origin/gh/eellison/800/orig 2025-08-14T21:18:06.0682275Z * [new branch] gh/eellison/801/base -> origin/gh/eellison/801/base 2025-08-14T21:18:06.0682571Z * [new branch] gh/eellison/801/head -> origin/gh/eellison/801/head 2025-08-14T21:18:06.0682865Z * [new branch] gh/eellison/801/orig -> origin/gh/eellison/801/orig 2025-08-14T21:18:06.0683163Z * [new branch] gh/eellison/802/base -> origin/gh/eellison/802/base 2025-08-14T21:18:06.0683577Z * [new branch] gh/eellison/802/head -> origin/gh/eellison/802/head 2025-08-14T21:18:06.0684092Z * [new branch] gh/eellison/802/orig -> origin/gh/eellison/802/orig 2025-08-14T21:18:06.0684866Z * [new branch] gh/eellison/805/base -> origin/gh/eellison/805/base 2025-08-14T21:18:06.0685528Z * [new branch] gh/eellison/805/head -> origin/gh/eellison/805/head 2025-08-14T21:18:06.0686099Z * [new branch] gh/eellison/805/orig -> origin/gh/eellison/805/orig 2025-08-14T21:18:06.0687726Z * [new branch] gh/eellison/808/base -> origin/gh/eellison/808/base 2025-08-14T21:18:06.0688091Z * [new branch] gh/eellison/808/head -> origin/gh/eellison/808/head 2025-08-14T21:18:06.0688590Z * [new branch] gh/eellison/808/orig -> origin/gh/eellison/808/orig 2025-08-14T21:18:06.0689069Z * [new branch] gh/eellison/809/base -> origin/gh/eellison/809/base 2025-08-14T21:18:06.0689917Z * [new branch] gh/eellison/809/head -> origin/gh/eellison/809/head 2025-08-14T21:18:06.0690312Z * [new branch] gh/eellison/809/orig -> origin/gh/eellison/809/orig 2025-08-14T21:18:06.0692572Z * [new branch] gh/eellison/810/base -> origin/gh/eellison/810/base 2025-08-14T21:18:06.0693131Z * [new branch] gh/eellison/810/head -> origin/gh/eellison/810/head 2025-08-14T21:18:06.0693930Z * [new branch] gh/eellison/810/orig -> origin/gh/eellison/810/orig 2025-08-14T21:18:06.0694300Z * [new branch] gh/eellison/811/base -> origin/gh/eellison/811/base 2025-08-14T21:18:06.0694614Z * [new branch] gh/eellison/811/head -> origin/gh/eellison/811/head 2025-08-14T21:18:06.0694917Z * [new branch] gh/eellison/811/orig -> origin/gh/eellison/811/orig 2025-08-14T21:18:06.0695454Z * [new branch] gh/eellison/812/base -> origin/gh/eellison/812/base 2025-08-14T21:18:06.0695872Z * [new branch] gh/eellison/812/head -> origin/gh/eellison/812/head 2025-08-14T21:18:06.0696322Z * [new branch] gh/eellison/812/orig -> origin/gh/eellison/812/orig 2025-08-14T21:18:06.0697797Z * [new branch] gh/eellison/813/base -> origin/gh/eellison/813/base 2025-08-14T21:18:06.0698343Z * [new branch] gh/eellison/813/head -> origin/gh/eellison/813/head 2025-08-14T21:18:06.0698777Z * [new branch] gh/eellison/813/orig -> origin/gh/eellison/813/orig 2025-08-14T21:18:06.0699341Z * [new branch] gh/etaf/132/base -> origin/gh/etaf/132/base 2025-08-14T21:18:06.0699956Z * [new branch] gh/etaf/132/head -> origin/gh/etaf/132/head 2025-08-14T21:18:06.0700630Z * [new branch] gh/etaf/132/orig -> origin/gh/etaf/132/orig 2025-08-14T21:18:06.0702858Z * [new branch] gh/etaf/138/base -> origin/gh/etaf/138/base 2025-08-14T21:18:06.0703193Z * [new branch] gh/etaf/138/head -> origin/gh/etaf/138/head 2025-08-14T21:18:06.0703490Z * [new branch] gh/etaf/138/orig -> origin/gh/etaf/138/orig 2025-08-14T21:18:06.0704010Z * [new branch] gh/etaf/140/base -> origin/gh/etaf/140/base 2025-08-14T21:18:06.0704559Z * [new branch] gh/etaf/140/head -> origin/gh/etaf/140/head 2025-08-14T21:18:06.0704970Z * [new branch] gh/etaf/140/orig -> origin/gh/etaf/140/orig 2025-08-14T21:18:06.0707778Z * [new branch] gh/etaf/143/base -> origin/gh/etaf/143/base 2025-08-14T21:18:06.0708289Z * [new branch] gh/etaf/143/head -> origin/gh/etaf/143/head 2025-08-14T21:18:06.0708672Z * [new branch] gh/etaf/143/orig -> origin/gh/etaf/143/orig 2025-08-14T21:18:06.0708968Z * [new branch] gh/etaf/147/base -> origin/gh/etaf/147/base 2025-08-14T21:18:06.0709251Z * [new branch] gh/etaf/147/head -> origin/gh/etaf/147/head 2025-08-14T21:18:06.0709533Z * [new branch] gh/etaf/148/base -> origin/gh/etaf/148/base 2025-08-14T21:18:06.0709983Z * [new branch] gh/etaf/148/head -> origin/gh/etaf/148/head 2025-08-14T21:18:06.0710713Z * [new branch] gh/etaf/148/orig -> origin/gh/etaf/148/orig 2025-08-14T21:18:06.0712179Z * [new branch] gh/etaf/149/base -> origin/gh/etaf/149/base 2025-08-14T21:18:06.0712560Z * [new branch] gh/etaf/149/head -> origin/gh/etaf/149/head 2025-08-14T21:18:06.0712856Z * [new branch] gh/etaf/149/orig -> origin/gh/etaf/149/orig 2025-08-14T21:18:06.0716417Z * [new branch] gh/etaf/150/base -> origin/gh/etaf/150/base 2025-08-14T21:18:06.0716935Z * [new branch] gh/etaf/150/head -> origin/gh/etaf/150/head 2025-08-14T21:18:06.0717352Z * [new branch] gh/etaf/150/orig -> origin/gh/etaf/150/orig 2025-08-14T21:18:06.0717650Z * [new branch] gh/etaf/151/base -> origin/gh/etaf/151/base 2025-08-14T21:18:06.0717940Z * [new branch] gh/etaf/151/head -> origin/gh/etaf/151/head 2025-08-14T21:18:06.0718228Z * [new branch] gh/etaf/151/orig -> origin/gh/etaf/151/orig 2025-08-14T21:18:06.0718505Z * [new branch] gh/etaf/152/base -> origin/gh/etaf/152/base 2025-08-14T21:18:06.0719326Z * [new branch] gh/etaf/152/head -> origin/gh/etaf/152/head 2025-08-14T21:18:06.0719769Z * [new branch] gh/etaf/152/orig -> origin/gh/etaf/152/orig 2025-08-14T21:18:06.0720286Z * [new branch] gh/etaf/153/base -> origin/gh/etaf/153/base 2025-08-14T21:18:06.0720939Z * [new branch] gh/etaf/153/head -> origin/gh/etaf/153/head 2025-08-14T21:18:06.0721461Z * [new branch] gh/etaf/153/orig -> origin/gh/etaf/153/orig 2025-08-14T21:18:06.0722208Z * [new branch] gh/etaf/154/base -> origin/gh/etaf/154/base 2025-08-14T21:18:06.0722835Z * [new branch] gh/etaf/154/head -> origin/gh/etaf/154/head 2025-08-14T21:18:06.0723468Z * [new branch] gh/etaf/154/orig -> origin/gh/etaf/154/orig 2025-08-14T21:18:06.0725049Z * [new branch] gh/etaf/155/base -> origin/gh/etaf/155/base 2025-08-14T21:18:06.0725477Z * [new branch] gh/etaf/155/head -> origin/gh/etaf/155/head 2025-08-14T21:18:06.0725892Z * [new branch] gh/etaf/155/orig -> origin/gh/etaf/155/orig 2025-08-14T21:18:06.0726793Z * [new branch] gh/ezyang/2374/base -> origin/gh/ezyang/2374/base 2025-08-14T21:18:06.0727291Z * [new branch] gh/ezyang/2374/head -> origin/gh/ezyang/2374/head 2025-08-14T21:18:06.0729318Z * [new branch] gh/ezyang/2374/orig -> origin/gh/ezyang/2374/orig 2025-08-14T21:18:06.0729837Z * [new branch] gh/ezyang/2973/base -> origin/gh/ezyang/2973/base 2025-08-14T21:18:06.0730264Z * [new branch] gh/ezyang/2973/head -> origin/gh/ezyang/2973/head 2025-08-14T21:18:06.0730618Z * [new branch] gh/ezyang/2973/orig -> origin/gh/ezyang/2973/orig 2025-08-14T21:18:06.0730924Z * [new branch] gh/ezyang/2974/base -> origin/gh/ezyang/2974/base 2025-08-14T21:18:06.0731918Z * [new branch] gh/ezyang/2974/head -> origin/gh/ezyang/2974/head 2025-08-14T21:18:06.0732342Z * [new branch] gh/ezyang/2974/orig -> origin/gh/ezyang/2974/orig 2025-08-14T21:18:06.0734406Z * [new branch] gh/ezyang/3068/base -> origin/gh/ezyang/3068/base 2025-08-14T21:18:06.0734929Z * [new branch] gh/ezyang/3068/head -> origin/gh/ezyang/3068/head 2025-08-14T21:18:06.0735369Z * [new branch] gh/ezyang/3068/orig -> origin/gh/ezyang/3068/orig 2025-08-14T21:18:06.0736098Z * [new branch] gh/ezyang/3071/base -> origin/gh/ezyang/3071/base 2025-08-14T21:18:06.0736458Z * [new branch] gh/ezyang/3071/head -> origin/gh/ezyang/3071/head 2025-08-14T21:18:06.0736907Z * [new branch] gh/ezyang/3071/orig -> origin/gh/ezyang/3071/orig 2025-08-14T21:18:06.0737424Z * [new branch] gh/ezyang/3074/base -> origin/gh/ezyang/3074/base 2025-08-14T21:18:06.0738098Z * [new branch] gh/ezyang/3074/head -> origin/gh/ezyang/3074/head 2025-08-14T21:18:06.0738577Z * [new branch] gh/ezyang/3074/orig -> origin/gh/ezyang/3074/orig 2025-08-14T21:18:06.0739755Z * [new branch] gh/ezyang/3088/base -> origin/gh/ezyang/3088/base 2025-08-14T21:18:06.0740163Z * [new branch] gh/ezyang/3088/head -> origin/gh/ezyang/3088/head 2025-08-14T21:18:06.0740617Z * [new branch] gh/ezyang/3088/orig -> origin/gh/ezyang/3088/orig 2025-08-14T21:18:06.0741682Z * [new branch] gh/ezyang/3092/base -> origin/gh/ezyang/3092/base 2025-08-14T21:18:06.0742101Z * [new branch] gh/ezyang/3092/head -> origin/gh/ezyang/3092/head 2025-08-14T21:18:06.0742746Z * [new branch] gh/ezyang/3092/orig -> origin/gh/ezyang/3092/orig 2025-08-14T21:18:06.0743771Z * [new branch] gh/ezyang/3097/base -> origin/gh/ezyang/3097/base 2025-08-14T21:18:06.0744071Z * [new branch] gh/ezyang/3097/head -> origin/gh/ezyang/3097/head 2025-08-14T21:18:06.0745097Z * [new branch] gh/ezyang/3097/orig -> origin/gh/ezyang/3097/orig 2025-08-14T21:18:06.0747848Z * [new branch] gh/ezyang/3098/base -> origin/gh/ezyang/3098/base 2025-08-14T21:18:06.0748212Z * [new branch] gh/ezyang/3098/head -> origin/gh/ezyang/3098/head 2025-08-14T21:18:06.0748518Z * [new branch] gh/ezyang/3098/orig -> origin/gh/ezyang/3098/orig 2025-08-14T21:18:06.0748808Z * [new branch] gh/ezyang/3099/base -> origin/gh/ezyang/3099/base 2025-08-14T21:18:06.0749093Z * [new branch] gh/ezyang/3099/head -> origin/gh/ezyang/3099/head 2025-08-14T21:18:06.0751544Z * [new branch] gh/ezyang/3099/orig -> origin/gh/ezyang/3099/orig 2025-08-14T21:18:06.0751921Z * [new branch] gh/ezyang/3100/base -> origin/gh/ezyang/3100/base 2025-08-14T21:18:06.0752227Z * [new branch] gh/ezyang/3100/head -> origin/gh/ezyang/3100/head 2025-08-14T21:18:06.0752568Z * [new branch] gh/ezyang/3100/orig -> origin/gh/ezyang/3100/orig 2025-08-14T21:18:06.0754842Z * [new branch] gh/ezyang/3101/base -> origin/gh/ezyang/3101/base 2025-08-14T21:18:06.0755137Z * [new branch] gh/ezyang/3101/head -> origin/gh/ezyang/3101/head 2025-08-14T21:18:06.0755493Z * [new branch] gh/ezyang/3101/orig -> origin/gh/ezyang/3101/orig 2025-08-14T21:18:06.0759826Z * [new branch] gh/ezyang/3102/base -> origin/gh/ezyang/3102/base 2025-08-14T21:18:06.0763679Z * [new branch] gh/ezyang/3102/head -> origin/gh/ezyang/3102/head 2025-08-14T21:18:06.0767466Z * [new branch] gh/ezyang/3102/orig -> origin/gh/ezyang/3102/orig 2025-08-14T21:18:06.0771257Z * [new branch] gh/ezyang/3103/base -> origin/gh/ezyang/3103/base 2025-08-14T21:18:06.0772702Z * [new branch] gh/ezyang/3103/head -> origin/gh/ezyang/3103/head 2025-08-14T21:18:06.0773030Z * [new branch] gh/ezyang/3103/orig -> origin/gh/ezyang/3103/orig 2025-08-14T21:18:06.0773349Z * [new branch] gh/ezyang/3104/base -> origin/gh/ezyang/3104/base 2025-08-14T21:18:06.0773645Z * [new branch] gh/ezyang/3104/head -> origin/gh/ezyang/3104/head 2025-08-14T21:18:06.0773937Z * [new branch] gh/ezyang/3104/orig -> origin/gh/ezyang/3104/orig 2025-08-14T21:18:06.0774225Z * [new branch] gh/ezyang/3105/base -> origin/gh/ezyang/3105/base 2025-08-14T21:18:06.0774674Z * [new branch] gh/ezyang/3105/head -> origin/gh/ezyang/3105/head 2025-08-14T21:18:06.0774973Z * [new branch] gh/ezyang/3105/orig -> origin/gh/ezyang/3105/orig 2025-08-14T21:18:06.0775267Z * [new branch] gh/ezyang/3106/base -> origin/gh/ezyang/3106/base 2025-08-14T21:18:06.0775550Z * [new branch] gh/ezyang/3106/head -> origin/gh/ezyang/3106/head 2025-08-14T21:18:06.0775843Z * [new branch] gh/ezyang/3106/orig -> origin/gh/ezyang/3106/orig 2025-08-14T21:18:06.0776141Z * [new branch] gh/ezyang/3107/base -> origin/gh/ezyang/3107/base 2025-08-14T21:18:06.0776428Z * [new branch] gh/ezyang/3107/head -> origin/gh/ezyang/3107/head 2025-08-14T21:18:06.0776708Z * [new branch] gh/ezyang/3107/orig -> origin/gh/ezyang/3107/orig 2025-08-14T21:18:06.0776995Z * [new branch] gh/ezyang/3108/base -> origin/gh/ezyang/3108/base 2025-08-14T21:18:06.0777287Z * [new branch] gh/ezyang/3108/head -> origin/gh/ezyang/3108/head 2025-08-14T21:18:06.0777568Z * [new branch] gh/ezyang/3108/orig -> origin/gh/ezyang/3108/orig 2025-08-14T21:18:06.0777859Z * [new branch] gh/ezyang/3109/base -> origin/gh/ezyang/3109/base 2025-08-14T21:18:06.0778146Z * [new branch] gh/ezyang/3109/head -> origin/gh/ezyang/3109/head 2025-08-14T21:18:06.0778440Z * [new branch] gh/ezyang/3109/orig -> origin/gh/ezyang/3109/orig 2025-08-14T21:18:06.0778778Z * [new branch] gh/ezyang/3110/base -> origin/gh/ezyang/3110/base 2025-08-14T21:18:06.0779073Z * [new branch] gh/ezyang/3110/head -> origin/gh/ezyang/3110/head 2025-08-14T21:18:06.0779361Z * [new branch] gh/ezyang/3110/orig -> origin/gh/ezyang/3110/orig 2025-08-14T21:18:06.0779646Z * [new branch] gh/ezyang/3111/base -> origin/gh/ezyang/3111/base 2025-08-14T21:18:06.0779931Z * [new branch] gh/ezyang/3111/head -> origin/gh/ezyang/3111/head 2025-08-14T21:18:06.0780219Z * [new branch] gh/ezyang/3111/orig -> origin/gh/ezyang/3111/orig 2025-08-14T21:18:06.0780507Z * [new branch] gh/ezyang/3112/base -> origin/gh/ezyang/3112/base 2025-08-14T21:18:06.0780797Z * [new branch] gh/ezyang/3112/head -> origin/gh/ezyang/3112/head 2025-08-14T21:18:06.0781079Z * [new branch] gh/ezyang/3112/orig -> origin/gh/ezyang/3112/orig 2025-08-14T21:18:06.0781375Z * [new branch] gh/ezyang/3113/base -> origin/gh/ezyang/3113/base 2025-08-14T21:18:06.0781665Z * [new branch] gh/ezyang/3113/head -> origin/gh/ezyang/3113/head 2025-08-14T21:18:06.0781948Z * [new branch] gh/ezyang/3113/orig -> origin/gh/ezyang/3113/orig 2025-08-14T21:18:06.0782241Z * [new branch] gh/ezyang/3114/base -> origin/gh/ezyang/3114/base 2025-08-14T21:18:06.0782535Z * [new branch] gh/ezyang/3114/head -> origin/gh/ezyang/3114/head 2025-08-14T21:18:06.0782828Z * [new branch] gh/ezyang/3114/orig -> origin/gh/ezyang/3114/orig 2025-08-14T21:18:06.0783133Z * [new branch] gh/ezyang/3115/base -> origin/gh/ezyang/3115/base 2025-08-14T21:18:06.0783483Z * [new branch] gh/ezyang/3115/head -> origin/gh/ezyang/3115/head 2025-08-14T21:18:06.0784445Z * [new branch] gh/ezyang/3115/orig -> origin/gh/ezyang/3115/orig 2025-08-14T21:18:06.0788127Z * [new branch] gh/ezyang/3116/base -> origin/gh/ezyang/3116/base 2025-08-14T21:18:06.0788490Z * [new branch] gh/ezyang/3116/head -> origin/gh/ezyang/3116/head 2025-08-14T21:18:06.0788788Z * [new branch] gh/ezyang/3116/orig -> origin/gh/ezyang/3116/orig 2025-08-14T21:18:06.0789270Z * [new branch] gh/ezyang/3117/base -> origin/gh/ezyang/3117/base 2025-08-14T21:18:06.0789568Z * [new branch] gh/ezyang/3117/head -> origin/gh/ezyang/3117/head 2025-08-14T21:18:06.0789863Z * [new branch] gh/ezyang/3117/orig -> origin/gh/ezyang/3117/orig 2025-08-14T21:18:06.0790151Z * [new branch] gh/ezyang/3118/base -> origin/gh/ezyang/3118/base 2025-08-14T21:18:06.0790447Z * [new branch] gh/ezyang/3118/head -> origin/gh/ezyang/3118/head 2025-08-14T21:18:06.0791306Z * [new branch] gh/ezyang/3118/orig -> origin/gh/ezyang/3118/orig 2025-08-14T21:18:06.0792048Z * [new branch] gh/ezyang/3119/base -> origin/gh/ezyang/3119/base 2025-08-14T21:18:06.0792598Z * [new branch] gh/ezyang/3119/head -> origin/gh/ezyang/3119/head 2025-08-14T21:18:06.0793208Z * [new branch] gh/ezyang/3119/orig -> origin/gh/ezyang/3119/orig 2025-08-14T21:18:06.0794196Z * [new branch] gh/ezyang/3120/base -> origin/gh/ezyang/3120/base 2025-08-14T21:18:06.0794499Z * [new branch] gh/ezyang/3120/head -> origin/gh/ezyang/3120/head 2025-08-14T21:18:06.0795848Z * [new branch] gh/ezyang/3120/orig -> origin/gh/ezyang/3120/orig 2025-08-14T21:18:06.0795982Z * [new branch] gh/ezyang/3121/base -> origin/gh/ezyang/3121/base 2025-08-14T21:18:06.0796671Z * [new branch] gh/ezyang/3121/head -> origin/gh/ezyang/3121/head 2025-08-14T21:18:06.0797242Z * [new branch] gh/ezyang/3121/orig -> origin/gh/ezyang/3121/orig 2025-08-14T21:18:06.0800711Z * [new branch] gh/ezyang/3122/base -> origin/gh/ezyang/3122/base 2025-08-14T21:18:06.0800866Z * [new branch] gh/ezyang/3122/head -> origin/gh/ezyang/3122/head 2025-08-14T21:18:06.0801002Z * [new branch] gh/ezyang/3122/orig -> origin/gh/ezyang/3122/orig 2025-08-14T21:18:06.0801140Z * [new branch] gh/ezyang/3123/base -> origin/gh/ezyang/3123/base 2025-08-14T21:18:06.0801264Z * [new branch] gh/ezyang/3123/head -> origin/gh/ezyang/3123/head 2025-08-14T21:18:06.0801647Z * [new branch] gh/ezyang/3123/orig -> origin/gh/ezyang/3123/orig 2025-08-14T21:18:06.0802089Z * [new branch] gh/ezyang/3124/base -> origin/gh/ezyang/3124/base 2025-08-14T21:18:06.0804520Z * [new branch] gh/ezyang/3124/head -> origin/gh/ezyang/3124/head 2025-08-14T21:18:06.0804692Z * [new branch] gh/ezyang/3124/orig -> origin/gh/ezyang/3124/orig 2025-08-14T21:18:06.0804824Z * [new branch] gh/ezyang/3125/base -> origin/gh/ezyang/3125/base 2025-08-14T21:18:06.0804944Z * [new branch] gh/ezyang/3125/head -> origin/gh/ezyang/3125/head 2025-08-14T21:18:06.0805323Z * [new branch] gh/ezyang/3125/orig -> origin/gh/ezyang/3125/orig 2025-08-14T21:18:06.0807612Z * [new branch] gh/ezyang/3126/base -> origin/gh/ezyang/3126/base 2025-08-14T21:18:06.0807923Z * [new branch] gh/ezyang/3126/head -> origin/gh/ezyang/3126/head 2025-08-14T21:18:06.0808059Z * [new branch] gh/ezyang/3126/orig -> origin/gh/ezyang/3126/orig 2025-08-14T21:18:06.0808201Z * [new branch] gh/ezyang/3127/base -> origin/gh/ezyang/3127/base 2025-08-14T21:18:06.0808950Z * [new branch] gh/ezyang/3127/head -> origin/gh/ezyang/3127/head 2025-08-14T21:18:06.0809378Z * [new branch] gh/ezyang/3127/orig -> origin/gh/ezyang/3127/orig 2025-08-14T21:18:06.0811715Z * [new branch] gh/ezyang/3128/base -> origin/gh/ezyang/3128/base 2025-08-14T21:18:06.0812019Z * [new branch] gh/ezyang/3128/head -> origin/gh/ezyang/3128/head 2025-08-14T21:18:06.0812198Z * [new branch] gh/ezyang/3128/orig -> origin/gh/ezyang/3128/orig 2025-08-14T21:18:06.0812575Z * [new branch] gh/ezyang/3129/base -> origin/gh/ezyang/3129/base 2025-08-14T21:18:06.0812716Z * [new branch] gh/ezyang/3129/head -> origin/gh/ezyang/3129/head 2025-08-14T21:18:06.0814056Z * [new branch] gh/ezyang/3129/orig -> origin/gh/ezyang/3129/orig 2025-08-14T21:18:06.0814212Z * [new branch] gh/ezyang/3130/base -> origin/gh/ezyang/3130/base 2025-08-14T21:18:06.0816010Z * [new branch] gh/ezyang/3130/head -> origin/gh/ezyang/3130/head 2025-08-14T21:18:06.0816328Z * [new branch] gh/ezyang/3130/orig -> origin/gh/ezyang/3130/orig 2025-08-14T21:18:06.0816491Z * [new branch] gh/ezyang/3131/base -> origin/gh/ezyang/3131/base 2025-08-14T21:18:06.0816921Z * [new branch] gh/ezyang/3131/head -> origin/gh/ezyang/3131/head 2025-08-14T21:18:06.0817660Z * [new branch] gh/ezyang/3131/orig -> origin/gh/ezyang/3131/orig 2025-08-14T21:18:06.0818575Z * [new branch] gh/ezyang/3132/base -> origin/gh/ezyang/3132/base 2025-08-14T21:18:06.0819054Z * [new branch] gh/ezyang/3132/head -> origin/gh/ezyang/3132/head 2025-08-14T21:18:06.0819667Z * [new branch] gh/ezyang/3132/orig -> origin/gh/ezyang/3132/orig 2025-08-14T21:18:06.0820615Z * [new branch] gh/ezyang/3133/base -> origin/gh/ezyang/3133/base 2025-08-14T21:18:06.0820756Z * [new branch] gh/ezyang/3133/head -> origin/gh/ezyang/3133/head 2025-08-14T21:18:06.0821879Z * [new branch] gh/ezyang/3133/orig -> origin/gh/ezyang/3133/orig 2025-08-14T21:18:06.0822329Z * [new branch] gh/ezyang/3134/base -> origin/gh/ezyang/3134/base 2025-08-14T21:18:06.0822966Z * [new branch] gh/ezyang/3134/head -> origin/gh/ezyang/3134/head 2025-08-14T21:18:06.0823771Z * [new branch] gh/ezyang/3134/orig -> origin/gh/ezyang/3134/orig 2025-08-14T21:18:06.0824564Z * [new branch] gh/ezyang/3135/base -> origin/gh/ezyang/3135/base 2025-08-14T21:18:06.0825040Z * [new branch] gh/ezyang/3135/head -> origin/gh/ezyang/3135/head 2025-08-14T21:18:06.0825740Z * [new branch] gh/ezyang/3135/orig -> origin/gh/ezyang/3135/orig 2025-08-14T21:18:06.0826648Z * [new branch] gh/ezyang/3136/base -> origin/gh/ezyang/3136/base 2025-08-14T21:18:06.0826885Z * [new branch] gh/ezyang/3136/head -> origin/gh/ezyang/3136/head 2025-08-14T21:18:06.0827874Z * [new branch] gh/ezyang/3136/orig -> origin/gh/ezyang/3136/orig 2025-08-14T21:18:06.0828916Z * [new branch] gh/fadara01/1/base -> origin/gh/fadara01/1/base 2025-08-14T21:18:06.0829215Z * [new branch] gh/fadara01/1/head -> origin/gh/fadara01/1/head 2025-08-14T21:18:06.0830135Z * [new branch] gh/fadara01/1/orig -> origin/gh/fadara01/1/orig 2025-08-14T21:18:06.0831373Z * [new branch] gh/fduwjj/168/base -> origin/gh/fduwjj/168/base 2025-08-14T21:18:06.0831784Z * [new branch] gh/fduwjj/168/head -> origin/gh/fduwjj/168/head 2025-08-14T21:18:06.0832693Z * [new branch] gh/fduwjj/168/orig -> origin/gh/fduwjj/168/orig 2025-08-14T21:18:06.0833586Z * [new branch] gh/fduwjj/169/base -> origin/gh/fduwjj/169/base 2025-08-14T21:18:06.0833979Z * [new branch] gh/fduwjj/169/head -> origin/gh/fduwjj/169/head 2025-08-14T21:18:06.0834912Z * [new branch] gh/fduwjj/169/orig -> origin/gh/fduwjj/169/orig 2025-08-14T21:18:06.0835972Z * [new branch] gh/fduwjj/170/base -> origin/gh/fduwjj/170/base 2025-08-14T21:18:06.0836240Z * [new branch] gh/fduwjj/170/head -> origin/gh/fduwjj/170/head 2025-08-14T21:18:06.0837238Z * [new branch] gh/fduwjj/170/orig -> origin/gh/fduwjj/170/orig 2025-08-14T21:18:06.0838160Z * [new branch] gh/fduwjj/171/base -> origin/gh/fduwjj/171/base 2025-08-14T21:18:06.0838540Z * [new branch] gh/fduwjj/171/head -> origin/gh/fduwjj/171/head 2025-08-14T21:18:06.0840701Z * [new branch] gh/fduwjj/171/orig -> origin/gh/fduwjj/171/orig 2025-08-14T21:18:06.0840934Z * [new branch] gh/fduwjj/172/base -> origin/gh/fduwjj/172/base 2025-08-14T21:18:06.0841121Z * [new branch] gh/fduwjj/172/head -> origin/gh/fduwjj/172/head 2025-08-14T21:18:06.0841595Z * [new branch] gh/fduwjj/172/orig -> origin/gh/fduwjj/172/orig 2025-08-14T21:18:06.0842559Z * [new branch] gh/fduwjj/173/base -> origin/gh/fduwjj/173/base 2025-08-14T21:18:06.0842754Z * [new branch] gh/fduwjj/173/head -> origin/gh/fduwjj/173/head 2025-08-14T21:18:06.0843982Z * [new branch] gh/fduwjj/173/orig -> origin/gh/fduwjj/173/orig 2025-08-14T21:18:06.0844175Z * [new branch] gh/fduwjj/174/base -> origin/gh/fduwjj/174/base 2025-08-14T21:18:06.0844968Z * [new branch] gh/fduwjj/174/head -> origin/gh/fduwjj/174/head 2025-08-14T21:18:06.0845420Z * [new branch] gh/fduwjj/174/orig -> origin/gh/fduwjj/174/orig 2025-08-14T21:18:06.0846663Z * [new branch] gh/fduwjj/175/base -> origin/gh/fduwjj/175/base 2025-08-14T21:18:06.0847345Z * [new branch] gh/fduwjj/175/head -> origin/gh/fduwjj/175/head 2025-08-14T21:18:06.0847978Z * [new branch] gh/fduwjj/175/orig -> origin/gh/fduwjj/175/orig 2025-08-14T21:18:06.0849181Z * [new branch] gh/fduwjj/176/base -> origin/gh/fduwjj/176/base 2025-08-14T21:18:06.0849371Z * [new branch] gh/fduwjj/176/head -> origin/gh/fduwjj/176/head 2025-08-14T21:18:06.0849907Z * [new branch] gh/fduwjj/176/orig -> origin/gh/fduwjj/176/orig 2025-08-14T21:18:06.0850960Z * [new branch] gh/fduwjj/177/base -> origin/gh/fduwjj/177/base 2025-08-14T21:18:06.0851350Z * [new branch] gh/fduwjj/177/head -> origin/gh/fduwjj/177/head 2025-08-14T21:18:06.0852222Z * [new branch] gh/fduwjj/177/orig -> origin/gh/fduwjj/177/orig 2025-08-14T21:18:06.0853166Z * [new branch] gh/fduwjj/178/base -> origin/gh/fduwjj/178/base 2025-08-14T21:18:06.0853389Z * [new branch] gh/fduwjj/178/head -> origin/gh/fduwjj/178/head 2025-08-14T21:18:06.0854330Z * [new branch] gh/fduwjj/178/orig -> origin/gh/fduwjj/178/orig 2025-08-14T21:18:06.0855148Z * [new branch] gh/fduwjj/179/base -> origin/gh/fduwjj/179/base 2025-08-14T21:18:06.0855429Z * [new branch] gh/fduwjj/179/head -> origin/gh/fduwjj/179/head 2025-08-14T21:18:06.0856366Z * [new branch] gh/fduwjj/179/orig -> origin/gh/fduwjj/179/orig 2025-08-14T21:18:06.0857226Z * [new branch] gh/fduwjj/180/base -> origin/gh/fduwjj/180/base 2025-08-14T21:18:06.0857560Z * [new branch] gh/fduwjj/180/head -> origin/gh/fduwjj/180/head 2025-08-14T21:18:06.0858503Z * [new branch] gh/fduwjj/180/orig -> origin/gh/fduwjj/180/orig 2025-08-14T21:18:06.0859307Z * [new branch] gh/fduwjj/181/base -> origin/gh/fduwjj/181/base 2025-08-14T21:18:06.0859542Z * [new branch] gh/fduwjj/181/head -> origin/gh/fduwjj/181/head 2025-08-14T21:18:06.0860553Z * [new branch] gh/fduwjj/181/orig -> origin/gh/fduwjj/181/orig 2025-08-14T21:18:06.0861612Z * [new branch] gh/fegin/306/base -> origin/gh/fegin/306/base 2025-08-14T21:18:06.0861908Z * [new branch] gh/fegin/306/head -> origin/gh/fegin/306/head 2025-08-14T21:18:06.0862865Z * [new branch] gh/fegin/306/orig -> origin/gh/fegin/306/orig 2025-08-14T21:18:06.0863416Z * [new branch] gh/fegin/307/base -> origin/gh/fegin/307/base 2025-08-14T21:18:06.0864089Z * [new branch] gh/fegin/307/head -> origin/gh/fegin/307/head 2025-08-14T21:18:06.0864550Z * [new branch] gh/fegin/307/orig -> origin/gh/fegin/307/orig 2025-08-14T21:18:06.0865977Z * [new branch] gh/fffrog/114/base -> origin/gh/fffrog/114/base 2025-08-14T21:18:06.0866204Z * [new branch] gh/fffrog/114/head -> origin/gh/fffrog/114/head 2025-08-14T21:18:06.0867280Z * [new branch] gh/fffrog/114/orig -> origin/gh/fffrog/114/orig 2025-08-14T21:18:06.0870144Z * [new branch] gh/fffrog/117/base -> origin/gh/fffrog/117/base 2025-08-14T21:18:06.0870265Z * [new branch] gh/fffrog/117/head -> origin/gh/fffrog/117/head 2025-08-14T21:18:06.0870395Z * [new branch] gh/fffrog/117/orig -> origin/gh/fffrog/117/orig 2025-08-14T21:18:06.0870509Z * [new branch] gh/fffrog/119/base -> origin/gh/fffrog/119/base 2025-08-14T21:18:06.0870636Z * [new branch] gh/fffrog/119/head -> origin/gh/fffrog/119/head 2025-08-14T21:18:06.0875027Z * [new branch] gh/fffrog/119/orig -> origin/gh/fffrog/119/orig 2025-08-14T21:18:06.0878786Z * [new branch] gh/fffrog/120/base -> origin/gh/fffrog/120/base 2025-08-14T21:18:06.0882539Z * [new branch] gh/fffrog/120/head -> origin/gh/fffrog/120/head 2025-08-14T21:18:06.0886159Z * [new branch] gh/fffrog/120/orig -> origin/gh/fffrog/120/orig 2025-08-14T21:18:06.0889831Z * [new branch] gh/fffrog/121/base -> origin/gh/fffrog/121/base 2025-08-14T21:18:06.0894045Z * [new branch] gh/fffrog/121/head -> origin/gh/fffrog/121/head 2025-08-14T21:18:06.0898120Z * [new branch] gh/fffrog/121/orig -> origin/gh/fffrog/121/orig 2025-08-14T21:18:06.0898290Z * [new branch] gh/fffrog/122/base -> origin/gh/fffrog/122/base 2025-08-14T21:18:06.0898425Z * [new branch] gh/fffrog/122/head -> origin/gh/fffrog/122/head 2025-08-14T21:18:06.0898579Z * [new branch] gh/fffrog/122/orig -> origin/gh/fffrog/122/orig 2025-08-14T21:18:06.0898696Z * [new branch] gh/fffrog/123/base -> origin/gh/fffrog/123/base 2025-08-14T21:18:06.0898925Z * [new branch] gh/fffrog/123/head -> origin/gh/fffrog/123/head 2025-08-14T21:18:06.0899078Z * [new branch] gh/fffrog/123/orig -> origin/gh/fffrog/123/orig 2025-08-14T21:18:06.0899547Z * [new branch] gh/fffrog/124/base -> origin/gh/fffrog/124/base 2025-08-14T21:18:06.0899711Z * [new branch] gh/fffrog/124/head -> origin/gh/fffrog/124/head 2025-08-14T21:18:06.0899849Z * [new branch] gh/fffrog/124/orig -> origin/gh/fffrog/124/orig 2025-08-14T21:18:06.0899975Z * [new branch] gh/fffrog/125/base -> origin/gh/fffrog/125/base 2025-08-14T21:18:06.0900093Z * [new branch] gh/fffrog/125/head -> origin/gh/fffrog/125/head 2025-08-14T21:18:06.0900209Z * [new branch] gh/fffrog/125/orig -> origin/gh/fffrog/125/orig 2025-08-14T21:18:06.0900337Z * [new branch] gh/fffrog/126/base -> origin/gh/fffrog/126/base 2025-08-14T21:18:06.0900472Z * [new branch] gh/fffrog/126/head -> origin/gh/fffrog/126/head 2025-08-14T21:18:06.0900596Z * [new branch] gh/fffrog/126/orig -> origin/gh/fffrog/126/orig 2025-08-14T21:18:06.0900707Z * [new branch] gh/fffrog/127/base -> origin/gh/fffrog/127/base 2025-08-14T21:18:06.0901008Z * [new branch] gh/fffrog/127/head -> origin/gh/fffrog/127/head 2025-08-14T21:18:06.0901129Z * [new branch] gh/fffrog/127/orig -> origin/gh/fffrog/127/orig 2025-08-14T21:18:06.0901244Z * [new branch] gh/fffrog/128/base -> origin/gh/fffrog/128/base 2025-08-14T21:18:06.0901388Z * [new branch] gh/fffrog/128/head -> origin/gh/fffrog/128/head 2025-08-14T21:18:06.0901506Z * [new branch] gh/fffrog/128/orig -> origin/gh/fffrog/128/orig 2025-08-14T21:18:06.0901622Z * [new branch] gh/fffrog/129/base -> origin/gh/fffrog/129/base 2025-08-14T21:18:06.0901750Z * [new branch] gh/fffrog/129/head -> origin/gh/fffrog/129/head 2025-08-14T21:18:06.0901865Z * [new branch] gh/fffrog/129/orig -> origin/gh/fffrog/129/orig 2025-08-14T21:18:06.0901985Z * [new branch] gh/fffrog/130/base -> origin/gh/fffrog/130/base 2025-08-14T21:18:06.0902102Z * [new branch] gh/fffrog/130/head -> origin/gh/fffrog/130/head 2025-08-14T21:18:06.0902214Z * [new branch] gh/fffrog/130/orig -> origin/gh/fffrog/130/orig 2025-08-14T21:18:06.0902332Z * [new branch] gh/fffrog/131/base -> origin/gh/fffrog/131/base 2025-08-14T21:18:06.0902445Z * [new branch] gh/fffrog/131/head -> origin/gh/fffrog/131/head 2025-08-14T21:18:06.0902557Z * [new branch] gh/fffrog/131/orig -> origin/gh/fffrog/131/orig 2025-08-14T21:18:06.0902730Z * [new branch] gh/fffrog/132/base -> origin/gh/fffrog/132/base 2025-08-14T21:18:06.0902849Z * [new branch] gh/fffrog/132/head -> origin/gh/fffrog/132/head 2025-08-14T21:18:06.0902971Z * [new branch] gh/fffrog/132/orig -> origin/gh/fffrog/132/orig 2025-08-14T21:18:06.0904110Z * [new branch] gh/fffrog/133/base -> origin/gh/fffrog/133/base 2025-08-14T21:18:06.0904445Z * [new branch] gh/fffrog/133/head -> origin/gh/fffrog/133/head 2025-08-14T21:18:06.0905673Z * [new branch] gh/fffrog/133/orig -> origin/gh/fffrog/133/orig 2025-08-14T21:18:06.0905825Z * [new branch] gh/fffrog/134/base -> origin/gh/fffrog/134/base 2025-08-14T21:18:06.0907812Z * [new branch] gh/fffrog/134/head -> origin/gh/fffrog/134/head 2025-08-14T21:18:06.0908105Z * [new branch] gh/fffrog/134/orig -> origin/gh/fffrog/134/orig 2025-08-14T21:18:06.0908256Z * [new branch] gh/fffrog/135/base -> origin/gh/fffrog/135/base 2025-08-14T21:18:06.0908530Z * [new branch] gh/fffrog/135/head -> origin/gh/fffrog/135/head 2025-08-14T21:18:06.0909362Z * [new branch] gh/fffrog/135/orig -> origin/gh/fffrog/135/orig 2025-08-14T21:18:06.0911644Z * [new branch] gh/fffrog/136/base -> origin/gh/fffrog/136/base 2025-08-14T21:18:06.0911958Z * [new branch] gh/fffrog/136/head -> origin/gh/fffrog/136/head 2025-08-14T21:18:06.0912105Z * [new branch] gh/fffrog/136/orig -> origin/gh/fffrog/136/orig 2025-08-14T21:18:06.0912320Z * [new branch] gh/fffrog/137/base -> origin/gh/fffrog/137/base 2025-08-14T21:18:06.0917192Z * [new branch] gh/fffrog/137/head -> origin/gh/fffrog/137/head 2025-08-14T21:18:06.0917486Z * [new branch] gh/fffrog/137/orig -> origin/gh/fffrog/137/orig 2025-08-14T21:18:06.0917662Z * [new branch] gh/fffrog/138/base -> origin/gh/fffrog/138/base 2025-08-14T21:18:06.0917779Z * [new branch] gh/fffrog/138/head -> origin/gh/fffrog/138/head 2025-08-14T21:18:06.0917896Z * [new branch] gh/fffrog/138/orig -> origin/gh/fffrog/138/orig 2025-08-14T21:18:06.0918039Z * [new branch] gh/gmagogsfm/1/base -> origin/gh/gmagogsfm/1/base 2025-08-14T21:18:06.0918295Z * [new branch] gh/gmagogsfm/1/head -> origin/gh/gmagogsfm/1/head 2025-08-14T21:18:06.0918566Z * [new branch] gh/gmagogsfm/1/orig -> origin/gh/gmagogsfm/1/orig 2025-08-14T21:18:06.0919025Z * [new branch] gh/gmagogsfm/2/base -> origin/gh/gmagogsfm/2/base 2025-08-14T21:18:06.0919178Z * [new branch] gh/gmagogsfm/2/head -> origin/gh/gmagogsfm/2/head 2025-08-14T21:18:06.0919352Z * [new branch] gh/gmagogsfm/2/orig -> origin/gh/gmagogsfm/2/orig 2025-08-14T21:18:06.0920603Z * [new branch] gh/gmagogsfm/3/base -> origin/gh/gmagogsfm/3/base 2025-08-14T21:18:06.0920944Z * [new branch] gh/gmagogsfm/3/head -> origin/gh/gmagogsfm/3/head 2025-08-14T21:18:06.0921085Z * [new branch] gh/gmagogsfm/3/orig -> origin/gh/gmagogsfm/3/orig 2025-08-14T21:18:06.0922562Z * [new branch] gh/gmagogsfm/4/base -> origin/gh/gmagogsfm/4/base 2025-08-14T21:18:06.0922877Z * [new branch] gh/gmagogsfm/4/head -> origin/gh/gmagogsfm/4/head 2025-08-14T21:18:06.0923126Z * [new branch] gh/gmagogsfm/4/orig -> origin/gh/gmagogsfm/4/orig 2025-08-14T21:18:06.0926610Z * [new branch] gh/guangyey/130/base -> origin/gh/guangyey/130/base 2025-08-14T21:18:06.0926908Z * [new branch] gh/guangyey/130/head -> origin/gh/guangyey/130/head 2025-08-14T21:18:06.0927050Z * [new branch] gh/guangyey/130/orig -> origin/gh/guangyey/130/orig 2025-08-14T21:18:06.0927319Z * [new branch] gh/guangyey/133/base -> origin/gh/guangyey/133/base 2025-08-14T21:18:06.0927592Z * [new branch] gh/guangyey/133/head -> origin/gh/guangyey/133/head 2025-08-14T21:18:06.0927841Z * [new branch] gh/guangyey/133/orig -> origin/gh/guangyey/133/orig 2025-08-14T21:18:06.0929465Z * [new branch] gh/guangyey/134/base -> origin/gh/guangyey/134/base 2025-08-14T21:18:06.0929774Z * [new branch] gh/guangyey/134/head -> origin/gh/guangyey/134/head 2025-08-14T21:18:06.0929941Z * [new branch] gh/guangyey/134/orig -> origin/gh/guangyey/134/orig 2025-08-14T21:18:06.0931321Z * [new branch] gh/guangyey/135/base -> origin/gh/guangyey/135/base 2025-08-14T21:18:06.0931610Z * [new branch] gh/guangyey/135/head -> origin/gh/guangyey/135/head 2025-08-14T21:18:06.0931970Z * [new branch] gh/guangyey/135/orig -> origin/gh/guangyey/135/orig 2025-08-14T21:18:06.0934987Z * [new branch] gh/guangyey/139/base -> origin/gh/guangyey/139/base 2025-08-14T21:18:06.0935158Z * [new branch] gh/guangyey/139/head -> origin/gh/guangyey/139/head 2025-08-14T21:18:06.0935285Z * [new branch] gh/guangyey/139/orig -> origin/gh/guangyey/139/orig 2025-08-14T21:18:06.0935420Z * [new branch] gh/guangyey/140/base -> origin/gh/guangyey/140/base 2025-08-14T21:18:06.0935546Z * [new branch] gh/guangyey/140/head -> origin/gh/guangyey/140/head 2025-08-14T21:18:06.0936049Z * [new branch] gh/guangyey/140/orig -> origin/gh/guangyey/140/orig 2025-08-14T21:18:06.0937179Z * [new branch] gh/guangyey/142/base -> origin/gh/guangyey/142/base 2025-08-14T21:18:06.0937760Z * [new branch] gh/guangyey/142/head -> origin/gh/guangyey/142/head 2025-08-14T21:18:06.0938416Z * [new branch] gh/guangyey/142/orig -> origin/gh/guangyey/142/orig 2025-08-14T21:18:06.0939116Z * [new branch] gh/guangyey/145/base -> origin/gh/guangyey/145/base 2025-08-14T21:18:06.0939497Z * [new branch] gh/guangyey/145/head -> origin/gh/guangyey/145/head 2025-08-14T21:18:06.0940369Z * [new branch] gh/guangyey/145/orig -> origin/gh/guangyey/145/orig 2025-08-14T21:18:06.0941008Z * [new branch] gh/guangyey/153/base -> origin/gh/guangyey/153/base 2025-08-14T21:18:06.0941830Z * [new branch] gh/guangyey/153/head -> origin/gh/guangyey/153/head 2025-08-14T21:18:06.0942192Z * [new branch] gh/guangyey/153/orig -> origin/gh/guangyey/153/orig 2025-08-14T21:18:06.0943273Z * [new branch] gh/guangyey/158/base -> origin/gh/guangyey/158/base 2025-08-14T21:18:06.0943608Z * [new branch] gh/guangyey/158/head -> origin/gh/guangyey/158/head 2025-08-14T21:18:06.0944568Z * [new branch] gh/guangyey/158/orig -> origin/gh/guangyey/158/orig 2025-08-14T21:18:06.0945657Z * [new branch] gh/guangyey/159/base -> origin/gh/guangyey/159/base 2025-08-14T21:18:06.0945910Z * [new branch] gh/guangyey/159/head -> origin/gh/guangyey/159/head 2025-08-14T21:18:06.0946670Z * [new branch] gh/guangyey/159/orig -> origin/gh/guangyey/159/orig 2025-08-14T21:18:06.0947577Z * [new branch] gh/guangyey/163/base -> origin/gh/guangyey/163/base 2025-08-14T21:18:06.0947865Z * [new branch] gh/guangyey/163/head -> origin/gh/guangyey/163/head 2025-08-14T21:18:06.0948777Z * [new branch] gh/guangyey/163/orig -> origin/gh/guangyey/163/orig 2025-08-14T21:18:06.0949412Z * [new branch] gh/guangyey/165/base -> origin/gh/guangyey/165/base 2025-08-14T21:18:06.0949918Z * [new branch] gh/guangyey/165/head -> origin/gh/guangyey/165/head 2025-08-14T21:18:06.0950832Z * [new branch] gh/guangyey/165/orig -> origin/gh/guangyey/165/orig 2025-08-14T21:18:06.0951364Z * [new branch] gh/guangyey/168/base -> origin/gh/guangyey/168/base 2025-08-14T21:18:06.0952044Z * [new branch] gh/guangyey/168/head -> origin/gh/guangyey/168/head 2025-08-14T21:18:06.0952573Z * [new branch] gh/guangyey/168/orig -> origin/gh/guangyey/168/orig 2025-08-14T21:18:06.0953580Z * [new branch] gh/guangyey/169/base -> origin/gh/guangyey/169/base 2025-08-14T21:18:06.0953954Z * [new branch] gh/guangyey/169/head -> origin/gh/guangyey/169/head 2025-08-14T21:18:06.0954874Z * [new branch] gh/guangyey/169/orig -> origin/gh/guangyey/169/orig 2025-08-14T21:18:06.0955735Z * [new branch] gh/guangyey/170/base -> origin/gh/guangyey/170/base 2025-08-14T21:18:06.0955928Z * [new branch] gh/guangyey/170/head -> origin/gh/guangyey/170/head 2025-08-14T21:18:06.0958530Z * [new branch] gh/guangyey/170/orig -> origin/gh/guangyey/170/orig 2025-08-14T21:18:06.0958697Z * [new branch] gh/guangyey/171/base -> origin/gh/guangyey/171/base 2025-08-14T21:18:06.0958828Z * [new branch] gh/guangyey/171/head -> origin/gh/guangyey/171/head 2025-08-14T21:18:06.0958984Z * [new branch] gh/guangyey/171/orig -> origin/gh/guangyey/171/orig 2025-08-14T21:18:06.0960432Z * [new branch] gh/guangyey/172/base -> origin/gh/guangyey/172/base 2025-08-14T21:18:06.0960558Z * [new branch] gh/guangyey/172/head -> origin/gh/guangyey/172/head 2025-08-14T21:18:06.0960963Z * [new branch] gh/guangyey/172/orig -> origin/gh/guangyey/172/orig 2025-08-14T21:18:06.0964520Z * [new branch] gh/guangyey/173/base -> origin/gh/guangyey/173/base 2025-08-14T21:18:06.0964700Z * [new branch] gh/guangyey/173/head -> origin/gh/guangyey/173/head 2025-08-14T21:18:06.0964827Z * [new branch] gh/guangyey/173/orig -> origin/gh/guangyey/173/orig 2025-08-14T21:18:06.0964947Z * [new branch] gh/guangyey/174/base -> origin/gh/guangyey/174/base 2025-08-14T21:18:06.0965074Z * [new branch] gh/guangyey/174/head -> origin/gh/guangyey/174/head 2025-08-14T21:18:06.0965496Z * [new branch] gh/guangyey/174/orig -> origin/gh/guangyey/174/orig 2025-08-14T21:18:06.0966022Z * [new branch] gh/guangyey/175/base -> origin/gh/guangyey/175/base 2025-08-14T21:18:06.0966550Z * [new branch] gh/guangyey/175/head -> origin/gh/guangyey/175/head 2025-08-14T21:18:06.0967306Z * [new branch] gh/guangyey/175/orig -> origin/gh/guangyey/175/orig 2025-08-14T21:18:06.0968601Z * [new branch] gh/guangyey/176/base -> origin/gh/guangyey/176/base 2025-08-14T21:18:06.0968744Z * [new branch] gh/guangyey/176/head -> origin/gh/guangyey/176/head 2025-08-14T21:18:06.0969196Z * [new branch] gh/guangyey/176/orig -> origin/gh/guangyey/176/orig 2025-08-14T21:18:06.0971143Z * [new branch] gh/guangyey/177/base -> origin/gh/guangyey/177/base 2025-08-14T21:18:06.0971443Z * [new branch] gh/guangyey/177/head -> origin/gh/guangyey/177/head 2025-08-14T21:18:06.0971610Z * [new branch] gh/guangyey/177/orig -> origin/gh/guangyey/177/orig 2025-08-14T21:18:06.0972158Z * [new branch] gh/guangyey/178/base -> origin/gh/guangyey/178/base 2025-08-14T21:18:06.0973630Z * [new branch] gh/guangyey/178/head -> origin/gh/guangyey/178/head 2025-08-14T21:18:06.0973960Z * [new branch] gh/guangyey/178/orig -> origin/gh/guangyey/178/orig 2025-08-14T21:18:06.0974256Z * [new branch] gh/guangyey/179/base -> origin/gh/guangyey/179/base 2025-08-14T21:18:06.0976413Z * [new branch] gh/guangyey/179/head -> origin/gh/guangyey/179/head 2025-08-14T21:18:06.0976785Z * [new branch] gh/guangyey/179/orig -> origin/gh/guangyey/179/orig 2025-08-14T21:18:06.0976939Z * [new branch] gh/guangyey/180/base -> origin/gh/guangyey/180/base 2025-08-14T21:18:06.0977082Z * [new branch] gh/guangyey/180/head -> origin/gh/guangyey/180/head 2025-08-14T21:18:06.0977573Z * [new branch] gh/guangyey/180/orig -> origin/gh/guangyey/180/orig 2025-08-14T21:18:06.0981636Z * [new branch] gh/guangyey/181/base -> origin/gh/guangyey/181/base 2025-08-14T21:18:06.0981951Z * [new branch] gh/guangyey/181/head -> origin/gh/guangyey/181/head 2025-08-14T21:18:06.0982109Z * [new branch] gh/guangyey/181/orig -> origin/gh/guangyey/181/orig 2025-08-14T21:18:06.0982341Z * [new branch] gh/guangyey/182/base -> origin/gh/guangyey/182/base 2025-08-14T21:18:06.0982524Z * [new branch] gh/guangyey/182/head -> origin/gh/guangyey/182/head 2025-08-14T21:18:06.0983113Z * [new branch] gh/guangyey/182/orig -> origin/gh/guangyey/182/orig 2025-08-14T21:18:06.0983470Z * [new branch] gh/guangyey/183/base -> origin/gh/guangyey/183/base 2025-08-14T21:18:06.0986701Z * [new branch] gh/guangyey/183/head -> origin/gh/guangyey/183/head 2025-08-14T21:18:06.0990989Z * [new branch] gh/guangyey/183/orig -> origin/gh/guangyey/183/orig 2025-08-14T21:18:06.0995049Z * [new branch] gh/guangyey/184/base -> origin/gh/guangyey/184/base 2025-08-14T21:18:06.0997296Z * [new branch] gh/guangyey/184/head -> origin/gh/guangyey/184/head 2025-08-14T21:18:06.1001713Z * [new branch] gh/guangyey/184/orig -> origin/gh/guangyey/184/orig 2025-08-14T21:18:06.1005907Z * [new branch] gh/guangyey/185/base -> origin/gh/guangyey/185/base 2025-08-14T21:18:06.1009767Z * [new branch] gh/guangyey/185/head -> origin/gh/guangyey/185/head 2025-08-14T21:18:06.1012947Z * [new branch] gh/guangyey/185/orig -> origin/gh/guangyey/185/orig 2025-08-14T21:18:06.1013223Z * [new branch] gh/guangyey/79/base -> origin/gh/guangyey/79/base 2025-08-14T21:18:06.1013663Z * [new branch] gh/guangyey/79/head -> origin/gh/guangyey/79/head 2025-08-14T21:18:06.1013993Z * [new branch] gh/guangyey/79/orig -> origin/gh/guangyey/79/orig 2025-08-14T21:18:06.1014122Z * [new branch] gh/guangyey/89/base -> origin/gh/guangyey/89/base 2025-08-14T21:18:06.1014245Z * [new branch] gh/guangyey/89/head -> origin/gh/guangyey/89/head 2025-08-14T21:18:06.1014364Z * [new branch] gh/guangyey/89/orig -> origin/gh/guangyey/89/orig 2025-08-14T21:18:06.1014552Z * [new branch] gh/guilhermeleobas/107/base -> origin/gh/guilhermeleobas/107/base 2025-08-14T21:18:06.1014697Z * [new branch] gh/guilhermeleobas/107/head -> origin/gh/guilhermeleobas/107/head 2025-08-14T21:18:06.1014846Z * [new branch] gh/guilhermeleobas/107/orig -> origin/gh/guilhermeleobas/107/orig 2025-08-14T21:18:06.1014985Z * [new branch] gh/guilhermeleobas/108/base -> origin/gh/guilhermeleobas/108/base 2025-08-14T21:18:06.1015127Z * [new branch] gh/guilhermeleobas/108/head -> origin/gh/guilhermeleobas/108/head 2025-08-14T21:18:06.1015272Z * [new branch] gh/guilhermeleobas/108/orig -> origin/gh/guilhermeleobas/108/orig 2025-08-14T21:18:06.1015408Z * [new branch] gh/guilhermeleobas/124/base -> origin/gh/guilhermeleobas/124/base 2025-08-14T21:18:06.1015552Z * [new branch] gh/guilhermeleobas/124/head -> origin/gh/guilhermeleobas/124/head 2025-08-14T21:18:06.1015753Z * [new branch] gh/guilhermeleobas/124/orig -> origin/gh/guilhermeleobas/124/orig 2025-08-14T21:18:06.1015894Z * [new branch] gh/guilhermeleobas/147/base -> origin/gh/guilhermeleobas/147/base 2025-08-14T21:18:06.1016038Z * [new branch] gh/guilhermeleobas/147/head -> origin/gh/guilhermeleobas/147/head 2025-08-14T21:18:06.1016176Z * [new branch] gh/guilhermeleobas/147/orig -> origin/gh/guilhermeleobas/147/orig 2025-08-14T21:18:06.1016326Z * [new branch] gh/guilhermeleobas/150/base -> origin/gh/guilhermeleobas/150/base 2025-08-14T21:18:06.1016463Z * [new branch] gh/guilhermeleobas/150/head -> origin/gh/guilhermeleobas/150/head 2025-08-14T21:18:06.1016601Z * [new branch] gh/guilhermeleobas/150/orig -> origin/gh/guilhermeleobas/150/orig 2025-08-14T21:18:06.1016743Z * [new branch] gh/guilhermeleobas/163/base -> origin/gh/guilhermeleobas/163/base 2025-08-14T21:18:06.1016880Z * [new branch] gh/guilhermeleobas/163/head -> origin/gh/guilhermeleobas/163/head 2025-08-14T21:18:06.1017028Z * [new branch] gh/guilhermeleobas/163/orig -> origin/gh/guilhermeleobas/163/orig 2025-08-14T21:18:06.1017165Z * [new branch] gh/guilhermeleobas/164/base -> origin/gh/guilhermeleobas/164/base 2025-08-14T21:18:06.1017301Z * [new branch] gh/guilhermeleobas/164/head -> origin/gh/guilhermeleobas/164/head 2025-08-14T21:18:06.1017449Z * [new branch] gh/guilhermeleobas/164/orig -> origin/gh/guilhermeleobas/164/orig 2025-08-14T21:18:06.1017587Z * [new branch] gh/guilhermeleobas/165/base -> origin/gh/guilhermeleobas/165/base 2025-08-14T21:18:06.1017733Z * [new branch] gh/guilhermeleobas/165/head -> origin/gh/guilhermeleobas/165/head 2025-08-14T21:18:06.1017873Z * [new branch] gh/guilhermeleobas/165/orig -> origin/gh/guilhermeleobas/165/orig 2025-08-14T21:18:06.1018009Z * [new branch] gh/guilhermeleobas/166/base -> origin/gh/guilhermeleobas/166/base 2025-08-14T21:18:06.1018154Z * [new branch] gh/guilhermeleobas/166/head -> origin/gh/guilhermeleobas/166/head 2025-08-14T21:18:06.1018292Z * [new branch] gh/guilhermeleobas/166/orig -> origin/gh/guilhermeleobas/166/orig 2025-08-14T21:18:06.1018436Z * [new branch] gh/guilhermeleobas/167/base -> origin/gh/guilhermeleobas/167/base 2025-08-14T21:18:06.1018575Z * [new branch] gh/guilhermeleobas/167/head -> origin/gh/guilhermeleobas/167/head 2025-08-14T21:18:06.1018791Z * [new branch] gh/guilhermeleobas/167/orig -> origin/gh/guilhermeleobas/167/orig 2025-08-14T21:18:06.1018942Z * [new branch] gh/guilhermeleobas/168/base -> origin/gh/guilhermeleobas/168/base 2025-08-14T21:18:06.1019079Z * [new branch] gh/guilhermeleobas/168/head -> origin/gh/guilhermeleobas/168/head 2025-08-14T21:18:06.1019223Z * [new branch] gh/guilhermeleobas/168/orig -> origin/gh/guilhermeleobas/168/orig 2025-08-14T21:18:06.1019363Z * [new branch] gh/guilhermeleobas/169/base -> origin/gh/guilhermeleobas/169/base 2025-08-14T21:18:06.1019500Z * [new branch] gh/guilhermeleobas/169/head -> origin/gh/guilhermeleobas/169/head 2025-08-14T21:18:06.1019644Z * [new branch] gh/guilhermeleobas/169/orig -> origin/gh/guilhermeleobas/169/orig 2025-08-14T21:18:06.1019785Z * [new branch] gh/guilhermeleobas/170/base -> origin/gh/guilhermeleobas/170/base 2025-08-14T21:18:06.1019938Z * [new branch] gh/guilhermeleobas/170/head -> origin/gh/guilhermeleobas/170/head 2025-08-14T21:18:06.1020592Z * [new branch] gh/guilhermeleobas/170/orig -> origin/gh/guilhermeleobas/170/orig 2025-08-14T21:18:06.1022080Z * [new branch] gh/guilhermeleobas/171/base -> origin/gh/guilhermeleobas/171/base 2025-08-14T21:18:06.1022255Z * [new branch] gh/guilhermeleobas/171/head -> origin/gh/guilhermeleobas/171/head 2025-08-14T21:18:06.1022765Z * [new branch] gh/guilhermeleobas/171/orig -> origin/gh/guilhermeleobas/171/orig 2025-08-14T21:18:06.1023326Z * [new branch] gh/guilhermeleobas/173/base -> origin/gh/guilhermeleobas/173/base 2025-08-14T21:18:06.1023862Z * [new branch] gh/guilhermeleobas/173/head -> origin/gh/guilhermeleobas/173/head 2025-08-14T21:18:06.1024541Z * [new branch] gh/guilhermeleobas/173/orig -> origin/gh/guilhermeleobas/173/orig 2025-08-14T21:18:06.1027906Z * [new branch] gh/guilhermeleobas/181/base -> origin/gh/guilhermeleobas/181/base 2025-08-14T21:18:06.1028224Z * [new branch] gh/guilhermeleobas/181/head -> origin/gh/guilhermeleobas/181/head 2025-08-14T21:18:06.1028423Z * [new branch] gh/guilhermeleobas/181/orig -> origin/gh/guilhermeleobas/181/orig 2025-08-14T21:18:06.1028591Z * [new branch] gh/guilhermeleobas/182/base -> origin/gh/guilhermeleobas/182/base 2025-08-14T21:18:06.1028824Z * [new branch] gh/guilhermeleobas/182/head -> origin/gh/guilhermeleobas/182/head 2025-08-14T21:18:06.1029074Z * [new branch] gh/guilhermeleobas/182/orig -> origin/gh/guilhermeleobas/182/orig 2025-08-14T21:18:06.1032708Z * [new branch] gh/guilhermeleobas/183/base -> origin/gh/guilhermeleobas/183/base 2025-08-14T21:18:06.1033029Z * [new branch] gh/guilhermeleobas/183/head -> origin/gh/guilhermeleobas/183/head 2025-08-14T21:18:06.1033281Z * [new branch] gh/guilhermeleobas/183/orig -> origin/gh/guilhermeleobas/183/orig 2025-08-14T21:18:06.1033442Z * [new branch] gh/guilhermeleobas/184/base -> origin/gh/guilhermeleobas/184/base 2025-08-14T21:18:06.1033690Z * [new branch] gh/guilhermeleobas/184/head -> origin/gh/guilhermeleobas/184/head 2025-08-14T21:18:06.1033849Z * [new branch] gh/guilhermeleobas/184/orig -> origin/gh/guilhermeleobas/184/orig 2025-08-14T21:18:06.1034116Z * [new branch] gh/guilhermeleobas/185/base -> origin/gh/guilhermeleobas/185/base 2025-08-14T21:18:06.1034629Z * [new branch] gh/guilhermeleobas/185/head -> origin/gh/guilhermeleobas/185/head 2025-08-14T21:18:06.1035486Z * [new branch] gh/guilhermeleobas/185/orig -> origin/gh/guilhermeleobas/185/orig 2025-08-14T21:18:06.1038341Z * [new branch] gh/guilhermeleobas/188/base -> origin/gh/guilhermeleobas/188/base 2025-08-14T21:18:06.1038681Z * [new branch] gh/guilhermeleobas/188/head -> origin/gh/guilhermeleobas/188/head 2025-08-14T21:18:06.1039126Z * [new branch] gh/guilhermeleobas/188/orig -> origin/gh/guilhermeleobas/188/orig 2025-08-14T21:18:06.1039703Z * [new branch] gh/guilhermeleobas/189/base -> origin/gh/guilhermeleobas/189/base 2025-08-14T21:18:06.1039888Z * [new branch] gh/guilhermeleobas/189/head -> origin/gh/guilhermeleobas/189/head 2025-08-14T21:18:06.1040039Z * [new branch] gh/guilhermeleobas/189/orig -> origin/gh/guilhermeleobas/189/orig 2025-08-14T21:18:06.1040534Z * [new branch] gh/guilhermeleobas/190/base -> origin/gh/guilhermeleobas/190/base 2025-08-14T21:18:06.1041342Z * [new branch] gh/guilhermeleobas/190/head -> origin/gh/guilhermeleobas/190/head 2025-08-14T21:18:06.1041679Z * [new branch] gh/guilhermeleobas/190/orig -> origin/gh/guilhermeleobas/190/orig 2025-08-14T21:18:06.1043900Z * [new branch] gh/guilhermeleobas/192/base -> origin/gh/guilhermeleobas/192/base 2025-08-14T21:18:06.1044229Z * [new branch] gh/guilhermeleobas/192/head -> origin/gh/guilhermeleobas/192/head 2025-08-14T21:18:06.1044427Z * [new branch] gh/guilhermeleobas/192/orig -> origin/gh/guilhermeleobas/192/orig 2025-08-14T21:18:06.1044601Z * [new branch] gh/guilhermeleobas/193/base -> origin/gh/guilhermeleobas/193/base 2025-08-14T21:18:06.1045172Z * [new branch] gh/guilhermeleobas/193/head -> origin/gh/guilhermeleobas/193/head 2025-08-14T21:18:06.1047821Z * [new branch] gh/guilhermeleobas/193/orig -> origin/gh/guilhermeleobas/193/orig 2025-08-14T21:18:06.1048177Z * [new branch] gh/guilhermeleobas/194/base -> origin/gh/guilhermeleobas/194/base 2025-08-14T21:18:06.1048413Z * [new branch] gh/guilhermeleobas/194/head -> origin/gh/guilhermeleobas/194/head 2025-08-14T21:18:06.1049042Z * [new branch] gh/guilhermeleobas/194/orig -> origin/gh/guilhermeleobas/194/orig 2025-08-14T21:18:06.1049238Z * [new branch] gh/guilhermeleobas/203/base -> origin/gh/guilhermeleobas/203/base 2025-08-14T21:18:06.1049416Z * [new branch] gh/guilhermeleobas/203/head -> origin/gh/guilhermeleobas/203/head 2025-08-14T21:18:06.1049911Z * [new branch] gh/guilhermeleobas/203/orig -> origin/gh/guilhermeleobas/203/orig 2025-08-14T21:18:06.1053358Z * [new branch] gh/guilhermeleobas/204/base -> origin/gh/guilhermeleobas/204/base 2025-08-14T21:18:06.1053673Z * [new branch] gh/guilhermeleobas/204/head -> origin/gh/guilhermeleobas/204/head 2025-08-14T21:18:06.1053910Z * [new branch] gh/guilhermeleobas/204/orig -> origin/gh/guilhermeleobas/204/orig 2025-08-14T21:18:06.1054166Z * [new branch] gh/guilhermeleobas/205/base -> origin/gh/guilhermeleobas/205/base 2025-08-14T21:18:06.1054333Z * [new branch] gh/guilhermeleobas/205/head -> origin/gh/guilhermeleobas/205/head 2025-08-14T21:18:06.1054488Z * [new branch] gh/guilhermeleobas/205/orig -> origin/gh/guilhermeleobas/205/orig 2025-08-14T21:18:06.1054980Z * [new branch] gh/guilhermeleobas/206/base -> origin/gh/guilhermeleobas/206/base 2025-08-14T21:18:06.1055478Z * [new branch] gh/guilhermeleobas/206/head -> origin/gh/guilhermeleobas/206/head 2025-08-14T21:18:06.1056185Z * [new branch] gh/guilhermeleobas/206/orig -> origin/gh/guilhermeleobas/206/orig 2025-08-14T21:18:06.1058473Z * [new branch] gh/guilhermeleobas/207/base -> origin/gh/guilhermeleobas/207/base 2025-08-14T21:18:06.1058667Z * [new branch] gh/guilhermeleobas/207/head -> origin/gh/guilhermeleobas/207/head 2025-08-14T21:18:06.1058826Z * [new branch] gh/guilhermeleobas/207/orig -> origin/gh/guilhermeleobas/207/orig 2025-08-14T21:18:06.1059284Z * [new branch] gh/guilhermeleobas/208/base -> origin/gh/guilhermeleobas/208/base 2025-08-14T21:18:06.1060817Z * [new branch] gh/guilhermeleobas/208/head -> origin/gh/guilhermeleobas/208/head 2025-08-14T21:18:06.1061155Z * [new branch] gh/guilhermeleobas/208/orig -> origin/gh/guilhermeleobas/208/orig 2025-08-14T21:18:06.1061855Z * [new branch] gh/guilhermeleobas/209/base -> origin/gh/guilhermeleobas/209/base 2025-08-14T21:18:06.1062799Z * [new branch] gh/guilhermeleobas/209/head -> origin/gh/guilhermeleobas/209/head 2025-08-14T21:18:06.1062948Z * [new branch] gh/guilhermeleobas/209/orig -> origin/gh/guilhermeleobas/209/orig 2025-08-14T21:18:06.1064297Z * [new branch] gh/guilhermeleobas/210/base -> origin/gh/guilhermeleobas/210/base 2025-08-14T21:18:06.1064572Z * [new branch] gh/guilhermeleobas/210/head -> origin/gh/guilhermeleobas/210/head 2025-08-14T21:18:06.1067433Z * [new branch] gh/guilhermeleobas/210/orig -> origin/gh/guilhermeleobas/210/orig 2025-08-14T21:18:06.1067614Z * [new branch] gh/guilhermeleobas/211/base -> origin/gh/guilhermeleobas/211/base 2025-08-14T21:18:06.1067775Z * [new branch] gh/guilhermeleobas/211/head -> origin/gh/guilhermeleobas/211/head 2025-08-14T21:18:06.1067923Z * [new branch] gh/guilhermeleobas/211/orig -> origin/gh/guilhermeleobas/211/orig 2025-08-14T21:18:06.1068136Z * [new branch] gh/guilhermeleobas/212/base -> origin/gh/guilhermeleobas/212/base 2025-08-14T21:18:06.1068604Z * [new branch] gh/guilhermeleobas/212/head -> origin/gh/guilhermeleobas/212/head 2025-08-14T21:18:06.1072505Z * [new branch] gh/guilhermeleobas/212/orig -> origin/gh/guilhermeleobas/212/orig 2025-08-14T21:18:06.1072980Z * [new branch] gh/guilhermeleobas/213/base -> origin/gh/guilhermeleobas/213/base 2025-08-14T21:18:06.1073271Z * [new branch] gh/guilhermeleobas/213/head -> origin/gh/guilhermeleobas/213/head 2025-08-14T21:18:06.1073863Z * [new branch] gh/guilhermeleobas/213/orig -> origin/gh/guilhermeleobas/213/orig 2025-08-14T21:18:06.1074058Z * [new branch] gh/guilhermeleobas/214/base -> origin/gh/guilhermeleobas/214/base 2025-08-14T21:18:06.1074204Z * [new branch] gh/guilhermeleobas/214/head -> origin/gh/guilhermeleobas/214/head 2025-08-14T21:18:06.1074343Z * [new branch] gh/guilhermeleobas/214/orig -> origin/gh/guilhermeleobas/214/orig 2025-08-14T21:18:06.1074530Z * [new branch] gh/guilhermeleobas/215/base -> origin/gh/guilhermeleobas/215/base 2025-08-14T21:18:06.1075078Z * [new branch] gh/guilhermeleobas/215/head -> origin/gh/guilhermeleobas/215/head 2025-08-14T21:18:06.1078887Z * [new branch] gh/guilhermeleobas/215/orig -> origin/gh/guilhermeleobas/215/orig 2025-08-14T21:18:06.1079208Z * [new branch] gh/guilhermeleobas/216/base -> origin/gh/guilhermeleobas/216/base 2025-08-14T21:18:06.1079461Z * [new branch] gh/guilhermeleobas/216/head -> origin/gh/guilhermeleobas/216/head 2025-08-14T21:18:06.1079628Z * [new branch] gh/guilhermeleobas/216/orig -> origin/gh/guilhermeleobas/216/orig 2025-08-14T21:18:06.1079765Z * [new branch] gh/guilhermeleobas/217/base -> origin/gh/guilhermeleobas/217/base 2025-08-14T21:18:06.1079920Z * [new branch] gh/guilhermeleobas/217/head -> origin/gh/guilhermeleobas/217/head 2025-08-14T21:18:06.1080062Z * [new branch] gh/guilhermeleobas/217/orig -> origin/gh/guilhermeleobas/217/orig 2025-08-14T21:18:06.1081801Z * [new branch] gh/guilhermeleobas/218/base -> origin/gh/guilhermeleobas/218/base 2025-08-14T21:18:06.1082143Z * [new branch] gh/guilhermeleobas/218/head -> origin/gh/guilhermeleobas/218/head 2025-08-14T21:18:06.1082364Z * [new branch] gh/guilhermeleobas/218/orig -> origin/gh/guilhermeleobas/218/orig 2025-08-14T21:18:06.1083864Z * [new branch] gh/guilhermeleobas/219/base -> origin/gh/guilhermeleobas/219/base 2025-08-14T21:18:06.1084185Z * [new branch] gh/guilhermeleobas/219/head -> origin/gh/guilhermeleobas/219/head 2025-08-14T21:18:06.1084734Z * [new branch] gh/guilhermeleobas/219/orig -> origin/gh/guilhermeleobas/219/orig 2025-08-14T21:18:06.1085057Z * [new branch] gh/guilhermeleobas/220/base -> origin/gh/guilhermeleobas/220/base 2025-08-14T21:18:06.1085965Z * [new branch] gh/guilhermeleobas/220/head -> origin/gh/guilhermeleobas/220/head 2025-08-14T21:18:06.1086291Z * [new branch] gh/guilhermeleobas/220/orig -> origin/gh/guilhermeleobas/220/orig 2025-08-14T21:18:06.1088785Z * [new branch] gh/guilhermeleobas/221/base -> origin/gh/guilhermeleobas/221/base 2025-08-14T21:18:06.1089106Z * [new branch] gh/guilhermeleobas/221/head -> origin/gh/guilhermeleobas/221/head 2025-08-14T21:18:06.1089271Z * [new branch] gh/guilhermeleobas/221/orig -> origin/gh/guilhermeleobas/221/orig 2025-08-14T21:18:06.1089498Z * [new branch] gh/guilhermeleobas/222/base -> origin/gh/guilhermeleobas/222/base 2025-08-14T21:18:06.1090251Z * [new branch] gh/guilhermeleobas/222/head -> origin/gh/guilhermeleobas/222/head 2025-08-14T21:18:06.1090661Z * [new branch] gh/guilhermeleobas/222/orig -> origin/gh/guilhermeleobas/222/orig 2025-08-14T21:18:06.1092482Z * [new branch] gh/guilhermeleobas/223/base -> origin/gh/guilhermeleobas/223/base 2025-08-14T21:18:06.1092814Z * [new branch] gh/guilhermeleobas/223/head -> origin/gh/guilhermeleobas/223/head 2025-08-14T21:18:06.1093160Z * [new branch] gh/guilhermeleobas/223/orig -> origin/gh/guilhermeleobas/223/orig 2025-08-14T21:18:06.1094785Z * [new branch] gh/guilhermeleobas/224/base -> origin/gh/guilhermeleobas/224/base 2025-08-14T21:18:06.1095149Z * [new branch] gh/guilhermeleobas/224/head -> origin/gh/guilhermeleobas/224/head 2025-08-14T21:18:06.1095384Z * [new branch] gh/guilhermeleobas/224/orig -> origin/gh/guilhermeleobas/224/orig 2025-08-14T21:18:06.1097071Z * [new branch] gh/guilhermeleobas/225/base -> origin/gh/guilhermeleobas/225/base 2025-08-14T21:18:06.1097390Z * [new branch] gh/guilhermeleobas/225/head -> origin/gh/guilhermeleobas/225/head 2025-08-14T21:18:06.1097554Z * [new branch] gh/guilhermeleobas/225/orig -> origin/gh/guilhermeleobas/225/orig 2025-08-14T21:18:06.1098518Z * [new branch] gh/guilhermeleobas/226/base -> origin/gh/guilhermeleobas/226/base 2025-08-14T21:18:06.1099557Z * [new branch] gh/guilhermeleobas/226/head -> origin/gh/guilhermeleobas/226/head 2025-08-14T21:18:06.1100032Z * [new branch] gh/guilhermeleobas/226/orig -> origin/gh/guilhermeleobas/226/orig 2025-08-14T21:18:06.1100562Z * [new branch] gh/guilhermeleobas/227/base -> origin/gh/guilhermeleobas/227/base 2025-08-14T21:18:06.1101788Z * [new branch] gh/guilhermeleobas/227/head -> origin/gh/guilhermeleobas/227/head 2025-08-14T21:18:06.1102110Z * [new branch] gh/guilhermeleobas/227/orig -> origin/gh/guilhermeleobas/227/orig 2025-08-14T21:18:06.1103560Z * [new branch] gh/guilhermeleobas/228/base -> origin/gh/guilhermeleobas/228/base 2025-08-14T21:18:06.1103734Z * [new branch] gh/guilhermeleobas/228/head -> origin/gh/guilhermeleobas/228/head 2025-08-14T21:18:06.1104346Z * [new branch] gh/guilhermeleobas/228/orig -> origin/gh/guilhermeleobas/228/orig 2025-08-14T21:18:06.1105696Z * [new branch] gh/guilhermeleobas/229/base -> origin/gh/guilhermeleobas/229/base 2025-08-14T21:18:06.1105877Z * [new branch] gh/guilhermeleobas/229/head -> origin/gh/guilhermeleobas/229/head 2025-08-14T21:18:06.1107741Z * [new branch] gh/guilhermeleobas/229/orig -> origin/gh/guilhermeleobas/229/orig 2025-08-14T21:18:06.1108043Z * [new branch] gh/guilhermeleobas/230/base -> origin/gh/guilhermeleobas/230/base 2025-08-14T21:18:06.1108215Z * [new branch] gh/guilhermeleobas/230/head -> origin/gh/guilhermeleobas/230/head 2025-08-14T21:18:06.1108622Z * [new branch] gh/guilhermeleobas/230/orig -> origin/gh/guilhermeleobas/230/orig 2025-08-14T21:18:06.1111827Z * [new branch] gh/guilhermeleobas/231/base -> origin/gh/guilhermeleobas/231/base 2025-08-14T21:18:06.1112160Z * [new branch] gh/guilhermeleobas/231/head -> origin/gh/guilhermeleobas/231/head 2025-08-14T21:18:06.1112330Z * [new branch] gh/guilhermeleobas/231/orig -> origin/gh/guilhermeleobas/231/orig 2025-08-14T21:18:06.1112494Z * [new branch] gh/guilhermeleobas/232/base -> origin/gh/guilhermeleobas/232/base 2025-08-14T21:18:06.1112760Z * [new branch] gh/guilhermeleobas/232/head -> origin/gh/guilhermeleobas/232/head 2025-08-14T21:18:06.1113276Z * [new branch] gh/guilhermeleobas/232/orig -> origin/gh/guilhermeleobas/232/orig 2025-08-14T21:18:06.1113788Z * [new branch] gh/guilhermeleobas/233/base -> origin/gh/guilhermeleobas/233/base 2025-08-14T21:18:06.1114114Z * [new branch] gh/guilhermeleobas/233/head -> origin/gh/guilhermeleobas/233/head 2025-08-14T21:18:06.1116329Z * [new branch] gh/guilhermeleobas/233/orig -> origin/gh/guilhermeleobas/233/orig 2025-08-14T21:18:06.1116652Z * [new branch] gh/guilhermeleobas/73/base -> origin/gh/guilhermeleobas/73/base 2025-08-14T21:18:06.1116814Z * [new branch] gh/guilhermeleobas/73/head -> origin/gh/guilhermeleobas/73/head 2025-08-14T21:18:06.1117316Z * [new branch] gh/guilhermeleobas/73/orig -> origin/gh/guilhermeleobas/73/orig 2025-08-14T21:18:06.1120862Z * [new branch] gh/henrylhtsang/103/base -> origin/gh/henrylhtsang/103/base 2025-08-14T21:18:06.1121209Z * [new branch] gh/henrylhtsang/103/head -> origin/gh/henrylhtsang/103/head 2025-08-14T21:18:06.1121368Z * [new branch] gh/henrylhtsang/103/orig -> origin/gh/henrylhtsang/103/orig 2025-08-14T21:18:06.1122071Z * [new branch] gh/henrylhtsang/108/base -> origin/gh/henrylhtsang/108/base 2025-08-14T21:18:06.1122248Z * [new branch] gh/henrylhtsang/108/head -> origin/gh/henrylhtsang/108/head 2025-08-14T21:18:06.1122501Z * [new branch] gh/henrylhtsang/108/orig -> origin/gh/henrylhtsang/108/orig 2025-08-14T21:18:06.1123746Z * [new branch] gh/henrylhtsang/118/base -> origin/gh/henrylhtsang/118/base 2025-08-14T21:18:06.1123930Z * [new branch] gh/henrylhtsang/118/head -> origin/gh/henrylhtsang/118/head 2025-08-14T21:18:06.1124344Z * [new branch] gh/henrylhtsang/118/orig -> origin/gh/henrylhtsang/118/orig 2025-08-14T21:18:06.1125937Z * [new branch] gh/henrylhtsang/123/base -> origin/gh/henrylhtsang/123/base 2025-08-14T21:18:06.1126257Z * [new branch] gh/henrylhtsang/123/head -> origin/gh/henrylhtsang/123/head 2025-08-14T21:18:06.1126657Z * [new branch] gh/henrylhtsang/123/orig -> origin/gh/henrylhtsang/123/orig 2025-08-14T21:18:06.1128239Z * [new branch] gh/henrylhtsang/124/base -> origin/gh/henrylhtsang/124/base 2025-08-14T21:18:06.1128445Z * [new branch] gh/henrylhtsang/124/head -> origin/gh/henrylhtsang/124/head 2025-08-14T21:18:06.1130631Z * [new branch] gh/henrylhtsang/124/orig -> origin/gh/henrylhtsang/124/orig 2025-08-14T21:18:06.1130956Z * [new branch] gh/henrylhtsang/125/base -> origin/gh/henrylhtsang/125/base 2025-08-14T21:18:06.1131120Z * [new branch] gh/henrylhtsang/125/head -> origin/gh/henrylhtsang/125/head 2025-08-14T21:18:06.1131310Z * [new branch] gh/henrylhtsang/125/orig -> origin/gh/henrylhtsang/125/orig 2025-08-14T21:18:06.1132680Z * [new branch] gh/henrylhtsang/126/base -> origin/gh/henrylhtsang/126/base 2025-08-14T21:18:06.1132988Z * [new branch] gh/henrylhtsang/126/head -> origin/gh/henrylhtsang/126/head 2025-08-14T21:18:06.1133295Z * [new branch] gh/henrylhtsang/126/orig -> origin/gh/henrylhtsang/126/orig 2025-08-14T21:18:06.1136054Z * [new branch] gh/henrylhtsang/127/base -> origin/gh/henrylhtsang/127/base 2025-08-14T21:18:06.1136377Z * [new branch] gh/henrylhtsang/127/head -> origin/gh/henrylhtsang/127/head 2025-08-14T21:18:06.1136537Z * [new branch] gh/henrylhtsang/127/orig -> origin/gh/henrylhtsang/127/orig 2025-08-14T21:18:06.1136772Z * [new branch] gh/henrylhtsang/128/base -> origin/gh/henrylhtsang/128/base 2025-08-14T21:18:06.1137129Z * [new branch] gh/henrylhtsang/128/head -> origin/gh/henrylhtsang/128/head 2025-08-14T21:18:06.1138031Z * [new branch] gh/henrylhtsang/128/orig -> origin/gh/henrylhtsang/128/orig 2025-08-14T21:18:06.1138806Z * [new branch] gh/henrylhtsang/129/base -> origin/gh/henrylhtsang/129/base 2025-08-14T21:18:06.1139241Z * [new branch] gh/henrylhtsang/129/head -> origin/gh/henrylhtsang/129/head 2025-08-14T21:18:06.1140142Z * [new branch] gh/henrylhtsang/129/orig -> origin/gh/henrylhtsang/129/orig 2025-08-14T21:18:06.1140857Z * [new branch] gh/henrylhtsang/130/base -> origin/gh/henrylhtsang/130/base 2025-08-14T21:18:06.1141269Z * [new branch] gh/henrylhtsang/130/head -> origin/gh/henrylhtsang/130/head 2025-08-14T21:18:06.1142294Z * [new branch] gh/henrylhtsang/131/base -> origin/gh/henrylhtsang/131/base 2025-08-14T21:18:06.1142691Z * [new branch] gh/henrylhtsang/131/head -> origin/gh/henrylhtsang/131/head 2025-08-14T21:18:06.1143346Z * [new branch] gh/henrylhtsang/131/orig -> origin/gh/henrylhtsang/131/orig 2025-08-14T21:18:06.1144452Z * [new branch] gh/henrylhtsang/132/base -> origin/gh/henrylhtsang/132/base 2025-08-14T21:18:06.1145424Z * [new branch] gh/henrylhtsang/132/head -> origin/gh/henrylhtsang/132/head 2025-08-14T21:18:06.1146032Z * [new branch] gh/henrylhtsang/132/orig -> origin/gh/henrylhtsang/132/orig 2025-08-14T21:18:06.1147143Z * [new branch] gh/henrylhtsang/133/base -> origin/gh/henrylhtsang/133/base 2025-08-14T21:18:06.1147481Z * [new branch] gh/henrylhtsang/133/head -> origin/gh/henrylhtsang/133/head 2025-08-14T21:18:06.1148406Z * [new branch] gh/henrylhtsang/133/orig -> origin/gh/henrylhtsang/133/orig 2025-08-14T21:18:06.1149285Z * [new branch] gh/henrylhtsang/134/base -> origin/gh/henrylhtsang/134/base 2025-08-14T21:18:06.1149593Z * [new branch] gh/henrylhtsang/134/head -> origin/gh/henrylhtsang/134/head 2025-08-14T21:18:06.1150527Z * [new branch] gh/henrylhtsang/134/orig -> origin/gh/henrylhtsang/134/orig 2025-08-14T21:18:06.1151349Z * [new branch] gh/henrylhtsang/135/base -> origin/gh/henrylhtsang/135/base 2025-08-14T21:18:06.1151892Z * [new branch] gh/henrylhtsang/135/head -> origin/gh/henrylhtsang/135/head 2025-08-14T21:18:06.1152672Z * [new branch] gh/henrylhtsang/135/orig -> origin/gh/henrylhtsang/135/orig 2025-08-14T21:18:06.1153868Z * [new branch] gh/henrylhtsang/136/base -> origin/gh/henrylhtsang/136/base 2025-08-14T21:18:06.1154114Z * [new branch] gh/henrylhtsang/136/head -> origin/gh/henrylhtsang/136/head 2025-08-14T21:18:06.1156004Z * [new branch] gh/henrylhtsang/136/orig -> origin/gh/henrylhtsang/136/orig 2025-08-14T21:18:06.1156217Z * [new branch] gh/henrylhtsang/137/base -> origin/gh/henrylhtsang/137/base 2025-08-14T21:18:06.1156356Z * [new branch] gh/henrylhtsang/137/head -> origin/gh/henrylhtsang/137/head 2025-08-14T21:18:06.1156842Z * [new branch] gh/henrylhtsang/137/orig -> origin/gh/henrylhtsang/137/orig 2025-08-14T21:18:06.1159958Z * [new branch] gh/henrylhtsang/138/base -> origin/gh/henrylhtsang/138/base 2025-08-14T21:18:06.1160285Z * [new branch] gh/henrylhtsang/138/head -> origin/gh/henrylhtsang/138/head 2025-08-14T21:18:06.1160434Z * [new branch] gh/henrylhtsang/138/orig -> origin/gh/henrylhtsang/138/orig 2025-08-14T21:18:06.1160575Z * [new branch] gh/henrylhtsang/139/base -> origin/gh/henrylhtsang/139/base 2025-08-14T21:18:06.1160760Z * [new branch] gh/henrylhtsang/139/head -> origin/gh/henrylhtsang/139/head 2025-08-14T21:18:06.1162035Z * [new branch] gh/henrylhtsang/139/orig -> origin/gh/henrylhtsang/139/orig 2025-08-14T21:18:06.1162553Z * [new branch] gh/henrylhtsang/140/base -> origin/gh/henrylhtsang/140/base 2025-08-14T21:18:06.1164515Z * [new branch] gh/henrylhtsang/140/head -> origin/gh/henrylhtsang/140/head 2025-08-14T21:18:06.1164834Z * [new branch] gh/henrylhtsang/140/orig -> origin/gh/henrylhtsang/140/orig 2025-08-14T21:18:06.1165013Z * [new branch] gh/henrylhtsang/141/base -> origin/gh/henrylhtsang/141/base 2025-08-14T21:18:06.1165235Z * [new branch] gh/henrylhtsang/141/head -> origin/gh/henrylhtsang/141/head 2025-08-14T21:18:06.1165900Z * [new branch] gh/henrylhtsang/141/orig -> origin/gh/henrylhtsang/141/orig 2025-08-14T21:18:06.1168269Z * [new branch] gh/henrylhtsang/142/base -> origin/gh/henrylhtsang/142/base 2025-08-14T21:18:06.1168581Z * [new branch] gh/henrylhtsang/142/head -> origin/gh/henrylhtsang/142/head 2025-08-14T21:18:06.1168919Z * [new branch] gh/henrylhtsang/142/orig -> origin/gh/henrylhtsang/142/orig 2025-08-14T21:18:06.1169072Z * [new branch] gh/henrylhtsang/143/base -> origin/gh/henrylhtsang/143/base 2025-08-14T21:18:06.1170359Z * [new branch] gh/henrylhtsang/143/head -> origin/gh/henrylhtsang/143/head 2025-08-14T21:18:06.1170677Z * [new branch] gh/henrylhtsang/143/orig -> origin/gh/henrylhtsang/143/orig 2025-08-14T21:18:06.1172573Z * [new branch] gh/henrylhtsang/144/base -> origin/gh/henrylhtsang/144/base 2025-08-14T21:18:06.1172876Z * [new branch] gh/henrylhtsang/144/head -> origin/gh/henrylhtsang/144/head 2025-08-14T21:18:06.1173114Z * [new branch] gh/henrylhtsang/144/orig -> origin/gh/henrylhtsang/144/orig 2025-08-14T21:18:06.1173265Z * [new branch] gh/henrylhtsang/145/base -> origin/gh/henrylhtsang/145/base 2025-08-14T21:18:06.1174402Z * [new branch] gh/henrylhtsang/145/head -> origin/gh/henrylhtsang/145/head 2025-08-14T21:18:06.1174861Z * [new branch] gh/henrylhtsang/145/orig -> origin/gh/henrylhtsang/145/orig 2025-08-14T21:18:06.1175478Z * [new branch] gh/henrylhtsang/146/base -> origin/gh/henrylhtsang/146/base 2025-08-14T21:18:06.1177414Z * [new branch] gh/henrylhtsang/146/head -> origin/gh/henrylhtsang/146/head 2025-08-14T21:18:06.1177764Z * [new branch] gh/henrylhtsang/146/orig -> origin/gh/henrylhtsang/146/orig 2025-08-14T21:18:06.1177921Z * [new branch] gh/huydhn/1/head -> origin/gh/huydhn/1/head 2025-08-14T21:18:06.1178294Z * [new branch] gh/huydhn/1/next -> origin/gh/huydhn/1/next 2025-08-14T21:18:06.1179376Z * [new branch] gh/huydhn/2/head -> origin/gh/huydhn/2/head 2025-08-14T21:18:06.1179586Z * [new branch] gh/huydhn/2/next -> origin/gh/huydhn/2/next 2025-08-14T21:18:06.1180525Z * [new branch] gh/huydhn/2/orig -> origin/gh/huydhn/2/orig 2025-08-14T21:18:06.1181391Z * [new branch] gh/huydhn/3/head -> origin/gh/huydhn/3/head 2025-08-14T21:18:06.1182029Z * [new branch] gh/huydhn/3/next -> origin/gh/huydhn/3/next 2025-08-14T21:18:06.1182208Z * [new branch] gh/huydhn/3/orig -> origin/gh/huydhn/3/orig 2025-08-14T21:18:06.1183619Z * [new branch] gh/huydhn/4/head -> origin/gh/huydhn/4/head 2025-08-14T21:18:06.1183874Z * [new branch] gh/huydhn/4/next -> origin/gh/huydhn/4/next 2025-08-14T21:18:06.1185160Z * [new branch] gh/huydhn/4/orig -> origin/gh/huydhn/4/orig 2025-08-14T21:18:06.1189817Z * [new branch] gh/huydhn/5/head -> origin/gh/huydhn/5/head 2025-08-14T21:18:06.1193949Z * [new branch] gh/huydhn/5/next -> origin/gh/huydhn/5/next 2025-08-14T21:18:06.1197902Z * [new branch] gh/huydhn/5/orig -> origin/gh/huydhn/5/orig 2025-08-14T21:18:06.1200296Z * [new branch] gh/huydhn/6/head -> origin/gh/huydhn/6/head 2025-08-14T21:18:06.1200548Z * [new branch] gh/huydhn/6/next -> origin/gh/huydhn/6/next 2025-08-14T21:18:06.1205220Z * [new branch] gh/huydhn/6/orig -> origin/gh/huydhn/6/orig 2025-08-14T21:18:06.1208782Z * [new branch] gh/int3/97/base -> origin/gh/int3/97/base 2025-08-14T21:18:06.1210265Z * [new branch] gh/int3/97/head -> origin/gh/int3/97/head 2025-08-14T21:18:06.1210427Z * [new branch] gh/isuruf/101/base -> origin/gh/isuruf/101/base 2025-08-14T21:18:06.1210563Z * [new branch] gh/isuruf/101/head -> origin/gh/isuruf/101/head 2025-08-14T21:18:06.1210683Z * [new branch] gh/isuruf/116/base -> origin/gh/isuruf/116/base 2025-08-14T21:18:06.1210807Z * [new branch] gh/isuruf/116/head -> origin/gh/isuruf/116/head 2025-08-14T21:18:06.1211107Z * [new branch] gh/isuruf/116/orig -> origin/gh/isuruf/116/orig 2025-08-14T21:18:06.1211237Z * [new branch] gh/isuruf/141/base -> origin/gh/isuruf/141/base 2025-08-14T21:18:06.1211354Z * [new branch] gh/isuruf/141/head -> origin/gh/isuruf/141/head 2025-08-14T21:18:06.1211474Z * [new branch] gh/isuruf/141/orig -> origin/gh/isuruf/141/orig 2025-08-14T21:18:06.1211599Z * [new branch] gh/isuruf/142/base -> origin/gh/isuruf/142/base 2025-08-14T21:18:06.1211713Z * [new branch] gh/isuruf/142/head -> origin/gh/isuruf/142/head 2025-08-14T21:18:06.1211836Z * [new branch] gh/isuruf/142/orig -> origin/gh/isuruf/142/orig 2025-08-14T21:18:06.1211955Z * [new branch] gh/isuruf/81/base -> origin/gh/isuruf/81/base 2025-08-14T21:18:06.1212071Z * [new branch] gh/isuruf/81/head -> origin/gh/isuruf/81/head 2025-08-14T21:18:06.1244081Z * [new branch] gh/isuruf/81/orig -> origin/gh/isuruf/81/orig 2025-08-14T21:18:06.1244338Z * [new branch] gh/jamesjwu/140/base -> origin/gh/jamesjwu/140/base 2025-08-14T21:18:06.1244490Z * [new branch] gh/jamesjwu/140/head -> origin/gh/jamesjwu/140/head 2025-08-14T21:18:06.1244623Z * [new branch] gh/jamesjwu/140/orig -> origin/gh/jamesjwu/140/orig 2025-08-14T21:18:06.1244745Z * [new branch] gh/jamesjwu/150/base -> origin/gh/jamesjwu/150/base 2025-08-14T21:18:06.1244877Z * [new branch] gh/jamesjwu/150/head -> origin/gh/jamesjwu/150/head 2025-08-14T21:18:06.1244996Z * [new branch] gh/jamesjwu/150/orig -> origin/gh/jamesjwu/150/orig 2025-08-14T21:18:06.1245120Z * [new branch] gh/jamesjwu/154/base -> origin/gh/jamesjwu/154/base 2025-08-14T21:18:06.1245249Z * [new branch] gh/jamesjwu/154/head -> origin/gh/jamesjwu/154/head 2025-08-14T21:18:06.1245367Z * [new branch] gh/jamesjwu/154/orig -> origin/gh/jamesjwu/154/orig 2025-08-14T21:18:06.1245493Z * [new branch] gh/jamesjwu/155/base -> origin/gh/jamesjwu/155/base 2025-08-14T21:18:06.1245609Z * [new branch] gh/jamesjwu/155/head -> origin/gh/jamesjwu/155/head 2025-08-14T21:18:06.1245850Z * [new branch] gh/jamesjwu/155/orig -> origin/gh/jamesjwu/155/orig 2025-08-14T21:18:06.1245981Z * [new branch] gh/jamesjwu/159/base -> origin/gh/jamesjwu/159/base 2025-08-14T21:18:06.1246102Z * [new branch] gh/jamesjwu/159/head -> origin/gh/jamesjwu/159/head 2025-08-14T21:18:06.1246370Z * [new branch] gh/jamesjwu/159/orig -> origin/gh/jamesjwu/159/orig 2025-08-14T21:18:06.1249773Z * [new branch] gh/jamesjwu/163/base -> origin/gh/jamesjwu/163/base 2025-08-14T21:18:06.1253743Z * [new branch] gh/jamesjwu/163/head -> origin/gh/jamesjwu/163/head 2025-08-14T21:18:06.1255459Z * [new branch] gh/jamesjwu/163/orig -> origin/gh/jamesjwu/163/orig 2025-08-14T21:18:06.1255708Z * [new branch] gh/jamesjwu/171/base -> origin/gh/jamesjwu/171/base 2025-08-14T21:18:06.1260874Z * [new branch] gh/jamesjwu/171/head -> origin/gh/jamesjwu/171/head 2025-08-14T21:18:06.1261125Z * [new branch] gh/jamesjwu/171/orig -> origin/gh/jamesjwu/171/orig 2025-08-14T21:18:06.1261330Z * [new branch] gh/jamesjwu/174/base -> origin/gh/jamesjwu/174/base 2025-08-14T21:18:06.1261540Z * [new branch] gh/jamesjwu/174/head -> origin/gh/jamesjwu/174/head 2025-08-14T21:18:06.1261663Z * [new branch] gh/jamesjwu/174/orig -> origin/gh/jamesjwu/174/orig 2025-08-14T21:18:06.1261793Z * [new branch] gh/jamesjwu/175/base -> origin/gh/jamesjwu/175/base 2025-08-14T21:18:06.1262038Z * [new branch] gh/jamesjwu/175/head -> origin/gh/jamesjwu/175/head 2025-08-14T21:18:06.1262161Z * [new branch] gh/jamesjwu/175/orig -> origin/gh/jamesjwu/175/orig 2025-08-14T21:18:06.1262286Z * [new branch] gh/jamesjwu/176/base -> origin/gh/jamesjwu/176/base 2025-08-14T21:18:06.1262402Z * [new branch] gh/jamesjwu/176/head -> origin/gh/jamesjwu/176/head 2025-08-14T21:18:06.1262531Z * [new branch] gh/jamesjwu/176/orig -> origin/gh/jamesjwu/176/orig 2025-08-14T21:18:06.1262648Z * [new branch] gh/jamesjwu/177/base -> origin/gh/jamesjwu/177/base 2025-08-14T21:18:06.1262764Z * [new branch] gh/jamesjwu/177/head -> origin/gh/jamesjwu/177/head 2025-08-14T21:18:06.1262889Z * [new branch] gh/jamesjwu/177/orig -> origin/gh/jamesjwu/177/orig 2025-08-14T21:18:06.1263007Z * [new branch] gh/jamesjwu/178/base -> origin/gh/jamesjwu/178/base 2025-08-14T21:18:06.1263126Z * [new branch] gh/jamesjwu/178/head -> origin/gh/jamesjwu/178/head 2025-08-14T21:18:06.1263252Z * [new branch] gh/jamesjwu/178/orig -> origin/gh/jamesjwu/178/orig 2025-08-14T21:18:06.1263368Z * [new branch] gh/jamesjwu/179/base -> origin/gh/jamesjwu/179/base 2025-08-14T21:18:06.1263494Z * [new branch] gh/jamesjwu/179/head -> origin/gh/jamesjwu/179/head 2025-08-14T21:18:06.1263609Z * [new branch] gh/jamesjwu/179/orig -> origin/gh/jamesjwu/179/orig 2025-08-14T21:18:06.1263725Z * [new branch] gh/jamesjwu/180/base -> origin/gh/jamesjwu/180/base 2025-08-14T21:18:06.1263846Z * [new branch] gh/jamesjwu/180/head -> origin/gh/jamesjwu/180/head 2025-08-14T21:18:06.1263962Z * [new branch] gh/jamesjwu/180/orig -> origin/gh/jamesjwu/180/orig 2025-08-14T21:18:06.1264088Z * [new branch] gh/jamesjwu/181/base -> origin/gh/jamesjwu/181/base 2025-08-14T21:18:06.1264294Z * [new branch] gh/jamesjwu/181/head -> origin/gh/jamesjwu/181/head 2025-08-14T21:18:06.1264421Z * [new branch] gh/jamesjwu/181/orig -> origin/gh/jamesjwu/181/orig 2025-08-14T21:18:06.1264545Z * [new branch] gh/jamesjwu/182/base -> origin/gh/jamesjwu/182/base 2025-08-14T21:18:06.1264713Z * [new branch] gh/jamesjwu/182/head -> origin/gh/jamesjwu/182/head 2025-08-14T21:18:06.1264839Z * [new branch] gh/jamesjwu/182/orig -> origin/gh/jamesjwu/182/orig 2025-08-14T21:18:06.1264955Z * [new branch] gh/jamesjwu/183/base -> origin/gh/jamesjwu/183/base 2025-08-14T21:18:06.1265072Z * [new branch] gh/jamesjwu/183/head -> origin/gh/jamesjwu/183/head 2025-08-14T21:18:06.1265194Z * [new branch] gh/jamesjwu/183/orig -> origin/gh/jamesjwu/183/orig 2025-08-14T21:18:06.1265315Z * [new branch] gh/jamesjwu/184/base -> origin/gh/jamesjwu/184/base 2025-08-14T21:18:06.1265431Z * [new branch] gh/jamesjwu/184/head -> origin/gh/jamesjwu/184/head 2025-08-14T21:18:06.1265557Z * [new branch] gh/jamesjwu/184/orig -> origin/gh/jamesjwu/184/orig 2025-08-14T21:18:06.1265683Z * [new branch] gh/jamesjwu/52/base -> origin/gh/jamesjwu/52/base 2025-08-14T21:18:06.1265818Z * [new branch] gh/jamesjwu/52/head -> origin/gh/jamesjwu/52/head 2025-08-14T21:18:06.1265938Z * [new branch] gh/jamesjwu/53/base -> origin/gh/jamesjwu/53/base 2025-08-14T21:18:06.1266055Z * [new branch] gh/jamesjwu/53/head -> origin/gh/jamesjwu/53/head 2025-08-14T21:18:06.1266184Z * [new branch] gh/jamesjwu/54/base -> origin/gh/jamesjwu/54/base 2025-08-14T21:18:06.1266301Z * [new branch] gh/jamesjwu/54/head -> origin/gh/jamesjwu/54/head 2025-08-14T21:18:06.1266469Z * [new branch] gh/jamesjwu/55/base -> origin/gh/jamesjwu/55/base 2025-08-14T21:18:06.1266589Z * [new branch] gh/jamesjwu/55/head -> origin/gh/jamesjwu/55/head 2025-08-14T21:18:06.1266706Z * [new branch] gh/jamesjwu/56/base -> origin/gh/jamesjwu/56/base 2025-08-14T21:18:06.1266829Z * [new branch] gh/jamesjwu/56/head -> origin/gh/jamesjwu/56/head 2025-08-14T21:18:06.1266950Z * [new branch] gh/jamesjwu/57/base -> origin/gh/jamesjwu/57/base 2025-08-14T21:18:06.1267074Z * [new branch] gh/jamesjwu/57/head -> origin/gh/jamesjwu/57/head 2025-08-14T21:18:06.1267189Z * [new branch] gh/jamesjwu/58/base -> origin/gh/jamesjwu/58/base 2025-08-14T21:18:06.1267306Z * [new branch] gh/jamesjwu/58/head -> origin/gh/jamesjwu/58/head 2025-08-14T21:18:06.1267426Z * [new branch] gh/jamesjwu/59/base -> origin/gh/jamesjwu/59/base 2025-08-14T21:18:06.1267545Z * [new branch] gh/jamesjwu/59/head -> origin/gh/jamesjwu/59/head 2025-08-14T21:18:06.1267661Z * [new branch] gh/jamesjwu/60/base -> origin/gh/jamesjwu/60/base 2025-08-14T21:18:06.1267784Z * [new branch] gh/jamesjwu/60/head -> origin/gh/jamesjwu/60/head 2025-08-14T21:18:06.1267901Z * [new branch] gh/jamesjwu/61/base -> origin/gh/jamesjwu/61/base 2025-08-14T21:18:06.1268026Z * [new branch] gh/jamesjwu/61/head -> origin/gh/jamesjwu/61/head 2025-08-14T21:18:06.1268142Z * [new branch] gh/jamesjwu/62/base -> origin/gh/jamesjwu/62/base 2025-08-14T21:18:06.1268257Z * [new branch] gh/jamesjwu/62/head -> origin/gh/jamesjwu/62/head 2025-08-14T21:18:06.1268381Z * [new branch] gh/jamesjwu/63/base -> origin/gh/jamesjwu/63/base 2025-08-14T21:18:06.1268496Z * [new branch] gh/jamesjwu/63/head -> origin/gh/jamesjwu/63/head 2025-08-14T21:18:06.1268623Z * [new branch] gh/jamesjwu/64/base -> origin/gh/jamesjwu/64/base 2025-08-14T21:18:06.1268740Z * [new branch] gh/jamesjwu/64/head -> origin/gh/jamesjwu/64/head 2025-08-14T21:18:06.1268856Z * [new branch] gh/jamesjwu/65/base -> origin/gh/jamesjwu/65/base 2025-08-14T21:18:06.1269015Z * [new branch] gh/jamesjwu/65/head -> origin/gh/jamesjwu/65/head 2025-08-14T21:18:06.1269134Z * [new branch] gh/janeyx99/165/base -> origin/gh/janeyx99/165/base 2025-08-14T21:18:06.1269252Z * [new branch] gh/janeyx99/165/head -> origin/gh/janeyx99/165/head 2025-08-14T21:18:06.1269376Z * [new branch] gh/janeyx99/165/orig -> origin/gh/janeyx99/165/orig 2025-08-14T21:18:06.1269491Z * [new branch] gh/janeyx99/201/base -> origin/gh/janeyx99/201/base 2025-08-14T21:18:06.1269615Z * [new branch] gh/janeyx99/201/head -> origin/gh/janeyx99/201/head 2025-08-14T21:18:06.1269744Z * [new branch] gh/janeyx99/201/orig -> origin/gh/janeyx99/201/orig 2025-08-14T21:18:06.1269866Z * [new branch] gh/janeyx99/225/base -> origin/gh/janeyx99/225/base 2025-08-14T21:18:06.1269983Z * [new branch] gh/janeyx99/225/head -> origin/gh/janeyx99/225/head 2025-08-14T21:18:06.1270101Z * [new branch] gh/janeyx99/225/orig -> origin/gh/janeyx99/225/orig 2025-08-14T21:18:06.1270226Z * [new branch] gh/janeyx99/256/base -> origin/gh/janeyx99/256/base 2025-08-14T21:18:06.1270342Z * [new branch] gh/janeyx99/256/head -> origin/gh/janeyx99/256/head 2025-08-14T21:18:06.1270457Z * [new branch] gh/janeyx99/256/orig -> origin/gh/janeyx99/256/orig 2025-08-14T21:18:06.1270582Z * [new branch] gh/janeyx99/268/base -> origin/gh/janeyx99/268/base 2025-08-14T21:18:06.1270729Z * [new branch] gh/janeyx99/268/head -> origin/gh/janeyx99/268/head 2025-08-14T21:18:06.1270855Z * [new branch] gh/janeyx99/268/orig -> origin/gh/janeyx99/268/orig 2025-08-14T21:18:06.1270974Z * [new branch] gh/janeyx99/269/base -> origin/gh/janeyx99/269/base 2025-08-14T21:18:06.1271093Z * [new branch] gh/janeyx99/269/head -> origin/gh/janeyx99/269/head 2025-08-14T21:18:06.1271218Z * [new branch] gh/janeyx99/269/orig -> origin/gh/janeyx99/269/orig 2025-08-14T21:18:06.1273122Z * [new branch] gh/janeyx99/274/base -> origin/gh/janeyx99/274/base 2025-08-14T21:18:06.1273422Z * [new branch] gh/janeyx99/274/head -> origin/gh/janeyx99/274/head 2025-08-14T21:18:06.1273576Z * [new branch] gh/janeyx99/274/orig -> origin/gh/janeyx99/274/orig 2025-08-14T21:18:06.1273832Z * [new branch] gh/janeyx99/276/base -> origin/gh/janeyx99/276/base 2025-08-14T21:18:06.1275183Z * [new branch] gh/janeyx99/276/head -> origin/gh/janeyx99/276/head 2025-08-14T21:18:06.1275492Z * [new branch] gh/janeyx99/276/orig -> origin/gh/janeyx99/276/orig 2025-08-14T21:18:06.1275957Z * [new branch] gh/janeyx99/277/base -> origin/gh/janeyx99/277/base 2025-08-14T21:18:06.1277244Z * [new branch] gh/janeyx99/277/head -> origin/gh/janeyx99/277/head 2025-08-14T21:18:06.1277558Z * [new branch] gh/janeyx99/277/orig -> origin/gh/janeyx99/277/orig 2025-08-14T21:18:06.1279373Z * [new branch] gh/janeyx99/278/base -> origin/gh/janeyx99/278/base 2025-08-14T21:18:06.1279675Z * [new branch] gh/janeyx99/278/head -> origin/gh/janeyx99/278/head 2025-08-14T21:18:06.1279822Z * [new branch] gh/janeyx99/278/orig -> origin/gh/janeyx99/278/orig 2025-08-14T21:18:06.1280319Z * [new branch] gh/janeyx99/279/base -> origin/gh/janeyx99/279/base 2025-08-14T21:18:06.1281221Z * [new branch] gh/janeyx99/279/head -> origin/gh/janeyx99/279/head 2025-08-14T21:18:06.1281553Z * [new branch] gh/janeyx99/279/orig -> origin/gh/janeyx99/279/orig 2025-08-14T21:18:06.1283184Z * [new branch] gh/janeyx99/280/base -> origin/gh/janeyx99/280/base 2025-08-14T21:18:06.1283658Z * [new branch] gh/janeyx99/280/head -> origin/gh/janeyx99/280/head 2025-08-14T21:18:06.1283982Z * [new branch] gh/janeyx99/280/orig -> origin/gh/janeyx99/280/orig 2025-08-14T21:18:06.1285271Z * [new branch] gh/janeyx99/281/base -> origin/gh/janeyx99/281/base 2025-08-14T21:18:06.1285450Z * [new branch] gh/janeyx99/281/head -> origin/gh/janeyx99/281/head 2025-08-14T21:18:06.1288182Z * [new branch] gh/janeyx99/281/orig -> origin/gh/janeyx99/281/orig 2025-08-14T21:18:06.1288351Z * [new branch] gh/janeyx99/282/base -> origin/gh/janeyx99/282/base 2025-08-14T21:18:06.1288485Z * [new branch] gh/janeyx99/282/head -> origin/gh/janeyx99/282/head 2025-08-14T21:18:06.1288603Z * [new branch] gh/janeyx99/282/orig -> origin/gh/janeyx99/282/orig 2025-08-14T21:18:06.1288958Z * [new branch] gh/janeyx99/283/base -> origin/gh/janeyx99/283/base 2025-08-14T21:18:06.1290146Z * [new branch] gh/janeyx99/283/head -> origin/gh/janeyx99/283/head 2025-08-14T21:18:06.1290332Z * [new branch] gh/janeyx99/283/orig -> origin/gh/janeyx99/283/orig 2025-08-14T21:18:06.1291994Z * [new branch] gh/janeyx99/284/base -> origin/gh/janeyx99/284/base 2025-08-14T21:18:06.1292295Z * [new branch] gh/janeyx99/284/head -> origin/gh/janeyx99/284/head 2025-08-14T21:18:06.1292572Z * [new branch] gh/janeyx99/284/orig -> origin/gh/janeyx99/284/orig 2025-08-14T21:18:06.1296768Z * [new branch] gh/janeyx99/285/base -> origin/gh/janeyx99/285/base 2025-08-14T21:18:06.1297116Z * [new branch] gh/janeyx99/285/head -> origin/gh/janeyx99/285/head 2025-08-14T21:18:06.1297325Z * [new branch] gh/janeyx99/285/orig -> origin/gh/janeyx99/285/orig 2025-08-14T21:18:06.1298000Z * [new branch] gh/janeyx99/286/base -> origin/gh/janeyx99/286/base 2025-08-14T21:18:06.1298181Z * [new branch] gh/janeyx99/286/head -> origin/gh/janeyx99/286/head 2025-08-14T21:18:06.1298308Z * [new branch] gh/janeyx99/286/orig -> origin/gh/janeyx99/286/orig 2025-08-14T21:18:06.1298435Z * [new branch] gh/janeyx99/287/base -> origin/gh/janeyx99/287/base 2025-08-14T21:18:06.1298598Z * [new branch] gh/janeyx99/287/head -> origin/gh/janeyx99/287/head 2025-08-14T21:18:06.1299573Z * [new branch] gh/janeyx99/287/orig -> origin/gh/janeyx99/287/orig 2025-08-14T21:18:06.1300147Z * [new branch] gh/janeyx99/288/base -> origin/gh/janeyx99/288/base 2025-08-14T21:18:06.1301026Z * [new branch] gh/janeyx99/288/head -> origin/gh/janeyx99/288/head 2025-08-14T21:18:06.1301239Z * [new branch] gh/janeyx99/288/orig -> origin/gh/janeyx99/288/orig 2025-08-14T21:18:06.1302527Z * [new branch] gh/janeyx99/289/base -> origin/gh/janeyx99/289/base 2025-08-14T21:18:06.1302811Z * [new branch] gh/janeyx99/289/head -> origin/gh/janeyx99/289/head 2025-08-14T21:18:06.1303744Z * [new branch] gh/janeyx99/289/orig -> origin/gh/janeyx99/289/orig 2025-08-14T21:18:06.1304664Z * [new branch] gh/janeyx99/290/base -> origin/gh/janeyx99/290/base 2025-08-14T21:18:06.1305019Z * [new branch] gh/janeyx99/290/head -> origin/gh/janeyx99/290/head 2025-08-14T21:18:06.1307686Z * [new branch] gh/janeyx99/290/orig -> origin/gh/janeyx99/290/orig 2025-08-14T21:18:06.1307951Z * [new branch] gh/janeyx99/291/base -> origin/gh/janeyx99/291/base 2025-08-14T21:18:06.1312533Z * [new branch] gh/janeyx99/291/head -> origin/gh/janeyx99/291/head 2025-08-14T21:18:06.1316430Z * [new branch] gh/janeyx99/291/orig -> origin/gh/janeyx99/291/orig 2025-08-14T21:18:06.1320621Z * [new branch] gh/janeyx99/292/base -> origin/gh/janeyx99/292/base 2025-08-14T21:18:06.1322779Z * [new branch] gh/janeyx99/292/head -> origin/gh/janeyx99/292/head 2025-08-14T21:18:06.1323029Z * [new branch] gh/janeyx99/292/orig -> origin/gh/janeyx99/292/orig 2025-08-14T21:18:06.1327744Z * [new branch] gh/janeyx99/293/base -> origin/gh/janeyx99/293/base 2025-08-14T21:18:06.1331396Z * [new branch] gh/janeyx99/293/head -> origin/gh/janeyx99/293/head 2025-08-14T21:18:06.1334953Z * [new branch] gh/janeyx99/293/orig -> origin/gh/janeyx99/293/orig 2025-08-14T21:18:06.1335120Z * [new branch] gh/janeyx99/294/base -> origin/gh/janeyx99/294/base 2025-08-14T21:18:06.1335253Z * [new branch] gh/janeyx99/294/head -> origin/gh/janeyx99/294/head 2025-08-14T21:18:06.1335382Z * [new branch] gh/janeyx99/294/orig -> origin/gh/janeyx99/294/orig 2025-08-14T21:18:06.1335509Z * [new branch] gh/janeyx99/295/base -> origin/gh/janeyx99/295/base 2025-08-14T21:18:06.1335638Z * [new branch] gh/janeyx99/295/head -> origin/gh/janeyx99/295/head 2025-08-14T21:18:06.1335757Z * [new branch] gh/janeyx99/295/orig -> origin/gh/janeyx99/295/orig 2025-08-14T21:18:06.1335885Z * [new branch] gh/janeyx99/296/base -> origin/gh/janeyx99/296/base 2025-08-14T21:18:06.1336002Z * [new branch] gh/janeyx99/296/head -> origin/gh/janeyx99/296/head 2025-08-14T21:18:06.1336272Z * [new branch] gh/janeyx99/296/orig -> origin/gh/janeyx99/296/orig 2025-08-14T21:18:06.1336404Z * [new branch] gh/janeyx99/297/base -> origin/gh/janeyx99/297/base 2025-08-14T21:18:06.1336525Z * [new branch] gh/janeyx99/297/head -> origin/gh/janeyx99/297/head 2025-08-14T21:18:06.1336647Z * [new branch] gh/janeyx99/297/orig -> origin/gh/janeyx99/297/orig 2025-08-14T21:18:06.1336780Z * [new branch] gh/janeyx99/298/base -> origin/gh/janeyx99/298/base 2025-08-14T21:18:06.1336900Z * [new branch] gh/janeyx99/298/head -> origin/gh/janeyx99/298/head 2025-08-14T21:18:06.1337027Z * [new branch] gh/janeyx99/298/orig -> origin/gh/janeyx99/298/orig 2025-08-14T21:18:06.1337146Z * [new branch] gh/janeyx99/299/base -> origin/gh/janeyx99/299/base 2025-08-14T21:18:06.1337264Z * [new branch] gh/janeyx99/299/head -> origin/gh/janeyx99/299/head 2025-08-14T21:18:06.1337412Z * [new branch] gh/janeyx99/299/orig -> origin/gh/janeyx99/299/orig 2025-08-14T21:18:06.1337531Z * [new branch] gh/janeyx99/300/base -> origin/gh/janeyx99/300/base 2025-08-14T21:18:06.1337659Z * [new branch] gh/janeyx99/300/head -> origin/gh/janeyx99/300/head 2025-08-14T21:18:06.1337779Z * [new branch] gh/janeyx99/300/orig -> origin/gh/janeyx99/300/orig 2025-08-14T21:18:06.1337913Z * [new branch] gh/janeyx99/88/base -> origin/gh/janeyx99/88/base 2025-08-14T21:18:06.1338043Z * [new branch] gh/janeyx99/88/head -> origin/gh/janeyx99/88/head 2025-08-14T21:18:06.1338162Z * [new branch] gh/janeyx99/88/orig -> origin/gh/janeyx99/88/orig 2025-08-14T21:18:06.1338286Z * [new branch] gh/jansel/360/base -> origin/gh/jansel/360/base 2025-08-14T21:18:06.1338414Z * [new branch] gh/jansel/360/head -> origin/gh/jansel/360/head 2025-08-14T21:18:06.1338532Z * [new branch] gh/jansel/451/base -> origin/gh/jansel/451/base 2025-08-14T21:18:06.1338651Z * [new branch] gh/jansel/451/head -> origin/gh/jansel/451/head 2025-08-14T21:18:06.1338778Z * [new branch] gh/jansel/451/orig -> origin/gh/jansel/451/orig 2025-08-14T21:18:06.1338894Z * [new branch] gh/jansel/462/base -> origin/gh/jansel/462/base 2025-08-14T21:18:06.1339054Z * [new branch] gh/jansel/462/head -> origin/gh/jansel/462/head 2025-08-14T21:18:06.1339174Z * [new branch] gh/jansel/462/orig -> origin/gh/jansel/462/orig 2025-08-14T21:18:06.1339296Z * [new branch] gh/jansel/531/base -> origin/gh/jansel/531/base 2025-08-14T21:18:06.1339417Z * [new branch] gh/jansel/531/head -> origin/gh/jansel/531/head 2025-08-14T21:18:06.1339529Z * [new branch] gh/jansel/531/orig -> origin/gh/jansel/531/orig 2025-08-14T21:18:06.1339645Z * [new branch] gh/jansel/534/base -> origin/gh/jansel/534/base 2025-08-14T21:18:06.1339766Z * [new branch] gh/jansel/534/head -> origin/gh/jansel/534/head 2025-08-14T21:18:06.1339880Z * [new branch] gh/jansel/534/orig -> origin/gh/jansel/534/orig 2025-08-14T21:18:06.1340174Z * [new branch] gh/jbschlosser/226/base -> origin/gh/jbschlosser/226/base 2025-08-14T21:18:06.1340359Z * [new branch] gh/jbschlosser/226/head -> origin/gh/jbschlosser/226/head 2025-08-14T21:18:06.1341388Z * [new branch] gh/jbschlosser/226/orig -> origin/gh/jbschlosser/226/orig 2025-08-14T21:18:06.1342252Z * [new branch] gh/jbschlosser/239/base -> origin/gh/jbschlosser/239/base 2025-08-14T21:18:06.1342497Z * [new branch] gh/jbschlosser/239/head -> origin/gh/jbschlosser/239/head 2025-08-14T21:18:06.1343615Z * [new branch] gh/jbschlosser/239/orig -> origin/gh/jbschlosser/239/orig 2025-08-14T21:18:06.1344238Z * [new branch] gh/jbschlosser/247/base -> origin/gh/jbschlosser/247/base 2025-08-14T21:18:06.1344844Z * [new branch] gh/jbschlosser/247/head -> origin/gh/jbschlosser/247/head 2025-08-14T21:18:06.1345301Z * [new branch] gh/jbschlosser/247/orig -> origin/gh/jbschlosser/247/orig 2025-08-14T21:18:06.1348939Z * [new branch] gh/jbschlosser/248/base -> origin/gh/jbschlosser/248/base 2025-08-14T21:18:06.1349216Z * [new branch] gh/jbschlosser/248/head -> origin/gh/jbschlosser/248/head 2025-08-14T21:18:06.1352864Z * [new branch] gh/jbschlosser/248/orig -> origin/gh/jbschlosser/248/orig 2025-08-14T21:18:06.1356463Z * [new branch] gh/jbschlosser/249/base -> origin/gh/jbschlosser/249/base 2025-08-14T21:18:06.1360022Z * [new branch] gh/jbschlosser/249/head -> origin/gh/jbschlosser/249/head 2025-08-14T21:18:06.1363550Z * [new branch] gh/jbschlosser/249/orig -> origin/gh/jbschlosser/249/orig 2025-08-14T21:18:06.1367091Z * [new branch] gh/jbschlosser/250/base -> origin/gh/jbschlosser/250/base 2025-08-14T21:18:06.1369273Z * [new branch] gh/jbschlosser/250/head -> origin/gh/jbschlosser/250/head 2025-08-14T21:18:06.1369576Z * [new branch] gh/jbschlosser/250/orig -> origin/gh/jbschlosser/250/orig 2025-08-14T21:18:06.1369831Z * [new branch] gh/jiayisunx/57/base -> origin/gh/jiayisunx/57/base 2025-08-14T21:18:06.1369961Z * [new branch] gh/jiayisunx/57/head -> origin/gh/jiayisunx/57/head 2025-08-14T21:18:06.1370160Z * [new branch] gh/jiayisunx/57/orig -> origin/gh/jiayisunx/57/orig 2025-08-14T21:18:06.1372903Z * [new branch] gh/jiayisunx/59/base -> origin/gh/jiayisunx/59/base 2025-08-14T21:18:06.1373145Z * [new branch] gh/jiayisunx/59/head -> origin/gh/jiayisunx/59/head 2025-08-14T21:18:06.1373338Z * [new branch] gh/jiayisunx/59/orig -> origin/gh/jiayisunx/59/orig 2025-08-14T21:18:06.1373516Z * [new branch] gh/jiayisunx/61/base -> origin/gh/jiayisunx/61/base 2025-08-14T21:18:06.1373698Z * [new branch] gh/jiayisunx/61/head -> origin/gh/jiayisunx/61/head 2025-08-14T21:18:06.1374002Z * [new branch] gh/jiayisunx/61/orig -> origin/gh/jiayisunx/61/orig 2025-08-14T21:18:06.1374123Z * [new branch] gh/jiayisunx/63/base -> origin/gh/jiayisunx/63/base 2025-08-14T21:18:06.1374250Z * [new branch] gh/jiayisunx/63/head -> origin/gh/jiayisunx/63/head 2025-08-14T21:18:06.1374368Z * [new branch] gh/jiayisunx/63/orig -> origin/gh/jiayisunx/63/orig 2025-08-14T21:18:06.1374495Z * [new branch] gh/jiayisunx/64/base -> origin/gh/jiayisunx/64/base 2025-08-14T21:18:06.1374618Z * [new branch] gh/jiayisunx/64/head -> origin/gh/jiayisunx/64/head 2025-08-14T21:18:06.1374737Z * [new branch] gh/jiayisunx/64/orig -> origin/gh/jiayisunx/64/orig 2025-08-14T21:18:06.1374862Z * [new branch] gh/jiayisunx/65/base -> origin/gh/jiayisunx/65/base 2025-08-14T21:18:06.1374978Z * [new branch] gh/jiayisunx/65/head -> origin/gh/jiayisunx/65/head 2025-08-14T21:18:06.1375104Z * [new branch] gh/jiayisunx/65/orig -> origin/gh/jiayisunx/65/orig 2025-08-14T21:18:06.1375223Z * [new branch] gh/jiayisunx/66/base -> origin/gh/jiayisunx/66/base 2025-08-14T21:18:06.1375342Z * [new branch] gh/jiayisunx/66/head -> origin/gh/jiayisunx/66/head 2025-08-14T21:18:06.1375464Z * [new branch] gh/jiayisunx/66/orig -> origin/gh/jiayisunx/66/orig 2025-08-14T21:18:06.1375580Z * [new branch] gh/jiayisunx/67/base -> origin/gh/jiayisunx/67/base 2025-08-14T21:18:06.1375734Z * [new branch] gh/jiayisunx/67/head -> origin/gh/jiayisunx/67/head 2025-08-14T21:18:06.1375860Z * [new branch] gh/jiayisunx/67/orig -> origin/gh/jiayisunx/67/orig 2025-08-14T21:18:06.1375981Z * [new branch] gh/jiayisunx/68/base -> origin/gh/jiayisunx/68/base 2025-08-14T21:18:06.1376107Z * [new branch] gh/jiayisunx/68/head -> origin/gh/jiayisunx/68/head 2025-08-14T21:18:06.1376229Z * [new branch] gh/jiayisunx/68/orig -> origin/gh/jiayisunx/68/orig 2025-08-14T21:18:06.1376375Z * [new branch] gh/jjwu@meta.com/1/base -> origin/gh/jjwu@meta.com/1/base 2025-08-14T21:18:06.1376510Z * [new branch] gh/jjwu@meta.com/1/head -> origin/gh/jjwu@meta.com/1/head 2025-08-14T21:18:06.1376663Z * [new branch] gh/justinchuby/111/base -> origin/gh/justinchuby/111/base 2025-08-14T21:18:06.1376884Z * [new branch] gh/justinchuby/111/head -> origin/gh/justinchuby/111/head 2025-08-14T21:18:06.1377062Z * [new branch] gh/justinchuby/111/orig -> origin/gh/justinchuby/111/orig 2025-08-14T21:18:06.1377194Z * [new branch] gh/kurtamohler/32/base -> origin/gh/kurtamohler/32/base 2025-08-14T21:18:06.1377329Z * [new branch] gh/kurtamohler/32/head -> origin/gh/kurtamohler/32/head 2025-08-14T21:18:06.1377463Z * [new branch] gh/kurtamohler/32/orig -> origin/gh/kurtamohler/32/orig 2025-08-14T21:18:06.1378479Z * [new branch] gh/kurtamohler/33/base -> origin/gh/kurtamohler/33/base 2025-08-14T21:18:06.1378953Z * [new branch] gh/kurtamohler/33/head -> origin/gh/kurtamohler/33/head 2025-08-14T21:18:06.1379437Z * [new branch] gh/kurtamohler/33/orig -> origin/gh/kurtamohler/33/orig 2025-08-14T21:18:06.1380443Z * [new branch] gh/kurtamohler/34/base -> origin/gh/kurtamohler/34/base 2025-08-14T21:18:06.1380784Z * [new branch] gh/kurtamohler/34/head -> origin/gh/kurtamohler/34/head 2025-08-14T21:18:06.1381684Z * [new branch] gh/kurtamohler/34/orig -> origin/gh/kurtamohler/34/orig 2025-08-14T21:18:06.1382403Z * [new branch] gh/kurtamohler/40/base -> origin/gh/kurtamohler/40/base 2025-08-14T21:18:06.1382893Z * [new branch] gh/kurtamohler/40/head -> origin/gh/kurtamohler/40/head 2025-08-14T21:18:06.1383588Z * [new branch] gh/kurtamohler/40/orig -> origin/gh/kurtamohler/40/orig 2025-08-14T21:18:06.1384786Z * [new branch] gh/kurtamohler/41/base -> origin/gh/kurtamohler/41/base 2025-08-14T21:18:06.1384934Z * [new branch] gh/kurtamohler/41/head -> origin/gh/kurtamohler/41/head 2025-08-14T21:18:06.1387969Z * [new branch] gh/kurtamohler/41/orig -> origin/gh/kurtamohler/41/orig 2025-08-14T21:18:06.1388293Z * [new branch] gh/kurtamohler/42/base -> origin/gh/kurtamohler/42/base 2025-08-14T21:18:06.1388529Z * [new branch] gh/kurtamohler/42/head -> origin/gh/kurtamohler/42/head 2025-08-14T21:18:06.1388742Z * [new branch] gh/kurtamohler/42/orig -> origin/gh/kurtamohler/42/orig 2025-08-14T21:18:06.1388889Z * [new branch] gh/kurtamohler/43/base -> origin/gh/kurtamohler/43/base 2025-08-14T21:18:06.1389113Z * [new branch] gh/kurtamohler/43/head -> origin/gh/kurtamohler/43/head 2025-08-14T21:18:06.1390010Z * [new branch] gh/kurtamohler/43/orig -> origin/gh/kurtamohler/43/orig 2025-08-14T21:18:06.1392434Z * [new branch] gh/kurtamohler/44/base -> origin/gh/kurtamohler/44/base 2025-08-14T21:18:06.1392750Z * [new branch] gh/kurtamohler/44/head -> origin/gh/kurtamohler/44/head 2025-08-14T21:18:06.1392903Z * [new branch] gh/kurtamohler/44/orig -> origin/gh/kurtamohler/44/orig 2025-08-14T21:18:06.1393298Z * [new branch] gh/kurtamohler/45/base -> origin/gh/kurtamohler/45/base 2025-08-14T21:18:06.1393861Z * [new branch] gh/kurtamohler/45/head -> origin/gh/kurtamohler/45/head 2025-08-14T21:18:06.1394032Z * [new branch] gh/kurtamohler/45/orig -> origin/gh/kurtamohler/45/orig 2025-08-14T21:18:06.1395423Z * [new branch] gh/kurtamohler/46/base -> origin/gh/kurtamohler/46/base 2025-08-14T21:18:06.1395712Z * [new branch] gh/kurtamohler/46/head -> origin/gh/kurtamohler/46/head 2025-08-14T21:18:06.1395860Z * [new branch] gh/kurtamohler/46/orig -> origin/gh/kurtamohler/46/orig 2025-08-14T21:18:06.1397440Z * [new branch] gh/kwen2501/130/base -> origin/gh/kwen2501/130/base 2025-08-14T21:18:06.1397614Z * [new branch] gh/kwen2501/130/head -> origin/gh/kwen2501/130/head 2025-08-14T21:18:06.1399097Z * [new branch] gh/kwen2501/130/orig -> origin/gh/kwen2501/130/orig 2025-08-14T21:18:06.1399455Z * [new branch] gh/kwen2501/142/base -> origin/gh/kwen2501/142/base 2025-08-14T21:18:06.1401134Z * [new branch] gh/kwen2501/142/head -> origin/gh/kwen2501/142/head 2025-08-14T21:18:06.1401436Z * [new branch] gh/kwen2501/142/orig -> origin/gh/kwen2501/142/orig 2025-08-14T21:18:06.1401783Z * [new branch] gh/kwen2501/15/base -> origin/gh/kwen2501/15/base 2025-08-14T21:18:06.1401992Z * [new branch] gh/kwen2501/15/head -> origin/gh/kwen2501/15/head 2025-08-14T21:18:06.1405619Z * [new branch] gh/kwen2501/156/base -> origin/gh/kwen2501/156/base 2025-08-14T21:18:06.1405849Z * [new branch] gh/kwen2501/156/head -> origin/gh/kwen2501/156/head 2025-08-14T21:18:06.1406032Z * [new branch] gh/kwen2501/156/orig -> origin/gh/kwen2501/156/orig 2025-08-14T21:18:06.1406161Z * [new branch] gh/kwen2501/170/base -> origin/gh/kwen2501/170/base 2025-08-14T21:18:06.1406293Z * [new branch] gh/kwen2501/170/head -> origin/gh/kwen2501/170/head 2025-08-14T21:18:06.1406440Z * [new branch] gh/kwen2501/179/base -> origin/gh/kwen2501/179/base 2025-08-14T21:18:06.1407700Z * [new branch] gh/kwen2501/179/head -> origin/gh/kwen2501/179/head 2025-08-14T21:18:06.1408009Z * [new branch] gh/kwen2501/179/orig -> origin/gh/kwen2501/179/orig 2025-08-14T21:18:06.1408464Z * [new branch] gh/kwen2501/181/base -> origin/gh/kwen2501/181/base 2025-08-14T21:18:06.1410259Z * [new branch] gh/kwen2501/181/head -> origin/gh/kwen2501/181/head 2025-08-14T21:18:06.1410550Z * [new branch] gh/kwen2501/181/orig -> origin/gh/kwen2501/181/orig 2025-08-14T21:18:06.1410701Z * [new branch] gh/kwen2501/183/base -> origin/gh/kwen2501/183/base 2025-08-14T21:18:06.1410981Z * [new branch] gh/kwen2501/183/head -> origin/gh/kwen2501/183/head 2025-08-14T21:18:06.1411869Z * [new branch] gh/kwen2501/183/orig -> origin/gh/kwen2501/183/orig 2025-08-14T21:18:06.1414666Z * [new branch] gh/kwen2501/184/base -> origin/gh/kwen2501/184/base 2025-08-14T21:18:06.1414976Z * [new branch] gh/kwen2501/184/head -> origin/gh/kwen2501/184/head 2025-08-14T21:18:06.1415198Z * [new branch] gh/kwen2501/184/orig -> origin/gh/kwen2501/184/orig 2025-08-14T21:18:06.1415339Z * [new branch] gh/kwen2501/186/base -> origin/gh/kwen2501/186/base 2025-08-14T21:18:06.1415538Z * [new branch] gh/kwen2501/186/head -> origin/gh/kwen2501/186/head 2025-08-14T21:18:06.1416173Z * [new branch] gh/kwen2501/186/orig -> origin/gh/kwen2501/186/orig 2025-08-14T21:18:06.1416510Z * [new branch] gh/kwen2501/187/base -> origin/gh/kwen2501/187/base 2025-08-14T21:18:06.1417577Z * [new branch] gh/kwen2501/187/head -> origin/gh/kwen2501/187/head 2025-08-14T21:18:06.1418142Z * [new branch] gh/kwen2501/187/orig -> origin/gh/kwen2501/187/orig 2025-08-14T21:18:06.1418582Z * [new branch] gh/kwen2501/188/base -> origin/gh/kwen2501/188/base 2025-08-14T21:18:06.1419784Z * [new branch] gh/kwen2501/188/head -> origin/gh/kwen2501/188/head 2025-08-14T21:18:06.1419959Z * [new branch] gh/kwen2501/188/orig -> origin/gh/kwen2501/188/orig 2025-08-14T21:18:06.1421455Z * [new branch] gh/kwen2501/194/base -> origin/gh/kwen2501/194/base 2025-08-14T21:18:06.1421637Z * [new branch] gh/kwen2501/194/head -> origin/gh/kwen2501/194/head 2025-08-14T21:18:06.1422573Z * [new branch] gh/kwen2501/194/orig -> origin/gh/kwen2501/194/orig 2025-08-14T21:18:06.1423555Z * [new branch] gh/kwen2501/195/base -> origin/gh/kwen2501/195/base 2025-08-14T21:18:06.1423832Z * [new branch] gh/kwen2501/195/head -> origin/gh/kwen2501/195/head 2025-08-14T21:18:06.1427424Z * [new branch] gh/kwen2501/195/orig -> origin/gh/kwen2501/195/orig 2025-08-14T21:18:06.1431329Z * [new branch] gh/kwen2501/196/base -> origin/gh/kwen2501/196/base 2025-08-14T21:18:06.1435398Z * [new branch] gh/kwen2501/196/head -> origin/gh/kwen2501/196/head 2025-08-14T21:18:06.1437109Z * [new branch] gh/kwen2501/196/orig -> origin/gh/kwen2501/196/orig 2025-08-14T21:18:06.1437362Z * [new branch] gh/kwen2501/197/base -> origin/gh/kwen2501/197/base 2025-08-14T21:18:06.1439997Z * [new branch] gh/kwen2501/197/head -> origin/gh/kwen2501/197/head 2025-08-14T21:18:06.1440147Z * [new branch] gh/kwen2501/197/orig -> origin/gh/kwen2501/197/orig 2025-08-14T21:18:06.1440282Z * [new branch] gh/kwen2501/198/base -> origin/gh/kwen2501/198/base 2025-08-14T21:18:06.1440424Z * [new branch] gh/kwen2501/198/head -> origin/gh/kwen2501/198/head 2025-08-14T21:18:06.1440541Z * [new branch] gh/kwen2501/198/orig -> origin/gh/kwen2501/198/orig 2025-08-14T21:18:06.1440664Z * [new branch] gh/kwen2501/199/base -> origin/gh/kwen2501/199/base 2025-08-14T21:18:06.1440781Z * [new branch] gh/kwen2501/199/head -> origin/gh/kwen2501/199/head 2025-08-14T21:18:06.1441042Z * [new branch] gh/kwen2501/199/orig -> origin/gh/kwen2501/199/orig 2025-08-14T21:18:06.1441165Z * [new branch] gh/kwen2501/200/base -> origin/gh/kwen2501/200/base 2025-08-14T21:18:06.1441280Z * [new branch] gh/kwen2501/200/head -> origin/gh/kwen2501/200/head 2025-08-14T21:18:06.1441410Z * [new branch] gh/kwen2501/200/orig -> origin/gh/kwen2501/200/orig 2025-08-14T21:18:06.1441526Z * [new branch] gh/kwen2501/201/base -> origin/gh/kwen2501/201/base 2025-08-14T21:18:06.1441646Z * [new branch] gh/kwen2501/201/head -> origin/gh/kwen2501/201/head 2025-08-14T21:18:06.1441772Z * [new branch] gh/kwen2501/201/orig -> origin/gh/kwen2501/201/orig 2025-08-14T21:18:06.1441888Z * [new branch] gh/kwen2501/202/base -> origin/gh/kwen2501/202/base 2025-08-14T21:18:06.1442008Z * [new branch] gh/kwen2501/202/head -> origin/gh/kwen2501/202/head 2025-08-14T21:18:06.1442131Z * [new branch] gh/kwen2501/202/orig -> origin/gh/kwen2501/202/orig 2025-08-14T21:18:06.1442244Z * [new branch] gh/kwen2501/203/base -> origin/gh/kwen2501/203/base 2025-08-14T21:18:06.1443205Z * [new branch] gh/kwen2501/203/head -> origin/gh/kwen2501/203/head 2025-08-14T21:18:06.1443416Z * [new branch] gh/kwen2501/203/orig -> origin/gh/kwen2501/203/orig 2025-08-14T21:18:06.1443645Z * [new branch] gh/laithsakka/152/base -> origin/gh/laithsakka/152/base 2025-08-14T21:18:06.1443858Z * [new branch] gh/laithsakka/152/head -> origin/gh/laithsakka/152/head 2025-08-14T21:18:06.1444008Z * [new branch] gh/laithsakka/152/orig -> origin/gh/laithsakka/152/orig 2025-08-14T21:18:06.1446873Z * [new branch] gh/laithsakka/156/base -> origin/gh/laithsakka/156/base 2025-08-14T21:18:06.1447042Z * [new branch] gh/laithsakka/156/head -> origin/gh/laithsakka/156/head 2025-08-14T21:18:06.1447210Z * [new branch] gh/laithsakka/156/orig -> origin/gh/laithsakka/156/orig 2025-08-14T21:18:06.1447352Z * [new branch] gh/laithsakka/159/base -> origin/gh/laithsakka/159/base 2025-08-14T21:18:06.1447488Z * [new branch] gh/laithsakka/159/head -> origin/gh/laithsakka/159/head 2025-08-14T21:18:06.1452087Z * [new branch] gh/laithsakka/159/orig -> origin/gh/laithsakka/159/orig 2025-08-14T21:18:06.1455748Z * [new branch] gh/laithsakka/160/base -> origin/gh/laithsakka/160/base 2025-08-14T21:18:06.1457702Z * [new branch] gh/laithsakka/160/head -> origin/gh/laithsakka/160/head 2025-08-14T21:18:06.1457849Z * [new branch] gh/laithsakka/160/orig -> origin/gh/laithsakka/160/orig 2025-08-14T21:18:06.1457987Z * [new branch] gh/laithsakka/178/base -> origin/gh/laithsakka/178/base 2025-08-14T21:18:06.1458125Z * [new branch] gh/laithsakka/178/head -> origin/gh/laithsakka/178/head 2025-08-14T21:18:06.1458251Z * [new branch] gh/laithsakka/178/orig -> origin/gh/laithsakka/178/orig 2025-08-14T21:18:06.1458379Z * [new branch] gh/laithsakka/191/base -> origin/gh/laithsakka/191/base 2025-08-14T21:18:06.1458503Z * [new branch] gh/laithsakka/191/head -> origin/gh/laithsakka/191/head 2025-08-14T21:18:06.1458639Z * [new branch] gh/laithsakka/191/orig -> origin/gh/laithsakka/191/orig 2025-08-14T21:18:06.1458772Z * [new branch] gh/laithsakka/234/base -> origin/gh/laithsakka/234/base 2025-08-14T21:18:06.1458894Z * [new branch] gh/laithsakka/234/head -> origin/gh/laithsakka/234/head 2025-08-14T21:18:06.1459026Z * [new branch] gh/laithsakka/234/orig -> origin/gh/laithsakka/234/orig 2025-08-14T21:18:06.1459296Z * [new branch] gh/laithsakka/237/base -> origin/gh/laithsakka/237/base 2025-08-14T21:18:06.1459429Z * [new branch] gh/laithsakka/237/head -> origin/gh/laithsakka/237/head 2025-08-14T21:18:06.1461226Z * [new branch] gh/laithsakka/237/orig -> origin/gh/laithsakka/237/orig 2025-08-14T21:18:06.1461368Z * [new branch] gh/laithsakka/238/base -> origin/gh/laithsakka/238/base 2025-08-14T21:18:06.1461515Z * [new branch] gh/laithsakka/238/head -> origin/gh/laithsakka/238/head 2025-08-14T21:18:06.1461720Z * [new branch] gh/laithsakka/238/orig -> origin/gh/laithsakka/238/orig 2025-08-14T21:18:06.1461949Z * [new branch] gh/laithsakka/239/base -> origin/gh/laithsakka/239/base 2025-08-14T21:18:06.1467375Z * [new branch] gh/laithsakka/239/head -> origin/gh/laithsakka/239/head 2025-08-14T21:18:06.1469378Z * [new branch] gh/laithsakka/239/orig -> origin/gh/laithsakka/239/orig 2025-08-14T21:18:06.1469641Z * [new branch] gh/laithsakka/240/base -> origin/gh/laithsakka/240/base 2025-08-14T21:18:06.1469818Z * [new branch] gh/laithsakka/240/head -> origin/gh/laithsakka/240/head 2025-08-14T21:18:06.1469948Z * [new branch] gh/laithsakka/240/orig -> origin/gh/laithsakka/240/orig 2025-08-14T21:18:06.1470082Z * [new branch] gh/laithsakka/242/base -> origin/gh/laithsakka/242/base 2025-08-14T21:18:06.1470207Z * [new branch] gh/laithsakka/242/head -> origin/gh/laithsakka/242/head 2025-08-14T21:18:06.1470484Z * [new branch] gh/laithsakka/242/orig -> origin/gh/laithsakka/242/orig 2025-08-14T21:18:06.1470613Z * [new branch] gh/laithsakka/243/base -> origin/gh/laithsakka/243/base 2025-08-14T21:18:06.1470739Z * [new branch] gh/laithsakka/243/head -> origin/gh/laithsakka/243/head 2025-08-14T21:18:06.1470876Z * [new branch] gh/laithsakka/243/orig -> origin/gh/laithsakka/243/orig 2025-08-14T21:18:06.1474335Z * [new branch] gh/laithsakka/244/base -> origin/gh/laithsakka/244/base 2025-08-14T21:18:06.1477928Z * [new branch] gh/laithsakka/244/head -> origin/gh/laithsakka/244/head 2025-08-14T21:18:06.1481607Z * [new branch] gh/laithsakka/244/orig -> origin/gh/laithsakka/244/orig 2025-08-14T21:18:06.1485411Z * [new branch] gh/laithsakka/245/base -> origin/gh/laithsakka/245/base 2025-08-14T21:18:06.1489504Z * [new branch] gh/laithsakka/245/head -> origin/gh/laithsakka/245/head 2025-08-14T21:18:06.1493075Z * [new branch] gh/laithsakka/245/orig -> origin/gh/laithsakka/245/orig 2025-08-14T21:18:06.1496559Z * [new branch] gh/laithsakka/246/base -> origin/gh/laithsakka/246/base 2025-08-14T21:18:06.1498080Z * [new branch] gh/laithsakka/246/head -> origin/gh/laithsakka/246/head 2025-08-14T21:18:06.1498351Z * [new branch] gh/laithsakka/246/orig -> origin/gh/laithsakka/246/orig 2025-08-14T21:18:06.1498489Z * [new branch] gh/laithsakka/247/base -> origin/gh/laithsakka/247/base 2025-08-14T21:18:06.1498622Z * [new branch] gh/laithsakka/247/head -> origin/gh/laithsakka/247/head 2025-08-14T21:18:06.1498748Z * [new branch] gh/laithsakka/247/orig -> origin/gh/laithsakka/247/orig 2025-08-14T21:18:06.1498872Z * [new branch] gh/laithsakka/248/base -> origin/gh/laithsakka/248/base 2025-08-14T21:18:06.1499007Z * [new branch] gh/laithsakka/248/head -> origin/gh/laithsakka/248/head 2025-08-14T21:18:06.1499128Z * [new branch] gh/laithsakka/248/orig -> origin/gh/laithsakka/248/orig 2025-08-14T21:18:06.1499252Z * [new branch] gh/laithsakka/249/base -> origin/gh/laithsakka/249/base 2025-08-14T21:18:06.1499382Z * [new branch] gh/laithsakka/249/head -> origin/gh/laithsakka/249/head 2025-08-14T21:18:06.1499676Z * [new branch] gh/laithsakka/249/orig -> origin/gh/laithsakka/249/orig 2025-08-14T21:18:06.1499808Z * [new branch] gh/laithsakka/250/base -> origin/gh/laithsakka/250/base 2025-08-14T21:18:06.1499930Z * [new branch] gh/laithsakka/250/head -> origin/gh/laithsakka/250/head 2025-08-14T21:18:06.1500054Z * [new branch] gh/laithsakka/250/orig -> origin/gh/laithsakka/250/orig 2025-08-14T21:18:06.1500183Z * [new branch] gh/laithsakka/251/base -> origin/gh/laithsakka/251/base 2025-08-14T21:18:06.1500310Z * [new branch] gh/laithsakka/251/head -> origin/gh/laithsakka/251/head 2025-08-14T21:18:06.1500439Z * [new branch] gh/laithsakka/251/orig -> origin/gh/laithsakka/251/orig 2025-08-14T21:18:06.1500562Z * [new branch] gh/laithsakka/252/base -> origin/gh/laithsakka/252/base 2025-08-14T21:18:06.1500691Z * [new branch] gh/laithsakka/252/head -> origin/gh/laithsakka/252/head 2025-08-14T21:18:06.1500821Z * [new branch] gh/laithsakka/252/orig -> origin/gh/laithsakka/252/orig 2025-08-14T21:18:06.1500943Z * [new branch] gh/laithsakka/253/base -> origin/gh/laithsakka/253/base 2025-08-14T21:18:06.1501076Z * [new branch] gh/laithsakka/253/head -> origin/gh/laithsakka/253/head 2025-08-14T21:18:06.1501199Z * [new branch] gh/laithsakka/253/orig -> origin/gh/laithsakka/253/orig 2025-08-14T21:18:06.1501371Z * [new branch] gh/laithsakka/254/base -> origin/gh/laithsakka/254/base 2025-08-14T21:18:06.1501506Z * [new branch] gh/laithsakka/254/head -> origin/gh/laithsakka/254/head 2025-08-14T21:18:06.1501639Z * [new branch] gh/laithsakka/254/orig -> origin/gh/laithsakka/254/orig 2025-08-14T21:18:06.1501768Z * [new branch] gh/laithsakka/255/base -> origin/gh/laithsakka/255/base 2025-08-14T21:18:06.1501897Z * [new branch] gh/laithsakka/255/head -> origin/gh/laithsakka/255/head 2025-08-14T21:18:06.1502020Z * [new branch] gh/laithsakka/255/orig -> origin/gh/laithsakka/255/orig 2025-08-14T21:18:06.1502151Z * [new branch] gh/laithsakka/256/base -> origin/gh/laithsakka/256/base 2025-08-14T21:18:06.1502275Z * [new branch] gh/laithsakka/256/head -> origin/gh/laithsakka/256/head 2025-08-14T21:18:06.1502406Z * [new branch] gh/laithsakka/256/orig -> origin/gh/laithsakka/256/orig 2025-08-14T21:18:06.1502534Z * [new branch] gh/laithsakka/257/base -> origin/gh/laithsakka/257/base 2025-08-14T21:18:06.1502658Z * [new branch] gh/laithsakka/257/head -> origin/gh/laithsakka/257/head 2025-08-14T21:18:06.1502788Z * [new branch] gh/laithsakka/257/orig -> origin/gh/laithsakka/257/orig 2025-08-14T21:18:06.1502915Z * [new branch] gh/laithsakka/258/base -> origin/gh/laithsakka/258/base 2025-08-14T21:18:06.1503040Z * [new branch] gh/laithsakka/258/head -> origin/gh/laithsakka/258/head 2025-08-14T21:18:06.1503171Z * [new branch] gh/laithsakka/258/orig -> origin/gh/laithsakka/258/orig 2025-08-14T21:18:06.1503292Z * [new branch] gh/laithsakka/259/base -> origin/gh/laithsakka/259/base 2025-08-14T21:18:06.1503424Z * [new branch] gh/laithsakka/259/head -> origin/gh/laithsakka/259/head 2025-08-14T21:18:06.1503548Z * [new branch] gh/laithsakka/259/orig -> origin/gh/laithsakka/259/orig 2025-08-14T21:18:06.1503672Z * [new branch] gh/laithsakka/260/base -> origin/gh/laithsakka/260/base 2025-08-14T21:18:06.1504055Z * [new branch] gh/laithsakka/260/head -> origin/gh/laithsakka/260/head 2025-08-14T21:18:06.1504616Z * [new branch] gh/laithsakka/260/orig -> origin/gh/laithsakka/260/orig 2025-08-14T21:18:06.1508130Z * [new branch] gh/laithsakka/261/base -> origin/gh/laithsakka/261/base 2025-08-14T21:18:06.1508301Z * [new branch] gh/laithsakka/261/head -> origin/gh/laithsakka/261/head 2025-08-14T21:18:06.1508436Z * [new branch] gh/laithsakka/261/orig -> origin/gh/laithsakka/261/orig 2025-08-14T21:18:06.1508567Z * [new branch] gh/laithsakka/262/base -> origin/gh/laithsakka/262/base 2025-08-14T21:18:06.1508704Z * [new branch] gh/laithsakka/262/head -> origin/gh/laithsakka/262/head 2025-08-14T21:18:06.1511113Z * [new branch] gh/laithsakka/262/orig -> origin/gh/laithsakka/262/orig 2025-08-14T21:18:06.1511305Z * [new branch] gh/laithsakka/28/base -> origin/gh/laithsakka/28/base 2025-08-14T21:18:06.1511448Z * [new branch] gh/laithsakka/29/base -> origin/gh/laithsakka/29/base 2025-08-14T21:18:06.1511607Z * [new branch] gh/laithsakka/30/base -> origin/gh/laithsakka/30/base 2025-08-14T21:18:06.1511764Z * [new branch] gh/laithsakka/30/head -> origin/gh/laithsakka/30/head 2025-08-14T21:18:06.1515841Z * [new branch] gh/laithsakka/31/base -> origin/gh/laithsakka/31/base 2025-08-14T21:18:06.1519596Z * [new branch] gh/laithsakka/31/head -> origin/gh/laithsakka/31/head 2025-08-14T21:18:06.1523288Z * [new branch] gh/laithsakka/32/base -> origin/gh/laithsakka/32/base 2025-08-14T21:18:06.1527060Z * [new branch] gh/laithsakka/32/head -> origin/gh/laithsakka/32/head 2025-08-14T21:18:06.1528669Z * [new branch] gh/lucaskabela/1/base -> origin/gh/lucaskabela/1/base 2025-08-14T21:18:06.1528853Z * [new branch] gh/lucaskabela/1/head -> origin/gh/lucaskabela/1/head 2025-08-14T21:18:06.1529056Z * [new branch] gh/lucaskabela/10/base -> origin/gh/lucaskabela/10/base 2025-08-14T21:18:06.1529270Z * [new branch] gh/lucaskabela/10/head -> origin/gh/lucaskabela/10/head 2025-08-14T21:18:06.1529398Z * [new branch] gh/lucaskabela/10/orig -> origin/gh/lucaskabela/10/orig 2025-08-14T21:18:06.1529523Z * [new branch] gh/lucaskabela/11/base -> origin/gh/lucaskabela/11/base 2025-08-14T21:18:06.1529654Z * [new branch] gh/lucaskabela/11/head -> origin/gh/lucaskabela/11/head 2025-08-14T21:18:06.1529779Z * [new branch] gh/lucaskabela/11/orig -> origin/gh/lucaskabela/11/orig 2025-08-14T21:18:06.1529912Z * [new branch] gh/lucaskabela/12/base -> origin/gh/lucaskabela/12/base 2025-08-14T21:18:06.1530034Z * [new branch] gh/lucaskabela/12/head -> origin/gh/lucaskabela/12/head 2025-08-14T21:18:06.1530157Z * [new branch] gh/lucaskabela/12/orig -> origin/gh/lucaskabela/12/orig 2025-08-14T21:18:06.1530291Z * [new branch] gh/lucaskabela/13/base -> origin/gh/lucaskabela/13/base 2025-08-14T21:18:06.1530416Z * [new branch] gh/lucaskabela/13/head -> origin/gh/lucaskabela/13/head 2025-08-14T21:18:06.1530544Z * [new branch] gh/lucaskabela/13/orig -> origin/gh/lucaskabela/13/orig 2025-08-14T21:18:06.1530668Z * [new branch] gh/lucaskabela/14/base -> origin/gh/lucaskabela/14/base 2025-08-14T21:18:06.1530790Z * [new branch] gh/lucaskabela/14/head -> origin/gh/lucaskabela/14/head 2025-08-14T21:18:06.1530923Z * [new branch] gh/lucaskabela/14/orig -> origin/gh/lucaskabela/14/orig 2025-08-14T21:18:06.1531048Z * [new branch] gh/lucaskabela/15/base -> origin/gh/lucaskabela/15/base 2025-08-14T21:18:06.1531510Z * [new branch] gh/lucaskabela/15/head -> origin/gh/lucaskabela/15/head 2025-08-14T21:18:06.1531634Z * [new branch] gh/lucaskabela/15/orig -> origin/gh/lucaskabela/15/orig 2025-08-14T21:18:06.1531757Z * [new branch] gh/lucaskabela/16/base -> origin/gh/lucaskabela/16/base 2025-08-14T21:18:06.1532015Z * [new branch] gh/lucaskabela/16/head -> origin/gh/lucaskabela/16/head 2025-08-14T21:18:06.1537255Z * [new branch] gh/lucaskabela/16/orig -> origin/gh/lucaskabela/16/orig 2025-08-14T21:18:06.1537423Z * [new branch] gh/lucaskabela/17/base -> origin/gh/lucaskabela/17/base 2025-08-14T21:18:06.1537555Z * [new branch] gh/lucaskabela/17/head -> origin/gh/lucaskabela/17/head 2025-08-14T21:18:06.1537725Z * [new branch] gh/lucaskabela/17/orig -> origin/gh/lucaskabela/17/orig 2025-08-14T21:18:06.1537864Z * [new branch] gh/lucaskabela/2/base -> origin/gh/lucaskabela/2/base 2025-08-14T21:18:06.1538004Z * [new branch] gh/lucaskabela/2/head -> origin/gh/lucaskabela/2/head 2025-08-14T21:18:06.1538132Z * [new branch] gh/lucaskabela/2/orig -> origin/gh/lucaskabela/2/orig 2025-08-14T21:18:06.1538269Z * [new branch] gh/lucaskabela/3/base -> origin/gh/lucaskabela/3/base 2025-08-14T21:18:06.1538411Z * [new branch] gh/lucaskabela/3/head -> origin/gh/lucaskabela/3/head 2025-08-14T21:18:06.1538539Z * [new branch] gh/lucaskabela/3/orig -> origin/gh/lucaskabela/3/orig 2025-08-14T21:18:06.1539990Z * [new branch] gh/lucaskabela/4/base -> origin/gh/lucaskabela/4/base 2025-08-14T21:18:06.1540126Z * [new branch] gh/lucaskabela/4/head -> origin/gh/lucaskabela/4/head 2025-08-14T21:18:06.1540708Z * [new branch] gh/lucaskabela/4/orig -> origin/gh/lucaskabela/4/orig 2025-08-14T21:18:06.1543791Z * [new branch] gh/lucaskabela/5/base -> origin/gh/lucaskabela/5/base 2025-08-14T21:18:06.1543946Z * [new branch] gh/lucaskabela/5/head -> origin/gh/lucaskabela/5/head 2025-08-14T21:18:06.1544081Z * [new branch] gh/lucaskabela/5/orig -> origin/gh/lucaskabela/5/orig 2025-08-14T21:18:06.1544298Z * [new branch] gh/lucaskabela/6/base -> origin/gh/lucaskabela/6/base 2025-08-14T21:18:06.1544471Z * [new branch] gh/lucaskabela/6/head -> origin/gh/lucaskabela/6/head 2025-08-14T21:18:06.1545048Z * [new branch] gh/lucaskabela/6/orig -> origin/gh/lucaskabela/6/orig 2025-08-14T21:18:06.1548288Z * [new branch] gh/lucaskabela/7/base -> origin/gh/lucaskabela/7/base 2025-08-14T21:18:06.1548458Z * [new branch] gh/lucaskabela/7/head -> origin/gh/lucaskabela/7/head 2025-08-14T21:18:06.1548602Z * [new branch] gh/lucaskabela/7/orig -> origin/gh/lucaskabela/7/orig 2025-08-14T21:18:06.1548737Z * [new branch] gh/lucaskabela/8/base -> origin/gh/lucaskabela/8/base 2025-08-14T21:18:06.1548861Z * [new branch] gh/lucaskabela/8/head -> origin/gh/lucaskabela/8/head 2025-08-14T21:18:06.1549022Z * [new branch] gh/lucaskabela/8/orig -> origin/gh/lucaskabela/8/orig 2025-08-14T21:18:06.1550384Z * [new branch] gh/lucaskabela/9/base -> origin/gh/lucaskabela/9/base 2025-08-14T21:18:06.1550584Z * [new branch] gh/lucaskabela/9/head -> origin/gh/lucaskabela/9/head 2025-08-14T21:18:06.1553021Z * [new branch] gh/lucaskabela/9/orig -> origin/gh/lucaskabela/9/orig 2025-08-14T21:18:06.1553164Z * [new branch] gh/lw/1/base -> origin/gh/lw/1/base 2025-08-14T21:18:06.1553279Z * [new branch] gh/lw/1/head -> origin/gh/lw/1/head 2025-08-14T21:18:06.1553398Z * [new branch] gh/lw/1/orig -> origin/gh/lw/1/orig 2025-08-14T21:18:06.1556962Z * [new branch] gh/lw/2/base -> origin/gh/lw/2/base 2025-08-14T21:18:06.1557104Z * [new branch] gh/lw/2/head -> origin/gh/lw/2/head 2025-08-14T21:18:06.1557212Z * [new branch] gh/lw/2/orig -> origin/gh/lw/2/orig 2025-08-14T21:18:06.1557495Z * [new branch] gh/lw/3/base -> origin/gh/lw/3/base 2025-08-14T21:18:06.1557600Z * [new branch] gh/lw/3/head -> origin/gh/lw/3/head 2025-08-14T21:18:06.1557713Z * [new branch] gh/lw/3/orig -> origin/gh/lw/3/orig 2025-08-14T21:18:06.1561698Z * [new branch] gh/malfet/14/base -> origin/gh/malfet/14/base 2025-08-14T21:18:06.1561852Z * [new branch] gh/malfet/330/base -> origin/gh/malfet/330/base 2025-08-14T21:18:06.1561992Z * [new branch] gh/malfet/330/head -> origin/gh/malfet/330/head 2025-08-14T21:18:06.1562108Z * [new branch] gh/malfet/330/orig -> origin/gh/malfet/330/orig 2025-08-14T21:18:06.1562225Z * [new branch] gh/malfet/396/base -> origin/gh/malfet/396/base 2025-08-14T21:18:06.1562381Z * [new branch] gh/malfet/396/head -> origin/gh/malfet/396/head 2025-08-14T21:18:06.1562950Z * [new branch] gh/malfet/396/orig -> origin/gh/malfet/396/orig 2025-08-14T21:18:06.1564314Z * [new branch] gh/malfet/397/base -> origin/gh/malfet/397/base 2025-08-14T21:18:06.1564441Z * [new branch] gh/malfet/397/head -> origin/gh/malfet/397/head 2025-08-14T21:18:06.1564990Z * [new branch] gh/malfet/397/orig -> origin/gh/malfet/397/orig 2025-08-14T21:18:06.1568148Z * [new branch] gh/malfet/398/base -> origin/gh/malfet/398/base 2025-08-14T21:18:06.1568431Z * [new branch] gh/malfet/398/head -> origin/gh/malfet/398/head 2025-08-14T21:18:06.1568563Z * [new branch] gh/malfet/398/orig -> origin/gh/malfet/398/orig 2025-08-14T21:18:06.1568683Z * [new branch] gh/malfet/399/base -> origin/gh/malfet/399/base 2025-08-14T21:18:06.1568816Z * [new branch] gh/malfet/399/head -> origin/gh/malfet/399/head 2025-08-14T21:18:06.1568976Z * [new branch] gh/malfet/399/orig -> origin/gh/malfet/399/orig 2025-08-14T21:18:06.1572849Z * [new branch] gh/malfet/414/base -> origin/gh/malfet/414/base 2025-08-14T21:18:06.1573010Z * [new branch] gh/malfet/414/head -> origin/gh/malfet/414/head 2025-08-14T21:18:06.1573132Z * [new branch] gh/malfet/414/orig -> origin/gh/malfet/414/orig 2025-08-14T21:18:06.1573255Z * [new branch] gh/malfet/417/base -> origin/gh/malfet/417/base 2025-08-14T21:18:06.1573385Z * [new branch] gh/malfet/417/head -> origin/gh/malfet/417/head 2025-08-14T21:18:06.1573502Z * [new branch] gh/malfet/417/orig -> origin/gh/malfet/417/orig 2025-08-14T21:18:06.1573982Z * [new branch] gh/malfet/418/base -> origin/gh/malfet/418/base 2025-08-14T21:18:06.1574543Z * [new branch] gh/malfet/418/head -> origin/gh/malfet/418/head 2025-08-14T21:18:06.1575177Z * [new branch] gh/malfet/418/orig -> origin/gh/malfet/418/orig 2025-08-14T21:18:06.1576519Z * [new branch] gh/malfet/422/base -> origin/gh/malfet/422/base 2025-08-14T21:18:06.1576652Z * [new branch] gh/malfet/422/head -> origin/gh/malfet/422/head 2025-08-14T21:18:06.1577131Z * [new branch] gh/malfet/422/orig -> origin/gh/malfet/422/orig 2025-08-14T21:18:06.1580635Z * [new branch] gh/malfet/438/base -> origin/gh/malfet/438/base 2025-08-14T21:18:06.1580806Z * [new branch] gh/malfet/438/head -> origin/gh/malfet/438/head 2025-08-14T21:18:06.1580925Z * [new branch] gh/malfet/438/orig -> origin/gh/malfet/438/orig 2025-08-14T21:18:06.1581050Z * [new branch] gh/malfet/439/base -> origin/gh/malfet/439/base 2025-08-14T21:18:06.1581366Z * [new branch] gh/malfet/439/head -> origin/gh/malfet/439/head 2025-08-14T21:18:06.1581492Z * [new branch] gh/malfet/439/orig -> origin/gh/malfet/439/orig 2025-08-14T21:18:06.1582363Z * [new branch] gh/malfet/440/base -> origin/gh/malfet/440/base 2025-08-14T21:18:06.1582669Z * [new branch] gh/malfet/440/head -> origin/gh/malfet/440/head 2025-08-14T21:18:06.1586688Z * [new branch] gh/malfet/440/orig -> origin/gh/malfet/440/orig 2025-08-14T21:18:06.1591332Z * [new branch] gh/malfet/441/base -> origin/gh/malfet/441/base 2025-08-14T21:18:06.1595500Z * [new branch] gh/malfet/441/head -> origin/gh/malfet/441/head 2025-08-14T21:18:06.1599525Z * [new branch] gh/malfet/441/orig -> origin/gh/malfet/441/orig 2025-08-14T21:18:06.1603569Z * [new branch] gh/malfet/442/base -> origin/gh/malfet/442/base 2025-08-14T21:18:06.1608120Z * [new branch] gh/malfet/442/head -> origin/gh/malfet/442/head 2025-08-14T21:18:06.1612167Z * [new branch] gh/malfet/442/orig -> origin/gh/malfet/442/orig 2025-08-14T21:18:06.1615760Z * [new branch] gh/malfet/443/base -> origin/gh/malfet/443/base 2025-08-14T21:18:06.1616053Z * [new branch] gh/malfet/443/head -> origin/gh/malfet/443/head 2025-08-14T21:18:06.1616196Z * [new branch] gh/malfet/443/orig -> origin/gh/malfet/443/orig 2025-08-14T21:18:06.1616501Z * [new branch] gh/malfet/444/base -> origin/gh/malfet/444/base 2025-08-14T21:18:06.1616753Z * [new branch] gh/malfet/444/head -> origin/gh/malfet/444/head 2025-08-14T21:18:06.1617327Z * [new branch] gh/malfet/444/orig -> origin/gh/malfet/444/orig 2025-08-14T21:18:06.1617485Z * [new branch] gh/malfet/445/base -> origin/gh/malfet/445/base 2025-08-14T21:18:06.1617653Z * [new branch] gh/malfet/445/head -> origin/gh/malfet/445/head 2025-08-14T21:18:06.1617784Z * [new branch] gh/malfet/445/orig -> origin/gh/malfet/445/orig 2025-08-14T21:18:06.1617903Z * [new branch] gh/malfet/446/base -> origin/gh/malfet/446/base 2025-08-14T21:18:06.1618028Z * [new branch] gh/malfet/446/head -> origin/gh/malfet/446/head 2025-08-14T21:18:06.1618144Z * [new branch] gh/malfet/446/orig -> origin/gh/malfet/446/orig 2025-08-14T21:18:06.1618267Z * [new branch] gh/malfet/447/base -> origin/gh/malfet/447/base 2025-08-14T21:18:06.1618388Z * [new branch] gh/malfet/447/head -> origin/gh/malfet/447/head 2025-08-14T21:18:06.1618500Z * [new branch] gh/malfet/448/base -> origin/gh/malfet/448/base 2025-08-14T21:18:06.1618620Z * [new branch] gh/malfet/448/head -> origin/gh/malfet/448/head 2025-08-14T21:18:06.1618737Z * [new branch] gh/malfet/449/base -> origin/gh/malfet/449/base 2025-08-14T21:18:06.1618849Z * [new branch] gh/malfet/449/head -> origin/gh/malfet/449/head 2025-08-14T21:18:06.1618967Z * [new branch] gh/malfet/450/base -> origin/gh/malfet/450/base 2025-08-14T21:18:06.1619080Z * [new branch] gh/malfet/450/head -> origin/gh/malfet/450/head 2025-08-14T21:18:06.1619191Z * [new branch] gh/malfet/451/base -> origin/gh/malfet/451/base 2025-08-14T21:18:06.1619316Z * [new branch] gh/malfet/451/head -> origin/gh/malfet/451/head 2025-08-14T21:18:06.1619429Z * [new branch] gh/malfet/452/base -> origin/gh/malfet/452/base 2025-08-14T21:18:06.1619549Z * [new branch] gh/malfet/452/head -> origin/gh/malfet/452/head 2025-08-14T21:18:06.1619663Z * [new branch] gh/malfet/452/orig -> origin/gh/malfet/452/orig 2025-08-14T21:18:06.1619955Z * [new branch] gh/malfet/453/base -> origin/gh/malfet/453/base 2025-08-14T21:18:06.1620076Z * [new branch] gh/malfet/453/head -> origin/gh/malfet/453/head 2025-08-14T21:18:06.1620188Z * [new branch] gh/malfet/453/orig -> origin/gh/malfet/453/orig 2025-08-14T21:18:06.1620307Z * [new branch] gh/malfet/454/base -> origin/gh/malfet/454/base 2025-08-14T21:18:06.1620421Z * [new branch] gh/malfet/454/head -> origin/gh/malfet/454/head 2025-08-14T21:18:06.1620541Z * [new branch] gh/malfet/454/orig -> origin/gh/malfet/454/orig 2025-08-14T21:18:06.1620663Z * [new branch] gh/malfet/455/base -> origin/gh/malfet/455/base 2025-08-14T21:18:06.1620778Z * [new branch] gh/malfet/455/head -> origin/gh/malfet/455/head 2025-08-14T21:18:06.1620900Z * [new branch] gh/malfet/455/orig -> origin/gh/malfet/455/orig 2025-08-14T21:18:06.1621019Z * [new branch] gh/malfet/456/base -> origin/gh/malfet/456/base 2025-08-14T21:18:06.1621132Z * [new branch] gh/malfet/456/head -> origin/gh/malfet/456/head 2025-08-14T21:18:06.1621250Z * [new branch] gh/malfet/456/orig -> origin/gh/malfet/456/orig 2025-08-14T21:18:06.1621371Z * [new branch] gh/malfet/457/base -> origin/gh/malfet/457/base 2025-08-14T21:18:06.1621488Z * [new branch] gh/malfet/457/head -> origin/gh/malfet/457/head 2025-08-14T21:18:06.1621650Z * [new branch] gh/malfet/457/orig -> origin/gh/malfet/457/orig 2025-08-14T21:18:06.1621768Z * [new branch] gh/malfet/458/base -> origin/gh/malfet/458/base 2025-08-14T21:18:06.1621888Z * [new branch] gh/malfet/458/head -> origin/gh/malfet/458/head 2025-08-14T21:18:06.1622002Z * [new branch] gh/malfet/458/orig -> origin/gh/malfet/458/orig 2025-08-14T21:18:06.1622126Z * [new branch] gh/malfet/459/base -> origin/gh/malfet/459/base 2025-08-14T21:18:06.1622683Z * [new branch] gh/malfet/459/head -> origin/gh/malfet/459/head 2025-08-14T21:18:06.1623300Z * [new branch] gh/malfet/459/orig -> origin/gh/malfet/459/orig 2025-08-14T21:18:06.1626015Z * [new branch] gh/malfet/460/base -> origin/gh/malfet/460/base 2025-08-14T21:18:06.1626317Z * [new branch] gh/malfet/460/head -> origin/gh/malfet/460/head 2025-08-14T21:18:06.1626481Z * [new branch] gh/malfet/460/orig -> origin/gh/malfet/460/orig 2025-08-14T21:18:06.1626654Z * [new branch] gh/malfet/461/base -> origin/gh/malfet/461/base 2025-08-14T21:18:06.1627188Z * [new branch] gh/malfet/461/head -> origin/gh/malfet/461/head 2025-08-14T21:18:06.1628796Z * [new branch] gh/malfet/461/orig -> origin/gh/malfet/461/orig 2025-08-14T21:18:06.1629118Z * [new branch] gh/malfet/462/base -> origin/gh/malfet/462/base 2025-08-14T21:18:06.1629260Z * [new branch] gh/malfet/462/head -> origin/gh/malfet/462/head 2025-08-14T21:18:06.1633033Z * [new branch] gh/malfet/462/orig -> origin/gh/malfet/462/orig 2025-08-14T21:18:06.1636430Z * [new branch] gh/malfet/463/base -> origin/gh/malfet/463/base 2025-08-14T21:18:06.1640066Z * [new branch] gh/malfet/463/head -> origin/gh/malfet/463/head 2025-08-14T21:18:06.1643115Z * [new branch] gh/malfet/463/orig -> origin/gh/malfet/463/orig 2025-08-14T21:18:06.1646280Z * [new branch] gh/malfet/464/base -> origin/gh/malfet/464/base 2025-08-14T21:18:06.1649365Z * [new branch] gh/malfet/464/head -> origin/gh/malfet/464/head 2025-08-14T21:18:06.1651321Z * [new branch] gh/malfet/464/orig -> origin/gh/malfet/464/orig 2025-08-14T21:18:06.1651626Z * [new branch] gh/malfet/465/base -> origin/gh/malfet/465/base 2025-08-14T21:18:06.1651747Z * [new branch] gh/malfet/465/head -> origin/gh/malfet/465/head 2025-08-14T21:18:06.1651875Z * [new branch] gh/malfet/465/orig -> origin/gh/malfet/465/orig 2025-08-14T21:18:06.1651991Z * [new branch] gh/malfet/466/base -> origin/gh/malfet/466/base 2025-08-14T21:18:06.1652108Z * [new branch] gh/malfet/466/head -> origin/gh/malfet/466/head 2025-08-14T21:18:06.1652239Z * [new branch] gh/malfet/466/orig -> origin/gh/malfet/466/orig 2025-08-14T21:18:06.1652354Z * [new branch] gh/malfet/467/base -> origin/gh/malfet/467/base 2025-08-14T21:18:06.1652467Z * [new branch] gh/malfet/467/head -> origin/gh/malfet/467/head 2025-08-14T21:18:06.1652587Z * [new branch] gh/malfet/467/orig -> origin/gh/malfet/467/orig 2025-08-14T21:18:06.1652704Z * [new branch] gh/malfet/468/base -> origin/gh/malfet/468/base 2025-08-14T21:18:06.1652826Z * [new branch] gh/malfet/468/head -> origin/gh/malfet/468/head 2025-08-14T21:18:06.1652938Z * [new branch] gh/malfet/468/orig -> origin/gh/malfet/468/orig 2025-08-14T21:18:06.1653052Z * [new branch] gh/malfet/469/base -> origin/gh/malfet/469/base 2025-08-14T21:18:06.1653174Z * [new branch] gh/malfet/469/head -> origin/gh/malfet/469/head 2025-08-14T21:18:06.1653346Z * [new branch] gh/malfet/469/orig -> origin/gh/malfet/469/orig 2025-08-14T21:18:06.1653471Z * [new branch] gh/malfet/470/base -> origin/gh/malfet/470/base 2025-08-14T21:18:06.1653585Z * [new branch] gh/malfet/470/head -> origin/gh/malfet/470/head 2025-08-14T21:18:06.1653697Z * [new branch] gh/malfet/470/orig -> origin/gh/malfet/470/orig 2025-08-14T21:18:06.1653820Z * [new branch] gh/malfet/471/base -> origin/gh/malfet/471/base 2025-08-14T21:18:06.1653933Z * [new branch] gh/malfet/471/head -> origin/gh/malfet/471/head 2025-08-14T21:18:06.1654052Z * [new branch] gh/malfet/471/orig -> origin/gh/malfet/471/orig 2025-08-14T21:18:06.1654163Z * [new branch] gh/malfet/472/base -> origin/gh/malfet/472/base 2025-08-14T21:18:06.1654274Z * [new branch] gh/malfet/472/head -> origin/gh/malfet/472/head 2025-08-14T21:18:06.1654395Z * [new branch] gh/malfet/472/orig -> origin/gh/malfet/472/orig 2025-08-14T21:18:06.1654509Z * [new branch] gh/malfet/473/base -> origin/gh/malfet/473/base 2025-08-14T21:18:06.1654620Z * [new branch] gh/malfet/473/head -> origin/gh/malfet/473/head 2025-08-14T21:18:06.1654892Z * [new branch] gh/malfet/473/orig -> origin/gh/malfet/473/orig 2025-08-14T21:18:06.1655112Z * [new branch] gh/malfet/474/base -> origin/gh/malfet/474/base 2025-08-14T21:18:06.1655237Z * [new branch] gh/malfet/474/head -> origin/gh/malfet/474/head 2025-08-14T21:18:06.1655433Z * [new branch] gh/malfet/474/orig -> origin/gh/malfet/474/orig 2025-08-14T21:18:06.1655692Z * [new branch] gh/malfet/475/base -> origin/gh/malfet/475/base 2025-08-14T21:18:06.1655816Z * [new branch] gh/malfet/475/head -> origin/gh/malfet/475/head 2025-08-14T21:18:06.1657266Z * [new branch] gh/malfet/475/orig -> origin/gh/malfet/475/orig 2025-08-14T21:18:06.1657418Z * [new branch] gh/malfet/476/base -> origin/gh/malfet/476/base 2025-08-14T21:18:06.1657813Z * [new branch] gh/malfet/476/head -> origin/gh/malfet/476/head 2025-08-14T21:18:06.1658413Z * [new branch] gh/malfet/476/orig -> origin/gh/malfet/476/orig 2025-08-14T21:18:06.1659547Z * [new branch] gh/malfet/477/base -> origin/gh/malfet/477/base 2025-08-14T21:18:06.1659880Z * [new branch] gh/malfet/477/head -> origin/gh/malfet/477/head 2025-08-14T21:18:06.1661402Z * [new branch] gh/malfet/477/orig -> origin/gh/malfet/477/orig 2025-08-14T21:18:06.1661562Z * [new branch] gh/malfet/478/base -> origin/gh/malfet/478/base 2025-08-14T21:18:06.1661981Z * [new branch] gh/malfet/478/head -> origin/gh/malfet/478/head 2025-08-14T21:18:06.1662599Z * [new branch] gh/malfet/478/orig -> origin/gh/malfet/478/orig 2025-08-14T21:18:06.1663465Z * [new branch] gh/malfet/479/base -> origin/gh/malfet/479/base 2025-08-14T21:18:06.1663870Z * [new branch] gh/malfet/479/head -> origin/gh/malfet/479/head 2025-08-14T21:18:06.1665119Z * [new branch] gh/malfet/479/orig -> origin/gh/malfet/479/orig 2025-08-14T21:18:06.1665344Z * [new branch] gh/malfet/480/base -> origin/gh/malfet/480/base 2025-08-14T21:18:06.1667244Z * [new branch] gh/malfet/480/head -> origin/gh/malfet/480/head 2025-08-14T21:18:06.1667402Z * [new branch] gh/malfet/480/orig -> origin/gh/malfet/480/orig 2025-08-14T21:18:06.1667523Z * [new branch] gh/malfet/481/base -> origin/gh/malfet/481/base 2025-08-14T21:18:06.1668291Z * [new branch] gh/malfet/481/head -> origin/gh/malfet/481/head 2025-08-14T21:18:06.1668695Z * [new branch] gh/malfet/481/orig -> origin/gh/malfet/481/orig 2025-08-14T21:18:06.1670024Z * [new branch] gh/malfet/482/base -> origin/gh/malfet/482/base 2025-08-14T21:18:06.1670157Z * [new branch] gh/malfet/482/head -> origin/gh/malfet/482/head 2025-08-14T21:18:06.1670573Z * [new branch] gh/malfet/482/orig -> origin/gh/malfet/482/orig 2025-08-14T21:18:06.1672332Z * [new branch] gh/malfet/483/base -> origin/gh/malfet/483/base 2025-08-14T21:18:06.1672644Z * [new branch] gh/malfet/483/head -> origin/gh/malfet/483/head 2025-08-14T21:18:06.1672782Z * [new branch] gh/malfet/483/orig -> origin/gh/malfet/483/orig 2025-08-14T21:18:06.1674081Z * [new branch] gh/malfet/484/base -> origin/gh/malfet/484/base 2025-08-14T21:18:06.1674386Z * [new branch] gh/malfet/484/head -> origin/gh/malfet/484/head 2025-08-14T21:18:06.1674589Z * [new branch] gh/malfet/484/orig -> origin/gh/malfet/484/orig 2025-08-14T21:18:06.1676100Z * [new branch] gh/malfet/485/base -> origin/gh/malfet/485/base 2025-08-14T21:18:06.1676407Z * [new branch] gh/malfet/485/head -> origin/gh/malfet/485/head 2025-08-14T21:18:06.1676710Z * [new branch] gh/malfet/485/orig -> origin/gh/malfet/485/orig 2025-08-14T21:18:06.1678112Z * [new branch] gh/malfet/486/base -> origin/gh/malfet/486/base 2025-08-14T21:18:06.1678415Z * [new branch] gh/malfet/486/head -> origin/gh/malfet/486/head 2025-08-14T21:18:06.1678685Z * [new branch] gh/malfet/486/orig -> origin/gh/malfet/486/orig 2025-08-14T21:18:06.1680701Z * [new branch] gh/malfet/487/base -> origin/gh/malfet/487/base 2025-08-14T21:18:06.1681018Z * [new branch] gh/malfet/487/head -> origin/gh/malfet/487/head 2025-08-14T21:18:06.1681162Z * [new branch] gh/malfet/487/orig -> origin/gh/malfet/487/orig 2025-08-14T21:18:06.1681929Z * [new branch] gh/malfet/488/base -> origin/gh/malfet/488/base 2025-08-14T21:18:06.1682293Z * [new branch] gh/malfet/488/head -> origin/gh/malfet/488/head 2025-08-14T21:18:06.1684559Z * [new branch] gh/malfet/488/orig -> origin/gh/malfet/488/orig 2025-08-14T21:18:06.1685004Z * [new branch] gh/malfet/489/base -> origin/gh/malfet/489/base 2025-08-14T21:18:06.1685169Z * [new branch] gh/malfet/489/head -> origin/gh/malfet/489/head 2025-08-14T21:18:06.1685308Z * [new branch] gh/malfet/489/orig -> origin/gh/malfet/489/orig 2025-08-14T21:18:06.1688981Z * [new branch] gh/malfet/490/base -> origin/gh/malfet/490/base 2025-08-14T21:18:06.1689303Z * [new branch] gh/malfet/490/head -> origin/gh/malfet/490/head 2025-08-14T21:18:06.1689530Z * [new branch] gh/malfet/490/orig -> origin/gh/malfet/490/orig 2025-08-14T21:18:06.1689672Z * [new branch] gh/malfet/64/base -> origin/gh/malfet/64/base 2025-08-14T21:18:06.1689875Z * [new branch] gh/malfet/64/head -> origin/gh/malfet/64/head 2025-08-14T21:18:06.1690592Z * [new branch] gh/manuelcandales/10/base -> origin/gh/manuelcandales/10/base 2025-08-14T21:18:06.1690767Z * [new branch] gh/manuelcandales/10/head -> origin/gh/manuelcandales/10/head 2025-08-14T21:18:06.1691210Z * [new branch] gh/manuelcandales/10/orig -> origin/gh/manuelcandales/10/orig 2025-08-14T21:18:06.1692561Z * [new branch] gh/manuelcandales/9/base -> origin/gh/manuelcandales/9/base 2025-08-14T21:18:06.1692847Z * [new branch] gh/manuelcandales/9/head -> origin/gh/manuelcandales/9/head 2025-08-14T21:18:06.1694601Z * [new branch] gh/manuelcandales/9/orig -> origin/gh/manuelcandales/9/orig 2025-08-14T21:18:06.1694948Z * [new branch] gh/markkm/1/base -> origin/gh/markkm/1/base 2025-08-14T21:18:06.1696264Z * [new branch] gh/masnesral/204/base -> origin/gh/masnesral/204/base 2025-08-14T21:18:06.1696595Z * [new branch] gh/masnesral/204/head -> origin/gh/masnesral/204/head 2025-08-14T21:18:06.1696849Z * [new branch] gh/masnesral/204/orig -> origin/gh/masnesral/204/orig 2025-08-14T21:18:06.1698490Z * [new branch] gh/masnesral/223/base -> origin/gh/masnesral/223/base 2025-08-14T21:18:06.1698645Z * [new branch] gh/masnesral/223/head -> origin/gh/masnesral/223/head 2025-08-14T21:18:06.1699152Z * [new branch] gh/masnesral/223/orig -> origin/gh/masnesral/223/orig 2025-08-14T21:18:06.1700152Z * [new branch] gh/masnesral/224/base -> origin/gh/masnesral/224/base 2025-08-14T21:18:06.1700355Z * [new branch] gh/masnesral/224/head -> origin/gh/masnesral/224/head 2025-08-14T21:18:06.1701344Z * [new branch] gh/masnesral/224/orig -> origin/gh/masnesral/224/orig 2025-08-14T21:18:06.1701814Z * [new branch] gh/masnesral/225/base -> origin/gh/masnesral/225/base 2025-08-14T21:18:06.1702657Z * [new branch] gh/masnesral/225/head -> origin/gh/masnesral/225/head 2025-08-14T21:18:06.1703045Z * [new branch] gh/masnesral/225/orig -> origin/gh/masnesral/225/orig 2025-08-14T21:18:06.1704134Z * [new branch] gh/masnesral/226/base -> origin/gh/masnesral/226/base 2025-08-14T21:18:06.1704627Z * [new branch] gh/masnesral/226/head -> origin/gh/masnesral/226/head 2025-08-14T21:18:06.1705487Z * [new branch] gh/masnesral/226/orig -> origin/gh/masnesral/226/orig 2025-08-14T21:18:06.1706433Z * [new branch] gh/masnesral/227/base -> origin/gh/masnesral/227/base 2025-08-14T21:18:06.1706716Z * [new branch] gh/masnesral/227/head -> origin/gh/masnesral/227/head 2025-08-14T21:18:06.1707725Z * [new branch] gh/masnesral/227/orig -> origin/gh/masnesral/227/orig 2025-08-14T21:18:06.1710038Z * [new branch] gh/masnesral/228/base -> origin/gh/masnesral/228/base 2025-08-14T21:18:06.1710341Z * [new branch] gh/masnesral/228/head -> origin/gh/masnesral/228/head 2025-08-14T21:18:06.1710463Z * [new branch] gh/masnesral/228/orig -> origin/gh/masnesral/228/orig 2025-08-14T21:18:06.1714556Z * [new branch] gh/masnesral/229/base -> origin/gh/masnesral/229/base 2025-08-14T21:18:06.1718055Z * [new branch] gh/masnesral/229/head -> origin/gh/masnesral/229/head 2025-08-14T21:18:06.1722571Z * [new branch] gh/masnesral/229/orig -> origin/gh/masnesral/229/orig 2025-08-14T21:18:06.1726055Z * [new branch] gh/masnesral/230/base -> origin/gh/masnesral/230/base 2025-08-14T21:18:06.1729528Z * [new branch] gh/masnesral/230/head -> origin/gh/masnesral/230/head 2025-08-14T21:18:06.1733018Z * [new branch] gh/masnesral/230/orig -> origin/gh/masnesral/230/orig 2025-08-14T21:18:06.1736504Z * [new branch] gh/masnesral/231/base -> origin/gh/masnesral/231/base 2025-08-14T21:18:06.1736657Z * [new branch] gh/masnesral/231/head -> origin/gh/masnesral/231/head 2025-08-14T21:18:06.1736827Z * [new branch] gh/masnesral/231/orig -> origin/gh/masnesral/231/orig 2025-08-14T21:18:06.1736950Z * [new branch] gh/masnesral/232/base -> origin/gh/masnesral/232/base 2025-08-14T21:18:06.1737075Z * [new branch] gh/masnesral/232/head -> origin/gh/masnesral/232/head 2025-08-14T21:18:06.1737332Z * [new branch] gh/masnesral/232/orig -> origin/gh/masnesral/232/orig 2025-08-14T21:18:06.1737460Z * [new branch] gh/masnesral/233/base -> origin/gh/masnesral/233/base 2025-08-14T21:18:06.1737589Z * [new branch] gh/masnesral/233/head -> origin/gh/masnesral/233/head 2025-08-14T21:18:06.1737708Z * [new branch] gh/masnesral/233/orig -> origin/gh/masnesral/233/orig 2025-08-14T21:18:06.1737849Z * [new branch] gh/masnesral/234/base -> origin/gh/masnesral/234/base 2025-08-14T21:18:06.1737972Z * [new branch] gh/masnesral/234/head -> origin/gh/masnesral/234/head 2025-08-14T21:18:06.1738096Z * [new branch] gh/masnesral/234/orig -> origin/gh/masnesral/234/orig 2025-08-14T21:18:06.1738220Z * [new branch] gh/masnesral/235/base -> origin/gh/masnesral/235/base 2025-08-14T21:18:06.1738344Z * [new branch] gh/masnesral/235/head -> origin/gh/masnesral/235/head 2025-08-14T21:18:06.1738469Z * [new branch] gh/masnesral/235/orig -> origin/gh/masnesral/235/orig 2025-08-14T21:18:06.1738593Z * [new branch] gh/masnesral/236/base -> origin/gh/masnesral/236/base 2025-08-14T21:18:06.1738711Z * [new branch] gh/masnesral/236/head -> origin/gh/masnesral/236/head 2025-08-14T21:18:06.1738835Z * [new branch] gh/masnesral/236/orig -> origin/gh/masnesral/236/orig 2025-08-14T21:18:06.1738968Z * [new branch] gh/masnesral/34/base -> origin/gh/masnesral/34/base 2025-08-14T21:18:06.1739099Z * [new branch] gh/mhorowitz/0/base -> origin/gh/mhorowitz/0/base 2025-08-14T21:18:06.1739223Z * [new branch] gh/mhorowitz/0/head -> origin/gh/mhorowitz/0/head 2025-08-14T21:18:06.1739338Z * [new branch] gh/mhorowitz/1/base -> origin/gh/mhorowitz/1/base 2025-08-14T21:18:06.1739456Z * [new branch] gh/mhorowitz/1/head -> origin/gh/mhorowitz/1/head 2025-08-14T21:18:06.1739574Z * [new branch] gh/mhorowitz/2/base -> origin/gh/mhorowitz/2/base 2025-08-14T21:18:06.1739691Z * [new branch] gh/mhorowitz/2/head -> origin/gh/mhorowitz/2/head 2025-08-14T21:18:06.1739809Z * [new branch] gh/mhorowitz/3/base -> origin/gh/mhorowitz/3/base 2025-08-14T21:18:06.1739924Z * [new branch] gh/mhorowitz/3/head -> origin/gh/mhorowitz/3/head 2025-08-14T21:18:06.1740105Z * [new branch] gh/mhorowitz/4/base -> origin/gh/mhorowitz/4/base 2025-08-14T21:18:06.1740224Z * [new branch] gh/mhorowitz/4/head -> origin/gh/mhorowitz/4/head 2025-08-14T21:18:06.1740338Z * [new branch] gh/mhorowitz/5/base -> origin/gh/mhorowitz/5/base 2025-08-14T21:18:06.1740462Z * [new branch] gh/mhorowitz/5/head -> origin/gh/mhorowitz/5/head 2025-08-14T21:18:06.1740575Z * [new branch] gh/mhorowitz/6/base -> origin/gh/mhorowitz/6/base 2025-08-14T21:18:06.1740692Z * [new branch] gh/mhorowitz/6/head -> origin/gh/mhorowitz/6/head 2025-08-14T21:18:06.1740853Z * [new branch] gh/mikaylagawarecki/234/base -> origin/gh/mikaylagawarecki/234/base 2025-08-14T21:18:06.1741001Z * [new branch] gh/mikaylagawarecki/234/head -> origin/gh/mikaylagawarecki/234/head 2025-08-14T21:18:06.1741150Z * [new branch] gh/mikaylagawarecki/235/base -> origin/gh/mikaylagawarecki/235/base 2025-08-14T21:18:06.1741297Z * [new branch] gh/mikaylagawarecki/235/head -> origin/gh/mikaylagawarecki/235/head 2025-08-14T21:18:06.1741923Z * [new branch] gh/mikaylagawarecki/236/base -> origin/gh/mikaylagawarecki/236/base 2025-08-14T21:18:06.1742339Z * [new branch] gh/mikaylagawarecki/236/head -> origin/gh/mikaylagawarecki/236/head 2025-08-14T21:18:06.1743444Z * [new branch] gh/mikaylagawarecki/237/base -> origin/gh/mikaylagawarecki/237/base 2025-08-14T21:18:06.1744077Z * [new branch] gh/mikaylagawarecki/237/head -> origin/gh/mikaylagawarecki/237/head 2025-08-14T21:18:06.1744610Z * [new branch] gh/mikaylagawarecki/238/base -> origin/gh/mikaylagawarecki/238/base 2025-08-14T21:18:06.1745390Z * [new branch] gh/mikaylagawarecki/238/head -> origin/gh/mikaylagawarecki/238/head 2025-08-14T21:18:06.1746664Z * [new branch] gh/mikaylagawarecki/313/base -> origin/gh/mikaylagawarecki/313/base 2025-08-14T21:18:06.1746828Z * [new branch] gh/mikaylagawarecki/313/head -> origin/gh/mikaylagawarecki/313/head 2025-08-14T21:18:06.1747238Z * [new branch] gh/mikaylagawarecki/313/orig -> origin/gh/mikaylagawarecki/313/orig 2025-08-14T21:18:06.1750790Z * [new branch] gh/mikaylagawarecki/317/base -> origin/gh/mikaylagawarecki/317/base 2025-08-14T21:18:06.1751061Z * [new branch] gh/mikaylagawarecki/317/head -> origin/gh/mikaylagawarecki/317/head 2025-08-14T21:18:06.1755177Z * [new branch] gh/mikaylagawarecki/317/orig -> origin/gh/mikaylagawarecki/317/orig 2025-08-14T21:18:06.1759094Z * [new branch] gh/mikaylagawarecki/318/base -> origin/gh/mikaylagawarecki/318/base 2025-08-14T21:18:06.1761021Z * [new branch] gh/mikaylagawarecki/318/head -> origin/gh/mikaylagawarecki/318/head 2025-08-14T21:18:06.1765241Z * [new branch] gh/mikaylagawarecki/318/orig -> origin/gh/mikaylagawarecki/318/orig 2025-08-14T21:18:06.1768770Z * [new branch] gh/mikaylagawarecki/319/base -> origin/gh/mikaylagawarecki/319/base 2025-08-14T21:18:06.1772877Z * [new branch] gh/mikaylagawarecki/319/head -> origin/gh/mikaylagawarecki/319/head 2025-08-14T21:18:06.1773147Z * [new branch] gh/mikaylagawarecki/319/orig -> origin/gh/mikaylagawarecki/319/orig 2025-08-14T21:18:06.1778146Z * [new branch] gh/mikaylagawarecki/320/base -> origin/gh/mikaylagawarecki/320/base 2025-08-14T21:18:06.1778497Z * [new branch] gh/mikaylagawarecki/320/head -> origin/gh/mikaylagawarecki/320/head 2025-08-14T21:18:06.1778671Z * [new branch] gh/mikaylagawarecki/320/orig -> origin/gh/mikaylagawarecki/320/orig 2025-08-14T21:18:06.1778817Z * [new branch] gh/mikaylagawarecki/321/base -> origin/gh/mikaylagawarecki/321/base 2025-08-14T21:18:06.1779088Z * [new branch] gh/mikaylagawarecki/321/head -> origin/gh/mikaylagawarecki/321/head 2025-08-14T21:18:06.1779379Z * [new branch] gh/mikaylagawarecki/321/orig -> origin/gh/mikaylagawarecki/321/orig 2025-08-14T21:18:06.1779529Z * [new branch] gh/mikaylagawarecki/322/base -> origin/gh/mikaylagawarecki/322/base 2025-08-14T21:18:06.1779696Z * [new branch] gh/mikaylagawarecki/322/head -> origin/gh/mikaylagawarecki/322/head 2025-08-14T21:18:06.1779841Z * [new branch] gh/mikaylagawarecki/322/orig -> origin/gh/mikaylagawarecki/322/orig 2025-08-14T21:18:06.1779996Z * [new branch] gh/mikaylagawarecki/323/base -> origin/gh/mikaylagawarecki/323/base 2025-08-14T21:18:06.1780138Z * [new branch] gh/mikaylagawarecki/323/head -> origin/gh/mikaylagawarecki/323/head 2025-08-14T21:18:06.1780281Z * [new branch] gh/mikaylagawarecki/323/orig -> origin/gh/mikaylagawarecki/323/orig 2025-08-14T21:18:06.1780431Z * [new branch] gh/mikaylagawarecki/324/base -> origin/gh/mikaylagawarecki/324/base 2025-08-14T21:18:06.1780582Z * [new branch] gh/mikaylagawarecki/324/head -> origin/gh/mikaylagawarecki/324/head 2025-08-14T21:18:06.1780730Z * [new branch] gh/mikaylagawarecki/324/orig -> origin/gh/mikaylagawarecki/324/orig 2025-08-14T21:18:06.1780870Z * [new branch] gh/mikaylagawarecki/325/base -> origin/gh/mikaylagawarecki/325/base 2025-08-14T21:18:06.1781011Z * [new branch] gh/mikaylagawarecki/325/head -> origin/gh/mikaylagawarecki/325/head 2025-08-14T21:18:06.1781236Z * [new branch] gh/mikaylagawarecki/325/orig -> origin/gh/mikaylagawarecki/325/orig 2025-08-14T21:18:06.1781378Z * [new branch] gh/mikaylagawarecki/326/base -> origin/gh/mikaylagawarecki/326/base 2025-08-14T21:18:06.1781523Z * [new branch] gh/mikaylagawarecki/326/head -> origin/gh/mikaylagawarecki/326/head 2025-08-14T21:18:06.1781662Z * [new branch] gh/mikaylagawarecki/326/orig -> origin/gh/mikaylagawarecki/326/orig 2025-08-14T21:18:06.1781804Z * [new branch] gh/mikaylagawarecki/327/base -> origin/gh/mikaylagawarecki/327/base 2025-08-14T21:18:06.1781948Z * [new branch] gh/mikaylagawarecki/327/head -> origin/gh/mikaylagawarecki/327/head 2025-08-14T21:18:06.1782086Z * [new branch] gh/mikaylagawarecki/327/orig -> origin/gh/mikaylagawarecki/327/orig 2025-08-14T21:18:06.1782229Z * [new branch] gh/mikaylagawarecki/328/base -> origin/gh/mikaylagawarecki/328/base 2025-08-14T21:18:06.1782374Z * [new branch] gh/mikaylagawarecki/328/head -> origin/gh/mikaylagawarecki/328/head 2025-08-14T21:18:06.1782516Z * [new branch] gh/mikaylagawarecki/328/orig -> origin/gh/mikaylagawarecki/328/orig 2025-08-14T21:18:06.1782661Z * [new branch] gh/mikaylagawarecki/329/base -> origin/gh/mikaylagawarecki/329/base 2025-08-14T21:18:06.1782802Z * [new branch] gh/mikaylagawarecki/329/head -> origin/gh/mikaylagawarecki/329/head 2025-08-14T21:18:06.1782950Z * [new branch] gh/mikaylagawarecki/329/orig -> origin/gh/mikaylagawarecki/329/orig 2025-08-14T21:18:06.1783088Z * [new branch] gh/mikaylagawarecki/330/base -> origin/gh/mikaylagawarecki/330/base 2025-08-14T21:18:06.1783228Z * [new branch] gh/mikaylagawarecki/330/head -> origin/gh/mikaylagawarecki/330/head 2025-08-14T21:18:06.1783375Z * [new branch] gh/mikaylagawarecki/330/orig -> origin/gh/mikaylagawarecki/330/orig 2025-08-14T21:18:06.1783513Z * [new branch] gh/mikaylagawarecki/331/base -> origin/gh/mikaylagawarecki/331/base 2025-08-14T21:18:06.1783657Z * [new branch] gh/mikaylagawarecki/331/head -> origin/gh/mikaylagawarecki/331/head 2025-08-14T21:18:06.1783806Z * [new branch] gh/mikaylagawarecki/331/orig -> origin/gh/mikaylagawarecki/331/orig 2025-08-14T21:18:06.1783951Z * [new branch] gh/mikaylagawarecki/332/base -> origin/gh/mikaylagawarecki/332/base 2025-08-14T21:18:06.1784131Z * [new branch] gh/mikaylagawarecki/332/head -> origin/gh/mikaylagawarecki/332/head 2025-08-14T21:18:06.1784383Z * [new branch] gh/mikaylagawarecki/332/orig -> origin/gh/mikaylagawarecki/332/orig 2025-08-14T21:18:06.1784528Z * [new branch] gh/mikaylagawarecki/333/base -> origin/gh/mikaylagawarecki/333/base 2025-08-14T21:18:06.1784824Z * [new branch] gh/mikaylagawarecki/333/head -> origin/gh/mikaylagawarecki/333/head 2025-08-14T21:18:06.1784974Z * [new branch] gh/mikaylagawarecki/333/orig -> origin/gh/mikaylagawarecki/333/orig 2025-08-14T21:18:06.1785122Z * [new branch] gh/mikaylagawarecki/334/base -> origin/gh/mikaylagawarecki/334/base 2025-08-14T21:18:06.1785267Z * [new branch] gh/mikaylagawarecki/334/head -> origin/gh/mikaylagawarecki/334/head 2025-08-14T21:18:06.1785566Z * [new branch] gh/mikaylagawarecki/334/orig -> origin/gh/mikaylagawarecki/334/orig 2025-08-14T21:18:06.1787811Z * [new branch] gh/mlazos/1/base -> origin/gh/mlazos/1/base 2025-08-14T21:18:06.1788102Z * [new branch] gh/mlazos/1/head -> origin/gh/mlazos/1/head 2025-08-14T21:18:06.1788234Z * [new branch] gh/mlazos/1/orig -> origin/gh/mlazos/1/orig 2025-08-14T21:18:06.1789374Z * [new branch] gh/mlazos/10/base -> origin/gh/mlazos/10/base 2025-08-14T21:18:06.1789674Z * [new branch] gh/mlazos/10/head -> origin/gh/mlazos/10/head 2025-08-14T21:18:06.1790245Z * [new branch] gh/mlazos/10/orig -> origin/gh/mlazos/10/orig 2025-08-14T21:18:06.1792100Z * [new branch] gh/mlazos/11/base -> origin/gh/mlazos/11/base 2025-08-14T21:18:06.1792408Z * [new branch] gh/mlazos/11/head -> origin/gh/mlazos/11/head 2025-08-14T21:18:06.1792575Z * [new branch] gh/mlazos/11/orig -> origin/gh/mlazos/11/orig 2025-08-14T21:18:06.1792847Z * [new branch] gh/mlazos/12/base -> origin/gh/mlazos/12/base 2025-08-14T21:18:06.1793934Z * [new branch] gh/mlazos/12/head -> origin/gh/mlazos/12/head 2025-08-14T21:18:06.1794510Z * [new branch] gh/mlazos/12/orig -> origin/gh/mlazos/12/orig 2025-08-14T21:18:06.1796191Z * [new branch] gh/mlazos/13/base -> origin/gh/mlazos/13/base 2025-08-14T21:18:06.1796342Z * [new branch] gh/mlazos/13/head -> origin/gh/mlazos/13/head 2025-08-14T21:18:06.1796468Z * [new branch] gh/mlazos/13/orig -> origin/gh/mlazos/13/orig 2025-08-14T21:18:06.1797791Z * [new branch] gh/mlazos/2/base -> origin/gh/mlazos/2/base 2025-08-14T21:18:06.1798088Z * [new branch] gh/mlazos/2/head -> origin/gh/mlazos/2/head 2025-08-14T21:18:06.1798648Z * [new branch] gh/mlazos/2/orig -> origin/gh/mlazos/2/orig 2025-08-14T21:18:06.1799026Z * [new branch] gh/mlazos/3/base -> origin/gh/mlazos/3/base 2025-08-14T21:18:06.1799870Z * [new branch] gh/mlazos/3/head -> origin/gh/mlazos/3/head 2025-08-14T21:18:06.1800219Z * [new branch] gh/mlazos/3/orig -> origin/gh/mlazos/3/orig 2025-08-14T21:18:06.1802062Z * [new branch] gh/mlazos/4/base -> origin/gh/mlazos/4/base 2025-08-14T21:18:06.1802364Z * [new branch] gh/mlazos/4/head -> origin/gh/mlazos/4/head 2025-08-14T21:18:06.1802509Z * [new branch] gh/mlazos/4/orig -> origin/gh/mlazos/4/orig 2025-08-14T21:18:06.1803928Z * [new branch] gh/mlazos/5/base -> origin/gh/mlazos/5/base 2025-08-14T21:18:06.1804209Z * [new branch] gh/mlazos/5/head -> origin/gh/mlazos/5/head 2025-08-14T21:18:06.1804438Z * [new branch] gh/mlazos/5/orig -> origin/gh/mlazos/5/orig 2025-08-14T21:18:06.1806434Z * [new branch] gh/mlazos/6/base -> origin/gh/mlazos/6/base 2025-08-14T21:18:06.1806734Z * [new branch] gh/mlazos/6/head -> origin/gh/mlazos/6/head 2025-08-14T21:18:06.1806868Z * [new branch] gh/mlazos/6/orig -> origin/gh/mlazos/6/orig 2025-08-14T21:18:06.1808075Z * [new branch] gh/mlazos/7/base -> origin/gh/mlazos/7/base 2025-08-14T21:18:06.1808245Z * [new branch] gh/mlazos/7/head -> origin/gh/mlazos/7/head 2025-08-14T21:18:06.1810636Z * [new branch] gh/mlazos/7/orig -> origin/gh/mlazos/7/orig 2025-08-14T21:18:06.1810965Z * [new branch] gh/mlazos/8/base -> origin/gh/mlazos/8/base 2025-08-14T21:18:06.1811167Z * [new branch] gh/mlazos/8/head -> origin/gh/mlazos/8/head 2025-08-14T21:18:06.1811393Z * [new branch] gh/mlazos/8/orig -> origin/gh/mlazos/8/orig 2025-08-14T21:18:06.1811633Z * [new branch] gh/mlazos/9/base -> origin/gh/mlazos/9/base 2025-08-14T21:18:06.1813175Z * [new branch] gh/mlazos/9/head -> origin/gh/mlazos/9/head 2025-08-14T21:18:06.1813475Z * [new branch] gh/mlazos/9/orig -> origin/gh/mlazos/9/orig 2025-08-14T21:18:06.1814845Z * [new branch] gh/mrmiywj/1/base -> origin/gh/mrmiywj/1/base 2025-08-14T21:18:06.1815143Z * [new branch] gh/mrmiywj/1/head -> origin/gh/mrmiywj/1/head 2025-08-14T21:18:06.1816878Z * [new branch] gh/muchulee8/62/base -> origin/gh/muchulee8/62/base 2025-08-14T21:18:06.1817315Z * [new branch] gh/muchulee8/62/head -> origin/gh/muchulee8/62/head 2025-08-14T21:18:06.1817561Z * [new branch] gh/muchulee8/62/orig -> origin/gh/muchulee8/62/orig 2025-08-14T21:18:06.1818035Z * [new branch] gh/muchulee8/63/base -> origin/gh/muchulee8/63/base 2025-08-14T21:18:06.1819285Z * [new branch] gh/muchulee8/63/head -> origin/gh/muchulee8/63/head 2025-08-14T21:18:06.1819743Z * [new branch] gh/muchulee8/63/orig -> origin/gh/muchulee8/63/orig 2025-08-14T21:18:06.1821089Z * [new branch] gh/muchulee8/64/base -> origin/gh/muchulee8/64/base 2025-08-14T21:18:06.1821333Z * [new branch] gh/muchulee8/64/head -> origin/gh/muchulee8/64/head 2025-08-14T21:18:06.1823039Z * [new branch] gh/muchulee8/64/orig -> origin/gh/muchulee8/64/orig 2025-08-14T21:18:06.1823208Z * [new branch] gh/muchulee8/65/base -> origin/gh/muchulee8/65/base 2025-08-14T21:18:06.1823504Z * [new branch] gh/muchulee8/65/head -> origin/gh/muchulee8/65/head 2025-08-14T21:18:06.1824441Z * [new branch] gh/muchulee8/65/orig -> origin/gh/muchulee8/65/orig 2025-08-14T21:18:06.1827682Z * [new branch] gh/oulgen/35/base -> origin/gh/oulgen/35/base 2025-08-14T21:18:06.1828005Z * [new branch] gh/oulgen/35/head -> origin/gh/oulgen/35/head 2025-08-14T21:18:06.1828165Z * [new branch] gh/oulgen/35/orig -> origin/gh/oulgen/35/orig 2025-08-14T21:18:06.1828292Z * [new branch] gh/oulgen/44/base -> origin/gh/oulgen/44/base 2025-08-14T21:18:06.1828480Z * [new branch] gh/oulgen/44/head -> origin/gh/oulgen/44/head 2025-08-14T21:18:06.1829052Z * [new branch] gh/oulgen/44/orig -> origin/gh/oulgen/44/orig 2025-08-14T21:18:06.1829454Z * [new branch] gh/oulgen/45/base -> origin/gh/oulgen/45/base 2025-08-14T21:18:06.1830570Z * [new branch] gh/oulgen/45/head -> origin/gh/oulgen/45/head 2025-08-14T21:18:06.1830772Z * [new branch] gh/oulgen/45/orig -> origin/gh/oulgen/45/orig 2025-08-14T21:18:06.1833038Z * [new branch] gh/oulgen/46/base -> origin/gh/oulgen/46/base 2025-08-14T21:18:06.1833501Z * [new branch] gh/oulgen/46/head -> origin/gh/oulgen/46/head 2025-08-14T21:18:06.1833646Z * [new branch] gh/oulgen/46/orig -> origin/gh/oulgen/46/orig 2025-08-14T21:18:06.1833837Z * [new branch] gh/oulgen/47/base -> origin/gh/oulgen/47/base 2025-08-14T21:18:06.1834291Z * [new branch] gh/oulgen/47/head -> origin/gh/oulgen/47/head 2025-08-14T21:18:06.1835023Z * [new branch] gh/oulgen/47/orig -> origin/gh/oulgen/47/orig 2025-08-14T21:18:06.1838009Z * [new branch] gh/pearu/108/base -> origin/gh/pearu/108/base 2025-08-14T21:18:06.1838307Z * [new branch] gh/pearu/108/head -> origin/gh/pearu/108/head 2025-08-14T21:18:06.1838444Z * [new branch] gh/pearu/108/orig -> origin/gh/pearu/108/orig 2025-08-14T21:18:06.1838646Z * [new branch] gh/pearu/56/base -> origin/gh/pearu/56/base 2025-08-14T21:18:06.1839332Z * [new branch] gh/pearu/56/head -> origin/gh/pearu/56/head 2025-08-14T21:18:06.1839970Z * [new branch] gh/pearu/56/orig -> origin/gh/pearu/56/orig 2025-08-14T21:18:06.1841977Z * [new branch] gh/pearu/97/base -> origin/gh/pearu/97/base 2025-08-14T21:18:06.1842271Z * [new branch] gh/pearu/97/head -> origin/gh/pearu/97/head 2025-08-14T21:18:06.1842400Z * [new branch] gh/pearu/97/orig -> origin/gh/pearu/97/orig 2025-08-14T21:18:06.1843636Z * [new branch] gh/qqaatw/29/base -> origin/gh/qqaatw/29/base 2025-08-14T21:18:06.1843791Z * [new branch] gh/qqaatw/29/head -> origin/gh/qqaatw/29/head 2025-08-14T21:18:06.1844181Z * [new branch] gh/qqaatw/29/orig -> origin/gh/qqaatw/29/orig 2025-08-14T21:18:06.1846428Z * [new branch] gh/raymo/cleanup-dynamo-logging -> origin/gh/raymo/cleanup-dynamo-logging 2025-08-14T21:18:06.1846777Z * [new branch] gh/raymo/refresh-script -> origin/gh/raymo/refresh-script 2025-08-14T21:18:06.1846944Z * [new branch] gh/rec/141/base -> origin/gh/rec/141/base 2025-08-14T21:18:06.1847067Z * [new branch] gh/rec/141/head -> origin/gh/rec/141/head 2025-08-14T21:18:06.1850586Z * [new branch] gh/rec/153/base -> origin/gh/rec/153/base 2025-08-14T21:18:06.1850880Z * [new branch] gh/rec/153/head -> origin/gh/rec/153/head 2025-08-14T21:18:06.1851033Z * [new branch] gh/rec/153/orig -> origin/gh/rec/153/orig 2025-08-14T21:18:06.1851138Z * [new branch] gh/rec/154/base -> origin/gh/rec/154/base 2025-08-14T21:18:06.1851239Z * [new branch] gh/rec/154/head -> origin/gh/rec/154/head 2025-08-14T21:18:06.1851475Z * [new branch] gh/rec/154/orig -> origin/gh/rec/154/orig 2025-08-14T21:18:06.1852115Z * [new branch] gh/rec/156/base -> origin/gh/rec/156/base 2025-08-14T21:18:06.1854841Z * [new branch] gh/rec/156/head -> origin/gh/rec/156/head 2025-08-14T21:18:06.1855123Z * [new branch] gh/rec/156/orig -> origin/gh/rec/156/orig 2025-08-14T21:18:06.1855251Z * [new branch] gh/rec/158/base -> origin/gh/rec/158/base 2025-08-14T21:18:06.1855360Z * [new branch] gh/rec/158/head -> origin/gh/rec/158/head 2025-08-14T21:18:06.1855611Z * [new branch] gh/rec/158/orig -> origin/gh/rec/158/orig 2025-08-14T21:18:06.1856070Z * [new branch] gh/rec/159/base -> origin/gh/rec/159/base 2025-08-14T21:18:06.1857081Z * [new branch] gh/rec/159/head -> origin/gh/rec/159/head 2025-08-14T21:18:06.1857566Z * [new branch] gh/rec/160/base -> origin/gh/rec/160/base 2025-08-14T21:18:06.1858182Z * [new branch] gh/rec/160/head -> origin/gh/rec/160/head 2025-08-14T21:18:06.1859152Z * [new branch] gh/rec/160/orig -> origin/gh/rec/160/orig 2025-08-14T21:18:06.1859793Z * [new branch] gh/rec/161/base -> origin/gh/rec/161/base 2025-08-14T21:18:06.1860277Z * [new branch] gh/rec/161/head -> origin/gh/rec/161/head 2025-08-14T21:18:06.1861091Z * [new branch] gh/rec/161/orig -> origin/gh/rec/161/orig 2025-08-14T21:18:06.1861969Z * [new branch] gh/rec/162/base -> origin/gh/rec/162/base 2025-08-14T21:18:06.1862513Z * [new branch] gh/rec/162/head -> origin/gh/rec/162/head 2025-08-14T21:18:06.1863013Z * [new branch] gh/rec/162/orig -> origin/gh/rec/162/orig 2025-08-14T21:18:06.1864006Z * [new branch] gh/rec/163/base -> origin/gh/rec/163/base 2025-08-14T21:18:06.1864358Z * [new branch] gh/rec/163/head -> origin/gh/rec/163/head 2025-08-14T21:18:06.1865273Z * [new branch] gh/rec/163/orig -> origin/gh/rec/163/orig 2025-08-14T21:18:06.1866162Z * [new branch] gh/rec/164/base -> origin/gh/rec/164/base 2025-08-14T21:18:06.1866383Z * [new branch] gh/rec/164/head -> origin/gh/rec/164/head 2025-08-14T21:18:06.1867242Z * [new branch] gh/rec/164/orig -> origin/gh/rec/164/orig 2025-08-14T21:18:06.1868423Z * [new branch] gh/robert-hardwick/1/base -> origin/gh/robert-hardwick/1/base 2025-08-14T21:18:06.1868703Z * [new branch] gh/robert-hardwick/1/head -> origin/gh/robert-hardwick/1/head 2025-08-14T21:18:06.1871043Z * [new branch] gh/robert-hardwick/1/orig -> origin/gh/robert-hardwick/1/orig 2025-08-14T21:18:06.1871278Z * [new branch] gh/robert-hardwick/2/base -> origin/gh/robert-hardwick/2/base 2025-08-14T21:18:06.1871422Z * [new branch] gh/robert-hardwick/2/head -> origin/gh/robert-hardwick/2/head 2025-08-14T21:18:06.1871562Z * [new branch] gh/robert-hardwick/2/orig -> origin/gh/robert-hardwick/2/orig 2025-08-14T21:18:06.1873682Z * [new branch] gh/robert-hardwick/3/base -> origin/gh/robert-hardwick/3/base 2025-08-14T21:18:06.1873933Z * [new branch] gh/robert-hardwick/3/head -> origin/gh/robert-hardwick/3/head 2025-08-14T21:18:06.1874077Z * [new branch] gh/robert-hardwick/3/orig -> origin/gh/robert-hardwick/3/orig 2025-08-14T21:18:06.1874228Z * [new branch] gh/robert-hardwick/4/base -> origin/gh/robert-hardwick/4/base 2025-08-14T21:18:06.1876765Z * [new branch] gh/robert-hardwick/4/head -> origin/gh/robert-hardwick/4/head 2025-08-14T21:18:06.1877042Z * [new branch] gh/robert-hardwick/4/orig -> origin/gh/robert-hardwick/4/orig 2025-08-14T21:18:06.1877177Z * [new branch] gh/rtimpe/1/base -> origin/gh/rtimpe/1/base 2025-08-14T21:18:06.1877305Z * [new branch] gh/rtimpe/1/head -> origin/gh/rtimpe/1/head 2025-08-14T21:18:06.1879827Z * [new branch] gh/rtimpe/10/base -> origin/gh/rtimpe/10/base 2025-08-14T21:18:06.1879950Z * [new branch] gh/rtimpe/10/head -> origin/gh/rtimpe/10/head 2025-08-14T21:18:06.1880067Z * [new branch] gh/rtimpe/10/orig -> origin/gh/rtimpe/10/orig 2025-08-14T21:18:06.1880594Z * [new branch] gh/rtimpe/11/base -> origin/gh/rtimpe/11/base 2025-08-14T21:18:06.1882894Z * [new branch] gh/rtimpe/11/head -> origin/gh/rtimpe/11/head 2025-08-14T21:18:06.1883162Z * [new branch] gh/rtimpe/11/orig -> origin/gh/rtimpe/11/orig 2025-08-14T21:18:06.1883292Z * [new branch] gh/rtimpe/12/base -> origin/gh/rtimpe/12/base 2025-08-14T21:18:06.1883404Z * [new branch] gh/rtimpe/12/head -> origin/gh/rtimpe/12/head 2025-08-14T21:18:06.1886082Z * [new branch] gh/rtimpe/12/orig -> origin/gh/rtimpe/12/orig 2025-08-14T21:18:06.1886208Z * [new branch] gh/rtimpe/2/base -> origin/gh/rtimpe/2/base 2025-08-14T21:18:06.1886322Z * [new branch] gh/rtimpe/2/head -> origin/gh/rtimpe/2/head 2025-08-14T21:18:06.1886439Z * [new branch] gh/rtimpe/3/base -> origin/gh/rtimpe/3/base 2025-08-14T21:18:06.1890619Z * [new branch] gh/rtimpe/3/head -> origin/gh/rtimpe/3/head 2025-08-14T21:18:06.1894665Z * [new branch] gh/rtimpe/4/base -> origin/gh/rtimpe/4/base 2025-08-14T21:18:06.1894931Z * [new branch] gh/rtimpe/4/head -> origin/gh/rtimpe/4/head 2025-08-14T21:18:06.1895161Z * [new branch] gh/rtimpe/5/base -> origin/gh/rtimpe/5/base 2025-08-14T21:18:06.1895280Z * [new branch] gh/rtimpe/5/head -> origin/gh/rtimpe/5/head 2025-08-14T21:18:06.1895546Z * [new branch] gh/rtimpe/5/orig -> origin/gh/rtimpe/5/orig 2025-08-14T21:18:06.1895982Z * [new branch] gh/rtimpe/6/base -> origin/gh/rtimpe/6/base 2025-08-14T21:18:06.1896127Z * [new branch] gh/rtimpe/6/head -> origin/gh/rtimpe/6/head 2025-08-14T21:18:06.1896251Z * [new branch] gh/rtimpe/6/orig -> origin/gh/rtimpe/6/orig 2025-08-14T21:18:06.1896365Z * [new branch] gh/rtimpe/7/base -> origin/gh/rtimpe/7/base 2025-08-14T21:18:06.1896692Z * [new branch] gh/rtimpe/7/head -> origin/gh/rtimpe/7/head 2025-08-14T21:18:06.1896816Z * [new branch] gh/rtimpe/7/orig -> origin/gh/rtimpe/7/orig 2025-08-14T21:18:06.1896930Z * [new branch] gh/rtimpe/8/base -> origin/gh/rtimpe/8/base 2025-08-14T21:18:06.1897054Z * [new branch] gh/rtimpe/8/head -> origin/gh/rtimpe/8/head 2025-08-14T21:18:06.1897170Z * [new branch] gh/rtimpe/8/orig -> origin/gh/rtimpe/8/orig 2025-08-14T21:18:06.1897884Z * [new branch] gh/rtimpe/9/base -> origin/gh/rtimpe/9/base 2025-08-14T21:18:06.1898324Z * [new branch] gh/rtimpe/9/head -> origin/gh/rtimpe/9/head 2025-08-14T21:18:06.1899474Z * [new branch] gh/rtimpe/9/orig -> origin/gh/rtimpe/9/orig 2025-08-14T21:18:06.1900657Z * [new branch] gh/ruisizhang123/1/base -> origin/gh/ruisizhang123/1/base 2025-08-14T21:18:06.1901005Z * [new branch] gh/ruisizhang123/1/head -> origin/gh/ruisizhang123/1/head 2025-08-14T21:18:06.1901902Z * [new branch] gh/ruisizhang123/1/orig -> origin/gh/ruisizhang123/1/orig 2025-08-14T21:18:06.1902799Z * [new branch] gh/ruisizhang123/4/base -> origin/gh/ruisizhang123/4/base 2025-08-14T21:18:06.1903286Z * [new branch] gh/ruisizhang123/4/head -> origin/gh/ruisizhang123/4/head 2025-08-14T21:18:06.1904072Z * [new branch] gh/ruisizhang123/4/orig -> origin/gh/ruisizhang123/4/orig 2025-08-14T21:18:06.1905315Z * [new branch] gh/ruisizhang123/5/base -> origin/gh/ruisizhang123/5/base 2025-08-14T21:18:06.1905451Z * [new branch] gh/ruisizhang123/5/head -> origin/gh/ruisizhang123/5/head 2025-08-14T21:18:06.1907817Z * [new branch] gh/ruisizhang123/5/orig -> origin/gh/ruisizhang123/5/orig 2025-08-14T21:18:06.1908000Z * [new branch] gh/ruisizhang123/6/base -> origin/gh/ruisizhang123/6/base 2025-08-14T21:18:06.1908130Z * [new branch] gh/ruisizhang123/6/head -> origin/gh/ruisizhang123/6/head 2025-08-14T21:18:06.1908268Z * [new branch] gh/ruisizhang123/6/orig -> origin/gh/ruisizhang123/6/orig 2025-08-14T21:18:06.1909653Z * [new branch] gh/ruisizhang123/7/base -> origin/gh/ruisizhang123/7/base 2025-08-14T21:18:06.1914548Z * [new branch] gh/ruisizhang123/7/head -> origin/gh/ruisizhang123/7/head 2025-08-14T21:18:06.1916320Z * [new branch] gh/ruisizhang123/7/orig -> origin/gh/ruisizhang123/7/orig 2025-08-14T21:18:06.1916575Z * [new branch] gh/ruisizhang123/8/base -> origin/gh/ruisizhang123/8/base 2025-08-14T21:18:06.1920991Z * [new branch] gh/ruisizhang123/8/head -> origin/gh/ruisizhang123/8/head 2025-08-14T21:18:06.1923206Z * [new branch] gh/ruisizhang123/8/orig -> origin/gh/ruisizhang123/8/orig 2025-08-14T21:18:06.1923466Z * [new branch] gh/sarckk/2/base -> origin/gh/sarckk/2/base 2025-08-14T21:18:06.1926528Z * [new branch] gh/sarckk/2/head -> origin/gh/sarckk/2/head 2025-08-14T21:18:06.1926755Z * [new branch] gh/sarckk/2/orig -> origin/gh/sarckk/2/orig 2025-08-14T21:18:06.1931587Z * [new branch] gh/seemethere/23/head -> origin/gh/seemethere/23/head 2025-08-14T21:18:06.1935239Z * [new branch] gh/seemethere/24/base -> origin/gh/seemethere/24/base 2025-08-14T21:18:06.1938324Z * [new branch] gh/seemethere/24/head -> origin/gh/seemethere/24/head 2025-08-14T21:18:06.1938588Z * [new branch] gh/seemethere/24/orig -> origin/gh/seemethere/24/orig 2025-08-14T21:18:06.1938797Z * [new branch] gh/seemethere/30/base -> origin/gh/seemethere/30/base 2025-08-14T21:18:06.1938936Z * [new branch] gh/seemethere/30/head -> origin/gh/seemethere/30/head 2025-08-14T21:18:06.1939270Z * [new branch] gh/seemethere/30/orig -> origin/gh/seemethere/30/orig 2025-08-14T21:18:06.1939409Z * [new branch] gh/seemethere/32/base -> origin/gh/seemethere/32/base 2025-08-14T21:18:06.1939697Z * [new branch] gh/seemethere/32/head -> origin/gh/seemethere/32/head 2025-08-14T21:18:06.1939823Z * [new branch] gh/seemethere/32/orig -> origin/gh/seemethere/32/orig 2025-08-14T21:18:06.1939971Z * [new branch] gh/seemethere/33/base -> origin/gh/seemethere/33/base 2025-08-14T21:18:06.1940099Z * [new branch] gh/seemethere/33/head -> origin/gh/seemethere/33/head 2025-08-14T21:18:06.1940218Z * [new branch] gh/seemethere/33/orig -> origin/gh/seemethere/33/orig 2025-08-14T21:18:06.1940339Z * [new branch] gh/seemethere/34/base -> origin/gh/seemethere/34/base 2025-08-14T21:18:06.1940461Z * [new branch] gh/seemethere/34/head -> origin/gh/seemethere/34/head 2025-08-14T21:18:06.1940579Z * [new branch] gh/seemethere/34/orig -> origin/gh/seemethere/34/orig 2025-08-14T21:18:06.1940699Z * [new branch] gh/seemethere/35/base -> origin/gh/seemethere/35/base 2025-08-14T21:18:06.1940817Z * [new branch] gh/seemethere/35/head -> origin/gh/seemethere/35/head 2025-08-14T21:18:06.1940942Z * [new branch] gh/seemethere/35/orig -> origin/gh/seemethere/35/orig 2025-08-14T21:18:06.1941059Z * [new branch] gh/seemethere/37/base -> origin/gh/seemethere/37/base 2025-08-14T21:18:06.1941177Z * [new branch] gh/seemethere/37/head -> origin/gh/seemethere/37/head 2025-08-14T21:18:06.1941296Z * [new branch] gh/seemethere/37/orig -> origin/gh/seemethere/37/orig 2025-08-14T21:18:06.1941413Z * [new branch] gh/seemethere/39/base -> origin/gh/seemethere/39/base 2025-08-14T21:18:06.1941532Z * [new branch] gh/seemethere/39/head -> origin/gh/seemethere/39/head 2025-08-14T21:18:06.1941653Z * [new branch] gh/seemethere/39/orig -> origin/gh/seemethere/39/orig 2025-08-14T21:18:06.1941771Z * [new branch] gh/seemethere/40/base -> origin/gh/seemethere/40/base 2025-08-14T21:18:06.1941893Z * [new branch] gh/seemethere/40/head -> origin/gh/seemethere/40/head 2025-08-14T21:18:06.1942053Z * [new branch] gh/seemethere/40/orig -> origin/gh/seemethere/40/orig 2025-08-14T21:18:06.1942172Z * [new branch] gh/seemethere/41/base -> origin/gh/seemethere/41/base 2025-08-14T21:18:06.1942293Z * [new branch] gh/seemethere/41/head -> origin/gh/seemethere/41/head 2025-08-14T21:18:06.1942409Z * [new branch] gh/seemethere/41/orig -> origin/gh/seemethere/41/orig 2025-08-14T21:18:06.1942530Z * [new branch] gh/seemethere/42/base -> origin/gh/seemethere/42/base 2025-08-14T21:18:06.1942650Z * [new branch] gh/seemethere/42/head -> origin/gh/seemethere/42/head 2025-08-14T21:18:06.1942769Z * [new branch] gh/seemethere/42/orig -> origin/gh/seemethere/42/orig 2025-08-14T21:18:06.1942893Z * [new branch] gh/seemethere/43/base -> origin/gh/seemethere/43/base 2025-08-14T21:18:06.1943017Z * [new branch] gh/seemethere/43/head -> origin/gh/seemethere/43/head 2025-08-14T21:18:06.1943139Z * [new branch] gh/seemethere/43/orig -> origin/gh/seemethere/43/orig 2025-08-14T21:18:06.1943256Z * [new branch] gh/seemethere/44/base -> origin/gh/seemethere/44/base 2025-08-14T21:18:06.1943374Z * [new branch] gh/seemethere/44/head -> origin/gh/seemethere/44/head 2025-08-14T21:18:06.1943718Z * [new branch] gh/seemethere/44/orig -> origin/gh/seemethere/44/orig 2025-08-14T21:18:06.1945204Z * [new branch] gh/seemethere/45/base -> origin/gh/seemethere/45/base 2025-08-14T21:18:06.1945471Z * [new branch] gh/seemethere/45/head -> origin/gh/seemethere/45/head 2025-08-14T21:18:06.1945610Z * [new branch] gh/seemethere/45/orig -> origin/gh/seemethere/45/orig 2025-08-14T21:18:06.1947286Z * [new branch] gh/seemethere/46/base -> origin/gh/seemethere/46/base 2025-08-14T21:18:06.1947614Z * [new branch] gh/seemethere/46/head -> origin/gh/seemethere/46/head 2025-08-14T21:18:06.1947765Z * [new branch] gh/seemethere/46/orig -> origin/gh/seemethere/46/orig 2025-08-14T21:18:06.1949283Z * [new branch] gh/seemethere/47/base -> origin/gh/seemethere/47/base 2025-08-14T21:18:06.1949580Z * [new branch] gh/seemethere/47/head -> origin/gh/seemethere/47/head 2025-08-14T21:18:06.1949724Z * [new branch] gh/seemethere/47/orig -> origin/gh/seemethere/47/orig 2025-08-14T21:18:06.1951476Z * [new branch] gh/seemethere/48/base -> origin/gh/seemethere/48/base 2025-08-14T21:18:06.1951782Z * [new branch] gh/seemethere/48/head -> origin/gh/seemethere/48/head 2025-08-14T21:18:06.1951931Z * [new branch] gh/seemethere/48/orig -> origin/gh/seemethere/48/orig 2025-08-14T21:18:06.1953492Z * [new branch] gh/seemethere/49/base -> origin/gh/seemethere/49/base 2025-08-14T21:18:06.1953818Z * [new branch] gh/seemethere/49/head -> origin/gh/seemethere/49/head 2025-08-14T21:18:06.1953988Z * [new branch] gh/seemethere/49/orig -> origin/gh/seemethere/49/orig 2025-08-14T21:18:06.1955356Z * [new branch] gh/seemethere/50/base -> origin/gh/seemethere/50/base 2025-08-14T21:18:06.1955660Z * [new branch] gh/seemethere/50/head -> origin/gh/seemethere/50/head 2025-08-14T21:18:06.1956001Z * [new branch] gh/seemethere/50/orig -> origin/gh/seemethere/50/orig 2025-08-14T21:18:06.1957499Z * [new branch] gh/seemethere/51/base -> origin/gh/seemethere/51/base 2025-08-14T21:18:06.1957807Z * [new branch] gh/seemethere/51/head -> origin/gh/seemethere/51/head 2025-08-14T21:18:06.1958165Z * [new branch] gh/seemethere/51/orig -> origin/gh/seemethere/51/orig 2025-08-14T21:18:06.1959551Z * [new branch] gh/seemethere/52/base -> origin/gh/seemethere/52/base 2025-08-14T21:18:06.1959855Z * [new branch] gh/seemethere/52/head -> origin/gh/seemethere/52/head 2025-08-14T21:18:06.1961261Z * [new branch] gh/seemethere/52/orig -> origin/gh/seemethere/52/orig 2025-08-14T21:18:06.1961715Z * [new branch] gh/seemethere/53/base -> origin/gh/seemethere/53/base 2025-08-14T21:18:06.1963326Z * [new branch] gh/seemethere/53/head -> origin/gh/seemethere/53/head 2025-08-14T21:18:06.1963665Z * [new branch] gh/seemethere/53/orig -> origin/gh/seemethere/53/orig 2025-08-14T21:18:06.1963831Z * [new branch] gh/seemethere/54/base -> origin/gh/seemethere/54/base 2025-08-14T21:18:06.1965155Z * [new branch] gh/seemethere/54/head -> origin/gh/seemethere/54/head 2025-08-14T21:18:06.1965459Z * [new branch] gh/seemethere/54/orig -> origin/gh/seemethere/54/orig 2025-08-14T21:18:06.1965819Z * [new branch] gh/seemethere/55/base -> origin/gh/seemethere/55/base 2025-08-14T21:18:06.1966963Z * [new branch] gh/seemethere/55/head -> origin/gh/seemethere/55/head 2025-08-14T21:18:06.1967120Z * [new branch] gh/seemethere/55/orig -> origin/gh/seemethere/55/orig 2025-08-14T21:18:06.1970964Z * [new branch] gh/seemethere/56/base -> origin/gh/seemethere/56/base 2025-08-14T21:18:06.1971119Z * [new branch] gh/seemethere/56/head -> origin/gh/seemethere/56/head 2025-08-14T21:18:06.1971391Z * [new branch] gh/seemethere/56/orig -> origin/gh/seemethere/56/orig 2025-08-14T21:18:06.1971529Z * [new branch] gh/seemethere/57/base -> origin/gh/seemethere/57/base 2025-08-14T21:18:06.1971660Z * [new branch] gh/seemethere/57/head -> origin/gh/seemethere/57/head 2025-08-14T21:18:06.1971827Z * [new branch] gh/seemethere/57/orig -> origin/gh/seemethere/57/orig 2025-08-14T21:18:06.1973255Z * [new branch] gh/seemethere/58/base -> origin/gh/seemethere/58/base 2025-08-14T21:18:06.1973405Z * [new branch] gh/seemethere/58/head -> origin/gh/seemethere/58/head 2025-08-14T21:18:06.1973840Z * [new branch] gh/seemethere/58/orig -> origin/gh/seemethere/58/orig 2025-08-14T21:18:06.1976604Z * [new branch] gh/seemethere/59/base -> origin/gh/seemethere/59/base 2025-08-14T21:18:06.1976761Z * [new branch] gh/seemethere/59/head -> origin/gh/seemethere/59/head 2025-08-14T21:18:06.1976909Z * [new branch] gh/seemethere/59/orig -> origin/gh/seemethere/59/orig 2025-08-14T21:18:06.1977049Z * [new branch] gh/seemethere/7/head -> origin/gh/seemethere/7/head 2025-08-14T21:18:06.1978281Z * [new branch] gh/shunting314/145/base -> origin/gh/shunting314/145/base 2025-08-14T21:18:06.1978661Z * [new branch] gh/shunting314/145/head -> origin/gh/shunting314/145/head 2025-08-14T21:18:06.1979618Z * [new branch] gh/shunting314/145/orig -> origin/gh/shunting314/145/orig 2025-08-14T21:18:06.1980655Z * [new branch] gh/shunting314/176/base -> origin/gh/shunting314/176/base 2025-08-14T21:18:06.1981160Z * [new branch] gh/shunting314/176/head -> origin/gh/shunting314/176/head 2025-08-14T21:18:06.1982008Z * [new branch] gh/shunting314/176/orig -> origin/gh/shunting314/176/orig 2025-08-14T21:18:06.1982922Z * [new branch] gh/shunting314/211/base -> origin/gh/shunting314/211/base 2025-08-14T21:18:06.1983218Z * [new branch] gh/shunting314/211/head -> origin/gh/shunting314/211/head 2025-08-14T21:18:06.1984129Z * [new branch] gh/shunting314/211/orig -> origin/gh/shunting314/211/orig 2025-08-14T21:18:06.1984841Z * [new branch] gh/shunting314/212/base -> origin/gh/shunting314/212/base 2025-08-14T21:18:06.1987651Z * [new branch] gh/shunting314/212/head -> origin/gh/shunting314/212/head 2025-08-14T21:18:06.1987887Z * [new branch] gh/shunting314/212/orig -> origin/gh/shunting314/212/orig 2025-08-14T21:18:06.1989743Z * [new branch] gh/shunting314/213/base -> origin/gh/shunting314/213/base 2025-08-14T21:18:06.1993667Z * [new branch] gh/shunting314/213/head -> origin/gh/shunting314/213/head 2025-08-14T21:18:06.1997717Z * [new branch] gh/shunting314/213/orig -> origin/gh/shunting314/213/orig 2025-08-14T21:18:06.2001881Z * [new branch] gh/silverguo/1/base -> origin/gh/silverguo/1/base 2025-08-14T21:18:06.2006003Z * [new branch] gh/silverguo/1/head -> origin/gh/silverguo/1/head 2025-08-14T21:18:06.2010045Z * [new branch] gh/silverguo/2/base -> origin/gh/silverguo/2/base 2025-08-14T21:18:06.2014299Z * [new branch] gh/silverguo/2/head -> origin/gh/silverguo/2/head 2025-08-14T21:18:06.2018291Z * [new branch] gh/silverguo/3/base -> origin/gh/silverguo/3/base 2025-08-14T21:18:06.2018564Z * [new branch] gh/silverguo/3/head -> origin/gh/silverguo/3/head 2025-08-14T21:18:06.2018873Z * [new branch] gh/silverguo/4/base -> origin/gh/silverguo/4/base 2025-08-14T21:18:06.2019006Z * [new branch] gh/silverguo/4/head -> origin/gh/silverguo/4/head 2025-08-14T21:18:06.2019141Z * [new branch] gh/sinhaanhsul/1/base -> origin/gh/sinhaanhsul/1/base 2025-08-14T21:18:06.2019452Z * [new branch] gh/sinhaanhsul/1/head -> origin/gh/sinhaanhsul/1/head 2025-08-14T21:18:06.2019583Z * [new branch] gh/skarjala/11/base -> origin/gh/skarjala/11/base 2025-08-14T21:18:06.2019699Z * [new branch] gh/skarjala/11/head -> origin/gh/skarjala/11/head 2025-08-14T21:18:06.2019833Z * [new branch] gh/skarjala/11/orig -> origin/gh/skarjala/11/orig 2025-08-14T21:18:06.2019946Z * [new branch] gh/skarjala/13/base -> origin/gh/skarjala/13/base 2025-08-14T21:18:06.2020062Z * [new branch] gh/skarjala/13/head -> origin/gh/skarjala/13/head 2025-08-14T21:18:06.2020175Z * [new branch] gh/skarjala/13/orig -> origin/gh/skarjala/13/orig 2025-08-14T21:18:06.2020288Z * [new branch] gh/skarjala/14/base -> origin/gh/skarjala/14/base 2025-08-14T21:18:06.2020408Z * [new branch] gh/skarjala/14/head -> origin/gh/skarjala/14/head 2025-08-14T21:18:06.2020520Z * [new branch] gh/skarjala/14/orig -> origin/gh/skarjala/14/orig 2025-08-14T21:18:06.2020631Z * [new branch] gh/skarjala/15/base -> origin/gh/skarjala/15/base 2025-08-14T21:18:06.2020749Z * [new branch] gh/skarjala/15/head -> origin/gh/skarjala/15/head 2025-08-14T21:18:06.2020869Z * [new branch] gh/skarjala/15/orig -> origin/gh/skarjala/15/orig 2025-08-14T21:18:06.2020987Z * [new branch] gh/skarjala/16/base -> origin/gh/skarjala/16/base 2025-08-14T21:18:06.2021103Z * [new branch] gh/skarjala/16/head -> origin/gh/skarjala/16/head 2025-08-14T21:18:06.2021233Z * [new branch] gh/skarjala/16/orig -> origin/gh/skarjala/16/orig 2025-08-14T21:18:06.2021347Z * [new branch] gh/skarjala/17/base -> origin/gh/skarjala/17/base 2025-08-14T21:18:06.2021462Z * [new branch] gh/skarjala/17/head -> origin/gh/skarjala/17/head 2025-08-14T21:18:06.2021578Z * [new branch] gh/skarjala/17/orig -> origin/gh/skarjala/17/orig 2025-08-14T21:18:06.2021694Z * [new branch] gh/skarjala/18/base -> origin/gh/skarjala/18/base 2025-08-14T21:18:06.2021812Z * [new branch] gh/skarjala/18/head -> origin/gh/skarjala/18/head 2025-08-14T21:18:06.2021993Z * [new branch] gh/skarjala/18/orig -> origin/gh/skarjala/18/orig 2025-08-14T21:18:06.2022113Z * [new branch] gh/skarjala/19/base -> origin/gh/skarjala/19/base 2025-08-14T21:18:06.2022234Z * [new branch] gh/skarjala/19/head -> origin/gh/skarjala/19/head 2025-08-14T21:18:06.2022353Z * [new branch] gh/skarjala/19/orig -> origin/gh/skarjala/19/orig 2025-08-14T21:18:06.2022485Z * [new branch] gh/soulitzer/269/base -> origin/gh/soulitzer/269/base 2025-08-14T21:18:06.2022619Z * [new branch] gh/soulitzer/269/head -> origin/gh/soulitzer/269/head 2025-08-14T21:18:06.2022744Z * [new branch] gh/soulitzer/269/orig -> origin/gh/soulitzer/269/orig 2025-08-14T21:18:06.2022870Z * [new branch] gh/soulitzer/276/base -> origin/gh/soulitzer/276/base 2025-08-14T21:18:06.2022991Z * [new branch] gh/soulitzer/276/head -> origin/gh/soulitzer/276/head 2025-08-14T21:18:06.2023114Z * [new branch] gh/soulitzer/276/orig -> origin/gh/soulitzer/276/orig 2025-08-14T21:18:06.2023242Z * [new branch] gh/soulitzer/287/base -> origin/gh/soulitzer/287/base 2025-08-14T21:18:06.2023371Z * [new branch] gh/soulitzer/287/head -> origin/gh/soulitzer/287/head 2025-08-14T21:18:06.2023500Z * [new branch] gh/soulitzer/287/orig -> origin/gh/soulitzer/287/orig 2025-08-14T21:18:06.2023626Z * [new branch] gh/soulitzer/296/base -> origin/gh/soulitzer/296/base 2025-08-14T21:18:06.2023798Z * [new branch] gh/soulitzer/296/head -> origin/gh/soulitzer/296/head 2025-08-14T21:18:06.2024493Z * [new branch] gh/soulitzer/296/orig -> origin/gh/soulitzer/296/orig 2025-08-14T21:18:06.2027929Z * [new branch] gh/soulitzer/299/base -> origin/gh/soulitzer/299/base 2025-08-14T21:18:06.2028238Z * [new branch] gh/soulitzer/299/head -> origin/gh/soulitzer/299/head 2025-08-14T21:18:06.2028414Z * [new branch] gh/soulitzer/299/orig -> origin/gh/soulitzer/299/orig 2025-08-14T21:18:06.2028594Z * [new branch] gh/soulitzer/300/base -> origin/gh/soulitzer/300/base 2025-08-14T21:18:06.2028730Z * [new branch] gh/soulitzer/300/head -> origin/gh/soulitzer/300/head 2025-08-14T21:18:06.2029809Z * [new branch] gh/soulitzer/300/orig -> origin/gh/soulitzer/300/orig 2025-08-14T21:18:06.2030317Z * [new branch] gh/soulitzer/301/base -> origin/gh/soulitzer/301/base 2025-08-14T21:18:06.2033231Z * [new branch] gh/soulitzer/301/head -> origin/gh/soulitzer/301/head 2025-08-14T21:18:06.2033546Z * [new branch] gh/soulitzer/301/orig -> origin/gh/soulitzer/301/orig 2025-08-14T21:18:06.2033758Z * [new branch] gh/soulitzer/313/base -> origin/gh/soulitzer/313/base 2025-08-14T21:18:06.2033978Z * [new branch] gh/soulitzer/313/head -> origin/gh/soulitzer/313/head 2025-08-14T21:18:06.2034122Z * [new branch] gh/soulitzer/313/orig -> origin/gh/soulitzer/313/orig 2025-08-14T21:18:06.2034338Z * [new branch] gh/soulitzer/319/base -> origin/gh/soulitzer/319/base 2025-08-14T21:18:06.2035647Z * [new branch] gh/soulitzer/319/head -> origin/gh/soulitzer/319/head 2025-08-14T21:18:06.2035876Z * [new branch] gh/soulitzer/319/orig -> origin/gh/soulitzer/319/orig 2025-08-14T21:18:06.2037565Z * [new branch] gh/soulitzer/320/base -> origin/gh/soulitzer/320/base 2025-08-14T21:18:06.2037883Z * [new branch] gh/soulitzer/320/head -> origin/gh/soulitzer/320/head 2025-08-14T21:18:06.2038054Z * [new branch] gh/soulitzer/320/orig -> origin/gh/soulitzer/320/orig 2025-08-14T21:18:06.2039368Z * [new branch] gh/soulitzer/336/base -> origin/gh/soulitzer/336/base 2025-08-14T21:18:06.2039815Z * [new branch] gh/soulitzer/336/head -> origin/gh/soulitzer/336/head 2025-08-14T21:18:06.2039962Z * [new branch] gh/soulitzer/336/orig -> origin/gh/soulitzer/336/orig 2025-08-14T21:18:06.2044120Z * [new branch] gh/soulitzer/347/base -> origin/gh/soulitzer/347/base 2025-08-14T21:18:06.2044428Z * [new branch] gh/soulitzer/347/head -> origin/gh/soulitzer/347/head 2025-08-14T21:18:06.2044590Z * [new branch] gh/soulitzer/347/orig -> origin/gh/soulitzer/347/orig 2025-08-14T21:18:06.2044762Z * [new branch] gh/soulitzer/349/base -> origin/gh/soulitzer/349/base 2025-08-14T21:18:06.2044917Z * [new branch] gh/soulitzer/349/head -> origin/gh/soulitzer/349/head 2025-08-14T21:18:06.2045492Z * [new branch] gh/soulitzer/349/orig -> origin/gh/soulitzer/349/orig 2025-08-14T21:18:06.2045656Z * [new branch] gh/soulitzer/350/base -> origin/gh/soulitzer/350/base 2025-08-14T21:18:06.2045953Z * [new branch] gh/soulitzer/350/head -> origin/gh/soulitzer/350/head 2025-08-14T21:18:06.2047206Z * [new branch] gh/soulitzer/350/orig -> origin/gh/soulitzer/350/orig 2025-08-14T21:18:06.2047692Z * [new branch] gh/soulitzer/351/base -> origin/gh/soulitzer/351/base 2025-08-14T21:18:06.2049299Z * [new branch] gh/soulitzer/351/head -> origin/gh/soulitzer/351/head 2025-08-14T21:18:06.2049734Z * [new branch] gh/soulitzer/351/orig -> origin/gh/soulitzer/351/orig 2025-08-14T21:18:06.2049997Z * [new branch] gh/soulitzer/353/base -> origin/gh/soulitzer/353/base 2025-08-14T21:18:06.2050374Z * [new branch] gh/soulitzer/353/head -> origin/gh/soulitzer/353/head 2025-08-14T21:18:06.2051674Z * [new branch] gh/soulitzer/353/orig -> origin/gh/soulitzer/353/orig 2025-08-14T21:18:06.2053643Z * [new branch] gh/soulitzer/358/base -> origin/gh/soulitzer/358/base 2025-08-14T21:18:06.2053954Z * [new branch] gh/soulitzer/358/head -> origin/gh/soulitzer/358/head 2025-08-14T21:18:06.2054099Z * [new branch] gh/soulitzer/358/orig -> origin/gh/soulitzer/358/orig 2025-08-14T21:18:06.2057367Z * [new branch] gh/soulitzer/359/base -> origin/gh/soulitzer/359/base 2025-08-14T21:18:06.2057663Z * [new branch] gh/soulitzer/359/head -> origin/gh/soulitzer/359/head 2025-08-14T21:18:06.2057835Z * [new branch] gh/soulitzer/359/orig -> origin/gh/soulitzer/359/orig 2025-08-14T21:18:06.2057964Z * [new branch] gh/soulitzer/362/base -> origin/gh/soulitzer/362/base 2025-08-14T21:18:06.2058207Z * [new branch] gh/soulitzer/362/head -> origin/gh/soulitzer/362/head 2025-08-14T21:18:06.2058401Z * [new branch] gh/soulitzer/362/orig -> origin/gh/soulitzer/362/orig 2025-08-14T21:18:06.2059625Z * [new branch] gh/soulitzer/372/base -> origin/gh/soulitzer/372/base 2025-08-14T21:18:06.2060100Z * [new branch] gh/soulitzer/372/head -> origin/gh/soulitzer/372/head 2025-08-14T21:18:06.2060607Z * [new branch] gh/soulitzer/372/orig -> origin/gh/soulitzer/372/orig 2025-08-14T21:18:06.2061898Z * [new branch] gh/swolchok/728/next -> origin/gh/swolchok/728/next 2025-08-14T21:18:06.2062748Z * [new branch] gh/swolchok/758/base -> origin/gh/swolchok/758/base 2025-08-14T21:18:06.2062972Z * [new branch] gh/swolchok/758/head -> origin/gh/swolchok/758/head 2025-08-14T21:18:06.2063902Z * [new branch] gh/swolchok/758/orig -> origin/gh/swolchok/758/orig 2025-08-14T21:18:06.2069120Z * [new branch] gh/swolchok/767/base -> origin/gh/swolchok/767/base 2025-08-14T21:18:06.2073356Z * [new branch] gh/swolchok/767/head -> origin/gh/swolchok/767/head 2025-08-14T21:18:06.2077517Z * [new branch] gh/swolchok/767/orig -> origin/gh/swolchok/767/orig 2025-08-14T21:18:06.2081506Z * [new branch] gh/swolchok/768/base -> origin/gh/swolchok/768/base 2025-08-14T21:18:06.2085277Z * [new branch] gh/swolchok/768/head -> origin/gh/swolchok/768/head 2025-08-14T21:18:06.2089373Z * [new branch] gh/swolchok/768/orig -> origin/gh/swolchok/768/orig 2025-08-14T21:18:06.2092486Z * [new branch] gh/swolchok/769/base -> origin/gh/swolchok/769/base 2025-08-14T21:18:06.2092686Z * [new branch] gh/swolchok/769/head -> origin/gh/swolchok/769/head 2025-08-14T21:18:06.2092815Z * [new branch] gh/swolchok/769/orig -> origin/gh/swolchok/769/orig 2025-08-14T21:18:06.2092943Z * [new branch] gh/swolchok/771/base -> origin/gh/swolchok/771/base 2025-08-14T21:18:06.2093078Z * [new branch] gh/swolchok/771/head -> origin/gh/swolchok/771/head 2025-08-14T21:18:06.2093198Z * [new branch] gh/swolchok/771/orig -> origin/gh/swolchok/771/orig 2025-08-14T21:18:06.2093324Z * [new branch] gh/swolchok/772/base -> origin/gh/swolchok/772/base 2025-08-14T21:18:06.2093444Z * [new branch] gh/swolchok/772/head -> origin/gh/swolchok/772/head 2025-08-14T21:18:06.2093563Z * [new branch] gh/swolchok/772/orig -> origin/gh/swolchok/772/orig 2025-08-14T21:18:06.2093873Z * [new branch] gh/swolchok/773/base -> origin/gh/swolchok/773/base 2025-08-14T21:18:06.2094003Z * [new branch] gh/swolchok/773/head -> origin/gh/swolchok/773/head 2025-08-14T21:18:06.2094128Z * [new branch] gh/swolchok/773/orig -> origin/gh/swolchok/773/orig 2025-08-14T21:18:06.2094245Z * [new branch] gh/swolchok/786/base -> origin/gh/swolchok/786/base 2025-08-14T21:18:06.2094369Z * [new branch] gh/swolchok/786/head -> origin/gh/swolchok/786/head 2025-08-14T21:18:06.2094490Z * [new branch] gh/swolchok/786/orig -> origin/gh/swolchok/786/orig 2025-08-14T21:18:06.2094606Z * [new branch] gh/swolchok/787/base -> origin/gh/swolchok/787/base 2025-08-14T21:18:06.2094728Z * [new branch] gh/swolchok/787/head -> origin/gh/swolchok/787/head 2025-08-14T21:18:06.2094847Z * [new branch] gh/swolchok/787/orig -> origin/gh/swolchok/787/orig 2025-08-14T21:18:06.2094980Z * [new branch] gh/syed-ahmed/2/base -> origin/gh/syed-ahmed/2/base 2025-08-14T21:18:06.2095105Z * [new branch] gh/syed-ahmed/2/head -> origin/gh/syed-ahmed/2/head 2025-08-14T21:18:06.2095221Z * [new branch] gh/syed-ahmed/2/orig -> origin/gh/syed-ahmed/2/orig 2025-08-14T21:18:06.2095336Z * [new branch] gh/syed-ahmed/3/base -> origin/gh/syed-ahmed/3/base 2025-08-14T21:18:06.2095463Z * [new branch] gh/syed-ahmed/3/head -> origin/gh/syed-ahmed/3/head 2025-08-14T21:18:06.2095581Z * [new branch] gh/syed-ahmed/3/orig -> origin/gh/syed-ahmed/3/orig 2025-08-14T21:18:06.2095699Z * [new branch] gh/syed-ahmed/4/base -> origin/gh/syed-ahmed/4/base 2025-08-14T21:18:06.2095812Z * [new branch] gh/syed-ahmed/4/head -> origin/gh/syed-ahmed/4/head 2025-08-14T21:18:06.2095924Z * [new branch] gh/syed-ahmed/4/orig -> origin/gh/syed-ahmed/4/orig 2025-08-14T21:18:06.2096054Z * [new branch] gh/teja-rao/3/base -> origin/gh/teja-rao/3/base 2025-08-14T21:18:06.2096174Z * [new branch] gh/teja-rao/3/head -> origin/gh/teja-rao/3/head 2025-08-14T21:18:06.2096294Z * [new branch] gh/teja-rao/3/orig -> origin/gh/teja-rao/3/orig 2025-08-14T21:18:06.2096412Z * [new branch] gh/tianyu-l/2/base -> origin/gh/tianyu-l/2/base 2025-08-14T21:18:06.2096597Z * [new branch] gh/tianyu-l/2/head -> origin/gh/tianyu-l/2/head 2025-08-14T21:18:06.2096712Z * [new branch] gh/tianyu-l/2/orig -> origin/gh/tianyu-l/2/orig 2025-08-14T21:18:06.2096842Z * [new branch] gh/titaiwangms/1/base -> origin/gh/titaiwangms/1/base 2025-08-14T21:18:06.2096973Z * [new branch] gh/titaiwangms/1/head -> origin/gh/titaiwangms/1/head 2025-08-14T21:18:06.2097097Z * [new branch] gh/titaiwangms/1/orig -> origin/gh/titaiwangms/1/orig 2025-08-14T21:18:06.2098047Z * [new branch] gh/titaiwangms/2/base -> origin/gh/titaiwangms/2/base 2025-08-14T21:18:06.2098246Z * [new branch] gh/titaiwangms/2/head -> origin/gh/titaiwangms/2/head 2025-08-14T21:18:06.2098446Z * [new branch] gh/titaiwangms/2/orig -> origin/gh/titaiwangms/2/orig 2025-08-14T21:18:06.2098590Z * [new branch] gh/titaiwangms/3/base -> origin/gh/titaiwangms/3/base 2025-08-14T21:18:06.2098809Z * [new branch] gh/titaiwangms/3/head -> origin/gh/titaiwangms/3/head 2025-08-14T21:18:06.2099720Z * [new branch] gh/titaiwangms/3/orig -> origin/gh/titaiwangms/3/orig 2025-08-14T21:18:06.2101890Z * [new branch] gh/titaiwangms/4/base -> origin/gh/titaiwangms/4/base 2025-08-14T21:18:06.2102112Z * [new branch] gh/titaiwangms/4/head -> origin/gh/titaiwangms/4/head 2025-08-14T21:18:06.2102420Z * [new branch] gh/titaiwangms/4/orig -> origin/gh/titaiwangms/4/orig 2025-08-14T21:18:06.2102850Z * [new branch] gh/titaiwangms/5/base -> origin/gh/titaiwangms/5/base 2025-08-14T21:18:06.2103400Z * [new branch] gh/titaiwangms/5/head -> origin/gh/titaiwangms/5/head 2025-08-14T21:18:06.2104370Z * [new branch] gh/titaiwangms/5/orig -> origin/gh/titaiwangms/5/orig 2025-08-14T21:18:06.2104962Z * [new branch] gh/titaiwangms/6/base -> origin/gh/titaiwangms/6/base 2025-08-14T21:18:06.2105646Z * [new branch] gh/titaiwangms/6/head -> origin/gh/titaiwangms/6/head 2025-08-14T21:18:06.2106095Z * [new branch] gh/titaiwangms/6/orig -> origin/gh/titaiwangms/6/orig 2025-08-14T21:18:06.2109127Z * [new branch] gh/titaiwangms/7/base -> origin/gh/titaiwangms/7/base 2025-08-14T21:18:06.2109353Z * [new branch] gh/titaiwangms/7/head -> origin/gh/titaiwangms/7/head 2025-08-14T21:18:06.2109518Z * [new branch] gh/titaiwangms/7/orig -> origin/gh/titaiwangms/7/orig 2025-08-14T21:18:06.2109650Z * [new branch] gh/titaiwangms/8/base -> origin/gh/titaiwangms/8/base 2025-08-14T21:18:06.2109934Z * [new branch] gh/titaiwangms/8/head -> origin/gh/titaiwangms/8/head 2025-08-14T21:18:06.2110077Z * [new branch] gh/titaiwangms/8/orig -> origin/gh/titaiwangms/8/orig 2025-08-14T21:18:06.2113543Z * [new branch] gh/tugsbayasgalan/1/base -> origin/gh/tugsbayasgalan/1/base 2025-08-14T21:18:06.2113860Z * [new branch] gh/tugsbayasgalan/1/head -> origin/gh/tugsbayasgalan/1/head 2025-08-14T21:18:06.2114073Z * [new branch] gh/tugsbayasgalan/1/orig -> origin/gh/tugsbayasgalan/1/orig 2025-08-14T21:18:06.2114274Z * [new branch] gh/v0i0/1/base -> origin/gh/v0i0/1/base 2025-08-14T21:18:06.2114916Z * [new branch] gh/v0i0/1/head -> origin/gh/v0i0/1/head 2025-08-14T21:18:06.2115067Z * [new branch] gh/v0i0/1/orig -> origin/gh/v0i0/1/orig 2025-08-14T21:18:06.2115569Z * [new branch] gh/v0i0/2/base -> origin/gh/v0i0/2/base 2025-08-14T21:18:06.2117807Z * [new branch] gh/v0i0/2/head -> origin/gh/v0i0/2/head 2025-08-14T21:18:06.2118090Z * [new branch] gh/v0i0/2/orig -> origin/gh/v0i0/2/orig 2025-08-14T21:18:06.2118498Z * [new branch] gh/v0i0/3/base -> origin/gh/v0i0/3/base 2025-08-14T21:18:06.2118622Z * [new branch] gh/v0i0/3/head -> origin/gh/v0i0/3/head 2025-08-14T21:18:06.2119087Z * [new branch] gh/v0i0/3/orig -> origin/gh/v0i0/3/orig 2025-08-14T21:18:06.2122426Z * [new branch] gh/v0i0/4/base -> origin/gh/v0i0/4/base 2025-08-14T21:18:06.2122712Z * [new branch] gh/v0i0/4/head -> origin/gh/v0i0/4/head 2025-08-14T21:18:06.2122856Z * [new branch] gh/v0i0/4/orig -> origin/gh/v0i0/4/orig 2025-08-14T21:18:06.2123036Z * [new branch] gh/v0i0/5/base -> origin/gh/v0i0/5/base 2025-08-14T21:18:06.2123350Z * [new branch] gh/v0i0/5/head -> origin/gh/v0i0/5/head 2025-08-14T21:18:06.2124005Z * [new branch] gh/v0i0/5/orig -> origin/gh/v0i0/5/orig 2025-08-14T21:18:06.2124187Z * [new branch] gh/v0i0/6/base -> origin/gh/v0i0/6/base 2025-08-14T21:18:06.2126915Z * [new branch] gh/v0i0/6/head -> origin/gh/v0i0/6/head 2025-08-14T21:18:06.2127211Z * [new branch] gh/v0i0/6/orig -> origin/gh/v0i0/6/orig 2025-08-14T21:18:06.2127369Z * [new branch] gh/vkuzo/1/next -> origin/gh/vkuzo/1/next 2025-08-14T21:18:06.2127563Z * [new branch] gh/vkuzo/2/next -> origin/gh/vkuzo/2/next 2025-08-14T21:18:06.2128217Z * [new branch] gh/vkuzo/3/next -> origin/gh/vkuzo/3/next 2025-08-14T21:18:06.2130316Z * [new branch] gh/wconstab/392/base -> origin/gh/wconstab/392/base 2025-08-14T21:18:06.2130630Z * [new branch] gh/wconstab/392/head -> origin/gh/wconstab/392/head 2025-08-14T21:18:06.2130792Z * [new branch] gh/wconstab/392/orig -> origin/gh/wconstab/392/orig 2025-08-14T21:18:06.2132103Z * [new branch] gh/wconstab/419/base -> origin/gh/wconstab/419/base 2025-08-14T21:18:06.2132353Z * [new branch] gh/wconstab/419/head -> origin/gh/wconstab/419/head 2025-08-14T21:18:06.2132651Z * [new branch] gh/wconstab/419/orig -> origin/gh/wconstab/419/orig 2025-08-14T21:18:06.2134197Z * [new branch] gh/wconstab/424/base -> origin/gh/wconstab/424/base 2025-08-14T21:18:06.2134500Z * [new branch] gh/wconstab/424/head -> origin/gh/wconstab/424/head 2025-08-14T21:18:06.2134652Z * [new branch] gh/wconstab/424/orig -> origin/gh/wconstab/424/orig 2025-08-14T21:18:06.2138094Z * [new branch] gh/wconstab/425/base -> origin/gh/wconstab/425/base 2025-08-14T21:18:06.2138400Z * [new branch] gh/wconstab/425/head -> origin/gh/wconstab/425/head 2025-08-14T21:18:06.2138604Z * [new branch] gh/wconstab/425/orig -> origin/gh/wconstab/425/orig 2025-08-14T21:18:06.2138820Z * [new branch] gh/wconstab/426/base -> origin/gh/wconstab/426/base 2025-08-14T21:18:06.2139520Z * [new branch] gh/wconstab/426/head -> origin/gh/wconstab/426/head 2025-08-14T21:18:06.2139672Z * [new branch] gh/wconstab/426/orig -> origin/gh/wconstab/426/orig 2025-08-14T21:18:06.2139803Z * [new branch] gh/wconstab/427/base -> origin/gh/wconstab/427/base 2025-08-14T21:18:06.2140685Z * [new branch] gh/wconstab/427/head -> origin/gh/wconstab/427/head 2025-08-14T21:18:06.2141031Z * [new branch] gh/wconstab/427/orig -> origin/gh/wconstab/427/orig 2025-08-14T21:18:06.2142254Z * [new branch] gh/wconstab/428/base -> origin/gh/wconstab/428/base 2025-08-14T21:18:06.2142492Z * [new branch] gh/wconstab/428/head -> origin/gh/wconstab/428/head 2025-08-14T21:18:06.2143840Z * [new branch] gh/wconstab/428/orig -> origin/gh/wconstab/428/orig 2025-08-14T21:18:06.2147445Z * [new branch] gh/wconstab/429/base -> origin/gh/wconstab/429/base 2025-08-14T21:18:06.2147609Z * [new branch] gh/wconstab/429/head -> origin/gh/wconstab/429/head 2025-08-14T21:18:06.2147750Z * [new branch] gh/wconstab/429/orig -> origin/gh/wconstab/429/orig 2025-08-14T21:18:06.2147870Z * [new branch] gh/wconstab/430/base -> origin/gh/wconstab/430/base 2025-08-14T21:18:06.2153223Z * [new branch] gh/wconstab/430/head -> origin/gh/wconstab/430/head 2025-08-14T21:18:06.2156840Z * [new branch] gh/wconstab/430/orig -> origin/gh/wconstab/430/orig 2025-08-14T21:18:06.2160236Z * [new branch] gh/wconstab/431/base -> origin/gh/wconstab/431/base 2025-08-14T21:18:06.2163398Z * [new branch] gh/wconstab/431/head -> origin/gh/wconstab/431/head 2025-08-14T21:18:06.2166577Z * [new branch] gh/wconstab/431/orig -> origin/gh/wconstab/431/orig 2025-08-14T21:18:06.2169803Z * [new branch] gh/wconstab/432/base -> origin/gh/wconstab/432/base 2025-08-14T21:18:06.2169973Z * [new branch] gh/wconstab/432/head -> origin/gh/wconstab/432/head 2025-08-14T21:18:06.2170108Z * [new branch] gh/wconstab/432/orig -> origin/gh/wconstab/432/orig 2025-08-14T21:18:06.2170228Z * [new branch] gh/wconstab/433/base -> origin/gh/wconstab/433/base 2025-08-14T21:18:06.2170496Z * [new branch] gh/wconstab/433/head -> origin/gh/wconstab/433/head 2025-08-14T21:18:06.2170623Z * [new branch] gh/wconstab/433/orig -> origin/gh/wconstab/433/orig 2025-08-14T21:18:06.2170745Z * [new branch] gh/wconstab/434/base -> origin/gh/wconstab/434/base 2025-08-14T21:18:06.2170870Z * [new branch] gh/wconstab/434/head -> origin/gh/wconstab/434/head 2025-08-14T21:18:06.2170998Z * [new branch] gh/wconstab/434/orig -> origin/gh/wconstab/434/orig 2025-08-14T21:18:06.2171120Z * [new branch] gh/wconstab/435/base -> origin/gh/wconstab/435/base 2025-08-14T21:18:06.2171241Z * [new branch] gh/wconstab/435/head -> origin/gh/wconstab/435/head 2025-08-14T21:18:06.2171357Z * [new branch] gh/wconstab/435/orig -> origin/gh/wconstab/435/orig 2025-08-14T21:18:06.2171479Z * [new branch] gh/wconstab/436/base -> origin/gh/wconstab/436/base 2025-08-14T21:18:06.2171600Z * [new branch] gh/wconstab/436/head -> origin/gh/wconstab/436/head 2025-08-14T21:18:06.2171718Z * [new branch] gh/wconstab/436/orig -> origin/gh/wconstab/436/orig 2025-08-14T21:18:06.2171839Z * [new branch] gh/wconstab/437/base -> origin/gh/wconstab/437/base 2025-08-14T21:18:06.2171958Z * [new branch] gh/wconstab/437/head -> origin/gh/wconstab/437/head 2025-08-14T21:18:06.2172086Z * [new branch] gh/wconstab/437/orig -> origin/gh/wconstab/437/orig 2025-08-14T21:18:06.2172206Z * [new branch] gh/wconstab/438/base -> origin/gh/wconstab/438/base 2025-08-14T21:18:06.2172324Z * [new branch] gh/wconstab/438/head -> origin/gh/wconstab/438/head 2025-08-14T21:18:06.2172447Z * [new branch] gh/wconstab/438/orig -> origin/gh/wconstab/438/orig 2025-08-14T21:18:06.2172565Z * [new branch] gh/wconstab/439/base -> origin/gh/wconstab/439/base 2025-08-14T21:18:06.2172688Z * [new branch] gh/wconstab/439/head -> origin/gh/wconstab/439/head 2025-08-14T21:18:06.2172804Z * [new branch] gh/wconstab/439/orig -> origin/gh/wconstab/439/orig 2025-08-14T21:18:06.2172919Z * [new branch] gh/wconstab/440/base -> origin/gh/wconstab/440/base 2025-08-14T21:18:06.2173037Z * [new branch] gh/wconstab/440/head -> origin/gh/wconstab/440/head 2025-08-14T21:18:06.2173218Z * [new branch] gh/wconstab/440/orig -> origin/gh/wconstab/440/orig 2025-08-14T21:18:06.2173337Z * [new branch] gh/wconstab/441/base -> origin/gh/wconstab/441/base 2025-08-14T21:18:06.2173459Z * [new branch] gh/wconstab/441/head -> origin/gh/wconstab/441/head 2025-08-14T21:18:06.2173579Z * [new branch] gh/wconstab/441/orig -> origin/gh/wconstab/441/orig 2025-08-14T21:18:06.2173704Z * [new branch] gh/wconstab/442/base -> origin/gh/wconstab/442/base 2025-08-14T21:18:06.2173824Z * [new branch] gh/wconstab/442/head -> origin/gh/wconstab/442/head 2025-08-14T21:18:06.2173940Z * [new branch] gh/wconstab/442/orig -> origin/gh/wconstab/442/orig 2025-08-14T21:18:06.2177566Z * [new branch] gh/weifengpy/27/base -> origin/gh/weifengpy/27/base 2025-08-14T21:18:06.2179714Z * [new branch] gh/weifengpy/27/head -> origin/gh/weifengpy/27/head 2025-08-14T21:18:06.2180009Z * [new branch] gh/weifengpy/27/orig -> origin/gh/weifengpy/27/orig 2025-08-14T21:18:06.2180156Z * [new branch] gh/weifengpy/30/base -> origin/gh/weifengpy/30/base 2025-08-14T21:18:06.2180355Z * [new branch] gh/weifengpy/30/head -> origin/gh/weifengpy/30/head 2025-08-14T21:18:06.2180497Z * [new branch] gh/weifengpy/30/orig -> origin/gh/weifengpy/30/orig 2025-08-14T21:18:06.2181326Z * [new branch] gh/weifengpy/31/base -> origin/gh/weifengpy/31/base 2025-08-14T21:18:06.2181510Z * [new branch] gh/weifengpy/31/head -> origin/gh/weifengpy/31/head 2025-08-14T21:18:06.2181639Z * [new branch] gh/weifengpy/31/orig -> origin/gh/weifengpy/31/orig 2025-08-14T21:18:06.2181768Z * [new branch] gh/weifengpy/32/base -> origin/gh/weifengpy/32/base 2025-08-14T21:18:06.2181894Z * [new branch] gh/weifengpy/32/head -> origin/gh/weifengpy/32/head 2025-08-14T21:18:06.2182014Z * [new branch] gh/weifengpy/32/orig -> origin/gh/weifengpy/32/orig 2025-08-14T21:18:06.2182131Z * [new branch] gh/weifengpy/33/base -> origin/gh/weifengpy/33/base 2025-08-14T21:18:06.2182255Z * [new branch] gh/weifengpy/33/head -> origin/gh/weifengpy/33/head 2025-08-14T21:18:06.2182478Z * [new branch] gh/weifengpy/33/orig -> origin/gh/weifengpy/33/orig 2025-08-14T21:18:06.2184240Z * [new branch] gh/williamwen42/196/base -> origin/gh/williamwen42/196/base 2025-08-14T21:18:06.2184728Z * [new branch] gh/williamwen42/196/head -> origin/gh/williamwen42/196/head 2025-08-14T21:18:06.2185549Z * [new branch] gh/williamwen42/196/orig -> origin/gh/williamwen42/196/orig 2025-08-14T21:18:06.2189089Z * [new branch] gh/williamwen42/209/base -> origin/gh/williamwen42/209/base 2025-08-14T21:18:06.2189258Z * [new branch] gh/williamwen42/209/head -> origin/gh/williamwen42/209/head 2025-08-14T21:18:06.2189388Z * [new branch] gh/williamwen42/209/orig -> origin/gh/williamwen42/209/orig 2025-08-14T21:18:06.2189520Z * [new branch] gh/williamwen42/250/base -> origin/gh/williamwen42/250/base 2025-08-14T21:18:06.2189646Z * [new branch] gh/williamwen42/250/head -> origin/gh/williamwen42/250/head 2025-08-14T21:18:06.2193423Z * [new branch] gh/williamwen42/250/orig -> origin/gh/williamwen42/250/orig 2025-08-14T21:18:06.2197074Z * [new branch] gh/williamwen42/252/base -> origin/gh/williamwen42/252/base 2025-08-14T21:18:06.2198868Z * [new branch] gh/williamwen42/252/head -> origin/gh/williamwen42/252/head 2025-08-14T21:18:06.2199118Z * [new branch] gh/williamwen42/252/orig -> origin/gh/williamwen42/252/orig 2025-08-14T21:18:06.2203111Z * [new branch] gh/williamwen42/256/base -> origin/gh/williamwen42/256/base 2025-08-14T21:18:06.2207107Z * [new branch] gh/williamwen42/256/head -> origin/gh/williamwen42/256/head 2025-08-14T21:18:06.2208898Z * [new branch] gh/williamwen42/256/orig -> origin/gh/williamwen42/256/orig 2025-08-14T21:18:06.2209155Z * [new branch] gh/williamwen42/258/base -> origin/gh/williamwen42/258/base 2025-08-14T21:18:06.2213118Z * [new branch] gh/williamwen42/258/head -> origin/gh/williamwen42/258/head 2025-08-14T21:18:06.2213440Z * [new branch] gh/williamwen42/258/orig -> origin/gh/williamwen42/258/orig 2025-08-14T21:18:06.2213616Z * [new branch] gh/williamwen42/260/base -> origin/gh/williamwen42/260/base 2025-08-14T21:18:06.2213774Z * [new branch] gh/williamwen42/260/head -> origin/gh/williamwen42/260/head 2025-08-14T21:18:06.2213940Z * [new branch] gh/williamwen42/260/orig -> origin/gh/williamwen42/260/orig 2025-08-14T21:18:06.2214104Z * [new branch] gh/williamwen42/261/base -> origin/gh/williamwen42/261/base 2025-08-14T21:18:06.2214765Z * [new branch] gh/williamwen42/261/head -> origin/gh/williamwen42/261/head 2025-08-14T21:18:06.2214940Z * [new branch] gh/williamwen42/261/orig -> origin/gh/williamwen42/261/orig 2025-08-14T21:18:06.2215077Z * [new branch] gh/williamwen42/262/base -> origin/gh/williamwen42/262/base 2025-08-14T21:18:06.2215392Z * [new branch] gh/williamwen42/262/head -> origin/gh/williamwen42/262/head 2025-08-14T21:18:06.2215532Z * [new branch] gh/williamwen42/262/orig -> origin/gh/williamwen42/262/orig 2025-08-14T21:18:06.2215663Z * [new branch] gh/williamwen42/263/base -> origin/gh/williamwen42/263/base 2025-08-14T21:18:06.2215792Z * [new branch] gh/williamwen42/263/head -> origin/gh/williamwen42/263/head 2025-08-14T21:18:06.2215929Z * [new branch] gh/williamwen42/263/orig -> origin/gh/williamwen42/263/orig 2025-08-14T21:18:06.2216058Z * [new branch] gh/williamwen42/264/base -> origin/gh/williamwen42/264/base 2025-08-14T21:18:06.2216183Z * [new branch] gh/williamwen42/264/head -> origin/gh/williamwen42/264/head 2025-08-14T21:18:06.2216314Z * [new branch] gh/williamwen42/264/orig -> origin/gh/williamwen42/264/orig 2025-08-14T21:18:06.2216439Z * [new branch] gh/williamwen42/265/base -> origin/gh/williamwen42/265/base 2025-08-14T21:18:06.2216573Z * [new branch] gh/williamwen42/265/head -> origin/gh/williamwen42/265/head 2025-08-14T21:18:06.2216700Z * [new branch] gh/williamwen42/265/orig -> origin/gh/williamwen42/265/orig 2025-08-14T21:18:06.2216824Z * [new branch] gh/williamwen42/266/base -> origin/gh/williamwen42/266/base 2025-08-14T21:18:06.2216955Z * [new branch] gh/williamwen42/266/head -> origin/gh/williamwen42/266/head 2025-08-14T21:18:06.2217085Z * [new branch] gh/williamwen42/266/orig -> origin/gh/williamwen42/266/orig 2025-08-14T21:18:06.2217218Z * [new branch] gh/williamwen42/267/base -> origin/gh/williamwen42/267/base 2025-08-14T21:18:06.2217356Z * [new branch] gh/williamwen42/267/head -> origin/gh/williamwen42/267/head 2025-08-14T21:18:06.2217481Z * [new branch] gh/williamwen42/267/orig -> origin/gh/williamwen42/267/orig 2025-08-14T21:18:06.2217615Z * [new branch] gh/williamwen42/268/base -> origin/gh/williamwen42/268/base 2025-08-14T21:18:06.2217745Z * [new branch] gh/williamwen42/268/head -> origin/gh/williamwen42/268/head 2025-08-14T21:18:06.2217871Z * [new branch] gh/williamwen42/268/orig -> origin/gh/williamwen42/268/orig 2025-08-14T21:18:06.2218012Z * [new branch] gh/williamwen42/269/base -> origin/gh/williamwen42/269/base 2025-08-14T21:18:06.2218198Z * [new branch] gh/williamwen42/269/head -> origin/gh/williamwen42/269/head 2025-08-14T21:18:06.2218334Z * [new branch] gh/williamwen42/269/orig -> origin/gh/williamwen42/269/orig 2025-08-14T21:18:06.2218459Z * [new branch] gh/williamwen42/270/base -> origin/gh/williamwen42/270/base 2025-08-14T21:18:06.2218599Z * [new branch] gh/williamwen42/270/head -> origin/gh/williamwen42/270/head 2025-08-14T21:18:06.2219545Z * [new branch] gh/williamwen42/270/orig -> origin/gh/williamwen42/270/orig 2025-08-14T21:18:06.2220419Z * [new branch] gh/williamwen42/271/base -> origin/gh/williamwen42/271/base 2025-08-14T21:18:06.2220926Z * [new branch] gh/williamwen42/271/head -> origin/gh/williamwen42/271/head 2025-08-14T21:18:06.2221444Z * [new branch] gh/williamwen42/271/orig -> origin/gh/williamwen42/271/orig 2025-08-14T21:18:06.2222567Z * [new branch] gh/williamwen42/272/base -> origin/gh/williamwen42/272/base 2025-08-14T21:18:06.2222838Z * [new branch] gh/williamwen42/272/head -> origin/gh/williamwen42/272/head 2025-08-14T21:18:06.2223743Z * [new branch] gh/williamwen42/272/orig -> origin/gh/williamwen42/272/orig 2025-08-14T21:18:06.2224617Z * [new branch] gh/williamwen42/273/base -> origin/gh/williamwen42/273/base 2025-08-14T21:18:06.2225029Z * [new branch] gh/williamwen42/273/head -> origin/gh/williamwen42/273/head 2025-08-14T21:18:06.2227175Z * [new branch] gh/williamwen42/273/orig -> origin/gh/williamwen42/273/orig 2025-08-14T21:18:06.2227512Z * [new branch] gh/williamwen42/274/base -> origin/gh/williamwen42/274/base 2025-08-14T21:18:06.2227690Z * [new branch] gh/williamwen42/274/head -> origin/gh/williamwen42/274/head 2025-08-14T21:18:06.2227849Z * [new branch] gh/williamwen42/274/orig -> origin/gh/williamwen42/274/orig 2025-08-14T21:18:06.2228596Z * [new branch] gh/williamwen42/275/base -> origin/gh/williamwen42/275/base 2025-08-14T21:18:06.2229052Z * [new branch] gh/williamwen42/275/head -> origin/gh/williamwen42/275/head 2025-08-14T21:18:06.2231223Z * [new branch] gh/williamwen42/276/base -> origin/gh/williamwen42/276/base 2025-08-14T21:18:06.2231534Z * [new branch] gh/williamwen42/276/head -> origin/gh/williamwen42/276/head 2025-08-14T21:18:06.2231737Z * [new branch] gh/williamwen42/276/orig -> origin/gh/williamwen42/276/orig 2025-08-14T21:18:06.2231968Z * [new branch] gh/williamwen42/277/base -> origin/gh/williamwen42/277/base 2025-08-14T21:18:06.2232692Z * [new branch] gh/williamwen42/277/head -> origin/gh/williamwen42/277/head 2025-08-14T21:18:06.2233106Z * [new branch] gh/williamwen42/277/orig -> origin/gh/williamwen42/277/orig 2025-08-14T21:18:06.2234812Z * [new branch] gh/williamwen42/278/base -> origin/gh/williamwen42/278/base 2025-08-14T21:18:06.2235139Z * [new branch] gh/williamwen42/278/head -> origin/gh/williamwen42/278/head 2025-08-14T21:18:06.2235290Z * [new branch] gh/williamwen42/278/orig -> origin/gh/williamwen42/278/orig 2025-08-14T21:18:06.2236042Z * [new branch] gh/williamwen42/279/base -> origin/gh/williamwen42/279/base 2025-08-14T21:18:06.2236471Z * [new branch] gh/williamwen42/279/head -> origin/gh/williamwen42/279/head 2025-08-14T21:18:06.2238191Z * [new branch] gh/williamwen42/279/orig -> origin/gh/williamwen42/279/orig 2025-08-14T21:18:06.2238501Z * [new branch] gh/xmfan/169/base -> origin/gh/xmfan/169/base 2025-08-14T21:18:06.2239632Z * [new branch] gh/xmfan/169/head -> origin/gh/xmfan/169/head 2025-08-14T21:18:06.2240149Z * [new branch] gh/xmfan/170/base -> origin/gh/xmfan/170/base 2025-08-14T21:18:06.2240721Z * [new branch] gh/xmfan/170/head -> origin/gh/xmfan/170/head 2025-08-14T21:18:06.2244034Z * [new branch] gh/xmfan/18/base -> origin/gh/xmfan/18/base 2025-08-14T21:18:06.2244319Z * [new branch] gh/xmfan/18/head -> origin/gh/xmfan/18/head 2025-08-14T21:18:06.2244469Z * [new branch] gh/xmfan/228/base -> origin/gh/xmfan/228/base 2025-08-14T21:18:06.2244686Z * [new branch] gh/xmfan/228/head -> origin/gh/xmfan/228/head 2025-08-14T21:18:06.2244829Z * [new branch] gh/xmfan/228/orig -> origin/gh/xmfan/228/orig 2025-08-14T21:18:06.2245132Z * [new branch] gh/xmfan/229/base -> origin/gh/xmfan/229/base 2025-08-14T21:18:06.2246340Z * [new branch] gh/xmfan/229/head -> origin/gh/xmfan/229/head 2025-08-14T21:18:06.2246622Z * [new branch] gh/xmfan/229/orig -> origin/gh/xmfan/229/orig 2025-08-14T21:18:06.2247110Z * [new branch] gh/xmfan/237/base -> origin/gh/xmfan/237/base 2025-08-14T21:18:06.2248772Z * [new branch] gh/xmfan/237/head -> origin/gh/xmfan/237/head 2025-08-14T21:18:06.2249075Z * [new branch] gh/xmfan/237/orig -> origin/gh/xmfan/237/orig 2025-08-14T21:18:06.2249520Z * [new branch] gh/xmfan/244/base -> origin/gh/xmfan/244/base 2025-08-14T21:18:06.2252570Z * [new branch] gh/xmfan/244/head -> origin/gh/xmfan/244/head 2025-08-14T21:18:06.2252878Z * [new branch] gh/xmfan/244/orig -> origin/gh/xmfan/244/orig 2025-08-14T21:18:06.2252999Z * [new branch] gh/xmfan/246/base -> origin/gh/xmfan/246/base 2025-08-14T21:18:06.2253114Z * [new branch] gh/xmfan/246/head -> origin/gh/xmfan/246/head 2025-08-14T21:18:06.2253405Z * [new branch] gh/xmfan/246/orig -> origin/gh/xmfan/246/orig 2025-08-14T21:18:06.2253748Z * [new branch] gh/xmfan/253/base -> origin/gh/xmfan/253/base 2025-08-14T21:18:06.2254647Z * [new branch] gh/xmfan/253/head -> origin/gh/xmfan/253/head 2025-08-14T21:18:06.2254950Z * [new branch] gh/xmfan/253/orig -> origin/gh/xmfan/253/orig 2025-08-14T21:18:06.2256860Z * [new branch] gh/xmfan/254/base -> origin/gh/xmfan/254/base 2025-08-14T21:18:06.2257001Z * [new branch] gh/xmfan/254/head -> origin/gh/xmfan/254/head 2025-08-14T21:18:06.2257139Z * [new branch] gh/xmfan/254/orig -> origin/gh/xmfan/254/orig 2025-08-14T21:18:06.2258578Z * [new branch] gh/xmfan/260/base -> origin/gh/xmfan/260/base 2025-08-14T21:18:06.2258725Z * [new branch] gh/xmfan/260/head -> origin/gh/xmfan/260/head 2025-08-14T21:18:06.2259132Z * [new branch] gh/xmfan/260/orig -> origin/gh/xmfan/260/orig 2025-08-14T21:18:06.2260174Z * [new branch] gh/xmfan/262/base -> origin/gh/xmfan/262/base 2025-08-14T21:18:06.2260410Z * [new branch] gh/xmfan/262/head -> origin/gh/xmfan/262/head 2025-08-14T21:18:06.2261375Z * [new branch] gh/xmfan/262/orig -> origin/gh/xmfan/262/orig 2025-08-14T21:18:06.2262238Z * [new branch] gh/xmfan/263/base -> origin/gh/xmfan/263/base 2025-08-14T21:18:06.2262522Z * [new branch] gh/xmfan/263/head -> origin/gh/xmfan/263/head 2025-08-14T21:18:06.2263394Z * [new branch] gh/xmfan/263/orig -> origin/gh/xmfan/263/orig 2025-08-14T21:18:06.2264321Z * [new branch] gh/xmfan/264/base -> origin/gh/xmfan/264/base 2025-08-14T21:18:06.2264449Z * [new branch] gh/xmfan/264/head -> origin/gh/xmfan/264/head 2025-08-14T21:18:06.2267865Z * [new branch] gh/xmfan/264/orig -> origin/gh/xmfan/264/orig 2025-08-14T21:18:06.2268163Z * [new branch] gh/xmfan/268/base -> origin/gh/xmfan/268/base 2025-08-14T21:18:06.2268282Z * [new branch] gh/xmfan/268/head -> origin/gh/xmfan/268/head 2025-08-14T21:18:06.2268398Z * [new branch] gh/xmfan/268/orig -> origin/gh/xmfan/268/orig 2025-08-14T21:18:06.2268673Z * [new branch] gh/xmfan/269/base -> origin/gh/xmfan/269/base 2025-08-14T21:18:06.2273521Z * [new branch] gh/xmfan/269/head -> origin/gh/xmfan/269/head 2025-08-14T21:18:06.2273828Z * [new branch] gh/xmfan/269/orig -> origin/gh/xmfan/269/orig 2025-08-14T21:18:06.2273999Z * [new branch] gh/xmfan/270/base -> origin/gh/xmfan/270/base 2025-08-14T21:18:06.2274133Z * [new branch] gh/xmfan/270/head -> origin/gh/xmfan/270/head 2025-08-14T21:18:06.2274687Z * [new branch] gh/xmfan/270/orig -> origin/gh/xmfan/270/orig 2025-08-14T21:18:06.2278035Z * [new branch] gh/xmfan/271/base -> origin/gh/xmfan/271/base 2025-08-14T21:18:06.2278339Z * [new branch] gh/xmfan/271/head -> origin/gh/xmfan/271/head 2025-08-14T21:18:06.2278478Z * [new branch] gh/xmfan/271/orig -> origin/gh/xmfan/271/orig 2025-08-14T21:18:06.2278675Z * [new branch] gh/xmfan/272/base -> origin/gh/xmfan/272/base 2025-08-14T21:18:06.2278803Z * [new branch] gh/xmfan/272/head -> origin/gh/xmfan/272/head 2025-08-14T21:18:06.2279057Z * [new branch] gh/xmfan/272/orig -> origin/gh/xmfan/272/orig 2025-08-14T21:18:06.2280764Z * [new branch] gh/xmfan/273/base -> origin/gh/xmfan/273/base 2025-08-14T21:18:06.2281059Z * [new branch] gh/xmfan/273/head -> origin/gh/xmfan/273/head 2025-08-14T21:18:06.2281188Z * [new branch] gh/xmfan/273/orig -> origin/gh/xmfan/273/orig 2025-08-14T21:18:06.2282512Z * [new branch] gh/xmfan/274/base -> origin/gh/xmfan/274/base 2025-08-14T21:18:06.2282810Z * [new branch] gh/xmfan/274/head -> origin/gh/xmfan/274/head 2025-08-14T21:18:06.2283084Z * [new branch] gh/xmfan/274/orig -> origin/gh/xmfan/274/orig 2025-08-14T21:18:06.2284890Z * [new branch] gh/xmfan/275/base -> origin/gh/xmfan/275/base 2025-08-14T21:18:06.2285190Z * [new branch] gh/xmfan/275/head -> origin/gh/xmfan/275/head 2025-08-14T21:18:06.2285358Z * [new branch] gh/xmfan/275/orig -> origin/gh/xmfan/275/orig 2025-08-14T21:18:06.2286590Z * [new branch] gh/xmfan/276/base -> origin/gh/xmfan/276/base 2025-08-14T21:18:06.2286760Z * [new branch] gh/xmfan/276/head -> origin/gh/xmfan/276/head 2025-08-14T21:18:06.2288740Z * [new branch] gh/xmfan/276/orig -> origin/gh/xmfan/276/orig 2025-08-14T21:18:06.2289070Z * [new branch] gh/xmfan/277/base -> origin/gh/xmfan/277/base 2025-08-14T21:18:06.2289227Z * [new branch] gh/xmfan/277/head -> origin/gh/xmfan/277/head 2025-08-14T21:18:06.2289354Z * [new branch] gh/xmfan/277/orig -> origin/gh/xmfan/277/orig 2025-08-14T21:18:06.2293190Z * [new branch] gh/xuanzhang816/12/base -> origin/gh/xuanzhang816/12/base 2025-08-14T21:18:06.2293501Z * [new branch] gh/xuanzhang816/12/head -> origin/gh/xuanzhang816/12/head 2025-08-14T21:18:06.2293697Z * [new branch] gh/xuanzhang816/12/orig -> origin/gh/xuanzhang816/12/orig 2025-08-14T21:18:06.2293903Z * [new branch] gh/xuanzhang816/14/base -> origin/gh/xuanzhang816/14/base 2025-08-14T21:18:06.2294550Z * [new branch] gh/xuanzhang816/14/head -> origin/gh/xuanzhang816/14/head 2025-08-14T21:18:06.2294868Z * [new branch] gh/xuanzhang816/14/orig -> origin/gh/xuanzhang816/14/orig 2025-08-14T21:18:06.2295322Z * [new branch] gh/xuanzhang816/18/base -> origin/gh/xuanzhang816/18/base 2025-08-14T21:18:06.2295928Z * [new branch] gh/xuanzhang816/18/head -> origin/gh/xuanzhang816/18/head 2025-08-14T21:18:06.2297294Z * [new branch] gh/xuanzhang816/18/orig -> origin/gh/xuanzhang816/18/orig 2025-08-14T21:18:06.2297457Z * [new branch] gh/xuanzhang816/19/base -> origin/gh/xuanzhang816/19/base 2025-08-14T21:18:06.2299003Z * [new branch] gh/xuanzhang816/19/head -> origin/gh/xuanzhang816/19/head 2025-08-14T21:18:06.2299168Z * [new branch] gh/xuanzhang816/19/orig -> origin/gh/xuanzhang816/19/orig 2025-08-14T21:18:06.2299308Z * [new branch] gh/xuanzhang816/20/base -> origin/gh/xuanzhang816/20/base 2025-08-14T21:18:06.2299443Z * [new branch] gh/xuanzhang816/20/head -> origin/gh/xuanzhang816/20/head 2025-08-14T21:18:06.2299580Z * [new branch] gh/xuanzhang816/20/orig -> origin/gh/xuanzhang816/20/orig 2025-08-14T21:18:06.2300635Z * [new branch] gh/xuanzhang816/21/base -> origin/gh/xuanzhang816/21/base 2025-08-14T21:18:06.2300962Z * [new branch] gh/xuanzhang816/21/head -> origin/gh/xuanzhang816/21/head 2025-08-14T21:18:06.2301888Z * [new branch] gh/xuanzhang816/21/orig -> origin/gh/xuanzhang816/21/orig 2025-08-14T21:18:06.2302326Z * [new branch] gh/xuanzhang816/22/base -> origin/gh/xuanzhang816/22/base 2025-08-14T21:18:06.2303481Z * [new branch] gh/xuanzhang816/22/head -> origin/gh/xuanzhang816/22/head 2025-08-14T21:18:06.2303626Z * [new branch] gh/xuanzhang816/22/orig -> origin/gh/xuanzhang816/22/orig 2025-08-14T21:18:06.2305030Z * [new branch] gh/xuanzhang816/23/base -> origin/gh/xuanzhang816/23/base 2025-08-14T21:18:06.2305290Z * [new branch] gh/xuanzhang816/23/head -> origin/gh/xuanzhang816/23/head 2025-08-14T21:18:06.2305718Z * [new branch] gh/xuanzhang816/23/orig -> origin/gh/xuanzhang816/23/orig 2025-08-14T21:18:06.2307243Z * [new branch] gh/xuanzhang816/24/base -> origin/gh/xuanzhang816/24/base 2025-08-14T21:18:06.2307551Z * [new branch] gh/xuanzhang816/24/head -> origin/gh/xuanzhang816/24/head 2025-08-14T21:18:06.2307908Z * [new branch] gh/xuanzhang816/24/orig -> origin/gh/xuanzhang816/24/orig 2025-08-14T21:18:06.2311223Z * [new branch] gh/yanbing-j/11/base -> origin/gh/yanbing-j/11/base 2025-08-14T21:18:06.2311541Z * [new branch] gh/yanbing-j/11/head -> origin/gh/yanbing-j/11/head 2025-08-14T21:18:06.2311716Z * [new branch] gh/yanbing-j/11/orig -> origin/gh/yanbing-j/11/orig 2025-08-14T21:18:06.2311876Z * [new branch] gh/yanbing-j/12/base -> origin/gh/yanbing-j/12/base 2025-08-14T21:18:06.2312552Z * [new branch] gh/yanbing-j/12/head -> origin/gh/yanbing-j/12/head 2025-08-14T21:18:06.2312849Z * [new branch] gh/yanbing-j/12/orig -> origin/gh/yanbing-j/12/orig 2025-08-14T21:18:06.2313044Z * [new branch] gh/yanbing-j/13/base -> origin/gh/yanbing-j/13/base 2025-08-14T21:18:06.2314074Z * [new branch] gh/yanbing-j/13/head -> origin/gh/yanbing-j/13/head 2025-08-14T21:18:06.2314224Z * [new branch] gh/yanbing-j/13/orig -> origin/gh/yanbing-j/13/orig 2025-08-14T21:18:06.2315779Z * [new branch] gh/yanbing-j/14/base -> origin/gh/yanbing-j/14/base 2025-08-14T21:18:06.2316091Z * [new branch] gh/yanbing-j/14/head -> origin/gh/yanbing-j/14/head 2025-08-14T21:18:06.2316235Z * [new branch] gh/yanbing-j/14/orig -> origin/gh/yanbing-j/14/orig 2025-08-14T21:18:06.2319680Z * [new branch] gh/yanbing-j/15/base -> origin/gh/yanbing-j/15/base 2025-08-14T21:18:06.2320140Z * [new branch] gh/yanbing-j/15/head -> origin/gh/yanbing-j/15/head 2025-08-14T21:18:06.2320309Z * [new branch] gh/yanbing-j/15/orig -> origin/gh/yanbing-j/15/orig 2025-08-14T21:18:06.2320460Z * [new branch] gh/yanbing-j/18/base -> origin/gh/yanbing-j/18/base 2025-08-14T21:18:06.2321037Z * [new branch] gh/yanbing-j/18/head -> origin/gh/yanbing-j/18/head 2025-08-14T21:18:06.2321191Z * [new branch] gh/yanbing-j/18/orig -> origin/gh/yanbing-j/18/orig 2025-08-14T21:18:06.2321335Z * [new branch] gh/yanbing-j/19/base -> origin/gh/yanbing-j/19/base 2025-08-14T21:18:06.2321673Z * [new branch] gh/yanbing-j/19/head -> origin/gh/yanbing-j/19/head 2025-08-14T21:18:06.2322603Z * [new branch] gh/yanbing-j/19/orig -> origin/gh/yanbing-j/19/orig 2025-08-14T21:18:06.2323035Z * [new branch] gh/yanbing-j/20/base -> origin/gh/yanbing-j/20/base 2025-08-14T21:18:06.2324913Z * [new branch] gh/yanbing-j/20/head -> origin/gh/yanbing-j/20/head 2025-08-14T21:18:06.2325218Z * [new branch] gh/yanbing-j/20/orig -> origin/gh/yanbing-j/20/orig 2025-08-14T21:18:06.2325356Z * [new branch] gh/yanbing-j/21/base -> origin/gh/yanbing-j/21/base 2025-08-14T21:18:06.2325717Z * [new branch] gh/yanbing-j/21/head -> origin/gh/yanbing-j/21/head 2025-08-14T21:18:06.2329648Z * [new branch] gh/yanbing-j/22/base -> origin/gh/yanbing-j/22/base 2025-08-14T21:18:06.2330094Z * [new branch] gh/yanbing-j/22/head -> origin/gh/yanbing-j/22/head 2025-08-14T21:18:06.2330360Z * [new branch] gh/yanbing-j/22/orig -> origin/gh/yanbing-j/22/orig 2025-08-14T21:18:06.2330507Z * [new branch] gh/yanbing-j/23/base -> origin/gh/yanbing-j/23/base 2025-08-14T21:18:06.2330636Z * [new branch] gh/yanbing-j/23/head -> origin/gh/yanbing-j/23/head 2025-08-14T21:18:06.2330772Z * [new branch] gh/yanbing-j/23/orig -> origin/gh/yanbing-j/23/orig 2025-08-14T21:18:06.2330929Z * [new branch] gh/yanbing-j/24/base -> origin/gh/yanbing-j/24/base 2025-08-14T21:18:06.2332353Z * [new branch] gh/yanbing-j/24/head -> origin/gh/yanbing-j/24/head 2025-08-14T21:18:06.2332654Z * [new branch] gh/yanbing-j/24/orig -> origin/gh/yanbing-j/24/orig 2025-08-14T21:18:06.2332852Z * [new branch] gh/yanbing-j/25/base -> origin/gh/yanbing-j/25/base 2025-08-14T21:18:06.2334243Z * [new branch] gh/yanbing-j/25/head -> origin/gh/yanbing-j/25/head 2025-08-14T21:18:06.2334552Z * [new branch] gh/yanbing-j/25/orig -> origin/gh/yanbing-j/25/orig 2025-08-14T21:18:06.2334887Z * [new branch] gh/yanbing-j/26/base -> origin/gh/yanbing-j/26/base 2025-08-14T21:18:06.2335818Z * [new branch] gh/yanbing-j/26/head -> origin/gh/yanbing-j/26/head 2025-08-14T21:18:06.2336071Z * [new branch] gh/yanbing-j/26/orig -> origin/gh/yanbing-j/26/orig 2025-08-14T21:18:06.2337869Z * [new branch] gh/yanbing-j/36/base -> origin/gh/yanbing-j/36/base 2025-08-14T21:18:06.2338026Z * [new branch] gh/yanbing-j/36/head -> origin/gh/yanbing-j/36/head 2025-08-14T21:18:06.2338217Z * [new branch] gh/yanbing-j/36/orig -> origin/gh/yanbing-j/36/orig 2025-08-14T21:18:06.2339924Z * [new branch] gh/yanbing-j/37/base -> origin/gh/yanbing-j/37/base 2025-08-14T21:18:06.2340082Z * [new branch] gh/yanbing-j/37/head -> origin/gh/yanbing-j/37/head 2025-08-14T21:18:06.2340337Z * [new branch] gh/yanbing-j/37/orig -> origin/gh/yanbing-j/37/orig 2025-08-14T21:18:06.2341351Z * [new branch] gh/yanbing-j/39/base -> origin/gh/yanbing-j/39/base 2025-08-14T21:18:06.2341626Z * [new branch] gh/yanbing-j/39/head -> origin/gh/yanbing-j/39/head 2025-08-14T21:18:06.2342533Z * [new branch] gh/yanbing-j/39/orig -> origin/gh/yanbing-j/39/orig 2025-08-14T21:18:06.2343524Z * [new branch] gh/yangw-dev/1/base -> origin/gh/yangw-dev/1/base 2025-08-14T21:18:06.2344102Z * [new branch] gh/yangw-dev/10/base -> origin/gh/yangw-dev/10/base 2025-08-14T21:18:06.2345360Z * [new branch] gh/yangw-dev/10/head -> origin/gh/yangw-dev/10/head 2025-08-14T21:18:06.2345508Z * [new branch] gh/yangw-dev/10/orig -> origin/gh/yangw-dev/10/orig 2025-08-14T21:18:06.2347825Z * [new branch] gh/yangw-dev/11/base -> origin/gh/yangw-dev/11/base 2025-08-14T21:18:06.2347976Z * [new branch] gh/yangw-dev/11/head -> origin/gh/yangw-dev/11/head 2025-08-14T21:18:06.2348104Z * [new branch] gh/yangw-dev/11/orig -> origin/gh/yangw-dev/11/orig 2025-08-14T21:18:06.2348349Z * [new branch] gh/yangw-dev/12/base -> origin/gh/yangw-dev/12/base 2025-08-14T21:18:06.2348774Z * [new branch] gh/yangw-dev/12/head -> origin/gh/yangw-dev/12/head 2025-08-14T21:18:06.2349686Z * [new branch] gh/yangw-dev/12/orig -> origin/gh/yangw-dev/12/orig 2025-08-14T21:18:06.2351601Z * [new branch] gh/yangw-dev/13/base -> origin/gh/yangw-dev/13/base 2025-08-14T21:18:06.2351897Z * [new branch] gh/yangw-dev/13/head -> origin/gh/yangw-dev/13/head 2025-08-14T21:18:06.2352228Z * [new branch] gh/yangw-dev/13/orig -> origin/gh/yangw-dev/13/orig 2025-08-14T21:18:06.2352448Z * [new branch] gh/yangw-dev/14/base -> origin/gh/yangw-dev/14/base 2025-08-14T21:18:06.2353251Z * [new branch] gh/yangw-dev/14/head -> origin/gh/yangw-dev/14/head 2025-08-14T21:18:06.2353596Z * [new branch] gh/yangw-dev/14/orig -> origin/gh/yangw-dev/14/orig 2025-08-14T21:18:06.2355224Z * [new branch] gh/yangw-dev/15/base -> origin/gh/yangw-dev/15/base 2025-08-14T21:18:06.2355523Z * [new branch] gh/yangw-dev/15/head -> origin/gh/yangw-dev/15/head 2025-08-14T21:18:06.2355663Z * [new branch] gh/yangw-dev/15/orig -> origin/gh/yangw-dev/15/orig 2025-08-14T21:18:06.2359039Z * [new branch] gh/yangw-dev/16/base -> origin/gh/yangw-dev/16/base 2025-08-14T21:18:06.2359341Z * [new branch] gh/yangw-dev/16/head -> origin/gh/yangw-dev/16/head 2025-08-14T21:18:06.2359499Z * [new branch] gh/yangw-dev/16/orig -> origin/gh/yangw-dev/16/orig 2025-08-14T21:18:06.2359622Z * [new branch] gh/yangw-dev/17/base -> origin/gh/yangw-dev/17/base 2025-08-14T21:18:06.2359861Z * [new branch] gh/yangw-dev/17/head -> origin/gh/yangw-dev/17/head 2025-08-14T21:18:06.2360500Z * [new branch] gh/yangw-dev/17/orig -> origin/gh/yangw-dev/17/orig 2025-08-14T21:18:06.2360663Z * [new branch] gh/yangw-dev/18/base -> origin/gh/yangw-dev/18/base 2025-08-14T21:18:06.2361184Z * [new branch] gh/yangw-dev/18/head -> origin/gh/yangw-dev/18/head 2025-08-14T21:18:06.2363321Z * [new branch] gh/yangw-dev/18/orig -> origin/gh/yangw-dev/18/orig 2025-08-14T21:18:06.2363622Z * [new branch] gh/yangw-dev/19/base -> origin/gh/yangw-dev/19/base 2025-08-14T21:18:06.2363786Z * [new branch] gh/yangw-dev/19/head -> origin/gh/yangw-dev/19/head 2025-08-14T21:18:06.2363943Z * [new branch] gh/yangw-dev/19/orig -> origin/gh/yangw-dev/19/orig 2025-08-14T21:18:06.2367443Z * [new branch] gh/yangw-dev/2/base -> origin/gh/yangw-dev/2/base 2025-08-14T21:18:06.2367750Z * [new branch] gh/yangw-dev/2/head -> origin/gh/yangw-dev/2/head 2025-08-14T21:18:06.2368138Z * [new branch] gh/yangw-dev/3/base -> origin/gh/yangw-dev/3/base 2025-08-14T21:18:06.2368277Z * [new branch] gh/yangw-dev/3/head -> origin/gh/yangw-dev/3/head 2025-08-14T21:18:06.2368392Z * [new branch] gh/yangw-dev/4/base -> origin/gh/yangw-dev/4/base 2025-08-14T21:18:06.2368629Z * [new branch] gh/yangw-dev/4/head -> origin/gh/yangw-dev/4/head 2025-08-14T21:18:06.2369135Z * [new branch] gh/yangw-dev/5/base -> origin/gh/yangw-dev/5/base 2025-08-14T21:18:06.2369607Z * [new branch] gh/yangw-dev/5/head -> origin/gh/yangw-dev/5/head 2025-08-14T21:18:06.2373155Z * [new branch] gh/yangw-dev/6/base -> origin/gh/yangw-dev/6/base 2025-08-14T21:18:06.2373315Z * [new branch] gh/yangw-dev/6/head -> origin/gh/yangw-dev/6/head 2025-08-14T21:18:06.2373440Z * [new branch] gh/yangw-dev/7/base -> origin/gh/yangw-dev/7/base 2025-08-14T21:18:06.2373580Z * [new branch] gh/yangw-dev/7/head -> origin/gh/yangw-dev/7/head 2025-08-14T21:18:06.2373704Z * [new branch] gh/yangw-dev/8/base -> origin/gh/yangw-dev/8/base 2025-08-14T21:18:06.2374079Z * [new branch] gh/yangw-dev/8/head -> origin/gh/yangw-dev/8/head 2025-08-14T21:18:06.2374537Z * [new branch] gh/yangw-dev/8/orig -> origin/gh/yangw-dev/8/orig 2025-08-14T21:18:06.2375863Z * [new branch] gh/yangw-dev/9/base -> origin/gh/yangw-dev/9/base 2025-08-14T21:18:06.2376124Z * [new branch] gh/yangw-dev/9/head -> origin/gh/yangw-dev/9/head 2025-08-14T21:18:06.2376534Z * [new branch] gh/yangw-dev/9/orig -> origin/gh/yangw-dev/9/orig 2025-08-14T21:18:06.2377873Z * [new branch] gh/ydwu4/233/base -> origin/gh/ydwu4/233/base 2025-08-14T21:18:06.2378169Z * [new branch] gh/ydwu4/233/head -> origin/gh/ydwu4/233/head 2025-08-14T21:18:06.2379042Z * [new branch] gh/ydwu4/233/orig -> origin/gh/ydwu4/233/orig 2025-08-14T21:18:06.2380273Z * [new branch] gh/ydwu4/246/base -> origin/gh/ydwu4/246/base 2025-08-14T21:18:06.2380503Z * [new branch] gh/ydwu4/246/head -> origin/gh/ydwu4/246/head 2025-08-14T21:18:06.2381490Z * [new branch] gh/ydwu4/246/orig -> origin/gh/ydwu4/246/orig 2025-08-14T21:18:06.2382414Z * [new branch] gh/ydwu4/253/base -> origin/gh/ydwu4/253/base 2025-08-14T21:18:06.2382558Z * [new branch] gh/ydwu4/253/head -> origin/gh/ydwu4/253/head 2025-08-14T21:18:06.2383689Z * [new branch] gh/ydwu4/253/orig -> origin/gh/ydwu4/253/orig 2025-08-14T21:18:06.2384449Z * [new branch] gh/ydwu4/255/base -> origin/gh/ydwu4/255/base 2025-08-14T21:18:06.2384955Z * [new branch] gh/ydwu4/255/head -> origin/gh/ydwu4/255/head 2025-08-14T21:18:06.2389899Z * [new branch] gh/ydwu4/255/orig -> origin/gh/ydwu4/255/orig 2025-08-14T21:18:06.2391681Z * [new branch] gh/ydwu4/259/base -> origin/gh/ydwu4/259/base 2025-08-14T21:18:06.2394675Z * [new branch] gh/ydwu4/259/head -> origin/gh/ydwu4/259/head 2025-08-14T21:18:06.2394803Z * [new branch] gh/ydwu4/259/orig -> origin/gh/ydwu4/259/orig 2025-08-14T21:18:06.2394926Z * [new branch] gh/ydwu4/262/base -> origin/gh/ydwu4/262/base 2025-08-14T21:18:06.2395046Z * [new branch] gh/ydwu4/262/head -> origin/gh/ydwu4/262/head 2025-08-14T21:18:06.2395164Z * [new branch] gh/ydwu4/262/orig -> origin/gh/ydwu4/262/orig 2025-08-14T21:18:06.2399406Z * [new branch] gh/ydwu4/263/base -> origin/gh/ydwu4/263/base 2025-08-14T21:18:06.2399712Z * [new branch] gh/ydwu4/263/head -> origin/gh/ydwu4/263/head 2025-08-14T21:18:06.2400224Z * [new branch] gh/ydwu4/263/orig -> origin/gh/ydwu4/263/orig 2025-08-14T21:18:06.2400444Z * [new branch] gh/ydwu4/269/base -> origin/gh/ydwu4/269/base 2025-08-14T21:18:06.2401080Z * [new branch] gh/ydwu4/269/head -> origin/gh/ydwu4/269/head 2025-08-14T21:18:06.2401239Z * [new branch] gh/ydwu4/269/orig -> origin/gh/ydwu4/269/orig 2025-08-14T21:18:06.2401356Z * [new branch] gh/ydwu4/270/base -> origin/gh/ydwu4/270/base 2025-08-14T21:18:06.2401506Z * [new branch] gh/ydwu4/270/head -> origin/gh/ydwu4/270/head 2025-08-14T21:18:06.2401619Z * [new branch] gh/ydwu4/270/orig -> origin/gh/ydwu4/270/orig 2025-08-14T21:18:06.2401730Z * [new branch] gh/ydwu4/272/base -> origin/gh/ydwu4/272/base 2025-08-14T21:18:06.2406297Z * [new branch] gh/ydwu4/272/head -> origin/gh/ydwu4/272/head 2025-08-14T21:18:06.2406601Z * [new branch] gh/ydwu4/272/orig -> origin/gh/ydwu4/272/orig 2025-08-14T21:18:06.2406810Z * [new branch] gh/ydwu4/275/base -> origin/gh/ydwu4/275/base 2025-08-14T21:18:06.2407010Z * [new branch] gh/ydwu4/275/head -> origin/gh/ydwu4/275/head 2025-08-14T21:18:06.2407143Z * [new branch] gh/ydwu4/275/orig -> origin/gh/ydwu4/275/orig 2025-08-14T21:18:06.2407256Z * [new branch] gh/ydwu4/276/base -> origin/gh/ydwu4/276/base 2025-08-14T21:18:06.2407746Z * [new branch] gh/ydwu4/276/head -> origin/gh/ydwu4/276/head 2025-08-14T21:18:06.2411721Z * [new branch] gh/ydwu4/276/orig -> origin/gh/ydwu4/276/orig 2025-08-14T21:18:06.2412017Z * [new branch] gh/ydwu4/277/base -> origin/gh/ydwu4/277/base 2025-08-14T21:18:06.2412225Z * [new branch] gh/ydwu4/277/head -> origin/gh/ydwu4/277/head 2025-08-14T21:18:06.2412437Z * [new branch] gh/ydwu4/277/orig -> origin/gh/ydwu4/277/orig 2025-08-14T21:18:06.2413118Z * [new branch] gh/ydwu4/278/base -> origin/gh/ydwu4/278/base 2025-08-14T21:18:06.2413266Z * [new branch] gh/ydwu4/278/head -> origin/gh/ydwu4/278/head 2025-08-14T21:18:06.2413394Z * [new branch] gh/ydwu4/278/orig -> origin/gh/ydwu4/278/orig 2025-08-14T21:18:06.2413513Z * [new branch] gh/ydwu4/279/base -> origin/gh/ydwu4/279/base 2025-08-14T21:18:06.2413646Z * [new branch] gh/ydwu4/279/head -> origin/gh/ydwu4/279/head 2025-08-14T21:18:06.2413757Z * [new branch] gh/ydwu4/279/orig -> origin/gh/ydwu4/279/orig 2025-08-14T21:18:06.2417588Z * [new branch] gh/ydwu4/280/base -> origin/gh/ydwu4/280/base 2025-08-14T21:18:06.2417887Z * [new branch] gh/ydwu4/280/head -> origin/gh/ydwu4/280/head 2025-08-14T21:18:06.2418106Z * [new branch] gh/ydwu4/280/orig -> origin/gh/ydwu4/280/orig 2025-08-14T21:18:06.2418244Z * [new branch] gh/ydwu4/281/base -> origin/gh/ydwu4/281/base 2025-08-14T21:18:06.2418460Z * [new branch] gh/ydwu4/281/head -> origin/gh/ydwu4/281/head 2025-08-14T21:18:06.2418625Z * [new branch] gh/ydwu4/281/orig -> origin/gh/ydwu4/281/orig 2025-08-14T21:18:06.2419178Z * [new branch] gh/ydwu4/282/base -> origin/gh/ydwu4/282/base 2025-08-14T21:18:06.2419341Z * [new branch] gh/ydwu4/282/head -> origin/gh/ydwu4/282/head 2025-08-14T21:18:06.2419457Z * [new branch] gh/ydwu4/282/orig -> origin/gh/ydwu4/282/orig 2025-08-14T21:18:06.2419579Z * [new branch] gh/ydwu4/283/base -> origin/gh/ydwu4/283/base 2025-08-14T21:18:06.2419974Z * [new branch] gh/ydwu4/283/head -> origin/gh/ydwu4/283/head 2025-08-14T21:18:06.2420517Z * [new branch] gh/ydwu4/283/orig -> origin/gh/ydwu4/283/orig 2025-08-14T21:18:06.2421471Z * [new branch] gh/ydwu4/284/base -> origin/gh/ydwu4/284/base 2025-08-14T21:18:06.2421728Z * [new branch] gh/ydwu4/284/head -> origin/gh/ydwu4/284/head 2025-08-14T21:18:06.2422714Z * [new branch] gh/ydwu4/284/orig -> origin/gh/ydwu4/284/orig 2025-08-14T21:18:06.2423228Z * [new branch] gh/ydwu4/285/base -> origin/gh/ydwu4/285/base 2025-08-14T21:18:06.2424133Z * [new branch] gh/ydwu4/285/head -> origin/gh/ydwu4/285/head 2025-08-14T21:18:06.2424460Z * [new branch] gh/ydwu4/285/orig -> origin/gh/ydwu4/285/orig 2025-08-14T21:18:06.2425692Z * [new branch] gh/ydwu4/286/base -> origin/gh/ydwu4/286/base 2025-08-14T21:18:06.2425905Z * [new branch] gh/ydwu4/286/head -> origin/gh/ydwu4/286/head 2025-08-14T21:18:06.2426862Z * [new branch] gh/ydwu4/286/orig -> origin/gh/ydwu4/286/orig 2025-08-14T21:18:06.2427787Z * [new branch] gh/ydwu4/287/base -> origin/gh/ydwu4/287/base 2025-08-14T21:18:06.2428703Z * [new branch] gh/ydwu4/287/head -> origin/gh/ydwu4/287/head 2025-08-14T21:18:06.2428921Z * [new branch] gh/ydwu4/287/orig -> origin/gh/ydwu4/287/orig 2025-08-14T21:18:06.2430190Z * [new branch] gh/ydwu4/288/base -> origin/gh/ydwu4/288/base 2025-08-14T21:18:06.2430459Z * [new branch] gh/ydwu4/288/head -> origin/gh/ydwu4/288/head 2025-08-14T21:18:06.2431365Z * [new branch] gh/ydwu4/288/orig -> origin/gh/ydwu4/288/orig 2025-08-14T21:18:06.2432304Z * [new branch] gh/ydwu4/289/base -> origin/gh/ydwu4/289/base 2025-08-14T21:18:06.2432604Z * [new branch] gh/ydwu4/289/head -> origin/gh/ydwu4/289/head 2025-08-14T21:18:06.2433501Z * [new branch] gh/ydwu4/289/orig -> origin/gh/ydwu4/289/orig 2025-08-14T21:18:06.2434695Z * [new branch] gh/ydwu4/290/base -> origin/gh/ydwu4/290/base 2025-08-14T21:18:06.2435076Z * [new branch] gh/ydwu4/290/head -> origin/gh/ydwu4/290/head 2025-08-14T21:18:06.2435951Z * [new branch] gh/ydwu4/290/orig -> origin/gh/ydwu4/290/orig 2025-08-14T21:18:06.2436800Z * [new branch] gh/ydwu4/291/base -> origin/gh/ydwu4/291/base 2025-08-14T21:18:06.2437076Z * [new branch] gh/ydwu4/291/head -> origin/gh/ydwu4/291/head 2025-08-14T21:18:06.2438057Z * [new branch] gh/ydwu4/291/orig -> origin/gh/ydwu4/291/orig 2025-08-14T21:18:06.2438869Z * [new branch] gh/ydwu4/292/base -> origin/gh/ydwu4/292/base 2025-08-14T21:18:06.2439120Z * [new branch] gh/ydwu4/292/head -> origin/gh/ydwu4/292/head 2025-08-14T21:18:06.2440158Z * [new branch] gh/ydwu4/292/orig -> origin/gh/ydwu4/292/orig 2025-08-14T21:18:06.2441179Z * [new branch] gh/ydwu4/293/base -> origin/gh/ydwu4/293/base 2025-08-14T21:18:06.2441534Z * [new branch] gh/ydwu4/293/head -> origin/gh/ydwu4/293/head 2025-08-14T21:18:06.2442394Z * [new branch] gh/ydwu4/293/orig -> origin/gh/ydwu4/293/orig 2025-08-14T21:18:06.2443278Z * [new branch] gh/ydwu4/294/base -> origin/gh/ydwu4/294/base 2025-08-14T21:18:06.2443468Z * [new branch] gh/ydwu4/294/head -> origin/gh/ydwu4/294/head 2025-08-14T21:18:06.2445200Z * [new branch] gh/ydwu4/294/orig -> origin/gh/ydwu4/294/orig 2025-08-14T21:18:06.2445346Z * [new branch] gh/ydwu4/295/base -> origin/gh/ydwu4/295/base 2025-08-14T21:18:06.2445663Z * [new branch] gh/ydwu4/295/head -> origin/gh/ydwu4/295/head 2025-08-14T21:18:06.2447175Z * [new branch] gh/ydwu4/295/orig -> origin/gh/ydwu4/295/orig 2025-08-14T21:18:06.2447296Z * [new branch] gh/ydwu4/296/base -> origin/gh/ydwu4/296/base 2025-08-14T21:18:06.2448016Z * [new branch] gh/ydwu4/296/head -> origin/gh/ydwu4/296/head 2025-08-14T21:18:06.2448822Z * [new branch] gh/ydwu4/296/orig -> origin/gh/ydwu4/296/orig 2025-08-14T21:18:06.2450051Z * [new branch] gh/ydwu4/297/base -> origin/gh/ydwu4/297/base 2025-08-14T21:18:06.2450200Z * [new branch] gh/ydwu4/297/head -> origin/gh/ydwu4/297/head 2025-08-14T21:18:06.2450596Z * [new branch] gh/ydwu4/297/orig -> origin/gh/ydwu4/297/orig 2025-08-14T21:18:06.2451888Z * [new branch] gh/ydwu4/298/base -> origin/gh/ydwu4/298/base 2025-08-14T21:18:06.2452185Z * [new branch] gh/ydwu4/298/head -> origin/gh/ydwu4/298/head 2025-08-14T21:18:06.2452523Z * [new branch] gh/ydwu4/298/orig -> origin/gh/ydwu4/298/orig 2025-08-14T21:18:06.2456009Z * [new branch] gh/ydwu4/299/base -> origin/gh/ydwu4/299/base 2025-08-14T21:18:06.2456313Z * [new branch] gh/ydwu4/299/head -> origin/gh/ydwu4/299/head 2025-08-14T21:18:06.2456457Z * [new branch] gh/ydwu4/299/orig -> origin/gh/ydwu4/299/orig 2025-08-14T21:18:06.2456581Z * [new branch] gh/ydwu4/300/base -> origin/gh/ydwu4/300/base 2025-08-14T21:18:06.2457218Z * [new branch] gh/ydwu4/300/head -> origin/gh/ydwu4/300/head 2025-08-14T21:18:06.2457817Z * [new branch] gh/ydwu4/300/orig -> origin/gh/ydwu4/300/orig 2025-08-14T21:18:06.2460953Z * [new branch] gh/ydwu4/301/base -> origin/gh/ydwu4/301/base 2025-08-14T21:18:06.2461103Z * [new branch] gh/ydwu4/301/head -> origin/gh/ydwu4/301/head 2025-08-14T21:18:06.2461241Z * [new branch] gh/ydwu4/301/orig -> origin/gh/ydwu4/301/orig 2025-08-14T21:18:06.2461356Z * [new branch] gh/ydwu4/302/base -> origin/gh/ydwu4/302/base 2025-08-14T21:18:06.2461476Z * [new branch] gh/ydwu4/302/head -> origin/gh/ydwu4/302/head 2025-08-14T21:18:06.2462092Z * [new branch] gh/ydwu4/302/orig -> origin/gh/ydwu4/302/orig 2025-08-14T21:18:06.2463048Z * [new branch] gh/ydwu4/303/base -> origin/gh/ydwu4/303/base 2025-08-14T21:18:06.2463505Z * [new branch] gh/ydwu4/303/head -> origin/gh/ydwu4/303/head 2025-08-14T21:18:06.2463976Z * [new branch] gh/ydwu4/303/orig -> origin/gh/ydwu4/303/orig 2025-08-14T21:18:06.2466801Z * [new branch] gh/ydwu4/304/base -> origin/gh/ydwu4/304/base 2025-08-14T21:18:06.2466953Z * [new branch] gh/ydwu4/304/head -> origin/gh/ydwu4/304/head 2025-08-14T21:18:06.2467091Z * [new branch] gh/ydwu4/304/orig -> origin/gh/ydwu4/304/orig 2025-08-14T21:18:06.2467213Z * [new branch] gh/ydwu4/305/base -> origin/gh/ydwu4/305/base 2025-08-14T21:18:06.2467688Z * [new branch] gh/ydwu4/305/head -> origin/gh/ydwu4/305/head 2025-08-14T21:18:06.2468472Z * [new branch] gh/ydwu4/305/orig -> origin/gh/ydwu4/305/orig 2025-08-14T21:18:06.2470582Z * [new branch] gh/ydwu4/306/base -> origin/gh/ydwu4/306/base 2025-08-14T21:18:06.2470898Z * [new branch] gh/ydwu4/306/head -> origin/gh/ydwu4/306/head 2025-08-14T21:18:06.2471056Z * [new branch] gh/ydwu4/306/orig -> origin/gh/ydwu4/306/orig 2025-08-14T21:18:06.2471184Z * [new branch] gh/ydwu4/307/base -> origin/gh/ydwu4/307/base 2025-08-14T21:18:06.2471903Z * [new branch] gh/ydwu4/307/head -> origin/gh/ydwu4/307/head 2025-08-14T21:18:06.2472511Z * [new branch] gh/ydwu4/307/orig -> origin/gh/ydwu4/307/orig 2025-08-14T21:18:06.2474585Z * [new branch] gh/ydwu4/308/base -> origin/gh/ydwu4/308/base 2025-08-14T21:18:06.2474883Z * [new branch] gh/ydwu4/308/head -> origin/gh/ydwu4/308/head 2025-08-14T21:18:06.2475131Z * [new branch] gh/ydwu4/308/orig -> origin/gh/ydwu4/308/orig 2025-08-14T21:18:06.2475421Z * [new branch] gh/ydwu4/309/base -> origin/gh/ydwu4/309/base 2025-08-14T21:18:06.2476068Z * [new branch] gh/ydwu4/309/head -> origin/gh/ydwu4/309/head 2025-08-14T21:18:06.2476680Z * [new branch] gh/ydwu4/309/orig -> origin/gh/ydwu4/309/orig 2025-08-14T21:18:06.2477110Z * [new branch] gh/ydwu4/310/base -> origin/gh/ydwu4/310/base 2025-08-14T21:18:06.2478573Z * [new branch] gh/ydwu4/310/head -> origin/gh/ydwu4/310/head 2025-08-14T21:18:06.2478862Z * [new branch] gh/ydwu4/310/orig -> origin/gh/ydwu4/310/orig 2025-08-14T21:18:06.2478999Z * [new branch] gh/ydwu4/311/base -> origin/gh/ydwu4/311/base 2025-08-14T21:18:06.2480635Z * [new branch] gh/ydwu4/311/head -> origin/gh/ydwu4/311/head 2025-08-14T21:18:06.2480783Z * [new branch] gh/ydwu4/311/orig -> origin/gh/ydwu4/311/orig 2025-08-14T21:18:06.2481672Z * [new branch] gh/yf225/133/base -> origin/gh/yf225/133/base 2025-08-14T21:18:06.2482261Z * [new branch] gh/yf225/133/head -> origin/gh/yf225/133/head 2025-08-14T21:18:06.2485719Z * [new branch] gh/yf225/171/base -> origin/gh/yf225/171/base 2025-08-14T21:18:06.2485864Z * [new branch] gh/yf225/171/head -> origin/gh/yf225/171/head 2025-08-14T21:18:06.2485995Z * [new branch] gh/yf225/171/orig -> origin/gh/yf225/171/orig 2025-08-14T21:18:06.2486110Z * [new branch] gh/yf225/172/base -> origin/gh/yf225/172/base 2025-08-14T21:18:06.2486695Z * [new branch] gh/yf225/172/head -> origin/gh/yf225/172/head 2025-08-14T21:18:06.2487119Z * [new branch] gh/yf225/172/orig -> origin/gh/yf225/172/orig 2025-08-14T21:18:06.2490339Z * [new branch] gh/yf225/93/base -> origin/gh/yf225/93/base 2025-08-14T21:18:06.2490491Z * [new branch] gh/yf225/93/head -> origin/gh/yf225/93/head 2025-08-14T21:18:06.2490642Z * [new branch] gh/yifuwang/152/base -> origin/gh/yifuwang/152/base 2025-08-14T21:18:06.2490779Z * [new branch] gh/yifuwang/152/head -> origin/gh/yifuwang/152/head 2025-08-14T21:18:06.2491661Z * [new branch] gh/yifuwang/152/orig -> origin/gh/yifuwang/152/orig 2025-08-14T21:18:06.2494758Z * [new branch] gh/yifuwang/195/base -> origin/gh/yifuwang/195/base 2025-08-14T21:18:06.2494910Z * [new branch] gh/yifuwang/195/head -> origin/gh/yifuwang/195/head 2025-08-14T21:18:06.2495038Z * [new branch] gh/yifuwang/195/orig -> origin/gh/yifuwang/195/orig 2025-08-14T21:18:06.2495172Z * [new branch] gh/yiming0416/1/base -> origin/gh/yiming0416/1/base 2025-08-14T21:18:06.2495450Z * [new branch] gh/yiming0416/1/head -> origin/gh/yiming0416/1/head 2025-08-14T21:18:06.2496785Z * [new branch] gh/yiming0416/2/base -> origin/gh/yiming0416/2/base 2025-08-14T21:18:06.2497088Z * [new branch] gh/yiming0416/2/head -> origin/gh/yiming0416/2/head 2025-08-14T21:18:06.2498669Z * [new branch] gh/ysiraichi/79/base -> origin/gh/ysiraichi/79/base 2025-08-14T21:18:06.2498823Z * [new branch] gh/ysiraichi/79/head -> origin/gh/ysiraichi/79/head 2025-08-14T21:18:06.2499476Z * [new branch] gh/ysiraichi/79/orig -> origin/gh/ysiraichi/79/orig 2025-08-14T21:18:06.2502541Z * [new branch] gh/ysiraichi/81/base -> origin/gh/ysiraichi/81/base 2025-08-14T21:18:06.2502693Z * [new branch] gh/ysiraichi/81/head -> origin/gh/ysiraichi/81/head 2025-08-14T21:18:06.2502818Z * [new branch] gh/ysiraichi/81/orig -> origin/gh/ysiraichi/81/orig 2025-08-14T21:18:06.2502937Z * [new branch] gh/ysiraichi/84/base -> origin/gh/ysiraichi/84/base 2025-08-14T21:18:06.2503208Z * [new branch] gh/ysiraichi/84/head -> origin/gh/ysiraichi/84/head 2025-08-14T21:18:06.2503567Z * [new branch] gh/ysiraichi/84/orig -> origin/gh/ysiraichi/84/orig 2025-08-14T21:18:06.2504727Z * [new branch] gh/ysiraichi/85/base -> origin/gh/ysiraichi/85/base 2025-08-14T21:18:06.2505191Z * [new branch] gh/ysiraichi/85/head -> origin/gh/ysiraichi/85/head 2025-08-14T21:18:06.2506970Z * [new branch] gh/ysiraichi/85/orig -> origin/gh/ysiraichi/85/orig 2025-08-14T21:18:06.2507346Z * [new branch] gh/ysiraichi/86/base -> origin/gh/ysiraichi/86/base 2025-08-14T21:18:06.2507795Z * [new branch] gh/ysiraichi/86/head -> origin/gh/ysiraichi/86/head 2025-08-14T21:18:06.2508717Z * [new branch] gh/ysiraichi/86/orig -> origin/gh/ysiraichi/86/orig 2025-08-14T21:18:06.2511755Z * [new branch] gh/ysiraichi/87/base -> origin/gh/ysiraichi/87/base 2025-08-14T21:18:06.2512098Z * [new branch] gh/ysiraichi/87/head -> origin/gh/ysiraichi/87/head 2025-08-14T21:18:06.2512237Z * [new branch] gh/ysiraichi/87/orig -> origin/gh/ysiraichi/87/orig 2025-08-14T21:18:06.2512363Z * [new branch] gh/ysiraichi/88/base -> origin/gh/ysiraichi/88/base 2025-08-14T21:18:06.2512495Z * [new branch] gh/ysiraichi/88/head -> origin/gh/ysiraichi/88/head 2025-08-14T21:18:06.2512675Z * [new branch] gh/ysiraichi/88/orig -> origin/gh/ysiraichi/88/orig 2025-08-14T21:18:06.2516564Z * [new branch] gh/yuguo68/1/base -> origin/gh/yuguo68/1/base 2025-08-14T21:18:06.2516711Z * [new branch] gh/yuguo68/1/head -> origin/gh/yuguo68/1/head 2025-08-14T21:18:06.2516825Z * [new branch] gh/yuguo68/1/orig -> origin/gh/yuguo68/1/orig 2025-08-14T21:18:06.2516947Z * [new branch] gh/yuguo68/2/base -> origin/gh/yuguo68/2/base 2025-08-14T21:18:06.2517072Z * [new branch] gh/yuguo68/2/head -> origin/gh/yuguo68/2/head 2025-08-14T21:18:06.2519803Z * [new branch] gh/yuguo68/2/orig -> origin/gh/yuguo68/2/orig 2025-08-14T21:18:06.2520012Z * [new branch] gh/zhxchen17/25/base -> origin/gh/zhxchen17/25/base 2025-08-14T21:18:06.2520140Z * [new branch] gh/zhxchen17/25/head -> origin/gh/zhxchen17/25/head 2025-08-14T21:18:06.2520274Z * [new branch] gh/zhxchen17/25/orig -> origin/gh/zhxchen17/25/orig 2025-08-14T21:18:06.2522385Z * [new branch] gh/zhxchen17/31/base -> origin/gh/zhxchen17/31/base 2025-08-14T21:18:06.2522629Z * [new branch] gh/zhxchen17/31/head -> origin/gh/zhxchen17/31/head 2025-08-14T21:18:06.2522768Z * [new branch] gh/zhxchen17/31/orig -> origin/gh/zhxchen17/31/orig 2025-08-14T21:18:06.2522915Z * [new branch] gh/zhxchen17/33/base -> origin/gh/zhxchen17/33/base 2025-08-14T21:18:06.2526113Z * [new branch] gh/zhxchen17/33/head -> origin/gh/zhxchen17/33/head 2025-08-14T21:18:06.2526247Z * [new branch] gh/zhxchen17/33/orig -> origin/gh/zhxchen17/33/orig 2025-08-14T21:18:06.2526515Z * [new branch] gh/zhxchen17/34/base -> origin/gh/zhxchen17/34/base 2025-08-14T21:18:06.2526778Z * [new branch] gh/zhxchen17/34/head -> origin/gh/zhxchen17/34/head 2025-08-14T21:18:06.2526905Z * [new branch] gh/zhxchen17/35/base -> origin/gh/zhxchen17/35/base 2025-08-14T21:18:06.2527200Z * [new branch] gh/zhxchen17/35/head -> origin/gh/zhxchen17/35/head 2025-08-14T21:18:06.2529389Z * [new branch] gh/zhxchen17/36/base -> origin/gh/zhxchen17/36/base 2025-08-14T21:18:06.2529545Z * [new branch] gh/zhxchen17/36/head -> origin/gh/zhxchen17/36/head 2025-08-14T21:18:06.2529689Z * [new branch] gh/zhxchen17/36/orig -> origin/gh/zhxchen17/36/orig 2025-08-14T21:18:06.2530226Z * [new branch] gh/zklaus/1/base -> origin/gh/zklaus/1/base 2025-08-14T21:18:06.2530815Z * [new branch] gh/zklaus/1/head -> origin/gh/zklaus/1/head 2025-08-14T21:18:06.2531344Z * [new branch] gh/zklaus/1/orig -> origin/gh/zklaus/1/orig 2025-08-14T21:18:06.2534480Z * [new branch] gh/zklaus/10/base -> origin/gh/zklaus/10/base 2025-08-14T21:18:06.2534621Z * [new branch] gh/zklaus/10/head -> origin/gh/zklaus/10/head 2025-08-14T21:18:06.2534742Z * [new branch] gh/zklaus/10/orig -> origin/gh/zklaus/10/orig 2025-08-14T21:18:06.2534852Z * [new branch] gh/zklaus/11/base -> origin/gh/zklaus/11/base 2025-08-14T21:18:06.2534966Z * [new branch] gh/zklaus/11/head -> origin/gh/zklaus/11/head 2025-08-14T21:18:06.2535708Z * [new branch] gh/zklaus/11/orig -> origin/gh/zklaus/11/orig 2025-08-14T21:18:06.2536545Z * [new branch] gh/zklaus/12/base -> origin/gh/zklaus/12/base 2025-08-14T21:18:06.2536825Z * [new branch] gh/zklaus/12/head -> origin/gh/zklaus/12/head 2025-08-14T21:18:06.2538034Z * [new branch] gh/zklaus/12/orig -> origin/gh/zklaus/12/orig 2025-08-14T21:18:06.2538263Z * [new branch] gh/zklaus/14/base -> origin/gh/zklaus/14/base 2025-08-14T21:18:06.2538822Z * [new branch] gh/zklaus/14/head -> origin/gh/zklaus/14/head 2025-08-14T21:18:06.2541588Z * [new branch] gh/zklaus/14/orig -> origin/gh/zklaus/14/orig 2025-08-14T21:18:06.2541889Z * [new branch] gh/zklaus/15/base -> origin/gh/zklaus/15/base 2025-08-14T21:18:06.2542031Z * [new branch] gh/zklaus/15/head -> origin/gh/zklaus/15/head 2025-08-14T21:18:06.2542247Z * [new branch] gh/zklaus/15/orig -> origin/gh/zklaus/15/orig 2025-08-14T21:18:06.2543018Z * [new branch] gh/zklaus/16/base -> origin/gh/zklaus/16/base 2025-08-14T21:18:06.2543225Z * [new branch] gh/zklaus/16/head -> origin/gh/zklaus/16/head 2025-08-14T21:18:06.2547252Z * [new branch] gh/zklaus/16/orig -> origin/gh/zklaus/16/orig 2025-08-14T21:18:06.2547509Z * [new branch] gh/zklaus/17/base -> origin/gh/zklaus/17/base 2025-08-14T21:18:06.2551020Z * [new branch] gh/zklaus/17/head -> origin/gh/zklaus/17/head 2025-08-14T21:18:06.2552909Z * [new branch] gh/zklaus/17/orig -> origin/gh/zklaus/17/orig 2025-08-14T21:18:06.2553144Z * [new branch] gh/zklaus/18/base -> origin/gh/zklaus/18/base 2025-08-14T21:18:06.2557558Z * [new branch] gh/zklaus/18/head -> origin/gh/zklaus/18/head 2025-08-14T21:18:06.2560919Z * [new branch] gh/zklaus/18/orig -> origin/gh/zklaus/18/orig 2025-08-14T21:18:06.2562798Z * [new branch] gh/zklaus/19/base -> origin/gh/zklaus/19/base 2025-08-14T21:18:06.2563047Z * [new branch] gh/zklaus/19/head -> origin/gh/zklaus/19/head 2025-08-14T21:18:06.2567201Z * [new branch] gh/zklaus/19/orig -> origin/gh/zklaus/19/orig 2025-08-14T21:18:06.2567655Z * [new branch] gh/zklaus/7/base -> origin/gh/zklaus/7/base 2025-08-14T21:18:06.2567921Z * [new branch] gh/zklaus/7/head -> origin/gh/zklaus/7/head 2025-08-14T21:18:06.2568147Z * [new branch] gh/zklaus/7/orig -> origin/gh/zklaus/7/orig 2025-08-14T21:18:06.2568670Z * [new branch] gh/zklaus/9/base -> origin/gh/zklaus/9/base 2025-08-14T21:18:06.2572951Z * [new branch] gh/zklaus/9/head -> origin/gh/zklaus/9/head 2025-08-14T21:18:06.2576424Z * [new branch] gh/zklaus/9/orig -> origin/gh/zklaus/9/orig 2025-08-14T21:18:06.2579966Z * [new branch] gh/zou3519/1175/base -> origin/gh/zou3519/1175/base 2025-08-14T21:18:06.2580253Z * [new branch] gh/zou3519/1175/head -> origin/gh/zou3519/1175/head 2025-08-14T21:18:06.2580395Z * [new branch] gh/zou3519/1175/orig -> origin/gh/zou3519/1175/orig 2025-08-14T21:18:06.2580610Z * [new branch] gh/zou3519/1177/base -> origin/gh/zou3519/1177/base 2025-08-14T21:18:06.2580745Z * [new branch] gh/zou3519/1177/head -> origin/gh/zou3519/1177/head 2025-08-14T21:18:06.2581300Z * [new branch] gh/zou3519/1177/orig -> origin/gh/zou3519/1177/orig 2025-08-14T21:18:06.2581479Z * [new branch] gh/zou3519/1187/base -> origin/gh/zou3519/1187/base 2025-08-14T21:18:06.2581595Z * [new branch] gh/zou3519/1187/head -> origin/gh/zou3519/1187/head 2025-08-14T21:18:06.2581990Z * [new branch] gh/zou3519/1187/orig -> origin/gh/zou3519/1187/orig 2025-08-14T21:18:06.2582126Z * [new branch] gh/zou3519/1188/base -> origin/gh/zou3519/1188/base 2025-08-14T21:18:06.2582245Z * [new branch] gh/zou3519/1188/head -> origin/gh/zou3519/1188/head 2025-08-14T21:18:06.2582365Z * [new branch] gh/zou3519/1188/orig -> origin/gh/zou3519/1188/orig 2025-08-14T21:18:06.2582494Z * [new branch] gh/zou3519/1189/base -> origin/gh/zou3519/1189/base 2025-08-14T21:18:06.2582606Z * [new branch] gh/zou3519/1189/head -> origin/gh/zou3519/1189/head 2025-08-14T21:18:06.2582724Z * [new branch] gh/zou3519/1189/orig -> origin/gh/zou3519/1189/orig 2025-08-14T21:18:06.2582835Z * [new branch] gh/zou3519/1190/base -> origin/gh/zou3519/1190/base 2025-08-14T21:18:06.2582958Z * [new branch] gh/zou3519/1190/head -> origin/gh/zou3519/1190/head 2025-08-14T21:18:06.2583074Z * [new branch] gh/zou3519/1190/orig -> origin/gh/zou3519/1190/orig 2025-08-14T21:18:06.2583188Z * [new branch] gh/zou3519/1191/base -> origin/gh/zou3519/1191/base 2025-08-14T21:18:06.2583305Z * [new branch] gh/zou3519/1191/head -> origin/gh/zou3519/1191/head 2025-08-14T21:18:06.2583416Z * [new branch] gh/zou3519/1191/orig -> origin/gh/zou3519/1191/orig 2025-08-14T21:18:06.2583547Z * [new branch] gh/zpcore/1/base -> origin/gh/zpcore/1/base 2025-08-14T21:18:06.2583672Z * [new branch] gh/zpcore/1/head -> origin/gh/zpcore/1/head 2025-08-14T21:18:06.2583792Z * [new branch] gh/zpcore/10/base -> origin/gh/zpcore/10/base 2025-08-14T21:18:06.2583910Z * [new branch] gh/zpcore/10/head -> origin/gh/zpcore/10/head 2025-08-14T21:18:06.2584020Z * [new branch] gh/zpcore/10/orig -> origin/gh/zpcore/10/orig 2025-08-14T21:18:06.2584131Z * [new branch] gh/zpcore/11/base -> origin/gh/zpcore/11/base 2025-08-14T21:18:06.2584347Z * [new branch] gh/zpcore/11/head -> origin/gh/zpcore/11/head 2025-08-14T21:18:06.2584463Z * [new branch] gh/zpcore/11/orig -> origin/gh/zpcore/11/orig 2025-08-14T21:18:06.2584580Z * [new branch] gh/zpcore/12/base -> origin/gh/zpcore/12/base 2025-08-14T21:18:06.2584860Z * [new branch] gh/zpcore/12/head -> origin/gh/zpcore/12/head 2025-08-14T21:18:06.2584973Z * [new branch] gh/zpcore/12/orig -> origin/gh/zpcore/12/orig 2025-08-14T21:18:06.2585093Z * [new branch] gh/zpcore/2/base -> origin/gh/zpcore/2/base 2025-08-14T21:18:06.2585205Z * [new branch] gh/zpcore/2/head -> origin/gh/zpcore/2/head 2025-08-14T21:18:06.2585314Z * [new branch] gh/zpcore/3/base -> origin/gh/zpcore/3/base 2025-08-14T21:18:06.2585432Z * [new branch] gh/zpcore/3/head -> origin/gh/zpcore/3/head 2025-08-14T21:18:06.2585540Z * [new branch] gh/zpcore/4/base -> origin/gh/zpcore/4/base 2025-08-14T21:18:06.2585666Z * [new branch] gh/zpcore/4/head -> origin/gh/zpcore/4/head 2025-08-14T21:18:06.2585775Z * [new branch] gh/zpcore/5/base -> origin/gh/zpcore/5/base 2025-08-14T21:18:06.2585887Z * [new branch] gh/zpcore/5/head -> origin/gh/zpcore/5/head 2025-08-14T21:18:06.2586005Z * [new branch] gh/zpcore/6/base -> origin/gh/zpcore/6/base 2025-08-14T21:18:06.2586112Z * [new branch] gh/zpcore/6/head -> origin/gh/zpcore/6/head 2025-08-14T21:18:06.2586224Z * [new branch] gh/zpcore/7/base -> origin/gh/zpcore/7/base 2025-08-14T21:18:06.2586336Z * [new branch] gh/zpcore/7/head -> origin/gh/zpcore/7/head 2025-08-14T21:18:06.2587942Z * [new branch] gh/zpcore/8/base -> origin/gh/zpcore/8/base 2025-08-14T21:18:06.2588273Z * [new branch] gh/zpcore/8/head -> origin/gh/zpcore/8/head 2025-08-14T21:18:06.2588696Z * [new branch] gh/zpcore/9/head -> origin/gh/zpcore/9/head 2025-08-14T21:18:06.2592061Z * [new branch] gh/zpcore/9/orig -> origin/gh/zpcore/9/orig 2025-08-14T21:18:06.2592362Z * [new branch] google-main -> origin/google-main 2025-08-14T21:18:06.2592530Z * [new branch] guangyey/external_stream -> origin/guangyey/external_stream 2025-08-14T21:18:06.2592668Z * [new branch] guangyey/host_alloc -> origin/guangyey/host_alloc 2025-08-14T21:18:06.2592909Z * [new branch] guangyey/test_2025 -> origin/guangyey/test_2025 2025-08-14T21:18:06.2593953Z * [new branch] guilhermeleobas/cherry-pick-55d87d9dfd9 -> origin/guilhermeleobas/cherry-pick-55d87d9dfd9 2025-08-14T21:18:06.2594441Z * [new branch] haozhe/bf16-dynamic-shape -> origin/haozhe/bf16-dynamic-shape 2025-08-14T21:18:06.2596609Z * [new branch] hc_baseline -> origin/hc_baseline 2025-08-14T21:18:06.2596941Z * [new branch] headeronlyScalarType -> origin/headeronlyScalarType 2025-08-14T21:18:06.2597149Z * [new branch] hf_update -> origin/hf_update 2025-08-14T21:18:06.2597403Z * [new branch] hhh_decomp_mul -> origin/hhh_decomp_mul 2025-08-14T21:18:06.2597734Z * [new branch] hhh_rand -> origin/hhh_rand 2025-08-14T21:18:06.2599191Z * [new branch] hoy/mmsplitk -> origin/hoy/mmsplitk 2025-08-14T21:18:06.2599501Z * [new branch] hoy/triton-PR3973 -> origin/hoy/triton-PR3973 2025-08-14T21:18:06.2599920Z * [new branch] hoy/triton-coalescing-baseline -> origin/hoy/triton-coalescing-baseline 2025-08-14T21:18:06.2601209Z * [new branch] hoy/triton-coalescing-min -> origin/hoy/triton-coalescing-min 2025-08-14T21:18:06.2601482Z * [new branch] hoy/triton-coalescing-new -> origin/hoy/triton-coalescing-new 2025-08-14T21:18:06.2601902Z * [new branch] hoy/triton-coalescing-vec -> origin/hoy/triton-coalescing-vec 2025-08-14T21:18:06.2602808Z * [new branch] inductordecompfix -> origin/inductordecompfix 2025-08-14T21:18:06.2603481Z * [new branch] inline -> origin/inline 2025-08-14T21:18:06.2603875Z * [new branch] inlining -> origin/inlining 2025-08-14T21:18:06.2605887Z * [new branch] inlining-ezyang -> origin/inlining-ezyang 2025-08-14T21:18:06.2606178Z * [new branch] int8_sdpa -> origin/int8_sdpa 2025-08-14T21:18:06.2606346Z * [new branch] invoke-subgraph -> origin/invoke-subgraph 2025-08-14T21:18:06.2606622Z * [new branch] issue#58739 -> origin/issue#58739 2025-08-14T21:18:06.2607956Z * [new branch] issue-154849 -> origin/issue-154849 2025-08-14T21:18:06.2608439Z * [new branch] ivanov/cherry-pick-ckpt-fixes -> origin/ivanov/cherry-pick-ckpt-fixes 2025-08-14T21:18:06.2609834Z * [new branch] jcaip/test-cusparselt-version-0.6.2 -> origin/jcaip/test-cusparselt-version-0.6.2 2025-08-14T21:18:06.2610144Z * [new branch] jcaip/update-cusparselt-0.6.2 -> origin/jcaip/update-cusparselt-0.6.2 2025-08-14T21:18:06.2611858Z * [new branch] jithunnair-amd-patch-1 -> origin/jithunnair-amd-patch-1 2025-08-14T21:18:06.2612197Z * [new branch] justinchu/attention-tests -> origin/justinchu/attention-tests 2025-08-14T21:18:06.2612355Z * [new branch] justinchu/native-qdq -> origin/justinchu/native-qdq 2025-08-14T21:18:06.2616592Z * [new branch] justinchuby/JitScalarType -> origin/justinchuby/JitScalarType 2025-08-14T21:18:06.2616935Z * [new branch] justinchuby/dynamo-true -> origin/justinchuby/dynamo-true 2025-08-14T21:18:06.2617156Z * [new branch] justinchuby/opset-20 -> origin/justinchuby/opset-20 2025-08-14T21:18:06.2617324Z * [new branch] kainan666/xlf_debug -> origin/kainan666/xlf_debug 2025-08-14T21:18:06.2617831Z * [new branch] kainan_test -> origin/kainan_test 2025-08-14T21:18:06.2618037Z * [new branch] leslie/enable_poc_reduction_fusion -> origin/leslie/enable_poc_reduction_fusion 2025-08-14T21:18:06.2618220Z * [new branch] leslie/test_group_gemm_epilogues -> origin/leslie/test_group_gemm_epilogues 2025-08-14T21:18:06.2618426Z * [new branch] lessw2020/fix_cutlass_cache_error -> origin/lessw2020/fix_cutlass_cache_error 2025-08-14T21:18:06.2619642Z * [new branch] liaoxuan/shm_all_reduce -> origin/liaoxuan/shm_all_reduce 2025-08-14T21:18:06.2619823Z * [new branch] liaoxuan/tags_issue -> origin/liaoxuan/tags_issue 2025-08-14T21:18:06.2621940Z * [new branch] liaoxuan/test_fa_disable_softmax -> origin/liaoxuan/test_fa_disable_softmax 2025-08-14T21:18:06.2622169Z * [new branch] liaoxuan/test_int8_sdpa -> origin/liaoxuan/test_int8_sdpa 2025-08-14T21:18:06.2622325Z * [new branch] lintbuilddocker -> origin/lintbuilddocker 2025-08-14T21:18:06.2622627Z * [new branch] llama4-stable -> origin/llama4-stable 2025-08-14T21:18:06.2622900Z * [new branch] logdetfix -> origin/logdetfix 2025-08-14T21:18:06.2626186Z * [new branch] lts/release/1.8 -> origin/lts/release/1.8 2025-08-14T21:18:06.2629753Z * [new branch] lucaskabela/#94773 -> origin/lucaskabela/#94773 2025-08-14T21:18:06.2634192Z * [new branch] lucaskabela/fix_157452 -> origin/lucaskabela/fix_157452 2025-08-14T21:18:06.2637661Z * [new branch] lucaskabela/fix_circular_import_158120 -> origin/lucaskabela/fix_circular_import_158120 2025-08-14T21:18:06.2641777Z * [new branch] lucaskabela/func_under_decomp -> origin/lucaskabela/func_under_decomp 2025-08-14T21:18:06.2645895Z * [new branch] lucaskabela/functional_in_dynamo -> origin/lucaskabela/functional_in_dynamo 2025-08-14T21:18:06.2649386Z * [new branch] lucaskabela/install_params_as_graph_attr -> origin/lucaskabela/install_params_as_graph_attr 2025-08-14T21:18:06.2652924Z * [new branch] lucaskabela/issue_120648 -> origin/lucaskabela/issue_120648 2025-08-14T21:18:06.2656403Z * [new branch] lucaskabela/parameters_as_graph_attr -> origin/lucaskabela/parameters_as_graph_attr 2025-08-14T21:18:06.2656727Z * [new branch] lucaskabela/registry_fix -> origin/lucaskabela/registry_fix 2025-08-14T21:18:06.2657046Z * [new branch] lucaskabela/remove_aot_dispatcher_metadata -> origin/lucaskabela/remove_aot_dispatcher_metadata 2025-08-14T21:18:06.2657328Z * [new branch] lucaskabela/type_guards -> origin/lucaskabela/type_guards 2025-08-14T21:18:06.2657883Z * [new branch] lucaskabela/typing-misc -> origin/lucaskabela/typing-misc 2025-08-14T21:18:06.2658169Z * [new branch] lucaskabela/typing_backends -> origin/lucaskabela/typing_backends 2025-08-14T21:18:06.2658419Z * [new branch] lucaskabela/typing_bytecode_analysis_transform -> origin/lucaskabela/typing_bytecode_analysis_transform 2025-08-14T21:18:06.2658585Z * [new branch] lucaskabela/typing_cache_files -> origin/lucaskabela/typing_cache_files 2025-08-14T21:18:06.2658763Z * [new branch] lucaskabela/typing_compile_autograd -> origin/lucaskabela/typing_compile_autograd 2025-08-14T21:18:06.2659060Z * [new branch] lucaskabela/typing_debug_utils.py -> origin/lucaskabela/typing_debug_utils.py 2025-08-14T21:18:06.2659227Z * [new branch] lucaskabela/typing_decorators -> origin/lucaskabela/typing_decorators 2025-08-14T21:18:06.2659375Z * [new branch] lucaskabela/typing_eval_frame -> origin/lucaskabela/typing_eval_frame 2025-08-14T21:18:06.2659537Z * [new branch] lucaskabela/typing_for_codegen -> origin/lucaskabela/typing_for_codegen 2025-08-14T21:18:06.2659702Z * [new branch] lucaskabela/typing_output_graph -> origin/lucaskabela/typing_output_graph 2025-08-14T21:18:06.2659855Z * [new branch] lucaskabela/typing_side_effects -> origin/lucaskabela/typing_side_effects 2025-08-14T21:18:06.2660012Z * [new branch] lucaskabela/typing_source_guard -> origin/lucaskabela/typing_source_guard 2025-08-14T21:18:06.2660159Z * [new branch] lucaskabela/typing_trace_rules -> origin/lucaskabela/typing_trace_rules 2025-08-14T21:18:06.2660313Z * [new branch] lucaskabela/typing_utils.py -> origin/lucaskabela/typing_utils.py 2025-08-14T21:18:06.2660494Z * [new branch] lucaskabela/typing_utils_improvements -> origin/lucaskabela/typing_utils_improvements 2025-08-14T21:18:06.2660599Z * [new branch] main -> origin/main 2025-08-14T21:18:06.2660795Z * [new branch] main-enable-b200-distributed-tests -> origin/main-enable-b200-distributed-tests 2025-08-14T21:18:06.2660921Z * [new branch] malfet-patch-1 -> origin/malfet-patch-1 2025-08-14T21:18:06.2661050Z * [new branch] malfet-patch-10 -> origin/malfet-patch-10 2025-08-14T21:18:06.2661165Z * [new branch] malfet-patch-11 -> origin/malfet-patch-11 2025-08-14T21:18:06.2661276Z * [new branch] malfet-patch-13 -> origin/malfet-patch-13 2025-08-14T21:18:06.2661392Z * [new branch] malfet-patch-14 -> origin/malfet-patch-14 2025-08-14T21:18:06.2661508Z * [new branch] malfet-patch-2 -> origin/malfet-patch-2 2025-08-14T21:18:06.2661616Z * [new branch] malfet-patch-3 -> origin/malfet-patch-3 2025-08-14T21:18:06.2661741Z * [new branch] malfet-patch-4 -> origin/malfet-patch-4 2025-08-14T21:18:06.2661848Z * [new branch] malfet-patch-5 -> origin/malfet-patch-5 2025-08-14T21:18:06.2662016Z * [new branch] malfet-patch-6 -> origin/malfet-patch-6 2025-08-14T21:18:06.2662124Z * [new branch] malfet-patch-7 -> origin/malfet-patch-7 2025-08-14T21:18:06.2662229Z * [new branch] malfet-patch-8 -> origin/malfet-patch-8 2025-08-14T21:18:06.2662343Z * [new branch] malfet-patch-9 -> origin/malfet-patch-9 2025-08-14T21:18:06.2662492Z * [new branch] malfet/delete-upsteam-cuda -> origin/malfet/delete-upsteam-cuda 2025-08-14T21:18:06.2662648Z * [new branch] malfet/mps-implement-col2im -> origin/malfet/mps-implement-col2im 2025-08-14T21:18:06.2662820Z * [new branch] manuel/fix_multidim_boolean_indexing -> origin/manuel/fix_multidim_boolean_indexing 2025-08-14T21:18:06.2662952Z * [new branch] manuel/np_empty_ellipsis -> origin/manuel/np_empty_ellipsis 2025-08-14T21:18:06.2663138Z * [new branch] manuel/test-ops-common-allow-mps -> origin/manuel/test-ops-common-allow-mps 2025-08-14T21:18:06.2663265Z * [new branch] metascroy-patch-1 -> origin/metascroy-patch-1 2025-08-14T21:18:06.2663399Z * [new branch] mlazos/S429861-debug -> origin/mlazos/S429861-debug 2025-08-14T21:18:06.2663504Z * [new branch] mlazos/aa -> origin/mlazos/aa 2025-08-14T21:18:06.2663626Z * [new branch] mlazos/arg-renames -> origin/mlazos/arg-renames 2025-08-14T21:18:06.2663856Z * [new branch] mlazos/backup-test-branch -> origin/mlazos/backup-test-branch 2025-08-14T21:18:06.2663991Z * [new branch] mlazos/bad-cudagraphs -> origin/mlazos/bad-cudagraphs 2025-08-14T21:18:06.2664111Z * [new branch] mlazos/baseline -> origin/mlazos/baseline 2025-08-14T21:18:06.2664364Z * [new branch] mlazos/baseline-graph-breaks -> origin/mlazos/baseline-graph-breaks 2025-08-14T21:18:06.2664497Z * [new branch] mlazos/beta-tensor -> origin/mlazos/beta-tensor 2025-08-14T21:18:06.2664627Z * [new branch] mlazos/buffers -> origin/mlazos/buffers 2025-08-14T21:18:06.2664739Z * [new branch] mlazos/buffers2 -> origin/mlazos/buffers2 2025-08-14T21:18:06.2664857Z * [new branch] mlazos/buffers3 -> origin/mlazos/buffers3 2025-08-14T21:18:06.2664962Z * [new branch] mlazos/ck2 -> origin/mlazos/ck2 2025-08-14T21:18:06.2665140Z * [new branch] mlazos/combokernels -> origin/mlazos/combokernels 2025-08-14T21:18:06.2667083Z * [new branch] mlazos/ctx-cleanup -> origin/mlazos/ctx-cleanup 2025-08-14T21:18:06.2667412Z * [new branch] mlazos/cudagraph-tests -> origin/mlazos/cudagraph-tests 2025-08-14T21:18:06.2667643Z * [new branch] mlazos/cudagraphs-measurement -> origin/mlazos/cudagraphs-measurement 2025-08-14T21:18:06.2667936Z * [new branch] mlazos/cutlass-test -> origin/mlazos/cutlass-test 2025-08-14T21:18:06.2668846Z * [new branch] mlazos/cutlass-topo-bug -> origin/mlazos/cutlass-topo-bug 2025-08-14T21:18:06.2669275Z * [new branch] mlazos/data-gather -> origin/mlazos/data-gather 2025-08-14T21:18:06.2670787Z * [new branch] mlazos/data-ptrs2 -> origin/mlazos/data-ptrs2 2025-08-14T21:18:06.2671076Z * [new branch] mlazos/data-ptrs3 -> origin/mlazos/data-ptrs3 2025-08-14T21:18:06.2671259Z * [new branch] mlazos/dataclass-proxy -> origin/mlazos/dataclass-proxy 2025-08-14T21:18:06.2671741Z * [new branch] mlazos/dc-attrs -> origin/mlazos/dc-attrs 2025-08-14T21:18:06.2672454Z * [new branch] mlazos/dc-helion -> origin/mlazos/dc-helion 2025-08-14T21:18:06.2672791Z * [new branch] mlazos/dict-fix -> origin/mlazos/dict-fix 2025-08-14T21:18:06.2674979Z * [new branch] mlazos/disable-closures -> origin/mlazos/disable-closures 2025-08-14T21:18:06.2675272Z * [new branch] mlazos/disable-tf -> origin/mlazos/disable-tf 2025-08-14T21:18:06.2675427Z * [new branch] mlazos/dupe-fix -> origin/mlazos/dupe-fix 2025-08-14T21:18:06.2675625Z * [new branch] mlazos/dyn-batch -> origin/mlazos/dyn-batch 2025-08-14T21:18:06.2675758Z * [new branch] mlazos/evt -> origin/mlazos/evt 2025-08-14T21:18:06.2679088Z * [new branch] mlazos/exp_disable -> origin/mlazos/exp_disable 2025-08-14T21:18:06.2679422Z * [new branch] mlazos/extract-examples -> origin/mlazos/extract-examples 2025-08-14T21:18:06.2679635Z * [new branch] mlazos/foreach-op -> origin/mlazos/foreach-op 2025-08-14T21:18:06.2679790Z * [new branch] mlazos/fp8 -> origin/mlazos/fp8 2025-08-14T21:18:06.2679993Z * [new branch] mlazos/fp8-bias -> origin/mlazos/fp8-bias 2025-08-14T21:18:06.2680144Z * [new branch] mlazos/fp8-bias-fusion -> origin/mlazos/fp8-bias-fusion 2025-08-14T21:18:06.2680742Z * [new branch] mlazos/freezing -> origin/mlazos/freezing 2025-08-14T21:18:06.2680899Z * [new branch] mlazos/h-comp -> origin/mlazos/h-comp 2025-08-14T21:18:06.2681356Z * [new branch] mlazos/h-comp2 -> origin/mlazos/h-comp2 2025-08-14T21:18:06.2682547Z * [new branch] mlazos/hash-hop -> origin/mlazos/hash-hop 2025-08-14T21:18:06.2682688Z * [new branch] mlazos/hc -> origin/mlazos/hc 2025-08-14T21:18:06.2683178Z * [new branch] mlazos/hc-cycles -> origin/mlazos/hc-cycles 2025-08-14T21:18:06.2684295Z * [new branch] mlazos/hc-fixes -> origin/mlazos/hc-fixes 2025-08-14T21:18:06.2684458Z * [new branch] mlazos/hc-fixes3 -> origin/mlazos/hc-fixes3 2025-08-14T21:18:06.2685063Z * [new branch] mlazos/hc-fixes4 -> origin/mlazos/hc-fixes4 2025-08-14T21:18:06.2686437Z * [new branch] mlazos/hc-hf -> origin/mlazos/hc-hf 2025-08-14T21:18:06.2686724Z * [new branch] mlazos/hc-mut -> origin/mlazos/hc-mut 2025-08-14T21:18:06.2686916Z * [new branch] mlazos/hc10 -> origin/mlazos/hc10 2025-08-14T21:18:06.2689192Z * [new branch] mlazos/hc11 -> origin/mlazos/hc11 2025-08-14T21:18:06.2689469Z * [new branch] mlazos/hc12 -> origin/mlazos/hc12 2025-08-14T21:18:06.2689597Z * [new branch] mlazos/hc13 -> origin/mlazos/hc13 2025-08-14T21:18:06.2689819Z * [new branch] mlazos/hc14 -> origin/mlazos/hc14 2025-08-14T21:18:06.2691106Z * [new branch] mlazos/hc15 -> origin/mlazos/hc15 2025-08-14T21:18:06.2691243Z * [new branch] mlazos/hc2 -> origin/mlazos/hc2 2025-08-14T21:18:06.2691572Z * [new branch] mlazos/hc4 -> origin/mlazos/hc4 2025-08-14T21:18:06.2692723Z * [new branch] mlazos/hc5 -> origin/mlazos/hc5 2025-08-14T21:18:06.2693012Z * [new branch] mlazos/hc6 -> origin/mlazos/hc6 2025-08-14T21:18:06.2693404Z * [new branch] mlazos/hc7 -> origin/mlazos/hc7 2025-08-14T21:18:06.2694936Z * [new branch] mlazos/hc8 -> origin/mlazos/hc8 2025-08-14T21:18:06.2695246Z * [new branch] mlazos/hc9 -> origin/mlazos/hc9 2025-08-14T21:18:06.2695403Z * [new branch] mlazos/hc_baseline2 -> origin/mlazos/hc_baseline2 2025-08-14T21:18:06.2695813Z * [new branch] mlazos/hop-modes -> origin/mlazos/hop-modes 2025-08-14T21:18:06.2696596Z * [new branch] mlazos/init-per-param -> origin/mlazos/init-per-param 2025-08-14T21:18:06.2697046Z * [new branch] mlazos/init_per_param -> origin/mlazos/init_per_param 2025-08-14T21:18:06.2698081Z * [new branch] mlazos/less-guards -> origin/mlazos/less-guards 2025-08-14T21:18:06.2698687Z * [new branch] mlazos/lr-composibility -> origin/mlazos/lr-composibility 2025-08-14T21:18:06.2698916Z * [new branch] mlazos/main -> origin/mlazos/main 2025-08-14T21:18:06.2700191Z * [new branch] mlazos/main-test-enablement -> origin/mlazos/main-test-enablement 2025-08-14T21:18:06.2700331Z * [new branch] mlazos/main2 -> origin/mlazos/main2 2025-08-14T21:18:06.2700868Z * [new branch] mlazos/mcg -> origin/mlazos/mcg 2025-08-14T21:18:06.2701405Z * [new branch] mlazos/mcg2 -> origin/mlazos/mcg2 2025-08-14T21:18:06.2702138Z * [new branch] mlazos/meta-guards -> origin/mlazos/meta-guards 2025-08-14T21:18:06.2703023Z * [new branch] mlazos/mlazos/ck2 -> origin/mlazos/mlazos/ck2 2025-08-14T21:18:06.2703424Z * [new branch] mlazos/mlazos/foreach-map-adam -> origin/mlazos/mlazos/foreach-map-adam 2025-08-14T21:18:06.2704447Z * [new branch] mlazos/mlazos/tf-mode-backup -> origin/mlazos/mlazos/tf-mode-backup 2025-08-14T21:18:06.2704579Z * [new branch] mlazos/mod-fix -> origin/mlazos/mod-fix 2025-08-14T21:18:06.2707540Z * [new branch] mlazos/mode-fix -> origin/mlazos/mode-fix 2025-08-14T21:18:06.2707756Z * [new branch] mlazos/more-tests -> origin/mlazos/more-tests 2025-08-14T21:18:06.2707886Z * [new branch] mlazos/nested-dc -> origin/mlazos/nested-dc 2025-08-14T21:18:06.2708003Z * [new branch] mlazos/no-cpp -> origin/mlazos/no-cpp 2025-08-14T21:18:06.2711331Z * [new branch] mlazos/no-init-group-handling -> origin/mlazos/no-init-group-handling 2025-08-14T21:18:06.2714838Z * [new branch] mlazos/offsets -> origin/mlazos/offsets 2025-08-14T21:18:06.2718359Z * [new branch] mlazos/opt-bench-exp2 -> origin/mlazos/opt-bench-exp2 2025-08-14T21:18:06.2721911Z * [new branch] mlazos/opt-incr -> origin/mlazos/opt-incr 2025-08-14T21:18:06.2725436Z * [new branch] mlazos/proxy-ctors -> origin/mlazos/proxy-ctors 2025-08-14T21:18:06.2728997Z * [new branch] mlazos/proxy-opt -> origin/mlazos/proxy-opt 2025-08-14T21:18:06.2732532Z * [new branch] mlazos/quant-fix -> origin/mlazos/quant-fix 2025-08-14T21:18:06.2732837Z * [new branch] mlazos/rm-buf-names -> origin/mlazos/rm-buf-names 2025-08-14T21:18:06.2733063Z * [new branch] mlazos/rm-spam -> origin/mlazos/rm-spam 2025-08-14T21:18:06.2733189Z * [new branch] mlazos/rtp -> origin/mlazos/rtp 2025-08-14T21:18:06.2733414Z * [new branch] mlazos/static-idx-dbg -> origin/mlazos/static-idx-dbg 2025-08-14T21:18:06.2733967Z * [new branch] mlazos/static-inputs-log -> origin/mlazos/static-inputs-log 2025-08-14T21:18:06.2737748Z * [new branch] mlazos/sub-param-fix -> origin/mlazos/sub-param-fix 2025-08-14T21:18:06.2738048Z * [new branch] mlazos/td-fix2 -> origin/mlazos/td-fix2 2025-08-14T21:18:06.2738285Z * [new branch] mlazos/tensor-hasattr2 -> origin/mlazos/tensor-hasattr2 2025-08-14T21:18:06.2738473Z * [new branch] mlazos/test -> origin/mlazos/test 2025-08-14T21:18:06.2738612Z * [new branch] mlazos/tf-mode -> origin/mlazos/tf-mode 2025-08-14T21:18:06.2738893Z * [new branch] mlazos/tf-mode-backup2 -> origin/mlazos/tf-mode-backup2 2025-08-14T21:18:06.2739040Z * [new branch] mlazos/tf-mode-reland -> origin/mlazos/tf-mode-reland 2025-08-14T21:18:06.2739173Z * [new branch] mlazos/tf-mode-reland2 -> origin/mlazos/tf-mode-reland2 2025-08-14T21:18:06.2739315Z * [new branch] mlazos/tf-mode-reland3 -> origin/mlazos/tf-mode-reland3 2025-08-14T21:18:06.2739447Z * [new branch] mlazos/topo-fix -> origin/mlazos/topo-fix 2025-08-14T21:18:06.2739588Z * [new branch] mlazos/triton-no-epi -> origin/mlazos/triton-no-epi 2025-08-14T21:18:06.2739722Z * [new branch] mlazos/tune-proto -> origin/mlazos/tune-proto 2025-08-14T21:18:06.2739848Z * [new branch] mlazos/tuple-fixes -> origin/mlazos/tuple-fixes 2025-08-14T21:18:06.2739976Z * [new branch] mlazos/tuple-fixes2 -> origin/mlazos/tuple-fixes2 2025-08-14T21:18:06.2740118Z * [new branch] mlazos/tuple-handling -> origin/mlazos/tuple-handling 2025-08-14T21:18:06.2740243Z * [new branch] mlazos/user-streams -> origin/mlazos/user-streams 2025-08-14T21:18:06.2740369Z * [new branch] mlazos/vary-beta -> origin/mlazos/vary-beta 2025-08-14T21:18:06.2740486Z * [new branch] mlazos/vary-beta2 -> origin/mlazos/vary-beta2 2025-08-14T21:18:06.2740609Z * [new branch] mlazos/weird-perf1 -> origin/mlazos/weird-perf1 2025-08-14T21:18:06.2740786Z * [new branch] mm_out_dtype_compile -> origin/mm_out_dtype_compile 2025-08-14T21:18:06.2740912Z * [new branch] modify-setupvllm -> origin/modify-setupvllm 2025-08-14T21:18:06.2741051Z * [new branch] move-theme-out-docker -> origin/move-theme-out-docker 2025-08-14T21:18:06.2741166Z * [new branch] mps-linear-1d -> origin/mps-linear-1d 2025-08-14T21:18:06.2741283Z * [new branch] msaroufim/be1 -> origin/msaroufim/be1 2025-08-14T21:18:06.2741408Z * [new branch] msaroufim/cn_path -> origin/msaroufim/cn_path 2025-08-14T21:18:06.2741559Z * [new branch] msaroufim/dtensorfusedadam -> origin/msaroufim/dtensorfusedadam 2025-08-14T21:18:06.2741676Z * [new branch] msaroufim/reduce -> origin/msaroufim/reduce 2025-08-14T21:18:06.2741797Z * [new branch] mtia/basic-cmake -> origin/mtia/basic-cmake 2025-08-14T21:18:06.2741903Z * [new branch] muon_dev -> origin/muon_dev 2025-08-14T21:18:06.2742043Z * [new branch] new-modifiy-setupvllm -> origin/new-modifiy-setupvllm 2025-08-14T21:18:06.2742158Z * [new branch] new-setupvllm -> origin/new-setupvllm 2025-08-14T21:18:06.2742272Z * [new branch] newtest-base -> origin/newtest-base 2025-08-14T21:18:06.2742396Z * [new branch] ngimel/cat_perf -> origin/ngimel/cat_perf 2025-08-14T21:18:06.2742527Z * [new branch] ngimel/cudamoduleload -> origin/ngimel/cudamoduleload 2025-08-14T21:18:06.2742682Z * [new branch] ngimel/fabric_driver_version -> origin/ngimel/fabric_driver_version 2025-08-14T21:18:06.2742801Z * [new branch] ngimel/fabric_symm -> origin/ngimel/fabric_symm 2025-08-14T21:18:06.2742907Z * [new branch] ngimel/gg_new -> origin/ngimel/gg_new 2025-08-14T21:18:06.2743059Z * [new branch] ngimel/grouped_mm_checks -> origin/ngimel/grouped_mm_checks 2025-08-14T21:18:06.2743182Z * [new branch] ngimel/guardfabric -> origin/ngimel/guardfabric 2025-08-14T21:18:06.2743303Z * [new branch] ngimel/index_None -> origin/ngimel/index_None 2025-08-14T21:18:06.2743417Z * [new branch] ngimel/modeguard -> origin/ngimel/modeguard 2025-08-14T21:18:06.2743578Z * [new branch] ngimel/multicast_fix -> origin/ngimel/multicast_fix 2025-08-14T21:18:06.2743718Z * [new branch] ngimel/unbind_multimem -> origin/ngimel/unbind_multimem 2025-08-14T21:18:06.2743844Z * [new branch] nightly -> origin/nightly 2025-08-14T21:18:06.2744782Z * [new branch] nmacchioni-patch-10 -> origin/nmacchioni-patch-10 2025-08-14T21:18:06.2747235Z * [new branch] nmacchioni-patch-7 -> origin/nmacchioni-patch-7 2025-08-14T21:18:06.2747560Z * [new branch] nmacchioni-patch-8 -> origin/nmacchioni-patch-8 2025-08-14T21:18:06.2747767Z * [new branch] nmacchioni-patch-9 -> origin/nmacchioni-patch-9 2025-08-14T21:18:06.2748008Z * [new branch] nullplay_fuse_matmul -> origin/nullplay_fuse_matmul 2025-08-14T21:18:06.2748442Z * [new branch] nweidia/enable-B200-inductor-nightly-ci -> origin/nweidia/enable-B200-inductor-nightly-ci 2025-08-14T21:18:06.2749351Z * [new branch] one-off -> origin/one-off 2025-08-14T21:18:06.2751139Z * [new branch] orig/release/1.10 -> origin/orig/release/1.10 2025-08-14T21:18:06.2751377Z * [new branch] orig/release/1.11 -> origin/orig/release/1.11 2025-08-14T21:18:06.2754944Z * [new branch] orig/release/1.12 -> origin/orig/release/1.12 2025-08-14T21:18:06.2758469Z * [new branch] orig/release/1.13 -> origin/orig/release/1.13 2025-08-14T21:18:06.2762059Z * [new branch] orig/release/1.6 -> origin/orig/release/1.6 2025-08-14T21:18:06.2765321Z * [new branch] orig/release/1.7 -> origin/orig/release/1.7 2025-08-14T21:18:06.2768761Z * [new branch] orig/release/1.8 -> origin/orig/release/1.8 2025-08-14T21:18:06.2772352Z * [new branch] orig/release/1.9 -> origin/orig/release/1.9 2025-08-14T21:18:06.2772661Z * [new branch] orig/release/2.0 -> origin/orig/release/2.0 2025-08-14T21:18:06.2772827Z * [new branch] orig/release/2.1 -> origin/orig/release/2.1 2025-08-14T21:18:06.2772939Z * [new branch] orig/release/2.2 -> origin/orig/release/2.2 2025-08-14T21:18:06.2773049Z * [new branch] orig/release/2.3 -> origin/orig/release/2.3 2025-08-14T21:18:06.2773171Z * [new branch] orig/release/2.4 -> origin/orig/release/2.4 2025-08-14T21:18:06.2773293Z * [new branch] orig/release/2.5 -> origin/orig/release/2.5 2025-08-14T21:18:06.2773410Z * [new branch] orig/release/2.6 -> origin/orig/release/2.6 2025-08-14T21:18:06.2773523Z * [new branch] orig/release/2.7 -> origin/orig/release/2.7 2025-08-14T21:18:06.2773632Z * [new branch] orig/release/2.8 -> origin/orig/release/2.8 2025-08-14T21:18:06.2773761Z * [new branch] oulgen/fx_graph -> origin/oulgen/fx_graph 2025-08-14T21:18:06.2773883Z * [new branch] padded-tensor -> origin/padded-tensor 2025-08-14T21:18:06.2774013Z * [new branch] parallel_cat -> origin/parallel_cat 2025-08-14T21:18:06.2774114Z * [new branch] pca2 -> origin/pca2 2025-08-14T21:18:06.2774244Z * [new branch] pianpwk-patch-1 -> origin/pianpwk-patch-1 2025-08-14T21:18:06.2774443Z * [new branch] pianpwk/backed_size_oblivious_export -> origin/pianpwk/backed_size_oblivious_export 2025-08-14T21:18:06.2774581Z * [new branch] pianpwk/dde_repeat_cat -> origin/pianpwk/dde_repeat_cat 2025-08-14T21:18:06.2774747Z * [new branch] pianpwk/draft_export_normalize -> origin/pianpwk/draft_export_normalize 2025-08-14T21:18:06.2774902Z * [new branch] pianpwk/dynamic_source_dim -> origin/pianpwk/dynamic_source_dim 2025-08-14T21:18:06.2775194Z * [new branch] pianpwk/invalidate_fake_memo -> origin/pianpwk/invalidate_fake_memo 2025-08-14T21:18:06.2775351Z * [new branch] pianpwk/lru_cache_bound_sympy -> origin/pianpwk/lru_cache_bound_sympy 2025-08-14T21:18:06.2775480Z * [new branch] pianpwk/max_1_strides -> origin/pianpwk/max_1_strides 2025-08-14T21:18:06.2775611Z * [new branch] pianpwk/nonzero_memo -> origin/pianpwk/nonzero_memo 2025-08-14T21:18:06.2775808Z * [new branch] pianpwk/oblivious_reshape_view_better -> origin/pianpwk/oblivious_reshape_view_better 2025-08-14T21:18:06.2775962Z * [new branch] pianpwk/oblivious_should_swap -> origin/pianpwk/oblivious_should_swap 2025-08-14T21:18:06.2776139Z * [new branch] pianpwk/oblivious_slice_forward -> origin/pianpwk/oblivious_slice_forward 2025-08-14T21:18:06.2776276Z * [new branch] pianpwk/oblivious_where -> origin/pianpwk/oblivious_where 2025-08-14T21:18:06.2776432Z * [new branch] pianpwk/param_static_pgo -> origin/pianpwk/param_static_pgo 2025-08-14T21:18:06.2776578Z * [new branch] pianpwk/pre_forward_hook -> origin/pianpwk/pre_forward_hook 2025-08-14T21:18:06.2776738Z * [new branch] pianpwk/remove_guard_fail_break -> origin/pianpwk/remove_guard_fail_break 2025-08-14T21:18:06.2776890Z * [new branch] pianpwk/slice_fresh_symbols -> origin/pianpwk/slice_fresh_symbols 2025-08-14T21:18:06.2777204Z * [new branch] pianpwk/sym_sym -> origin/pianpwk/sym_sym 2025-08-14T21:18:06.2777408Z * [new branch] pianpwk/test_slice_fake_impl -> origin/pianpwk/test_slice_fake_impl 2025-08-14T21:18:06.2778358Z * [new branch] pianpwk/unbacked_channels_last -> origin/pianpwk/unbacked_channels_last 2025-08-14T21:18:06.2778714Z * [new branch] pianpwk/unbacked_safe_conv1d -> origin/pianpwk/unbacked_safe_conv1d 2025-08-14T21:18:06.2780670Z * [new branch] pianpwk/unbacked_sdpa_flash -> origin/pianpwk/unbacked_sdpa_flash 2025-08-14T21:18:06.2780862Z * [new branch] pianpwk/unbacked_should_swap -> origin/pianpwk/unbacked_should_swap 2025-08-14T21:18:06.2781030Z * [new branch] pianpwk/unbacked_should_swap_2 -> origin/pianpwk/unbacked_should_swap_2 2025-08-14T21:18:06.2781464Z * [new branch] pianpwk/unbacked_slice_binding -> origin/pianpwk/unbacked_slice_binding 2025-08-14T21:18:06.2781934Z * [new branch] pianpwk/unbacked_slice_forward -> origin/pianpwk/unbacked_slice_forward 2025-08-14T21:18:06.2782528Z * [new branch] pianpwk/verbose_tensor_guards -> origin/pianpwk/verbose_tensor_guards 2025-08-14T21:18:06.2783108Z * [new branch] pianpwk/wan21_reshape -> origin/pianpwk/wan21_reshape 2025-08-14T21:18:06.2783676Z * [new branch] pianpwk/whitelist_optimizer -> origin/pianpwk/whitelist_optimizer 2025-08-14T21:18:06.2784942Z * [new branch] pin-torchao -> origin/pin-torchao 2025-08-14T21:18:06.2787316Z * [new branch] piz/fall_back_missing_0705 -> origin/piz/fall_back_missing_0705 2025-08-14T21:18:06.2787624Z * [new branch] piz/fall_back_missing_0716 -> origin/piz/fall_back_missing_0716 2025-08-14T21:18:06.2787813Z * [new branch] piz/fill_dist_cost_0702-3 -> origin/piz/fill_dist_cost_0702-3 2025-08-14T21:18:06.2787957Z * [new branch] piz/fill_dist_cost_0702-4 -> origin/piz/fill_dist_cost_0702-4 2025-08-14T21:18:06.2788524Z * [new branch] piz/fill_dist_cost_0702-5 -> origin/piz/fill_dist_cost_0702-5 2025-08-14T21:18:06.2789062Z * [new branch] piz/fix_sort_ -> origin/piz/fix_sort_ 2025-08-14T21:18:06.2790446Z * [new branch] piz/improve_scatter_0808 -> origin/piz/improve_scatter_0808 2025-08-14T21:18:06.2790750Z * [new branch] pool-separate -> origin/pool-separate 2025-08-14T21:18:06.2791084Z * [new branch] pr-156087 -> origin/pr-156087 2025-08-14T21:18:06.2794517Z * [new branch] pr/131860 -> origin/pr/131860 2025-08-14T21:18:06.2794810Z * [new branch] predispatch_to -> origin/predispatch_to 2025-08-14T21:18:06.2794953Z * [new branch] pt-opt-cuda3 -> origin/pt-opt-cuda3 2025-08-14T21:18:06.2795187Z * [new branch] pt2e-cache-model-device -> origin/pt2e-cache-model-device 2025-08-14T21:18:06.2795349Z * [new branch] pull-latest-theme -> origin/pull-latest-theme 2025-08-14T21:18:06.2795903Z * [new branch] pyobjectslot -> origin/pyobjectslot 2025-08-14T21:18:06.2797655Z * [new branch] python_compiled_autograd -> origin/python_compiled_autograd 2025-08-14T21:18:06.2797983Z * [new branch] qchip/export-D54134695 -> origin/qchip/export-D54134695 2025-08-14T21:18:06.2798323Z * [new branch] quint-bits -> origin/quint-bits 2025-08-14T21:18:06.2802160Z * [new branch] release/1.10 -> origin/release/1.10 2025-08-14T21:18:06.2802444Z * [new branch] release/1.11 -> origin/release/1.11 2025-08-14T21:18:06.2802587Z * [new branch] release/1.12 -> origin/release/1.12 2025-08-14T21:18:06.2802690Z * [new branch] release/1.13 -> origin/release/1.13 2025-08-14T21:18:06.2802992Z * [new branch] release/1.4 -> origin/release/1.4 2025-08-14T21:18:06.2803239Z * [new branch] release/1.4.1 -> origin/release/1.4.1 2025-08-14T21:18:06.2803363Z * [new branch] release/1.5 -> origin/release/1.5 2025-08-14T21:18:06.2803818Z * [new branch] release/1.6 -> origin/release/1.6 2025-08-14T21:18:06.2807123Z * [new branch] release/1.7 -> origin/release/1.7 2025-08-14T21:18:06.2807419Z * [new branch] release/1.8 -> origin/release/1.8 2025-08-14T21:18:06.2807555Z * [new branch] release/1.9 -> origin/release/1.9 2025-08-14T21:18:06.2807742Z * [new branch] release/2.0 -> origin/release/2.0 2025-08-14T21:18:06.2807968Z * [new branch] release/2.1 -> origin/release/2.1 2025-08-14T21:18:06.2808532Z * [new branch] release/2.2 -> origin/release/2.2 2025-08-14T21:18:06.2809906Z * [new branch] release/2.3 -> origin/release/2.3 2025-08-14T21:18:06.2810187Z * [new branch] release/2.4 -> origin/release/2.4 2025-08-14T21:18:06.2811662Z * [new branch] release/2.5 -> origin/release/2.5 2025-08-14T21:18:06.2811948Z * [new branch] release/2.6 -> origin/release/2.6 2025-08-14T21:18:06.2812338Z * [new branch] release/2.7 -> origin/release/2.7 2025-08-14T21:18:06.2813616Z * [new branch] release/2.8 -> origin/release/2.8 2025-08-14T21:18:06.2813860Z * [new branch] release_notes -> origin/release_notes 2025-08-14T21:18:06.2815260Z * [new branch] remove-actionable-label -> origin/remove-actionable-label 2025-08-14T21:18:06.2815538Z * [new branch] remove-ao -> origin/remove-ao 2025-08-14T21:18:06.2815785Z * [new branch] replace-pytorch-labs-20250812-195836 -> origin/replace-pytorch-labs-20250812-195836 2025-08-14T21:18:06.2816318Z * [new branch] replace-pytorch-labs-20250812-200248 -> origin/replace-pytorch-labs-20250812-200248 2025-08-14T21:18:06.2816911Z * [new branch] replace-pytorch-labs-20250812-200324 -> origin/replace-pytorch-labs-20250812-200324 2025-08-14T21:18:06.2817871Z * [new branch] replace-pytorch-labs-20250812-204020 -> origin/replace-pytorch-labs-20250812-204020 2025-08-14T21:18:06.2818269Z * [new branch] replace-pytorch-labs-20250812-204125 -> origin/replace-pytorch-labs-20250812-204125 2025-08-14T21:18:06.2819731Z * [new branch] replace-pytorch-labs-20250812-205624 -> origin/replace-pytorch-labs-20250812-205624 2025-08-14T21:18:06.2820276Z * [new branch] revert-131069-gh/krzysztofjordan/1/head -> origin/revert-131069-gh/krzysztofjordan/1/head 2025-08-14T21:18:06.2823997Z * [new branch] revert-131469-gh/andrewor14/51/head -> origin/revert-131469-gh/andrewor14/51/head 2025-08-14T21:18:06.2824291Z * [new branch] revert-156870-gh/skarjala/3/head -> origin/revert-156870-gh/skarjala/3/head 2025-08-14T21:18:06.2824585Z * [new branch] revert-157914-cherry-pick-157503-by-pytorch_bot_bot_ -> origin/revert-157914-cherry-pick-157503-by-pytorch_bot_bot_ 2025-08-14T21:18:06.2824743Z * [new branch] revert-direct-updates -> origin/revert-direct-updates 2025-08-14T21:18:06.2828696Z * [new branch] rocm-monitoring -> origin/rocm-monitoring 2025-08-14T21:18:06.2830629Z * [new branch] ryanguo99/cleanup-dynamo-expected-failures -> origin/ryanguo99/cleanup-dynamo-expected-failures 2025-08-14T21:18:06.2830915Z * [new branch] ryanguo99/fix-closure-var -> origin/ryanguo99/fix-closure-var 2025-08-14T21:18:06.2834948Z * [new branch] rzou/faketensor_bench -> origin/rzou/faketensor_bench 2025-08-14T21:18:06.2837120Z * [new branch] rzou/njt -> origin/rzou/njt 2025-08-14T21:18:06.2837294Z * [new branch] rzou/operator -> origin/rzou/operator 2025-08-14T21:18:06.2837408Z * [new branch] rzou/pca -> origin/rzou/pca 2025-08-14T21:18:06.2837538Z * [new branch] rzou/pipe_split -> origin/rzou/pipe_split 2025-08-14T21:18:06.2837667Z * [new branch] rzou/realprop -> origin/rzou/realprop 2025-08-14T21:18:06.2837795Z * [new branch] rzou/setup_context -> origin/rzou/setup_context 2025-08-14T21:18:06.2837998Z * [new branch] sanchitintel/refactor_aten_int8_woq_gemm -> origin/sanchitintel/refactor_aten_int8_woq_gemm 2025-08-14T21:18:06.2838269Z * [new branch] sanchitintel/weird_thing_with_test_cpu_select_algorithm -> origin/sanchitintel/weird_thing_with_test_cpu_select_algorithm 2025-08-14T21:18:06.2838431Z * [new branch] sapling-pr-archive-SS-JIA -> origin/sapling-pr-archive-SS-JIA 2025-08-14T21:18:06.2838539Z * [new branch] save -> origin/save 2025-08-14T21:18:06.2838644Z * [new branch] sdym/2.5.1 -> origin/sdym/2.5.1 2025-08-14T21:18:06.2838779Z * [new branch] seemethere-patch-1 -> origin/seemethere-patch-1 2025-08-14T21:18:06.2838903Z * [new branch] setup-torchci -> origin/setup-torchci 2025-08-14T21:18:06.2839009Z * [new branch] setupvllm -> origin/setupvllm 2025-08-14T21:18:06.2839134Z * [new branch] share_and_pin_fork -> origin/share_and_pin_fork 2025-08-14T21:18:06.2839266Z * [new branch] shengf/fx-xform-perf -> origin/shengf/fx-xform-perf 2025-08-14T21:18:06.2843217Z * [new branch] shikaili_fp8_allgather -> origin/shikaili_fp8_allgather 2025-08-14T21:18:06.2847308Z * [new branch] shoumikhin-patch-12 -> origin/shoumikhin-patch-12 2025-08-14T21:18:06.2851326Z * [new branch] simplify-fq-per-channel -> origin/simplify-fq-per-channel 2025-08-14T21:18:06.2855373Z * [new branch] solve-accuracy-fix -> origin/solve-accuracy-fix 2025-08-14T21:18:06.2857481Z * [new branch] sqzhang/flight4 -> origin/sqzhang/flight4 2025-08-14T21:18:06.2857952Z * [new branch] sqzhang/flight4plus -> origin/sqzhang/flight4plus 2025-08-14T21:18:06.2858147Z * [new branch] sraikund/record_funct_test -> origin/sraikund/record_funct_test 2025-08-14T21:18:06.2858351Z * [new branch] sraikund16/test -> origin/sraikund16/test 2025-08-14T21:18:06.2859036Z * [new branch] stablize-compilation-time -> origin/stablize-compilation-time 2025-08-14T21:18:06.2859211Z * [new branch] standalone-templates -> origin/standalone-templates 2025-08-14T21:18:06.2859379Z * [new branch] standalone_package_weights -> origin/standalone_package_weights 2025-08-14T21:18:06.2859539Z * [new branch] starterTaskUpdate -> origin/starterTaskUpdate 2025-08-14T21:18:06.2859660Z * [new branch] step2vllmsetup -> origin/step2vllmsetup 2025-08-14T21:18:06.2859781Z * [new branch] subgraph_fuse -> origin/subgraph_fuse 2025-08-14T21:18:06.2859944Z * [new branch] support-uv-in-collect_env -> origin/support-uv-in-collect_env 2025-08-14T21:18:06.2860083Z * [new branch] suryasub/fix-nccl-hang -> origin/suryasub/fix-nccl-hang 2025-08-14T21:18:06.2860198Z * [new branch] sve-poc -> origin/sve-poc 2025-08-14T21:18:06.2860324Z * [new branch] svekars-patch-1 -> origin/svekars-patch-1 2025-08-14T21:18:06.2860451Z * [new branch] svekars-patch-2 -> origin/svekars-patch-2 2025-08-14T21:18:06.2860710Z * [new branch] switch-bn -> origin/switch-bn 2025-08-14T21:18:06.2860859Z * [new branch] sympy-bottleneck-repro -> origin/sympy-bottleneck-repro 2025-08-14T21:18:06.2861024Z * [new branch] tenpercent/ck_inductor_gfx950 -> origin/tenpercent/ck_inductor_gfx950 2025-08-14T21:18:06.2861157Z * [new branch] tensordict_integration -> origin/tensordict_integration 2025-08-14T21:18:06.2861330Z * [new branch] test-half-migration-internally -> origin/test-half-migration-internally 2025-08-14T21:18:06.2861459Z * [new branch] test-internal-et -> origin/test-internal-et 2025-08-14T21:18:06.2861591Z * [new branch] test-move-conda-builds -> origin/test-move-conda-builds 2025-08-14T21:18:06.2861757Z * [new branch] test-myst-markdown-docstring -> origin/test-myst-markdown-docstring 2025-08-14T21:18:06.2861863Z * [new branch] test-old -> origin/test-old 2025-08-14T21:18:06.2862026Z * [new branch] test-vec-migration-internally -> origin/test-vec-migration-internally 2025-08-14T21:18:06.2862143Z * [new branch] test/bmm_heur -> origin/test/bmm_heur 2025-08-14T21:18:06.2862253Z * [new branch] test/inductor -> origin/test/inductor 2025-08-14T21:18:06.2862398Z * [new branch] tidy_performance_cyy -> origin/tidy_performance_cyy 2025-08-14T21:18:06.2862509Z * [new branch] torchtitan_ep -> origin/torchtitan_ep 2025-08-14T21:18:06.2862648Z * [new branch] trace_fsdp_torchtune_lora -> origin/trace_fsdp_torchtune_lora 2025-08-14T21:18:06.2862793Z * [new branch] traceable_fsdp_unit_tests -> origin/traceable_fsdp_unit_tests 2025-08-14T21:18:06.2862915Z * [new branch] trackMonitor -> origin/trackMonitor 2025-08-14T21:18:06.2863040Z * [new branch] tree_loop_vec_base -> origin/tree_loop_vec_base 2025-08-14T21:18:06.2863144Z * [new branch] tree_vec_base -> origin/tree_vec_base 2025-08-14T21:18:06.2863255Z * [new branch] triton-update -> origin/triton-update 2025-08-14T21:18:06.2863367Z * [new branch] triton_kernel -> origin/triton_kernel 2025-08-14T21:18:06.2863654Z * [new branch] triton_kernel_perf -> origin/triton_kernel_perf 2025-08-14T21:18:06.2865077Z * [new branch] try-runllm -> origin/try-runllm 2025-08-14T21:18:06.2865304Z * [new branch] type_dec -> origin/type_dec 2025-08-14T21:18:06.2865777Z * [new branch] udate-sphinx-dependancies -> origin/udate-sphinx-dependancies 2025-08-14T21:18:06.2867274Z * [new branch] update-audio-commit-hash/16307312222-1661-1 -> origin/update-audio-commit-hash/16307312222-1661-1 2025-08-14T21:18:06.2867671Z * [new branch] update-audio-commit-hash/16431348808-1673-1 -> origin/update-audio-commit-hash/16431348808-1673-1 2025-08-14T21:18:06.2868006Z * [new branch] update-audio-commit-hash/16510774365-1683-1 -> origin/update-audio-commit-hash/16510774365-1683-1 2025-08-14T21:18:06.2868726Z * [new branch] update-audio-commit-hash/16583472358-1693-1 -> origin/update-audio-commit-hash/16583472358-1693-1 2025-08-14T21:18:06.2871815Z * [new branch] update-audio-commit-hash/16663082088-1700-1 -> origin/update-audio-commit-hash/16663082088-1700-1 2025-08-14T21:18:06.2872197Z * [new branch] update-audio-commit-hash/16737365217-1704-1 -> origin/update-audio-commit-hash/16737365217-1704-1 2025-08-14T21:18:06.2872519Z * [new branch] update-audio-commit-hash/16791960928-1711-1 -> origin/update-audio-commit-hash/16791960928-1711-1 2025-08-14T21:18:06.2873128Z * [new branch] update-audio-commit-hash/16818882925-1712-1 -> origin/update-audio-commit-hash/16818882925-1712-1 2025-08-14T21:18:06.2873522Z * [new branch] update-audio-commit-hash/16895560422-1720-1 -> origin/update-audio-commit-hash/16895560422-1720-1 2025-08-14T21:18:06.2873739Z * [new branch] update-audio-commit-hash/16924174496-1738-1 -> origin/update-audio-commit-hash/16924174496-1738-1 2025-08-14T21:18:06.2873989Z * [new branch] update-dynamic-shapes-doc -> origin/update-dynamic-shapes-doc 2025-08-14T21:18:06.2875343Z * [new branch] update-executorch-commit-hash/15694981040-1626-1 -> origin/update-executorch-commit-hash/15694981040-1626-1 2025-08-14T21:18:06.2875611Z * [new branch] update-triton-commit-hash/13663274526-1487-2 -> origin/update-triton-commit-hash/13663274526-1487-2 2025-08-14T21:18:06.2877619Z * [new branch] update-vision-commit-hash/15336342773-1607-1 -> origin/update-vision-commit-hash/15336342773-1607-1 2025-08-14T21:18:06.2878031Z * [new branch] update-vllm-commit-hash/16431348808-1673-1 -> origin/update-vllm-commit-hash/16431348808-1673-1 2025-08-14T21:18:06.2878346Z * [new branch] update-vllm-commit-hash/16484773233-1682-1 -> origin/update-vllm-commit-hash/16484773233-1682-1 2025-08-14T21:18:06.2878647Z * [new branch] update-vllm-commit-hash/16510774365-1683-1 -> origin/update-vllm-commit-hash/16510774365-1683-1 2025-08-14T21:18:06.2879248Z * [new branch] update-vllm-commit-hash/16534031105-1684-1 -> origin/update-vllm-commit-hash/16534031105-1684-1 2025-08-14T21:18:06.2879761Z * [new branch] update-vllm-commit-hash/16545403308-1687-1 -> origin/update-vllm-commit-hash/16545403308-1687-1 2025-08-14T21:18:06.2881557Z * [new branch] update-vllm-commit-hash/16557202787-1688-1 -> origin/update-vllm-commit-hash/16557202787-1688-1 2025-08-14T21:18:06.2881937Z * [new branch] update-vllm-commit-hash/16583472358-1693-1 -> origin/update-vllm-commit-hash/16583472358-1693-1 2025-08-14T21:18:06.2882253Z * [new branch] update-vllm-commit-hash/16663082088-1700-1 -> origin/update-vllm-commit-hash/16663082088-1700-1 2025-08-14T21:18:06.2882823Z * [new branch] update-vllm-commit-hash/16737365217-1704-1 -> origin/update-vllm-commit-hash/16737365217-1704-1 2025-08-14T21:18:06.2883426Z * [new branch] update-vllm-commit-hash/16843157111-1713-1 -> origin/update-vllm-commit-hash/16843157111-1713-1 2025-08-14T21:18:06.2883822Z * [new branch] update-vllm-commit-hash/16855312394-1714-1 -> origin/update-vllm-commit-hash/16855312394-1714-1 2025-08-14T21:18:06.2885503Z * [new branch] update-vllm-commit-hash/16924174496-1738-1 -> origin/update-vllm-commit-hash/16924174496-1738-1 2025-08-14T21:18:06.2885889Z * [new branch] update-vllm-commit-hash/16952608705-1745-1 -> origin/update-vllm-commit-hash/16952608705-1745-1 2025-08-14T21:18:06.2886192Z * [new branch] update-xla-commit-hash/16260974441-194-1 -> origin/update-xla-commit-hash/16260974441-194-1 2025-08-14T21:18:06.2886502Z * [new branch] update-xla-commit-hash/16717126778-197-1 -> origin/update-xla-commit-hash/16717126778-197-1 2025-08-14T21:18:06.2887034Z * [new branch] update-xla-commit-hash/16873912760-198-1 -> origin/update-xla-commit-hash/16873912760-198-1 2025-08-14T21:18:06.2888403Z * [new branch] update_docs_torch_multinomial_issue#125388 -> origin/update_docs_torch_multinomial_issue#125388 2025-08-14T21:18:06.2888700Z * [new branch] update_executorch_pin -> origin/update_executorch_pin 2025-08-14T21:18:06.2890033Z * [new branch] update_slow_tests_1722488736 -> origin/update_slow_tests_1722488736 2025-08-14T21:18:06.2890351Z * [new branch] update_slow_tests_1722879173 -> origin/update_slow_tests_1722879173 2025-08-14T21:18:06.2890562Z * [new branch] update_slow_tests_1752478971 -> origin/update_slow_tests_1752478971 2025-08-14T21:18:06.2891850Z * [new branch] update_submodule_FBGEMM -> origin/update_submodule_FBGEMM 2025-08-14T21:18:06.2892037Z * [new branch] update_submodule_kineto -> origin/update_submodule_kineto 2025-08-14T21:18:06.2892409Z * [new branch] update_submodule_tensorpipe -> origin/update_submodule_tensorpipe 2025-08-14T21:18:06.2893658Z * [new branch] v0.1.2 -> origin/v0.1.2 2025-08-14T21:18:06.2893967Z * [new branch] v1.0.1 -> origin/v1.0.1 2025-08-14T21:18:06.2897941Z * [new branch] v1.0.3 -> origin/v1.0.3 2025-08-14T21:18:06.2898217Z * [new branch] v1.1.0 -> origin/v1.1.0 2025-08-14T21:18:06.2898333Z * [new branch] v1.2.0 -> origin/v1.2.0 2025-08-14T21:18:06.2898518Z * [new branch] v1.3.0 -> origin/v1.3.0 2025-08-14T21:18:06.2898729Z * [new branch] v1.3.1 -> origin/v1.3.1 2025-08-14T21:18:06.2898867Z * [new branch] validate_fn -> origin/validate_fn 2025-08-14T21:18:06.2899118Z * [new branch] validations_2.6 -> origin/validations_2.6 2025-08-14T21:18:06.2899661Z * [new branch] validations_2.8 -> origin/validations_2.8 2025-08-14T21:18:06.2900859Z * [new branch] viable/strict -> origin/viable/strict 2025-08-14T21:18:06.2901056Z * [new branch] vllmbuildci -> origin/vllmbuildci 2025-08-14T21:18:06.2902696Z * [new branch] vllmpin -> origin/vllmpin 2025-08-14T21:18:06.2902987Z * [new branch] vllmpintest -> origin/vllmpintest 2025-08-14T21:18:06.2903139Z * [new branch] wdvr-patch-1 -> origin/wdvr-patch-1 2025-08-14T21:18:06.2906662Z * [new branch] wdvr-patch-2 -> origin/wdvr-patch-2 2025-08-14T21:18:06.2910258Z * [new branch] wdvr/conda_devcontainer -> origin/wdvr/conda_devcontainer 2025-08-14T21:18:06.2914354Z * [new branch] wdvr/fix_logging_test -> origin/wdvr/fix_logging_test 2025-08-14T21:18:06.2915950Z * [new branch] wdvr/iss_145259 -> origin/wdvr/iss_145259 2025-08-14T21:18:06.2916100Z * [new branch] weight_sharing_cpp -> origin/weight_sharing_cpp 2025-08-14T21:18:06.2916397Z * [new branch] whc/flight -> origin/whc/flight 2025-08-14T21:18:06.2916633Z * [new branch] whc/flight4 -> origin/whc/flight4 2025-08-14T21:18:06.2919968Z * [new branch] whc/flight51 -> origin/whc/flight51 2025-08-14T21:18:06.2923531Z * [new branch] whc/flight53 -> origin/whc/flight53 2025-08-14T21:18:06.2925657Z * [new branch] whc/p2phang -> origin/whc/p2phang 2025-08-14T21:18:06.2925896Z * [new branch] whc/stage2 -> origin/whc/stage2 2025-08-14T21:18:06.2929373Z * [new branch] whc/uneven -> origin/whc/uneven 2025-08-14T21:18:06.2932863Z * [new branch] whc/uneven-merge -> origin/whc/uneven-merge 2025-08-14T21:18:06.2936937Z * [new branch] win_warnings -> origin/win_warnings 2025-08-14T21:18:06.2938761Z * [new branch] workonoldcommit -> origin/workonoldcommit 2025-08-14T21:18:06.2938967Z * [new branch] wwen/programming-model-2.8 -> origin/wwen/programming-model-2.8 2025-08-14T21:18:06.2939096Z * [new branch] xmfan/ca_0516 -> origin/xmfan/ca_0516 2025-08-14T21:18:06.2939222Z * [new branch] xmfan/ca_1051b93192 -> origin/xmfan/ca_1051b93192 2025-08-14T21:18:06.2939468Z * [new branch] xmfan/ca_1a722f62c248391fc4a542e8851a5559aa356ae8 -> origin/xmfan/ca_1a722f62c248391fc4a542e8851a5559aa356ae8 2025-08-14T21:18:06.2939727Z * [new branch] xmfan/ca_5a2be192d1 -> origin/xmfan/ca_5a2be192d1 2025-08-14T21:18:06.2939853Z * [new branch] xmfan/ca_9d59b516e9 -> origin/xmfan/ca_9d59b516e9 2025-08-14T21:18:06.2939977Z * [new branch] xmfan/ca_api -> origin/xmfan/ca_api 2025-08-14T21:18:06.2940086Z * [new branch] xmfan/ca_apr8 -> origin/xmfan/ca_apr8 2025-08-14T21:18:06.2940208Z * [new branch] xmfan/ca_base -> origin/xmfan/ca_base 2025-08-14T21:18:06.2940334Z * [new branch] xmfan/ca_cudagraphs -> origin/xmfan/ca_cudagraphs 2025-08-14T21:18:06.2940451Z * [new branch] xmfan/ca_dynamic -> origin/xmfan/ca_dynamic 2025-08-14T21:18:06.2940567Z * [new branch] xmfan/ca_fix_dyn -> origin/xmfan/ca_fix_dyn 2025-08-14T21:18:06.2940701Z * [new branch] xmfan/ca_fix_lowering -> origin/xmfan/ca_fix_lowering 2025-08-14T21:18:06.2940834Z * [new branch] xmfan/ca_fix_polyfills -> origin/xmfan/ca_fix_polyfills 2025-08-14T21:18:06.2940947Z * [new branch] xmfan/ca_jan3 -> origin/xmfan/ca_jan3 2025-08-14T21:18:06.2941061Z * [new branch] xmfan/ca_jun18 -> origin/xmfan/ca_jun18 2025-08-14T21:18:06.2941174Z * [new branch] xmfan/ca_jun24 -> origin/xmfan/ca_jun24 2025-08-14T21:18:06.2941292Z * [new branch] xmfan/ca_mem_base -> origin/xmfan/ca_mem_base 2025-08-14T21:18:06.2941402Z * [new branch] xmfan/ca_mem_fix -> origin/xmfan/ca_mem_fix 2025-08-14T21:18:06.2941523Z * [new branch] xmfan/ca_memory_fix -> origin/xmfan/ca_memory_fix 2025-08-14T21:18:06.2941661Z * [new branch] xmfan/ca_memory_fix_rebased -> origin/xmfan/ca_memory_fix_rebased 2025-08-14T21:18:06.2941801Z * [new branch] xmfan/ca_memory_fix_rebased2 -> origin/xmfan/ca_memory_fix_rebased2 2025-08-14T21:18:06.2941930Z * [new branch] xmfan/ca_move_to_cuda -> origin/xmfan/ca_move_to_cuda 2025-08-14T21:18:06.2942040Z * [new branch] xmfan/ca_nested -> origin/xmfan/ca_nested 2025-08-14T21:18:06.2942158Z * [new branch] xmfan/ca_overhead -> origin/xmfan/ca_overhead 2025-08-14T21:18:06.2942299Z * [new branch] xmfan/ca_overhead_0eba7e5451 -> origin/xmfan/ca_overhead_0eba7e5451 2025-08-14T21:18:06.2942456Z * [new branch] xmfan/ca_scalar -> origin/xmfan/ca_scalar 2025-08-14T21:18:06.2942594Z * [new branch] xmfan/ca_subclass_mem_fix -> origin/xmfan/ca_subclass_mem_fix 2025-08-14T21:18:06.2942708Z * [new branch] xmfan/ca_warm_mem -> origin/xmfan/ca_warm_mem 2025-08-14T21:18:06.2942831Z * [new branch] xmfan/ca_warm_mem_base -> origin/xmfan/ca_warm_mem_base 2025-08-14T21:18:06.2942941Z * [new branch] xmfan/cacu_jun18 -> origin/xmfan/cacu_jun18 2025-08-14T21:18:06.2943053Z * [new branch] xmfan/cacu_jun19 -> origin/xmfan/cacu_jun19 2025-08-14T21:18:06.2943167Z * [new branch] xmfan/cacu_jun4 -> origin/xmfan/cacu_jun4 2025-08-14T21:18:06.2943275Z * [new branch] xmfan/cacu_may27 -> origin/xmfan/cacu_may27 2025-08-14T21:18:06.2943406Z * [new branch] xmfan/circular_dep -> origin/xmfan/circular_dep 2025-08-14T21:18:06.2943561Z * [new branch] xmfan/compiled_autograd_feb_29 -> origin/xmfan/compiled_autograd_feb_29 2025-08-14T21:18:06.2943744Z * [new branch] xmfan/compiled_autograd_graph_breaks -> origin/xmfan/compiled_autograd_graph_breaks 2025-08-14T21:18:06.2943890Z * [new branch] xmfan/disable_duck_shape -> origin/xmfan/disable_duck_shape 2025-08-14T21:18:06.2944057Z * [new branch] xmfan/fca_cpp_node_passthrough -> origin/xmfan/fca_cpp_node_passthrough 2025-08-14T21:18:06.2944317Z * [new branch] xmfan/issue_123374 -> origin/xmfan/issue_123374 2025-08-14T21:18:06.2944577Z * [new branch] xmfan/post_3945954741e2d37023c5d6954f9483008e0892f9 -> origin/xmfan/post_3945954741e2d37023c5d6954f9483008e0892f9 2025-08-14T21:18:06.2944803Z * [new branch] xmfan/pre_3945954741e2d37023c5d6954f9483008e0892f9 -> origin/xmfan/pre_3945954741e2d37023c5d6954f9483008e0892f9 2025-08-14T21:18:06.2944936Z * [new branch] xmfan/segfault_test -> origin/xmfan/segfault_test 2025-08-14T21:18:06.2945053Z * [new branch] xmfan/single_step -> origin/xmfan/single_step 2025-08-14T21:18:06.2945166Z * [new branch] xmfan/sth_0829 -> origin/xmfan/sth_0829 2025-08-14T21:18:06.2945274Z * [new branch] xmfan/test -> origin/xmfan/test 2025-08-14T21:18:06.2945434Z * [new branch] y-do-we-have-7-build-systems -> origin/y-do-we-have-7-build-systems 2025-08-14T21:18:06.2945595Z * [new branch] yguo/debug-0226-constexpr -> origin/yguo/debug-0226-constexpr 2025-08-14T21:18:06.2945732Z * [new branch] yguo/new_latest_changes -> origin/yguo/new_latest_changes 2025-08-14T21:18:06.2947228Z * [new branch] yguo/patch_constexpr_changes -> origin/yguo/patch_constexpr_changes 2025-08-14T21:18:06.2947529Z * [new branch] yihan_quantization -> origin/yihan_quantization 2025-08-14T21:18:06.2947825Z * [new branch] yiming/add_nativert_benchmark -> origin/yiming/add_nativert_benchmark 2025-08-14T21:18:06.2948997Z * [new branch] yiming/bootcamp -> origin/yiming/bootcamp 2025-08-14T21:18:06.2949261Z * [new branch] zainr/canary-test -> origin/zainr/canary-test 2025-08-14T21:18:06.2950819Z * [new branch] zainr/cleanup-gh-runners -> origin/zainr/cleanup-gh-runners 2025-08-14T21:18:06.2951109Z * [new branch] zainr/fixlint -> origin/zainr/fixlint 2025-08-14T21:18:06.2951286Z * [new branch] zainr/git-push-v2 -> origin/zainr/git-push-v2 2025-08-14T21:18:06.2951792Z * [new branch] zainr/lint-py3.9 -> origin/zainr/lint-py3.9 2025-08-14T21:18:06.2952887Z * [new branch] zainr/mypy15-claude -> origin/zainr/mypy15-claude 2025-08-14T21:18:06.2953203Z * [new branch] zainr/pre-push-hooks -> origin/zainr/pre-push-hooks 2025-08-14T21:18:06.2953719Z * [new branch] zainr/pull-migration-c -> origin/zainr/pull-migration-c 2025-08-14T21:18:06.2954985Z * [new branch] zainr/test2 -> origin/zainr/test2 2025-08-14T21:18:06.2955201Z * [new branch] zainr/unstable -> origin/zainr/unstable 2025-08-14T21:18:06.2956649Z * [new branch] zainr/unstable-xla -> origin/zainr/unstable-xla 2025-08-14T21:18:06.2956971Z * [new branch] zainr/uv-pip-fix -> origin/zainr/uv-pip-fix 2025-08-14T21:18:06.2957165Z * [new branch] zainr/vs-aarch64 -> origin/zainr/vs-aarch64 2025-08-14T21:18:06.2957960Z * [new branch] zasdfgbnm-patch-3 -> origin/zasdfgbnm-patch-3 2025-08-14T21:18:06.2958410Z * [new branch] zb2p -> origin/zb2p 2025-08-14T21:18:06.2960177Z * [new branch] zdevito-patch-1 -> origin/zdevito-patch-1 2025-08-14T21:18:06.2960517Z * [new branch] zeros-and-scatter-part2 -> origin/zeros-and-scatter-part2 2025-08-14T21:18:06.2960914Z * [new branch] zhxchen17/nativert/0 -> origin/zhxchen17/nativert/0 2025-08-14T21:18:06.2964075Z * [new branch] zhxchen17/scratch/0 -> origin/zhxchen17/scratch/0 2025-08-14T21:18:06.2964391Z * [new branch] zhxhcen17/moodycamel -> origin/zhxhcen17/moodycamel 2025-08-14T21:18:06.2964557Z * [new branch] zxiiro/bazel -> origin/zxiiro/bazel 2025-08-14T21:18:06.2964834Z * [new branch] zxiiro/get-hardware -> origin/zxiiro/get-hardware 2025-08-14T21:18:06.2965247Z * [new branch] zxiiro/main -> origin/zxiiro/main 2025-08-14T21:18:06.2965863Z * [new branch] zxiiro/test -> origin/zxiiro/test 2025-08-14T21:18:06.2966395Z * [new tag] bc2caa7fdf006894eff7af936babde69ab5a40f8-huydhn-debug -> bc2caa7fdf006894eff7af936babde69ab5a40f8-huydhn-debug 2025-08-14T21:18:06.2967824Z * [new tag] ci/binaries/77164 -> ci/binaries/77164 2025-08-14T21:18:06.2968120Z * [new tag] ciflow/binaries/138996 -> ciflow/binaries/138996 2025-08-14T21:18:06.2968252Z * [new tag] ciflow/binaries/143959 -> ciflow/binaries/143959 2025-08-14T21:18:06.2968522Z * [new tag] ciflow/binaries/154595 -> ciflow/binaries/154595 2025-08-14T21:18:06.2968948Z * [new tag] ciflow/binaries/156049 -> ciflow/binaries/156049 2025-08-14T21:18:06.2969326Z * [new tag] ciflow/binaries/156712 -> ciflow/binaries/156712 2025-08-14T21:18:06.2969984Z * [new tag] ciflow/binaries/157432 -> ciflow/binaries/157432 2025-08-14T21:18:06.2970152Z * [new tag] ciflow/binaries/157685 -> ciflow/binaries/157685 2025-08-14T21:18:06.2970766Z * [new tag] ciflow/binaries/157689 -> ciflow/binaries/157689 2025-08-14T21:18:06.2970979Z * [new tag] ciflow/binaries/158104 -> ciflow/binaries/158104 2025-08-14T21:18:06.2972587Z * [new tag] ciflow/binaries/158623 -> ciflow/binaries/158623 2025-08-14T21:18:06.2972870Z * [new tag] ciflow/binaries/159827 -> ciflow/binaries/159827 2025-08-14T21:18:06.2972999Z * [new tag] ciflow/binaries/159869 -> ciflow/binaries/159869 2025-08-14T21:18:06.2973202Z * [new tag] ciflow/binaries/160593 -> ciflow/binaries/160593 2025-08-14T21:18:06.2973612Z * [new tag] ciflow/binaries_libtorch/143959 -> ciflow/binaries_libtorch/143959 2025-08-14T21:18:06.2974294Z * [new tag] ciflow/binaries_libtorch/156049 -> ciflow/binaries_libtorch/156049 2025-08-14T21:18:06.2974487Z * [new tag] ciflow/binaries_libtorch/157432 -> ciflow/binaries_libtorch/157432 2025-08-14T21:18:06.2975191Z * [new tag] ciflow/binaries_wheel/143959 -> ciflow/binaries_wheel/143959 2025-08-14T21:18:06.2975331Z * [new tag] ciflow/binaries_wheel/156049 -> ciflow/binaries_wheel/156049 2025-08-14T21:18:06.2975676Z * [new tag] ciflow/binaries_wheel/157432 -> ciflow/binaries_wheel/157432 2025-08-14T21:18:06.2977090Z * [new tag] ciflow/binaries_wheel/158733 -> ciflow/binaries_wheel/158733 2025-08-14T21:18:06.2977242Z * [new tag] ciflow/binaries_wheel/160301 -> ciflow/binaries_wheel/160301 2025-08-14T21:18:06.2977389Z * [new tag] ciflow/binaries_wheel/160496 -> ciflow/binaries_wheel/160496 2025-08-14T21:18:06.2977743Z * [new tag] ciflow/h100-distributed/156703 -> ciflow/h100-distributed/156703 2025-08-14T21:18:06.2978777Z * [new tag] ciflow/h100-symm-mem/151845 -> ciflow/h100-symm-mem/151845 2025-08-14T21:18:06.2978924Z * [new tag] ciflow/h100-symm-mem/155923 -> ciflow/h100-symm-mem/155923 2025-08-14T21:18:06.2979216Z * [new tag] ciflow/h100-symm-mem/157635 -> ciflow/h100-symm-mem/157635 2025-08-14T21:18:06.2979519Z * [new tag] ciflow/h100-symm-mem/159118 -> ciflow/h100-symm-mem/159118 2025-08-14T21:18:06.2980238Z * [new tag] ciflow/h100-symm-mem/159562 -> ciflow/h100-symm-mem/159562 2025-08-14T21:18:06.2980581Z * [new tag] ciflow/h100-symm-mem/159889 -> ciflow/h100-symm-mem/159889 2025-08-14T21:18:06.2980930Z * [new tag] ciflow/h100/159158 -> ciflow/h100/159158 2025-08-14T21:18:06.2984162Z * [new tag] ciflow/h100/160450 -> ciflow/h100/160450 2025-08-14T21:18:06.2984407Z * [new tag] ciflow/h100/160480 -> ciflow/h100/160480 2025-08-14T21:18:06.2984519Z * [new tag] ciflow/h100/160614 -> ciflow/h100/160614 2025-08-14T21:18:06.2984869Z * [new tag] ciflow/inductor-perf-test-nightly-rocm/151845 -> ciflow/inductor-perf-test-nightly-rocm/151845 2025-08-14T21:18:06.2985105Z * [new tag] ciflow/inductor-perf-test-nightly-rocm/160538 -> ciflow/inductor-perf-test-nightly-rocm/160538 2025-08-14T21:18:06.2985338Z * [new tag] ciflow/inductor-perf-test-nightly-x86-zen/156599 -> ciflow/inductor-perf-test-nightly-x86-zen/156599 2025-08-14T21:18:06.2985657Z * [new tag] ciflow/inductor-periodic/160406 -> ciflow/inductor-periodic/160406 2025-08-14T21:18:06.2985816Z * [new tag] ciflow/inductor-periodic/160538 -> ciflow/inductor-periodic/160538 2025-08-14T21:18:06.2987786Z * [new tag] ciflow/inductor-rocm/151845 -> ciflow/inductor-rocm/151845 2025-08-14T21:18:06.2988097Z * [new tag] ciflow/inductor-rocm/159158 -> ciflow/inductor-rocm/159158 2025-08-14T21:18:06.2988271Z * [new tag] ciflow/inductor-rocm/160073 -> ciflow/inductor-rocm/160073 2025-08-14T21:18:06.2988440Z * [new tag] ciflow/inductor-rocm/160538 -> ciflow/inductor-rocm/160538 2025-08-14T21:18:06.2988573Z * [new tag] ciflow/inductor/134881 -> ciflow/inductor/134881 2025-08-14T21:18:06.2988733Z * [new tag] ciflow/inductor/137400 -> ciflow/inductor/137400 2025-08-14T21:18:06.2989239Z * [new tag] ciflow/inductor/144516 -> ciflow/inductor/144516 2025-08-14T21:18:06.2989690Z * [new tag] ciflow/inductor/146506 -> ciflow/inductor/146506 2025-08-14T21:18:06.2990046Z * [new tag] ciflow/inductor/147360 -> ciflow/inductor/147360 2025-08-14T21:18:06.2990432Z * [new tag] ciflow/inductor/147990 -> ciflow/inductor/147990 2025-08-14T21:18:06.2990831Z * [new tag] ciflow/inductor/148180 -> ciflow/inductor/148180 2025-08-14T21:18:06.2991222Z * [new tag] ciflow/inductor/148328 -> ciflow/inductor/148328 2025-08-14T21:18:06.2991921Z * [new tag] ciflow/inductor/148484 -> ciflow/inductor/148484 2025-08-14T21:18:06.2992097Z * [new tag] ciflow/inductor/148492 -> ciflow/inductor/148492 2025-08-14T21:18:06.2992509Z * [new tag] ciflow/inductor/150302 -> ciflow/inductor/150302 2025-08-14T21:18:06.2993002Z * [new tag] ciflow/inductor/151845 -> ciflow/inductor/151845 2025-08-14T21:18:06.2993615Z * [new tag] ciflow/inductor/152198 -> ciflow/inductor/152198 2025-08-14T21:18:06.2993953Z * [new tag] ciflow/inductor/152624 -> ciflow/inductor/152624 2025-08-14T21:18:06.2994371Z * [new tag] ciflow/inductor/153966 -> ciflow/inductor/153966 2025-08-14T21:18:06.2994801Z * [new tag] ciflow/inductor/154193 -> ciflow/inductor/154193 2025-08-14T21:18:06.2995737Z * [new tag] ciflow/inductor/154650 -> ciflow/inductor/154650 2025-08-14T21:18:06.2995860Z * [new tag] ciflow/inductor/154694 -> ciflow/inductor/154694 2025-08-14T21:18:06.2996206Z * [new tag] ciflow/inductor/155072 -> ciflow/inductor/155072 2025-08-14T21:18:06.2996556Z * [new tag] ciflow/inductor/155152 -> ciflow/inductor/155152 2025-08-14T21:18:06.2998103Z * [new tag] ciflow/inductor/155153 -> ciflow/inductor/155153 2025-08-14T21:18:06.2998399Z * [new tag] ciflow/inductor/155154 -> ciflow/inductor/155154 2025-08-14T21:18:06.2998527Z * [new tag] ciflow/inductor/155501 -> ciflow/inductor/155501 2025-08-14T21:18:06.2998906Z * [new tag] ciflow/inductor/155502 -> ciflow/inductor/155502 2025-08-14T21:18:06.2999185Z * [new tag] ciflow/inductor/155503 -> ciflow/inductor/155503 2025-08-14T21:18:06.2999770Z * [new tag] ciflow/inductor/155504 -> ciflow/inductor/155504 2025-08-14T21:18:06.2999909Z * [new tag] ciflow/inductor/155557 -> ciflow/inductor/155557 2025-08-14T21:18:06.3000075Z * [new tag] ciflow/inductor/155608 -> ciflow/inductor/155608 2025-08-14T21:18:06.3000407Z * [new tag] ciflow/inductor/155923 -> ciflow/inductor/155923 2025-08-14T21:18:06.3001075Z * [new tag] ciflow/inductor/155928 -> ciflow/inductor/155928 2025-08-14T21:18:06.3001610Z * [new tag] ciflow/inductor/155958 -> ciflow/inductor/155958 2025-08-14T21:18:06.3001747Z * [new tag] ciflow/inductor/156049 -> ciflow/inductor/156049 2025-08-14T21:18:06.3002175Z * [new tag] ciflow/inductor/156851 -> ciflow/inductor/156851 2025-08-14T21:18:06.3002565Z * [new tag] ciflow/inductor/156967 -> ciflow/inductor/156967 2025-08-14T21:18:06.3002982Z * [new tag] ciflow/inductor/157148 -> ciflow/inductor/157148 2025-08-14T21:18:06.3003447Z * [new tag] ciflow/inductor/157149 -> ciflow/inductor/157149 2025-08-14T21:18:06.3003926Z * [new tag] ciflow/inductor/157152 -> ciflow/inductor/157152 2025-08-14T21:18:06.3004200Z * [new tag] ciflow/inductor/157542 -> ciflow/inductor/157542 2025-08-14T21:18:06.3004611Z * [new tag] ciflow/inductor/157572 -> ciflow/inductor/157572 2025-08-14T21:18:06.3005007Z * [new tag] ciflow/inductor/157635 -> ciflow/inductor/157635 2025-08-14T21:18:06.3005414Z * [new tag] ciflow/inductor/157685 -> ciflow/inductor/157685 2025-08-14T21:18:06.3005849Z * [new tag] ciflow/inductor/157686 -> ciflow/inductor/157686 2025-08-14T21:18:06.3006224Z * [new tag] ciflow/inductor/157689 -> ciflow/inductor/157689 2025-08-14T21:18:06.3006699Z * [new tag] ciflow/inductor/157699 -> ciflow/inductor/157699 2025-08-14T21:18:06.3007163Z * [new tag] ciflow/inductor/157743 -> ciflow/inductor/157743 2025-08-14T21:18:06.3007792Z * [new tag] ciflow/inductor/157944 -> ciflow/inductor/157944 2025-08-14T21:18:06.3008182Z * [new tag] ciflow/inductor/157971 -> ciflow/inductor/157971 2025-08-14T21:18:06.3008325Z * [new tag] ciflow/inductor/157994 -> ciflow/inductor/157994 2025-08-14T21:18:06.3009074Z * [new tag] ciflow/inductor/158061 -> ciflow/inductor/158061 2025-08-14T21:18:06.3009200Z * [new tag] ciflow/inductor/158091 -> ciflow/inductor/158091 2025-08-14T21:18:06.3009677Z * [new tag] ciflow/inductor/158097 -> ciflow/inductor/158097 2025-08-14T21:18:06.3010057Z * [new tag] ciflow/inductor/158098 -> ciflow/inductor/158098 2025-08-14T21:18:06.3011773Z * [new tag] ciflow/inductor/158104 -> ciflow/inductor/158104 2025-08-14T21:18:06.3012056Z * [new tag] ciflow/inductor/158168 -> ciflow/inductor/158168 2025-08-14T21:18:06.3012199Z * [new tag] ciflow/inductor/158250 -> ciflow/inductor/158250 2025-08-14T21:18:06.3012390Z * [new tag] ciflow/inductor/158321 -> ciflow/inductor/158321 2025-08-14T21:18:06.3012513Z * [new tag] ciflow/inductor/158609 -> ciflow/inductor/158609 2025-08-14T21:18:06.3012659Z * [new tag] ciflow/inductor/158647 -> ciflow/inductor/158647 2025-08-14T21:18:06.3013231Z * [new tag] ciflow/inductor/158914 -> ciflow/inductor/158914 2025-08-14T21:18:06.3013761Z * [new tag] ciflow/inductor/158932 -> ciflow/inductor/158932 2025-08-14T21:18:06.3014098Z * [new tag] ciflow/inductor/158987 -> ciflow/inductor/158987 2025-08-14T21:18:06.3014716Z * [new tag] ciflow/inductor/159009 -> ciflow/inductor/159009 2025-08-14T21:18:06.3015178Z * [new tag] ciflow/inductor/159010 -> ciflow/inductor/159010 2025-08-14T21:18:06.3015950Z * [new tag] ciflow/inductor/159093 -> ciflow/inductor/159093 2025-08-14T21:18:06.3016085Z * [new tag] ciflow/inductor/159158 -> ciflow/inductor/159158 2025-08-14T21:18:06.3016464Z * [new tag] ciflow/inductor/159197 -> ciflow/inductor/159197 2025-08-14T21:18:06.3018077Z * [new tag] ciflow/inductor/159274 -> ciflow/inductor/159274 2025-08-14T21:18:06.3018222Z * [new tag] ciflow/inductor/159281 -> ciflow/inductor/159281 2025-08-14T21:18:06.3018343Z * [new tag] ciflow/inductor/159329 -> ciflow/inductor/159329 2025-08-14T21:18:06.3018458Z * [new tag] ciflow/inductor/159361 -> ciflow/inductor/159361 2025-08-14T21:18:06.3018625Z * [new tag] ciflow/inductor/159365 -> ciflow/inductor/159365 2025-08-14T21:18:06.3019022Z * [new tag] ciflow/inductor/159366 -> ciflow/inductor/159366 2025-08-14T21:18:06.3019430Z * [new tag] ciflow/inductor/159367 -> ciflow/inductor/159367 2025-08-14T21:18:06.3019948Z * [new tag] ciflow/inductor/159368 -> ciflow/inductor/159368 2025-08-14T21:18:06.3020346Z * [new tag] ciflow/inductor/159473 -> ciflow/inductor/159473 2025-08-14T21:18:06.3020825Z * [new tag] ciflow/inductor/159483 -> ciflow/inductor/159483 2025-08-14T21:18:06.3021167Z * [new tag] ciflow/inductor/159508 -> ciflow/inductor/159508 2025-08-14T21:18:06.3021567Z * [new tag] ciflow/inductor/159523 -> ciflow/inductor/159523 2025-08-14T21:18:06.3022014Z * [new tag] ciflow/inductor/159678 -> ciflow/inductor/159678 2025-08-14T21:18:06.3022442Z * [new tag] ciflow/inductor/159691 -> ciflow/inductor/159691 2025-08-14T21:18:06.3023046Z * [new tag] ciflow/inductor/159778 -> ciflow/inductor/159778 2025-08-14T21:18:06.3023337Z * [new tag] ciflow/inductor/159786 -> ciflow/inductor/159786 2025-08-14T21:18:06.3023729Z * [new tag] ciflow/inductor/159817 -> ciflow/inductor/159817 2025-08-14T21:18:06.3024157Z * [new tag] ciflow/inductor/159842 -> ciflow/inductor/159842 2025-08-14T21:18:06.3025899Z * [new tag] ciflow/inductor/159864 -> ciflow/inductor/159864 2025-08-14T21:18:06.3026184Z * [new tag] ciflow/inductor/159865 -> ciflow/inductor/159865 2025-08-14T21:18:06.3026314Z * [new tag] ciflow/inductor/159869 -> ciflow/inductor/159869 2025-08-14T21:18:06.3026472Z * [new tag] ciflow/inductor/159875 -> ciflow/inductor/159875 2025-08-14T21:18:06.3026611Z * [new tag] ciflow/inductor/159889 -> ciflow/inductor/159889 2025-08-14T21:18:06.3026730Z * [new tag] ciflow/inductor/159902 -> ciflow/inductor/159902 2025-08-14T21:18:06.3028041Z * [new tag] ciflow/inductor/159923 -> ciflow/inductor/159923 2025-08-14T21:18:06.3028340Z * [new tag] ciflow/inductor/159944 -> ciflow/inductor/159944 2025-08-14T21:18:06.3028478Z * [new tag] ciflow/inductor/160004 -> ciflow/inductor/160004 2025-08-14T21:18:06.3028664Z * [new tag] ciflow/inductor/160080 -> ciflow/inductor/160080 2025-08-14T21:18:06.3029051Z * [new tag] ciflow/inductor/160108 -> ciflow/inductor/160108 2025-08-14T21:18:06.3029519Z * [new tag] ciflow/inductor/160109 -> ciflow/inductor/160109 2025-08-14T21:18:06.3030240Z * [new tag] ciflow/inductor/160111 -> ciflow/inductor/160111 2025-08-14T21:18:06.3030425Z * [new tag] ciflow/inductor/160113 -> ciflow/inductor/160113 2025-08-14T21:18:06.3030815Z * [new tag] ciflow/inductor/160127 -> ciflow/inductor/160127 2025-08-14T21:18:06.3032127Z * [new tag] ciflow/inductor/160131 -> ciflow/inductor/160131 2025-08-14T21:18:06.3032439Z * [new tag] ciflow/inductor/160132 -> ciflow/inductor/160132 2025-08-14T21:18:06.3032568Z * [new tag] ciflow/inductor/160136 -> ciflow/inductor/160136 2025-08-14T21:18:06.3032770Z * [new tag] ciflow/inductor/160138 -> ciflow/inductor/160138 2025-08-14T21:18:06.3033001Z * [new tag] ciflow/inductor/160151 -> ciflow/inductor/160151 2025-08-14T21:18:06.3033314Z * [new tag] ciflow/inductor/160152 -> ciflow/inductor/160152 2025-08-14T21:18:06.3033734Z * [new tag] ciflow/inductor/160154 -> ciflow/inductor/160154 2025-08-14T21:18:06.3034153Z * [new tag] ciflow/inductor/160156 -> ciflow/inductor/160156 2025-08-14T21:18:06.3034709Z * [new tag] ciflow/inductor/160161 -> ciflow/inductor/160161 2025-08-14T21:18:06.3035395Z * [new tag] ciflow/inductor/160166 -> ciflow/inductor/160166 2025-08-14T21:18:06.3035582Z * [new tag] ciflow/inductor/160168 -> ciflow/inductor/160168 2025-08-14T21:18:06.3036005Z * [new tag] ciflow/inductor/160174 -> ciflow/inductor/160174 2025-08-14T21:18:06.3036399Z * [new tag] ciflow/inductor/160181 -> ciflow/inductor/160181 2025-08-14T21:18:06.3036844Z * [new tag] ciflow/inductor/160183 -> ciflow/inductor/160183 2025-08-14T21:18:06.3037564Z * [new tag] ciflow/inductor/160190 -> ciflow/inductor/160190 2025-08-14T21:18:06.3037942Z * [new tag] ciflow/inductor/160198 -> ciflow/inductor/160198 2025-08-14T21:18:06.3038386Z * [new tag] ciflow/inductor/160201 -> ciflow/inductor/160201 2025-08-14T21:18:06.3039682Z * [new tag] ciflow/inductor/160209 -> ciflow/inductor/160209 2025-08-14T21:18:06.3039977Z * [new tag] ciflow/inductor/160218 -> ciflow/inductor/160218 2025-08-14T21:18:06.3040372Z * [new tag] ciflow/inductor/160239 -> ciflow/inductor/160239 2025-08-14T21:18:06.3040501Z * [new tag] ciflow/inductor/160250 -> ciflow/inductor/160250 2025-08-14T21:18:06.3040821Z * [new tag] ciflow/inductor/160253 -> ciflow/inductor/160253 2025-08-14T21:18:06.3041236Z * [new tag] ciflow/inductor/160266 -> ciflow/inductor/160266 2025-08-14T21:18:06.3041647Z * [new tag] ciflow/inductor/160282 -> ciflow/inductor/160282 2025-08-14T21:18:06.3042103Z * [new tag] ciflow/inductor/160298 -> ciflow/inductor/160298 2025-08-14T21:18:06.3042448Z * [new tag] ciflow/inductor/160301 -> ciflow/inductor/160301 2025-08-14T21:18:06.3043042Z * [new tag] ciflow/inductor/160310 -> ciflow/inductor/160310 2025-08-14T21:18:06.3043470Z * [new tag] ciflow/inductor/160323 -> ciflow/inductor/160323 2025-08-14T21:18:06.3045461Z * [new tag] ciflow/inductor/160324 -> ciflow/inductor/160324 2025-08-14T21:18:06.3045746Z * [new tag] ciflow/inductor/160325 -> ciflow/inductor/160325 2025-08-14T21:18:06.3045879Z * [new tag] ciflow/inductor/160326 -> ciflow/inductor/160326 2025-08-14T21:18:06.3046164Z * [new tag] ciflow/inductor/160327 -> ciflow/inductor/160327 2025-08-14T21:18:06.3047175Z * [new tag] ciflow/inductor/160328 -> ciflow/inductor/160328 2025-08-14T21:18:06.3047456Z * [new tag] ciflow/inductor/160329 -> ciflow/inductor/160329 2025-08-14T21:18:06.3047743Z * [new tag] ciflow/inductor/160351 -> ciflow/inductor/160351 2025-08-14T21:18:06.3048034Z * [new tag] ciflow/inductor/160353 -> ciflow/inductor/160353 2025-08-14T21:18:06.3048450Z * [new tag] ciflow/inductor/160362 -> ciflow/inductor/160362 2025-08-14T21:18:06.3048848Z * [new tag] ciflow/inductor/160363 -> ciflow/inductor/160363 2025-08-14T21:18:06.3049269Z * [new tag] ciflow/inductor/160364 -> ciflow/inductor/160364 2025-08-14T21:18:06.3049689Z * [new tag] ciflow/inductor/160365 -> ciflow/inductor/160365 2025-08-14T21:18:06.3050092Z * [new tag] ciflow/inductor/160366 -> ciflow/inductor/160366 2025-08-14T21:18:06.3050569Z * [new tag] ciflow/inductor/160367 -> ciflow/inductor/160367 2025-08-14T21:18:06.3050931Z * [new tag] ciflow/inductor/160368 -> ciflow/inductor/160368 2025-08-14T21:18:06.3051341Z * [new tag] ciflow/inductor/160369 -> ciflow/inductor/160369 2025-08-14T21:18:06.3051742Z * [new tag] ciflow/inductor/160371 -> ciflow/inductor/160371 2025-08-14T21:18:06.3052165Z * [new tag] ciflow/inductor/160374 -> ciflow/inductor/160374 2025-08-14T21:18:06.3052665Z * [new tag] ciflow/inductor/160375 -> ciflow/inductor/160375 2025-08-14T21:18:06.3052986Z * [new tag] ciflow/inductor/160377 -> ciflow/inductor/160377 2025-08-14T21:18:06.3053397Z * [new tag] ciflow/inductor/160380 -> ciflow/inductor/160380 2025-08-14T21:18:06.3054625Z * [new tag] ciflow/inductor/160381 -> ciflow/inductor/160381 2025-08-14T21:18:06.3054918Z * [new tag] ciflow/inductor/160383 -> ciflow/inductor/160383 2025-08-14T21:18:06.3055109Z * [new tag] ciflow/inductor/160394 -> ciflow/inductor/160394 2025-08-14T21:18:06.3056480Z * [new tag] ciflow/inductor/160401 -> ciflow/inductor/160401 2025-08-14T21:18:06.3056771Z * [new tag] ciflow/inductor/160402 -> ciflow/inductor/160402 2025-08-14T21:18:06.3056912Z * [new tag] ciflow/inductor/160403 -> ciflow/inductor/160403 2025-08-14T21:18:06.3057095Z * [new tag] ciflow/inductor/160424 -> ciflow/inductor/160424 2025-08-14T21:18:06.3057474Z * [new tag] ciflow/inductor/160426 -> ciflow/inductor/160426 2025-08-14T21:18:06.3057796Z * [new tag] ciflow/inductor/160431 -> ciflow/inductor/160431 2025-08-14T21:18:06.3058333Z * [new tag] ciflow/inductor/160448 -> ciflow/inductor/160448 2025-08-14T21:18:06.3058931Z * [new tag] ciflow/inductor/160450 -> ciflow/inductor/160450 2025-08-14T21:18:06.3059086Z * [new tag] ciflow/inductor/160455 -> ciflow/inductor/160455 2025-08-14T21:18:06.3060111Z * [new tag] ciflow/inductor/160456 -> ciflow/inductor/160456 2025-08-14T21:18:06.3060226Z * [new tag] ciflow/inductor/160461 -> ciflow/inductor/160461 2025-08-14T21:18:06.3060608Z * [new tag] ciflow/inductor/160462 -> ciflow/inductor/160462 2025-08-14T21:18:06.3060958Z * [new tag] ciflow/inductor/160467 -> ciflow/inductor/160467 2025-08-14T21:18:06.3061377Z * [new tag] ciflow/inductor/160470 -> ciflow/inductor/160470 2025-08-14T21:18:06.3061839Z * [new tag] ciflow/inductor/160473 -> ciflow/inductor/160473 2025-08-14T21:18:06.3062477Z * [new tag] ciflow/inductor/160476 -> ciflow/inductor/160476 2025-08-14T21:18:06.3062942Z * [new tag] ciflow/inductor/160480 -> ciflow/inductor/160480 2025-08-14T21:18:06.3063162Z * [new tag] ciflow/inductor/160481 -> ciflow/inductor/160481 2025-08-14T21:18:06.3063715Z * [new tag] ciflow/inductor/160482 -> ciflow/inductor/160482 2025-08-14T21:18:06.3063959Z * [new tag] ciflow/inductor/160483 -> ciflow/inductor/160483 2025-08-14T21:18:06.3064471Z * [new tag] ciflow/inductor/160485 -> ciflow/inductor/160485 2025-08-14T21:18:06.3065657Z * [new tag] ciflow/inductor/160486 -> ciflow/inductor/160486 2025-08-14T21:18:06.3065918Z * [new tag] ciflow/inductor/160503 -> ciflow/inductor/160503 2025-08-14T21:18:06.3066046Z * [new tag] ciflow/inductor/160510 -> ciflow/inductor/160510 2025-08-14T21:18:06.3066173Z * [new tag] ciflow/inductor/160527 -> ciflow/inductor/160527 2025-08-14T21:18:06.3066627Z * [new tag] ciflow/inductor/160530 -> ciflow/inductor/160530 2025-08-14T21:18:06.3067042Z * [new tag] ciflow/inductor/160531 -> ciflow/inductor/160531 2025-08-14T21:18:06.3067468Z * [new tag] ciflow/inductor/160538 -> ciflow/inductor/160538 2025-08-14T21:18:06.3070912Z * [new tag] ciflow/inductor/160539 -> ciflow/inductor/160539 2025-08-14T21:18:06.3071204Z * [new tag] ciflow/inductor/160540 -> ciflow/inductor/160540 2025-08-14T21:18:06.3071349Z * [new tag] ciflow/inductor/160548 -> ciflow/inductor/160548 2025-08-14T21:18:06.3071481Z * [new tag] ciflow/inductor/160561 -> ciflow/inductor/160561 2025-08-14T21:18:06.3071589Z * [new tag] ciflow/inductor/160576 -> ciflow/inductor/160576 2025-08-14T21:18:06.3071695Z * [new tag] ciflow/inductor/160578 -> ciflow/inductor/160578 2025-08-14T21:18:06.3071933Z * [new tag] ciflow/inductor/160580 -> ciflow/inductor/160580 2025-08-14T21:18:06.3072043Z * [new tag] ciflow/inductor/160583 -> ciflow/inductor/160583 2025-08-14T21:18:06.3072292Z * [new tag] ciflow/inductor/160589 -> ciflow/inductor/160589 2025-08-14T21:18:06.3073500Z * [new tag] ciflow/inductor/160590 -> ciflow/inductor/160590 2025-08-14T21:18:06.3073786Z * [new tag] ciflow/inductor/160592 -> ciflow/inductor/160592 2025-08-14T21:18:06.3073918Z * [new tag] ciflow/inductor/160596 -> ciflow/inductor/160596 2025-08-14T21:18:06.3074479Z * [new tag] ciflow/inductor/160601 -> ciflow/inductor/160601 2025-08-14T21:18:06.3075617Z * [new tag] ciflow/inductor/160607 -> ciflow/inductor/160607 2025-08-14T21:18:06.3075911Z * [new tag] ciflow/inductor/160608 -> ciflow/inductor/160608 2025-08-14T21:18:06.3076041Z * [new tag] ciflow/inductor/160611 -> ciflow/inductor/160611 2025-08-14T21:18:06.3076332Z * [new tag] ciflow/inductor/160614 -> ciflow/inductor/160614 2025-08-14T21:18:06.3076765Z * [new tag] ciflow/inductor/160616 -> ciflow/inductor/160616 2025-08-14T21:18:06.3077253Z * [new tag] ciflow/inductor/160619 -> ciflow/inductor/160619 2025-08-14T21:18:06.3077599Z * [new tag] ciflow/inductor/160625 -> ciflow/inductor/160625 2025-08-14T21:18:06.3078005Z * [new tag] ciflow/inductor/160635 -> ciflow/inductor/160635 2025-08-14T21:18:06.3079947Z * [new tag] ciflow/inductor/160649 -> ciflow/inductor/160649 2025-08-14T21:18:06.3080244Z * [new tag] ciflow/inductor/160658 -> ciflow/inductor/160658 2025-08-14T21:18:06.3080381Z * [new tag] ciflow/inductor/160662 -> ciflow/inductor/160662 2025-08-14T21:18:06.3080491Z * [new tag] ciflow/inductor/160668 -> ciflow/inductor/160668 2025-08-14T21:18:06.3080738Z * [new tag] ciflow/inductor/160669 -> ciflow/inductor/160669 2025-08-14T21:18:06.3081338Z * [new tag] ciflow/inductor/160670 -> ciflow/inductor/160670 2025-08-14T21:18:06.3081614Z * [new tag] ciflow/inductor/160671 -> ciflow/inductor/160671 2025-08-14T21:18:06.3081767Z * [new tag] ciflow/inductor/160677 -> ciflow/inductor/160677 2025-08-14T21:18:06.3082078Z * [new tag] ciflow/inductor/160679 -> ciflow/inductor/160679 2025-08-14T21:18:06.3082848Z * [new tag] ciflow/inductor/3b9a386 -> ciflow/inductor/3b9a386 2025-08-14T21:18:06.3083190Z * [new tag] ciflow/inductor/3d4b92b -> ciflow/inductor/3d4b92b 2025-08-14T21:18:06.3085088Z * [new tag] ciflow/inductor/d224ac7 -> ciflow/inductor/d224ac7 2025-08-14T21:18:06.3085393Z * [new tag] ciflow/linux-aarch64/147855 -> ciflow/linux-aarch64/147855 2025-08-14T21:18:06.3085555Z * [new tag] ciflow/linux-aarch64/157994 -> ciflow/linux-aarch64/157994 2025-08-14T21:18:06.3085695Z * [new tag] ciflow/linux-aarch64/159737 -> ciflow/linux-aarch64/159737 2025-08-14T21:18:06.3085904Z * [new tag] ciflow/linux-aarch64/160078 -> ciflow/linux-aarch64/160078 2025-08-14T21:18:06.3086035Z * [new tag] ciflow/linux-aarch64/160299 -> ciflow/linux-aarch64/160299 2025-08-14T21:18:06.3086495Z * [new tag] ciflow/linux-aarch64/160301 -> ciflow/linux-aarch64/160301 2025-08-14T21:18:06.3087266Z * [new tag] ciflow/mps/155923 -> ciflow/mps/155923 2025-08-14T21:18:06.3087387Z * [new tag] ciflow/mps/157553 -> ciflow/mps/157553 2025-08-14T21:18:06.3087805Z * [new tag] ciflow/mps/157635 -> ciflow/mps/157635 2025-08-14T21:18:06.3088138Z * [new tag] ciflow/mps/160541 -> ciflow/mps/160541 2025-08-14T21:18:06.3089675Z * [new tag] ciflow/nightly/156049 -> ciflow/nightly/156049 2025-08-14T21:18:06.3089960Z * [new tag] ciflow/nightly/158104 -> ciflow/nightly/158104 2025-08-14T21:18:06.3090129Z * [new tag] ciflow/op-benchmark/157994 -> ciflow/op-benchmark/157994 2025-08-14T21:18:06.3090411Z * [new tag] ciflow/periodic-rocm-mi300/139971 -> ciflow/periodic-rocm-mi300/139971 2025-08-14T21:18:06.3090603Z * [new tag] ciflow/periodic-rocm-mi300/160073 -> ciflow/periodic-rocm-mi300/160073 2025-08-14T21:18:06.3091379Z * [new tag] ciflow/periodic-rocm-mi300/160538 -> ciflow/periodic-rocm-mi300/160538 2025-08-14T21:18:06.3091557Z * [new tag] ciflow/periodic/054a2fd -> ciflow/periodic/054a2fd 2025-08-14T21:18:06.3091970Z * [new tag] ciflow/periodic/131296 -> ciflow/periodic/131296 2025-08-14T21:18:06.3092441Z * [new tag] ciflow/periodic/139971 -> ciflow/periodic/139971 2025-08-14T21:18:06.3092763Z * [new tag] ciflow/periodic/143959 -> ciflow/periodic/143959 2025-08-14T21:18:06.3093158Z * [new tag] ciflow/periodic/154595 -> ciflow/periodic/154595 2025-08-14T21:18:06.3093569Z * [new tag] ciflow/periodic/156703 -> ciflow/periodic/156703 2025-08-14T21:18:06.3093952Z * [new tag] ciflow/periodic/160201 -> ciflow/periodic/160201 2025-08-14T21:18:06.3094422Z * [new tag] ciflow/periodic/160424 -> ciflow/periodic/160424 2025-08-14T21:18:06.3094879Z * [new tag] ciflow/periodic/160538 -> ciflow/periodic/160538 2025-08-14T21:18:06.3098477Z * [new tag] ciflow/periodic/1febab2a89302464f6c7d69cfbef7a24c421ea65 -> ciflow/periodic/1febab2a89302464f6c7d69cfbef7a24c421ea65 2025-08-14T21:18:06.3098788Z * [new tag] ciflow/periodic/2a6d37d -> ciflow/periodic/2a6d37d 2025-08-14T21:18:06.3099114Z * [new tag] ciflow/periodic/2ee22e435131369a7e4f8cc4732579acc29a941b -> ciflow/periodic/2ee22e435131369a7e4f8cc4732579acc29a941b 2025-08-14T21:18:06.3099903Z * [new tag] ciflow/periodic/317eeb8 -> ciflow/periodic/317eeb8 2025-08-14T21:18:06.3100060Z * [new tag] ciflow/periodic/3c32 -> ciflow/periodic/3c32 2025-08-14T21:18:06.3100190Z * [new tag] ciflow/periodic/3e98831 -> ciflow/periodic/3e98831 2025-08-14T21:18:06.3100473Z * [new tag] ciflow/periodic/4a773e1e867f28a8ff0b15203e5cd9548f74fcee -> ciflow/periodic/4a773e1e867f28a8ff0b15203e5cd9548f74fcee 2025-08-14T21:18:06.3101263Z * [new tag] ciflow/periodic/5f5f508aa836a46dfe88857fb223049616b94e93 -> ciflow/periodic/5f5f508aa836a46dfe88857fb223049616b94e93 2025-08-14T21:18:06.3101429Z * [new tag] ciflow/periodic/94512-point -> ciflow/periodic/94512-point 2025-08-14T21:18:06.3101577Z * [new tag] ciflow/periodic/csl/test87519 -> ciflow/periodic/csl/test87519 2025-08-14T21:18:06.3102104Z * [new tag] ciflow/periodic/csltest88275 -> ciflow/periodic/csltest88275 2025-08-14T21:18:06.3102532Z * [new tag] ciflow/periodic/csltest88761 -> ciflow/periodic/csltest88761 2025-08-14T21:18:06.3103647Z * [new tag] ciflow/periodic/d7114f05b10de8e6de81ffc567d63944c3117d51 -> ciflow/periodic/d7114f05b10de8e6de81ffc567d63944c3117d51 2025-08-14T21:18:06.3103789Z * [new tag] ciflow/periodic/release_1.12 -> ciflow/periodic/release_1.12 2025-08-14T21:18:06.3104603Z * [new tag] ciflow/periodic/release_1.12.0 -> ciflow/periodic/release_1.12.0 2025-08-14T21:18:06.3105059Z * [new tag] ciflow/periodic/sha-ec5b83 -> ciflow/periodic/sha-ec5b83 2025-08-14T21:18:06.3107681Z * [new tag] ciflow/rocm-mi300/151360 -> ciflow/rocm-mi300/151360 2025-08-14T21:18:06.3107977Z * [new tag] ciflow/rocm-mi300/159158 -> ciflow/rocm-mi300/159158 2025-08-14T21:18:06.3108116Z * [new tag] ciflow/rocm-mi300/160073 -> ciflow/rocm-mi300/160073 2025-08-14T21:18:06.3108223Z * [new tag] ciflow/rocm-mi300/160468 -> ciflow/rocm-mi300/160468 2025-08-14T21:18:06.3108465Z * [new tag] ciflow/rocm-mi300/160538 -> ciflow/rocm-mi300/160538 2025-08-14T21:18:06.3108573Z * [new tag] ciflow/rocm-mi355/160215 -> ciflow/rocm-mi355/160215 2025-08-14T21:18:06.3109050Z * [new tag] ciflow/rocm/148492 -> ciflow/rocm/148492 2025-08-14T21:18:06.3109329Z * [new tag] ciflow/rocm/151360 -> ciflow/rocm/151360 2025-08-14T21:18:06.3109432Z * [new tag] ciflow/rocm/151845 -> ciflow/rocm/151845 2025-08-14T21:18:06.3109667Z * [new tag] ciflow/rocm/154864 -> ciflow/rocm/154864 2025-08-14T21:18:06.3110016Z * [new tag] ciflow/rocm/156491 -> ciflow/rocm/156491 2025-08-14T21:18:06.3110417Z * [new tag] ciflow/rocm/158219 -> ciflow/rocm/158219 2025-08-14T21:18:06.3110830Z * [new tag] ciflow/rocm/158220 -> ciflow/rocm/158220 2025-08-14T21:18:06.3111983Z * [new tag] ciflow/rocm/158224 -> ciflow/rocm/158224 2025-08-14T21:18:06.3112269Z * [new tag] ciflow/rocm/159158 -> ciflow/rocm/159158 2025-08-14T21:18:06.3112390Z * [new tag] ciflow/rocm/160215 -> ciflow/rocm/160215 2025-08-14T21:18:06.3112489Z * [new tag] ciflow/rocm/160468 -> ciflow/rocm/160468 2025-08-14T21:18:06.3113832Z * [new tag] ciflow/rocm/160538 -> ciflow/rocm/160538 2025-08-14T21:18:06.3114126Z * [new tag] ciflow/s390/143959 -> ciflow/s390/143959 2025-08-14T21:18:06.3114254Z * [new tag] ciflow/slow/01c7106 -> ciflow/slow/01c7106 2025-08-14T21:18:06.3114554Z * [new tag] ciflow/slow/0577043 -> ciflow/slow/0577043 2025-08-14T21:18:06.3116133Z * [new tag] ciflow/slow/0d5b74da0cab798fbfdb9caa53fad816999c8386-sdym -> ciflow/slow/0d5b74da0cab798fbfdb9caa53fad816999c8386-sdym 2025-08-14T21:18:06.3116557Z * [new tag] ciflow/slow/0e81104 -> ciflow/slow/0e81104 2025-08-14T21:18:06.3116811Z * [new tag] ciflow/slow/154595 -> ciflow/slow/154595 2025-08-14T21:18:06.3116929Z * [new tag] ciflow/slow/1732077 -> ciflow/slow/1732077 2025-08-14T21:18:06.3117458Z * [new tag] ciflow/slow/187eb7c -> ciflow/slow/187eb7c 2025-08-14T21:18:06.3117839Z * [new tag] ciflow/slow/1faef89 -> ciflow/slow/1faef89 2025-08-14T21:18:06.3118845Z * [new tag] ciflow/slow/3920ec1 -> ciflow/slow/3920ec1 2025-08-14T21:18:06.3119184Z * [new tag] ciflow/slow/3b7c6b2 -> ciflow/slow/3b7c6b2 2025-08-14T21:18:06.3120480Z * [new tag] ciflow/slow/59a3759 -> ciflow/slow/59a3759 2025-08-14T21:18:06.3120759Z * [new tag] ciflow/slow/70ef0bb -> ciflow/slow/70ef0bb 2025-08-14T21:18:06.3120909Z * [new tag] ciflow/slow/788ff06 -> ciflow/slow/788ff06 2025-08-14T21:18:06.3122262Z * [new tag] ciflow/slow/8751002215790a3a88750faa8f4366933e296693-sdym -> ciflow/slow/8751002215790a3a88750faa8f4366933e296693-sdym 2025-08-14T21:18:06.3122557Z * [new tag] ciflow/slow/9d85864 -> ciflow/slow/9d85864 2025-08-14T21:18:06.3122687Z * [new tag] ciflow/slow/9ffad5b -> ciflow/slow/9ffad5b 2025-08-14T21:18:06.3123014Z * [new tag] ciflow/slow/a206e8b -> ciflow/slow/a206e8b 2025-08-14T21:18:06.3123612Z * [new tag] ciflow/slow/a837609 -> ciflow/slow/a837609 2025-08-14T21:18:06.3124448Z * [new tag] ciflow/slow/af841f3 -> ciflow/slow/af841f3 2025-08-14T21:18:06.3124766Z * [new tag] ciflow/slow/da3aba1e46157c4df504b067477cdf2b3c96b194-sdym -> ciflow/slow/da3aba1e46157c4df504b067477cdf2b3c96b194-sdym 2025-08-14T21:18:06.3125170Z * [new tag] ciflow/trunk/131296 -> ciflow/trunk/131296 2025-08-14T21:18:06.3125566Z * [new tag] ciflow/trunk/137400 -> ciflow/trunk/137400 2025-08-14T21:18:06.3126005Z * [new tag] ciflow/trunk/138996 -> ciflow/trunk/138996 2025-08-14T21:18:06.3126422Z * [new tag] ciflow/trunk/139971 -> ciflow/trunk/139971 2025-08-14T21:18:06.3126789Z * [new tag] ciflow/trunk/147360 -> ciflow/trunk/147360 2025-08-14T21:18:06.3127227Z * [new tag] ciflow/trunk/147855 -> ciflow/trunk/147855 2025-08-14T21:18:06.3127591Z * [new tag] ciflow/trunk/148180 -> ciflow/trunk/148180 2025-08-14T21:18:06.3127995Z * [new tag] ciflow/trunk/148328 -> ciflow/trunk/148328 2025-08-14T21:18:06.3128400Z * [new tag] ciflow/trunk/148492 -> ciflow/trunk/148492 2025-08-14T21:18:06.3131729Z * [new tag] ciflow/trunk/150282 -> ciflow/trunk/150282 2025-08-14T21:18:06.3132033Z * [new tag] ciflow/trunk/150302 -> ciflow/trunk/150302 2025-08-14T21:18:06.3132182Z * [new tag] ciflow/trunk/151845 -> ciflow/trunk/151845 2025-08-14T21:18:06.3132296Z * [new tag] ciflow/trunk/152624 -> ciflow/trunk/152624 2025-08-14T21:18:06.3132481Z * [new tag] ciflow/trunk/154193 -> ciflow/trunk/154193 2025-08-14T21:18:06.3132599Z * [new tag] ciflow/trunk/154595 -> ciflow/trunk/154595 2025-08-14T21:18:06.3133212Z * [new tag] ciflow/trunk/154650 -> ciflow/trunk/154650 2025-08-14T21:18:06.3133354Z * [new tag] ciflow/trunk/154694 -> ciflow/trunk/154694 2025-08-14T21:18:06.3133460Z * [new tag] ciflow/trunk/155958 -> ciflow/trunk/155958 2025-08-14T21:18:06.3133561Z * [new tag] ciflow/trunk/156049 -> ciflow/trunk/156049 2025-08-14T21:18:06.3133947Z * [new tag] ciflow/trunk/156703 -> ciflow/trunk/156703 2025-08-14T21:18:06.3134167Z * [new tag] ciflow/trunk/156851 -> ciflow/trunk/156851 2025-08-14T21:18:06.3134277Z * [new tag] ciflow/trunk/157148 -> ciflow/trunk/157148 2025-08-14T21:18:06.3134912Z * [new tag] ciflow/trunk/157152 -> ciflow/trunk/157152 2025-08-14T21:18:06.3135210Z * [new tag] ciflow/trunk/157432 -> ciflow/trunk/157432 2025-08-14T21:18:06.3135578Z * [new tag] ciflow/trunk/157685 -> ciflow/trunk/157685 2025-08-14T21:18:06.3135982Z * [new tag] ciflow/trunk/157689 -> ciflow/trunk/157689 2025-08-14T21:18:06.3136421Z * [new tag] ciflow/trunk/157699 -> ciflow/trunk/157699 2025-08-14T21:18:06.3136819Z * [new tag] ciflow/trunk/157813 -> ciflow/trunk/157813 2025-08-14T21:18:06.3137333Z * [new tag] ciflow/trunk/157994 -> ciflow/trunk/157994 2025-08-14T21:18:06.3137636Z * [new tag] ciflow/trunk/158091 -> ciflow/trunk/158091 2025-08-14T21:18:06.3138038Z * [new tag] ciflow/trunk/158104 -> ciflow/trunk/158104 2025-08-14T21:18:06.3138457Z * [new tag] ciflow/trunk/158219 -> ciflow/trunk/158219 2025-08-14T21:18:06.3138856Z * [new tag] ciflow/trunk/158220 -> ciflow/trunk/158220 2025-08-14T21:18:06.3139318Z * [new tag] ciflow/trunk/158224 -> ciflow/trunk/158224 2025-08-14T21:18:06.3139729Z * [new tag] ciflow/trunk/158529 -> ciflow/trunk/158529 2025-08-14T21:18:06.3141060Z * [new tag] ciflow/trunk/158647 -> ciflow/trunk/158647 2025-08-14T21:18:06.3141190Z * [new tag] ciflow/trunk/158810 -> ciflow/trunk/158810 2025-08-14T21:18:06.3141297Z * [new tag] ciflow/trunk/158812 -> ciflow/trunk/158812 2025-08-14T21:18:06.3141484Z * [new tag] ciflow/trunk/158863 -> ciflow/trunk/158863 2025-08-14T21:18:06.3141851Z * [new tag] ciflow/trunk/158864 -> ciflow/trunk/158864 2025-08-14T21:18:06.3142263Z * [new tag] ciflow/trunk/158883 -> ciflow/trunk/158883 2025-08-14T21:18:06.3142674Z * [new tag] ciflow/trunk/158914 -> ciflow/trunk/158914 2025-08-14T21:18:06.3143148Z * [new tag] ciflow/trunk/158965 -> ciflow/trunk/158965 2025-08-14T21:18:06.3143483Z * [new tag] ciflow/trunk/158987 -> ciflow/trunk/158987 2025-08-14T21:18:06.3144243Z * [new tag] ciflow/trunk/159033 -> ciflow/trunk/159033 2025-08-14T21:18:06.3144834Z * [new tag] ciflow/trunk/159140 -> ciflow/trunk/159140 2025-08-14T21:18:06.3144984Z * [new tag] ciflow/trunk/159158 -> ciflow/trunk/159158 2025-08-14T21:18:06.3145354Z * [new tag] ciflow/trunk/159553 -> ciflow/trunk/159553 2025-08-14T21:18:06.3145754Z * [new tag] ciflow/trunk/159562 -> ciflow/trunk/159562 2025-08-14T21:18:06.3146555Z * [new tag] ciflow/trunk/159682 -> ciflow/trunk/159682 2025-08-14T21:18:06.3146681Z * [new tag] ciflow/trunk/159691 -> ciflow/trunk/159691 2025-08-14T21:18:06.3147073Z * [new tag] ciflow/trunk/159842 -> ciflow/trunk/159842 2025-08-14T21:18:06.3147494Z * [new tag] ciflow/trunk/159889 -> ciflow/trunk/159889 2025-08-14T21:18:06.3147914Z * [new tag] ciflow/trunk/159923 -> ciflow/trunk/159923 2025-08-14T21:18:06.3148458Z * [new tag] ciflow/trunk/160004 -> ciflow/trunk/160004 2025-08-14T21:18:06.3148995Z * [new tag] ciflow/trunk/160113 -> ciflow/trunk/160113 2025-08-14T21:18:06.3149439Z * [new tag] ciflow/trunk/160161 -> ciflow/trunk/160161 2025-08-14T21:18:06.3149705Z * [new tag] ciflow/trunk/160168 -> ciflow/trunk/160168 2025-08-14T21:18:06.3150029Z * [new tag] ciflow/trunk/160181 -> ciflow/trunk/160181 2025-08-14T21:18:06.3150465Z * [new tag] ciflow/trunk/160183 -> ciflow/trunk/160183 2025-08-14T21:18:06.3150851Z * [new tag] ciflow/trunk/160190 -> ciflow/trunk/160190 2025-08-14T21:18:06.3151321Z * [new tag] ciflow/trunk/160198 -> ciflow/trunk/160198 2025-08-14T21:18:06.3151678Z * [new tag] ciflow/trunk/160205 -> ciflow/trunk/160205 2025-08-14T21:18:06.3153530Z * [new tag] ciflow/trunk/160219 -> ciflow/trunk/160219 2025-08-14T21:18:06.3153813Z * [new tag] ciflow/trunk/160224 -> ciflow/trunk/160224 2025-08-14T21:18:06.3153952Z * [new tag] ciflow/trunk/160250 -> ciflow/trunk/160250 2025-08-14T21:18:06.3154147Z * [new tag] ciflow/trunk/160253 -> ciflow/trunk/160253 2025-08-14T21:18:06.3154271Z * [new tag] ciflow/trunk/160335 -> ciflow/trunk/160335 2025-08-14T21:18:06.3154482Z * [new tag] ciflow/trunk/160338 -> ciflow/trunk/160338 2025-08-14T21:18:06.3154985Z * [new tag] ciflow/trunk/160383 -> ciflow/trunk/160383 2025-08-14T21:18:06.3155584Z * [new tag] ciflow/trunk/160401 -> ciflow/trunk/160401 2025-08-14T21:18:06.3156007Z * [new tag] ciflow/trunk/160403 -> ciflow/trunk/160403 2025-08-14T21:18:06.3156218Z * [new tag] ciflow/trunk/160430 -> ciflow/trunk/160430 2025-08-14T21:18:06.3157894Z * [new tag] ciflow/trunk/160431 -> ciflow/trunk/160431 2025-08-14T21:18:06.3158175Z * [new tag] ciflow/trunk/160439 -> ciflow/trunk/160439 2025-08-14T21:18:06.3158323Z * [new tag] ciflow/trunk/160449 -> ciflow/trunk/160449 2025-08-14T21:18:06.3158503Z * [new tag] ciflow/trunk/160454 -> ciflow/trunk/160454 2025-08-14T21:18:06.3158691Z * [new tag] ciflow/trunk/160468 -> ciflow/trunk/160468 2025-08-14T21:18:06.3159552Z * [new tag] ciflow/trunk/160481 -> ciflow/trunk/160481 2025-08-14T21:18:06.3159903Z * [new tag] ciflow/trunk/160485 -> ciflow/trunk/160485 2025-08-14T21:18:06.3160026Z * [new tag] ciflow/trunk/160519 -> ciflow/trunk/160519 2025-08-14T21:18:06.3162032Z * [new tag] ciflow/trunk/160527 -> ciflow/trunk/160527 2025-08-14T21:18:06.3162321Z * [new tag] ciflow/trunk/160560 -> ciflow/trunk/160560 2025-08-14T21:18:06.3162442Z * [new tag] ciflow/trunk/160578 -> ciflow/trunk/160578 2025-08-14T21:18:06.3162625Z * [new tag] ciflow/trunk/160589 -> ciflow/trunk/160589 2025-08-14T21:18:06.3162863Z * [new tag] ciflow/trunk/160592 -> ciflow/trunk/160592 2025-08-14T21:18:06.3162966Z * [new tag] ciflow/trunk/160649 -> ciflow/trunk/160649 2025-08-14T21:18:06.3163084Z * [new tag] ciflow/trunk/160656 -> ciflow/trunk/160656 2025-08-14T21:18:06.3164935Z * [new tag] ciflow/unstable/123 -> ciflow/unstable/123 2025-08-14T21:18:06.3165080Z * [new tag] ciflow/vllm/160116 -> ciflow/vllm/160116 2025-08-14T21:18:06.3165197Z * [new tag] ciflow/vllm/160583 -> ciflow/vllm/160583 2025-08-14T21:18:06.3165298Z * [new tag] ciflow/vllm/160619 -> ciflow/vllm/160619 2025-08-14T21:18:06.3165726Z * [new tag] ciflow/vllm/160625 -> ciflow/vllm/160625 2025-08-14T21:18:06.3165874Z * [new tag] ciflow/vllm/160627 -> ciflow/vllm/160627 2025-08-14T21:18:06.3166676Z * [new tag] ciflow/win-arm64/156049 -> ciflow/win-arm64/156049 2025-08-14T21:18:06.3166811Z * [new tag] ciflow/win-arm64/158104 -> ciflow/win-arm64/158104 2025-08-14T21:18:06.3167207Z * [new tag] ciflow/win-arm64/159553 -> ciflow/win-arm64/159553 2025-08-14T21:18:06.3167611Z * [new tag] ciflow/win-arm64/159562 -> ciflow/win-arm64/159562 2025-08-14T21:18:06.3168897Z * [new tag] ciflow/win-arm64/159777 -> ciflow/win-arm64/159777 2025-08-14T21:18:06.3169194Z * [new tag] ciflow/win-arm64/159780 -> ciflow/win-arm64/159780 2025-08-14T21:18:06.3169321Z * [new tag] ciflow/win-arm64/159842 -> ciflow/win-arm64/159842 2025-08-14T21:18:06.3169517Z * [new tag] ciflow/win-arm64/160250 -> ciflow/win-arm64/160250 2025-08-14T21:18:06.3169746Z * [new tag] ciflow/win-arm64/160253 -> ciflow/win-arm64/160253 2025-08-14T21:18:06.3170093Z * [new tag] ciflow/win-arm64/160454 -> ciflow/win-arm64/160454 2025-08-14T21:18:06.3170477Z * [new tag] ciflow/win-arm64/160560 -> ciflow/win-arm64/160560 2025-08-14T21:18:06.3171274Z * [new tag] ciflow/xpu/138996 -> ciflow/xpu/138996 2025-08-14T21:18:06.3171823Z * [new tag] ciflow/xpu/139971 -> ciflow/xpu/139971 2025-08-14T21:18:06.3172107Z * [new tag] ciflow/xpu/140972 -> ciflow/xpu/140972 2025-08-14T21:18:06.3172226Z * [new tag] ciflow/xpu/143553 -> ciflow/xpu/143553 2025-08-14T21:18:06.3173145Z * [new tag] ciflow/xpu/156272 -> ciflow/xpu/156272 2025-08-14T21:18:06.3173263Z * [new tag] ciflow/xpu/156812 -> ciflow/xpu/156812 2025-08-14T21:18:06.3173562Z * [new tag] ciflow/xpu/157699 -> ciflow/xpu/157699 2025-08-14T21:18:06.3173956Z * [new tag] ciflow/xpu/157994 -> ciflow/xpu/157994 2025-08-14T21:18:06.3174375Z * [new tag] ciflow/xpu/158336 -> ciflow/xpu/158336 2025-08-14T21:18:06.3174786Z * [new tag] ciflow/xpu/158733 -> ciflow/xpu/158733 2025-08-14T21:18:06.3175198Z * [new tag] ciflow/xpu/159033 -> ciflow/xpu/159033 2025-08-14T21:18:06.3176367Z * [new tag] ciflow/xpu/159118 -> ciflow/xpu/159118 2025-08-14T21:18:06.3176728Z * [new tag] ciflow/xpu/159140 -> ciflow/xpu/159140 2025-08-14T21:18:06.3176839Z * [new tag] ciflow/xpu/159241 -> ciflow/xpu/159241 2025-08-14T21:18:06.3178106Z * [new tag] ciflow/xpu/159473 -> ciflow/xpu/159473 2025-08-14T21:18:06.3178241Z * [new tag] ciflow/xpu/159474 -> ciflow/xpu/159474 2025-08-14T21:18:06.3178539Z * [new tag] ciflow/xpu/159553 -> ciflow/xpu/159553 2025-08-14T21:18:06.3178952Z * [new tag] ciflow/xpu/159944 -> ciflow/xpu/159944 2025-08-14T21:18:06.3179709Z * [new tag] ciflow/xpu/160062 -> ciflow/xpu/160062 2025-08-14T21:18:06.3180093Z * [new tag] ciflow/xpu/160067 -> ciflow/xpu/160067 2025-08-14T21:18:06.3180636Z * [new tag] ciflow/xpu/160158 -> ciflow/xpu/160158 2025-08-14T21:18:06.3180795Z * [new tag] ciflow/xpu/160173 -> ciflow/xpu/160173 2025-08-14T21:18:06.3182711Z * [new tag] ciflow/xpu/160183 -> ciflow/xpu/160183 2025-08-14T21:18:06.3182851Z * [new tag] ciflow/xpu/160301 -> ciflow/xpu/160301 2025-08-14T21:18:06.3182953Z * [new tag] ciflow/xpu/160403 -> ciflow/xpu/160403 2025-08-14T21:18:06.3183049Z * [new tag] ciflow/xpu/160606 -> ciflow/xpu/160606 2025-08-14T21:18:06.3183186Z * [new tag] cslpull75 -> cslpull75 2025-08-14T21:18:06.3183928Z * [new tag] cslpull76 -> cslpull76 2025-08-14T21:18:06.3184066Z * [new tag] cslpull77 -> cslpull77 2025-08-14T21:18:06.3184789Z * [new tag] cslpull78 -> cslpull78 2025-08-14T21:18:06.3187452Z * [new tag] cslpull79 -> cslpull79 2025-08-14T21:18:06.3191114Z * [new tag] cslpull80 -> cslpull80 2025-08-14T21:18:06.3193171Z * [new tag] cslpull81 -> cslpull81 2025-08-14T21:18:06.3193403Z * [new tag] cslpull82 -> cslpull82 2025-08-14T21:18:06.3193615Z * [new tag] cslpull83 -> cslpull83 2025-08-14T21:18:06.3193707Z * [new tag] cslpull84 -> cslpull84 2025-08-14T21:18:06.3193879Z * [new tag] cslpull85 -> cslpull85 2025-08-14T21:18:06.3194176Z * [new tag] cslpull86 -> cslpull86 2025-08-14T21:18:06.3194272Z * [new tag] cslpull87 -> cslpull87 2025-08-14T21:18:06.3194360Z * [new tag] cslpull88 -> cslpull88 2025-08-14T21:18:06.3194885Z * [new tag] cslpull89 -> cslpull89 2025-08-14T21:18:06.3194994Z * [new tag] cslpull90 -> cslpull90 2025-08-14T21:18:06.3195105Z * [new tag] cslpull91 -> cslpull91 2025-08-14T21:18:06.3195189Z * [new tag] cslpull92 -> cslpull92 2025-08-14T21:18:06.3195288Z * [new tag] flight_5 -> flight_5 2025-08-14T21:18:06.3195387Z * [new tag] flight_5.1 -> flight_5.1 2025-08-14T21:18:06.3195477Z * [new tag] flight_5.2 -> flight_5.2 2025-08-14T21:18:06.3195569Z * [new tag] flight_5.3 -> flight_5.3 2025-08-14T21:18:06.3195664Z * [new tag] forpull1 -> forpull1 2025-08-14T21:18:06.3199304Z * [new tag] malfet/tag-2ef5611 -> malfet/tag-2ef5611 2025-08-14T21:18:06.3202784Z * [new tag] malfet/tag-317b1a0 -> malfet/tag-317b1a0 2025-08-14T21:18:06.3206275Z * [new tag] malfet/tag-ec6f767 -> malfet/tag-ec6f767 2025-08-14T21:18:06.3210462Z * [new tag] nightly-binary -> nightly-binary 2025-08-14T21:18:06.3213951Z * [new tag] sqzhang_flight4_plus -> sqzhang_flight4_plus 2025-08-14T21:18:06.3217419Z * [new tag] sqzhang_flight_3 -> sqzhang_flight_3 2025-08-14T21:18:06.3219717Z * [new tag] trunk/01584d2a7d029c9749eb73678cf1dc313cc35df6 -> trunk/01584d2a7d029c9749eb73678cf1dc313cc35df6 2025-08-14T21:18:06.3219990Z * [new tag] trunk/017259f9c65b6fad55fb9597d7077e2543eaae46 -> trunk/017259f9c65b6fad55fb9597d7077e2543eaae46 2025-08-14T21:18:06.3220208Z * [new tag] trunk/01bcf9a40dea937637d2cdd530bed2652510943d -> trunk/01bcf9a40dea937637d2cdd530bed2652510943d 2025-08-14T21:18:06.3220424Z * [new tag] trunk/01f66d08d93365015f4af005a252f439c4d4013a -> trunk/01f66d08d93365015f4af005a252f439c4d4013a 2025-08-14T21:18:06.3220629Z * [new tag] trunk/03b254e49f2d4c092e6ca712e5702cf2895aa47e -> trunk/03b254e49f2d4c092e6ca712e5702cf2895aa47e 2025-08-14T21:18:06.3220851Z * [new tag] trunk/05029ad1c30865d3f7e7fd13384db9d826e563eb -> trunk/05029ad1c30865d3f7e7fd13384db9d826e563eb 2025-08-14T21:18:06.3221055Z * [new tag] trunk/05c19d1acecc01b0d2512364183058a6885b9869 -> trunk/05c19d1acecc01b0d2512364183058a6885b9869 2025-08-14T21:18:06.3221261Z * [new tag] trunk/05c417715f791875fbf28cfc3fc86142de1a3206 -> trunk/05c417715f791875fbf28cfc3fc86142de1a3206 2025-08-14T21:18:06.3221637Z * [new tag] trunk/06824f3c7268bb807a422b663047cd0900ddd126 -> trunk/06824f3c7268bb807a422b663047cd0900ddd126 2025-08-14T21:18:06.3221859Z * [new tag] trunk/077cb389746a7d61cfc018aad2ba29a8aa195610 -> trunk/077cb389746a7d61cfc018aad2ba29a8aa195610 2025-08-14T21:18:06.3222075Z * [new tag] trunk/089c4a1ba007ed4abb3e5e0eafd97b7584566057 -> trunk/089c4a1ba007ed4abb3e5e0eafd97b7584566057 2025-08-14T21:18:06.3222293Z * [new tag] trunk/09381f5dacda7bbbfa361f5df76bde5cd309adc1 -> trunk/09381f5dacda7bbbfa361f5df76bde5cd309adc1 2025-08-14T21:18:06.3222507Z * [new tag] trunk/0bd3af4fb87445f4de3a1f9b823e399c8b3cefde -> trunk/0bd3af4fb87445f4de3a1f9b823e399c8b3cefde 2025-08-14T21:18:06.3222712Z * [new tag] trunk/0d3461bac0fb5177e35152d980b301ea3a0aa2c4 -> trunk/0d3461bac0fb5177e35152d980b301ea3a0aa2c4 2025-08-14T21:18:06.3222917Z * [new tag] trunk/0d40ff3b496e68193bc16d5391fa2e3623709f81 -> trunk/0d40ff3b496e68193bc16d5391fa2e3623709f81 2025-08-14T21:18:06.3223133Z * [new tag] trunk/0d71ca2c46753bb268bfdcf815c14415c122a289 -> trunk/0d71ca2c46753bb268bfdcf815c14415c122a289 2025-08-14T21:18:06.3223338Z * [new tag] trunk/0d88593dd826544c9e7bd4aa615ef86847a78d2b -> trunk/0d88593dd826544c9e7bd4aa615ef86847a78d2b 2025-08-14T21:18:06.3223550Z * [new tag] trunk/0e3e377bd5126cfcc69d70c4d77b352d3404cc11 -> trunk/0e3e377bd5126cfcc69d70c4d77b352d3404cc11 2025-08-14T21:18:06.3223769Z * [new tag] trunk/0f3b10b8eebe68e3c75d473d499b87dfe14a2eca -> trunk/0f3b10b8eebe68e3c75d473d499b87dfe14a2eca 2025-08-14T21:18:06.3223976Z * [new tag] trunk/101276f81b4d2a8c31bfd6796b986d4c1bfdf483 -> trunk/101276f81b4d2a8c31bfd6796b986d4c1bfdf483 2025-08-14T21:18:06.3224260Z * [new tag] trunk/1028c5e2d50e121865bf98307e7c035f549a24b2 -> trunk/1028c5e2d50e121865bf98307e7c035f549a24b2 2025-08-14T21:18:06.3224480Z * [new tag] trunk/10bc36fe840cb3510fab84d2ea22663b76702f1e -> trunk/10bc36fe840cb3510fab84d2ea22663b76702f1e 2025-08-14T21:18:06.3224683Z * [new tag] trunk/10e3514c962b58cbbee994257872a626ff76d51b -> trunk/10e3514c962b58cbbee994257872a626ff76d51b 2025-08-14T21:18:06.3224885Z * [new tag] trunk/1128f4c2a822cbe34a9d966306af15097179ffe1 -> trunk/1128f4c2a822cbe34a9d966306af15097179ffe1 2025-08-14T21:18:06.3225138Z * [new tag] trunk/114a6c40434bfb9cfa5abc30e9e34d81300d743e -> trunk/114a6c40434bfb9cfa5abc30e9e34d81300d743e 2025-08-14T21:18:06.3225344Z * [new tag] trunk/118bc97b14c24ac88a4b0c0750a9e7bf93154c76 -> trunk/118bc97b14c24ac88a4b0c0750a9e7bf93154c76 2025-08-14T21:18:06.3225541Z * [new tag] trunk/1196bb1c2e4d5a7edc09f2260e3034132f0c6c91 -> trunk/1196bb1c2e4d5a7edc09f2260e3034132f0c6c91 2025-08-14T21:18:06.3225748Z * [new tag] trunk/11a3565f1872bbad9c253a127e8d4ce7a1b40ec8 -> trunk/11a3565f1872bbad9c253a127e8d4ce7a1b40ec8 2025-08-14T21:18:06.3225945Z * [new tag] trunk/15e49f61643e4c0eef420f0981609709ef55b848 -> trunk/15e49f61643e4c0eef420f0981609709ef55b848 2025-08-14T21:18:06.3226141Z * [new tag] trunk/16d15445f8bd8740095b23de4af89d757af793ca -> trunk/16d15445f8bd8740095b23de4af89d757af793ca 2025-08-14T21:18:06.3226343Z * [new tag] trunk/178515d0ff6833c8e9221482b2a650ab31e00019 -> trunk/178515d0ff6833c8e9221482b2a650ab31e00019 2025-08-14T21:18:06.3226546Z * [new tag] trunk/182efe31dbe43376e7eef7338356aaf94d5bcabe -> trunk/182efe31dbe43376e7eef7338356aaf94d5bcabe 2025-08-14T21:18:06.3226761Z * [new tag] trunk/194fcfcfbdad0add1a1b695321e31a576058f4cf -> trunk/194fcfcfbdad0add1a1b695321e31a576058f4cf 2025-08-14T21:18:06.3226969Z * [new tag] trunk/195b5c2e27eb8f21cbc8ad1e90f42db5a8cfccca -> trunk/195b5c2e27eb8f21cbc8ad1e90f42db5a8cfccca 2025-08-14T21:18:06.3227179Z * [new tag] trunk/198b5fd2d47fa3d5110ceba6827a3b18e0064014 -> trunk/198b5fd2d47fa3d5110ceba6827a3b18e0064014 2025-08-14T21:18:06.3227417Z * [new tag] trunk/199e9abb6a366bbd27c39d1da7c3123b4eea9b5a -> trunk/199e9abb6a366bbd27c39d1da7c3123b4eea9b5a 2025-08-14T21:18:06.3227621Z * [new tag] trunk/19b4283884b2d9b3a0eb364da10b1540d14ab7a7 -> trunk/19b4283884b2d9b3a0eb364da10b1540d14ab7a7 2025-08-14T21:18:06.3227824Z * [new tag] trunk/1c2587119152cec3905647a47c65d3d26619c5a8 -> trunk/1c2587119152cec3905647a47c65d3d26619c5a8 2025-08-14T21:18:06.3228023Z * [new tag] trunk/1c26c53851c212a7c90a325549a72f0571613a8c -> trunk/1c26c53851c212a7c90a325549a72f0571613a8c 2025-08-14T21:18:06.3228239Z * [new tag] trunk/1c2cba17eab2b09d87142883da2bdbdbcf018613 -> trunk/1c2cba17eab2b09d87142883da2bdbdbcf018613 2025-08-14T21:18:06.3228454Z * [new tag] trunk/1d80d697a269234b47ec7ede192faf3bb9b159e3 -> trunk/1d80d697a269234b47ec7ede192faf3bb9b159e3 2025-08-14T21:18:06.3228663Z * [new tag] trunk/1ea688f9a2602fbcde32c0302b822526ca4219dc -> trunk/1ea688f9a2602fbcde32c0302b822526ca4219dc 2025-08-14T21:18:06.3228869Z * [new tag] trunk/1f4057c11ac941fb324386ca594d0a6882185aad -> trunk/1f4057c11ac941fb324386ca594d0a6882185aad 2025-08-14T21:18:06.3229069Z * [new tag] trunk/1fc683cf17c8c673044538d10266c00f92987be2 -> trunk/1fc683cf17c8c673044538d10266c00f92987be2 2025-08-14T21:18:06.3229286Z * [new tag] trunk/1febab2a89302464f6c7d69cfbef7a24c421ea65 -> trunk/1febab2a89302464f6c7d69cfbef7a24c421ea65 2025-08-14T21:18:06.3229485Z * [new tag] trunk/206c1eef6571f906c2792d899a09136b3fce9673 -> trunk/206c1eef6571f906c2792d899a09136b3fce9673 2025-08-14T21:18:06.3229689Z * [new tag] trunk/20bdabbb3c5d6b118a94b2e045c777662563d5bb -> trunk/20bdabbb3c5d6b118a94b2e045c777662563d5bb 2025-08-14T21:18:06.3229888Z * [new tag] trunk/21392c0e06ac2b2621950455975ca6332f0bf641 -> trunk/21392c0e06ac2b2621950455975ca6332f0bf641 2025-08-14T21:18:06.3230084Z * [new tag] trunk/2247aa6d1d43e256255f5c74a781c3190a4387b6 -> trunk/2247aa6d1d43e256255f5c74a781c3190a4387b6 2025-08-14T21:18:06.3230292Z * [new tag] trunk/2259dbed4e0d3f2a8174b5847fd0741aed42451d -> trunk/2259dbed4e0d3f2a8174b5847fd0741aed42451d 2025-08-14T21:18:06.3230485Z * [new tag] trunk/231c72240d80091f099c95e326d3600cba866eee -> trunk/231c72240d80091f099c95e326d3600cba866eee 2025-08-14T21:18:06.3230721Z * [new tag] trunk/24257f5bfaa37795f74d9f64c1b43584128d4b8c -> trunk/24257f5bfaa37795f74d9f64c1b43584128d4b8c 2025-08-14T21:18:06.3230926Z * [new tag] trunk/24f43d0da7ad9c6e95a09a2fee610387728cc1cd -> trunk/24f43d0da7ad9c6e95a09a2fee610387728cc1cd 2025-08-14T21:18:06.3231129Z * [new tag] trunk/2898d3f965e5cd9d02fc2ecdab7c580fd457fea9 -> trunk/2898d3f965e5cd9d02fc2ecdab7c580fd457fea9 2025-08-14T21:18:06.3231333Z * [new tag] trunk/28ccc9e7247798980fe00a11bcd64a8016b5f227 -> trunk/28ccc9e7247798980fe00a11bcd64a8016b5f227 2025-08-14T21:18:06.3231530Z * [new tag] trunk/29712314dd5cf500a8ea3d1c69483a3cb768ca72 -> trunk/29712314dd5cf500a8ea3d1c69483a3cb768ca72 2025-08-14T21:18:06.3231743Z * [new tag] trunk/29d20d49f0b7f4e362e1cefdcdc4b5659969312c -> trunk/29d20d49f0b7f4e362e1cefdcdc4b5659969312c 2025-08-14T21:18:06.3231949Z * [new tag] trunk/2c5e10a5fceb208b11c3d569ae02e348b5893b31 -> trunk/2c5e10a5fceb208b11c3d569ae02e348b5893b31 2025-08-14T21:18:06.3232157Z * [new tag] trunk/2d0cdee394bccadcd0abe19dd4623ed978a331ad -> trunk/2d0cdee394bccadcd0abe19dd4623ed978a331ad 2025-08-14T21:18:06.3232370Z * [new tag] trunk/2e4e5ab4be9e0aeffd9c49b5b2f9f820bd0895b1 -> trunk/2e4e5ab4be9e0aeffd9c49b5b2f9f820bd0895b1 2025-08-14T21:18:06.3232574Z * [new tag] trunk/2ea40fba841b3af8103f332ba62e54f350ba9a51 -> trunk/2ea40fba841b3af8103f332ba62e54f350ba9a51 2025-08-14T21:18:06.3232807Z * [new tag] trunk/2ee22e435131369a7e4f8cc4732579acc29a941b -> trunk/2ee22e435131369a7e4f8cc4732579acc29a941b 2025-08-14T21:18:06.3233008Z * [new tag] trunk/2f4c2226175512af787725c4d5ad7313c60d4db1 -> trunk/2f4c2226175512af787725c4d5ad7313c60d4db1 2025-08-14T21:18:06.3233212Z * [new tag] trunk/3008d985a8fc155eb89374afff50cb33a6bd10d5 -> trunk/3008d985a8fc155eb89374afff50cb33a6bd10d5 2025-08-14T21:18:06.3233610Z * [new tag] trunk/3028fa6ce9d9c96671722ab8213a1a30670d7cf2 -> trunk/3028fa6ce9d9c96671722ab8213a1a30670d7cf2 2025-08-14T21:18:06.3234212Z * [new tag] trunk/303c614f3df95ae2b659c5f6c1838b14e4776ce6 -> trunk/303c614f3df95ae2b659c5f6c1838b14e4776ce6 2025-08-14T21:18:06.3234579Z * [new tag] trunk/305fa2239365ad17ac9c534a68bba8a149c42d67 -> trunk/305fa2239365ad17ac9c534a68bba8a149c42d67 2025-08-14T21:18:06.3234882Z * [new tag] trunk/31c9ac4319c0cc2ed8c6be701c6ccf73f6cb4706 -> trunk/31c9ac4319c0cc2ed8c6be701c6ccf73f6cb4706 2025-08-14T21:18:06.3235382Z * [new tag] trunk/32099961d588fc19ead8afe805d6b5108de75669 -> trunk/32099961d588fc19ead8afe805d6b5108de75669 2025-08-14T21:18:06.3236037Z * [new tag] trunk/32e5e2f596d55bb9441d5d53f3c58bcb55828047 -> trunk/32e5e2f596d55bb9441d5d53f3c58bcb55828047 2025-08-14T21:18:06.3236427Z * [new tag] trunk/334b38ccc4427b1d14981c48a3a0b92180d58225 -> trunk/334b38ccc4427b1d14981c48a3a0b92180d58225 2025-08-14T21:18:06.3236898Z * [new tag] trunk/334ecbd4ffe11858cae7d23d1190ddb4777c2513 -> trunk/334ecbd4ffe11858cae7d23d1190ddb4777c2513 2025-08-14T21:18:06.3237408Z * [new tag] trunk/33d94018668951611b318b7515ae96f04e48eac0 -> trunk/33d94018668951611b318b7515ae96f04e48eac0 2025-08-14T21:18:06.3237983Z * [new tag] trunk/34358f335d95213d96b6cca6a83e7bf3af6a9fcb -> trunk/34358f335d95213d96b6cca6a83e7bf3af6a9fcb 2025-08-14T21:18:06.3238467Z * [new tag] trunk/34ec5ed275f8aa875c80daa97b3e82af0b06f673 -> trunk/34ec5ed275f8aa875c80daa97b3e82af0b06f673 2025-08-14T21:18:06.3238931Z * [new tag] trunk/355462e1278d818deb9ef4a184073d5b66074816 -> trunk/355462e1278d818deb9ef4a184073d5b66074816 2025-08-14T21:18:06.3243261Z * [new tag] trunk/3626ba711b34397d1fbf0a9b1979f85cbf68b919 -> trunk/3626ba711b34397d1fbf0a9b1979f85cbf68b919 2025-08-14T21:18:06.3243639Z * [new tag] trunk/36f46d082a4954921cb8493223f000f2aab79ed7 -> trunk/36f46d082a4954921cb8493223f000f2aab79ed7 2025-08-14T21:18:06.3244013Z * [new tag] trunk/39aa3d1471549b7829c207d634dfdc1d26e346a2 -> trunk/39aa3d1471549b7829c207d634dfdc1d26e346a2 2025-08-14T21:18:06.3244300Z * [new tag] trunk/3a562374401113187ce2566b87e3f1d87d7c53aa -> trunk/3a562374401113187ce2566b87e3f1d87d7c53aa 2025-08-14T21:18:06.3244764Z * [new tag] trunk/3ac86e728dfaa7383ff7f865e9e7d33486188dae -> trunk/3ac86e728dfaa7383ff7f865e9e7d33486188dae 2025-08-14T21:18:06.3245163Z * [new tag] trunk/3be70dc30e893b552fc0f23ca06cd8f7949b6d08 -> trunk/3be70dc30e893b552fc0f23ca06cd8f7949b6d08 2025-08-14T21:18:06.3245745Z * [new tag] trunk/3cec82a7e9aea040a34dd7a2587ae6d3bd65dba0 -> trunk/3cec82a7e9aea040a34dd7a2587ae6d3bd65dba0 2025-08-14T21:18:06.3246230Z * [new tag] trunk/3cf7b4024ef83e44e9ae223dbff7c7ab68240cb2 -> trunk/3cf7b4024ef83e44e9ae223dbff7c7ab68240cb2 2025-08-14T21:18:06.3246740Z * [new tag] trunk/3ef2e1ef769582a82c6ddf150e9d11bf4bf1c44f -> trunk/3ef2e1ef769582a82c6ddf150e9d11bf4bf1c44f 2025-08-14T21:18:06.3247275Z * [new tag] trunk/3f1636ebef9b45e8a3cb0eb20d327ee6acb74be0 -> trunk/3f1636ebef9b45e8a3cb0eb20d327ee6acb74be0 2025-08-14T21:18:06.3248099Z * [new tag] trunk/3faee0a6318afcbbbb48687009a459214910d820 -> trunk/3faee0a6318afcbbbb48687009a459214910d820 2025-08-14T21:18:06.3248436Z * [new tag] trunk/3fcd79e023da7156ac584992ebab29205d3b7881 -> trunk/3fcd79e023da7156ac584992ebab29205d3b7881 2025-08-14T21:18:06.3248910Z * [new tag] trunk/3fe19a7a0af3f4d692af30476c320be18c7e8ae6 -> trunk/3fe19a7a0af3f4d692af30476c320be18c7e8ae6 2025-08-14T21:18:06.3249736Z * [new tag] trunk/41673110cd7c5960824cc74a6fcaeda1a8bc7a23 -> trunk/41673110cd7c5960824cc74a6fcaeda1a8bc7a23 2025-08-14T21:18:06.3250393Z * [new tag] trunk/4183d4ff3dcc1d87400326a9a7998c3f9e966f60 -> trunk/4183d4ff3dcc1d87400326a9a7998c3f9e966f60 2025-08-14T21:18:06.3251053Z * [new tag] trunk/422bd6808bb98cbbac31d157d9c82ad11ba9732d -> trunk/422bd6808bb98cbbac31d157d9c82ad11ba9732d 2025-08-14T21:18:06.3251318Z * [new tag] trunk/42e51cd4b3973a053fcfa80878a3f346fd158e9f -> trunk/42e51cd4b3973a053fcfa80878a3f346fd158e9f 2025-08-14T21:18:06.3252350Z * [new tag] trunk/4416433c7c625127b7f975c92f8ec98ea4c67fd3 -> trunk/4416433c7c625127b7f975c92f8ec98ea4c67fd3 2025-08-14T21:18:06.3252688Z * [new tag] trunk/45ba7ecda876685b083cbbe932450560c566826b -> trunk/45ba7ecda876685b083cbbe932450560c566826b 2025-08-14T21:18:06.3253269Z * [new tag] trunk/47a1db823dfcdacdb99f317428fc3791a18c5812 -> trunk/47a1db823dfcdacdb99f317428fc3791a18c5812 2025-08-14T21:18:06.3253661Z * [new tag] trunk/4a773e1e867f28a8ff0b15203e5cd9548f74fcee -> trunk/4a773e1e867f28a8ff0b15203e5cd9548f74fcee 2025-08-14T21:18:06.3254208Z * [new tag] trunk/4a90dc0c1f68d1f98832b169f792ed1bb195a0f3 -> trunk/4a90dc0c1f68d1f98832b169f792ed1bb195a0f3 2025-08-14T21:18:06.3254791Z * [new tag] trunk/4cde0acc0e4e795e1a12cbdd9b93c8c04c1fa05d -> trunk/4cde0acc0e4e795e1a12cbdd9b93c8c04c1fa05d 2025-08-14T21:18:06.3255265Z * [new tag] trunk/4d419a74610c32b1372f8802dcc61893740a23cf -> trunk/4d419a74610c32b1372f8802dcc61893740a23cf 2025-08-14T21:18:06.3255815Z * [new tag] trunk/4d5b3f2d5af7c8e4f41da4ffca53fafe8bb86235 -> trunk/4d5b3f2d5af7c8e4f41da4ffca53fafe8bb86235 2025-08-14T21:18:06.3257464Z * [new tag] trunk/4e2ddb5db67617f9f5309c8bba0c17adc84cadbc -> trunk/4e2ddb5db67617f9f5309c8bba0c17adc84cadbc 2025-08-14T21:18:06.3257711Z * [new tag] trunk/50a8c118754a6c5a46968f5c8e215ccba6831d42 -> trunk/50a8c118754a6c5a46968f5c8e215ccba6831d42 2025-08-14T21:18:06.3257927Z * [new tag] trunk/50f23ff6f883db5021dd6bab4c146434f98dd15d -> trunk/50f23ff6f883db5021dd6bab4c146434f98dd15d 2025-08-14T21:18:06.3258498Z * [new tag] trunk/515cb70367e84fcbad23fcc5b39eb1d7706df2aa -> trunk/515cb70367e84fcbad23fcc5b39eb1d7706df2aa 2025-08-14T21:18:06.3259119Z * [new tag] trunk/53e39494958b7e2278cc8176f63636e812e8945f -> trunk/53e39494958b7e2278cc8176f63636e812e8945f 2025-08-14T21:18:06.3259380Z * [new tag] trunk/556e2a73f4f0643f7c2aeb5c7dddda43388a40ce -> trunk/556e2a73f4f0643f7c2aeb5c7dddda43388a40ce 2025-08-14T21:18:06.3259795Z * [new tag] trunk/5665dc9ab76b84d7c90d845ffb0f6349b3621919 -> trunk/5665dc9ab76b84d7c90d845ffb0f6349b3621919 2025-08-14T21:18:06.3260089Z * [new tag] trunk/566c6d52ef1411c8262d7b9cf85e2044fdfbe1a3 -> trunk/566c6d52ef1411c8262d7b9cf85e2044fdfbe1a3 2025-08-14T21:18:06.3260647Z * [new tag] trunk/56c828bef93eada0e18d2cc013207831ca80cc99 -> trunk/56c828bef93eada0e18d2cc013207831ca80cc99 2025-08-14T21:18:06.3261103Z * [new tag] trunk/5737372862253a0ac0292407a5844796f02380ad -> trunk/5737372862253a0ac0292407a5844796f02380ad 2025-08-14T21:18:06.3261699Z * [new tag] trunk/57f738b6357cc8fcdde479a0948e723809a1a44d -> trunk/57f738b6357cc8fcdde479a0948e723809a1a44d 2025-08-14T21:18:06.3262184Z * [new tag] trunk/5a40c5784482255b9baf14086cc4b9349fc6d512 -> trunk/5a40c5784482255b9baf14086cc4b9349fc6d512 2025-08-14T21:18:06.3262700Z * [new tag] trunk/5a9c4cfce42b9eb87da0de40c5633f083115c307 -> trunk/5a9c4cfce42b9eb87da0de40c5633f083115c307 2025-08-14T21:18:06.3263383Z * [new tag] trunk/5ace061254af71aa83d1baae81aa1864c9746add -> trunk/5ace061254af71aa83d1baae81aa1864c9746add 2025-08-14T21:18:06.3263728Z * [new tag] trunk/5dddcd5b07c6644efca8d613f4eca1dc95daa87f -> trunk/5dddcd5b07c6644efca8d613f4eca1dc95daa87f 2025-08-14T21:18:06.3264433Z * [new tag] trunk/5ed4f9177907fe403ec4c4499d0d0e9be6b68fcf -> trunk/5ed4f9177907fe403ec4c4499d0d0e9be6b68fcf 2025-08-14T21:18:06.3265047Z * [new tag] trunk/5f1010fbb3850d99c8fdf9a9de2f79260cdc586a -> trunk/5f1010fbb3850d99c8fdf9a9de2f79260cdc586a 2025-08-14T21:18:06.3265315Z * [new tag] trunk/5f5f508aa836a46dfe88857fb223049616b94e93 -> trunk/5f5f508aa836a46dfe88857fb223049616b94e93 2025-08-14T21:18:06.3267349Z * [new tag] trunk/62bac0798100e0e06a86b7a4cee1788413e3d0ca -> trunk/62bac0798100e0e06a86b7a4cee1788413e3d0ca 2025-08-14T21:18:06.3267767Z * [new tag] trunk/63654ba4c5178fd12220cfc9d1c878af2fdd07cc -> trunk/63654ba4c5178fd12220cfc9d1c878af2fdd07cc 2025-08-14T21:18:06.3268096Z * [new tag] trunk/639778b3ee3b80e0894367fdc4442b58ae1b3a62 -> trunk/639778b3ee3b80e0894367fdc4442b58ae1b3a62 2025-08-14T21:18:06.3268384Z * [new tag] trunk/641ee7478150f26969968f49d8b358e199679a8a -> trunk/641ee7478150f26969968f49d8b358e199679a8a 2025-08-14T21:18:06.3269022Z * [new tag] trunk/65053c03a3d209060cb239d20a229dac37cf9dd1 -> trunk/65053c03a3d209060cb239d20a229dac37cf9dd1 2025-08-14T21:18:06.3269286Z * [new tag] trunk/652a6f5954d039d61dc6e6575ccf89d385d74537 -> trunk/652a6f5954d039d61dc6e6575ccf89d385d74537 2025-08-14T21:18:06.3269502Z * [new tag] trunk/685f15dbea66e8ffa8564752f81ad2f6cb447a14 -> trunk/685f15dbea66e8ffa8564752f81ad2f6cb447a14 2025-08-14T21:18:06.3269968Z * [new tag] trunk/68a4b4b2e336cfd4451ce6546d900568e5ddf96c -> trunk/68a4b4b2e336cfd4451ce6546d900568e5ddf96c 2025-08-14T21:18:06.3270209Z * [new tag] trunk/69a0a9aa7f5e320a02e97fa789d2f72baff1554f -> trunk/69a0a9aa7f5e320a02e97fa789d2f72baff1554f 2025-08-14T21:18:06.3270781Z * [new tag] trunk/6be6d06295c870c77a6eb69f96b3170d983520d5 -> trunk/6be6d06295c870c77a6eb69f96b3170d983520d5 2025-08-14T21:18:06.3271313Z * [new tag] trunk/6c05ea6475beaf3acc05e1bda0f3f8fe3bdc1d49 -> trunk/6c05ea6475beaf3acc05e1bda0f3f8fe3bdc1d49 2025-08-14T21:18:06.3271883Z * [new tag] trunk/6da11d9aafc0d84dc7f66030c181608ff2614f66 -> trunk/6da11d9aafc0d84dc7f66030c181608ff2614f66 2025-08-14T21:18:06.3273216Z * [new tag] trunk/6e8865fbc161270e2ffc52817e6c667df417a3f7 -> trunk/6e8865fbc161270e2ffc52817e6c667df417a3f7 2025-08-14T21:18:06.3273602Z * [new tag] trunk/6ea8376f84232048d6be0f7b2edf82aec1b61d58 -> trunk/6ea8376f84232048d6be0f7b2edf82aec1b61d58 2025-08-14T21:18:06.3273919Z * [new tag] trunk/6ee175195ac7853734d64704171993cc6265eb38 -> trunk/6ee175195ac7853734d64704171993cc6265eb38 2025-08-14T21:18:06.3274359Z * [new tag] trunk/6f0f4e0c3eacd479864319127915f869f64e1935 -> trunk/6f0f4e0c3eacd479864319127915f869f64e1935 2025-08-14T21:18:06.3275018Z * [new tag] trunk/70ccdec44b89e355a2cb03ba14a634284f7750f8 -> trunk/70ccdec44b89e355a2cb03ba14a634284f7750f8 2025-08-14T21:18:06.3275526Z * [new tag] trunk/72009ec6bebca7714f99c18449183787f202af4d -> trunk/72009ec6bebca7714f99c18449183787f202af4d 2025-08-14T21:18:06.3276025Z * [new tag] trunk/731ee31f7b6ba19307daab323f6196172b71aaf8 -> trunk/731ee31f7b6ba19307daab323f6196172b71aaf8 2025-08-14T21:18:06.3276665Z * [new tag] trunk/76a0609b6bddb2bc40f1eb4ade12885023653d59 -> trunk/76a0609b6bddb2bc40f1eb4ade12885023653d59 2025-08-14T21:18:06.3277104Z * [new tag] trunk/781e9a7724c47496e3d38a81e6dd6194cf098c41 -> trunk/781e9a7724c47496e3d38a81e6dd6194cf098c41 2025-08-14T21:18:06.3277728Z * [new tag] trunk/78a2fe1d42edeaa2ef7020b0fa0ac82ee4a640e4 -> trunk/78a2fe1d42edeaa2ef7020b0fa0ac82ee4a640e4 2025-08-14T21:18:06.3278150Z * [new tag] trunk/7a974a88f2c529a614baeabe4debd00fc8a3b299 -> trunk/7a974a88f2c529a614baeabe4debd00fc8a3b299 2025-08-14T21:18:06.3280498Z * [new tag] trunk/7ae0629d64b404e0ef5d9c931433ad25e65d6114 -> trunk/7ae0629d64b404e0ef5d9c931433ad25e65d6114 2025-08-14T21:18:06.3280756Z * [new tag] trunk/7d2ec704e47f4b740cdecda5534b305e8e1875ef -> trunk/7d2ec704e47f4b740cdecda5534b305e8e1875ef 2025-08-14T21:18:06.3280977Z * [new tag] trunk/7d87e358ac8440f666fabbfd99058bb5342be6ac -> trunk/7d87e358ac8440f666fabbfd99058bb5342be6ac 2025-08-14T21:18:06.3281180Z * [new tag] trunk/7e27347fd353928c99620495c8c531a5eba7d56b -> trunk/7e27347fd353928c99620495c8c531a5eba7d56b 2025-08-14T21:18:06.3281544Z * [new tag] trunk/7e91394955721c77645fcdb75a5d47a255d65020 -> trunk/7e91394955721c77645fcdb75a5d47a255d65020 2025-08-14T21:18:06.3281795Z * [new tag] trunk/7f4cb4a3e018a621add2a37a3a2f67b982d51001 -> trunk/7f4cb4a3e018a621add2a37a3a2f67b982d51001 2025-08-14T21:18:06.3282254Z * [new tag] trunk/7fbc22855c17741ae016992803b2e147a13aa22d -> trunk/7fbc22855c17741ae016992803b2e147a13aa22d 2025-08-14T21:18:06.3282855Z * [new tag] trunk/8047421fbb607d70ede13b9cd5a60b7b8bdfe348 -> trunk/8047421fbb607d70ede13b9cd5a60b7b8bdfe348 2025-08-14T21:18:06.3283561Z * [new tag] trunk/8088cfa592504a2897b4c78f8a46fe658ab5c2c2 -> trunk/8088cfa592504a2897b4c78f8a46fe658ab5c2c2 2025-08-14T21:18:06.3284132Z * [new tag] trunk/80cca8307943ba64168208b54028f55b2c71daff -> trunk/80cca8307943ba64168208b54028f55b2c71daff 2025-08-14T21:18:06.3284388Z * [new tag] trunk/8147370733bbdcd034cad54e9212e51885a11892 -> trunk/8147370733bbdcd034cad54e9212e51885a11892 2025-08-14T21:18:06.3286368Z * [new tag] trunk/83875cdb5594ccb3c9206b8eb5745fe1d011cf26 -> trunk/83875cdb5594ccb3c9206b8eb5745fe1d011cf26 2025-08-14T21:18:06.3291472Z * [new tag] trunk/8399cf88ce8399d2be93355f29d4cb69f51c0654 -> trunk/8399cf88ce8399d2be93355f29d4cb69f51c0654 2025-08-14T21:18:06.3295521Z * [new tag] trunk/842cc77ab9aafd518593c2fce077d6abb42a5b7f -> trunk/842cc77ab9aafd518593c2fce077d6abb42a5b7f 2025-08-14T21:18:06.3297885Z * [new tag] trunk/85db508af533649d0b3447ff3f0d5fe083150c84 -> trunk/85db508af533649d0b3447ff3f0d5fe083150c84 2025-08-14T21:18:06.3298339Z * [new tag] trunk/86eb65f7f06016bcd5d7951dc9d74bc3993a827a -> trunk/86eb65f7f06016bcd5d7951dc9d74bc3993a827a 2025-08-14T21:18:06.3298554Z * [new tag] trunk/87e6c4079d8ec7d04aff00ed82096b39836a8367 -> trunk/87e6c4079d8ec7d04aff00ed82096b39836a8367 2025-08-14T21:18:06.3298770Z * [new tag] trunk/89654db1abccf7e5f261989a150db4d1619ea2aa -> trunk/89654db1abccf7e5f261989a150db4d1619ea2aa 2025-08-14T21:18:06.3298986Z * [new tag] trunk/8a37f0c90392a2c38b7c5955471fa49edcaf5cb1 -> trunk/8a37f0c90392a2c38b7c5955471fa49edcaf5cb1 2025-08-14T21:18:06.3299193Z * [new tag] trunk/8ab5868a2199fe485c2d66533b9244ccb97e487d -> trunk/8ab5868a2199fe485c2d66533b9244ccb97e487d 2025-08-14T21:18:06.3299407Z * [new tag] trunk/8ae4d2652f64b8444b3d5314b9232bd2119bcde6 -> trunk/8ae4d2652f64b8444b3d5314b9232bd2119bcde6 2025-08-14T21:18:06.3299623Z * [new tag] trunk/8c41cb800ae0411f02ea5da34bd5ccc3790633b0 -> trunk/8c41cb800ae0411f02ea5da34bd5ccc3790633b0 2025-08-14T21:18:06.3299838Z * [new tag] trunk/8cb91e20bc205b1416648d0ffd98d1ba1f3a6fc4 -> trunk/8cb91e20bc205b1416648d0ffd98d1ba1f3a6fc4 2025-08-14T21:18:06.3300049Z * [new tag] trunk/8cfaf51d4e29c9bd9f49ecc98d955ed53df1a13d -> trunk/8cfaf51d4e29c9bd9f49ecc98d955ed53df1a13d 2025-08-14T21:18:06.3300258Z * [new tag] trunk/8d1cf529229dce7cd5ea04abb0faac83b87ca6d1 -> trunk/8d1cf529229dce7cd5ea04abb0faac83b87ca6d1 2025-08-14T21:18:06.3300681Z * [new tag] trunk/8d3d1c844303cb1d46123a1caa76d4cf83973347 -> trunk/8d3d1c844303cb1d46123a1caa76d4cf83973347 2025-08-14T21:18:06.3300887Z * [new tag] trunk/8d6d3246316e1767a57d5e855acd6208da753b75 -> trunk/8d6d3246316e1767a57d5e855acd6208da753b75 2025-08-14T21:18:06.3301090Z * [new tag] trunk/8e6a3138581152ab827a0997f34c470271399f5e -> trunk/8e6a3138581152ab827a0997f34c470271399f5e 2025-08-14T21:18:06.3301298Z * [new tag] trunk/8eee08d2279b98af2522debb6512d37e837e89e3 -> trunk/8eee08d2279b98af2522debb6512d37e837e89e3 2025-08-14T21:18:06.3301496Z * [new tag] trunk/90b78ee50f73b5c963996076a3d54b74b1b965be -> trunk/90b78ee50f73b5c963996076a3d54b74b1b965be 2025-08-14T21:18:06.3301717Z * [new tag] trunk/94b91a876327820a4bb6f5d39d156f13f2553ab6 -> trunk/94b91a876327820a4bb6f5d39d156f13f2553ab6 2025-08-14T21:18:06.3301923Z * [new tag] trunk/95210cc409dd578988c7116b47725c304dea54c7 -> trunk/95210cc409dd578988c7116b47725c304dea54c7 2025-08-14T21:18:06.3302133Z * [new tag] trunk/96bd33b2de79598566df395f32e27c4d33673f05 -> trunk/96bd33b2de79598566df395f32e27c4d33673f05 2025-08-14T21:18:06.3302334Z * [new tag] trunk/9708fcf92db88b80b9010c68662d634434da3106 -> trunk/9708fcf92db88b80b9010c68662d634434da3106 2025-08-14T21:18:06.3302551Z * [new tag] trunk/97c8c98f8dcb9c5c188b691d156e0043dba6c7f8 -> trunk/97c8c98f8dcb9c5c188b691d156e0043dba6c7f8 2025-08-14T21:18:06.3302756Z * [new tag] trunk/9903ca4f70bdc1653016256f5b4fd74fdfc609f8 -> trunk/9903ca4f70bdc1653016256f5b4fd74fdfc609f8 2025-08-14T21:18:06.3303137Z * [new tag] trunk/99bc2f94c1955657e950ebdad5f77e518785ccbd -> trunk/99bc2f94c1955657e950ebdad5f77e518785ccbd 2025-08-14T21:18:06.3303472Z * [new tag] trunk/9a06e6d0310da9d8a59ae05e8ec9c0201b55cacd -> trunk/9a06e6d0310da9d8a59ae05e8ec9c0201b55cacd 2025-08-14T21:18:06.3304095Z * [new tag] trunk/9a0f7a3bb01b235ea04581ee540970a098071b72 -> trunk/9a0f7a3bb01b235ea04581ee540970a098071b72 2025-08-14T21:18:06.3304435Z * [new tag] trunk/9b803cdbe298009f08340c1aaccb25aafbca95d8 -> trunk/9b803cdbe298009f08340c1aaccb25aafbca95d8 2025-08-14T21:18:06.3304665Z * [new tag] trunk/9ccd0f5e31ea54fcf42101dfbaacc103494e34df -> trunk/9ccd0f5e31ea54fcf42101dfbaacc103494e34df 2025-08-14T21:18:06.3305069Z * [new tag] trunk/9d37c960a4fc44d5ac334ca8bf775f85b95d76fc -> trunk/9d37c960a4fc44d5ac334ca8bf775f85b95d76fc 2025-08-14T21:18:06.3305625Z * [new tag] trunk/9e07673deb212c87b1c6fea23799a97474c476ed -> trunk/9e07673deb212c87b1c6fea23799a97474c476ed 2025-08-14T21:18:06.3306106Z * [new tag] trunk/9eedd2a20b64302d0d116ea2802b50948d2ebb09 -> trunk/9eedd2a20b64302d0d116ea2802b50948d2ebb09 2025-08-14T21:18:06.3307606Z * [new tag] trunk/9fa8ce26cf638504469852cbc3e7d04579fc8674 -> trunk/9fa8ce26cf638504469852cbc3e7d04579fc8674 2025-08-14T21:18:06.3308015Z * [new tag] trunk/a06ec54d40013c97fbffc174ea8f524ea5a95715 -> trunk/a06ec54d40013c97fbffc174ea8f524ea5a95715 2025-08-14T21:18:06.3308344Z * [new tag] trunk/a288b15ea9f87ddd665f249d492e0fb0861f5a69 -> trunk/a288b15ea9f87ddd665f249d492e0fb0861f5a69 2025-08-14T21:18:06.3308637Z * [new tag] trunk/a2fd106d670bb4990cebfd00f25ecbae4145e76c -> trunk/a2fd106d670bb4990cebfd00f25ecbae4145e76c 2025-08-14T21:18:06.3309112Z * [new tag] trunk/a354fa91e26b376d96385a2206c5ff5b42aa4600 -> trunk/a354fa91e26b376d96385a2206c5ff5b42aa4600 2025-08-14T21:18:06.3309734Z * [new tag] trunk/a4f69a5da08eace1c1e6469dec6a18aa842da73b -> trunk/a4f69a5da08eace1c1e6469dec6a18aa842da73b 2025-08-14T21:18:06.3310300Z * [new tag] trunk/a53d14d5f846ac44f6c205abb1c5bc4d2f3126ae -> trunk/a53d14d5f846ac44f6c205abb1c5bc4d2f3126ae 2025-08-14T21:18:06.3310961Z * [new tag] trunk/a5652407e4f3d772fc44486ac2abf756decf0861 -> trunk/a5652407e4f3d772fc44486ac2abf756decf0861 2025-08-14T21:18:06.3311589Z * [new tag] trunk/a7abf57aabec0ce686092e2d66e53ba185dbc56b -> trunk/a7abf57aabec0ce686092e2d66e53ba185dbc56b 2025-08-14T21:18:06.3312078Z * [new tag] trunk/a84b60c0c4016785fd93b7b8a0c04f2d0770d332 -> trunk/a84b60c0c4016785fd93b7b8a0c04f2d0770d332 2025-08-14T21:18:06.3313578Z * [new tag] trunk/aa75e917bdb0f95bb6dee81853c2d3c4ab3e1883 -> trunk/aa75e917bdb0f95bb6dee81853c2d3c4ab3e1883 2025-08-14T21:18:06.3313976Z * [new tag] trunk/adcca7d9a1c053495e99012de801b2ea237faad0 -> trunk/adcca7d9a1c053495e99012de801b2ea237faad0 2025-08-14T21:18:06.3314285Z * [new tag] trunk/af10f1f86cc4effc93142a447693d8be55966615 -> trunk/af10f1f86cc4effc93142a447693d8be55966615 2025-08-14T21:18:06.3314514Z * [new tag] trunk/af3cabc55d5699f4da528e1ca39d83338f84ae8c -> trunk/af3cabc55d5699f4da528e1ca39d83338f84ae8c 2025-08-14T21:18:06.3315058Z * [new tag] trunk/b0df7715e8c590c0001d1f9cdb97057be80c9107 -> trunk/b0df7715e8c590c0001d1f9cdb97057be80c9107 2025-08-14T21:18:06.3319003Z * [new tag] trunk/b149c7204c218e7c4d6594a89dd74f72bd480ec5 -> trunk/b149c7204c218e7c4d6594a89dd74f72bd480ec5 2025-08-14T21:18:06.3319392Z * [new tag] trunk/b1a602762e6a6674b406a3137e7e7a678885a97b -> trunk/b1a602762e6a6674b406a3137e7e7a678885a97b 2025-08-14T21:18:06.3319747Z * [new tag] trunk/b1f43548cad8fc0e30bda250f6e196310fa7a4bc -> trunk/b1f43548cad8fc0e30bda250f6e196310fa7a4bc 2025-08-14T21:18:06.3320093Z * [new tag] trunk/b219ca2a00a305753c4f1ea4c9c5d23243d54753 -> trunk/b219ca2a00a305753c4f1ea4c9c5d23243d54753 2025-08-14T21:18:06.3320738Z * [new tag] trunk/b4596895b9d85a686c2cb978938b0a7797b3690a -> trunk/b4596895b9d85a686c2cb978938b0a7797b3690a 2025-08-14T21:18:06.3321006Z * [new tag] trunk/b5fd7223b1bf44720dc9183bda7dfcf7aeccff02 -> trunk/b5fd7223b1bf44720dc9183bda7dfcf7aeccff02 2025-08-14T21:18:06.3321224Z * [new tag] trunk/b602ea9cab7d43a7ee7b4051227090f23fbd3dbf -> trunk/b602ea9cab7d43a7ee7b4051227090f23fbd3dbf 2025-08-14T21:18:06.3321442Z * [new tag] trunk/b6b74aed604bd2e96389ff99aaaf39abc64fdc64 -> trunk/b6b74aed604bd2e96389ff99aaaf39abc64fdc64 2025-08-14T21:18:06.3321792Z * [new tag] trunk/b7db86600a2614adc71c92ca42d359a7ac534d78 -> trunk/b7db86600a2614adc71c92ca42d359a7ac534d78 2025-08-14T21:18:06.3322012Z * [new tag] trunk/b9003ed3d87699e81e436719625a21996a6654e5 -> trunk/b9003ed3d87699e81e436719625a21996a6654e5 2025-08-14T21:18:06.3322225Z * [new tag] trunk/b90feeac86bda00afc2789321bcd706015ff44e3 -> trunk/b90feeac86bda00afc2789321bcd706015ff44e3 2025-08-14T21:18:06.3322606Z * [new tag] trunk/b9d7de3a094598c3dc0dd52e57bce30eb684c9d8 -> trunk/b9d7de3a094598c3dc0dd52e57bce30eb684c9d8 2025-08-14T21:18:06.3323132Z * [new tag] trunk/ba47821f524eee50a214ed39fa2e7765d54aabf4 -> trunk/ba47821f524eee50a214ed39fa2e7765d54aabf4 2025-08-14T21:18:06.3323668Z * [new tag] trunk/ba4ccf5d67e3d237f435eacc2bce3c6025f08491 -> trunk/ba4ccf5d67e3d237f435eacc2bce3c6025f08491 2025-08-14T21:18:06.3324673Z * [new tag] trunk/bcf23ecc476df2bd7479f142567213e2623308ee -> trunk/bcf23ecc476df2bd7479f142567213e2623308ee 2025-08-14T21:18:06.3325014Z * [new tag] trunk/be53f609aaf6f01e2863f490975ea9eaac3ee9ff -> trunk/be53f609aaf6f01e2863f490975ea9eaac3ee9ff 2025-08-14T21:18:06.3325264Z * [new tag] trunk/beb4d7816dedc67a5de1f82e5a45b5910f407941 -> trunk/beb4d7816dedc67a5de1f82e5a45b5910f407941 2025-08-14T21:18:06.3327262Z * [new tag] trunk/bfc873d02ec413344717493e4175a902921359fd -> trunk/bfc873d02ec413344717493e4175a902921359fd 2025-08-14T21:18:06.3327685Z * [new tag] trunk/c184cb3852f0ff2d16a489d61abc3739c309e6ca -> trunk/c184cb3852f0ff2d16a489d61abc3739c309e6ca 2025-08-14T21:18:06.3327911Z * [new tag] trunk/c24ca7f4bf79f62fd623d76346ca27e53f731431 -> trunk/c24ca7f4bf79f62fd623d76346ca27e53f731431 2025-08-14T21:18:06.3328284Z * [new tag] trunk/c3dc8dc4122977893004c49d10e4676cd0a97da4 -> trunk/c3dc8dc4122977893004c49d10e4676cd0a97da4 2025-08-14T21:18:06.3328996Z * [new tag] trunk/c5ec5458a547f7a774468ea0eb2258d3de596492 -> trunk/c5ec5458a547f7a774468ea0eb2258d3de596492 2025-08-14T21:18:06.3329269Z * [new tag] trunk/c5efc5c8a66eca84865015058b3221013ebfe685 -> trunk/c5efc5c8a66eca84865015058b3221013ebfe685 2025-08-14T21:18:06.3329653Z * [new tag] trunk/c6563341208003f64c131854a9cf029555f786d2 -> trunk/c6563341208003f64c131854a9cf029555f786d2 2025-08-14T21:18:06.3330104Z * [new tag] trunk/c6d78d4dbda53837d298d23a5fbc09af90a42d9e -> trunk/c6d78d4dbda53837d298d23a5fbc09af90a42d9e 2025-08-14T21:18:06.3330606Z * [new tag] trunk/c8205cb35435f39d2c26f6c94b45e4adeb6dcb23 -> trunk/c8205cb35435f39d2c26f6c94b45e4adeb6dcb23 2025-08-14T21:18:06.3331155Z * [new tag] trunk/c859ba7114b1fcb49527e090745fa17091d1f8d5 -> trunk/c859ba7114b1fcb49527e090745fa17091d1f8d5 2025-08-14T21:18:06.3333337Z * [new tag] trunk/c86040a8e68f754b90a84099187d3624954c7f36 -> trunk/c86040a8e68f754b90a84099187d3624954c7f36 2025-08-14T21:18:06.3333735Z * [new tag] trunk/c9671dc865aa0fc1cb86df754e355b44d8e02bb4 -> trunk/c9671dc865aa0fc1cb86df754e355b44d8e02bb4 2025-08-14T21:18:06.3334075Z * [new tag] trunk/ca7315c17162ea21b1ca5ba23f4bf6168766c7b9 -> trunk/ca7315c17162ea21b1ca5ba23f4bf6168766c7b9 2025-08-14T21:18:06.3334697Z * [new tag] trunk/cae2b5e3d223829bdc553fc8601df4b1c1554cff -> trunk/cae2b5e3d223829bdc553fc8601df4b1c1554cff 2025-08-14T21:18:06.3334944Z * [new tag] trunk/cbffde774557752cf20447d42d99ec6102673c31 -> trunk/cbffde774557752cf20447d42d99ec6102673c31 2025-08-14T21:18:06.3335188Z * [new tag] trunk/cd8d8c18f5bafdc1c73d5ac0129e7b4d76ab45bc -> trunk/cd8d8c18f5bafdc1c73d5ac0129e7b4d76ab45bc 2025-08-14T21:18:06.3335406Z * [new tag] trunk/cf0a0dcb0afa5e84b95461cc542f862b51ca96bf -> trunk/cf0a0dcb0afa5e84b95461cc542f862b51ca96bf 2025-08-14T21:18:06.3335653Z * [new tag] trunk/cf4964be68fa9f4ffc334f01cce42d7424b1cc81 -> trunk/cf4964be68fa9f4ffc334f01cce42d7424b1cc81 2025-08-14T21:18:06.3336443Z * [new tag] trunk/d0e2240f680ea2a553f7ee8188f52482e130bfd0 -> trunk/d0e2240f680ea2a553f7ee8188f52482e130bfd0 2025-08-14T21:18:06.3336896Z * [new tag] trunk/d1950d4bb5cba8fb6b23e4d283eea5b9801737e2 -> trunk/d1950d4bb5cba8fb6b23e4d283eea5b9801737e2 2025-08-14T21:18:06.3337407Z * [new tag] trunk/d20c4c20e61adecf00335c4d8c22eb1ace472cd3 -> trunk/d20c4c20e61adecf00335c4d8c22eb1ace472cd3 2025-08-14T21:18:06.3337939Z * [new tag] trunk/d25c4f954d599ea512e2f70cd6df101c21479d4c -> trunk/d25c4f954d599ea512e2f70cd6df101c21479d4c 2025-08-14T21:18:06.3338486Z * [new tag] trunk/d3d359dbafa89173a371e2637f22b47398e94a24 -> trunk/d3d359dbafa89173a371e2637f22b47398e94a24 2025-08-14T21:18:06.3339481Z * [new tag] trunk/d46768db04499d07a5b0db984112a6d1b7d3b0c1 -> trunk/d46768db04499d07a5b0db984112a6d1b7d3b0c1 2025-08-14T21:18:06.3339741Z * [new tag] trunk/d4c1a08c89f37d249a0146ff511c82ecc5c53b8f -> trunk/d4c1a08c89f37d249a0146ff511c82ecc5c53b8f 2025-08-14T21:18:06.3340475Z * [new tag] trunk/d556586448f3caab85673c7da0978fe31c7748f7 -> trunk/d556586448f3caab85673c7da0978fe31c7748f7 2025-08-14T21:18:06.3340986Z * [new tag] trunk/d670304001429a1a833255a918ed788d7ec4989a -> trunk/d670304001429a1a833255a918ed788d7ec4989a 2025-08-14T21:18:06.3341525Z * [new tag] trunk/d6786741a77aba200c78002646cc069b7a1799b0 -> trunk/d6786741a77aba200c78002646cc069b7a1799b0 2025-08-14T21:18:06.3342544Z * [new tag] trunk/d68c323692dedcbb74e670801e3502944fd790ff -> trunk/d68c323692dedcbb74e670801e3502944fd790ff 2025-08-14T21:18:06.3343220Z * [new tag] trunk/d8cb3db5339b45e4b745b2b883ef3ecde9843e2c -> trunk/d8cb3db5339b45e4b745b2b883ef3ecde9843e2c 2025-08-14T21:18:06.3343460Z * [new tag] trunk/da1f608ca33f3062535d0a4866d95db19e72fcbd -> trunk/da1f608ca33f3062535d0a4866d95db19e72fcbd 2025-08-14T21:18:06.3343703Z * [new tag] trunk/db0b7f1cc9bb3fe71aaf8b964a644147ae8e1c35 -> trunk/db0b7f1cc9bb3fe71aaf8b964a644147ae8e1c35 2025-08-14T21:18:06.3344383Z * [new tag] trunk/db32b60662b2f2bdcad980127d5dc4b66b02a7e4 -> trunk/db32b60662b2f2bdcad980127d5dc4b66b02a7e4 2025-08-14T21:18:06.3344754Z * [new tag] trunk/db763b17175553ba09637362eb9773a91997a7ad -> trunk/db763b17175553ba09637362eb9773a91997a7ad 2025-08-14T21:18:06.3345345Z * [new tag] trunk/db78943a1ca13a32a3d6045eb15e2b719ee13a2f -> trunk/db78943a1ca13a32a3d6045eb15e2b719ee13a2f 2025-08-14T21:18:06.3345833Z * [new tag] trunk/dc0d18e023d9b7e314ebba0f234b6cb1579dbcfd -> trunk/dc0d18e023d9b7e314ebba0f234b6cb1579dbcfd 2025-08-14T21:18:06.3346348Z * [new tag] trunk/dd21c8a578038ab2841a7ba809a06921093ac9d8 -> trunk/dd21c8a578038ab2841a7ba809a06921093ac9d8 2025-08-14T21:18:06.3348026Z * [new tag] trunk/deea71a90e05eb320c04bebfead5317746637f0d -> trunk/deea71a90e05eb320c04bebfead5317746637f0d 2025-08-14T21:18:06.3348289Z * [new tag] trunk/df55ec7d4b35f6d21691e9dd41c82f27de762948 -> trunk/df55ec7d4b35f6d21691e9dd41c82f27de762948 2025-08-14T21:18:06.3348513Z * [new tag] trunk/e1cf0d496ea85d1807c8c740f296e77bf7bdc1df -> trunk/e1cf0d496ea85d1807c8c740f296e77bf7bdc1df 2025-08-14T21:18:06.3348753Z * [new tag] trunk/e248719ac03c103767ab72034f6b9fd56855bf98 -> trunk/e248719ac03c103767ab72034f6b9fd56855bf98 2025-08-14T21:18:06.3349341Z * [new tag] trunk/e49762026070f66be41bfa6537fbcf9bfc24e558 -> trunk/e49762026070f66be41bfa6537fbcf9bfc24e558 2025-08-14T21:18:06.3351372Z * [new tag] trunk/e4de93f6a3e342bab34d3757cf90ec0ccc87e168 -> trunk/e4de93f6a3e342bab34d3757cf90ec0ccc87e168 2025-08-14T21:18:06.3351775Z * [new tag] trunk/e619c6bb90b9dedaccd3cbeed86a288993a4e33f -> trunk/e619c6bb90b9dedaccd3cbeed86a288993a4e33f 2025-08-14T21:18:06.3352281Z * [new tag] trunk/e63c2b21c186a7d2ab8a8953b8aa1535f2e96e58 -> trunk/e63c2b21c186a7d2ab8a8953b8aa1535f2e96e58 2025-08-14T21:18:06.3352881Z * [new tag] trunk/e7152ff8a6a929a0db7f3f4a72a5b6d471769cd3 -> trunk/e7152ff8a6a929a0db7f3f4a72a5b6d471769cd3 2025-08-14T21:18:06.3353143Z * [new tag] trunk/e96c7c4bb0f6aeae2ab3b6f040f7d67edbec199a -> trunk/e96c7c4bb0f6aeae2ab3b6f040f7d67edbec199a 2025-08-14T21:18:06.3353359Z * [new tag] trunk/e9eb2096a59a79e7a94c3e28a0715e040369f34c -> trunk/e9eb2096a59a79e7a94c3e28a0715e040369f34c 2025-08-14T21:18:06.3353581Z * [new tag] trunk/eac2d9d695a32dd456050f45cac35134ec3809f4 -> trunk/eac2d9d695a32dd456050f45cac35134ec3809f4 2025-08-14T21:18:06.3353830Z * [new tag] trunk/ecde76c764752540edf9ef62a97936c86d984b17 -> trunk/ecde76c764752540edf9ef62a97936c86d984b17 2025-08-14T21:18:06.3354165Z * [new tag] trunk/ecea81117b2fdc52907c97b3c32d779e07b5d55b -> trunk/ecea81117b2fdc52907c97b3c32d779e07b5d55b 2025-08-14T21:18:06.3356390Z * [new tag] trunk/edaa151d0d5a4e75fbec9843f49cc78770eb61fb -> trunk/edaa151d0d5a4e75fbec9843f49cc78770eb61fb 2025-08-14T21:18:06.3356793Z * [new tag] trunk/ee1b0412b919dfb358d5a697b3be49621497fbc2 -> trunk/ee1b0412b919dfb358d5a697b3be49621497fbc2 2025-08-14T21:18:06.3357125Z * [new tag] trunk/ee1fb43450c2e985657f95a91b68328d6f20f24e -> trunk/ee1fb43450c2e985657f95a91b68328d6f20f24e 2025-08-14T21:18:06.3357895Z * [new tag] trunk/ee89cc7a0acd69de25f98fe4ef828546db7b444c -> trunk/ee89cc7a0acd69de25f98fe4ef828546db7b444c 2025-08-14T21:18:06.3358147Z * [new tag] trunk/ee9f8ba11d664b871a9e0c7933fdc8571635b78c -> trunk/ee9f8ba11d664b871a9e0c7933fdc8571635b78c 2025-08-14T21:18:06.3358385Z * [new tag] trunk/eed9dbf70f43ee529fec78ac00ed9a4fd74c6e76 -> trunk/eed9dbf70f43ee529fec78ac00ed9a4fd74c6e76 2025-08-14T21:18:06.3358615Z * [new tag] trunk/f077c2402e4eb5b0ed562b4ee5b7a0503f26ef94 -> trunk/f077c2402e4eb5b0ed562b4ee5b7a0503f26ef94 2025-08-14T21:18:06.3358983Z * [new tag] trunk/f0980fc0bbd656d6c02d23ad97e945353b314f35 -> trunk/f0980fc0bbd656d6c02d23ad97e945353b314f35 2025-08-14T21:18:06.3359512Z * [new tag] trunk/f15ada5c6fad97a7dcbfa4673f067b6942dda640 -> trunk/f15ada5c6fad97a7dcbfa4673f067b6942dda640 2025-08-14T21:18:06.3359981Z * [new tag] trunk/f27232a2134150cb5e55d26a74d8c36c6a961ca5 -> trunk/f27232a2134150cb5e55d26a74d8c36c6a961ca5 2025-08-14T21:18:06.3360644Z * [new tag] trunk/f33ce40bc062a281e1a1f57e8c1926d0a7d155cc -> trunk/f33ce40bc062a281e1a1f57e8c1926d0a7d155cc 2025-08-14T21:18:06.3361080Z * [new tag] trunk/f341077ce4710172da20cfad916ee37159bfe9fe -> trunk/f341077ce4710172da20cfad916ee37159bfe9fe 2025-08-14T21:18:06.3361490Z * [new tag] trunk/f3a4d742ece08de4cb0e59dcc62e0093a7d0b0c7 -> trunk/f3a4d742ece08de4cb0e59dcc62e0093a7d0b0c7 2025-08-14T21:18:06.3363866Z * [new tag] trunk/f3f159ff8c4bad2edec99c68a941c628e983d04c -> trunk/f3f159ff8c4bad2edec99c68a941c628e983d04c 2025-08-14T21:18:06.3364251Z * [new tag] trunk/f60454cce8b93e5bbf67f2f3c88c8ac01ed65457 -> trunk/f60454cce8b93e5bbf67f2f3c88c8ac01ed65457 2025-08-14T21:18:06.3364600Z * [new tag] trunk/f7b2f3314cf7aede67d5fa5c75e4243208484344 -> trunk/f7b2f3314cf7aede67d5fa5c75e4243208484344 2025-08-14T21:18:06.3365200Z * [new tag] trunk/f8f0414a5983ff481a2188e0c18594150430c8c5 -> trunk/f8f0414a5983ff481a2188e0c18594150430c8c5 2025-08-14T21:18:06.3365456Z * [new tag] trunk/f95b58c2844b3444cd8446fed8570729dc4216eb -> trunk/f95b58c2844b3444cd8446fed8570729dc4216eb 2025-08-14T21:18:06.3365672Z * [new tag] trunk/f990490a23815ea6ee27e487c70ba2cf513ba43d -> trunk/f990490a23815ea6ee27e487c70ba2cf513ba43d 2025-08-14T21:18:06.3365883Z * [new tag] trunk/fb887c3bb588cfe782615e67f6c26db636b8539b -> trunk/fb887c3bb588cfe782615e67f6c26db636b8539b 2025-08-14T21:18:06.3366933Z * [new tag] trunk/fc25c68f20f772290927a7031b998b92615259cf -> trunk/fc25c68f20f772290927a7031b998b92615259cf 2025-08-14T21:18:06.3367204Z * [new tag] trunk/fc80f6859e0ccf66513a40f04b9e735e759d4ddb -> trunk/fc80f6859e0ccf66513a40f04b9e735e759d4ddb 2025-08-14T21:18:06.3369068Z * [new tag] trunk/fdfd69bb05488d76123db9cc1cdd90ac4137bbfb -> trunk/fdfd69bb05488d76123db9cc1cdd90ac4137bbfb 2025-08-14T21:18:06.3369487Z * [new tag] trunk/fe3f5fe4ea2ff6f56406dc5d954636ebb08d0a08 -> trunk/fe3f5fe4ea2ff6f56406dc5d954636ebb08d0a08 2025-08-14T21:18:06.3369820Z * [new tag] trunk/fea7e9dd37c02c334b130f6624af6163fde6b2ab -> trunk/fea7e9dd37c02c334b130f6624af6163fde6b2ab 2025-08-14T21:18:06.3370148Z * [new tag] trunk/ff0d56d03592aa03f3ced8359241d21df1783393 -> trunk/ff0d56d03592aa03f3ced8359241d21df1783393 2025-08-14T21:18:06.3370265Z * [new tag] v0.1.1 -> v0.1.1 2025-08-14T21:18:06.3370360Z * [new tag] v0.1.10 -> v0.1.10 2025-08-14T21:18:06.3371161Z * [new tag] v0.1.11 -> v0.1.11 2025-08-14T21:18:06.3371261Z * [new tag] v0.1.12 -> v0.1.12 2025-08-14T21:18:06.3373004Z * [new tag] v0.1.2 -> v0.1.2 2025-08-14T21:18:06.3373273Z * [new tag] v0.1.3 -> v0.1.3 2025-08-14T21:18:06.3373509Z * [new tag] v0.1.4 -> v0.1.4 2025-08-14T21:18:06.3373726Z * [new tag] v0.1.5 -> v0.1.5 2025-08-14T21:18:06.3373822Z * [new tag] v0.1.6 -> v0.1.6 2025-08-14T21:18:06.3374168Z * [new tag] v0.1.7 -> v0.1.7 2025-08-14T21:18:06.3375197Z * [new tag] v0.1.8 -> v0.1.8 2025-08-14T21:18:06.3375324Z * [new tag] v0.1.9 -> v0.1.9 2025-08-14T21:18:06.3375758Z * [new tag] v0.2.0 -> v0.2.0 2025-08-14T21:18:06.3376181Z * [new tag] v0.3.0 -> v0.3.0 2025-08-14T21:18:06.3376813Z * [new tag] v0.3.1 -> v0.3.1 2025-08-14T21:18:06.3377133Z * [new tag] v0.4.0 -> v0.4.0 2025-08-14T21:18:06.3377816Z * [new tag] v0.4.1 -> v0.4.1 2025-08-14T21:18:06.3378813Z * [new tag] v1.0.0 -> v1.0.0 2025-08-14T21:18:06.3379269Z * [new tag] v1.0.0a0 -> v1.0.0a0 2025-08-14T21:18:06.3379606Z * [new tag] v1.0.1 -> v1.0.1 2025-08-14T21:18:06.3379983Z * [new tag] v1.0rc0 -> v1.0rc0 2025-08-14T21:18:06.3380422Z * [new tag] v1.0rc1 -> v1.0rc1 2025-08-14T21:18:06.3381023Z * [new tag] v1.1.0 -> v1.1.0 2025-08-14T21:18:06.3381439Z * [new tag] v1.1.0a0 -> v1.1.0a0 2025-08-14T21:18:06.3382362Z * [new tag] v1.10.0 -> v1.10.0 2025-08-14T21:18:06.3382708Z * [new tag] v1.10.0-rc1 -> v1.10.0-rc1 2025-08-14T21:18:06.3383313Z * [new tag] v1.10.0-rc2 -> v1.10.0-rc2 2025-08-14T21:18:06.3383552Z * [new tag] v1.10.0-rc3 -> v1.10.0-rc3 2025-08-14T21:18:06.3384499Z * [new tag] v1.10.1 -> v1.10.1 2025-08-14T21:18:06.3384724Z * [new tag] v1.10.1-rc1 -> v1.10.1-rc1 2025-08-14T21:18:06.3384973Z * [new tag] v1.10.2 -> v1.10.2 2025-08-14T21:18:06.3385824Z * [new tag] v1.10.2-rc1 -> v1.10.2-rc1 2025-08-14T21:18:06.3386084Z * [new tag] v1.11.0 -> v1.11.0 2025-08-14T21:18:06.3387884Z * [new tag] v1.11.0-rc1 -> v1.11.0-rc1 2025-08-14T21:18:06.3388015Z * [new tag] v1.11.0-rc2 -> v1.11.0-rc2 2025-08-14T21:18:06.3388110Z * [new tag] v1.11.0-rc3 -> v1.11.0-rc3 2025-08-14T21:18:06.3388347Z * [new tag] v1.11.0-rc4 -> v1.11.0-rc4 2025-08-14T21:18:06.3389092Z * [new tag] v1.11.0-rc5 -> v1.11.0-rc5 2025-08-14T21:18:06.3389302Z * [new tag] v1.11.0-rc6 -> v1.11.0-rc6 2025-08-14T21:18:06.3389613Z * [new tag] v1.11.0-rc7 -> v1.11.0-rc7 2025-08-14T21:18:06.3391264Z * [new tag] v1.12.0 -> v1.12.0 2025-08-14T21:18:06.3391397Z * [new tag] v1.12.0-rc1 -> v1.12.0-rc1 2025-08-14T21:18:06.3391521Z * [new tag] v1.12.0-rc2 -> v1.12.0-rc2 2025-08-14T21:18:06.3391719Z * [new tag] v1.12.0-rc3 -> v1.12.0-rc3 2025-08-14T21:18:06.3392585Z * [new tag] v1.12.0-rc4 -> v1.12.0-rc4 2025-08-14T21:18:06.3392698Z * [new tag] v1.12.0-rc5 -> v1.12.0-rc5 2025-08-14T21:18:06.3395330Z * [new tag] v1.12.0-rc6 -> v1.12.0-rc6 2025-08-14T21:18:06.3395595Z * [new tag] v1.12.0-rc7 -> v1.12.0-rc7 2025-08-14T21:18:06.3395956Z * [new tag] v1.12.0-rc8 -> v1.12.0-rc8 2025-08-14T21:18:06.3396060Z * [new tag] v1.12.1 -> v1.12.1 2025-08-14T21:18:06.3396281Z * [new tag] v1.12.1-rc1 -> v1.12.1-rc1 2025-08-14T21:18:06.3396576Z * [new tag] v1.12.1-rc2 -> v1.12.1-rc2 2025-08-14T21:18:06.3397169Z * [new tag] v1.12.1-rc3 -> v1.12.1-rc3 2025-08-14T21:18:06.3397434Z * [new tag] v1.12.1-rc4 -> v1.12.1-rc4 2025-08-14T21:18:06.3397551Z * [new tag] v1.12.1-rc5 -> v1.12.1-rc5 2025-08-14T21:18:06.3397720Z * [new tag] v1.13.0 -> v1.13.0 2025-08-14T21:18:06.3398082Z * [new tag] v1.13.0-rc1 -> v1.13.0-rc1 2025-08-14T21:18:06.3399259Z * [new tag] v1.13.0-rc2 -> v1.13.0-rc2 2025-08-14T21:18:06.3399535Z * [new tag] v1.13.0-rc3 -> v1.13.0-rc3 2025-08-14T21:18:06.3399741Z * [new tag] v1.13.0-rc4 -> v1.13.0-rc4 2025-08-14T21:18:06.3400118Z * [new tag] v1.13.0-rc5 -> v1.13.0-rc5 2025-08-14T21:18:06.3400489Z * [new tag] v1.13.0-rc6 -> v1.13.0-rc6 2025-08-14T21:18:06.3401939Z * [new tag] v1.13.1 -> v1.13.1 2025-08-14T21:18:06.3402206Z * [new tag] v1.13.1-rc1 -> v1.13.1-rc1 2025-08-14T21:18:06.3402329Z * [new tag] v1.2.0 -> v1.2.0 2025-08-14T21:18:06.3402502Z * [new tag] v1.2.0a0 -> v1.2.0a0 2025-08-14T21:18:06.3402948Z * [new tag] v1.3.0 -> v1.3.0 2025-08-14T21:18:06.3404080Z * [new tag] v1.3.0a0 -> v1.3.0a0 2025-08-14T21:18:06.3404345Z * [new tag] v1.3.1 -> v1.3.1 2025-08-14T21:18:06.3404473Z * [new tag] v1.4.0 -> v1.4.0 2025-08-14T21:18:06.3404770Z * [new tag] v1.4.0a0 -> v1.4.0a0 2025-08-14T21:18:06.3405152Z * [new tag] v1.4.1 -> v1.4.1 2025-08-14T21:18:06.3406448Z * [new tag] v1.5.0 -> v1.5.0 2025-08-14T21:18:06.3406884Z * [new tag] v1.5.0-rc1 -> v1.5.0-rc1 2025-08-14T21:18:06.3406991Z * [new tag] v1.5.0-rc2 -> v1.5.0-rc2 2025-08-14T21:18:06.3407444Z * [new tag] v1.5.0-rc3 -> v1.5.0-rc3 2025-08-14T21:18:06.3409090Z * [new tag] v1.5.0-rc4 -> v1.5.0-rc4 2025-08-14T21:18:06.3409348Z * [new tag] v1.5.0-rc5 -> v1.5.0-rc5 2025-08-14T21:18:06.3409463Z * [new tag] v1.5.1 -> v1.5.1 2025-08-14T21:18:06.3409645Z * [new tag] v1.5.1-rc1 -> v1.5.1-rc1 2025-08-14T21:18:06.3409862Z * [new tag] v1.6.0 -> v1.6.0 2025-08-14T21:18:06.3411519Z * [new tag] v1.6.0-rc1 -> v1.6.0-rc1 2025-08-14T21:18:06.3411777Z * [new tag] v1.6.0-rc2 -> v1.6.0-rc2 2025-08-14T21:18:06.3411904Z * [new tag] v1.6.0-rc3 -> v1.6.0-rc3 2025-08-14T21:18:06.3412094Z * [new tag] v1.6.0-rc4 -> v1.6.0-rc4 2025-08-14T21:18:06.3413718Z * [new tag] v1.6.0-rc5 -> v1.6.0-rc5 2025-08-14T21:18:06.3413978Z * [new tag] v1.6.0-rc6 -> v1.6.0-rc6 2025-08-14T21:18:06.3414090Z * [new tag] v1.6.0-rc7 -> v1.6.0-rc7 2025-08-14T21:18:06.3414278Z * [new tag] v1.7.0 -> v1.7.0 2025-08-14T21:18:06.3414713Z * [new tag] v1.7.0-rc1 -> v1.7.0-rc1 2025-08-14T21:18:06.3415204Z * [new tag] v1.7.0-rc2 -> v1.7.0-rc2 2025-08-14T21:18:06.3415762Z * [new tag] v1.7.0-rc3 -> v1.7.0-rc3 2025-08-14T21:18:06.3416050Z * [new tag] v1.7.0-rc4 -> v1.7.0-rc4 2025-08-14T21:18:06.3417482Z * [new tag] v1.7.1 -> v1.7.1 2025-08-14T21:18:06.3417766Z * [new tag] v1.7.1-rc1 -> v1.7.1-rc1 2025-08-14T21:18:06.3417988Z * [new tag] v1.7.1-rc2 -> v1.7.1-rc2 2025-08-14T21:18:06.3418092Z * [new tag] v1.7.1-rc3 -> v1.7.1-rc3 2025-08-14T21:18:06.3419585Z * [new tag] v1.8.0 -> v1.8.0 2025-08-14T21:18:06.3420190Z * [new tag] v1.8.0-rc1 -> v1.8.0-rc1 2025-08-14T21:18:06.3420565Z * [new tag] v1.8.0-rc2 -> v1.8.0-rc2 2025-08-14T21:18:06.3420895Z * [new tag] v1.8.0-rc3 -> v1.8.0-rc3 2025-08-14T21:18:06.3420993Z * [new tag] v1.8.0-rc4 -> v1.8.0-rc4 2025-08-14T21:18:06.3421081Z * [new tag] v1.8.0-rc5 -> v1.8.0-rc5 2025-08-14T21:18:06.3421342Z * [new tag] v1.8.1 -> v1.8.1 2025-08-14T21:18:06.3421986Z * [new tag] v1.8.1-rc1 -> v1.8.1-rc1 2025-08-14T21:18:06.3422284Z * [new tag] v1.8.1-rc2 -> v1.8.1-rc2 2025-08-14T21:18:06.3422728Z * [new tag] v1.8.1-rc3 -> v1.8.1-rc3 2025-08-14T21:18:06.3423833Z * [new tag] v1.8.2 -> v1.8.2 2025-08-14T21:18:06.3423937Z * [new tag] v1.8.2-rc1 -> v1.8.2-rc1 2025-08-14T21:18:06.3426117Z * [new tag] v1.9.0 -> v1.9.0 2025-08-14T21:18:06.3426401Z * [new tag] v1.9.0-rc1 -> v1.9.0-rc1 2025-08-14T21:18:06.3426598Z * [new tag] v1.9.0-rc2 -> v1.9.0-rc2 2025-08-14T21:18:06.3426703Z * [new tag] v1.9.0-rc3 -> v1.9.0-rc3 2025-08-14T21:18:06.3426805Z * [new tag] v1.9.0-rc4 -> v1.9.0-rc4 2025-08-14T21:18:06.3427145Z * [new tag] v1.9.1 -> v1.9.1 2025-08-14T21:18:06.3428434Z * [new tag] v1.9.1-rc1 -> v1.9.1-rc1 2025-08-14T21:18:06.3428649Z * [new tag] v1.9.1-rc2 -> v1.9.1-rc2 2025-08-14T21:18:06.3428746Z * [new tag] v2.0.0 -> v2.0.0 2025-08-14T21:18:06.3430163Z * [new tag] v2.0.0-rc1 -> v2.0.0-rc1 2025-08-14T21:18:06.3430427Z * [new tag] v2.0.0-rc2 -> v2.0.0-rc2 2025-08-14T21:18:06.3430551Z * [new tag] v2.0.0-rc3 -> v2.0.0-rc3 2025-08-14T21:18:06.3430896Z * [new tag] v2.0.0-rc4 -> v2.0.0-rc4 2025-08-14T21:18:06.3432161Z * [new tag] v2.0.0-rc5 -> v2.0.0-rc5 2025-08-14T21:18:06.3432424Z * [new tag] v2.0.0-rc6 -> v2.0.0-rc6 2025-08-14T21:18:06.3432542Z * [new tag] v2.0.1 -> v2.0.1 2025-08-14T21:18:06.3432862Z * [new tag] v2.0.1-rc1 -> v2.0.1-rc1 2025-08-14T21:18:06.3433236Z * [new tag] v2.0.1-rc2 -> v2.0.1-rc2 2025-08-14T21:18:06.3434118Z * [new tag] v2.0.1-rc3 -> v2.0.1-rc3 2025-08-14T21:18:06.3434223Z * [new tag] v2.0.1-rc4 -> v2.0.1-rc4 2025-08-14T21:18:06.3435970Z * [new tag] v2.1.0 -> v2.1.0 2025-08-14T21:18:06.3436253Z * [new tag] v2.1.0-rc1 -> v2.1.0-rc1 2025-08-14T21:18:06.3436503Z * [new tag] v2.1.0-rc2 -> v2.1.0-rc2 2025-08-14T21:18:06.3436612Z * [new tag] v2.1.0-rc3 -> v2.1.0-rc3 2025-08-14T21:18:06.3437699Z * [new tag] v2.1.0-rc4 -> v2.1.0-rc4 2025-08-14T21:18:06.3437814Z * [new tag] v2.1.0-rc5 -> v2.1.0-rc5 2025-08-14T21:18:06.3438056Z * [new tag] v2.1.0-rc6 -> v2.1.0-rc6 2025-08-14T21:18:06.3439355Z * [new tag] v2.1.1 -> v2.1.1 2025-08-14T21:18:06.3439986Z * [new tag] v2.1.1-rc1 -> v2.1.1-rc1 2025-08-14T21:18:06.3440255Z * [new tag] v2.1.1-rc2 -> v2.1.1-rc2 2025-08-14T21:18:06.3440368Z * [new tag] v2.1.1-rc3 -> v2.1.1-rc3 2025-08-14T21:18:06.3440830Z * [new tag] v2.1.1-rc4 -> v2.1.1-rc4 2025-08-14T21:18:06.3442097Z * [new tag] v2.1.1-rc5 -> v2.1.1-rc5 2025-08-14T21:18:06.3442354Z * [new tag] v2.1.1-rc6 -> v2.1.1-rc6 2025-08-14T21:18:06.3442475Z * [new tag] v2.1.2 -> v2.1.2 2025-08-14T21:18:06.3442752Z * [new tag] v2.1.2-rc1 -> v2.1.2-rc1 2025-08-14T21:18:06.3444247Z * [new tag] v2.1.2-rc2 -> v2.1.2-rc2 2025-08-14T21:18:06.3444501Z * [new tag] v2.1.2-rc3 -> v2.1.2-rc3 2025-08-14T21:18:06.3444612Z * [new tag] v2.2.0 -> v2.2.0 2025-08-14T21:18:06.3446698Z * [new tag] v2.2.0-rc1 -> v2.2.0-rc1 2025-08-14T21:18:06.3446963Z * [new tag] v2.2.0-rc2 -> v2.2.0-rc2 2025-08-14T21:18:06.3447067Z * [new tag] v2.2.0-rc3 -> v2.2.0-rc3 2025-08-14T21:18:06.3447244Z * [new tag] v2.2.0-rc4 -> v2.2.0-rc4 2025-08-14T21:18:06.3447349Z * [new tag] v2.2.0-rc5 -> v2.2.0-rc5 2025-08-14T21:18:06.3447587Z * [new tag] v2.2.0-rc6 -> v2.2.0-rc6 2025-08-14T21:18:06.3448018Z * [new tag] v2.2.0-rc7 -> v2.2.0-rc7 2025-08-14T21:18:06.3448353Z * [new tag] v2.2.0-rc8 -> v2.2.0-rc8 2025-08-14T21:18:06.3450497Z * [new tag] v2.2.1 -> v2.2.1 2025-08-14T21:18:06.3450771Z * [new tag] v2.2.1-rc1 -> v2.2.1-rc1 2025-08-14T21:18:06.3450893Z * [new tag] v2.2.1-rc2 -> v2.2.1-rc2 2025-08-14T21:18:06.3451057Z * [new tag] v2.2.1-rc3 -> v2.2.1-rc3 2025-08-14T21:18:06.3451169Z * [new tag] v2.2.2 -> v2.2.2 2025-08-14T21:18:06.3451353Z * [new tag] v2.2.2-rc1 -> v2.2.2-rc1 2025-08-14T21:18:06.3451823Z * [new tag] v2.2.2-rc2 -> v2.2.2-rc2 2025-08-14T21:18:06.3451964Z * [new tag] v2.2.2-rc3 -> v2.2.2-rc3 2025-08-14T21:18:06.3452853Z * [new tag] v2.3.0 -> v2.3.0 2025-08-14T21:18:06.3452963Z * [new tag] v2.3.0-rc1 -> v2.3.0-rc1 2025-08-14T21:18:06.3455198Z * [new tag] v2.3.0-rc10 -> v2.3.0-rc10 2025-08-14T21:18:06.3455470Z * [new tag] v2.3.0-rc11 -> v2.3.0-rc11 2025-08-14T21:18:06.3455580Z * [new tag] v2.3.0-rc12 -> v2.3.0-rc12 2025-08-14T21:18:06.3455697Z * [new tag] v2.3.0-rc2 -> v2.3.0-rc2 2025-08-14T21:18:06.3455801Z * [new tag] v2.3.0-rc3 -> v2.3.0-rc3 2025-08-14T21:18:06.3456191Z * [new tag] v2.3.0-rc4 -> v2.3.0-rc4 2025-08-14T21:18:06.3456919Z * [new tag] v2.3.0-rc5 -> v2.3.0-rc5 2025-08-14T21:18:06.3457312Z * [new tag] v2.3.0-rc6 -> v2.3.0-rc6 2025-08-14T21:18:06.3457750Z * [new tag] v2.3.0-rc7 -> v2.3.0-rc7 2025-08-14T21:18:06.3458097Z * [new tag] v2.3.0-rc8 -> v2.3.0-rc8 2025-08-14T21:18:06.3458493Z * [new tag] v2.3.0-rc9 -> v2.3.0-rc9 2025-08-14T21:18:06.3458881Z * [new tag] v2.3.1 -> v2.3.1 2025-08-14T21:18:06.3460491Z * [new tag] v2.3.1-rc1 -> v2.3.1-rc1 2025-08-14T21:18:06.3460621Z * [new tag] v2.3.1-rc2 -> v2.3.1-rc2 2025-08-14T21:18:06.3460712Z * [new tag] v2.3.1-rc3 -> v2.3.1-rc3 2025-08-14T21:18:06.3460994Z * [new tag] v2.4.0 -> v2.4.0 2025-08-14T21:18:06.3462204Z * [new tag] v2.4.0-rc1 -> v2.4.0-rc1 2025-08-14T21:18:06.3462746Z * [new tag] v2.4.0-rc2 -> v2.4.0-rc2 2025-08-14T21:18:06.3462870Z * [new tag] v2.4.0-rc3 -> v2.4.0-rc3 2025-08-14T21:18:06.3463061Z * [new tag] v2.4.0-rc4 -> v2.4.0-rc4 2025-08-14T21:18:06.3464533Z * [new tag] v2.4.0-rc5 -> v2.4.0-rc5 2025-08-14T21:18:06.3464645Z * [new tag] v2.4.0-rc6 -> v2.4.0-rc6 2025-08-14T21:18:06.3466978Z * [new tag] v2.4.0-rc7 -> v2.4.0-rc7 2025-08-14T21:18:06.3467225Z * [new tag] v2.4.0-rc8 -> v2.4.0-rc8 2025-08-14T21:18:06.3467330Z * [new tag] v2.4.0-rc9 -> v2.4.0-rc9 2025-08-14T21:18:06.3467432Z * [new tag] v2.4.1 -> v2.4.1 2025-08-14T21:18:06.3467650Z * [new tag] v2.4.1-rc1 -> v2.4.1-rc1 2025-08-14T21:18:06.3467758Z * [new tag] v2.4.1-rc2 -> v2.4.1-rc2 2025-08-14T21:18:06.3472088Z * [new tag] v2.4.1-rc3 -> v2.4.1-rc3 2025-08-14T21:18:06.3472353Z * [new tag] v2.5.0 -> v2.5.0 2025-08-14T21:18:06.3472473Z * [new tag] v2.5.0-rc1 -> v2.5.0-rc1 2025-08-14T21:18:06.3472711Z * [new tag] v2.5.0-rc10 -> v2.5.0-rc10 2025-08-14T21:18:06.3472813Z * [new tag] v2.5.0-rc2 -> v2.5.0-rc2 2025-08-14T21:18:06.3472903Z * [new tag] v2.5.0-rc3 -> v2.5.0-rc3 2025-08-14T21:18:06.3473117Z * [new tag] v2.5.0-rc4 -> v2.5.0-rc4 2025-08-14T21:18:06.3473550Z * [new tag] v2.5.0-rc5 -> v2.5.0-rc5 2025-08-14T21:18:06.3473661Z * [new tag] v2.5.0-rc6 -> v2.5.0-rc6 2025-08-14T21:18:06.3473764Z * [new tag] v2.5.0-rc7 -> v2.5.0-rc7 2025-08-14T21:18:06.3473861Z * [new tag] v2.5.0-rc8 -> v2.5.0-rc8 2025-08-14T21:18:06.3473951Z * [new tag] v2.5.0-rc9 -> v2.5.0-rc9 2025-08-14T21:18:06.3474051Z * [new tag] v2.5.1 -> v2.5.1 2025-08-14T21:18:06.3474143Z * [new tag] v2.5.1-rc1 -> v2.5.1-rc1 2025-08-14T21:18:06.3474456Z * [new tag] v2.6.0 -> v2.6.0 2025-08-14T21:18:06.3479072Z * [new tag] v2.6.0-rc1 -> v2.6.0-rc1 2025-08-14T21:18:06.3479334Z * [new tag] v2.6.0-rc2 -> v2.6.0-rc2 2025-08-14T21:18:06.3479447Z * [new tag] v2.6.0-rc3 -> v2.6.0-rc3 2025-08-14T21:18:06.3479614Z * [new tag] v2.6.0-rc4 -> v2.6.0-rc4 2025-08-14T21:18:06.3479856Z * [new tag] v2.6.0-rc5 -> v2.6.0-rc5 2025-08-14T21:18:06.3479961Z * [new tag] v2.6.0-rc6 -> v2.6.0-rc6 2025-08-14T21:18:06.3480182Z * [new tag] v2.6.0-rc7 -> v2.6.0-rc7 2025-08-14T21:18:06.3480759Z * [new tag] v2.6.0-rc8 -> v2.6.0-rc8 2025-08-14T21:18:06.3480898Z * [new tag] v2.6.0-rc9 -> v2.6.0-rc9 2025-08-14T21:18:06.3480995Z * [new tag] v2.7.0 -> v2.7.0 2025-08-14T21:18:06.3481094Z * [new tag] v2.7.0-rc1 -> v2.7.0-rc1 2025-08-14T21:18:06.3481189Z * [new tag] v2.7.0-rc10 -> v2.7.0-rc10 2025-08-14T21:18:06.3481277Z * [new tag] v2.7.0-rc2 -> v2.7.0-rc2 2025-08-14T21:18:06.3485250Z * [new tag] v2.7.0-rc3 -> v2.7.0-rc3 2025-08-14T21:18:06.3485530Z * [new tag] v2.7.0-rc4 -> v2.7.0-rc4 2025-08-14T21:18:06.3485654Z * [new tag] v2.7.0-rc5 -> v2.7.0-rc5 2025-08-14T21:18:06.3485740Z * [new tag] v2.7.0-rc6 -> v2.7.0-rc6 2025-08-14T21:18:06.3485825Z * [new tag] v2.7.0-rc7 -> v2.7.0-rc7 2025-08-14T21:18:06.3486049Z * [new tag] v2.7.0-rc8 -> v2.7.0-rc8 2025-08-14T21:18:06.3486153Z * [new tag] v2.7.0-rc9 -> v2.7.0-rc9 2025-08-14T21:18:06.3486764Z * [new tag] v2.7.1 -> v2.7.1 2025-08-14T21:18:06.3487038Z * [new tag] v2.7.1-rc1 -> v2.7.1-rc1 2025-08-14T21:18:06.3491194Z * [new tag] v2.7.1-rc2 -> v2.7.1-rc2 2025-08-14T21:18:06.3491462Z * [new tag] v2.7.1-rc3 -> v2.7.1-rc3 2025-08-14T21:18:06.3491611Z * [new tag] v2.7.1-rc4 -> v2.7.1-rc4 2025-08-14T21:18:06.3491702Z * [new tag] v2.7.1-rc5 -> v2.7.1-rc5 2025-08-14T21:18:06.3491923Z * [new tag] v2.8.0 -> v2.8.0 2025-08-14T21:18:06.3492034Z * [new tag] v2.8.0-rc1 -> v2.8.0-rc1 2025-08-14T21:18:06.3492219Z * [new tag] v2.8.0-rc2 -> v2.8.0-rc2 2025-08-14T21:18:06.3492528Z * [new tag] v2.8.0-rc3 -> v2.8.0-rc3 2025-08-14T21:18:06.3492940Z * [new tag] v2.8.0-rc4 -> v2.8.0-rc4 2025-08-14T21:18:06.3493048Z * [new tag] v2.8.0-rc5 -> v2.8.0-rc5 2025-08-14T21:18:06.3493137Z * [new tag] v2.8.0-rc6 -> v2.8.0-rc6 2025-08-14T21:18:06.3493229Z * [new tag] v2.8.0-rc7 -> v2.8.0-rc7 2025-08-14T21:18:06.3493315Z * [new tag] v2.8.0-rc8 -> v2.8.0-rc8 2025-08-14T21:18:06.3493436Z * [new tag] whc_flight_1 -> whc_flight_1 2025-08-14T21:18:06.3493530Z * [new tag] whc_flight_2 -> whc_flight_2 2025-08-14T21:18:06.3493619Z * [new tag] whc_flight_4 -> whc_flight_4 2025-08-14T21:18:06.3912201Z [command]/usr/bin/git rev-parse --verify --quiet 1fc683cf17c8c673044538d10266c00f92987be2^{object} 2025-08-14T21:18:06.3940046Z 1fc683cf17c8c673044538d10266c00f92987be2 2025-08-14T21:18:06.3940635Z ##[endgroup] 2025-08-14T21:18:06.3940831Z ##[group]Determining the checkout info 2025-08-14T21:18:06.3940980Z ##[endgroup] 2025-08-14T21:18:06.3948981Z [command]/usr/bin/git sparse-checkout disable 2025-08-14T21:18:06.3991101Z [command]/usr/bin/git config --local --unset-all extensions.worktreeConfig 2025-08-14T21:18:06.4019848Z ##[group]Checking out the ref 2025-08-14T21:18:06.4020072Z [command]/usr/bin/git checkout --progress --force 1fc683cf17c8c673044538d10266c00f92987be2 2025-08-14T21:18:07.3500484Z Note: switching to '1fc683cf17c8c673044538d10266c00f92987be2'. 2025-08-14T21:18:07.3501200Z 2025-08-14T21:18:07.3501553Z You are in 'detached HEAD' state. You can look around, make experimental 2025-08-14T21:18:07.3501920Z changes and commit them, and you can discard any commits you make in this 2025-08-14T21:18:07.3502227Z state without impacting any branches by switching back to a branch. 2025-08-14T21:18:07.3502431Z 2025-08-14T21:18:07.3502556Z If you want to create a new branch to retain commits you create, you may 2025-08-14T21:18:07.3502844Z do so (now or later) by using -c with the switch command. Example: 2025-08-14T21:18:07.3503005Z 2025-08-14T21:18:07.3503092Z git switch -c 2025-08-14T21:18:07.3503213Z 2025-08-14T21:18:07.3503285Z Or undo this operation with: 2025-08-14T21:18:07.3503401Z 2025-08-14T21:18:07.3503462Z git switch - 2025-08-14T21:18:07.3503547Z 2025-08-14T21:18:07.3503693Z Turn off this advice by setting config variable advice.detachedHead to false 2025-08-14T21:18:07.3503889Z 2025-08-14T21:18:07.3504110Z HEAD is now at 1fc683cf17c [Inductor] Allow indexing a flexible layout for extract_input_node_reduction_ranges (#160645) 2025-08-14T21:18:07.3549999Z ##[endgroup] 2025-08-14T21:18:07.3554353Z ##[group]Setting up auth for fetching submodules 2025-08-14T21:18:07.3558651Z [command]/usr/bin/git config --global http.https://github.com/.extraheader AUTHORIZATION: basic *** 2025-08-14T21:18:07.3629909Z [command]/usr/bin/git config --global --unset-all url.https://github.com/.insteadOf 2025-08-14T21:18:07.3665348Z [command]/usr/bin/git config --global --add url.https://github.com/.insteadOf git@github.com: 2025-08-14T21:18:07.3687201Z [command]/usr/bin/git config --global --add url.https://github.com/.insteadOf org-21003710@github.com: 2025-08-14T21:18:07.3725112Z ##[endgroup] 2025-08-14T21:18:07.3729332Z ##[group]Fetching submodules 2025-08-14T21:18:07.3733438Z [command]/usr/bin/git submodule sync --recursive 2025-08-14T21:18:07.4028434Z [command]/usr/bin/git -c protocol.version=2 submodule update --init --force --recursive 2025-08-14T21:18:07.4329291Z Submodule 'android/libs/fbjni' (https://github.com/facebookincubator/fbjni.git) registered for path 'android/libs/fbjni' 2025-08-14T21:18:07.4333571Z Submodule 'third_party/NNPACK_deps/FP16' (https://github.com/Maratyszcza/FP16.git) registered for path 'third_party/FP16' 2025-08-14T21:18:07.4335329Z Submodule 'third_party/NNPACK_deps/FXdiv' (https://github.com/Maratyszcza/FXdiv.git) registered for path 'third_party/FXdiv' 2025-08-14T21:18:07.4335941Z Submodule 'third_party/NNPACK' (https://github.com/Maratyszcza/NNPACK.git) registered for path 'third_party/NNPACK' 2025-08-14T21:18:07.4338994Z Submodule 'third_party/NVTX' (https://github.com/NVIDIA/NVTX.git) registered for path 'third_party/NVTX' 2025-08-14T21:18:07.4339693Z Submodule 'third_party/VulkanMemoryAllocator' (https://github.com/GPUOpen-LibrariesAndSDKs/VulkanMemoryAllocator.git) registered for path 'third_party/VulkanMemoryAllocator' 2025-08-14T21:18:07.4340371Z Submodule 'third_party/XNNPACK' (https://github.com/google/XNNPACK.git) registered for path 'third_party/XNNPACK' 2025-08-14T21:18:07.4340851Z Submodule 'third_party/aiter' (https://github.com/ROCm/aiter.git) registered for path 'third_party/aiter' 2025-08-14T21:18:07.4341345Z Submodule 'third_party/benchmark' (https://github.com/google/benchmark.git) registered for path 'third_party/benchmark' 2025-08-14T21:18:07.4341934Z Submodule 'third_party/composable_kernel' (https://github.com/ROCm/composable_kernel.git) registered for path 'third_party/composable_kernel' 2025-08-14T21:18:07.4342525Z Submodule 'third_party/cpp-httplib' (https://github.com/yhirose/cpp-httplib.git) registered for path 'third_party/cpp-httplib' 2025-08-14T21:18:07.4343040Z Submodule 'third_party/cpuinfo' (https://github.com/pytorch/cpuinfo.git) registered for path 'third_party/cpuinfo' 2025-08-14T21:18:07.4344868Z Submodule 'third_party/cudnn_frontend' (https://github.com/NVIDIA/cudnn-frontend.git) registered for path 'third_party/cudnn_frontend' 2025-08-14T21:18:07.4350401Z Submodule 'third_party/cutlass' (https://github.com/NVIDIA/cutlass.git) registered for path 'third_party/cutlass' 2025-08-14T21:18:07.4351096Z Submodule 'third_party/fbgemm' (https://github.com/pytorch/fbgemm) registered for path 'third_party/fbgemm' 2025-08-14T21:18:07.4355242Z Submodule 'third_party/flash-attention' (https://github.com/Dao-AILab/flash-attention.git) registered for path 'third_party/flash-attention' 2025-08-14T21:18:07.4359685Z Submodule 'third_party/flatbuffers' (https://github.com/google/flatbuffers.git) registered for path 'third_party/flatbuffers' 2025-08-14T21:18:07.4360383Z Submodule 'third_party/fmt' (https://github.com/fmtlib/fmt.git) registered for path 'third_party/fmt' 2025-08-14T21:18:07.4721797Z Submodule 'third_party/gemmlowp/gemmlowp' (https://github.com/google/gemmlowp.git) registered for path 'third_party/gemmlowp/gemmlowp' 2025-08-14T21:18:07.4726114Z Submodule 'third_party/gloo' (https://github.com/pytorch/gloo) registered for path 'third_party/gloo' 2025-08-14T21:18:07.4730607Z Submodule 'third_party/googletest' (https://github.com/google/googletest.git) registered for path 'third_party/googletest' 2025-08-14T21:18:07.4734801Z Submodule 'third_party/ideep' (https://github.com/intel/ideep) registered for path 'third_party/ideep' 2025-08-14T21:18:07.4737401Z Submodule 'third_party/ittapi' (https://github.com/intel/ittapi.git) registered for path 'third_party/ittapi' 2025-08-14T21:18:07.4738084Z Submodule 'third_party/kineto' (https://github.com/pytorch/kineto) registered for path 'third_party/kineto' 2025-08-14T21:18:07.4738725Z Submodule 'third_party/kleidiai' (https://github.com/ARM-software/kleidiai.git) registered for path 'third_party/kleidiai' 2025-08-14T21:18:07.4739263Z Submodule 'third_party/mimalloc' (https://github.com/microsoft/mimalloc.git) registered for path 'third_party/mimalloc' 2025-08-14T21:18:07.4739770Z Submodule 'third_party/nlohmann' (https://github.com/nlohmann/json.git) registered for path 'third_party/nlohmann' 2025-08-14T21:18:07.4740230Z Submodule 'third_party/onnx' (https://github.com/onnx/onnx.git) registered for path 'third_party/onnx' 2025-08-14T21:18:07.4741741Z Submodule 'third_party/opentelemetry-cpp' (https://github.com/open-telemetry/opentelemetry-cpp.git) registered for path 'third_party/opentelemetry-cpp' 2025-08-14T21:18:07.4749051Z Submodule 'third_party/pocketfft' (https://github.com/mreineck/pocketfft) registered for path 'third_party/pocketfft' 2025-08-14T21:18:07.4751105Z Submodule 'third_party/protobuf' (https://github.com/protocolbuffers/protobuf.git) registered for path 'third_party/protobuf' 2025-08-14T21:18:07.4756344Z Submodule 'third_party/NNPACK_deps/psimd' (https://github.com/Maratyszcza/psimd.git) registered for path 'third_party/psimd' 2025-08-14T21:18:07.4757140Z Submodule 'third_party/NNPACK_deps/pthreadpool' (https://github.com/Maratyszcza/pthreadpool.git) registered for path 'third_party/pthreadpool' 2025-08-14T21:18:07.4757743Z Submodule 'third_party/pybind11' (https://github.com/pybind/pybind11.git) registered for path 'third_party/pybind11' 2025-08-14T21:18:07.4758283Z Submodule 'third_party/python-peachpy' (https://github.com/malfet/PeachPy.git) registered for path 'third_party/python-peachpy' 2025-08-14T21:18:07.4761750Z Submodule 'third_party/sleef' (https://github.com/shibatch/sleef) registered for path 'third_party/sleef' 2025-08-14T21:18:07.4766649Z Submodule 'third_party/tensorpipe' (https://github.com/pytorch/tensorpipe.git) registered for path 'third_party/tensorpipe' 2025-08-14T21:18:07.4797967Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/android/libs/fbjni'... 2025-08-14T21:18:07.7126988Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/FXdiv'... 2025-08-14T21:18:07.7127438Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/psimd'... 2025-08-14T21:18:07.7127813Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/FP16'... 2025-08-14T21:18:07.7128404Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/NNPACK'... 2025-08-14T21:18:07.7147743Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/python-peachpy'... 2025-08-14T21:18:07.9575050Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/pocketfft'... 2025-08-14T21:18:07.9575803Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/ideep'... 2025-08-14T21:18:07.9576403Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/gloo'... 2025-08-14T21:18:07.9576979Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/ittapi'... 2025-08-14T21:18:07.9577588Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/pthreadpool'... 2025-08-14T21:18:07.9578674Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/kleidiai'... 2025-08-14T21:18:07.9579438Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/benchmark'... 2025-08-14T21:18:07.9580217Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/gemmlowp/gemmlowp'... 2025-08-14T21:18:07.9580921Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/NVTX'... 2025-08-14T21:18:07.9956889Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/sleef'... 2025-08-14T21:18:08.5950302Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/cpp-httplib'... 2025-08-14T21:18:08.5950811Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/flash-attention'... 2025-08-14T21:18:08.5951364Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/cpuinfo'... 2025-08-14T21:18:08.5951797Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/tensorpipe'... 2025-08-14T21:18:08.5952214Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/googletest'... 2025-08-14T21:18:08.5952646Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/mimalloc'... 2025-08-14T21:18:08.6274256Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/VulkanMemoryAllocator'... 2025-08-14T21:18:08.7818546Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/cudnn_frontend'... 2025-08-14T21:18:08.7819297Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/pybind11'... 2025-08-14T21:18:08.8040330Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/XNNPACK'... 2025-08-14T21:18:19.8136055Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/fmt'... 2025-08-14T21:18:19.8137510Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/kineto'... 2025-08-14T21:18:19.8137933Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/flatbuffers'... 2025-08-14T21:18:19.8138369Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/fbgemm'... 2025-08-14T21:18:19.8138730Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/cutlass'... 2025-08-14T21:18:19.8139100Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/onnx'... 2025-08-14T21:18:19.8139490Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/composable_kernel'... 2025-08-14T21:18:19.8139894Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/aiter'... 2025-08-14T21:18:19.8140277Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/opentelemetry-cpp'... 2025-08-14T21:18:19.8140673Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/nlohmann'... 2025-08-14T21:18:19.8141046Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/protobuf'... 2025-08-14T21:18:19.8265554Z Submodule path 'android/libs/fbjni': checked out '7e1e1fe3858c63c251c637ae41a20de425dde96f' 2025-08-14T21:18:19.8370557Z Submodule path 'third_party/FP16': checked out '4dfe081cf6bcd15db339cf2680b9281b8451eeb3' 2025-08-14T21:18:19.8452972Z Submodule path 'third_party/FXdiv': checked out 'b408327ac2a15ec3e43352421954f5b1967701d1' 2025-08-14T21:18:19.8642764Z Submodule path 'third_party/NNPACK': checked out 'c07e3a0400713d546e0dea2d5466dd22ea389c73' 2025-08-14T21:18:19.9265126Z Submodule path 'third_party/NVTX': checked out '2942f167cc30c5e3a44a2aecd5b0d9c07ff61a07' 2025-08-14T21:18:19.9696093Z Submodule path 'third_party/VulkanMemoryAllocator': checked out '1d8f600fd424278486eade7ed3e877c99f0846b1' 2025-08-14T21:18:20.4690425Z Submodule path 'third_party/XNNPACK': checked out '51a0103656eff6fc9bfd39a4597923c4b542c883' 2025-08-14T21:18:20.5825208Z Submodule path 'third_party/aiter': checked out '01aae101b9e5e94d6c16a9514c9fb8df99c93150' 2025-08-14T21:18:20.5841484Z Submodule '3rdparty/composable_kernel' (https://github.com/ROCm/composable_kernel.git) registered for path 'third_party/aiter/3rdparty/composable_kernel' 2025-08-14T21:18:20.5866852Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/aiter/3rdparty/composable_kernel'... 2025-08-14T21:18:23.7812347Z Submodule path 'third_party/aiter/3rdparty/composable_kernel': checked out 'cffe8fa2a442ac8e80dd236a1a5d24fe3d7e0cbf' 2025-08-14T21:18:23.8002032Z Submodule path 'third_party/benchmark': checked out '299e5928955cc62af9968370293b916f5130916f' 2025-08-14T21:18:24.0219176Z Submodule path 'third_party/composable_kernel': checked out '7fe50dc3da2069d6645d9deb8c017a876472a977' 2025-08-14T21:18:24.0617720Z Submodule path 'third_party/cpp-httplib': checked out '3af7f2c16147f3fbc6e4d717032daf505dc1652c' 2025-08-14T21:18:24.1401475Z Submodule path 'third_party/cpuinfo': checked out '5e3d2445e6a84d9599bee2bf78edbb4d80865e1d' 2025-08-14T21:18:24.1752608Z Submodule path 'third_party/cudnn_frontend': checked out 'f937055efc6d414d11f4c6577e3977fe74f35fb6' 2025-08-14T21:18:24.6549825Z Submodule path 'third_party/cutlass': checked out 'e51efbfe18fe4f4cbb66ab814c55bf4aa0185491' 2025-08-14T21:18:24.7570678Z Submodule path 'third_party/fbgemm': checked out '21c7d30c526c0f1ad873ecc632dca6cfa8a69067' 2025-08-14T21:18:24.7587540Z Submodule 'external/asmjit' (https://github.com/asmjit/asmjit.git) registered for path 'third_party/fbgemm/external/asmjit' 2025-08-14T21:18:24.7588655Z Submodule 'external/composable_kernel' (https://github.com/jwfromm/composable_kernel.git) registered for path 'third_party/fbgemm/external/composable_kernel' 2025-08-14T21:18:24.7589655Z Submodule 'external/cpuinfo' (https://github.com/pytorch/cpuinfo) registered for path 'third_party/fbgemm/external/cpuinfo' 2025-08-14T21:18:24.7593346Z Submodule 'external/cutlass' (https://github.com/jwfromm/cutlass) registered for path 'third_party/fbgemm/external/cutlass' 2025-08-14T21:18:24.7595333Z Submodule 'external/googletest' (https://github.com/google/googletest) registered for path 'third_party/fbgemm/external/googletest' 2025-08-14T21:18:24.7596124Z Submodule 'external/hipify_torch' (https://github.com/ROCmSoftwarePlatform/hipify_torch.git) registered for path 'third_party/fbgemm/external/hipify_torch' 2025-08-14T21:18:24.7600046Z Submodule 'external/json' (https://github.com/nlohmann/json.git) registered for path 'third_party/fbgemm/external/json' 2025-08-14T21:18:24.7621778Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/fbgemm/external/asmjit'... 2025-08-14T21:18:25.8335934Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/fbgemm/external/hipify_torch'... 2025-08-14T21:18:25.8337255Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/fbgemm/external/cpuinfo'... 2025-08-14T21:18:25.8337803Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/fbgemm/external/googletest'... 2025-08-14T21:18:25.9010969Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/fbgemm/external/composable_kernel'... 2025-08-14T21:18:26.0012573Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/fbgemm/external/cutlass'... 2025-08-14T21:18:26.9617352Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/fbgemm/external/json'... 2025-08-14T21:18:30.7841583Z Submodule path 'third_party/fbgemm/external/asmjit': checked out 'a3199e8857792cd10b7589ff5d58343d2c9008ea' 2025-08-14T21:18:30.9688716Z Submodule path 'third_party/fbgemm/external/composable_kernel': checked out 'b1281b8b08d973a7064f864f47eeb30f3e2596e9' 2025-08-14T21:18:31.0493133Z Submodule path 'third_party/fbgemm/external/cpuinfo': checked out '6543fec09b2f04ac4a666882998b534afc9c1349' 2025-08-14T21:18:31.5137553Z Submodule path 'third_party/fbgemm/external/cutlass': checked out 'b40777404c174b9694a870bff5c13ce6b7f656ad' 2025-08-14T21:18:31.5510031Z Submodule path 'third_party/fbgemm/external/googletest': checked out '52eb8108c5bdec04579160ae17225d66034bd723' 2025-08-14T21:18:31.5615214Z Submodule path 'third_party/fbgemm/external/hipify_torch': checked out 'a4337c69fe0e2552a7b7b0669178926beeed828c' 2025-08-14T21:18:31.6428717Z Submodule path 'third_party/fbgemm/external/json': checked out '9cca280a4d0ccf0c08f47a99aa71d1b0e52f8d03' 2025-08-14T21:18:31.6962535Z Submodule path 'third_party/flash-attention': checked out '979702c87a8713a8e0a5e9fee122b90d2ef13be5' 2025-08-14T21:18:31.6977442Z Submodule 'csrc/composable_kernel' (https://github.com/ROCm/composable_kernel.git) registered for path 'third_party/flash-attention/csrc/composable_kernel' 2025-08-14T21:18:31.6982113Z Submodule 'csrc/cutlass' (https://github.com/NVIDIA/cutlass.git) registered for path 'third_party/flash-attention/csrc/cutlass' 2025-08-14T21:18:31.7005513Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/flash-attention/csrc/composable_kernel'... 2025-08-14T21:18:34.5884103Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/flash-attention/csrc/cutlass'... 2025-08-14T21:18:34.7528412Z Submodule path 'third_party/flash-attention/csrc/composable_kernel': checked out '888317e698e9803c62bd38568abc9e05d7709f33' 2025-08-14T21:18:35.1745275Z Submodule path 'third_party/flash-attention/csrc/cutlass': checked out 'c506e16788cb08416a4a57e11a9067beeee29420' 2025-08-14T21:18:35.2730408Z Submodule path 'third_party/flatbuffers': checked out 'a2cd1ea3b6d3fee220106b5fed3f7ce8da9eb757' 2025-08-14T21:18:35.2995634Z Submodule path 'third_party/fmt': checked out '40626af88bd7df9a5fb80be7b25ac85b122d6c21' 2025-08-14T21:18:35.3303453Z Submodule path 'third_party/gemmlowp/gemmlowp': checked out '3fb5c176c17c765a3492cd2f0321b0dab712f350' 2025-08-14T21:18:35.3494270Z Submodule path 'third_party/gloo': checked out 'c7b7b022c124d9643957d9bd55f57ac59fce8fa2' 2025-08-14T21:18:35.3846716Z Submodule path 'third_party/googletest': checked out '52eb8108c5bdec04579160ae17225d66034bd723' 2025-08-14T21:18:35.3954782Z Submodule path 'third_party/ideep': checked out '719d8e6cd7f7a0e01b155657526d693acf97c2b3' 2025-08-14T21:18:35.3968207Z Submodule 'mkl-dnn' (https://github.com/intel/mkl-dnn.git) registered for path 'third_party/ideep/mkl-dnn' 2025-08-14T21:18:35.3994331Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/ideep/mkl-dnn'... 2025-08-14T21:18:46.0163591Z Submodule path 'third_party/ideep/mkl-dnn': checked out '8d263e693366ef8db40acc569cc7d8edf644556d' 2025-08-14T21:18:46.0323964Z Submodule path 'third_party/ittapi': checked out 'dec1d23ca65ab069d225dfe40dea14f455170959' 2025-08-14T21:18:46.1100003Z Submodule path 'third_party/kineto': checked out '5e7501833f1021ce6f618572d3baf657b6319658' 2025-08-14T21:18:46.1114664Z Submodule 'libkineto/third_party/dynolog' (https://github.com/facebookincubator/dynolog.git) registered for path 'third_party/kineto/libkineto/third_party/dynolog' 2025-08-14T21:18:46.1115577Z Submodule 'libkineto/third_party/fmt' (https://github.com/fmtlib/fmt.git) registered for path 'third_party/kineto/libkineto/third_party/fmt' 2025-08-14T21:18:46.1116446Z Submodule 'libkineto/third_party/googletest' (https://github.com/google/googletest.git) registered for path 'third_party/kineto/libkineto/third_party/googletest' 2025-08-14T21:18:46.1142252Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/kineto/libkineto/third_party/dynolog'... 2025-08-14T21:18:46.7302928Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/kineto/libkineto/third_party/fmt'... 2025-08-14T21:18:47.3095753Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/kineto/libkineto/third_party/googletest'... 2025-08-14T21:18:47.3740939Z Submodule path 'third_party/kineto/libkineto/third_party/dynolog': checked out '7d04a0053a845370ae06ce317a22a48e9edcc74e' 2025-08-14T21:18:47.3756018Z Submodule 'third_party/DCGM' (https://github.com/NVIDIA/DCGM.git) registered for path 'third_party/kineto/libkineto/third_party/dynolog/third_party/DCGM' 2025-08-14T21:18:47.3758373Z Submodule 'third_party/cpr' (https://github.com/libcpr/cpr.git) registered for path 'third_party/kineto/libkineto/third_party/dynolog/third_party/cpr' 2025-08-14T21:18:47.3763160Z Submodule 'third_party/fmt' (https://github.com/fmtlib/fmt.git) registered for path 'third_party/kineto/libkineto/third_party/dynolog/third_party/fmt' 2025-08-14T21:18:47.3767675Z Submodule 'third_party/gflags' (https://github.com/gflags/gflags.git) registered for path 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags' 2025-08-14T21:18:47.3769786Z Submodule 'third_party/glog' (https://github.com/google/glog.git) registered for path 'third_party/kineto/libkineto/third_party/dynolog/third_party/glog' 2025-08-14T21:18:47.3770624Z Submodule 'third_party/googletest' (https://github.com/google/googletest.git) registered for path 'third_party/kineto/libkineto/third_party/dynolog/third_party/googletest' 2025-08-14T21:18:47.3774183Z Submodule 'third_party/json' (https://github.com/nlohmann/json.git) registered for path 'third_party/kineto/libkineto/third_party/dynolog/third_party/json' 2025-08-14T21:18:47.3776185Z Submodule 'third_party/pfs' (https://github.com/dtrugman/pfs.git) registered for path 'third_party/kineto/libkineto/third_party/dynolog/third_party/pfs' 2025-08-14T21:18:47.3794812Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/kineto/libkineto/third_party/dynolog/third_party/DCGM'... 2025-08-14T21:18:48.3952509Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/kineto/libkineto/third_party/dynolog/third_party/pfs'... 2025-08-14T21:18:48.3954254Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/kineto/libkineto/third_party/dynolog/third_party/gflags'... 2025-08-14T21:18:48.3954855Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/kineto/libkineto/third_party/dynolog/third_party/cpr'... 2025-08-14T21:18:48.3955426Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/kineto/libkineto/third_party/dynolog/third_party/glog'... 2025-08-14T21:18:48.3956040Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/kineto/libkineto/third_party/dynolog/third_party/googletest'... 2025-08-14T21:18:48.4953040Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/kineto/libkineto/third_party/dynolog/third_party/fmt'... 2025-08-14T21:18:48.6519358Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/kineto/libkineto/third_party/dynolog/third_party/json'... 2025-08-14T21:18:54.1386479Z Submodule path 'third_party/kineto/libkineto/third_party/dynolog/third_party/DCGM': checked out 'ffde4e54bc7249a6039a5e6b45b395141e1217f9' 2025-08-14T21:18:54.1531956Z Submodule path 'third_party/kineto/libkineto/third_party/dynolog/third_party/cpr': checked out '871ed52d350214a034f6ef8a3b8f51c5ce1bd400' 2025-08-14T21:18:54.1825172Z Submodule path 'third_party/kineto/libkineto/third_party/dynolog/third_party/fmt': checked out 'cd4af11efc9c622896a3e4cb599fa28668ca3d05' 2025-08-14T21:18:54.1943356Z Submodule path 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags': checked out 'e171aa2d15ed9eb17054558e0b3a6a413bb01067' 2025-08-14T21:18:54.1957243Z Submodule 'doc' (https://github.com/gflags/gflags.git) registered for path 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags/doc' 2025-08-14T21:18:54.1980191Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/kineto/libkineto/third_party/dynolog/third_party/gflags/doc'... 2025-08-14T21:18:54.4794503Z Submodule path 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags/doc': checked out '8411df715cf522606e3b1aca386ddfc0b63d34b4' 2025-08-14T21:18:54.4945440Z Submodule path 'third_party/kineto/libkineto/third_party/dynolog/third_party/glog': checked out 'b33e3bad4c46c8a6345525fd822af355e5ef9446' 2025-08-14T21:18:54.5277703Z Submodule path 'third_party/kineto/libkineto/third_party/dynolog/third_party/googletest': checked out '58d77fa8070e8cec2dc1ed015d66b454c8d78850' 2025-08-14T21:18:54.6114312Z Submodule path 'third_party/kineto/libkineto/third_party/dynolog/third_party/json': checked out '4f8fba14066156b73f1189a2b8bd568bde5284c5' 2025-08-14T21:18:54.6250010Z Submodule path 'third_party/kineto/libkineto/third_party/dynolog/third_party/pfs': checked out 'f68a2fa8ea36c783bdd760371411fcb495aa3150' 2025-08-14T21:18:54.6559756Z Submodule path 'third_party/kineto/libkineto/third_party/fmt': checked out '0041a40c1350ba702d475b9c4ad62da77caea164' 2025-08-14T21:18:54.7025873Z Submodule path 'third_party/kineto/libkineto/third_party/googletest': checked out '7aca84427f224eeed3144123d5230d5871e93347' 2025-08-14T21:18:54.7364910Z Submodule path 'third_party/kleidiai': checked out 'cca02c2f69dd18e1f12647c1c0bdc8cf90e680c7' 2025-08-14T21:18:54.7679943Z Submodule path 'third_party/mimalloc': checked out 'fbd8b99c2b828428947d70fdc046bb55609be93e' 2025-08-14T21:18:54.8697147Z Submodule path 'third_party/nlohmann': checked out '55f93686c01528224f448c19128836e7df245f72' 2025-08-14T21:18:55.1335678Z Submodule path 'third_party/onnx': checked out 'e709452ef2bbc1d113faf678c24e6d3467696e83' 2025-08-14T21:18:55.1361812Z Submodule 'third_party/pybind11' (https://github.com/pybind/pybind11.git) registered for path 'third_party/onnx/third_party/pybind11' 2025-08-14T21:18:55.1387563Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/onnx/third_party/pybind11'... 2025-08-14T21:18:56.0024345Z Submodule path 'third_party/onnx/third_party/pybind11': checked out 'a2e59f0e7065404b44dfe92a28aca47ba1378dc4' 2025-08-14T21:18:56.0521671Z Submodule path 'third_party/opentelemetry-cpp': checked out 'a799f4aed9c94b765dcdaabaeab7d5e7e2310878' 2025-08-14T21:18:56.0537178Z Submodule 'third_party/benchmark' (https://github.com/google/benchmark) registered for path 'third_party/opentelemetry-cpp/third_party/benchmark' 2025-08-14T21:18:56.0541859Z Submodule 'third_party/googletest' (https://github.com/google/googletest) registered for path 'third_party/opentelemetry-cpp/third_party/googletest' 2025-08-14T21:18:56.0543375Z Submodule 'third_party/ms-gsl' (https://github.com/microsoft/GSL) registered for path 'third_party/opentelemetry-cpp/third_party/ms-gsl' 2025-08-14T21:18:56.0544103Z Submodule 'third_party/nlohmann-json' (https://github.com/nlohmann/json) registered for path 'third_party/opentelemetry-cpp/third_party/nlohmann-json' 2025-08-14T21:18:56.0544922Z Submodule 'third_party/opentelemetry-proto' (https://github.com/open-telemetry/opentelemetry-proto) registered for path 'third_party/opentelemetry-cpp/third_party/opentelemetry-proto' 2025-08-14T21:18:56.0545711Z Submodule 'third_party/opentracing-cpp' (https://github.com/opentracing/opentracing-cpp.git) registered for path 'third_party/opentelemetry-cpp/third_party/opentracing-cpp' 2025-08-14T21:18:56.0546424Z Submodule 'third_party/prometheus-cpp' (https://github.com/jupp0r/prometheus-cpp) registered for path 'third_party/opentelemetry-cpp/third_party/prometheus-cpp' 2025-08-14T21:18:56.0547034Z Submodule 'tools/vcpkg' (https://github.com/Microsoft/vcpkg) registered for path 'third_party/opentelemetry-cpp/tools/vcpkg' 2025-08-14T21:18:56.0573086Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/opentelemetry-cpp/third_party/benchmark'... 2025-08-14T21:18:56.4291631Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/opentelemetry-cpp/third_party/opentracing-cpp'... 2025-08-14T21:18:56.4293621Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/opentelemetry-cpp/third_party/opentelemetry-proto'... 2025-08-14T21:18:56.4299251Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/opentelemetry-cpp/third_party/ms-gsl'... 2025-08-14T21:18:56.4301773Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/opentelemetry-cpp/third_party/prometheus-cpp'... 2025-08-14T21:18:56.5291748Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/opentelemetry-cpp/third_party/googletest'... 2025-08-14T21:18:57.0864130Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/opentelemetry-cpp/third_party/nlohmann-json'... 2025-08-14T21:19:02.1137961Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/opentelemetry-cpp/tools/vcpkg'... 2025-08-14T21:19:02.8894940Z Submodule path 'third_party/opentelemetry-cpp/third_party/benchmark': checked out 'd572f4777349d43653b21d6c2fc63020ab326db2' 2025-08-14T21:19:02.9220387Z Submodule path 'third_party/opentelemetry-cpp/third_party/googletest': checked out 'b796f7d44681514f58a683a3a71ff17c94edb0c1' 2025-08-14T21:19:02.9361720Z Submodule path 'third_party/opentelemetry-cpp/third_party/ms-gsl': checked out '6f4529395c5b7c2d661812257cd6780c67e54afa' 2025-08-14T21:19:03.0196266Z Submodule path 'third_party/opentelemetry-cpp/third_party/nlohmann-json': checked out 'bc889afb4c5bf1c0d8ee29ef35eaaf4c8bef8a5d' 2025-08-14T21:19:03.0312545Z Submodule path 'third_party/opentelemetry-cpp/third_party/opentelemetry-proto': checked out '4ca4f0335c63cda7ab31ea7ed70d6553aee14dce' 2025-08-14T21:19:03.0436760Z Submodule path 'third_party/opentelemetry-cpp/third_party/opentracing-cpp': checked out '06b57f48ded1fa3bdd3d4346f6ef29e40e08eaf5' 2025-08-14T21:19:03.0562382Z Submodule path 'third_party/opentelemetry-cpp/third_party/prometheus-cpp': checked out 'c9ffcdda9086ffd9e1283ea7a0276d831f3c8a8d' 2025-08-14T21:19:03.0576216Z Submodule 'civetweb' (https://github.com/civetweb/civetweb.git) registered for path 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/civetweb' 2025-08-14T21:19:03.0577519Z Submodule 'googletest' (https://github.com/google/googletest.git) registered for path 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/googletest' 2025-08-14T21:19:03.0602866Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/civetweb'... 2025-08-14T21:19:04.9263143Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/googletest'... 2025-08-14T21:19:05.1224610Z Submodule path 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/civetweb': checked out 'eefb26f82b233268fc98577d265352720d477ba4' 2025-08-14T21:19:05.1601968Z Submodule path 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/googletest': checked out 'e2239ee6043f73722e7aa812a459f54a28552929' 2025-08-14T21:19:05.4574658Z Submodule path 'third_party/opentelemetry-cpp/tools/vcpkg': checked out '8eb57355a4ffb410a2e94c07b4dca2dffbee8e50' 2025-08-14T21:19:05.4680440Z Submodule path 'third_party/pocketfft': checked out '0fa0ef591e38c2758e3184c6c23e497b9f732ffa' 2025-08-14T21:19:05.6718470Z Submodule path 'third_party/protobuf': checked out 'd1eca4e4b421cd2997495c4b4e65cea6be4e9b8a' 2025-08-14T21:19:05.6735757Z Submodule 'third_party/benchmark' (https://github.com/google/benchmark.git) registered for path 'third_party/protobuf/third_party/benchmark' 2025-08-14T21:19:05.6739809Z Submodule 'third_party/googletest' (https://github.com/google/googletest.git) registered for path 'third_party/protobuf/third_party/googletest' 2025-08-14T21:19:05.6761912Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/protobuf/third_party/benchmark'... 2025-08-14T21:19:06.7559734Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/protobuf/third_party/googletest'... 2025-08-14T21:19:06.7670085Z Submodule path 'third_party/protobuf/third_party/benchmark': checked out '5b7683f49e1e9223cf9927b24f6fd3d6bd82e3f8' 2025-08-14T21:19:06.8238762Z Submodule path 'third_party/protobuf/third_party/googletest': checked out '5ec7f0c4a113e2f18ac2c6cc7df51ad6afc24081' 2025-08-14T21:19:06.8323741Z Submodule path 'third_party/psimd': checked out '072586a71b55b7f8c584153d223e95687148a900' 2025-08-14T21:19:06.8426804Z Submodule path 'third_party/pthreadpool': checked out '4fe0e1e183925bf8cfa6aae24237e724a96479b8' 2025-08-14T21:19:06.8717767Z Submodule path 'third_party/pybind11': checked out 'a2e59f0e7065404b44dfe92a28aca47ba1378dc4' 2025-08-14T21:19:06.8942399Z Submodule path 'third_party/python-peachpy': checked out 'f45429b087dd7d5bc78bb40dc7cf06425c252d67' 2025-08-14T21:19:06.9285383Z Submodule path 'third_party/sleef': checked out '5a1d179df9cf652951b59010a2d2075372d67f68' 2025-08-14T21:19:06.9490934Z Submodule path 'third_party/tensorpipe': checked out 'dacda0567d9f23d4bc503e1c4f84aa65f33ac38a' 2025-08-14T21:19:06.9505514Z Submodule 'third_party/googletest' (https://github.com/google/googletest.git) registered for path 'third_party/tensorpipe/third_party/googletest' 2025-08-14T21:19:06.9509997Z Submodule 'third_party/libnop' (https://github.com/google/libnop.git) registered for path 'third_party/tensorpipe/third_party/libnop' 2025-08-14T21:19:06.9514624Z Submodule 'third_party/libuv' (https://github.com/libuv/libuv.git) registered for path 'third_party/tensorpipe/third_party/libuv' 2025-08-14T21:19:06.9518986Z Submodule 'third_party/pybind11' (https://github.com/pybind/pybind11.git) registered for path 'third_party/tensorpipe/third_party/pybind11' 2025-08-14T21:19:06.9535957Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/tensorpipe/third_party/googletest'... 2025-08-14T21:19:07.7872128Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/tensorpipe/third_party/libnop'... 2025-08-14T21:19:07.7892896Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/tensorpipe/third_party/libuv'... 2025-08-14T21:19:07.9849746Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/tensorpipe/third_party/pybind11'... 2025-08-14T21:19:08.0302132Z Submodule path 'third_party/tensorpipe/third_party/googletest': checked out 'aee0f9d9b5b87796ee8a0ab26b7587ec30e8858e' 2025-08-14T21:19:08.0430310Z Submodule path 'third_party/tensorpipe/third_party/libnop': checked out '910b55815be16109f04f4180e9adee14fb4ce281' 2025-08-14T21:19:08.1001316Z Submodule path 'third_party/tensorpipe/third_party/libuv': checked out '5152db2cbfeb5582e9c27c5ea1dba2cd9e10759b' 2025-08-14T21:19:08.1230995Z Submodule path 'third_party/tensorpipe/third_party/pybind11': checked out 'a23996fce38ff6ccfbcdc09f1e63f2c4be5ea2ef' 2025-08-14T21:19:08.1244655Z Submodule 'tools/clang' (https://github.com/wjakob/clang-cindex-python3) registered for path 'third_party/tensorpipe/third_party/pybind11/tools/clang' 2025-08-14T21:19:08.1269436Z Cloning into '/home/ec2-user/actions-runner/_work/pytorch/pytorch/third_party/tensorpipe/third_party/pybind11/tools/clang'... 2025-08-14T21:19:08.3456850Z Submodule path 'third_party/tensorpipe/third_party/pybind11/tools/clang': checked out '6a00cbc4a9b8e68b71caf7f774b3f9c753ae84d5' 2025-08-14T21:19:08.3498658Z [command]/usr/bin/git submodule foreach --recursive git config --local gc.auto 0 2025-08-14T21:19:08.3792355Z Entering 'android/libs/fbjni' 2025-08-14T21:19:08.3829234Z Entering 'third_party/FP16' 2025-08-14T21:19:08.3867838Z Entering 'third_party/FXdiv' 2025-08-14T21:19:08.3908988Z Entering 'third_party/NNPACK' 2025-08-14T21:19:08.3949323Z Entering 'third_party/NVTX' 2025-08-14T21:19:08.3984958Z Entering 'third_party/VulkanMemoryAllocator' 2025-08-14T21:19:08.4025525Z Entering 'third_party/XNNPACK' 2025-08-14T21:19:08.4074201Z Entering 'third_party/aiter' 2025-08-14T21:19:08.4112065Z Entering 'third_party/aiter/3rdparty/composable_kernel' 2025-08-14T21:19:08.4160288Z Entering 'third_party/benchmark' 2025-08-14T21:19:08.4198966Z Entering 'third_party/composable_kernel' 2025-08-14T21:19:08.4242831Z Entering 'third_party/cpp-httplib' 2025-08-14T21:19:08.4278651Z Entering 'third_party/cpuinfo' 2025-08-14T21:19:08.4319424Z Entering 'third_party/cudnn_frontend' 2025-08-14T21:19:08.4357533Z Entering 'third_party/cutlass' 2025-08-14T21:19:08.4402458Z Entering 'third_party/fbgemm' 2025-08-14T21:19:08.4442105Z Entering 'third_party/fbgemm/external/asmjit' 2025-08-14T21:19:08.4478729Z Entering 'third_party/fbgemm/external/composable_kernel' 2025-08-14T21:19:08.4523224Z Entering 'third_party/fbgemm/external/cpuinfo' 2025-08-14T21:19:08.4562013Z Entering 'third_party/fbgemm/external/cutlass' 2025-08-14T21:19:08.4604543Z Entering 'third_party/fbgemm/external/googletest' 2025-08-14T21:19:08.4644626Z Entering 'third_party/fbgemm/external/hipify_torch' 2025-08-14T21:19:08.4678673Z Entering 'third_party/fbgemm/external/json' 2025-08-14T21:19:08.4723780Z Entering 'third_party/flash-attention' 2025-08-14T21:19:08.4759904Z Entering 'third_party/flash-attention/csrc/composable_kernel' 2025-08-14T21:19:08.4803580Z Entering 'third_party/flash-attention/csrc/cutlass' 2025-08-14T21:19:08.4848324Z Entering 'third_party/flatbuffers' 2025-08-14T21:19:08.4884520Z Entering 'third_party/fmt' 2025-08-14T21:19:08.4924809Z Entering 'third_party/gemmlowp/gemmlowp' 2025-08-14T21:19:08.4960592Z Entering 'third_party/gloo' 2025-08-14T21:19:08.4998408Z Entering 'third_party/googletest' 2025-08-14T21:19:08.5037610Z Entering 'third_party/ideep' 2025-08-14T21:19:08.5072842Z Entering 'third_party/ideep/mkl-dnn' 2025-08-14T21:19:08.5117373Z Entering 'third_party/ittapi' 2025-08-14T21:19:08.5155149Z Entering 'third_party/kineto' 2025-08-14T21:19:08.5191991Z Entering 'third_party/kineto/libkineto/third_party/dynolog' 2025-08-14T21:19:08.5224570Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/DCGM' 2025-08-14T21:19:08.5263678Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/cpr' 2025-08-14T21:19:08.5301198Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/fmt' 2025-08-14T21:19:08.5338387Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags' 2025-08-14T21:19:08.5373574Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags/doc' 2025-08-14T21:19:08.5416537Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/glog' 2025-08-14T21:19:08.5453159Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/googletest' 2025-08-14T21:19:08.5488474Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/json' 2025-08-14T21:19:08.5527880Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/pfs' 2025-08-14T21:19:08.5565850Z Entering 'third_party/kineto/libkineto/third_party/fmt' 2025-08-14T21:19:08.5602819Z Entering 'third_party/kineto/libkineto/third_party/googletest' 2025-08-14T21:19:08.5643525Z Entering 'third_party/kleidiai' 2025-08-14T21:19:08.5680493Z Entering 'third_party/mimalloc' 2025-08-14T21:19:08.5719674Z Entering 'third_party/nlohmann' 2025-08-14T21:19:08.5758101Z Entering 'third_party/onnx' 2025-08-14T21:19:08.5806981Z Entering 'third_party/onnx/third_party/pybind11' 2025-08-14T21:19:08.5846776Z Entering 'third_party/opentelemetry-cpp' 2025-08-14T21:19:08.5884311Z Entering 'third_party/opentelemetry-cpp/third_party/benchmark' 2025-08-14T21:19:08.5921588Z Entering 'third_party/opentelemetry-cpp/third_party/googletest' 2025-08-14T21:19:08.5958196Z Entering 'third_party/opentelemetry-cpp/third_party/ms-gsl' 2025-08-14T21:19:08.5994843Z Entering 'third_party/opentelemetry-cpp/third_party/nlohmann-json' 2025-08-14T21:19:08.6033424Z Entering 'third_party/opentelemetry-cpp/third_party/opentelemetry-proto' 2025-08-14T21:19:08.6069499Z Entering 'third_party/opentelemetry-cpp/third_party/opentracing-cpp' 2025-08-14T21:19:08.6103215Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp' 2025-08-14T21:19:08.6136806Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/civetweb' 2025-08-14T21:19:08.6175281Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/googletest' 2025-08-14T21:19:08.6216015Z Entering 'third_party/opentelemetry-cpp/tools/vcpkg' 2025-08-14T21:19:08.6268565Z Entering 'third_party/pocketfft' 2025-08-14T21:19:08.6306091Z Entering 'third_party/protobuf' 2025-08-14T21:19:08.6345354Z Entering 'third_party/protobuf/third_party/benchmark' 2025-08-14T21:19:08.6381080Z Entering 'third_party/protobuf/third_party/googletest' 2025-08-14T21:19:08.6421843Z Entering 'third_party/psimd' 2025-08-14T21:19:08.6459880Z Entering 'third_party/pthreadpool' 2025-08-14T21:19:08.6498303Z Entering 'third_party/pybind11' 2025-08-14T21:19:08.6536101Z Entering 'third_party/python-peachpy' 2025-08-14T21:19:08.6572682Z Entering 'third_party/sleef' 2025-08-14T21:19:08.6611285Z Entering 'third_party/tensorpipe' 2025-08-14T21:19:08.6647714Z Entering 'third_party/tensorpipe/third_party/googletest' 2025-08-14T21:19:08.6683216Z Entering 'third_party/tensorpipe/third_party/libnop' 2025-08-14T21:19:08.6720267Z Entering 'third_party/tensorpipe/third_party/libuv' 2025-08-14T21:19:08.6757349Z Entering 'third_party/tensorpipe/third_party/pybind11' 2025-08-14T21:19:08.6795279Z Entering 'third_party/tensorpipe/third_party/pybind11/tools/clang' 2025-08-14T21:19:08.6852891Z ##[endgroup] 2025-08-14T21:19:08.6857187Z ##[group]Persisting credentials for submodules 2025-08-14T21:19:08.6861726Z [command]/usr/bin/git submodule foreach --recursive sh -c "git config --local --name-only --get-regexp 'url\.https\:\/\/github\.com\/\.insteadOf' && git config --local --unset-all 'url.https://github.com/.insteadOf' || :" 2025-08-14T21:19:08.7154098Z Entering 'android/libs/fbjni' 2025-08-14T21:19:08.7205532Z Entering 'third_party/FP16' 2025-08-14T21:19:08.7256457Z Entering 'third_party/FXdiv' 2025-08-14T21:19:08.7309234Z Entering 'third_party/NNPACK' 2025-08-14T21:19:08.7360216Z Entering 'third_party/NVTX' 2025-08-14T21:19:08.7411519Z Entering 'third_party/VulkanMemoryAllocator' 2025-08-14T21:19:08.7460638Z Entering 'third_party/XNNPACK' 2025-08-14T21:19:08.7523492Z Entering 'third_party/aiter' 2025-08-14T21:19:08.7573797Z Entering 'third_party/aiter/3rdparty/composable_kernel' 2025-08-14T21:19:08.7634085Z Entering 'third_party/benchmark' 2025-08-14T21:19:08.7683230Z Entering 'third_party/composable_kernel' 2025-08-14T21:19:08.7741705Z Entering 'third_party/cpp-httplib' 2025-08-14T21:19:08.7794369Z Entering 'third_party/cpuinfo' 2025-08-14T21:19:08.7844662Z Entering 'third_party/cudnn_frontend' 2025-08-14T21:19:08.7897487Z Entering 'third_party/cutlass' 2025-08-14T21:19:08.7956092Z Entering 'third_party/fbgemm' 2025-08-14T21:19:08.8010773Z Entering 'third_party/fbgemm/external/asmjit' 2025-08-14T21:19:08.8059041Z Entering 'third_party/fbgemm/external/composable_kernel' 2025-08-14T21:19:08.8118194Z Entering 'third_party/fbgemm/external/cpuinfo' 2025-08-14T21:19:08.8168377Z Entering 'third_party/fbgemm/external/cutlass' 2025-08-14T21:19:08.8224479Z Entering 'third_party/fbgemm/external/googletest' 2025-08-14T21:19:08.8274923Z Entering 'third_party/fbgemm/external/hipify_torch' 2025-08-14T21:19:08.8328507Z Entering 'third_party/fbgemm/external/json' 2025-08-14T21:19:08.8383437Z Entering 'third_party/flash-attention' 2025-08-14T21:19:08.8434254Z Entering 'third_party/flash-attention/csrc/composable_kernel' 2025-08-14T21:19:08.8486477Z Entering 'third_party/flash-attention/csrc/cutlass' 2025-08-14T21:19:08.8546009Z Entering 'third_party/flatbuffers' 2025-08-14T21:19:08.8603497Z Entering 'third_party/fmt' 2025-08-14T21:19:08.8655461Z Entering 'third_party/gemmlowp/gemmlowp' 2025-08-14T21:19:08.8708012Z Entering 'third_party/gloo' 2025-08-14T21:19:08.8761667Z Entering 'third_party/googletest' 2025-08-14T21:19:08.8818852Z Entering 'third_party/ideep' 2025-08-14T21:19:08.8866785Z Entering 'third_party/ideep/mkl-dnn' 2025-08-14T21:19:08.8925227Z Entering 'third_party/ittapi' 2025-08-14T21:19:08.8974768Z Entering 'third_party/kineto' 2025-08-14T21:19:08.9028331Z Entering 'third_party/kineto/libkineto/third_party/dynolog' 2025-08-14T21:19:08.9077588Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/DCGM' 2025-08-14T21:19:08.9131612Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/cpr' 2025-08-14T21:19:08.9181852Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/fmt' 2025-08-14T21:19:08.9232466Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags' 2025-08-14T21:19:08.9283654Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags/doc' 2025-08-14T21:19:08.9337963Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/glog' 2025-08-14T21:19:08.9389137Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/googletest' 2025-08-14T21:19:08.9441467Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/json' 2025-08-14T21:19:08.9495147Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/pfs' 2025-08-14T21:19:08.9549965Z Entering 'third_party/kineto/libkineto/third_party/fmt' 2025-08-14T21:19:08.9601620Z Entering 'third_party/kineto/libkineto/third_party/googletest' 2025-08-14T21:19:08.9655824Z Entering 'third_party/kleidiai' 2025-08-14T21:19:08.9710383Z Entering 'third_party/mimalloc' 2025-08-14T21:19:08.9764657Z Entering 'third_party/nlohmann' 2025-08-14T21:19:08.9817341Z Entering 'third_party/onnx' 2025-08-14T21:19:08.9879526Z Entering 'third_party/onnx/third_party/pybind11' 2025-08-14T21:19:08.9935532Z Entering 'third_party/opentelemetry-cpp' 2025-08-14T21:19:08.9986992Z Entering 'third_party/opentelemetry-cpp/third_party/benchmark' 2025-08-14T21:19:09.0038201Z Entering 'third_party/opentelemetry-cpp/third_party/googletest' 2025-08-14T21:19:09.0090526Z Entering 'third_party/opentelemetry-cpp/third_party/ms-gsl' 2025-08-14T21:19:09.0139313Z Entering 'third_party/opentelemetry-cpp/third_party/nlohmann-json' 2025-08-14T21:19:09.0190274Z Entering 'third_party/opentelemetry-cpp/third_party/opentelemetry-proto' 2025-08-14T21:19:09.0243814Z Entering 'third_party/opentelemetry-cpp/third_party/opentracing-cpp' 2025-08-14T21:19:09.0294294Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp' 2025-08-14T21:19:09.0342618Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/civetweb' 2025-08-14T21:19:09.0396640Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/googletest' 2025-08-14T21:19:09.0452143Z Entering 'third_party/opentelemetry-cpp/tools/vcpkg' 2025-08-14T21:19:09.0514695Z Entering 'third_party/pocketfft' 2025-08-14T21:19:09.0565764Z Entering 'third_party/protobuf' 2025-08-14T21:19:09.0618316Z Entering 'third_party/protobuf/third_party/benchmark' 2025-08-14T21:19:09.0668845Z Entering 'third_party/protobuf/third_party/googletest' 2025-08-14T21:19:09.0724350Z Entering 'third_party/psimd' 2025-08-14T21:19:09.0774335Z Entering 'third_party/pthreadpool' 2025-08-14T21:19:09.0828122Z Entering 'third_party/pybind11' 2025-08-14T21:19:09.0878017Z Entering 'third_party/python-peachpy' 2025-08-14T21:19:09.0930644Z Entering 'third_party/sleef' 2025-08-14T21:19:09.0981594Z Entering 'third_party/tensorpipe' 2025-08-14T21:19:09.1037528Z Entering 'third_party/tensorpipe/third_party/googletest' 2025-08-14T21:19:09.1084801Z Entering 'third_party/tensorpipe/third_party/libnop' 2025-08-14T21:19:09.1135166Z Entering 'third_party/tensorpipe/third_party/libuv' 2025-08-14T21:19:09.1183501Z Entering 'third_party/tensorpipe/third_party/pybind11' 2025-08-14T21:19:09.1237301Z Entering 'third_party/tensorpipe/third_party/pybind11/tools/clang' 2025-08-14T21:19:09.1307033Z [command]/usr/bin/git submodule foreach --recursive sh -c "git config --local 'http.https://github.com/.extraheader' 'AUTHORIZATION: basic ***' && git config --local --show-origin --name-only --get-regexp remote.origin.url" 2025-08-14T21:19:09.1609211Z Entering 'android/libs/fbjni' 2025-08-14T21:19:09.1652227Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/android/libs/fbjni/config remote.origin.url 2025-08-14T21:19:09.1669035Z Entering 'third_party/FP16' 2025-08-14T21:19:09.1713753Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/FP16/config remote.origin.url 2025-08-14T21:19:09.1732732Z Entering 'third_party/FXdiv' 2025-08-14T21:19:09.1778614Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/FXdiv/config remote.origin.url 2025-08-14T21:19:09.1797450Z Entering 'third_party/NNPACK' 2025-08-14T21:19:09.1842243Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK/config remote.origin.url 2025-08-14T21:19:09.1860372Z Entering 'third_party/NVTX' 2025-08-14T21:19:09.1906043Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/NVTX/config remote.origin.url 2025-08-14T21:19:09.1921986Z Entering 'third_party/VulkanMemoryAllocator' 2025-08-14T21:19:09.1966835Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/VulkanMemoryAllocator/config remote.origin.url 2025-08-14T21:19:09.1980634Z Entering 'third_party/XNNPACK' 2025-08-14T21:19:09.2029712Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/XNNPACK/config remote.origin.url 2025-08-14T21:19:09.2057789Z Entering 'third_party/aiter' 2025-08-14T21:19:09.2104505Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/aiter/config remote.origin.url 2025-08-14T21:19:09.2120866Z Entering 'third_party/aiter/3rdparty/composable_kernel' 2025-08-14T21:19:09.2163752Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/aiter/modules/3rdparty/composable_kernel/config remote.origin.url 2025-08-14T21:19:09.2189954Z Entering 'third_party/benchmark' 2025-08-14T21:19:09.2231683Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/benchmark/config remote.origin.url 2025-08-14T21:19:09.2249348Z Entering 'third_party/composable_kernel' 2025-08-14T21:19:09.2296936Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/composable_kernel/config remote.origin.url 2025-08-14T21:19:09.2319949Z Entering 'third_party/cpp-httplib' 2025-08-14T21:19:09.2363820Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/cpp-httplib/config remote.origin.url 2025-08-14T21:19:09.2377857Z Entering 'third_party/cpuinfo' 2025-08-14T21:19:09.2424753Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/cpuinfo/config remote.origin.url 2025-08-14T21:19:09.2444533Z Entering 'third_party/cudnn_frontend' 2025-08-14T21:19:09.2490769Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/cudnn_frontend/config remote.origin.url 2025-08-14T21:19:09.2509316Z Entering 'third_party/cutlass' 2025-08-14T21:19:09.2554633Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/cutlass/config remote.origin.url 2025-08-14T21:19:09.2575883Z Entering 'third_party/fbgemm' 2025-08-14T21:19:09.2623062Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/config remote.origin.url 2025-08-14T21:19:09.2640731Z Entering 'third_party/fbgemm/external/asmjit' 2025-08-14T21:19:09.2683355Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/asmjit/config remote.origin.url 2025-08-14T21:19:09.2703397Z Entering 'third_party/fbgemm/external/composable_kernel' 2025-08-14T21:19:09.2746643Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/composable_kernel/config remote.origin.url 2025-08-14T21:19:09.2767461Z Entering 'third_party/fbgemm/external/cpuinfo' 2025-08-14T21:19:09.2813775Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/cpuinfo/config remote.origin.url 2025-08-14T21:19:09.2829938Z Entering 'third_party/fbgemm/external/cutlass' 2025-08-14T21:19:09.2875440Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/cutlass/config remote.origin.url 2025-08-14T21:19:09.2897320Z Entering 'third_party/fbgemm/external/googletest' 2025-08-14T21:19:09.2941438Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/googletest/config remote.origin.url 2025-08-14T21:19:09.2957843Z Entering 'third_party/fbgemm/external/hipify_torch' 2025-08-14T21:19:09.3001238Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/hipify_torch/config remote.origin.url 2025-08-14T21:19:09.3015715Z Entering 'third_party/fbgemm/external/json' 2025-08-14T21:19:09.3059701Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/json/config remote.origin.url 2025-08-14T21:19:09.3079734Z Entering 'third_party/flash-attention' 2025-08-14T21:19:09.3125075Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/flash-attention/config remote.origin.url 2025-08-14T21:19:09.3146168Z Entering 'third_party/flash-attention/csrc/composable_kernel' 2025-08-14T21:19:09.3189865Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/flash-attention/modules/csrc/composable_kernel/config remote.origin.url 2025-08-14T21:19:09.3211520Z Entering 'third_party/flash-attention/csrc/cutlass' 2025-08-14T21:19:09.3253433Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/flash-attention/modules/csrc/cutlass/config remote.origin.url 2025-08-14T21:19:09.3277041Z Entering 'third_party/flatbuffers' 2025-08-14T21:19:09.3323411Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/flatbuffers/config remote.origin.url 2025-08-14T21:19:09.3342051Z Entering 'third_party/fmt' 2025-08-14T21:19:09.3388127Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/fmt/config remote.origin.url 2025-08-14T21:19:09.3406697Z Entering 'third_party/gemmlowp/gemmlowp' 2025-08-14T21:19:09.3451603Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/gemmlowp/gemmlowp/config remote.origin.url 2025-08-14T21:19:09.3468766Z Entering 'third_party/gloo' 2025-08-14T21:19:09.3514836Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/gloo/config remote.origin.url 2025-08-14T21:19:09.3533801Z Entering 'third_party/googletest' 2025-08-14T21:19:09.3582429Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/googletest/config remote.origin.url 2025-08-14T21:19:09.3595376Z Entering 'third_party/ideep' 2025-08-14T21:19:09.3640404Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/ideep/config remote.origin.url 2025-08-14T21:19:09.3654999Z Entering 'third_party/ideep/mkl-dnn' 2025-08-14T21:19:09.3702406Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/ideep/modules/mkl-dnn/config remote.origin.url 2025-08-14T21:19:09.3726237Z Entering 'third_party/ittapi' 2025-08-14T21:19:09.3769946Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/ittapi/config remote.origin.url 2025-08-14T21:19:09.3787177Z Entering 'third_party/kineto' 2025-08-14T21:19:09.3831185Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/config remote.origin.url 2025-08-14T21:19:09.3845538Z Entering 'third_party/kineto/libkineto/third_party/dynolog' 2025-08-14T21:19:09.3890792Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/config remote.origin.url 2025-08-14T21:19:09.3906764Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/DCGM' 2025-08-14T21:19:09.3953838Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/DCGM/config remote.origin.url 2025-08-14T21:19:09.3969501Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/cpr' 2025-08-14T21:19:09.4016987Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/cpr/config remote.origin.url 2025-08-14T21:19:09.4034263Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/fmt' 2025-08-14T21:19:09.4077773Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/fmt/config remote.origin.url 2025-08-14T21:19:09.4094707Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags' 2025-08-14T21:19:09.4140628Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/gflags/config remote.origin.url 2025-08-14T21:19:09.4155513Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags/doc' 2025-08-14T21:19:09.4200932Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/gflags/modules/doc/config remote.origin.url 2025-08-14T21:19:09.4220067Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/glog' 2025-08-14T21:19:09.4263526Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/glog/config remote.origin.url 2025-08-14T21:19:09.4276021Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/googletest' 2025-08-14T21:19:09.4321074Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/googletest/config remote.origin.url 2025-08-14T21:19:09.4338584Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/json' 2025-08-14T21:19:09.4383009Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/json/config remote.origin.url 2025-08-14T21:19:09.4402472Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/pfs' 2025-08-14T21:19:09.4448908Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/pfs/config remote.origin.url 2025-08-14T21:19:09.4468621Z Entering 'third_party/kineto/libkineto/third_party/fmt' 2025-08-14T21:19:09.4514376Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/fmt/config remote.origin.url 2025-08-14T21:19:09.4530975Z Entering 'third_party/kineto/libkineto/third_party/googletest' 2025-08-14T21:19:09.4574289Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/googletest/config remote.origin.url 2025-08-14T21:19:09.4598587Z Entering 'third_party/kleidiai' 2025-08-14T21:19:09.4641048Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kleidiai/config remote.origin.url 2025-08-14T21:19:09.4658414Z Entering 'third_party/mimalloc' 2025-08-14T21:19:09.4703051Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/mimalloc/config remote.origin.url 2025-08-14T21:19:09.4721480Z Entering 'third_party/nlohmann' 2025-08-14T21:19:09.4767262Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/nlohmann/config remote.origin.url 2025-08-14T21:19:09.4782469Z Entering 'third_party/onnx' 2025-08-14T21:19:09.4832359Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/onnx/config remote.origin.url 2025-08-14T21:19:09.4859805Z Entering 'third_party/onnx/third_party/pybind11' 2025-08-14T21:19:09.4901286Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/onnx/modules/third_party/pybind11/config remote.origin.url 2025-08-14T21:19:09.4922251Z Entering 'third_party/opentelemetry-cpp' 2025-08-14T21:19:09.4968448Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/config remote.origin.url 2025-08-14T21:19:09.4983190Z Entering 'third_party/opentelemetry-cpp/third_party/benchmark' 2025-08-14T21:19:09.5029901Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/benchmark/config remote.origin.url 2025-08-14T21:19:09.5045918Z Entering 'third_party/opentelemetry-cpp/third_party/googletest' 2025-08-14T21:19:09.5092180Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/googletest/config remote.origin.url 2025-08-14T21:19:09.5110259Z Entering 'third_party/opentelemetry-cpp/third_party/ms-gsl' 2025-08-14T21:19:09.5152204Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/ms-gsl/config remote.origin.url 2025-08-14T21:19:09.5170221Z Entering 'third_party/opentelemetry-cpp/third_party/nlohmann-json' 2025-08-14T21:19:09.5215918Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/nlohmann-json/config remote.origin.url 2025-08-14T21:19:09.5233626Z Entering 'third_party/opentelemetry-cpp/third_party/opentelemetry-proto' 2025-08-14T21:19:09.5278827Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/opentelemetry-proto/config remote.origin.url 2025-08-14T21:19:09.5297827Z Entering 'third_party/opentelemetry-cpp/third_party/opentracing-cpp' 2025-08-14T21:19:09.5342582Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/opentracing-cpp/config remote.origin.url 2025-08-14T21:19:09.5357772Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp' 2025-08-14T21:19:09.5405608Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/prometheus-cpp/config remote.origin.url 2025-08-14T21:19:09.5419415Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/civetweb' 2025-08-14T21:19:09.5464127Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/prometheus-cpp/modules/civetweb/config remote.origin.url 2025-08-14T21:19:09.5481915Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/googletest' 2025-08-14T21:19:09.5525929Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/prometheus-cpp/modules/googletest/config remote.origin.url 2025-08-14T21:19:09.5545569Z Entering 'third_party/opentelemetry-cpp/tools/vcpkg' 2025-08-14T21:19:09.5593095Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/tools/vcpkg/config remote.origin.url 2025-08-14T21:19:09.5628074Z Entering 'third_party/pocketfft' 2025-08-14T21:19:09.5672873Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/pocketfft/config remote.origin.url 2025-08-14T21:19:09.5689861Z Entering 'third_party/protobuf' 2025-08-14T21:19:09.5735735Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/protobuf/config remote.origin.url 2025-08-14T21:19:09.5753170Z Entering 'third_party/protobuf/third_party/benchmark' 2025-08-14T21:19:09.5796634Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/protobuf/modules/third_party/benchmark/config remote.origin.url 2025-08-14T21:19:09.5813922Z Entering 'third_party/protobuf/third_party/googletest' 2025-08-14T21:19:09.5857998Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/protobuf/modules/third_party/googletest/config remote.origin.url 2025-08-14T21:19:09.5876542Z Entering 'third_party/psimd' 2025-08-14T21:19:09.5922317Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/psimd/config remote.origin.url 2025-08-14T21:19:09.5939692Z Entering 'third_party/pthreadpool' 2025-08-14T21:19:09.5983003Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/pthreadpool/config remote.origin.url 2025-08-14T21:19:09.6004854Z Entering 'third_party/pybind11' 2025-08-14T21:19:09.6051022Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/pybind11/config remote.origin.url 2025-08-14T21:19:09.6067132Z Entering 'third_party/python-peachpy' 2025-08-14T21:19:09.6110983Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/python-peachpy/config remote.origin.url 2025-08-14T21:19:09.6128576Z Entering 'third_party/sleef' 2025-08-14T21:19:09.6171903Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/sleef/config remote.origin.url 2025-08-14T21:19:09.6185924Z Entering 'third_party/tensorpipe' 2025-08-14T21:19:09.6232210Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/config remote.origin.url 2025-08-14T21:19:09.6248001Z Entering 'third_party/tensorpipe/third_party/googletest' 2025-08-14T21:19:09.6292604Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/googletest/config remote.origin.url 2025-08-14T21:19:09.6310053Z Entering 'third_party/tensorpipe/third_party/libnop' 2025-08-14T21:19:09.6355986Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/libnop/config remote.origin.url 2025-08-14T21:19:09.6371378Z Entering 'third_party/tensorpipe/third_party/libuv' 2025-08-14T21:19:09.6415779Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/libuv/config remote.origin.url 2025-08-14T21:19:09.6433246Z Entering 'third_party/tensorpipe/third_party/pybind11' 2025-08-14T21:19:09.6476135Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/pybind11/config remote.origin.url 2025-08-14T21:19:09.6492405Z Entering 'third_party/tensorpipe/third_party/pybind11/tools/clang' 2025-08-14T21:19:09.6537134Z file:/home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/pybind11/modules/tools/clang/config remote.origin.url 2025-08-14T21:19:09.7682823Z [command]/usr/bin/git submodule foreach --recursive git config --local --add 'url.https://github.com/.insteadOf' 'git@github.com:' 2025-08-14T21:19:09.7971625Z Entering 'android/libs/fbjni' 2025-08-14T21:19:09.8010182Z Entering 'third_party/FP16' 2025-08-14T21:19:09.8048171Z Entering 'third_party/FXdiv' 2025-08-14T21:19:09.8084159Z Entering 'third_party/NNPACK' 2025-08-14T21:19:09.8123028Z Entering 'third_party/NVTX' 2025-08-14T21:19:09.8160743Z Entering 'third_party/VulkanMemoryAllocator' 2025-08-14T21:19:09.8202218Z Entering 'third_party/XNNPACK' 2025-08-14T21:19:09.8252638Z Entering 'third_party/aiter' 2025-08-14T21:19:09.8291222Z Entering 'third_party/aiter/3rdparty/composable_kernel' 2025-08-14T21:19:09.8336618Z Entering 'third_party/benchmark' 2025-08-14T21:19:09.8374297Z Entering 'third_party/composable_kernel' 2025-08-14T21:19:09.8419906Z Entering 'third_party/cpp-httplib' 2025-08-14T21:19:09.8457673Z Entering 'third_party/cpuinfo' 2025-08-14T21:19:09.8496132Z Entering 'third_party/cudnn_frontend' 2025-08-14T21:19:09.8535079Z Entering 'third_party/cutlass' 2025-08-14T21:19:09.8579827Z Entering 'third_party/fbgemm' 2025-08-14T21:19:09.8619569Z Entering 'third_party/fbgemm/external/asmjit' 2025-08-14T21:19:09.8655902Z Entering 'third_party/fbgemm/external/composable_kernel' 2025-08-14T21:19:09.8700161Z Entering 'third_party/fbgemm/external/cpuinfo' 2025-08-14T21:19:09.8737924Z Entering 'third_party/fbgemm/external/cutlass' 2025-08-14T21:19:09.8779304Z Entering 'third_party/fbgemm/external/googletest' 2025-08-14T21:19:09.8819346Z Entering 'third_party/fbgemm/external/hipify_torch' 2025-08-14T21:19:09.8855608Z Entering 'third_party/fbgemm/external/json' 2025-08-14T21:19:09.8896573Z Entering 'third_party/flash-attention' 2025-08-14T21:19:09.8933182Z Entering 'third_party/flash-attention/csrc/composable_kernel' 2025-08-14T21:19:09.8975329Z Entering 'third_party/flash-attention/csrc/cutlass' 2025-08-14T21:19:09.9021075Z Entering 'third_party/flatbuffers' 2025-08-14T21:19:09.9063151Z Entering 'third_party/fmt' 2025-08-14T21:19:09.9101543Z Entering 'third_party/gemmlowp/gemmlowp' 2025-08-14T21:19:09.9139059Z Entering 'third_party/gloo' 2025-08-14T21:19:09.9176306Z Entering 'third_party/googletest' 2025-08-14T21:19:09.9215922Z Entering 'third_party/ideep' 2025-08-14T21:19:09.9252109Z Entering 'third_party/ideep/mkl-dnn' 2025-08-14T21:19:09.9297627Z Entering 'third_party/ittapi' 2025-08-14T21:19:09.9334124Z Entering 'third_party/kineto' 2025-08-14T21:19:09.9370553Z Entering 'third_party/kineto/libkineto/third_party/dynolog' 2025-08-14T21:19:09.9407223Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/DCGM' 2025-08-14T21:19:09.9444967Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/cpr' 2025-08-14T21:19:09.9479488Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/fmt' 2025-08-14T21:19:09.9518441Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags' 2025-08-14T21:19:09.9554279Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags/doc' 2025-08-14T21:19:09.9597487Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/glog' 2025-08-14T21:19:09.9634463Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/googletest' 2025-08-14T21:19:09.9671195Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/json' 2025-08-14T21:19:09.9710434Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/pfs' 2025-08-14T21:19:09.9750935Z Entering 'third_party/kineto/libkineto/third_party/fmt' 2025-08-14T21:19:09.9785673Z Entering 'third_party/kineto/libkineto/third_party/googletest' 2025-08-14T21:19:09.9825917Z Entering 'third_party/kleidiai' 2025-08-14T21:19:09.9864255Z Entering 'third_party/mimalloc' 2025-08-14T21:19:09.9902490Z Entering 'third_party/nlohmann' 2025-08-14T21:19:09.9941992Z Entering 'third_party/onnx' 2025-08-14T21:19:09.9991968Z Entering 'third_party/onnx/third_party/pybind11' 2025-08-14T21:19:10.0031549Z Entering 'third_party/opentelemetry-cpp' 2025-08-14T21:19:10.0069106Z Entering 'third_party/opentelemetry-cpp/third_party/benchmark' 2025-08-14T21:19:10.0105801Z Entering 'third_party/opentelemetry-cpp/third_party/googletest' 2025-08-14T21:19:10.0143503Z Entering 'third_party/opentelemetry-cpp/third_party/ms-gsl' 2025-08-14T21:19:10.0180463Z Entering 'third_party/opentelemetry-cpp/third_party/nlohmann-json' 2025-08-14T21:19:10.0220752Z Entering 'third_party/opentelemetry-cpp/third_party/opentelemetry-proto' 2025-08-14T21:19:10.0257847Z Entering 'third_party/opentelemetry-cpp/third_party/opentracing-cpp' 2025-08-14T21:19:10.0293993Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp' 2025-08-14T21:19:10.0329842Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/civetweb' 2025-08-14T21:19:10.0367765Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/googletest' 2025-08-14T21:19:10.0409430Z Entering 'third_party/opentelemetry-cpp/tools/vcpkg' 2025-08-14T21:19:10.0460716Z Entering 'third_party/pocketfft' 2025-08-14T21:19:10.0498749Z Entering 'third_party/protobuf' 2025-08-14T21:19:10.0535717Z Entering 'third_party/protobuf/third_party/benchmark' 2025-08-14T21:19:10.0572287Z Entering 'third_party/protobuf/third_party/googletest' 2025-08-14T21:19:10.0612668Z Entering 'third_party/psimd' 2025-08-14T21:19:10.0649401Z Entering 'third_party/pthreadpool' 2025-08-14T21:19:10.0685609Z Entering 'third_party/pybind11' 2025-08-14T21:19:10.0725514Z Entering 'third_party/python-peachpy' 2025-08-14T21:19:10.0761663Z Entering 'third_party/sleef' 2025-08-14T21:19:10.0800153Z Entering 'third_party/tensorpipe' 2025-08-14T21:19:10.0837153Z Entering 'third_party/tensorpipe/third_party/googletest' 2025-08-14T21:19:10.0872565Z Entering 'third_party/tensorpipe/third_party/libnop' 2025-08-14T21:19:10.0910507Z Entering 'third_party/tensorpipe/third_party/libuv' 2025-08-14T21:19:10.0948039Z Entering 'third_party/tensorpipe/third_party/pybind11' 2025-08-14T21:19:10.0981638Z Entering 'third_party/tensorpipe/third_party/pybind11/tools/clang' 2025-08-14T21:19:10.1040185Z [command]/usr/bin/git submodule foreach --recursive git config --local --add 'url.https://github.com/.insteadOf' 'org-21003710@github.com:' 2025-08-14T21:19:10.1332280Z Entering 'android/libs/fbjni' 2025-08-14T21:19:10.1368770Z Entering 'third_party/FP16' 2025-08-14T21:19:10.1439167Z Entering 'third_party/FXdiv' 2025-08-14T21:19:10.1475019Z Entering 'third_party/NNPACK' 2025-08-14T21:19:10.1513880Z Entering 'third_party/NVTX' 2025-08-14T21:19:10.1553011Z Entering 'third_party/VulkanMemoryAllocator' 2025-08-14T21:19:10.1590197Z Entering 'third_party/XNNPACK' 2025-08-14T21:19:10.1638491Z Entering 'third_party/aiter' 2025-08-14T21:19:10.1675927Z Entering 'third_party/aiter/3rdparty/composable_kernel' 2025-08-14T21:19:10.1720310Z Entering 'third_party/benchmark' 2025-08-14T21:19:10.1757428Z Entering 'third_party/composable_kernel' 2025-08-14T21:19:10.1800611Z Entering 'third_party/cpp-httplib' 2025-08-14T21:19:10.1838922Z Entering 'third_party/cpuinfo' 2025-08-14T21:19:10.1875075Z Entering 'third_party/cudnn_frontend' 2025-08-14T21:19:10.1915433Z Entering 'third_party/cutlass' 2025-08-14T21:19:10.1957533Z Entering 'third_party/fbgemm' 2025-08-14T21:19:10.1996970Z Entering 'third_party/fbgemm/external/asmjit' 2025-08-14T21:19:10.2033724Z Entering 'third_party/fbgemm/external/composable_kernel' 2025-08-14T21:19:10.2074949Z Entering 'third_party/fbgemm/external/cpuinfo' 2025-08-14T21:19:10.2113974Z Entering 'third_party/fbgemm/external/cutlass' 2025-08-14T21:19:10.2155372Z Entering 'third_party/fbgemm/external/googletest' 2025-08-14T21:19:10.2194668Z Entering 'third_party/fbgemm/external/hipify_torch' 2025-08-14T21:19:10.2234221Z Entering 'third_party/fbgemm/external/json' 2025-08-14T21:19:10.2272832Z Entering 'third_party/flash-attention' 2025-08-14T21:19:10.2310865Z Entering 'third_party/flash-attention/csrc/composable_kernel' 2025-08-14T21:19:10.2352608Z Entering 'third_party/flash-attention/csrc/cutlass' 2025-08-14T21:19:10.2399051Z Entering 'third_party/flatbuffers' 2025-08-14T21:19:10.2437037Z Entering 'third_party/fmt' 2025-08-14T21:19:10.2476395Z Entering 'third_party/gemmlowp/gemmlowp' 2025-08-14T21:19:10.2514057Z Entering 'third_party/gloo' 2025-08-14T21:19:10.2553061Z Entering 'third_party/googletest' 2025-08-14T21:19:10.2589240Z Entering 'third_party/ideep' 2025-08-14T21:19:10.2627083Z Entering 'third_party/ideep/mkl-dnn' 2025-08-14T21:19:10.2670989Z Entering 'third_party/ittapi' 2025-08-14T21:19:10.2709230Z Entering 'third_party/kineto' 2025-08-14T21:19:10.2749834Z Entering 'third_party/kineto/libkineto/third_party/dynolog' 2025-08-14T21:19:10.2783006Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/DCGM' 2025-08-14T21:19:10.2821327Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/cpr' 2025-08-14T21:19:10.2859593Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/fmt' 2025-08-14T21:19:10.2896452Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags' 2025-08-14T21:19:10.2931631Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags/doc' 2025-08-14T21:19:10.2971667Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/glog' 2025-08-14T21:19:10.3012216Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/googletest' 2025-08-14T21:19:10.3049531Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/json' 2025-08-14T21:19:10.3083674Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/pfs' 2025-08-14T21:19:10.3126483Z Entering 'third_party/kineto/libkineto/third_party/fmt' 2025-08-14T21:19:10.3164634Z Entering 'third_party/kineto/libkineto/third_party/googletest' 2025-08-14T21:19:10.3211133Z Entering 'third_party/kleidiai' 2025-08-14T21:19:10.3250202Z Entering 'third_party/mimalloc' 2025-08-14T21:19:10.3288413Z Entering 'third_party/nlohmann' 2025-08-14T21:19:10.3327790Z Entering 'third_party/onnx' 2025-08-14T21:19:10.3375639Z Entering 'third_party/onnx/third_party/pybind11' 2025-08-14T21:19:10.3415998Z Entering 'third_party/opentelemetry-cpp' 2025-08-14T21:19:10.3453333Z Entering 'third_party/opentelemetry-cpp/third_party/benchmark' 2025-08-14T21:19:10.3491309Z Entering 'third_party/opentelemetry-cpp/third_party/googletest' 2025-08-14T21:19:10.3528003Z Entering 'third_party/opentelemetry-cpp/third_party/ms-gsl' 2025-08-14T21:19:10.3565342Z Entering 'third_party/opentelemetry-cpp/third_party/nlohmann-json' 2025-08-14T21:19:10.3603079Z Entering 'third_party/opentelemetry-cpp/third_party/opentelemetry-proto' 2025-08-14T21:19:10.3640105Z Entering 'third_party/opentelemetry-cpp/third_party/opentracing-cpp' 2025-08-14T21:19:10.3676747Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp' 2025-08-14T21:19:10.3715437Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/civetweb' 2025-08-14T21:19:10.3753733Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/googletest' 2025-08-14T21:19:10.3796045Z Entering 'third_party/opentelemetry-cpp/tools/vcpkg' 2025-08-14T21:19:10.3847112Z Entering 'third_party/pocketfft' 2025-08-14T21:19:10.3882152Z Entering 'third_party/protobuf' 2025-08-14T21:19:10.3922180Z Entering 'third_party/protobuf/third_party/benchmark' 2025-08-14T21:19:10.3959727Z Entering 'third_party/protobuf/third_party/googletest' 2025-08-14T21:19:10.4002762Z Entering 'third_party/psimd' 2025-08-14T21:19:10.4040308Z Entering 'third_party/pthreadpool' 2025-08-14T21:19:10.4076432Z Entering 'third_party/pybind11' 2025-08-14T21:19:10.4116310Z Entering 'third_party/python-peachpy' 2025-08-14T21:19:10.4155730Z Entering 'third_party/sleef' 2025-08-14T21:19:10.4195248Z Entering 'third_party/tensorpipe' 2025-08-14T21:19:10.4232800Z Entering 'third_party/tensorpipe/third_party/googletest' 2025-08-14T21:19:10.4268743Z Entering 'third_party/tensorpipe/third_party/libnop' 2025-08-14T21:19:10.4309285Z Entering 'third_party/tensorpipe/third_party/libuv' 2025-08-14T21:19:10.4350720Z Entering 'third_party/tensorpipe/third_party/pybind11' 2025-08-14T21:19:10.4382897Z Entering 'third_party/tensorpipe/third_party/pybind11/tools/clang' 2025-08-14T21:19:10.4448166Z ##[endgroup] 2025-08-14T21:19:10.4478274Z [command]/usr/bin/git log -1 --format=%H 2025-08-14T21:19:10.4502162Z 1fc683cf17c8c673044538d10266c00f92987be2 2025-08-14T21:19:10.4670885Z Prepare all required actions 2025-08-14T21:19:10.4671281Z Getting action download info 2025-08-14T21:19:10.5928480Z ##[group]Run ./.github/actions/setup-linux 2025-08-14T21:19:10.5928687Z env: 2025-08-14T21:19:10.5928829Z GIT_DEFAULT_BRANCH: main 2025-08-14T21:19:10.5928989Z ##[endgroup] 2025-08-14T21:19:10.5965078Z ##[group]Run set -euo pipefail 2025-08-14T21:19:10.5965315Z set -euo pipefail 2025-08-14T21:19:10.5965504Z function get_ec2_metadata() { 2025-08-14T21:19:10.5965734Z  # Pulled from instance metadata endpoint for EC2 2025-08-14T21:19:10.5966099Z  # see https://docs.aws.amazon.com/AWSEC2/latest/UserGuide/instancedata-data-retrieval.html 2025-08-14T21:19:10.5966410Z  category=$1 2025-08-14T21:19:10.5966622Z  # If it is GCP runner (runner name contains gcp), do not run this 2025-08-14T21:19:10.5966877Z  runner_name_str=i-0819c8fa835cec089 2025-08-14T21:19:10.5967119Z  if [[ -f /.inarc ]]; then 2025-08-14T21:19:10.5967327Z  echo "ARC Runner, no info on ec2 metadata" 2025-08-14T21:19:10.5967557Z  elif [[ $runner_name_str == *"gcp"* ]]; then 2025-08-14T21:19:10.5967827Z  echo "Runner is from Google Cloud Platform, No info on ec2 metadata" 2025-08-14T21:19:10.5968065Z  else 2025-08-14T21:19:10.5968530Z  curl -H "X-aws-ec2-metadata-token: $(curl -s -X PUT "http://169.254.169.254/latest/api/token" -H "X-aws-ec2-metadata-token-ttl-seconds: 30")" -fsSL "http://169.254.169.254/latest/meta-data/${category}" 2025-08-14T21:19:10.5969013Z  fi 2025-08-14T21:19:10.5969153Z } 2025-08-14T21:19:10.5969314Z echo "ami-id: $(get_ec2_metadata ami-id)" 2025-08-14T21:19:10.5969567Z echo "instance-id: $(get_ec2_metadata instance-id)" 2025-08-14T21:19:10.5969839Z echo "instance-type: $(get_ec2_metadata instance-type)" 2025-08-14T21:19:10.5970074Z echo "system info $(uname -a)" 2025-08-14T21:19:10.5977649Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-08-14T21:19:10.5977872Z env: 2025-08-14T21:19:10.5978015Z GIT_DEFAULT_BRANCH: main 2025-08-14T21:19:10.5978171Z ##[endgroup] 2025-08-14T21:19:10.6110659Z ami-id: ami-05ffe3c48a9991133 2025-08-14T21:19:10.6203920Z instance-id: i-0819c8fa835cec089 2025-08-14T21:19:10.6291518Z instance-type: m7i-flex.8xlarge 2025-08-14T21:19:10.6302542Z system info Linux ip-10-0-18-145.ec2.internal 6.1.141-155.222.amzn2023.x86_64 #1 SMP PREEMPT_DYNAMIC Tue Jun 17 10:29:47 UTC 2025 x86_64 x86_64 x86_64 GNU/Linux 2025-08-14T21:19:10.6325643Z ##[group]Run echo "IN_CONTAINER_RUNNER=$(if [ -f /.inarc ] || [ -f /.incontainer ]; then echo true ; else echo false; fi)" >> "$GITHUB_OUTPUT" 2025-08-14T21:19:10.6326160Z echo "IN_CONTAINER_RUNNER=$(if [ -f /.inarc ] || [ -f /.incontainer ]; then echo true ; else echo false; fi)" >> "$GITHUB_OUTPUT" 2025-08-14T21:19:10.6330062Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-08-14T21:19:10.6330289Z env: 2025-08-14T21:19:10.6330435Z GIT_DEFAULT_BRANCH: main 2025-08-14T21:19:10.6330599Z ##[endgroup] 2025-08-14T21:19:10.6383274Z ##[group]Run if systemctl is-active --quiet docker; then 2025-08-14T21:19:10.6383549Z if systemctl is-active --quiet docker; then 2025-08-14T21:19:10.6383778Z  echo "Docker daemon is running..."; 2025-08-14T21:19:10.6384044Z else 2025-08-14T21:19:10.6384257Z  echo "Starting docker daemon..." && sudo systemctl start docker; 2025-08-14T21:19:10.6384494Z fi 2025-08-14T21:19:10.6388302Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-08-14T21:19:10.6388516Z env: 2025-08-14T21:19:10.6388660Z GIT_DEFAULT_BRANCH: main 2025-08-14T21:19:10.6388823Z ##[endgroup] 2025-08-14T21:19:10.6504019Z Docker daemon is running... 2025-08-14T21:19:10.6569634Z ##[group]Run nick-fields/retry@v3.0.0 2025-08-14T21:19:10.6569821Z with: 2025-08-14T21:19:10.6569954Z shell: bash 2025-08-14T21:19:10.6570211Z timeout_minutes: 5 2025-08-14T21:19:10.6570373Z max_attempts: 3 2025-08-14T21:19:10.6570528Z retry_wait_seconds: 30 2025-08-14T21:19:10.6571759Z command: AWS_ACCOUNT_ID=$(aws sts get-caller-identity|grep Account|cut -f4 -d\") aws ecr get-login-password --region "$AWS_DEFAULT_REGION" | docker login --username AWS \ --password-stdin "$AWS_ACCOUNT_ID.dkr.ecr.$AWS_DEFAULT_REGION.amazonaws.com" # For LF Runners we need to make sure we also login to Meta's ECR docker registry too. META_AWS_ACCOUNT_ID=308535385114 if [ "$AWS_ACCOUNT_ID" != "$META_AWS_ACCOUNT_ID" ] ; then aws ecr get-login-password --region "$AWS_DEFAULT_REGION" | docker login --username AWS \ --password-stdin "$META_AWS_ACCOUNT_ID.dkr.ecr.$AWS_DEFAULT_REGION.amazonaws.com" fi 2025-08-14T21:19:10.6572956Z polling_interval_seconds: 1 2025-08-14T21:19:10.6573141Z warning_on_retry: true 2025-08-14T21:19:10.6573308Z continue_on_error: false 2025-08-14T21:19:10.6573468Z env: 2025-08-14T21:19:10.6573614Z GIT_DEFAULT_BRANCH: main 2025-08-14T21:19:10.6573785Z AWS_RETRY_MODE: standard 2025-08-14T21:19:10.6573944Z AWS_MAX_ATTEMPTS: 5 2025-08-14T21:19:10.6574112Z AWS_DEFAULT_REGION: us-east-1 2025-08-14T21:19:10.6574292Z ##[endgroup] 2025-08-14T21:19:11.5905594Z WARNING! Your password will be stored unencrypted in /home/ec2-user/.docker/config.json. 2025-08-14T21:19:11.5909386Z Configure a credential helper to remove this warning. See 2025-08-14T21:19:11.5912926Z https://docs.docker.com/engine/reference/commandline/login/#credentials-store 2025-08-14T21:19:11.5914468Z 2025-08-14T21:19:11.5914758Z Login Succeeded 2025-08-14T21:19:11.8187714Z Command completed after 1 attempt(s). 2025-08-14T21:19:11.8253338Z ##[group]Run env | grep '^GITHUB' >> "/tmp/github_env_${GITHUB_RUN_ID}" 2025-08-14T21:19:11.8253670Z env | grep '^GITHUB' >> "/tmp/github_env_${GITHUB_RUN_ID}" 2025-08-14T21:19:11.8253937Z env | grep '^CI' >> "/tmp/github_env_${GITHUB_RUN_ID}" 2025-08-14T21:19:11.8259258Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-08-14T21:19:11.8259482Z env: 2025-08-14T21:19:11.8259630Z GIT_DEFAULT_BRANCH: main 2025-08-14T21:19:11.8259789Z ##[endgroup] 2025-08-14T21:19:11.8341105Z ##[group]Run # ignore expansion of "docker ps -q" since it could be empty 2025-08-14T21:19:11.8341474Z # ignore expansion of "docker ps -q" since it could be empty 2025-08-14T21:19:11.8341743Z # shellcheck disable=SC2046 2025-08-14T21:19:11.8341949Z docker stop $(docker ps -q) || true 2025-08-14T21:19:11.8342153Z # Prune all of the docker images 2025-08-14T21:19:11.8342352Z docker system prune -af 2025-08-14T21:19:11.8346130Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-08-14T21:19:11.8346349Z env: 2025-08-14T21:19:11.8346494Z GIT_DEFAULT_BRANCH: main 2025-08-14T21:19:11.8346658Z ##[endgroup] 2025-08-14T21:19:11.8827717Z "docker stop" requires at least 1 argument. 2025-08-14T21:19:11.8829531Z See 'docker stop --help'. 2025-08-14T21:19:11.8829788Z 2025-08-14T21:19:11.8834275Z Usage: docker stop [OPTIONS] CONTAINER [CONTAINER...] 2025-08-14T21:19:11.8835887Z 2025-08-14T21:19:11.8840462Z Stop one or more running containers 2025-08-14T21:19:11.9094831Z Total reclaimed space: 0B 2025-08-14T21:19:11.9133118Z ##[group]Run set +e 2025-08-14T21:19:11.9133301Z set +e 2025-08-14T21:19:11.9133451Z set -x 2025-08-14T21:19:11.9133693Z  2025-08-14T21:19:11.9133846Z PT_DOMAIN=download.pytorch.org 2025-08-14T21:19:11.9134184Z # TODO: Flaky access to download.pytorch.org https://github.com/pytorch/pytorch/issues/100400, 2025-08-14T21:19:11.9134593Z # cleaning this up once the issue is fixed. There are more than one resolved IP here, the last 2025-08-14T21:19:11.9134883Z # one is returned at random 2025-08-14T21:19:11.9135148Z RESOLVED_IP=$(dig -4 +short "${PT_DOMAIN}" | tail -n1) 2025-08-14T21:19:11.9135360Z  2025-08-14T21:19:11.9135587Z if [ -z "${RESOLVED_IP}" ]; then 2025-08-14T21:19:11.9135843Z  echo "Couldn't resolve ${PT_DOMAIN}, retrying with Google DNS..." 2025-08-14T21:19:11.9136129Z  RESOLVED_IP=$(dig -4 +short "${PT_DOMAIN}" @8.8.8.8 | tail -n1) 2025-08-14T21:19:11.9136350Z  2025-08-14T21:19:11.9136501Z  if [ -z "${RESOLVED_IP}" ]; then 2025-08-14T21:19:11.9136728Z  echo "Couldn't resolve ${PT_DOMAIN}, exiting..." 2025-08-14T21:19:11.9136936Z  exit 1 2025-08-14T21:19:11.9137081Z  fi 2025-08-14T21:19:11.9137217Z fi 2025-08-14T21:19:11.9137341Z  2025-08-14T21:19:11.9137504Z if grep -r "${PT_DOMAIN}" /etc/hosts; then 2025-08-14T21:19:11.9137719Z  # Clean up any old records first 2025-08-14T21:19:11.9137923Z  sudo sed -i "/${PT_DOMAIN}/d" /etc/hosts 2025-08-14T21:19:11.9138109Z fi 2025-08-14T21:19:11.9138238Z  2025-08-14T21:19:11.9138421Z echo "${RESOLVED_IP} ${PT_DOMAIN}" | sudo tee -a /etc/hosts 2025-08-14T21:19:11.9138647Z cat /etc/hosts 2025-08-14T21:19:11.9142265Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-08-14T21:19:11.9142485Z env: 2025-08-14T21:19:11.9142622Z GIT_DEFAULT_BRANCH: main 2025-08-14T21:19:11.9142787Z ##[endgroup] 2025-08-14T21:19:11.9162542Z + PT_DOMAIN=download.pytorch.org 2025-08-14T21:19:11.9170652Z ++ dig -4 +short download.pytorch.org 2025-08-14T21:19:11.9173793Z ++ tail -n1 2025-08-14T21:19:11.9903196Z + RESOLVED_IP=18.160.10.28 2025-08-14T21:19:11.9907253Z + '[' -z 18.160.10.28 ']' 2025-08-14T21:19:11.9910948Z + grep -r download.pytorch.org /etc/hosts 2025-08-14T21:19:11.9918855Z + echo '18.160.10.28 download.pytorch.org' 2025-08-14T21:19:11.9922800Z + sudo tee -a /etc/hosts 2025-08-14T21:19:12.2583257Z 18.160.10.28 download.pytorch.org 2025-08-14T21:19:12.2609251Z + cat /etc/hosts 2025-08-14T21:19:12.2619407Z 127.0.0.1 localhost localhost.localdomain localhost4 localhost4.localdomain4 2025-08-14T21:19:12.2623954Z ::1 localhost6 localhost6.localdomain6 2025-08-14T21:19:12.2624201Z 18.160.10.28 download.pytorch.org 2025-08-14T21:19:12.2728021Z ##[group]Run pytorch/test-infra/.github/actions/calculate-docker-image@main 2025-08-14T21:19:12.2728282Z with: 2025-08-14T21:19:12.2728759Z docker-image-name: 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/ci-image:pytorch-linux-jammy-py3.9-gcc11-inductor-benchmarks-bfa89110622ba7202628e9faac705f183070defe 2025-08-14T21:19:12.2729299Z use-custom-docker-registry: true 2025-08-14T21:19:12.2729496Z docker-build-dir: .ci/docker 2025-08-14T21:19:12.2729671Z docker-build-script: ./build.sh 2025-08-14T21:19:12.2729850Z working-directory: . 2025-08-14T21:19:12.2730062Z docker-registry: 308535385114.dkr.ecr.us-east-1.amazonaws.com 2025-08-14T21:19:12.2730322Z force-push: false 2025-08-14T21:19:12.2730479Z env: 2025-08-14T21:19:12.2730652Z GIT_DEFAULT_BRANCH: main 2025-08-14T21:19:12.2730837Z ##[endgroup] 2025-08-14T21:19:12.2755036Z ##[group]Run set -ex 2025-08-14T21:19:12.2755243Z set -ex 2025-08-14T21:19:12.2755392Z  2025-08-14T21:19:12.2755669Z # If the docker build directory or the build script doesn't exist, the action will 2025-08-14T21:19:12.2756036Z # gracefully return the docker image name as it is. Pulling docker image in Linux 2025-08-14T21:19:12.2756348Z # job could then download the pre-built image as usual 2025-08-14T21:19:12.2756829Z if [[ -d "${DOCKER_BUILD_DIR}" ]] && [[ -f "${DOCKER_BUILD_DIR}/${DOCKER_BUILD_SCRIPT}" ]] && [[ "${USE_CUSTOM_DOCKER_REGISTRY}" == "true" ]]; then 2025-08-14T21:19:12.2757180Z  echo "skip=false" >> "${GITHUB_OUTPUT}" 2025-08-14T21:19:12.2757369Z else 2025-08-14T21:19:12.2757534Z  echo "skip=true" >> "${GITHUB_OUTPUT}" 2025-08-14T21:19:12.2757795Z  echo "docker-image=${DOCKER_IMAGE_NAME}" >> "${GITHUB_OUTPUT}" 2025-08-14T21:19:12.2758028Z  2025-08-14T21:19:12.2758336Z  echo "Not using custom ECR registry. Either it was not requested or there is no Docker build script in the ${REPO_NAME} repo..." 2025-08-14T21:19:12.2758679Z  exit 0 2025-08-14T21:19:12.2758820Z fi 2025-08-14T21:19:12.2758948Z  2025-08-14T21:19:12.2759149Z if [[ "${DOCKER_IMAGE_NAME}" == *"${DOCKER_REGISTRY}/${REPO_NAME}"* ]]; then 2025-08-14T21:19:12.2759484Z  # The docker image name already includes the ECR prefix and tag, so we can just 2025-08-14T21:19:12.2759783Z  # use it as it is, but first let's extract the tag 2025-08-14T21:19:12.2760053Z  DOCKER_TAG=$(echo "${DOCKER_IMAGE_NAME}" | awk -F '[:,]' '{print $2}') 2025-08-14T21:19:12.2760342Z  echo "docker-tag=${DOCKER_TAG}" >> "${GITHUB_OUTPUT}" 2025-08-14T21:19:12.2760619Z  echo "docker-image=${DOCKER_IMAGE_NAME}" >> "${GITHUB_OUTPUT}" 2025-08-14T21:19:12.2760848Z else 2025-08-14T21:19:12.2761011Z  if [[ "${DOCKER_IMAGE_NAME}" == *:* ]]; then 2025-08-14T21:19:12.2761238Z  CUSTOM_TAG_PREFIX=${DOCKER_IMAGE_NAME#*:} 2025-08-14T21:19:12.2761469Z  DOCKER_IMAGE_NAME=${DOCKER_IMAGE_NAME%%:*} 2025-08-14T21:19:12.2761659Z  fi 2025-08-14T21:19:12.2761924Z  DOCKER_TAG=${CUSTOM_TAG_PREFIX:+${CUSTOM_TAG_PREFIX}-}$(git rev-parse HEAD:"${DOCKER_BUILD_DIR}") 2025-08-14T21:19:12.2762266Z  echo "docker-tag=${DOCKER_TAG}" >> "${GITHUB_OUTPUT}" 2025-08-14T21:19:12.2762614Z  echo "docker-image=${DOCKER_REGISTRY}/${REPO_NAME}/${DOCKER_IMAGE_NAME}:${DOCKER_TAG}" >> "${GITHUB_OUTPUT}" 2025-08-14T21:19:12.2762986Z  echo "custom-tag-prefix=${CUSTOM_TAG_PREFIX}" >> "${GITHUB_OUTPUT}" 2025-08-14T21:19:12.2763230Z fi 2025-08-14T21:19:12.2768796Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-08-14T21:19:12.2769012Z env: 2025-08-14T21:19:12.2769161Z GIT_DEFAULT_BRANCH: main 2025-08-14T21:19:12.2769329Z REPO_NAME: pytorch 2025-08-14T21:19:12.2769895Z DOCKER_IMAGE_NAME: 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/ci-image:pytorch-linux-jammy-py3.9-gcc11-inductor-benchmarks-bfa89110622ba7202628e9faac705f183070defe 2025-08-14T21:19:12.2770398Z DOCKER_BUILD_DIR: .ci/docker 2025-08-14T21:19:12.2770585Z DOCKER_BUILD_SCRIPT: ./build.sh 2025-08-14T21:19:12.2770830Z DOCKER_REGISTRY: 308535385114.dkr.ecr.us-east-1.amazonaws.com 2025-08-14T21:19:12.2771073Z USE_CUSTOM_DOCKER_REGISTRY: true 2025-08-14T21:19:12.2771259Z CUSTOM_TAG_PREFIX: 2025-08-14T21:19:12.2771420Z ##[endgroup] 2025-08-14T21:19:12.2791863Z + [[ -d .ci/docker ]] 2025-08-14T21:19:12.2793445Z + [[ -f .ci/docker/./build.sh ]] 2025-08-14T21:19:12.2793674Z + [[ true == \t\r\u\e ]] 2025-08-14T21:19:12.2793834Z + echo skip=false 2025-08-14T21:19:12.2794461Z + [[ 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/ci-image:pytorch-linux-jammy-py3.9-gcc11-inductor-benchmarks-bfa89110622ba7202628e9faac705f183070defe == *\3\0\8\5\3\5\3\8\5\1\1\4\.\d\k\r\.\e\c\r\.\u\s\-\e\a\s\t\-\1\.\a\m\a\z\o\n\a\w\s\.\c\o\m\/\p\y\t\o\r\c\h* ]] 2025-08-14T21:19:12.2800506Z ++ echo 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/ci-image:pytorch-linux-jammy-py3.9-gcc11-inductor-benchmarks-bfa89110622ba7202628e9faac705f183070defe 2025-08-14T21:19:12.2801368Z ++ awk -F '[:,]' '{print $2}' 2025-08-14T21:19:12.2820410Z + DOCKER_TAG=pytorch-linux-jammy-py3.9-gcc11-inductor-benchmarks-bfa89110622ba7202628e9faac705f183070defe 2025-08-14T21:19:12.2821343Z + echo docker-tag=pytorch-linux-jammy-py3.9-gcc11-inductor-benchmarks-bfa89110622ba7202628e9faac705f183070defe 2025-08-14T21:19:12.2822056Z + echo docker-image=308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/ci-image:pytorch-linux-jammy-py3.9-gcc11-inductor-benchmarks-bfa89110622ba7202628e9faac705f183070defe 2025-08-14T21:19:12.2852062Z ##[group]Run set +e 2025-08-14T21:19:12.2852269Z set +e 2025-08-14T21:19:12.2852417Z set -x 2025-08-14T21:19:12.2852559Z  2025-08-14T21:19:12.2852687Z login() { 2025-08-14T21:19:12.2852971Z  aws ecr get-login-password --region us-east-1 | docker login -u AWS --password-stdin "$1" 2025-08-14T21:19:12.2853272Z } 2025-08-14T21:19:12.2853399Z  2025-08-14T21:19:12.2853531Z retry () { 2025-08-14T21:19:12.2853699Z  $* || (sleep 1 && $*) || (sleep 2 && $*) 2025-08-14T21:19:12.2853879Z } 2025-08-14T21:19:12.2854015Z  2025-08-14T21:19:12.2854160Z retry login "${DOCKER_REGISTRY}" 2025-08-14T21:19:12.2854339Z  2025-08-14T21:19:12.2854469Z START_TIME=$(date +%s) 2025-08-14T21:19:12.2854654Z # Wait up to 120 minutes 2025-08-14T21:19:12.2854876Z while [[ $(( $(date +%s) - 7200 )) -lt $START_TIME ]]; do 2025-08-14T21:19:12.2855146Z  # Check if image already exists, if it does then skip building it 2025-08-14T21:19:12.2855427Z  if docker manifest inspect "${DOCKER_IMAGE}"; then 2025-08-14T21:19:12.2855637Z  exit 0 2025-08-14T21:19:12.2855777Z  fi 2025-08-14T21:19:12.2855912Z  2025-08-14T21:19:12.2856142Z  # NB: This flag is used by Docker build workflow to push the image to ECR, so we can 2025-08-14T21:19:12.2856501Z  # use this to differentiate between the Docker build and regular build jobs. For the 2025-08-14T21:19:12.2856854Z  # latter, it will wait for the Docker images to become available before continuing 2025-08-14T21:19:12.2857155Z  if [ "${DOCKER_PUSH:-false}" == "true" ]; then 2025-08-14T21:19:12.2857393Z  # It's a Docker build job, let's build the image 2025-08-14T21:19:12.2857592Z  break 2025-08-14T21:19:12.2857730Z  else 2025-08-14T21:19:12.2857937Z  # It's a regular build job, wait for the image to become available 2025-08-14T21:19:12.2858171Z  sleep 300 2025-08-14T21:19:12.2858319Z  fi 2025-08-14T21:19:12.2858459Z done 2025-08-14T21:19:12.2858595Z  2025-08-14T21:19:12.2858795Z # NB: This part requires a full checkout. Otherwise, the merge base will 2025-08-14T21:19:12.2859186Z # be empty. The default action would be to continue rebuild the image 2025-08-14T21:19:12.2859478Z if [[ "$BASE_REVISION" = "$(git rev-parse HEAD)" ]]; then 2025-08-14T21:19:12.2859736Z  # if we're on the base branch then use the parent commit 2025-08-14T21:19:12.2859968Z  MERGE_BASE=$(git rev-parse HEAD~) 2025-08-14T21:19:12.2860154Z else 2025-08-14T21:19:12.2860351Z  # otherwise we're on a PR, so use the most recent base commit 2025-08-14T21:19:12.2860614Z  MERGE_BASE=$(git merge-base HEAD "$BASE_REVISION") 2025-08-14T21:19:12.2860821Z fi 2025-08-14T21:19:12.2860954Z  2025-08-14T21:19:12.2861102Z if [[ -z "${MERGE_BASE}" ]]; then 2025-08-14T21:19:12.2861307Z  echo "rebuild=true" >> "${GITHUB_OUTPUT}" 2025-08-14T21:19:12.2861498Z  2025-08-14T21:19:12.2861759Z  echo "Finding merge base only works with full checkout, please set fetch-depth to 0, continuing ..." 2025-08-14T21:19:12.2862048Z  exit 0 2025-08-14T21:19:12.2862189Z fi 2025-08-14T21:19:12.2862321Z  2025-08-14T21:19:12.2862497Z if ! git rev-parse "${MERGE_BASE}:${DOCKER_BUILD_DIR}"; then 2025-08-14T21:19:12.2862922Z  echo "Directory '${DOCKER_BUILD_DIR}' not found in commit $MERGE_BASE, you should rebase onto a more recent commit" 2025-08-14T21:19:12.2863237Z  exit 1 2025-08-14T21:19:12.2863374Z fi 2025-08-14T21:19:12.2863495Z  2025-08-14T21:19:12.2863706Z PREVIOUS_DOCKER_TAG=$(git rev-parse "${MERGE_BASE}:${DOCKER_BUILD_DIR}") 2025-08-14T21:19:12.2864064Z # If no image exists but the hash is the same as the previous hash then we should error out here 2025-08-14T21:19:12.2864384Z if [[ "${PREVIOUS_DOCKER_TAG}" == "${DOCKER_TAG}" ]]; then 2025-08-14T21:19:12.2864874Z  echo "WARNING: Something has gone wrong and the previous image isn't available for the merge-base of your branch" 2025-08-14T21:19:12.2865291Z  echo " Will re-build docker image to store in local cache, TTS may be longer" 2025-08-14T21:19:12.2865545Z fi 2025-08-14T21:19:12.2865671Z  2025-08-14T21:19:12.2865836Z echo "rebuild=true" >> "${GITHUB_OUTPUT}" 2025-08-14T21:19:12.2869887Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-08-14T21:19:12.2870110Z env: 2025-08-14T21:19:12.2870256Z GIT_DEFAULT_BRANCH: main 2025-08-14T21:19:12.2870420Z DOCKER_BUILD_DIR: .ci/docker 2025-08-14T21:19:12.2870637Z BASE_REVISION: 1fc683cf17c8c673044538d10266c00f92987be2 2025-08-14T21:19:12.2871177Z DOCKER_IMAGE: 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/ci-image:pytorch-linux-jammy-py3.9-gcc11-inductor-benchmarks-bfa89110622ba7202628e9faac705f183070defe 2025-08-14T21:19:12.2871843Z DOCKER_TAG: pytorch-linux-jammy-py3.9-gcc11-inductor-benchmarks-bfa89110622ba7202628e9faac705f183070defe 2025-08-14T21:19:12.2872248Z DOCKER_REGISTRY: 308535385114.dkr.ecr.us-east-1.amazonaws.com 2025-08-14T21:19:12.2872477Z DOCKER_PUSH: 2025-08-14T21:19:12.2872620Z ##[endgroup] 2025-08-14T21:19:12.2894095Z + retry login 308535385114.dkr.ecr.us-east-1.amazonaws.com 2025-08-14T21:19:12.2895904Z + login 308535385114.dkr.ecr.us-east-1.amazonaws.com 2025-08-14T21:19:12.2896323Z + aws ecr get-login-password --region us-east-1 2025-08-14T21:19:12.2898276Z + docker login -u AWS --password-stdin 308535385114.dkr.ecr.us-east-1.amazonaws.com 2025-08-14T21:19:12.6805956Z WARNING! Your password will be stored unencrypted in /home/ec2-user/.docker/config.json. 2025-08-14T21:19:12.6807545Z Login Succeeded 2025-08-14T21:19:12.6810081Z Configure a credential helper to remove this warning. See 2025-08-14T21:19:12.6814204Z https://docs.docker.com/engine/reference/commandline/login/#credentials-store 2025-08-14T21:19:12.6818003Z 2025-08-14T21:19:12.6824290Z ++ date +%s 2025-08-14T21:19:12.6835379Z + START_TIME=1755206352 2025-08-14T21:19:12.6838889Z ++ date +%s 2025-08-14T21:19:12.6847172Z + [[ 1755199152 -lt 1755206352 ]] 2025-08-14T21:19:12.6850949Z + docker manifest inspect 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/ci-image:pytorch-linux-jammy-py3.9-gcc11-inductor-benchmarks-bfa89110622ba7202628e9faac705f183070defe 2025-08-14T21:19:12.9222755Z { 2025-08-14T21:19:12.9224815Z "schemaVersion": 2, 2025-08-14T21:19:12.9229088Z "mediaType": "application/vnd.docker.distribution.manifest.v2+json", 2025-08-14T21:19:12.9233209Z "config": { 2025-08-14T21:19:12.9235036Z + exit 0 2025-08-14T21:19:12.9235285Z "mediaType": "application/vnd.docker.container.image.v1+json", 2025-08-14T21:19:12.9235541Z "size": 30151, 2025-08-14T21:19:12.9235796Z "digest": "sha256:0899ae453036ee7a91795ea95b1db61000579eeb74b140edab5976919ee64bbe" 2025-08-14T21:19:12.9236072Z }, 2025-08-14T21:19:12.9236195Z "layers": [ 2025-08-14T21:19:12.9236330Z { 2025-08-14T21:19:12.9236540Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-08-14T21:19:12.9236796Z "size": 30448173, 2025-08-14T21:19:12.9237053Z "digest": "sha256:660ffc76f83b006444a5731b215acc2e35138d8be5cac8ed1ffd40f947117495" 2025-08-14T21:19:12.9237397Z }, 2025-08-14T21:19:12.9237519Z { 2025-08-14T21:19:12.9237714Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-08-14T21:19:12.9238173Z "size": 1554, 2025-08-14T21:19:12.9238431Z "digest": "sha256:c7b4a852a45516e27a9256df90878663d770f96d271d6155d43be78cc5225eef" 2025-08-14T21:19:12.9238741Z }, 2025-08-14T21:19:12.9238876Z { 2025-08-14T21:19:12.9239087Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-08-14T21:19:12.9239315Z "size": 313280151, 2025-08-14T21:19:12.9239559Z "digest": "sha256:e5a28988c8932eb5797557621582a064ce48651dbb5eaed379e9978535daccb9" 2025-08-14T21:19:12.9239815Z }, 2025-08-14T21:19:12.9239936Z { 2025-08-14T21:19:12.9240119Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-08-14T21:19:12.9240345Z "size": 793, 2025-08-14T21:19:12.9240591Z "digest": "sha256:76a69b57b6837bef07dbc1b481cf28a62dfd7c7063219d9f6e0d0d63067653c7" 2025-08-14T21:19:12.9240845Z }, 2025-08-14T21:19:12.9240967Z { 2025-08-14T21:19:12.9241157Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-08-14T21:19:12.9241382Z "size": 106, 2025-08-14T21:19:12.9241624Z "digest": "sha256:5c785dcb4cdbf1f2ceffe4d1d8e85d73225a56d0236e7ed6e36a95c836996052" 2025-08-14T21:19:12.9241885Z }, 2025-08-14T21:19:12.9241997Z { 2025-08-14T21:19:12.9242185Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-08-14T21:19:12.9242485Z "size": 704, 2025-08-14T21:19:12.9242721Z "digest": "sha256:836ab08052e8eb2bae68e69ae086fd23a5f04a8491c320718ab47f84f03aebb1" 2025-08-14T21:19:12.9242975Z }, 2025-08-14T21:19:12.9243090Z { 2025-08-14T21:19:12.9243274Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-08-14T21:19:12.9243493Z "size": 1217, 2025-08-14T21:19:12.9243737Z "digest": "sha256:53b11c77468cbefca210560f7d8be8e58f9eeb415e096ab0c3fb0277f0b41caf" 2025-08-14T21:19:12.9243998Z }, 2025-08-14T21:19:12.9244109Z { 2025-08-14T21:19:12.9244296Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-08-14T21:19:12.9244520Z "size": 485, 2025-08-14T21:19:12.9244748Z "digest": "sha256:e97311a6a967664cbe10c5027a1ec60c514caa9a1160167d8363088fd1f9fe09" 2025-08-14T21:19:12.9245007Z }, 2025-08-14T21:19:12.9245126Z { 2025-08-14T21:19:12.9245307Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-08-14T21:19:12.9245569Z "size": 110343699, 2025-08-14T21:19:12.9245815Z "digest": "sha256:2c414689d31dc46a22fe02d4f43699f528cc1c02fb505824768383fa0bbf1c74" 2025-08-14T21:19:12.9246065Z }, 2025-08-14T21:19:12.9246185Z { 2025-08-14T21:19:12.9246370Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-08-14T21:19:12.9246590Z "size": 4817, 2025-08-14T21:19:12.9246916Z "digest": "sha256:6d89b5f065d59e4abcaa9b5ff3bf0afded2394d493d2df0f7babf7154f7548e0" 2025-08-14T21:19:12.9247197Z }, 2025-08-14T21:19:12.9247318Z { 2025-08-14T21:19:12.9247501Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-08-14T21:19:12.9247734Z "size": 1709, 2025-08-14T21:19:12.9247978Z "digest": "sha256:5a5cc76ada432cccf7d18e0eb79379afb95deaaa7afec482406267924d291ae4" 2025-08-14T21:19:12.9248243Z }, 2025-08-14T21:19:12.9248363Z { 2025-08-14T21:19:12.9248555Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-08-14T21:19:12.9248785Z "size": 724, 2025-08-14T21:19:12.9249021Z "digest": "sha256:fc6b37d40530f2c5339430321eab67ae1e2e87e997587c7bc8c41504464208f9" 2025-08-14T21:19:12.9249283Z }, 2025-08-14T21:19:12.9249395Z { 2025-08-14T21:19:12.9249583Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-08-14T21:19:12.9249814Z "size": 542, 2025-08-14T21:19:12.9250046Z "digest": "sha256:2e16579078600b91216fd14aca1e0ce0f9d1801b230689dd309980e8d2783935" 2025-08-14T21:19:12.9250297Z }, 2025-08-14T21:19:12.9250414Z { 2025-08-14T21:19:12.9250603Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-08-14T21:19:12.9250832Z "size": 3397512507, 2025-08-14T21:19:12.9251076Z "digest": "sha256:7b92d7a4b8c766d7b7873aa33088e171fb44a8e968645e4b31dfe6de2968aead" 2025-08-14T21:19:12.9251387Z }, 2025-08-14T21:19:12.9251496Z { 2025-08-14T21:19:12.9251685Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-08-14T21:19:12.9251914Z "size": 32, 2025-08-14T21:19:12.9252143Z "digest": "sha256:4f4fb700ef54461cfa02571ae0db9a0dc1e0cdb5577484a6d75e68dc38e8acc1" 2025-08-14T21:19:12.9252404Z }, 2025-08-14T21:19:12.9252522Z { 2025-08-14T21:19:12.9252701Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-08-14T21:19:12.9252931Z "size": 380, 2025-08-14T21:19:12.9253167Z "digest": "sha256:d6226eb61f823984003d5ac28f4d66fec9b27baf5d54a9513286483f5912cd88" 2025-08-14T21:19:12.9253428Z }, 2025-08-14T21:19:12.9253539Z { 2025-08-14T21:19:12.9253729Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-08-14T21:19:12.9253963Z "size": 234681, 2025-08-14T21:19:12.9254196Z "digest": "sha256:83c70f4266a6ee5f8f44a88d4cb951382f6c960323b8250046bddc080e62268b" 2025-08-14T21:19:12.9254458Z }, 2025-08-14T21:19:12.9254577Z { 2025-08-14T21:19:12.9254757Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-08-14T21:19:12.9254990Z "size": 231, 2025-08-14T21:19:12.9255221Z "digest": "sha256:60c725d21861c24c417efe3a5474414ba04f0f49c78c6d6451478ab9e45469ec" 2025-08-14T21:19:12.9255470Z }, 2025-08-14T21:19:12.9255593Z { 2025-08-14T21:19:12.9255783Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-08-14T21:19:12.9256008Z "size": 4464546, 2025-08-14T21:19:12.9256252Z "digest": "sha256:a504e76e66a49926b4ea837b7a7ff3c842a27b2caaa4d80cf5057a1e55293666" 2025-08-14T21:19:12.9256511Z }, 2025-08-14T21:19:12.9256629Z { 2025-08-14T21:19:12.9256813Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-08-14T21:19:12.9257046Z "size": 1864, 2025-08-14T21:19:12.9257285Z "digest": "sha256:fc1c200a4f77face2af0146f9b03ad04f31fe06fec216473ffd2ebd538cde056" 2025-08-14T21:19:12.9257542Z }, 2025-08-14T21:19:12.9257658Z { 2025-08-14T21:19:12.9257850Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-08-14T21:19:12.9258073Z "size": 475, 2025-08-14T21:19:12.9258303Z "digest": "sha256:43273c22704f81f162741d2039015f745273eee1d1fdec47be35c9b2a90dcc5b" 2025-08-14T21:19:12.9258558Z }, 2025-08-14T21:19:12.9258670Z { 2025-08-14T21:19:12.9258858Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-08-14T21:19:12.9259089Z "size": 178, 2025-08-14T21:19:12.9259329Z "digest": "sha256:89df389d042adbd7621a94d36b6e3db60ff6c559efb95c6fcc11b8afd42f0599" 2025-08-14T21:19:12.9259587Z }, 2025-08-14T21:19:12.9259704Z { 2025-08-14T21:19:12.9259932Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-08-14T21:19:12.9260160Z "size": 586, 2025-08-14T21:19:12.9260390Z "digest": "sha256:684349f50d9456597026ee5c1bd890c51d1e498614f367adf03329c5227add79" 2025-08-14T21:19:12.9260640Z }, 2025-08-14T21:19:12.9260750Z { 2025-08-14T21:19:12.9260942Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-08-14T21:19:12.9261174Z "size": 218, 2025-08-14T21:19:12.9261404Z "digest": "sha256:21d0eae87fb3ac753b3f0e91ae638360d23922d4cd119410a5a1b97bbe0ca435" 2025-08-14T21:19:12.9261669Z }, 2025-08-14T21:19:12.9261787Z { 2025-08-14T21:19:12.9261969Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-08-14T21:19:12.9262200Z "size": 802, 2025-08-14T21:19:12.9262435Z "digest": "sha256:c9c2b424b8e08d943dc259a3796d66eede3a1e93a6460df5db132c0036d3d6af" 2025-08-14T21:19:12.9262697Z }, 2025-08-14T21:19:12.9262808Z { 2025-08-14T21:19:12.9262996Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-08-14T21:19:12.9263228Z "size": 32, 2025-08-14T21:19:12.9263463Z "digest": "sha256:4f4fb700ef54461cfa02571ae0db9a0dc1e0cdb5577484a6d75e68dc38e8acc1" 2025-08-14T21:19:12.9263728Z }, 2025-08-14T21:19:12.9263846Z { 2025-08-14T21:19:12.9264026Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-08-14T21:19:12.9264330Z "size": 104, 2025-08-14T21:19:12.9264568Z "digest": "sha256:98dda28f339592e3ca6d589d551e69b8314f2b7fc2a1544eacc1b3c2d3378521" 2025-08-14T21:19:12.9264964Z }, 2025-08-14T21:19:12.9265091Z { 2025-08-14T21:19:12.9265283Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-08-14T21:19:12.9265512Z "size": 1496, 2025-08-14T21:19:12.9265756Z "digest": "sha256:acf5babd87f23aa905883eb434073e9a00ff41679134f2f4827dd86949f5a9d9" 2025-08-14T21:19:12.9266026Z }, 2025-08-14T21:19:12.9266151Z { 2025-08-14T21:19:12.9266336Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-08-14T21:19:12.9266574Z "size": 453555614, 2025-08-14T21:19:12.9266829Z "digest": "sha256:7c5050d8408d3c4f9f5e8f2cb215245473bfc2f1510fe5ee01c2a6c505068b5a" 2025-08-14T21:19:12.9267085Z }, 2025-08-14T21:19:12.9267206Z { 2025-08-14T21:19:12.9267393Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-08-14T21:19:12.9267617Z "size": 163, 2025-08-14T21:19:12.9267858Z "digest": "sha256:7ddd14e2b548b9ae6e216a081bb20116434aacbbe571c99b40e60fb2fde22a2a" 2025-08-14T21:19:12.9268125Z }, 2025-08-14T21:19:12.9268236Z { 2025-08-14T21:19:12.9268425Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-08-14T21:19:12.9268654Z "size": 347, 2025-08-14T21:19:12.9268886Z "digest": "sha256:4ba8e7a736c8199931fd7ff9931a5f17b7b931d0383a3e158f1b12b191a1d250" 2025-08-14T21:19:12.9269138Z }, 2025-08-14T21:19:12.9269255Z { 2025-08-14T21:19:12.9269445Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-08-14T21:19:12.9269667Z "size": 32, 2025-08-14T21:19:12.9269908Z "digest": "sha256:4f4fb700ef54461cfa02571ae0db9a0dc1e0cdb5577484a6d75e68dc38e8acc1" 2025-08-14T21:19:12.9270174Z }, 2025-08-14T21:19:12.9270284Z { 2025-08-14T21:19:12.9270472Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-08-14T21:19:12.9270701Z "size": 106, 2025-08-14T21:19:12.9270930Z "digest": "sha256:907c320fee2f90da0cf5028c90a0ef49a137518baf79b483dcf7f22d5a0a497d" 2025-08-14T21:19:12.9271202Z }, 2025-08-14T21:19:12.9271320Z { 2025-08-14T21:19:12.9271499Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-08-14T21:19:12.9271728Z "size": 425, 2025-08-14T21:19:12.9271962Z "digest": "sha256:18c4ed1ec491095788e352ae018afd84de0f251fbcfb8f74d5d893e1e9ab196d" 2025-08-14T21:19:12.9272223Z }, 2025-08-14T21:19:12.9272333Z { 2025-08-14T21:19:12.9272518Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-08-14T21:19:12.9272749Z "size": 19308711, 2025-08-14T21:19:12.9272992Z "digest": "sha256:d7618c2df6cdb4bbf3d9870ba2d089094ac46c429b573d9adb94411fac54cfca" 2025-08-14T21:19:12.9273325Z }, 2025-08-14T21:19:12.9273443Z { 2025-08-14T21:19:12.9273626Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-08-14T21:19:12.9273856Z "size": 108, 2025-08-14T21:19:12.9274092Z "digest": "sha256:b7bdd9a6f789ba483a46c92e5d373638850f33e88b1baa4bbe67e1c6a09cb7d0" 2025-08-14T21:19:12.9274352Z }, 2025-08-14T21:19:12.9274475Z { 2025-08-14T21:19:12.9274664Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-08-14T21:19:12.9274885Z "size": 691, 2025-08-14T21:19:12.9275125Z "digest": "sha256:6738ba83282e002d92bff3d2b4951e3c1a67f5ec2c1bad2fd780c2f5d444748f" 2025-08-14T21:19:12.9275391Z }, 2025-08-14T21:19:12.9275515Z { 2025-08-14T21:19:12.9275701Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-08-14T21:19:12.9275934Z "size": 724, 2025-08-14T21:19:12.9276164Z "digest": "sha256:fc6b37d40530f2c5339430321eab67ae1e2e87e997587c7bc8c41504464208f9" 2025-08-14T21:19:12.9276413Z }, 2025-08-14T21:19:12.9276536Z { 2025-08-14T21:19:12.9276724Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-08-14T21:19:12.9276946Z "size": 116, 2025-08-14T21:19:12.9277179Z "digest": "sha256:dfb0f24886393e1d394f1f433dc9346026679dafd7a60c3a93de17d94078c1ca" 2025-08-14T21:19:12.9277480Z }, 2025-08-14T21:19:12.9277593Z { 2025-08-14T21:19:12.9277782Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-08-14T21:19:12.9278009Z "size": 136, 2025-08-14T21:19:12.9278244Z "digest": "sha256:dc833b0762f2e144670a660f6b7ce62cec71a5fdd24df4e67b5c6173d5834451" 2025-08-14T21:19:12.9278499Z }, 2025-08-14T21:19:12.9278616Z { 2025-08-14T21:19:12.9278804Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-08-14T21:19:12.9279024Z "size": 139, 2025-08-14T21:19:12.9279254Z "digest": "sha256:8827df8ca2da347e0032d1bff3b0312437f711c5d0b5f2164f8a60c3368a9827" 2025-08-14T21:19:12.9279512Z }, 2025-08-14T21:19:12.9279623Z { 2025-08-14T21:19:12.9279815Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-08-14T21:19:12.9280049Z "size": 17672683360, 2025-08-14T21:19:12.9280293Z "digest": "sha256:fac8f3bd0f85eaffb43df539683dc3d861c370e583623253559fd7a1f5b00229" 2025-08-14T21:19:12.9280556Z }, 2025-08-14T21:19:12.9280676Z { 2025-08-14T21:19:12.9280854Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-08-14T21:19:12.9281084Z "size": 214, 2025-08-14T21:19:12.9281318Z "digest": "sha256:d7cf7f140df32761610e1d58686db7f7c66a85affa4bb4b9d3c245e232443a8f" 2025-08-14T21:19:12.9281578Z }, 2025-08-14T21:19:12.9281689Z { 2025-08-14T21:19:12.9281875Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-08-14T21:19:12.9282103Z "size": 272992162, 2025-08-14T21:19:12.9282344Z "digest": "sha256:733eedc8da8d8e7bd5a85a58d3d7818f14ed9a4fdf2dbd587038bb7725fbb9f7" 2025-08-14T21:19:12.9282612Z }, 2025-08-14T21:19:12.9282729Z { 2025-08-14T21:19:12.9282914Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-08-14T21:19:12.9283149Z "size": 6435582332, 2025-08-14T21:19:12.9283395Z "digest": "sha256:5b092eb06909a2ea8906849acac588a10864da349670d65c0bfea342187edba2" 2025-08-14T21:19:12.9283645Z }, 2025-08-14T21:19:12.9283767Z { 2025-08-14T21:19:12.9283960Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-08-14T21:19:12.9284191Z "size": 129, 2025-08-14T21:19:12.9284407Z "digest": "sha256:bc596103109216e154006085503386753b0b114b5900bf44758cdff324df5504" 2025-08-14T21:19:12.9284871Z }, 2025-08-14T21:19:12.9285003Z { 2025-08-14T21:19:12.9285189Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-08-14T21:19:12.9285441Z "size": 776, 2025-08-14T21:19:12.9285686Z "digest": "sha256:0531cc34c12ab9127f1858c4cf365bb3a02bc31e8d6df5eabba2e1b6ef026ccf" 2025-08-14T21:19:12.9285950Z }, 2025-08-14T21:19:12.9286075Z { 2025-08-14T21:19:12.9286272Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-08-14T21:19:12.9286554Z "size": 724, 2025-08-14T21:19:12.9286789Z "digest": "sha256:fc6b37d40530f2c5339430321eab67ae1e2e87e997587c7bc8c41504464208f9" 2025-08-14T21:19:12.9287051Z }, 2025-08-14T21:19:12.9287163Z { 2025-08-14T21:19:12.9287349Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-08-14T21:19:12.9287585Z "size": 141, 2025-08-14T21:19:12.9287817Z "digest": "sha256:38c303d3b62eb463762816db04062a480014a6f3c9754386f3e83ba331ab4d1d" 2025-08-14T21:19:12.9288067Z }, 2025-08-14T21:19:12.9288191Z { 2025-08-14T21:19:12.9288383Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-08-14T21:19:12.9288607Z "size": 32, 2025-08-14T21:19:12.9288848Z "digest": "sha256:4f4fb700ef54461cfa02571ae0db9a0dc1e0cdb5577484a6d75e68dc38e8acc1" 2025-08-14T21:19:12.9289117Z }, 2025-08-14T21:19:12.9289230Z { 2025-08-14T21:19:12.9289420Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-08-14T21:19:12.9289650Z "size": 160, 2025-08-14T21:19:12.9289880Z "digest": "sha256:e06d15594a2a76995baebbce7032946ff9f94e281246fbc3f8ab19d8bcc38b81" 2025-08-14T21:19:12.9290145Z }, 2025-08-14T21:19:12.9290263Z { 2025-08-14T21:19:12.9290443Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-08-14T21:19:12.9290728Z "size": 1010, 2025-08-14T21:19:12.9290982Z "digest": "sha256:0e55deb5cb38fd36b600183f7d86eaca0dabc04d2ff4d49ec2266ee3329edc4a" 2025-08-14T21:19:12.9291254Z }, 2025-08-14T21:19:12.9291371Z { 2025-08-14T21:19:12.9291565Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-08-14T21:19:12.9291803Z "size": 724, 2025-08-14T21:19:12.9292034Z "digest": "sha256:fc6b37d40530f2c5339430321eab67ae1e2e87e997587c7bc8c41504464208f9" 2025-08-14T21:19:12.9292299Z }, 2025-08-14T21:19:12.9292423Z { 2025-08-14T21:19:12.9292613Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-08-14T21:19:12.9292848Z "size": 134, 2025-08-14T21:19:12.9293091Z "digest": "sha256:4a53d66dce071bb7416414aa1adbc3e4a59003300c0d42038612fabdeb5a1b01" 2025-08-14T21:19:12.9293357Z }, 2025-08-14T21:19:12.9293484Z { 2025-08-14T21:19:12.9293678Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-08-14T21:19:12.9293912Z "size": 32, 2025-08-14T21:19:12.9294154Z "digest": "sha256:4f4fb700ef54461cfa02571ae0db9a0dc1e0cdb5577484a6d75e68dc38e8acc1" 2025-08-14T21:19:12.9294422Z }, 2025-08-14T21:19:12.9294549Z { 2025-08-14T21:19:12.9294740Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-08-14T21:19:12.9294978Z "size": 159, 2025-08-14T21:19:12.9295224Z "digest": "sha256:1519daa051b8b80e04125f2f2215dc412dcdbb9502711925e97aeccbda069eaf" 2025-08-14T21:19:12.9295488Z }, 2025-08-14T21:19:12.9295614Z { 2025-08-14T21:19:12.9295810Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-08-14T21:19:12.9296038Z "size": 1371, 2025-08-14T21:19:12.9296292Z "digest": "sha256:381ed91d2119f078fbba19102a65befc4cb242f8cf47a11fb6f76ea424690692" 2025-08-14T21:19:12.9296562Z }, 2025-08-14T21:19:12.9296679Z { 2025-08-14T21:19:12.9296878Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-08-14T21:19:12.9297113Z "size": 32, 2025-08-14T21:19:12.9297361Z "digest": "sha256:4f4fb700ef54461cfa02571ae0db9a0dc1e0cdb5577484a6d75e68dc38e8acc1" 2025-08-14T21:19:12.9297629Z }, 2025-08-14T21:19:12.9297753Z { 2025-08-14T21:19:12.9297947Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-08-14T21:19:12.9298178Z "size": 137, 2025-08-14T21:19:12.9298420Z "digest": "sha256:c6b0a01a96dd479640297d4b012031ffc1bd9fc0daf61d86058f9b675c0a0705" 2025-08-14T21:19:12.9298691Z }, 2025-08-14T21:19:12.9298811Z { 2025-08-14T21:19:12.9299007Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-08-14T21:19:12.9299248Z "size": 380, 2025-08-14T21:19:12.9299490Z "digest": "sha256:62df6413daeefebde04dcc401134734952e4ea37fc85ff23c89cb9b4fbd45155" 2025-08-14T21:19:12.9299796Z }, 2025-08-14T21:19:12.9299918Z { 2025-08-14T21:19:12.9300100Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-08-14T21:19:12.9300331Z "size": 32, 2025-08-14T21:19:12.9300567Z "digest": "sha256:4f4fb700ef54461cfa02571ae0db9a0dc1e0cdb5577484a6d75e68dc38e8acc1" 2025-08-14T21:19:12.9300832Z }, 2025-08-14T21:19:12.9300943Z { 2025-08-14T21:19:12.9301131Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-08-14T21:19:12.9301362Z "size": 104, 2025-08-14T21:19:12.9301594Z "digest": "sha256:7a18bc2a6881b76a6f591c98dafb47e44d903f7a905f7eba0fc3aedb5c90fff7" 2025-08-14T21:19:12.9301863Z }, 2025-08-14T21:19:12.9301981Z { 2025-08-14T21:19:12.9302163Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-08-14T21:19:12.9302391Z "size": 407, 2025-08-14T21:19:12.9302624Z "digest": "sha256:93359cd58a8cece344fd4291b27647e57761c9399bb54bb0c18149c12af5f66a" 2025-08-14T21:19:12.9302877Z }, 2025-08-14T21:19:12.9302998Z { 2025-08-14T21:19:12.9303190Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-08-14T21:19:12.9303416Z "size": 32, 2025-08-14T21:19:12.9303655Z "digest": "sha256:4f4fb700ef54461cfa02571ae0db9a0dc1e0cdb5577484a6d75e68dc38e8acc1" 2025-08-14T21:19:12.9303917Z }, 2025-08-14T21:19:12.9304069Z { 2025-08-14T21:19:12.9304251Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-08-14T21:19:12.9304480Z "size": 109, 2025-08-14T21:19:12.9304805Z "digest": "sha256:c35ba0a1f353d6894c914a4bfbea9a2c9b8ac1b526af64d34cbe9a12bd83c78e" 2025-08-14T21:19:12.9305070Z }, 2025-08-14T21:19:12.9305193Z { 2025-08-14T21:19:12.9305383Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-08-14T21:19:12.9305609Z "size": 1896, 2025-08-14T21:19:12.9305849Z "digest": "sha256:dcf1e01c98d6a6f72674d79a4e8e4047b54796576cd06ad682c225a92820a8f5" 2025-08-14T21:19:12.9306113Z }, 2025-08-14T21:19:12.9306226Z { 2025-08-14T21:19:12.9306423Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-08-14T21:19:12.9306660Z "size": 242635753, 2025-08-14T21:19:12.9306914Z "digest": "sha256:bad0564f61fdf377e3ae31f6fec0ec28b6922da0b9db28408b55b8e97ff1ea51" 2025-08-14T21:19:12.9307174Z }, 2025-08-14T21:19:12.9307296Z { 2025-08-14T21:19:12.9307486Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-08-14T21:19:12.9307706Z "size": 106, 2025-08-14T21:19:12.9307939Z "digest": "sha256:539ded9057364aade7abe23ab908d2caf53966a186734aa58ae84a56bee659eb" 2025-08-14T21:19:12.9308202Z }, 2025-08-14T21:19:12.9308311Z { 2025-08-14T21:19:12.9308496Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-08-14T21:19:12.9308724Z "size": 163, 2025-08-14T21:19:12.9308944Z "digest": "sha256:28d482062637d32514edfc447913e98745d7c13d2f277531e64ffcf090ae6d92" 2025-08-14T21:19:12.9309197Z }, 2025-08-14T21:19:12.9309317Z { 2025-08-14T21:19:12.9309500Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-08-14T21:19:12.9309729Z "size": 7943, 2025-08-14T21:19:12.9309964Z "digest": "sha256:3245316ff51b50b27da4ef7279733c92f76cc652b3fce3877c0e3d510430e8b3" 2025-08-14T21:19:12.9310220Z }, 2025-08-14T21:19:12.9310329Z { 2025-08-14T21:19:12.9310516Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-08-14T21:19:12.9310748Z "size": 8073, 2025-08-14T21:19:12.9310975Z "digest": "sha256:b53167d1a6df0e4b67d637d073150dff1fb87a823864c0c98d77c15e56babc24" 2025-08-14T21:19:12.9311231Z }, 2025-08-14T21:19:12.9311346Z { 2025-08-14T21:19:12.9311524Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-08-14T21:19:12.9311752Z "size": 303, 2025-08-14T21:19:12.9311983Z "digest": "sha256:7f5277f691672469f431fd90a8c2bb702c6c68333f6be2cff868f00e416c5a1a" 2025-08-14T21:19:12.9312231Z }, 2025-08-14T21:19:12.9312348Z { 2025-08-14T21:19:12.9312535Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-08-14T21:19:12.9312790Z "size": 32, 2025-08-14T21:19:12.9313033Z "digest": "sha256:4f4fb700ef54461cfa02571ae0db9a0dc1e0cdb5577484a6d75e68dc38e8acc1" 2025-08-14T21:19:12.9313296Z }, 2025-08-14T21:19:12.9313416Z { 2025-08-14T21:19:12.9313596Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-08-14T21:19:12.9313829Z "size": 108, 2025-08-14T21:19:12.9314066Z "digest": "sha256:23dff10cdaa5b1e9c7250f0c58a6279f104b35408281e951bfe9983f97e3d9ed" 2025-08-14T21:19:12.9314322Z }, 2025-08-14T21:19:12.9314442Z { 2025-08-14T21:19:12.9314630Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-08-14T21:19:12.9314855Z "size": 54145699, 2025-08-14T21:19:12.9315105Z "digest": "sha256:9fb73296da6ac15f37f36663bd10afc98abb8a01fb40bff4848de7247d28e018" 2025-08-14T21:19:12.9315371Z }, 2025-08-14T21:19:12.9315483Z { 2025-08-14T21:19:12.9315668Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-08-14T21:19:12.9315898Z "size": 32, 2025-08-14T21:19:12.9316133Z "digest": "sha256:4f4fb700ef54461cfa02571ae0db9a0dc1e0cdb5577484a6d75e68dc38e8acc1" 2025-08-14T21:19:12.9316389Z } 2025-08-14T21:19:12.9316508Z ] 2025-08-14T21:19:12.9316631Z } 2025-08-14T21:19:12.9346685Z ##[group]Run set -eux 2025-08-14T21:19:12.9346870Z set -eux 2025-08-14T21:19:12.9347440Z aws secretsmanager get-secret-value --secret-id docker_hub_readonly_token | jq --raw-output '.SecretString' | jq -r .docker_hub_readonly_token | docker login --username pytorchbot --password-stdin 2025-08-14T21:19:12.9353219Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-08-14T21:19:12.9353434Z env: 2025-08-14T21:19:12.9353582Z GIT_DEFAULT_BRANCH: main 2025-08-14T21:19:12.9353748Z ##[endgroup] 2025-08-14T21:19:12.9377650Z + aws secretsmanager get-secret-value --secret-id docker_hub_readonly_token 2025-08-14T21:19:12.9378134Z + jq --raw-output .SecretString 2025-08-14T21:19:12.9378784Z + jq -r .docker_hub_readonly_token 2025-08-14T21:19:12.9379088Z + docker login --username pytorchbot --password-stdin 2025-08-14T21:19:13.3563210Z WARNING! Your password will be stored unencrypted in /home/ec2-user/.docker/config.json. 2025-08-14T21:19:13.3563676Z Configure a credential helper to remove this warning. See 2025-08-14T21:19:13.3564013Z https://docs.docker.com/engine/reference/commandline/login/#credentials-store 2025-08-14T21:19:13.3564249Z 2025-08-14T21:19:13.3567479Z Login Succeeded 2025-08-14T21:19:13.3641805Z ##[group]Run tag=${ECR_DOCKER_IMAGE##*:} 2025-08-14T21:19:13.3642039Z tag=${ECR_DOCKER_IMAGE##*:} 2025-08-14T21:19:13.3642280Z echo "docker pull ghcr.io/pytorch/ci-image:${tag/:/-}" 2025-08-14T21:19:13.3646516Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-08-14T21:19:13.3646740Z env: 2025-08-14T21:19:13.3646879Z GIT_DEFAULT_BRANCH: main 2025-08-14T21:19:13.3647377Z ECR_DOCKER_IMAGE: 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/ci-image:pytorch-linux-jammy-py3.9-gcc11-inductor-benchmarks-bfa89110622ba7202628e9faac705f183070defe 2025-08-14T21:19:13.3647872Z ##[endgroup] 2025-08-14T21:19:13.3669328Z docker pull ghcr.io/pytorch/ci-image:pytorch-linux-jammy-py3.9-gcc11-inductor-benchmarks-bfa89110622ba7202628e9faac705f183070defe 2025-08-14T21:19:13.3741749Z ##[group]Run pytorch/test-infra/.github/actions/pull-docker-image@main 2025-08-14T21:19:13.3742015Z with: 2025-08-14T21:19:13.3742480Z docker-image: 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/ci-image:pytorch-linux-jammy-py3.9-gcc11-inductor-benchmarks-bfa89110622ba7202628e9faac705f183070defe 2025-08-14T21:19:13.3743020Z docker-registry: 308535385114.dkr.ecr.us-east-1.amazonaws.com 2025-08-14T21:19:13.3743250Z env: 2025-08-14T21:19:13.3743395Z GIT_DEFAULT_BRANCH: main 2025-08-14T21:19:13.3743557Z ##[endgroup] 2025-08-14T21:19:13.3765617Z ##[group]Run set -x 2025-08-14T21:19:13.3765804Z set -x 2025-08-14T21:19:13.3765951Z set +e 2025-08-14T21:19:13.3766091Z  2025-08-14T21:19:13.3766224Z login() { 2025-08-14T21:19:13.3766518Z  aws ecr get-login-password --region us-east-1 | docker login -u AWS --password-stdin "$1" 2025-08-14T21:19:13.3766813Z } 2025-08-14T21:19:13.3766942Z  2025-08-14T21:19:13.3767119Z retry () { 2025-08-14T21:19:13.3767286Z  $* || (sleep 1 && $*) || (sleep 2 && $*) 2025-08-14T21:19:13.3767480Z } 2025-08-14T21:19:13.3767607Z  2025-08-14T21:19:13.3767748Z retry login "${DOCKER_REGISTRY}" 2025-08-14T21:19:13.3767921Z  2025-08-14T21:19:13.3768200Z IMAGE_SIZE=$(docker manifest inspect "${DOCKER_IMAGE}" | jq '[.layers[].size, .config.size] | add / 1024 / 1024') 2025-08-14T21:19:13.3768571Z echo "Compressed size of image in MB: ${IMAGE_SIZE}" 2025-08-14T21:19:13.3768786Z  2025-08-14T21:19:13.3768917Z set -e 2025-08-14T21:19:13.3769123Z # ignore output since only exit code is used for conditional 2025-08-14T21:19:13.3769406Z # only pull docker image if it's not available locally 2025-08-14T21:19:13.3769706Z if ! docker inspect --type=image "${DOCKER_IMAGE}" >/dev/null 2>/dev/null; then 2025-08-14T21:19:13.3769989Z  retry docker pull "${DOCKER_IMAGE}" 2025-08-14T21:19:13.3770178Z fi 2025-08-14T21:19:13.3773906Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-08-14T21:19:13.3774129Z env: 2025-08-14T21:19:13.3774273Z GIT_DEFAULT_BRANCH: main 2025-08-14T21:19:13.3774764Z DOCKER_IMAGE: 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/ci-image:pytorch-linux-jammy-py3.9-gcc11-inductor-benchmarks-bfa89110622ba7202628e9faac705f183070defe 2025-08-14T21:19:13.3775320Z DOCKER_REGISTRY: 308535385114.dkr.ecr.us-east-1.amazonaws.com 2025-08-14T21:19:13.3775553Z ##[endgroup] 2025-08-14T21:19:13.3795864Z + set +e 2025-08-14T21:19:13.3799933Z + retry login 308535385114.dkr.ecr.us-east-1.amazonaws.com 2025-08-14T21:19:13.3802105Z + login 308535385114.dkr.ecr.us-east-1.amazonaws.com 2025-08-14T21:19:13.3802580Z + aws ecr get-login-password --region us-east-1 2025-08-14T21:19:13.3806406Z + docker login -u AWS --password-stdin 308535385114.dkr.ecr.us-east-1.amazonaws.com 2025-08-14T21:19:13.7588318Z WARNING! Your password will be stored unencrypted in /home/ec2-user/.docker/config.json. 2025-08-14T21:19:13.7588651Z Login Succeeded 2025-08-14T21:19:13.7592894Z Configure a credential helper to remove this warning. See 2025-08-14T21:19:13.7596941Z https://docs.docker.com/engine/reference/commandline/login/#credentials-store 2025-08-14T21:19:13.7598818Z 2025-08-14T21:19:13.7611101Z ++ docker manifest inspect 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/ci-image:pytorch-linux-jammy-py3.9-gcc11-inductor-benchmarks-bfa89110622ba7202628e9faac705f183070defe 2025-08-14T21:19:13.7612531Z ++ jq '[.layers[].size, .config.size] | add / 1024 / 1024' 2025-08-14T21:19:13.9923424Z + IMAGE_SIZE=27663.483686447144 2025-08-14T21:19:13.9923676Z Compressed size of image in MB: 27663.483686447144 2025-08-14T21:19:13.9927738Z + echo 'Compressed size of image in MB: 27663.483686447144' 2025-08-14T21:19:13.9931659Z + set -e 2025-08-14T21:19:13.9934319Z + docker inspect --type=image 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/ci-image:pytorch-linux-jammy-py3.9-gcc11-inductor-benchmarks-bfa89110622ba7202628e9faac705f183070defe 2025-08-14T21:19:14.0058406Z + retry docker pull 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/ci-image:pytorch-linux-jammy-py3.9-gcc11-inductor-benchmarks-bfa89110622ba7202628e9faac705f183070defe 2025-08-14T21:19:14.0059238Z + docker pull 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/ci-image:pytorch-linux-jammy-py3.9-gcc11-inductor-benchmarks-bfa89110622ba7202628e9faac705f183070defe 2025-08-14T21:19:14.2855614Z pytorch-linux-jammy-py3.9-gcc11-inductor-benchmarks-bfa89110622ba7202628e9faac705f183070defe: Pulling from pytorch/ci-image 2025-08-14T21:19:14.2860431Z 660ffc76f83b: Pulling fs layer 2025-08-14T21:19:14.2864318Z c7b4a852a455: Pulling fs layer 2025-08-14T21:19:14.2868616Z e5a28988c893: Pulling fs layer 2025-08-14T21:19:14.2872908Z 76a69b57b683: Pulling fs layer 2025-08-14T21:19:14.2874849Z 5c785dcb4cdb: Pulling fs layer 2025-08-14T21:19:14.2875216Z 836ab08052e8: Pulling fs layer 2025-08-14T21:19:14.2880042Z 53b11c77468c: Pulling fs layer 2025-08-14T21:19:14.2884948Z e97311a6a967: Pulling fs layer 2025-08-14T21:19:14.2889264Z 2c414689d31d: Pulling fs layer 2025-08-14T21:19:14.2889708Z 6d89b5f065d5: Pulling fs layer 2025-08-14T21:19:14.2890156Z 5a5cc76ada43: Pulling fs layer 2025-08-14T21:19:14.2890905Z fc6b37d40530: Pulling fs layer 2025-08-14T21:19:14.2917862Z 2e1657907860: Pulling fs layer 2025-08-14T21:19:14.2918204Z 7b92d7a4b8c7: Pulling fs layer 2025-08-14T21:19:14.2918392Z 4f4fb700ef54: Pulling fs layer 2025-08-14T21:19:14.2918576Z d6226eb61f82: Pulling fs layer 2025-08-14T21:19:14.2918754Z 83c70f4266a6: Pulling fs layer 2025-08-14T21:19:14.2918919Z 60c725d21861: Pulling fs layer 2025-08-14T21:19:14.2919097Z a504e76e66a4: Pulling fs layer 2025-08-14T21:19:14.2919273Z fc1c200a4f77: Pulling fs layer 2025-08-14T21:19:14.2919466Z 43273c22704f: Pulling fs layer 2025-08-14T21:19:14.2919724Z 89df389d042a: Pulling fs layer 2025-08-14T21:19:14.2919955Z 684349f50d94: Pulling fs layer 2025-08-14T21:19:14.2920170Z 21d0eae87fb3: Pulling fs layer 2025-08-14T21:19:14.2920357Z c9c2b424b8e0: Pulling fs layer 2025-08-14T21:19:14.2920750Z 98dda28f3395: Pulling fs layer 2025-08-14T21:19:14.2920930Z acf5babd87f2: Pulling fs layer 2025-08-14T21:19:14.2921097Z 7c5050d8408d: Pulling fs layer 2025-08-14T21:19:14.2921274Z 7ddd14e2b548: Pulling fs layer 2025-08-14T21:19:14.2921453Z 4ba8e7a736c8: Pulling fs layer 2025-08-14T21:19:14.2921645Z 907c320fee2f: Pulling fs layer 2025-08-14T21:19:14.2921820Z 18c4ed1ec491: Pulling fs layer 2025-08-14T21:19:14.2921992Z 836ab08052e8: Waiting 2025-08-14T21:19:14.2922153Z d7618c2df6cd: Pulling fs layer 2025-08-14T21:19:14.2922322Z 53b11c77468c: Waiting 2025-08-14T21:19:14.2922490Z b7bdd9a6f789: Pulling fs layer 2025-08-14T21:19:14.2922739Z 6738ba83282e: Pulling fs layer 2025-08-14T21:19:14.2922952Z dfb0f2488639: Pulling fs layer 2025-08-14T21:19:14.2923122Z e97311a6a967: Waiting 2025-08-14T21:19:14.2923275Z dc833b0762f2: Pulling fs layer 2025-08-14T21:19:14.2923457Z 8827df8ca2da: Pulling fs layer 2025-08-14T21:19:14.2923629Z 2c414689d31d: Waiting 2025-08-14T21:19:14.2923795Z fac8f3bd0f85: Pulling fs layer 2025-08-14T21:19:14.2923957Z 6d89b5f065d5: Waiting 2025-08-14T21:19:14.2924112Z 5a5cc76ada43: Waiting 2025-08-14T21:19:14.2924274Z d7cf7f140df3: Pulling fs layer 2025-08-14T21:19:14.2924443Z 733eedc8da8d: Pulling fs layer 2025-08-14T21:19:14.2924624Z fc6b37d40530: Waiting 2025-08-14T21:19:14.2924816Z 2e1657907860: Waiting 2025-08-14T21:19:14.2924972Z 5b092eb06909: Pulling fs layer 2025-08-14T21:19:14.2925145Z bc5961031092: Pulling fs layer 2025-08-14T21:19:14.2925312Z 7b92d7a4b8c7: Waiting 2025-08-14T21:19:14.2925459Z 4f4fb700ef54: Waiting 2025-08-14T21:19:14.2925618Z 0531cc34c12a: Pulling fs layer 2025-08-14T21:19:14.2925788Z d6226eb61f82: Waiting 2025-08-14T21:19:14.2925937Z 38c303d3b62e: Pulling fs layer 2025-08-14T21:19:14.2926106Z 83c70f4266a6: Waiting 2025-08-14T21:19:14.2926269Z e06d15594a2a: Pulling fs layer 2025-08-14T21:19:14.2926454Z 60c725d21861: Waiting 2025-08-14T21:19:14.2926717Z 0e55deb5cb38: Pulling fs layer 2025-08-14T21:19:14.2926897Z a504e76e66a4: Waiting 2025-08-14T21:19:14.2927052Z 76a69b57b683: Waiting 2025-08-14T21:19:14.2927218Z 4a53d66dce07: Pulling fs layer 2025-08-14T21:19:14.2948290Z fc1c200a4f77: Waiting 2025-08-14T21:19:14.2948594Z 43273c22704f: Waiting 2025-08-14T21:19:14.2948774Z 1519daa051b8: Pulling fs layer 2025-08-14T21:19:14.2948948Z 5c785dcb4cdb: Waiting 2025-08-14T21:19:14.2949093Z 89df389d042a: Waiting 2025-08-14T21:19:14.2949243Z 381ed91d2119: Pulling fs layer 2025-08-14T21:19:14.2949402Z 684349f50d94: Waiting 2025-08-14T21:19:14.2949542Z 21d0eae87fb3: Waiting 2025-08-14T21:19:14.2949699Z c6b0a01a96dd: Pulling fs layer 2025-08-14T21:19:14.2949859Z c9c2b424b8e0: Waiting 2025-08-14T21:19:14.2950011Z 62df6413daee: Pulling fs layer 2025-08-14T21:19:14.2950178Z 7a18bc2a6881: Pulling fs layer 2025-08-14T21:19:14.2950338Z 93359cd58a8c: Pulling fs layer 2025-08-14T21:19:14.2950492Z 98dda28f3395: Waiting 2025-08-14T21:19:14.2950660Z c35ba0a1f353: Pulling fs layer 2025-08-14T21:19:14.2950830Z dcf1e01c98d6: Pulling fs layer 2025-08-14T21:19:14.2951001Z bad0564f61fd: Pulling fs layer 2025-08-14T21:19:14.2951155Z 907c320fee2f: Waiting 2025-08-14T21:19:14.2951300Z acf5babd87f2: Waiting 2025-08-14T21:19:14.2951446Z 539ded905736: Pulling fs layer 2025-08-14T21:19:14.2951603Z 28d482062637: Pulling fs layer 2025-08-14T21:19:14.2951755Z 7c5050d8408d: Waiting 2025-08-14T21:19:14.2951895Z 3245316ff51b: Pulling fs layer 2025-08-14T21:19:14.2952051Z 7ddd14e2b548: Waiting 2025-08-14T21:19:14.2952223Z b53167d1a6df: Pulling fs layer 2025-08-14T21:19:14.2952382Z 7f5277f69167: Pulling fs layer 2025-08-14T21:19:14.2952531Z 4ba8e7a736c8: Waiting 2025-08-14T21:19:14.2952677Z 23dff10cdaa5: Pulling fs layer 2025-08-14T21:19:14.2952873Z 9fb73296da6a: Pulling fs layer 2025-08-14T21:19:14.2953033Z 18c4ed1ec491: Waiting 2025-08-14T21:19:14.2953172Z d7618c2df6cd: Waiting 2025-08-14T21:19:14.2953312Z b7bdd9a6f789: Waiting 2025-08-14T21:19:14.2953449Z 6738ba83282e: Waiting 2025-08-14T21:19:14.2953589Z dfb0f2488639: Waiting 2025-08-14T21:19:14.2953725Z c6b0a01a96dd: Waiting 2025-08-14T21:19:14.2954012Z 62df6413daee: Waiting 2025-08-14T21:19:14.2954150Z 7a18bc2a6881: Waiting 2025-08-14T21:19:14.2954284Z 93359cd58a8c: Waiting 2025-08-14T21:19:14.2954421Z c35ba0a1f353: Waiting 2025-08-14T21:19:14.2954567Z dc833b0762f2: Waiting 2025-08-14T21:19:14.2954702Z dcf1e01c98d6: Waiting 2025-08-14T21:19:14.2954841Z 8827df8ca2da: Waiting 2025-08-14T21:19:14.2954982Z bad0564f61fd: Waiting 2025-08-14T21:19:14.2955120Z 539ded905736: Waiting 2025-08-14T21:19:14.2955258Z fac8f3bd0f85: Waiting 2025-08-14T21:19:14.2955395Z 9fb73296da6a: Waiting 2025-08-14T21:19:14.2976569Z d7cf7f140df3: Waiting 2025-08-14T21:19:14.2976757Z 28d482062637: Waiting 2025-08-14T21:19:14.2976902Z 733eedc8da8d: Waiting 2025-08-14T21:19:14.2977040Z 3245316ff51b: Waiting 2025-08-14T21:19:14.2977176Z b53167d1a6df: Waiting 2025-08-14T21:19:14.2977310Z 7f5277f69167: Waiting 2025-08-14T21:19:14.2977448Z 5b092eb06909: Waiting 2025-08-14T21:19:14.2977613Z 23dff10cdaa5: Waiting 2025-08-14T21:19:14.2977755Z bc5961031092: Waiting 2025-08-14T21:19:14.2977903Z 381ed91d2119: Waiting 2025-08-14T21:19:14.2978047Z 0531cc34c12a: Waiting 2025-08-14T21:19:14.2978181Z 1519daa051b8: Waiting 2025-08-14T21:19:14.2978327Z e06d15594a2a: Waiting 2025-08-14T21:19:14.2978472Z 38c303d3b62e: Waiting 2025-08-14T21:19:14.2978608Z 4a53d66dce07: Waiting 2025-08-14T21:19:14.2978754Z 0e55deb5cb38: Waiting 2025-08-14T21:19:14.3762812Z c7b4a852a455: Verifying Checksum 2025-08-14T21:19:14.3764826Z c7b4a852a455: Download complete 2025-08-14T21:19:14.5231449Z 76a69b57b683: Verifying Checksum 2025-08-14T21:19:14.5233415Z 76a69b57b683: Download complete 2025-08-14T21:19:14.6229717Z 5c785dcb4cdb: Download complete 2025-08-14T21:19:14.6582088Z 660ffc76f83b: Download complete 2025-08-14T21:19:14.7546443Z 53b11c77468c: Verifying Checksum 2025-08-14T21:19:14.7548660Z 53b11c77468c: Download complete 2025-08-14T21:19:14.7664970Z 836ab08052e8: Verifying Checksum 2025-08-14T21:19:14.7665587Z 836ab08052e8: Download complete 2025-08-14T21:19:14.8499303Z e97311a6a967: Verifying Checksum 2025-08-14T21:19:14.8499681Z e97311a6a967: Download complete 2025-08-14T21:19:14.9551369Z 6d89b5f065d5: Download complete 2025-08-14T21:19:15.0643735Z 5a5cc76ada43: Verifying Checksum 2025-08-14T21:19:15.0648302Z 5a5cc76ada43: Download complete 2025-08-14T21:19:15.1354478Z fc6b37d40530: Verifying Checksum 2025-08-14T21:19:15.1354803Z fc6b37d40530: Download complete 2025-08-14T21:19:15.2330953Z 2e1657907860: Download complete 2025-08-14T21:19:15.6804973Z 660ffc76f83b: Pull complete 2025-08-14T21:19:15.7081705Z c7b4a852a455: Pull complete 2025-08-14T21:19:15.9216112Z 2c414689d31d: Verifying Checksum 2025-08-14T21:19:15.9220204Z 2c414689d31d: Download complete 2025-08-14T21:19:15.9296676Z 4f4fb700ef54: Verifying Checksum 2025-08-14T21:19:15.9300995Z 4f4fb700ef54: Download complete 2025-08-14T21:19:16.0065517Z d6226eb61f82: Verifying Checksum 2025-08-14T21:19:16.0069813Z d6226eb61f82: Download complete 2025-08-14T21:19:16.1127000Z 83c70f4266a6: Download complete 2025-08-14T21:19:16.2096160Z 60c725d21861: Verifying Checksum 2025-08-14T21:19:16.2098258Z 60c725d21861: Download complete 2025-08-14T21:19:16.3205394Z a504e76e66a4: Verifying Checksum 2025-08-14T21:19:16.3209250Z a504e76e66a4: Download complete 2025-08-14T21:19:16.4144241Z fc1c200a4f77: Verifying Checksum 2025-08-14T21:19:16.4146148Z fc1c200a4f77: Download complete 2025-08-14T21:19:16.4733707Z 43273c22704f: Verifying Checksum 2025-08-14T21:19:16.4738055Z 43273c22704f: Download complete 2025-08-14T21:19:16.5635181Z 89df389d042a: Verifying Checksum 2025-08-14T21:19:16.5639515Z 89df389d042a: Download complete 2025-08-14T21:19:16.6495041Z 684349f50d94: Verifying Checksum 2025-08-14T21:19:16.6499364Z 684349f50d94: Download complete 2025-08-14T21:19:16.7240275Z 21d0eae87fb3: Verifying Checksum 2025-08-14T21:19:16.7242317Z 21d0eae87fb3: Download complete 2025-08-14T21:19:16.8320973Z c9c2b424b8e0: Download complete 2025-08-14T21:19:16.9225849Z 98dda28f3395: Verifying Checksum 2025-08-14T21:19:16.9228082Z 98dda28f3395: Download complete 2025-08-14T21:19:17.0295999Z acf5babd87f2: Verifying Checksum 2025-08-14T21:19:17.0298052Z acf5babd87f2: Download complete 2025-08-14T21:19:17.4911837Z e5a28988c893: Verifying Checksum 2025-08-14T21:19:17.4913620Z e5a28988c893: Download complete 2025-08-14T21:19:17.5773747Z 7ddd14e2b548: Download complete 2025-08-14T21:19:17.6548289Z 4ba8e7a736c8: Download complete 2025-08-14T21:19:17.7595164Z 907c320fee2f: Verifying Checksum 2025-08-14T21:19:17.7597204Z 907c320fee2f: Download complete 2025-08-14T21:19:17.8470651Z 18c4ed1ec491: Verifying Checksum 2025-08-14T21:19:17.8471083Z 18c4ed1ec491: Download complete 2025-08-14T21:19:18.1096161Z d7618c2df6cd: Verifying Checksum 2025-08-14T21:19:18.1098026Z d7618c2df6cd: Download complete 2025-08-14T21:19:18.1953867Z b7bdd9a6f789: Verifying Checksum 2025-08-14T21:19:18.1956038Z b7bdd9a6f789: Download complete 2025-08-14T21:19:18.2625667Z 6738ba83282e: Verifying Checksum 2025-08-14T21:19:18.2626067Z 6738ba83282e: Download complete 2025-08-14T21:19:18.3832942Z dfb0f2488639: Verifying Checksum 2025-08-14T21:19:18.3835079Z dfb0f2488639: Download complete 2025-08-14T21:19:18.4748950Z dc833b0762f2: Download complete 2025-08-14T21:19:18.5598916Z 8827df8ca2da: Download complete 2025-08-14T21:19:21.6346056Z 7c5050d8408d: Verifying Checksum 2025-08-14T21:19:21.6348515Z 7c5050d8408d: Download complete 2025-08-14T21:19:21.7251704Z d7cf7f140df3: Verifying Checksum 2025-08-14T21:19:21.7256251Z d7cf7f140df3: Download complete 2025-08-14T21:19:24.5297155Z 733eedc8da8d: Verifying Checksum 2025-08-14T21:19:24.5297625Z 733eedc8da8d: Download complete 2025-08-14T21:19:26.6173894Z e5a28988c893: Pull complete 2025-08-14T21:19:26.8515347Z 76a69b57b683: Pull complete 2025-08-14T21:19:27.0862470Z 5c785dcb4cdb: Pull complete 2025-08-14T21:19:27.4752183Z 836ab08052e8: Pull complete 2025-08-14T21:19:27.8149273Z 53b11c77468c: Pull complete 2025-08-14T21:19:28.0789430Z e97311a6a967: Pull complete 2025-08-14T21:19:31.4060230Z 2c414689d31d: Pull complete 2025-08-14T21:19:31.7988253Z 6d89b5f065d5: Pull complete 2025-08-14T21:19:32.1240783Z 5a5cc76ada43: Pull complete 2025-08-14T21:19:32.4416393Z fc6b37d40530: Pull complete 2025-08-14T21:19:32.7213210Z 2e1657907860: Pull complete 2025-08-14T21:19:49.2789502Z 7b92d7a4b8c7: Verifying Checksum 2025-08-14T21:19:49.2791200Z 7b92d7a4b8c7: Download complete 2025-08-14T21:19:49.3571599Z bc5961031092: Verifying Checksum 2025-08-14T21:19:49.3575367Z bc5961031092: Download complete 2025-08-14T21:19:49.4541755Z 0531cc34c12a: Verifying Checksum 2025-08-14T21:19:49.4543705Z 0531cc34c12a: Download complete 2025-08-14T21:19:49.5629302Z 38c303d3b62e: Verifying Checksum 2025-08-14T21:19:49.5631373Z 38c303d3b62e: Download complete 2025-08-14T21:19:49.6419365Z e06d15594a2a: Verifying Checksum 2025-08-14T21:19:49.6421278Z e06d15594a2a: Download complete 2025-08-14T21:19:49.7343962Z 0e55deb5cb38: Verifying Checksum 2025-08-14T21:19:49.7344404Z 0e55deb5cb38: Download complete 2025-08-14T21:19:49.8236920Z 4a53d66dce07: Verifying Checksum 2025-08-14T21:19:49.8241390Z 4a53d66dce07: Download complete 2025-08-14T21:19:49.9552165Z 1519daa051b8: Verifying Checksum 2025-08-14T21:19:49.9556039Z 1519daa051b8: Download complete 2025-08-14T21:19:50.0381782Z 381ed91d2119: Verifying Checksum 2025-08-14T21:19:50.0383838Z 381ed91d2119: Download complete 2025-08-14T21:19:50.1070822Z c6b0a01a96dd: Verifying Checksum 2025-08-14T21:19:50.1072636Z c6b0a01a96dd: Download complete 2025-08-14T21:19:50.1893802Z 62df6413daee: Verifying Checksum 2025-08-14T21:19:50.1895695Z 62df6413daee: Download complete 2025-08-14T21:19:50.2802981Z 7a18bc2a6881: Verifying Checksum 2025-08-14T21:19:50.2807235Z 7a18bc2a6881: Download complete 2025-08-14T21:19:50.3812467Z 93359cd58a8c: Verifying Checksum 2025-08-14T21:19:50.3814517Z 93359cd58a8c: Download complete 2025-08-14T21:19:50.4745904Z c35ba0a1f353: Verifying Checksum 2025-08-14T21:19:50.4749783Z c35ba0a1f353: Download complete 2025-08-14T21:19:50.5975165Z dcf1e01c98d6: Verifying Checksum 2025-08-14T21:19:50.5979165Z dcf1e01c98d6: Download complete 2025-08-14T21:19:53.0784896Z bad0564f61fd: Verifying Checksum 2025-08-14T21:19:53.0785909Z bad0564f61fd: Download complete 2025-08-14T21:19:53.1552947Z 539ded905736: Verifying Checksum 2025-08-14T21:19:53.1555127Z 539ded905736: Download complete 2025-08-14T21:19:53.2456176Z 28d482062637: Verifying Checksum 2025-08-14T21:19:53.2456664Z 28d482062637: Download complete 2025-08-14T21:19:53.3134626Z 3245316ff51b: Verifying Checksum 2025-08-14T21:19:53.3136438Z 3245316ff51b: Download complete 2025-08-14T21:19:53.4036268Z b53167d1a6df: Verifying Checksum 2025-08-14T21:19:53.4037876Z b53167d1a6df: Download complete 2025-08-14T21:19:53.5101481Z 7f5277f69167: Verifying Checksum 2025-08-14T21:19:53.5105244Z 7f5277f69167: Download complete 2025-08-14T21:19:53.5824928Z 23dff10cdaa5: Verifying Checksum 2025-08-14T21:19:53.5829355Z 23dff10cdaa5: Download complete 2025-08-14T21:19:54.1969719Z 9fb73296da6a: Verifying Checksum 2025-08-14T21:19:54.1969978Z 9fb73296da6a: Download complete 2025-08-14T21:20:28.9527521Z 5b092eb06909: Verifying Checksum 2025-08-14T21:20:28.9527799Z 5b092eb06909: Download complete 2025-08-14T21:20:58.0872470Z 7b92d7a4b8c7: Pull complete 2025-08-14T21:20:58.3874032Z 4f4fb700ef54: Pull complete 2025-08-14T21:20:58.6849164Z d6226eb61f82: Pull complete 2025-08-14T21:20:58.9669573Z 83c70f4266a6: Pull complete 2025-08-14T21:20:59.1924770Z 60c725d21861: Pull complete 2025-08-14T21:20:59.5062657Z a504e76e66a4: Pull complete 2025-08-14T21:20:59.6923390Z fc1c200a4f77: Pull complete 2025-08-14T21:20:59.9616761Z 43273c22704f: Pull complete 2025-08-14T21:21:00.2388963Z 89df389d042a: Pull complete 2025-08-14T21:21:00.7247167Z 684349f50d94: Pull complete 2025-08-14T21:21:00.8228859Z 21d0eae87fb3: Pull complete 2025-08-14T21:21:00.8795881Z c9c2b424b8e0: Pull complete 2025-08-14T21:21:00.9654644Z 98dda28f3395: Pull complete 2025-08-14T21:21:01.0092488Z acf5babd87f2: Pull complete 2025-08-14T21:21:11.3138966Z 7c5050d8408d: Pull complete 2025-08-14T21:21:11.6970981Z 7ddd14e2b548: Pull complete 2025-08-14T21:21:11.9268399Z 4ba8e7a736c8: Pull complete 2025-08-14T21:21:12.4668141Z 907c320fee2f: Pull complete 2025-08-14T21:21:12.6912325Z 18c4ed1ec491: Pull complete 2025-08-14T21:21:13.4518941Z d7618c2df6cd: Pull complete 2025-08-14T21:21:13.9169626Z b7bdd9a6f789: Pull complete 2025-08-14T21:21:14.3645537Z 6738ba83282e: Pull complete 2025-08-14T21:21:15.2100276Z dfb0f2488639: Pull complete 2025-08-14T21:21:15.7525427Z dc833b0762f2: Pull complete 2025-08-14T21:21:16.2018069Z 8827df8ca2da: Pull complete 2025-08-14T21:22:15.3365926Z fac8f3bd0f85: Verifying Checksum 2025-08-14T21:22:15.3369798Z fac8f3bd0f85: Download complete 2025-08-14T21:25:45.5960496Z fac8f3bd0f85: Pull complete 2025-08-14T21:25:45.6257184Z d7cf7f140df3: Pull complete 2025-08-14T21:25:47.4428835Z 733eedc8da8d: Pull complete 2025-08-14T21:27:53.8608913Z 5b092eb06909: Pull complete 2025-08-14T21:27:54.3237205Z bc5961031092: Pull complete 2025-08-14T21:27:54.5462638Z 0531cc34c12a: Pull complete 2025-08-14T21:27:55.4105792Z 38c303d3b62e: Pull complete 2025-08-14T21:27:56.0496947Z e06d15594a2a: Pull complete 2025-08-14T21:27:56.4257179Z 0e55deb5cb38: Pull complete 2025-08-14T21:27:57.1508977Z 4a53d66dce07: Pull complete 2025-08-14T21:27:57.9383092Z 1519daa051b8: Pull complete 2025-08-14T21:27:58.2713048Z 381ed91d2119: Pull complete 2025-08-14T21:27:59.1881825Z c6b0a01a96dd: Pull complete 2025-08-14T21:27:59.6594757Z 62df6413daee: Pull complete 2025-08-14T21:28:00.2183286Z 7a18bc2a6881: Pull complete 2025-08-14T21:28:00.4705509Z 93359cd58a8c: Pull complete 2025-08-14T21:28:01.1803331Z c35ba0a1f353: Pull complete 2025-08-14T21:28:01.2490737Z dcf1e01c98d6: Pull complete 2025-08-14T21:28:08.6371833Z bad0564f61fd: Pull complete 2025-08-14T21:28:08.7998542Z 539ded905736: Pull complete 2025-08-14T21:28:08.9776720Z 28d482062637: Pull complete 2025-08-14T21:28:09.1458411Z 3245316ff51b: Pull complete 2025-08-14T21:28:09.4850945Z b53167d1a6df: Pull complete 2025-08-14T21:28:09.8573655Z 7f5277f69167: Pull complete 2025-08-14T21:28:10.5477690Z 23dff10cdaa5: Pull complete 2025-08-14T21:28:13.1618847Z 9fb73296da6a: Pull complete 2025-08-14T21:28:13.7865563Z Digest: sha256:4236794baba289041d240d08fd393bbd57497c3012e5e0ccd9fd98f61ebf35c6 2025-08-14T21:28:13.8664918Z Status: Downloaded newer image for 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/ci-image:pytorch-linux-jammy-py3.9-gcc11-inductor-benchmarks-bfa89110622ba7202628e9faac705f183070defe 2025-08-14T21:28:13.9112614Z 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/ci-image:pytorch-linux-jammy-py3.9-gcc11-inductor-benchmarks-bfa89110622ba7202628e9faac705f183070defe 2025-08-14T21:28:13.9176979Z ##[group]Run echo "IN_CONTAINER_RUNNER=$(if [ -f /.inarc ] || [ -f /.incontainer ]; then echo true ; else echo false; fi)" >> "$GITHUB_OUTPUT" 2025-08-14T21:28:13.9177517Z echo "IN_CONTAINER_RUNNER=$(if [ -f /.inarc ] || [ -f /.incontainer ]; then echo true ; else echo false; fi)" >> "$GITHUB_OUTPUT" 2025-08-14T21:28:13.9184847Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-08-14T21:28:13.9185179Z env: 2025-08-14T21:28:13.9185345Z GIT_DEFAULT_BRANCH: main 2025-08-14T21:28:13.9185540Z ##[endgroup] 2025-08-14T21:28:13.9260592Z Prepare all required actions 2025-08-14T21:28:13.9498265Z ##[group]Run ./.github/actions/get-workflow-job-id 2025-08-14T21:28:13.9498485Z with: 2025-08-14T21:28:13.9498969Z github-token: *** 2025-08-14T21:28:13.9499119Z env: 2025-08-14T21:28:13.9499262Z GIT_DEFAULT_BRANCH: main 2025-08-14T21:28:13.9499433Z ##[endgroup] 2025-08-14T21:28:13.9645168Z ##[group]Run set -eux 2025-08-14T21:28:13.9645421Z set -eux 2025-08-14T21:28:13.9645688Z python3 .github/scripts/get_workflow_job_id.py "${GITHUB_RUN_ID}" "${RUNNER_NAME}" 2025-08-14T21:28:13.9649583Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-08-14T21:28:13.9649804Z env: 2025-08-14T21:28:13.9649948Z GIT_DEFAULT_BRANCH: main 2025-08-14T21:28:13.9650240Z GITHUB_TOKEN: *** 2025-08-14T21:28:13.9650384Z ##[endgroup] 2025-08-14T21:28:13.9674903Z + python3 .github/scripts/get_workflow_job_id.py 16976255153 i-0819c8fa835cec089 2025-08-14T21:28:14.9125141Z Setting output job-id=48128039107 2025-08-14T21:28:14.9127410Z Setting output job-name=linux-jammy-cpu-py3.9-gcc11-inductor / test (dynamic_cpu_inductor_huggingface, 1, 1, linux.8xlarge.amx) 2025-08-14T21:28:14.9402605Z ##[group]Run python3 -m pip install psutil==5.9.8 dataclasses_json==0.6.7 nvidia-ml-py==11.525.84 2025-08-14T21:28:14.9403028Z python3 -m pip install psutil==5.9.8 dataclasses_json==0.6.7 nvidia-ml-py==11.525.84 2025-08-14T21:28:14.9403553Z python3 -m tools.stats.monitor --log-interval "$MONITOR_LOG_INTERVAL" --data-collect-interval "$MONITOR_DATA_COLLECT_INTERVAL" > usage_log.txt 2>&1 & 2025-08-14T21:28:14.9404009Z echo "monitor-script-pid=${!}" >> "${GITHUB_OUTPUT}" 2025-08-14T21:28:14.9409357Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-08-14T21:28:14.9409581Z env: 2025-08-14T21:28:14.9409727Z GIT_DEFAULT_BRANCH: main 2025-08-14T21:28:14.9409884Z JOB_ID: 48128039107 2025-08-14T21:28:14.9410207Z JOB_NAME: linux-jammy-cpu-py3.9-gcc11-inductor / test (dynamic_cpu_inductor_huggingface, 1, 1, linux.8xlarge.amx) 2025-08-14T21:28:14.9410564Z WORKFLOW_NAME: inductor 2025-08-14T21:28:14.9410726Z WORKFLOW_RUN_ID: 16976255153 2025-08-14T21:28:14.9410929Z MONITOR_LOG_INTERVAL: 5 2025-08-14T21:28:14.9411089Z MONITOR_DATA_COLLECT_INTERVAL: 1 2025-08-14T21:28:14.9411263Z ##[endgroup] 2025-08-14T21:28:15.4270536Z Defaulting to user installation because normal site-packages is not writeable 2025-08-14T21:28:15.6627608Z Collecting psutil==5.9.8 2025-08-14T21:28:15.6781510Z Downloading psutil-5.9.8-cp36-abi3-manylinux_2_12_x86_64.manylinux2010_x86_64.manylinux_2_17_x86_64.manylinux2014_x86_64.whl (288 kB) 2025-08-14T21:28:15.8576420Z Collecting dataclasses_json==0.6.7 2025-08-14T21:28:15.8617301Z Downloading dataclasses_json-0.6.7-py3-none-any.whl (28 kB) 2025-08-14T21:28:15.9035433Z Collecting nvidia-ml-py==11.525.84 2025-08-14T21:28:15.9077264Z Downloading nvidia_ml_py-11.525.84-py3-none-any.whl (34 kB) 2025-08-14T21:28:15.9718286Z Collecting typing-inspect<1,>=0.4.0 2025-08-14T21:28:15.9756874Z Downloading typing_inspect-0.9.0-py3-none-any.whl (8.8 kB) 2025-08-14T21:28:16.1134850Z Collecting marshmallow<4.0.0,>=3.18.0 2025-08-14T21:28:16.1172373Z Downloading marshmallow-3.26.1-py3-none-any.whl (50 kB) 2025-08-14T21:28:16.2880278Z Collecting packaging>=17.0 2025-08-14T21:28:16.2920515Z Downloading packaging-25.0-py3-none-any.whl (66 kB) 2025-08-14T21:28:16.4562652Z Collecting typing-extensions>=3.7.4 2025-08-14T21:28:16.4600360Z Downloading typing_extensions-4.14.1-py3-none-any.whl (43 kB) 2025-08-14T21:28:16.5887921Z Collecting mypy-extensions>=0.3.0 2025-08-14T21:28:16.5927625Z Downloading mypy_extensions-1.1.0-py3-none-any.whl (5.0 kB) 2025-08-14T21:28:16.8748214Z Installing collected packages: typing-extensions, packaging, mypy-extensions, typing-inspect, marshmallow, psutil, nvidia-ml-py, dataclasses-json 2025-08-14T21:28:17.1844616Z Successfully installed dataclasses-json-0.6.7 marshmallow-3.26.1 mypy-extensions-1.1.0 nvidia-ml-py-11.525.84 packaging-25.0 psutil-5.9.8 typing-extensions-4.14.1 typing-inspect-0.9.0 2025-08-14T21:28:17.4123194Z Prepare all required actions 2025-08-14T21:28:17.4123467Z Getting action download info 2025-08-14T21:28:17.5959386Z Download action repository 'seemethere/download-artifact-s3@v4' (SHA:1da556a7aa0a088e3153970611f6c432d58e80e6) 2025-08-14T21:28:18.4132437Z Download action repository 'actions/download-artifact@v4' (SHA:d3f86a106a0bac45b974a628896c90dbdf5c8093) 2025-08-14T21:28:21.2184513Z ##[group]Run ./.github/actions/download-build-artifacts 2025-08-14T21:28:21.2185113Z with: 2025-08-14T21:28:21.2185327Z name: linux-jammy-py3.9-gcc11-build 2025-08-14T21:28:21.2185611Z s3-bucket: gha-artifacts 2025-08-14T21:28:21.2185817Z env: 2025-08-14T21:28:21.2185999Z GIT_DEFAULT_BRANCH: main 2025-08-14T21:28:21.2186208Z ##[endgroup] 2025-08-14T21:28:21.2292526Z ##[group]Run seemethere/download-artifact-s3@v4 2025-08-14T21:28:21.2292743Z with: 2025-08-14T21:28:21.2292902Z name: linux-jammy-py3.9-gcc11-build 2025-08-14T21:28:21.2293123Z s3-bucket: gha-artifacts 2025-08-14T21:28:21.2293326Z region: us-east-1 2025-08-14T21:28:21.2293465Z env: 2025-08-14T21:28:21.2293603Z GIT_DEFAULT_BRANCH: main 2025-08-14T21:28:21.2293764Z ##[endgroup] 2025-08-14T21:28:21.8841147Z (node:47840) NOTE: We are formalizing our plans to enter AWS SDK for JavaScript (v2) into maintenance mode in 2023. 2025-08-14T21:28:21.8842370Z 2025-08-14T21:28:21.8842661Z Please migrate your code to use AWS SDK for JavaScript (v3). 2025-08-14T21:28:21.8843011Z For more information, check the migration guide at https://a.co/7PzMCcy 2025-08-14T21:28:21.8843334Z (Use `node --trace-warnings ...` to show where the warning was created) 2025-08-14T21:28:23.1376499Z Found 1 objects with prefix pytorch/pytorch/16976255153/linux-jammy-py3.9-gcc11-build/ 2025-08-14T21:28:23.1377128Z Starting download (1/1): /home/ec2-user/actions-runner/_work/pytorch/pytorch/artifacts.zip 2025-08-14T21:28:30.8224787Z Finished download (1/1): /home/ec2-user/actions-runner/_work/pytorch/pytorch/artifacts.zip 2025-08-14T21:28:30.8230234Z Artifact download has finished successfully 2025-08-14T21:28:30.8402681Z ##[group]Run unzip -o artifacts.zip 2025-08-14T21:28:30.8402900Z unzip -o artifacts.zip 2025-08-14T21:28:30.8407345Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-08-14T21:28:30.8407564Z env: 2025-08-14T21:28:30.8407703Z GIT_DEFAULT_BRANCH: main 2025-08-14T21:28:30.8407867Z ##[endgroup] 2025-08-14T21:28:30.9215550Z Archive: artifacts.zip 2025-08-14T21:28:30.9305703Z creating: dist/ 2025-08-14T21:28:31.9347834Z inflating: dist/torch-2.9.0a0+git1fc683c-cp39-cp39-linux_x86_64.whl 2025-08-14T21:28:31.9348535Z creating: dist/vision/ 2025-08-14T21:28:31.9417707Z inflating: dist/vision/torchvision-0.22.0a0+966da7e-cp39-cp39-linux_x86_64.whl 2025-08-14T21:28:31.9421458Z creating: dist/audio/ 2025-08-14T21:28:31.9512105Z inflating: dist/audio/torchaudio-2.8.0a0+bdb88e1-cp39-cp39-linux_x86_64.whl 2025-08-14T21:28:31.9516479Z creating: dist/ao/ 2025-08-14T21:28:31.9547612Z inflating: dist/ao/torchao-0.7.0+git51c87b6e-py3-none-any.whl 2025-08-14T21:28:31.9651780Z inflating: dist/.ninja_log 2025-08-14T21:28:31.9655862Z creating: build/custom_test_artifacts/ 2025-08-14T21:28:31.9659492Z creating: build/custom_test_artifacts/custom-op-build/ 2025-08-14T21:28:31.9659988Z creating: build/custom_test_artifacts/custom-op-build/CMakeFiles/ 2025-08-14T21:28:31.9660967Z creating: build/custom_test_artifacts/custom-op-build/CMakeFiles/pkgRedirects/ 2025-08-14T21:28:31.9661552Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/CMakeConfigureLog.yaml 2025-08-14T21:28:31.9661939Z creating: build/custom_test_artifacts/custom-op-build/CMakeFiles/4.0.0/ 2025-08-14T21:28:31.9662335Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/4.0.0/CMakeSystem.cmake 2025-08-14T21:28:31.9662761Z creating: build/custom_test_artifacts/custom-op-build/CMakeFiles/4.0.0/CompilerIdC/ 2025-08-14T21:28:31.9663415Z creating: build/custom_test_artifacts/custom-op-build/CMakeFiles/4.0.0/CompilerIdC/tmp/ 2025-08-14T21:28:31.9663860Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/4.0.0/CompilerIdC/CMakeCCompilerId.c 2025-08-14T21:28:31.9664294Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/4.0.0/CompilerIdC/a.out 2025-08-14T21:28:31.9664703Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/4.0.0/CMakeCCompiler.cmake 2025-08-14T21:28:31.9665190Z creating: build/custom_test_artifacts/custom-op-build/CMakeFiles/4.0.0/CompilerIdCXX/ 2025-08-14T21:28:31.9665581Z creating: build/custom_test_artifacts/custom-op-build/CMakeFiles/4.0.0/CompilerIdCXX/tmp/ 2025-08-14T21:28:31.9666026Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/4.0.0/CompilerIdCXX/CMakeCXXCompilerId.cpp 2025-08-14T21:28:31.9666484Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/4.0.0/CompilerIdCXX/a.out 2025-08-14T21:28:31.9666904Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/4.0.0/CMakeCXXCompiler.cmake 2025-08-14T21:28:31.9667367Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/4.0.0/CMakeDetermineCompilerABI_C.bin 2025-08-14T21:28:31.9667846Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/4.0.0/CMakeDetermineCompilerABI_CXX.bin 2025-08-14T21:28:31.9668266Z creating: build/custom_test_artifacts/custom-op-build/CMakeFiles/CMakeScratch/ 2025-08-14T21:28:31.9668624Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/cmake.check_cache 2025-08-14T21:28:31.9668989Z creating: build/custom_test_artifacts/custom-op-build/CMakeFiles/custom_ops.dir/ 2025-08-14T21:28:31.9669390Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/custom_ops.dir/compiler_depend.ts 2025-08-14T21:28:31.9669837Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/custom_ops.dir/compiler_depend.make 2025-08-14T21:28:31.9670274Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/custom_ops.dir/depend.make 2025-08-14T21:28:31.9670677Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/custom_ops.dir/link.txt 2025-08-14T21:28:31.9671092Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/custom_ops.dir/cmake_clean.cmake 2025-08-14T21:28:31.9671501Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/custom_ops.dir/build.make 2025-08-14T21:28:31.9671922Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/custom_ops.dir/DependInfo.cmake 2025-08-14T21:28:31.9672336Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/custom_ops.dir/flags.make 2025-08-14T21:28:31.9672749Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/custom_ops.dir/progress.make 2025-08-14T21:28:31.9686078Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/custom_ops.dir/op.cpp.o.d 2025-08-14T21:28:31.9842771Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/custom_ops.dir/op.cpp.o 2025-08-14T21:28:31.9844632Z creating: build/custom_test_artifacts/custom-op-build/CMakeFiles/test_custom_ops.dir/ 2025-08-14T21:28:31.9845214Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/test_custom_ops.dir/compiler_depend.ts 2025-08-14T21:28:31.9849775Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/test_custom_ops.dir/compiler_depend.make 2025-08-14T21:28:31.9853856Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/test_custom_ops.dir/depend.make 2025-08-14T21:28:31.9855879Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/test_custom_ops.dir/link.txt 2025-08-14T21:28:31.9856464Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/test_custom_ops.dir/cmake_clean.cmake 2025-08-14T21:28:31.9860354Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/test_custom_ops.dir/build.make 2025-08-14T21:28:31.9861235Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/test_custom_ops.dir/DependInfo.cmake 2025-08-14T21:28:31.9861706Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/test_custom_ops.dir/flags.make 2025-08-14T21:28:31.9866086Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/test_custom_ops.dir/progress.make 2025-08-14T21:28:31.9870783Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/test_custom_ops.dir/test_custom_ops.cpp.o.d 2025-08-14T21:28:31.9927877Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/test_custom_ops.dir/test_custom_ops.cpp.o 2025-08-14T21:28:31.9931366Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/CMakeDirectoryInformation.cmake 2025-08-14T21:28:31.9935810Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/TargetDirectories.txt 2025-08-14T21:28:31.9937857Z extracting: build/custom_test_artifacts/custom-op-build/CMakeFiles/progress.marks 2025-08-14T21:28:31.9938384Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/Makefile2 2025-08-14T21:28:31.9942535Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/Makefile.cmake 2025-08-14T21:28:31.9946199Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/InstallScripts.json 2025-08-14T21:28:31.9948010Z inflating: build/custom_test_artifacts/custom-op-build/CMakeCache.txt 2025-08-14T21:28:31.9948470Z inflating: build/custom_test_artifacts/custom-op-build/Makefile 2025-08-14T21:28:31.9952602Z inflating: build/custom_test_artifacts/custom-op-build/cmake_install.cmake 2025-08-14T21:28:32.0068384Z inflating: build/custom_test_artifacts/custom-op-build/libcustom_ops.so 2025-08-14T21:28:32.0114078Z inflating: build/custom_test_artifacts/custom-op-build/test_custom_ops 2025-08-14T21:28:32.0115717Z creating: build/custom_test_artifacts/jit-hook-build/ 2025-08-14T21:28:32.0116043Z creating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/ 2025-08-14T21:28:32.0116403Z creating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/pkgRedirects/ 2025-08-14T21:28:32.0116900Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/CMakeConfigureLog.yaml 2025-08-14T21:28:32.0120278Z creating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/4.0.0/ 2025-08-14T21:28:32.0124455Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/4.0.0/CMakeSystem.cmake 2025-08-14T21:28:32.0127876Z creating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/4.0.0/CompilerIdC/ 2025-08-14T21:28:32.0132130Z creating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/4.0.0/CompilerIdC/tmp/ 2025-08-14T21:28:32.0135512Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/4.0.0/CompilerIdC/CMakeCCompilerId.c 2025-08-14T21:28:32.0139370Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/4.0.0/CompilerIdC/a.out 2025-08-14T21:28:32.0139924Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/4.0.0/CMakeCCompiler.cmake 2025-08-14T21:28:32.0141902Z creating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/4.0.0/CompilerIdCXX/ 2025-08-14T21:28:32.0142321Z creating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/4.0.0/CompilerIdCXX/tmp/ 2025-08-14T21:28:32.0142766Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/4.0.0/CompilerIdCXX/CMakeCXXCompilerId.cpp 2025-08-14T21:28:32.0143221Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/4.0.0/CompilerIdCXX/a.out 2025-08-14T21:28:32.0143633Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/4.0.0/CMakeCXXCompiler.cmake 2025-08-14T21:28:32.0144079Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/4.0.0/CMakeDetermineCompilerABI_C.bin 2025-08-14T21:28:32.0144542Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/4.0.0/CMakeDetermineCompilerABI_CXX.bin 2025-08-14T21:28:32.0145036Z creating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/CMakeScratch/ 2025-08-14T21:28:32.0145490Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/cmake.check_cache 2025-08-14T21:28:32.0145863Z creating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/test_jit_hooks.dir/ 2025-08-14T21:28:32.0146265Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/test_jit_hooks.dir/compiler_depend.ts 2025-08-14T21:28:32.0146726Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/test_jit_hooks.dir/compiler_depend.make 2025-08-14T21:28:32.0147168Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/test_jit_hooks.dir/depend.make 2025-08-14T21:28:32.0147582Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/test_jit_hooks.dir/link.txt 2025-08-14T21:28:32.0148000Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/test_jit_hooks.dir/cmake_clean.cmake 2025-08-14T21:28:32.0148430Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/test_jit_hooks.dir/build.make 2025-08-14T21:28:32.0148868Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/test_jit_hooks.dir/DependInfo.cmake 2025-08-14T21:28:32.0149288Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/test_jit_hooks.dir/flags.make 2025-08-14T21:28:32.0149703Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/test_jit_hooks.dir/progress.make 2025-08-14T21:28:32.0150149Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/test_jit_hooks.dir/test_jit_hooks.cpp.o.d 2025-08-14T21:28:32.0199977Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/test_jit_hooks.dir/test_jit_hooks.cpp.o 2025-08-14T21:28:32.0202338Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/CMakeDirectoryInformation.cmake 2025-08-14T21:28:32.0207103Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/TargetDirectories.txt 2025-08-14T21:28:32.0210575Z extracting: build/custom_test_artifacts/jit-hook-build/CMakeFiles/progress.marks 2025-08-14T21:28:32.0214411Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/Makefile2 2025-08-14T21:28:32.0214983Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/Makefile.cmake 2025-08-14T21:28:32.0215856Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/InstallScripts.json 2025-08-14T21:28:32.0216277Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeCache.txt 2025-08-14T21:28:32.0216594Z inflating: build/custom_test_artifacts/jit-hook-build/Makefile 2025-08-14T21:28:32.0216905Z inflating: build/custom_test_artifacts/jit-hook-build/cmake_install.cmake 2025-08-14T21:28:32.0235089Z inflating: build/custom_test_artifacts/jit-hook-build/test_jit_hooks 2025-08-14T21:28:32.0235598Z creating: build/custom_test_artifacts/custom-backend-build/ 2025-08-14T21:28:32.0236045Z creating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/ 2025-08-14T21:28:32.0236620Z creating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/pkgRedirects/ 2025-08-14T21:28:32.0237042Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/CMakeConfigureLog.yaml 2025-08-14T21:28:32.0237440Z creating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/4.0.0/ 2025-08-14T21:28:32.0237834Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/4.0.0/CMakeSystem.cmake 2025-08-14T21:28:32.0238246Z creating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/4.0.0/CompilerIdC/ 2025-08-14T21:28:32.0238658Z creating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/4.0.0/CompilerIdC/tmp/ 2025-08-14T21:28:32.0243044Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/4.0.0/CompilerIdC/CMakeCCompilerId.c 2025-08-14T21:28:32.0245056Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/4.0.0/CompilerIdC/a.out 2025-08-14T21:28:32.0249260Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/4.0.0/CMakeCCompiler.cmake 2025-08-14T21:28:32.0253342Z creating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/4.0.0/CompilerIdCXX/ 2025-08-14T21:28:32.0257193Z creating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/4.0.0/CompilerIdCXX/tmp/ 2025-08-14T21:28:32.0259020Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/4.0.0/CompilerIdCXX/CMakeCXXCompilerId.cpp 2025-08-14T21:28:32.0259663Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/4.0.0/CompilerIdCXX/a.out 2025-08-14T21:28:32.0260242Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/4.0.0/CMakeCXXCompiler.cmake 2025-08-14T21:28:32.0260722Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/4.0.0/CMakeDetermineCompilerABI_C.bin 2025-08-14T21:28:32.0261215Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/4.0.0/CMakeDetermineCompilerABI_CXX.bin 2025-08-14T21:28:32.0261675Z creating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/CMakeScratch/ 2025-08-14T21:28:32.0262067Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/cmake.check_cache 2025-08-14T21:28:32.0262474Z creating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/custom_backend.dir/ 2025-08-14T21:28:32.0262904Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/custom_backend.dir/compiler_depend.ts 2025-08-14T21:28:32.0263394Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/custom_backend.dir/compiler_depend.make 2025-08-14T21:28:32.0263862Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/custom_backend.dir/depend.make 2025-08-14T21:28:32.0264302Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/custom_backend.dir/link.txt 2025-08-14T21:28:32.0264748Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/custom_backend.dir/cmake_clean.cmake 2025-08-14T21:28:32.0265320Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/custom_backend.dir/build.make 2025-08-14T21:28:32.0265789Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/custom_backend.dir/DependInfo.cmake 2025-08-14T21:28:32.0266251Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/custom_backend.dir/flags.make 2025-08-14T21:28:32.0266698Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/custom_backend.dir/progress.make 2025-08-14T21:28:32.0267189Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/custom_backend.dir/custom_backend.cpp.o.d 2025-08-14T21:28:32.0354315Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/custom_backend.dir/custom_backend.cpp.o 2025-08-14T21:28:32.0357736Z creating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/test_custom_backend.dir/ 2025-08-14T21:28:32.0359679Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/test_custom_backend.dir/compiler_depend.ts 2025-08-14T21:28:32.0360344Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/test_custom_backend.dir/compiler_depend.make 2025-08-14T21:28:32.0363932Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/test_custom_backend.dir/depend.make 2025-08-14T21:28:32.0367400Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/test_custom_backend.dir/link.txt 2025-08-14T21:28:32.0371727Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/test_custom_backend.dir/cmake_clean.cmake 2025-08-14T21:28:32.0376009Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/test_custom_backend.dir/build.make 2025-08-14T21:28:32.0379875Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/test_custom_backend.dir/DependInfo.cmake 2025-08-14T21:28:32.0380769Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/test_custom_backend.dir/flags.make 2025-08-14T21:28:32.0381292Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/test_custom_backend.dir/progress.make 2025-08-14T21:28:32.0381816Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/test_custom_backend.dir/test_custom_backend.cpp.o.d 2025-08-14T21:28:32.0418450Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/test_custom_backend.dir/test_custom_backend.cpp.o 2025-08-14T21:28:32.0422358Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/CMakeDirectoryInformation.cmake 2025-08-14T21:28:32.0423021Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/TargetDirectories.txt 2025-08-14T21:28:32.0423465Z extracting: build/custom_test_artifacts/custom-backend-build/CMakeFiles/progress.marks 2025-08-14T21:28:32.0423865Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/Makefile2 2025-08-14T21:28:32.0424260Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/Makefile.cmake 2025-08-14T21:28:32.0424669Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/InstallScripts.json 2025-08-14T21:28:32.0425144Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeCache.txt 2025-08-14T21:28:32.0425482Z inflating: build/custom_test_artifacts/custom-backend-build/Makefile 2025-08-14T21:28:32.0425811Z inflating: build/custom_test_artifacts/custom-backend-build/cmake_install.cmake 2025-08-14T21:28:32.0507431Z inflating: build/custom_test_artifacts/custom-backend-build/libcustom_backend.so 2025-08-14T21:28:32.0539302Z inflating: build/custom_test_artifacts/custom-backend-build/test_custom_backend 2025-08-14T21:28:32.0539765Z creating: build/lib/ 2025-08-14T21:28:32.0609291Z inflating: build/lib/libprotobuf-lite.a 2025-08-14T21:28:32.0984803Z inflating: build/lib/libprotobuf.a 2025-08-14T21:28:32.1404310Z inflating: build/lib/libprotoc.a 2025-08-14T21:28:32.1412677Z inflating: build/lib/libpthreadpool.a 2025-08-14T21:28:32.1419371Z inflating: build/lib/libcpuinfo.a 2025-08-14T21:28:32.1425080Z inflating: build/lib/libcpuinfo_internals.a 2025-08-14T21:28:32.1425353Z inflating: build/lib/libclog.a 2025-08-14T21:28:32.1442672Z inflating: build/lib/libpytorch_qnnpack.a 2025-08-14T21:28:32.1446267Z inflating: build/lib/libnnpack_reference_layers.a 2025-08-14T21:28:32.1603137Z inflating: build/lib/libmicrokernels-prod.a 2025-08-14T21:28:32.1618895Z inflating: build/lib/libnnpack.a 2025-08-14T21:28:32.2362036Z inflating: build/lib/libmicrokernels-all.a 2025-08-14T21:28:32.2422210Z inflating: build/lib/libgtest.a 2025-08-14T21:28:32.2436344Z inflating: build/lib/libgmock.a 2025-08-14T21:28:32.2439915Z inflating: build/lib/libgtest_main.a 2025-08-14T21:28:32.2444122Z inflating: build/lib/libgmock_main.a 2025-08-14T21:28:32.2514176Z inflating: build/lib/libXNNPACK.a 2025-08-14T21:28:32.2578180Z inflating: build/lib/libbenchmark.a 2025-08-14T21:28:32.2580053Z inflating: build/lib/libbenchmark_main.a 2025-08-14T21:28:32.2580453Z inflating: build/lib/libjitprofiling.a 2025-08-14T21:28:32.2583402Z inflating: build/lib/libittnotify.a 2025-08-14T21:28:32.2641662Z inflating: build/lib/libasmjit.a 2025-08-14T21:28:32.3611293Z inflating: build/lib/libfbgemm.a 2025-08-14T21:28:32.3637209Z inflating: build/lib/libtensorpipe_uv.a 2025-08-14T21:28:32.4099125Z inflating: build/lib/libtensorpipe.a 2025-08-14T21:28:32.4201722Z inflating: build/lib/libgloo.a 2025-08-14T21:28:32.4240953Z inflating: build/lib/libonnx_proto.a 2025-08-14T21:28:32.4842140Z inflating: build/lib/libonnx.a 2025-08-14T21:28:33.3334746Z inflating: build/lib/libdnnl.a 2025-08-14T21:28:33.3350682Z inflating: build/lib/libfmt.a 2025-08-14T21:28:33.3573225Z inflating: build/lib/libkineto.a 2025-08-14T21:28:33.3667286Z inflating: build/lib/libc10.so 2025-08-14T21:28:33.3669307Z inflating: build/lib/libtorch_global_deps.so 2025-08-14T21:28:35.9047111Z inflating: build/lib/libtorch_cpu.so 2025-08-14T21:28:35.9051172Z inflating: build/lib/libtorch.so 2025-08-14T21:28:35.9108280Z inflating: build/lib/libtorchbind_test.so 2025-08-14T21:28:35.9124005Z inflating: build/lib/libjitbackend_test.so 2025-08-14T21:28:35.9144558Z inflating: build/lib/libbackend_with_compiler.so 2025-08-14T21:28:35.9167213Z inflating: build/lib/libaoti_custom_ops.so 2025-08-14T21:28:35.9171636Z inflating: build/lib/libshm.so 2025-08-14T21:28:36.0857659Z inflating: build/lib/libtorch_python.so 2025-08-14T21:28:36.0887187Z inflating: build/lib/libnnapi_backend.so 2025-08-14T21:28:36.0889100Z creating: build/bin/ 2025-08-14T21:28:36.0892992Z creating: build/bin/CMakeFiles/ 2025-08-14T21:28:36.0896649Z inflating: build/bin/cmake_install.cmake 2025-08-14T21:28:36.0900353Z inflating: build/bin/CTestTestfile.cmake 2025-08-14T21:28:36.1281500Z inflating: build/bin/protoc-3.13.0.0 2025-08-14T21:28:36.1672994Z inflating: build/bin/protoc 2025-08-14T21:28:36.1723877Z inflating: build/bin/c10_AllocatorConfig_test 2025-08-14T21:28:36.1771451Z inflating: build/bin/c10_CompileTimeFunctionPointer_test 2025-08-14T21:28:36.1820623Z inflating: build/bin/c10_DeviceGuard_test 2025-08-14T21:28:36.1869809Z inflating: build/bin/c10_Device_test 2025-08-14T21:28:36.1925687Z inflating: build/bin/c10_DispatchKeySet_test 2025-08-14T21:28:36.1977330Z inflating: build/bin/c10_Scalar_test 2025-08-14T21:28:36.2023826Z inflating: build/bin/c10_StreamGuard_test 2025-08-14T21:28:36.2075402Z inflating: build/bin/c10_InlineDeviceGuard_test 2025-08-14T21:28:36.2124228Z inflating: build/bin/c10_SymInt_test 2025-08-14T21:28:36.2170663Z inflating: build/bin/c10_ConstexprCrc_test 2025-08-14T21:28:36.2223574Z inflating: build/bin/c10_SizesAndStrides_test 2025-08-14T21:28:36.2276927Z inflating: build/bin/c10_InlineStreamGuard_test 2025-08-14T21:28:36.2323823Z inflating: build/bin/c10_ArrayRef_test 2025-08-14T21:28:36.2389900Z inflating: build/bin/c10_cow_test 2025-08-14T21:28:36.2439921Z inflating: build/bin/c10_Bitset_test 2025-08-14T21:28:36.2487401Z inflating: build/bin/c10_DeadlockDetection_test 2025-08-14T21:28:36.2540394Z inflating: build/bin/c10_Enumerate_test 2025-08-14T21:28:36.2591227Z inflating: build/bin/c10_IntrusiveList_test 2025-08-14T21:28:36.2639402Z inflating: build/bin/c10_Half_test 2025-08-14T21:28:36.2692005Z inflating: build/bin/c10_Metaprogramming_test 2025-08-14T21:28:36.2745057Z inflating: build/bin/c10_LeftRight_test 2025-08-14T21:28:36.2795102Z inflating: build/bin/c10_NetworkFlow_test 2025-08-14T21:28:36.2842424Z inflating: build/bin/c10_Semaphore_test 2025-08-14T21:28:36.2890440Z inflating: build/bin/c10_Synchronized_test 2025-08-14T21:28:36.2939907Z inflating: build/bin/c10_TypeIndex_test 2025-08-14T21:28:36.2992800Z inflating: build/bin/c10_ThreadLocal_test 2025-08-14T21:28:36.3040930Z inflating: build/bin/c10_TypeList_test 2025-08-14T21:28:36.3087988Z inflating: build/bin/c10_TypeTraits_test 2025-08-14T21:28:36.3136543Z inflating: build/bin/c10_accumulate_test 2025-08-14T21:28:36.3189638Z inflating: build/bin/c10_bfloat16_test 2025-08-14T21:28:36.3237927Z inflating: build/bin/c10_bit_cast_test 2025-08-14T21:28:36.3291685Z inflating: build/bin/c10_complex_test 2025-08-14T21:28:36.3343559Z inflating: build/bin/c10_complex_math_test 2025-08-14T21:28:36.3389984Z inflating: build/bin/c10_error_test 2025-08-14T21:28:36.3440021Z inflating: build/bin/c10_exception_test 2025-08-14T21:28:36.3488855Z inflating: build/bin/c10_flags_test 2025-08-14T21:28:36.3537237Z inflating: build/bin/c10_generic_math_test 2025-08-14T21:28:36.3585377Z inflating: build/bin/c10_irange_test 2025-08-14T21:28:36.3729866Z inflating: build/bin/c10_intrusive_ptr_test 2025-08-14T21:28:36.3780586Z inflating: build/bin/c10_lazy_test 2025-08-14T21:28:36.3835123Z inflating: build/bin/c10_logging_test 2025-08-14T21:28:36.3904404Z inflating: build/bin/c10_optional_test 2025-08-14T21:28:36.3962496Z inflating: build/bin/c10_ordered_preserving_dict_test 2025-08-14T21:28:36.4013151Z inflating: build/bin/c10_registry_test 2025-08-14T21:28:36.4150478Z inflating: build/bin/c10_small_vector_test 2025-08-14T21:28:36.4199720Z inflating: build/bin/c10_ssize_test 2025-08-14T21:28:36.4253286Z inflating: build/bin/c10_string_util_test 2025-08-14T21:28:36.4299821Z inflating: build/bin/c10_string_view_test 2025-08-14T21:28:36.4348208Z inflating: build/bin/c10_tempfile_test 2025-08-14T21:28:36.4390019Z inflating: build/bin/c10_intrusive_ptr_benchmark 2025-08-14T21:28:36.4443329Z inflating: build/bin/c10_typeid_test 2025-08-14T21:28:36.4950567Z inflating: build/bin/vec_test_all_types_DEFAULT 2025-08-14T21:28:36.5474806Z inflating: build/bin/vec_test_all_types_AVX512 2025-08-14T21:28:36.6004906Z inflating: build/bin/vec_test_all_types_AVX2 2025-08-14T21:28:36.6055831Z inflating: build/bin/static_runtime_bench 2025-08-14T21:28:36.6277378Z inflating: build/bin/static_runtime_test 2025-08-14T21:28:36.6346226Z inflating: build/bin/Dict_test 2025-08-14T21:28:36.6396112Z inflating: build/bin/Dimname_test 2025-08-14T21:28:36.6456581Z inflating: build/bin/MaybeOwned_test 2025-08-14T21:28:36.6510633Z inflating: build/bin/NamedTensor_test 2025-08-14T21:28:36.6565475Z inflating: build/bin/apply_utils_test 2025-08-14T21:28:36.6620210Z inflating: build/bin/atest 2025-08-14T21:28:36.6681171Z inflating: build/bin/basic 2025-08-14T21:28:36.6733764Z inflating: build/bin/broadcast_test 2025-08-14T21:28:36.6782459Z inflating: build/bin/cpu_allocator_test 2025-08-14T21:28:36.6837387Z inflating: build/bin/cpu_generator_test 2025-08-14T21:28:36.6887454Z inflating: build/bin/cpu_profiling_allocator_test 2025-08-14T21:28:36.6972115Z inflating: build/bin/cpu_rng_test 2025-08-14T21:28:36.7020041Z inflating: build/bin/dlconvertor_test 2025-08-14T21:28:36.7074666Z inflating: build/bin/extension_backend_test 2025-08-14T21:28:36.7126770Z inflating: build/bin/half_test 2025-08-14T21:28:36.7215599Z inflating: build/bin/ivalue_test 2025-08-14T21:28:36.7262460Z inflating: build/bin/lazy_tensor_test 2025-08-14T21:28:36.7313208Z inflating: build/bin/math_kernel_test 2025-08-14T21:28:36.7364342Z inflating: build/bin/memory_format_test 2025-08-14T21:28:36.7415109Z inflating: build/bin/memory_overlapping_test 2025-08-14T21:28:36.7465444Z inflating: build/bin/mobile_memory_cleanup 2025-08-14T21:28:36.7518709Z inflating: build/bin/native_test 2025-08-14T21:28:36.7567201Z inflating: build/bin/operator_name_test 2025-08-14T21:28:36.7616001Z inflating: build/bin/operators_test 2025-08-14T21:28:36.7665684Z inflating: build/bin/packedtensoraccessor_test 2025-08-14T21:28:36.7728134Z inflating: build/bin/pow_test 2025-08-14T21:28:36.7782749Z inflating: build/bin/quantized_test 2025-08-14T21:28:36.7830418Z inflating: build/bin/reduce_ops_test 2025-08-14T21:28:36.7878713Z inflating: build/bin/reportMemoryUsage_test 2025-08-14T21:28:36.7932321Z inflating: build/bin/scalar_tensor_test 2025-08-14T21:28:36.7987648Z inflating: build/bin/scalar_test 2025-08-14T21:28:36.8036155Z inflating: build/bin/StorageUtils_test 2025-08-14T21:28:36.8085879Z inflating: build/bin/stride_properties_test 2025-08-14T21:28:36.8158218Z inflating: build/bin/tensor_iterator_test 2025-08-14T21:28:36.8209749Z inflating: build/bin/test_parallel 2025-08-14T21:28:36.8258308Z inflating: build/bin/thread_init_test 2025-08-14T21:28:36.8310306Z inflating: build/bin/type_ptr_test 2025-08-14T21:28:36.8365770Z inflating: build/bin/type_test 2025-08-14T21:28:36.8415806Z inflating: build/bin/undefined_tensor_test 2025-08-14T21:28:36.8463385Z inflating: build/bin/verify_api_visibility 2025-08-14T21:28:36.8528316Z inflating: build/bin/legacy_vmap_test 2025-08-14T21:28:36.8576510Z inflating: build/bin/weakref_test 2025-08-14T21:28:36.8625574Z inflating: build/bin/wrapdim_test 2025-08-14T21:28:36.8674614Z inflating: build/bin/xla_tensor_test 2025-08-14T21:28:36.8730563Z inflating: build/bin/IListRef_test 2025-08-14T21:28:36.8826261Z inflating: build/bin/List_test 2025-08-14T21:28:36.8887966Z inflating: build/bin/KernelFunction_test 2025-08-14T21:28:36.8996935Z inflating: build/bin/kernel_function_legacy_test 2025-08-14T21:28:36.9084991Z inflating: build/bin/kernel_function_test 2025-08-14T21:28:36.9197634Z inflating: build/bin/kernel_lambda_legacy_test 2025-08-14T21:28:36.9289736Z inflating: build/bin/kernel_lambda_test 2025-08-14T21:28:36.9347438Z inflating: build/bin/kernel_stackbased_test 2025-08-14T21:28:36.9434162Z inflating: build/bin/make_boxed_from_unboxed_functor_test 2025-08-14T21:28:36.9482865Z inflating: build/bin/CppSignature_test 2025-08-14T21:28:36.9534911Z inflating: build/bin/backend_fallback_test 2025-08-14T21:28:36.9581613Z inflating: build/bin/op_allowlist_test 2025-08-14T21:28:36.9852441Z inflating: build/bin/op_registration_test 2025-08-14T21:28:36.9915555Z inflating: build/bin/inline_container_test 2025-08-14T21:28:37.0877777Z inflating: build/bin/test_jit 2025-08-14T21:28:37.0927774Z inflating: build/bin/BackoffTest 2025-08-14T21:28:37.1213156Z inflating: build/bin/test_nativert 2025-08-14T21:28:37.1266728Z inflating: build/bin/TCPStoreTest 2025-08-14T21:28:37.1317661Z inflating: build/bin/FileStoreTest 2025-08-14T21:28:37.1368498Z inflating: build/bin/HashStoreTest 2025-08-14T21:28:37.1370659Z inflating: build/bin/example_allreduce 2025-08-14T21:28:37.1423388Z inflating: build/bin/test_dist_autograd 2025-08-14T21:28:37.1485529Z inflating: build/bin/ProcessGroupGlooTest 2025-08-14T21:28:37.1548216Z inflating: build/bin/test_cpp_rpc 2025-08-14T21:28:37.1552581Z inflating: build/bin/parallel_benchmark 2025-08-14T21:28:37.2535677Z inflating: build/bin/test_api 2025-08-14T21:28:37.2834589Z inflating: build/bin/test_lazy 2025-08-14T21:28:37.2836614Z inflating: build/bin/torch_shm_manager 2025-08-14T21:28:37.2839321Z creating: .additional_ci_files/ 2025-08-14T21:28:37.2905136Z inflating: .additional_ci_files/test-times.json 2025-08-14T21:28:37.3163653Z inflating: .additional_ci_files/test-class-times.json 2025-08-14T21:28:37.3247726Z ##[group]Run rm artifacts.zip 2025-08-14T21:28:37.3247933Z rm artifacts.zip 2025-08-14T21:28:37.3252327Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-08-14T21:28:37.3252552Z env: 2025-08-14T21:28:37.3252701Z GIT_DEFAULT_BRANCH: main 2025-08-14T21:28:37.3252867Z ##[endgroup] 2025-08-14T21:28:37.3808141Z ##[group]Run df -H 2025-08-14T21:28:37.3808310Z df -H 2025-08-14T21:28:37.3812223Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-08-14T21:28:37.3812556Z env: 2025-08-14T21:28:37.3812707Z GIT_DEFAULT_BRANCH: main 2025-08-14T21:28:37.3812876Z ##[endgroup] 2025-08-14T21:28:37.3849819Z Filesystem Size Used Avail Use% Mounted on 2025-08-14T21:28:37.3851811Z devtmpfs 4.2M 0 4.2M 0% /dev 2025-08-14T21:28:37.3852133Z tmpfs 67G 0 67G 0% /dev/shm 2025-08-14T21:28:37.3857914Z tmpfs 27G 791k 27G 1% /run 2025-08-14T21:28:37.3859321Z /dev/nvme0n1p1 215G 69G 147G 32% / 2025-08-14T21:28:37.3859588Z tmpfs 67G 13k 67G 1% /tmp 2025-08-14T21:28:37.3859811Z /dev/nvme0n1p128 11M 1.4M 9.2M 13% /boot/efi 2025-08-14T21:28:37.3879906Z Prepare all required actions 2025-08-14T21:28:37.3880578Z Getting action download info 2025-08-14T21:28:37.5362870Z ##[group]Run ./.github/actions/download-td-artifacts 2025-08-14T21:28:37.5363090Z with: 2025-08-14T21:28:37.5363222Z env: 2025-08-14T21:28:37.5363364Z GIT_DEFAULT_BRANCH: main 2025-08-14T21:28:37.5363524Z ##[endgroup] 2025-08-14T21:28:37.5491976Z ##[group]Run seemethere/download-artifact-s3@v4 2025-08-14T21:28:37.5492182Z with: 2025-08-14T21:28:37.5492324Z name: td_results 2025-08-14T21:28:37.5492483Z s3-bucket: gha-artifacts 2025-08-14T21:28:37.5492643Z region: us-east-1 2025-08-14T21:28:37.5492789Z env: 2025-08-14T21:28:37.5492927Z GIT_DEFAULT_BRANCH: main 2025-08-14T21:28:37.5493082Z ##[endgroup] 2025-08-14T21:28:37.8636516Z (node:47865) NOTE: We are formalizing our plans to enter AWS SDK for JavaScript (v2) into maintenance mode in 2023. 2025-08-14T21:28:37.8638136Z 2025-08-14T21:28:37.8643385Z Please migrate your code to use AWS SDK for JavaScript (v3). 2025-08-14T21:28:37.8647590Z For more information, check the migration guide at https://a.co/7PzMCcy 2025-08-14T21:28:37.8649744Z (Use `node --trace-warnings ...` to show where the warning was created) 2025-08-14T21:28:37.9461770Z Found 0 objects with prefix pytorch/pytorch/16976255153/td_results/ 2025-08-14T21:28:37.9468242Z Artifact download has finished successfully 2025-08-14T21:28:37.9788726Z ##[group]Run mkdir -p .additional_ci_files 2025-08-14T21:28:37.9788974Z mkdir -p .additional_ci_files 2025-08-14T21:28:37.9789233Z mv td_results.json .additional_ci_files/td_results.json || true 2025-08-14T21:28:37.9793721Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-08-14T21:28:37.9793944Z env: 2025-08-14T21:28:37.9794092Z GIT_DEFAULT_BRANCH: main 2025-08-14T21:28:37.9794252Z ##[endgroup] 2025-08-14T21:28:37.9840780Z mv: cannot stat 'td_results.json': No such file or directory 2025-08-14T21:28:37.9936784Z ##[group]Run .github/scripts/parse_ref.py 2025-08-14T21:28:37.9937020Z .github/scripts/parse_ref.py 2025-08-14T21:28:37.9940685Z shell: /usr/bin/bash -e {0} 2025-08-14T21:28:37.9940861Z env: 2025-08-14T21:28:37.9941008Z GIT_DEFAULT_BRANCH: main 2025-08-14T21:28:37.9941174Z ##[endgroup] 2025-08-14T21:28:38.0691129Z Setting output branch=main 2025-08-14T21:28:38.0776165Z Prepare all required actions 2025-08-14T21:28:38.0776435Z Getting action download info 2025-08-14T21:28:38.2107787Z ##[group]Run ./.github/actions/filter-test-configs 2025-08-14T21:28:38.2108000Z with: 2025-08-14T21:28:38.2108295Z github-token: *** 2025-08-14T21:28:38.2109764Z test-matrix: {"include": [{"config": "cpu_inductor_torchbench", "shard": 1, "num_shards": 2, "runner": "linux.8xlarge.amx"}, {"config": "cpu_inductor_torchbench", "shard": 2, "num_shards": 2, "runner": "linux.8xlarge.amx"}, {"config": "dynamic_cpu_inductor_huggingface", "shard": 1, "num_shards": 1, "runner": "linux.8xlarge.amx"}, {"config": "dynamic_cpu_inductor_timm", "shard": 1, "num_shards": 2, "runner": "linux.8xlarge.amx"}, {"config": "dynamic_cpu_inductor_timm", "shard": 2, "num_shards": 2, "runner": "linux.8xlarge.amx"}, {"config": "dynamic_cpu_inductor_torchbench", "shard": 1, "num_shards": 2, "runner": "linux.8xlarge.amx"}, {"config": "dynamic_cpu_inductor_torchbench", "shard": 2, "num_shards": 2, "runner": "linux.8xlarge.amx"}, {"config": "inductor_torchbench_cpu_smoketest_perf", "shard": 1, "num_shards": 1, "runner": "linux.24xl.spr-metal"}]} 2025-08-14T21:28:38.2111472Z job-name: linux-jammy-cpu-py3.9-gcc11-inductor / test (dynamic_cpu_inductor_huggingface, 1, 1, linux.8xlarge.amx) 2025-08-14T21:28:38.2111803Z env: 2025-08-14T21:28:38.2111937Z GIT_DEFAULT_BRANCH: main 2025-08-14T21:28:38.2112099Z ##[endgroup] 2025-08-14T21:28:38.2239802Z ##[group]Run nick-fields/retry@v3.0.0 2025-08-14T21:28:38.2240009Z with: 2025-08-14T21:28:38.2240151Z shell: bash 2025-08-14T21:28:38.2240297Z timeout_minutes: 10 2025-08-14T21:28:38.2240448Z max_attempts: 5 2025-08-14T21:28:38.2240601Z retry_wait_seconds: 30 2025-08-14T21:28:38.2241053Z command: set -eux # PyYAML 6.0 doesn't work with MacOS x86 anymore # This must run on Python-3.7 (AmazonLinux2) so can't use request=3.32.2 python3 -m pip install requests==2.27.1 pyyaml==6.0.2 2025-08-14T21:28:38.2241497Z polling_interval_seconds: 1 2025-08-14T21:28:38.2241720Z warning_on_retry: true 2025-08-14T21:28:38.2241935Z continue_on_error: false 2025-08-14T21:28:38.2242116Z env: 2025-08-14T21:28:38.2242271Z GIT_DEFAULT_BRANCH: main 2025-08-14T21:28:38.2242595Z GITHUB_TOKEN: *** 2025-08-14T21:28:38.2242749Z ##[endgroup] 2025-08-14T21:28:38.3277736Z + python3 -m pip install requests==2.27.1 pyyaml==6.0.2 2025-08-14T21:28:38.4889928Z Defaulting to user installation because normal site-packages is not writeable 2025-08-14T21:28:39.1134423Z Collecting requests==2.27.1 2025-08-14T21:28:39.1274298Z Downloading requests-2.27.1-py2.py3-none-any.whl (63 kB) 2025-08-14T21:28:39.3168895Z Collecting pyyaml==6.0.2 2025-08-14T21:28:39.3201925Z Downloading PyYAML-6.0.2-cp39-cp39-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (737 kB) 2025-08-14T21:28:39.3823905Z Requirement already satisfied: urllib3<1.27,>=1.21.1 in /usr/lib/python3.9/site-packages (from requests==2.27.1) (1.25.10) 2025-08-14T21:28:39.3831640Z Requirement already satisfied: idna<4,>=2.5 in /usr/lib/python3.9/site-packages (from requests==2.27.1) (2.10) 2025-08-14T21:28:39.6636435Z Collecting charset-normalizer~=2.0.0 2025-08-14T21:28:39.6668519Z Downloading charset_normalizer-2.0.12-py3-none-any.whl (39 kB) 2025-08-14T21:28:39.7860096Z Collecting certifi>=2017.4.17 2025-08-14T21:28:39.7898708Z Downloading certifi-2025.8.3-py3-none-any.whl (161 kB) 2025-08-14T21:28:39.8978357Z Installing collected packages: charset-normalizer, certifi, requests, pyyaml 2025-08-14T21:28:40.3061390Z Successfully installed certifi-2025.8.3 charset-normalizer-2.0.12 pyyaml-6.0.2 requests-2.27.1 2025-08-14T21:28:41.2838111Z Command completed after 1 attempt(s). 2025-08-14T21:28:41.2979287Z ##[group]Run set -x 2025-08-14T21:28:41.2979469Z set -x 2025-08-14T21:28:41.2979613Z  2025-08-14T21:28:41.2979846Z # Use relative path here as this could be checked out anywhere, not necessarily 2025-08-14T21:28:41.2980114Z # in runner workspace 2025-08-14T21:28:41.2980499Z python3 "${GITHUB_ACTION_PATH}/../../scripts/parse_ref.py" 2025-08-14T21:28:41.2985777Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-08-14T21:28:41.2986010Z env: 2025-08-14T21:28:41.2986170Z GIT_DEFAULT_BRANCH: main 2025-08-14T21:28:41.2986328Z ##[endgroup] 2025-08-14T21:28:41.3010326Z + python3 /home/ec2-user/actions-runner/_work/pytorch/pytorch/./.github/actions/filter-test-configs/../../scripts/parse_ref.py 2025-08-14T21:28:41.3143018Z Setting output branch=main 2025-08-14T21:28:41.3259284Z ##[group]Run echo "Workflow: ${GITHUB_WORKFLOW}" 2025-08-14T21:28:41.3259543Z echo "Workflow: ${GITHUB_WORKFLOW}" 2025-08-14T21:28:41.3259755Z echo "Job name: ${JOB_NAME}" 2025-08-14T21:28:41.3259930Z  2025-08-14T21:28:41.3260159Z # Use relative path here as this could be checked out anywhere, not necessarily 2025-08-14T21:28:41.3260438Z # in runner workspace 2025-08-14T21:28:41.3260709Z python3 "${GITHUB_ACTION_PATH}/../../scripts/filter_test_configs.py" \ 2025-08-14T21:28:41.3260984Z  --workflow "${GITHUB_WORKFLOW}" \ 2025-08-14T21:28:41.3261334Z  --job-name "${JOB_NAME}" \ 2025-08-14T21:28:41.3262874Z  --test-matrix "{"include": [{"config": "cpu_inductor_torchbench", "shard": 1, "num_shards": 2, "runner": "linux.8xlarge.amx"}, {"config": "cpu_inductor_torchbench", "shard": 2, "num_shards": 2, "runner": "linux.8xlarge.amx"}, {"config": "dynamic_cpu_inductor_huggingface", "shard": 1, "num_shards": 1, "runner": "linux.8xlarge.amx"}, {"config": "dynamic_cpu_inductor_timm", "shard": 1, "num_shards": 2, "runner": "linux.8xlarge.amx"}, {"config": "dynamic_cpu_inductor_timm", "shard": 2, "num_shards": 2, "runner": "linux.8xlarge.amx"}, {"config": "dynamic_cpu_inductor_torchbench", "shard": 1, "num_shards": 2, "runner": "linux.8xlarge.amx"}, {"config": "dynamic_cpu_inductor_torchbench", "shard": 2, "num_shards": 2, "runner": "linux.8xlarge.amx"}, {"config": "inductor_torchbench_cpu_smoketest_perf", "shard": 1, "num_shards": 1, "runner": "linux.24xl.spr-metal"}]}" \ 2025-08-14T21:28:41.3264450Z  --selected-test-configs "" \ 2025-08-14T21:28:41.3264663Z  --pr-number "${PR_NUMBER}" \ 2025-08-14T21:28:41.3264860Z  --tag "${TAG}" \ 2025-08-14T21:28:41.3265153Z  --event-name "${EVENT_NAME}" \ 2025-08-14T21:28:41.3265357Z  --schedule "${SCHEDULE}" \ 2025-08-14T21:28:41.3265554Z  --branch "${HEAD_BRANCH}" 2025-08-14T21:28:41.3269154Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-08-14T21:28:41.3269369Z env: 2025-08-14T21:28:41.3269510Z GIT_DEFAULT_BRANCH: main 2025-08-14T21:28:41.3269936Z GITHUB_TOKEN: *** 2025-08-14T21:28:41.3270261Z JOB_NAME: linux-jammy-cpu-py3.9-gcc11-inductor / test (dynamic_cpu_inductor_huggingface, 1, 1, linux.8xlarge.amx) 2025-08-14T21:28:41.3270593Z PR_NUMBER: 2025-08-14T21:28:41.3270734Z TAG: 2025-08-14T21:28:41.3270861Z EVENT_NAME: push 2025-08-14T21:28:41.3271010Z SCHEDULE: 2025-08-14T21:28:41.3271156Z HEAD_BRANCH: main 2025-08-14T21:28:41.3271294Z ##[endgroup] 2025-08-14T21:28:41.3292887Z Workflow: inductor 2025-08-14T21:28:41.3297530Z Job name: linux-jammy-cpu-py3.9-gcc11-inductor / test (dynamic_cpu_inductor_huggingface, 1, 1, linux.8xlarge.amx) 2025-08-14T21:28:41.4896610Z Setting output keep-going=True 2025-08-14T21:28:41.4899907Z Setting output ci-verbose-test-logs=False 2025-08-14T21:28:41.4901963Z Setting output ci-test-showlocals=False 2025-08-14T21:28:41.4906588Z Setting output ci-no-test-timeout=False 2025-08-14T21:28:41.4908717Z Setting output ci-no-td=False 2025-08-14T21:28:41.4909053Z Setting output ci-td-distributed=False 2025-08-14T21:28:41.4913850Z Setting output is-unstable=False 2025-08-14T21:28:41.4917964Z Setting output reenabled-issues= 2025-08-14T21:28:41.4923757Z Setting output test-matrix={"include": [{"config": "cpu_inductor_torchbench", "shard": 1, "num_shards": 2, "runner": "linux.8xlarge.amx"}, {"config": "cpu_inductor_torchbench", "shard": 2, "num_shards": 2, "runner": "linux.8xlarge.amx"}, {"config": "dynamic_cpu_inductor_huggingface", "shard": 1, "num_shards": 1, "runner": "linux.8xlarge.amx"}, {"config": "dynamic_cpu_inductor_timm", "shard": 1, "num_shards": 2, "runner": "linux.8xlarge.amx"}, {"config": "dynamic_cpu_inductor_timm", "shard": 2, "num_shards": 2, "runner": "linux.8xlarge.amx"}, {"config": "dynamic_cpu_inductor_torchbench", "shard": 1, "num_shards": 2, "runner": "linux.8xlarge.amx"}, {"config": "dynamic_cpu_inductor_torchbench", "shard": 2, "num_shards": 2, "runner": "linux.8xlarge.amx"}, {"config": "inductor_torchbench_cpu_smoketest_perf", "shard": 1, "num_shards": 1, "runner": "linux.24xl.spr-metal"}]} 2025-08-14T21:28:41.4925391Z Setting output is-test-matrix-empty=False 2025-08-14T21:28:41.5168021Z ##[group]Run echo "Filtered matrix:" 2025-08-14T21:28:41.5168249Z echo "Filtered matrix:" 2025-08-14T21:28:41.5169757Z echo "{"include": [{"config": "cpu_inductor_torchbench", "shard": 1, "num_shards": 2, "runner": "linux.8xlarge.amx"}, {"config": "cpu_inductor_torchbench", "shard": 2, "num_shards": 2, "runner": "linux.8xlarge.amx"}, {"config": "dynamic_cpu_inductor_huggingface", "shard": 1, "num_shards": 1, "runner": "linux.8xlarge.amx"}, {"config": "dynamic_cpu_inductor_timm", "shard": 1, "num_shards": 2, "runner": "linux.8xlarge.amx"}, {"config": "dynamic_cpu_inductor_timm", "shard": 2, "num_shards": 2, "runner": "linux.8xlarge.amx"}, {"config": "dynamic_cpu_inductor_torchbench", "shard": 1, "num_shards": 2, "runner": "linux.8xlarge.amx"}, {"config": "dynamic_cpu_inductor_torchbench", "shard": 2, "num_shards": 2, "runner": "linux.8xlarge.amx"}, {"config": "inductor_torchbench_cpu_smoketest_perf", "shard": 1, "num_shards": 1, "runner": "linux.24xl.spr-metal"}]}" 2025-08-14T21:28:41.5171370Z  2025-08-14T21:28:41.5171511Z echo 2025-08-14T21:28:41.5171687Z echo "Is the current job unstable? False" 2025-08-14T21:28:41.5171901Z  2025-08-14T21:28:41.5172051Z echo 2025-08-14T21:28:41.5172249Z echo "Is keep-going label set? True" 2025-08-14T21:28:41.5172449Z  2025-08-14T21:28:41.5172580Z echo 2025-08-14T21:28:41.5172725Z echo "Reenabled issues? " 2025-08-14T21:28:41.5176685Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-08-14T21:28:41.5176909Z env: 2025-08-14T21:28:41.5177047Z GIT_DEFAULT_BRANCH: main 2025-08-14T21:28:41.5177212Z ##[endgroup] 2025-08-14T21:28:41.5198671Z Filtered matrix: 2025-08-14T21:28:41.5203525Z {include: [{config: cpu_inductor_torchbench, shard: 1, num_shards: 2, runner: linux.8xlarge.amx}, {config: cpu_inductor_torchbench, shard: 2, num_shards: 2, runner: linux.8xlarge.amx}, {config: dynamic_cpu_inductor_huggingface, shard: 1, num_shards: 1, runner: linux.8xlarge.amx}, {config: dynamic_cpu_inductor_timm, shard: 1, num_shards: 2, runner: linux.8xlarge.amx}, {config: dynamic_cpu_inductor_timm, shard: 2, num_shards: 2, runner: linux.8xlarge.amx}, {config: dynamic_cpu_inductor_torchbench, shard: 1, num_shards: 2, runner: linux.8xlarge.amx}, {config: dynamic_cpu_inductor_torchbench, shard: 2, num_shards: 2, runner: linux.8xlarge.amx}, {config: inductor_torchbench_cpu_smoketest_perf, shard: 1, num_shards: 1, runner: linux.24xl.spr-metal}]} 2025-08-14T21:28:41.5204956Z 2025-08-14T21:28:41.5205060Z Is the current job unstable? False 2025-08-14T21:28:41.5205188Z 2025-08-14T21:28:41.5205269Z Is keep-going label set? True 2025-08-14T21:28:41.5205388Z 2025-08-14T21:28:41.5205450Z Reenabled issues? 2025-08-14T21:28:41.5325713Z ##[group]Run echo "timeout=$((JOB_TIMEOUT-30))" >> "${GITHUB_OUTPUT}" 2025-08-14T21:28:41.5326046Z echo "timeout=$((JOB_TIMEOUT-30))" >> "${GITHUB_OUTPUT}" 2025-08-14T21:28:41.5329569Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-08-14T21:28:41.5329790Z env: 2025-08-14T21:28:41.5329927Z GIT_DEFAULT_BRANCH: main 2025-08-14T21:28:41.5330094Z JOB_TIMEOUT: 240 2025-08-14T21:28:41.5330245Z ##[endgroup] 2025-08-14T21:28:41.5473692Z ##[group]Run env | grep '^GITHUB' >> "/tmp/github_env_${GITHUB_RUN_ID}" 2025-08-14T21:28:41.5474013Z env | grep '^GITHUB' >> "/tmp/github_env_${GITHUB_RUN_ID}" 2025-08-14T21:28:41.5474268Z env | grep '^CI' >> "/tmp/github_env_${GITHUB_RUN_ID}" 2025-08-14T21:28:41.5477644Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-08-14T21:28:41.5477861Z env: 2025-08-14T21:28:41.5478005Z GIT_DEFAULT_BRANCH: main 2025-08-14T21:28:41.5478161Z ##[endgroup] 2025-08-14T21:28:41.5629105Z ##[group]Run set -x 2025-08-14T21:28:41.5629334Z set -x 2025-08-14T21:28:41.5629470Z  2025-08-14T21:28:41.5629634Z if [[ $TEST_CONFIG == 'multigpu' ]]; then 2025-08-14T21:28:41.5629873Z  TEST_COMMAND=.ci/pytorch/multigpu-test.sh 2025-08-14T21:28:41.5630099Z elif [[ $BUILD_ENVIRONMENT == *onnx* ]]; then 2025-08-14T21:28:41.5630315Z  TEST_COMMAND=.ci/onnx/test.sh 2025-08-14T21:28:41.5630497Z else 2025-08-14T21:28:41.5630657Z  TEST_COMMAND=.ci/pytorch/test.sh 2025-08-14T21:28:41.5630832Z fi 2025-08-14T21:28:41.5631045Z  2025-08-14T21:28:41.5631208Z # Leaving 1GB for the runner and other things 2025-08-14T21:28:41.5631536Z TOTAL_AVAILABLE_MEMORY_IN_GB=$(awk '/MemTotal/ { printf "%.3f \n", $2/1024/1024 - 1 }' /proc/meminfo) 2025-08-14T21:28:41.5632038Z # https://docs.docker.com/engine/containers/resource_constraints/#--memory-swap-details, the 3GB swap 2025-08-14T21:28:41.5632435Z # comes from https://github.com/pytorch/test-infra/pull/6058 2025-08-14T21:28:41.5632732Z TOTAL_MEMORY_WITH_SWAP=$(("${TOTAL_AVAILABLE_MEMORY_IN_GB%.*}" + 3)) 2025-08-14T21:28:41.5632972Z  2025-08-14T21:28:41.5633146Z if [[ ${BUILD_ENVIRONMENT} == *"s390x"* ]]; then 2025-08-14T21:28:41.5633346Z  SHM_OPTS= 2025-08-14T21:28:41.5633504Z  JENKINS_USER= 2025-08-14T21:28:41.5633720Z  # ensure that docker container cleanly exits in 12 hours 2025-08-14T21:28:41.5633985Z  # if for some reason cleanup action doesn't stop container 2025-08-14T21:28:41.5634220Z  # when job is cancelled 2025-08-14T21:28:41.5634410Z  DOCKER_SHELL_CMD="sleep 12h" 2025-08-14T21:28:41.5634590Z else 2025-08-14T21:28:41.5634745Z  SHM_OPTS="--shm-size=${SHM_SIZE}" 2025-08-14T21:28:41.5634950Z  JENKINS_USER="--user jenkins" 2025-08-14T21:28:41.5635141Z  DOCKER_SHELL_CMD= 2025-08-14T21:28:41.5635296Z fi 2025-08-14T21:28:41.5635426Z  2025-08-14T21:28:41.5635626Z # detached container should get cleaned up by teardown_ec2_linux 2025-08-14T21:28:41.5635920Z # TODO: Stop building test binaries as part of the build phase 2025-08-14T21:28:41.5636255Z # Used for GPU_FLAG, SHM_OPTS, JENKINS_USER and DOCKER_SHELL_CMD since that doesn't play nice 2025-08-14T21:28:41.5636550Z # shellcheck disable=SC2086,SC2090 2025-08-14T21:28:41.5636748Z container_name=$(docker run \ 2025-08-14T21:28:41.5636930Z  ${GPU_FLAG:-} \ 2025-08-14T21:28:41.5637120Z  ${SCCACHE_SERVER_PORT_DOCKER_FLAG:-} \ 2025-08-14T21:28:41.5637325Z  -e BUILD_ENVIRONMENT \ 2025-08-14T21:28:41.5637500Z  -e PR_NUMBER \ 2025-08-14T21:28:41.5637671Z  -e GITHUB_ACTIONS \ 2025-08-14T21:28:41.5637849Z  -e GITHUB_REPOSITORY \ 2025-08-14T21:28:41.5638028Z  -e GITHUB_WORKFLOW \ 2025-08-14T21:28:41.5638196Z  -e GITHUB_JOB \ 2025-08-14T21:28:41.5638360Z  -e GITHUB_RUN_ID \ 2025-08-14T21:28:41.5638531Z  -e GITHUB_RUN_NUMBER \ 2025-08-14T21:28:41.5638702Z  -e GITHUB_RUN_ATTEMPT \ 2025-08-14T21:28:41.5638880Z  -e JOB_ID \ 2025-08-14T21:28:41.5639039Z  -e JOB_NAME \ 2025-08-14T21:28:41.5639210Z  -e BASE_SHA \ 2025-08-14T21:28:41.5639358Z  -e BRANCH \ 2025-08-14T21:28:41.5639509Z  -e SHA1 \ 2025-08-14T21:28:41.5639665Z  -e AWS_DEFAULT_REGION \ 2025-08-14T21:28:41.5639838Z  -e IN_WHEEL_TEST \ 2025-08-14T21:28:41.5640008Z  -e SHARD_NUMBER \ 2025-08-14T21:28:41.5640176Z  -e TEST_CONFIG \ 2025-08-14T21:28:41.5640336Z  -e NUM_TEST_SHARDS \ 2025-08-14T21:28:41.5640514Z  -e REENABLED_ISSUES \ 2025-08-14T21:28:41.5640699Z  -e CONTINUE_THROUGH_ERROR \ 2025-08-14T21:28:41.5640958Z  -e VERBOSE_TEST_LOGS \ 2025-08-14T21:28:41.5641131Z  -e TEST_SHOWLOCALS \ 2025-08-14T21:28:41.5641303Z  -e NO_TEST_TIMEOUT \ 2025-08-14T21:28:41.5641467Z  -e NO_TD \ 2025-08-14T21:28:41.5641618Z  -e TD_DISTRIBUTED \ 2025-08-14T21:28:41.5641788Z  -e PR_LABELS \ 2025-08-14T21:28:41.5641972Z  -e MAX_JOBS="$(nproc --ignore=2)" \ 2025-08-14T21:28:41.5642165Z  -e SCCACHE_BUCKET \ 2025-08-14T21:28:41.5642336Z  -e SCCACHE_REGION \ 2025-08-14T21:28:41.5642503Z  -e XLA_CUDA \ 2025-08-14T21:28:41.5642683Z  -e XLA_CLANG_CACHE_S3_BUCKET_NAME \ 2025-08-14T21:28:41.5642931Z  -e PYTORCH_TEST_CUDA_MEM_LEAK_CHECK \ 2025-08-14T21:28:41.5643148Z  -e PYTORCH_TEST_RERUN_DISABLED_TESTS \ 2025-08-14T21:28:41.5643363Z  -e SKIP_SCCACHE_INITIALIZATION=1 \ 2025-08-14T21:28:41.5643556Z  -e HUGGING_FACE_HUB_TOKEN \ 2025-08-14T21:28:41.5643751Z  -e SCRIBE_GRAPHQL_ACCESS_TOKEN \ 2025-08-14T21:28:41.5643935Z  -e DASHBOARD_TAG \ 2025-08-14T21:28:41.5644100Z  -e ARTIFACTS_FILE_SUFFIX \ 2025-08-14T21:28:41.5644312Z  --memory="${TOTAL_AVAILABLE_MEMORY_IN_GB%.*}g" \ 2025-08-14T21:28:41.5644552Z  --memory-swap="${TOTAL_MEMORY_WITH_SWAP}g" \ 2025-08-14T21:28:41.5644795Z  --env-file="/tmp/github_env_${GITHUB_RUN_ID}" \ 2025-08-14T21:28:41.5645018Z  --security-opt seccomp=unconfined \ 2025-08-14T21:28:41.5645220Z  --cap-add=SYS_PTRACE \ 2025-08-14T21:28:41.5645400Z  --ipc=host \ 2025-08-14T21:28:41.5645558Z  ${SHM_OPTS} \ 2025-08-14T21:28:41.5645714Z  --tty \ 2025-08-14T21:28:41.5645860Z  --detach \ 2025-08-14T21:28:41.5646020Z  --name="${container_name}" \ 2025-08-14T21:28:41.5646206Z  ${JENKINS_USER} \ 2025-08-14T21:28:41.5646415Z  -v "${GITHUB_WORKSPACE}:/var/lib/jenkins/workspace" \ 2025-08-14T21:28:41.5646646Z  -w /var/lib/jenkins/workspace \ 2025-08-14T21:28:41.5646828Z  "${DOCKER_IMAGE}" \ 2025-08-14T21:28:41.5646997Z  ${DOCKER_SHELL_CMD} 2025-08-14T21:28:41.5647159Z ) 2025-08-14T21:28:41.5647335Z # Propagate download.pytorch.org IP to container 2025-08-14T21:28:41.5647714Z grep download.pytorch.org /etc/hosts | docker exec -i "${container_name}" sudo bash -c "/bin/cat >> /etc/hosts" 2025-08-14T21:28:41.5648105Z echo "DOCKER_CONTAINER_ID=${container_name}" >> "${GITHUB_ENV}" 2025-08-14T21:28:41.5648340Z  2025-08-14T21:28:41.5648499Z if [[ ${BUILD_ENVIRONMENT} == *"s390x"* ]]; then 2025-08-14T21:28:41.5648831Z  docker exec -t "${container_name}" sh -c "python3 -m pip install -r .ci/docker/requirements-ci.txt" 2025-08-14T21:28:41.5649121Z fi 2025-08-14T21:28:41.5649246Z  2025-08-14T21:28:41.5649527Z docker exec -t "${container_name}" sh -c "python3 -m pip install $(echo dist/*.whl)[opt-einsum] && ${TEST_COMMAND}" 2025-08-14T21:28:41.5653030Z shell: /usr/bin/bash -e {0} 2025-08-14T21:28:41.5653196Z env: 2025-08-14T21:28:41.5653330Z GIT_DEFAULT_BRANCH: main 2025-08-14T21:28:41.5653532Z BUILD_ENVIRONMENT: linux-jammy-py3.9-gcc11-build 2025-08-14T21:28:41.5653741Z PR_NUMBER: 2025-08-14T21:28:41.5653889Z GITHUB_REPOSITORY: pytorch/pytorch 2025-08-14T21:28:41.5654079Z GITHUB_WORKFLOW: inductor 2025-08-14T21:28:41.5654241Z GITHUB_JOB: test 2025-08-14T21:28:41.5654384Z GITHUB_RUN_ID: 16976255153 2025-08-14T21:28:41.5654551Z GITHUB_RUN_NUMBER: 147536 2025-08-14T21:28:41.5654711Z GITHUB_RUN_ATTEMPT: 1 2025-08-14T21:28:41.5654863Z JOB_ID: 48128039107 2025-08-14T21:28:41.5655180Z JOB_NAME: linux-jammy-cpu-py3.9-gcc11-inductor / test (dynamic_cpu_inductor_huggingface, 1, 1, linux.8xlarge.amx) 2025-08-14T21:28:41.5655510Z BRANCH: main 2025-08-14T21:28:41.5655675Z SHA1: 1fc683cf17c8c673044538d10266c00f92987be2 2025-08-14T21:28:41.5655948Z BASE_SHA: 1fc683cf17c8c673044538d10266c00f92987be2 2025-08-14T21:28:41.5656175Z TEST_CONFIG: dynamic_cpu_inductor_huggingface 2025-08-14T21:28:41.5656370Z SHARD_NUMBER: 1 2025-08-14T21:28:41.5656512Z NUM_TEST_SHARDS: 1 2025-08-14T21:28:41.5656661Z REENABLED_ISSUES: 2025-08-14T21:28:41.5656812Z CONTINUE_THROUGH_ERROR: True 2025-08-14T21:28:41.5656977Z VERBOSE_TEST_LOGS: False 2025-08-14T21:28:41.5657142Z TEST_SHOWLOCALS: False 2025-08-14T21:28:41.5657305Z NO_TEST_TIMEOUT: False 2025-08-14T21:28:41.5657458Z NO_TD: False 2025-08-14T21:28:41.5657595Z TD_DISTRIBUTED: False 2025-08-14T21:28:41.5657789Z SCCACHE_BUCKET: ossci-compiler-cache-circleci-v2 2025-08-14T21:28:41.5658053Z SCCACHE_REGION: us-east-1 2025-08-14T21:28:41.5658204Z SHM_SIZE: 1g 2025-08-14T21:28:41.5658664Z DOCKER_IMAGE: 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/ci-image:pytorch-linux-jammy-py3.9-gcc11-inductor-benchmarks-bfa89110622ba7202628e9faac705f183070defe 2025-08-14T21:28:41.5659142Z XLA_CUDA: 2025-08-14T21:28:41.5659358Z XLA_CLANG_CACHE_S3_BUCKET_NAME: ossci-compiler-clang-cache-circleci-xla 2025-08-14T21:28:41.5659617Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK: 0 2025-08-14T21:28:41.5659812Z PYTORCH_TEST_RERUN_DISABLED_TESTS: 0 2025-08-14T21:28:41.5659990Z DASHBOARD_TAG: 2025-08-14T21:28:41.5660276Z HUGGING_FACE_HUB_TOKEN: *** 2025-08-14T21:28:41.5660529Z SCRIBE_GRAPHQL_ACCESS_TOKEN: *** 2025-08-14T21:28:41.5660834Z ARTIFACTS_FILE_SUFFIX: test-dynamic_cpu_inductor_huggingface-1-1-linux.8xlarge.amx_48128039107 2025-08-14T21:28:41.5661131Z ##[endgroup] 2025-08-14T21:28:41.5678980Z + [[ dynamic_cpu_inductor_huggingface == \m\u\l\t\i\g\p\u ]] 2025-08-14T21:28:41.5679466Z + [[ linux-jammy-py3.9-gcc11-build == *onnx* ]] 2025-08-14T21:28:41.5679821Z + TEST_COMMAND=.ci/pytorch/test.sh 2025-08-14T21:28:41.5682920Z ++ awk '/MemTotal/ { printf "%.3f \n", $2/1024/1024 - 1 }' /proc/meminfo 2025-08-14T21:28:41.5701281Z + TOTAL_AVAILABLE_MEMORY_IN_GB='122.780 ' 2025-08-14T21:28:41.5701688Z + TOTAL_MEMORY_WITH_SWAP=125 2025-08-14T21:28:41.5702005Z + [[ linux-jammy-py3.9-gcc11-build == *\s\3\9\0\x* ]] 2025-08-14T21:28:41.5702697Z + SHM_OPTS=--shm-size=1g 2025-08-14T21:28:41.5702934Z + JENKINS_USER='--user jenkins' 2025-08-14T21:28:41.5703120Z + DOCKER_SHELL_CMD= 2025-08-14T21:28:41.5713455Z +++ nproc --ignore=2 2025-08-14T21:28:41.6320031Z ++ docker run -e BUILD_ENVIRONMENT -e PR_NUMBER -e GITHUB_ACTIONS -e GITHUB_REPOSITORY -e GITHUB_WORKFLOW -e GITHUB_JOB -e GITHUB_RUN_ID -e GITHUB_RUN_NUMBER -e GITHUB_RUN_ATTEMPT -e JOB_ID -e JOB_NAME -e BASE_SHA -e BRANCH -e SHA1 -e AWS_DEFAULT_REGION -e IN_WHEEL_TEST -e SHARD_NUMBER -e TEST_CONFIG -e NUM_TEST_SHARDS -e REENABLED_ISSUES -e CONTINUE_THROUGH_ERROR -e VERBOSE_TEST_LOGS -e TEST_SHOWLOCALS -e NO_TEST_TIMEOUT -e NO_TD -e TD_DISTRIBUTED -e PR_LABELS -e MAX_JOBS=30 -e SCCACHE_BUCKET -e SCCACHE_REGION -e XLA_CUDA -e XLA_CLANG_CACHE_S3_BUCKET_NAME -e PYTORCH_TEST_CUDA_MEM_LEAK_CHECK -e PYTORCH_TEST_RERUN_DISABLED_TESTS -e SKIP_SCCACHE_INITIALIZATION=1 -e HUGGING_FACE_HUB_TOKEN -e SCRIBE_GRAPHQL_ACCESS_TOKEN -e DASHBOARD_TAG -e ARTIFACTS_FILE_SUFFIX --memory=122g --memory-swap=125g --env-file=/tmp/github_env_16976255153 --security-opt seccomp=unconfined --cap-add=SYS_PTRACE --ipc=host --shm-size=1g --tty --detach --name= --user jenkins -v /home/ec2-user/actions-runner/_work/pytorch/pytorch:/var/lib/jenkins/workspace -w /var/lib/jenkins/workspace 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/ci-image:pytorch-linux-jammy-py3.9-gcc11-inductor-benchmarks-bfa89110622ba7202628e9faac705f183070defe 2025-08-14T21:28:52.5203470Z + container_name=4dd890d366a3b8e8bea253d8d281baebc94267ef08a0525f01f6487a0edf5681 2025-08-14T21:28:52.5206214Z + docker exec -i 4dd890d366a3b8e8bea253d8d281baebc94267ef08a0525f01f6487a0edf5681 sudo bash -c '/bin/cat >> /etc/hosts' 2025-08-14T21:28:52.5210246Z + grep download.pytorch.org /etc/hosts 2025-08-14T21:28:52.6549146Z + echo DOCKER_CONTAINER_ID=4dd890d366a3b8e8bea253d8d281baebc94267ef08a0525f01f6487a0edf5681 2025-08-14T21:28:52.6549844Z + [[ linux-jammy-py3.9-gcc11-build == *\s\3\9\0\x* ]] 2025-08-14T21:28:52.6553016Z ++ echo dist/torch-2.9.0a0+git1fc683c-cp39-cp39-linux_x86_64.whl 2025-08-14T21:28:52.6558082Z + docker exec -t 4dd890d366a3b8e8bea253d8d281baebc94267ef08a0525f01f6487a0edf5681 sh -c 'python3 -m pip install dist/torch-2.9.0a0+git1fc683c-cp39-cp39-linux_x86_64.whl[opt-einsum] && .ci/pytorch/test.sh' 2025-08-14T21:28:52.9632633Z Processing ./dist/torch-2.9.0a0+git1fc683c-cp39-cp39-linux_x86_64.whl (from torch==2.9.0a0+git1fc683c) 2025-08-14T21:28:53.1598024Z Requirement already satisfied: filelock in /opt/conda/envs/py_3.9/lib/python3.9/site-packages (from torch==2.9.0a0+git1fc683c->torch==2.9.0a0+git1fc683c) (3.18.0) 2025-08-14T21:28:53.1599568Z Requirement already satisfied: typing-extensions>=4.10.0 in /opt/conda/envs/py_3.9/lib/python3.9/site-packages (from torch==2.9.0a0+git1fc683c->torch==2.9.0a0+git1fc683c) (4.14.1) 2025-08-14T21:28:53.1605212Z Requirement already satisfied: sympy>=1.13.3 in /opt/conda/envs/py_3.9/lib/python3.9/site-packages (from torch==2.9.0a0+git1fc683c->torch==2.9.0a0+git1fc683c) (1.13.3) 2025-08-14T21:28:53.1609643Z Requirement already satisfied: networkx>=2.5.1 in /opt/conda/envs/py_3.9/lib/python3.9/site-packages (from torch==2.9.0a0+git1fc683c->torch==2.9.0a0+git1fc683c) (2.8.8) 2025-08-14T21:28:53.1613936Z Requirement already satisfied: jinja2 in /opt/conda/envs/py_3.9/lib/python3.9/site-packages (from torch==2.9.0a0+git1fc683c->torch==2.9.0a0+git1fc683c) (3.1.6) 2025-08-14T21:28:53.1614736Z Requirement already satisfied: fsspec>=0.8.5 in /opt/conda/envs/py_3.9/lib/python3.9/site-packages (from torch==2.9.0a0+git1fc683c->torch==2.9.0a0+git1fc683c) (2025.3.0) 2025-08-14T21:28:53.1622833Z Requirement already satisfied: opt-einsum>=3.3 in /opt/conda/envs/py_3.9/lib/python3.9/site-packages (from torch==2.9.0a0+git1fc683c->torch==2.9.0a0+git1fc683c) (3.3.0) 2025-08-14T21:28:53.1883034Z Requirement already satisfied: numpy>=1.7 in /opt/conda/envs/py_3.9/lib/python3.9/site-packages (from opt-einsum>=3.3->torch==2.9.0a0+git1fc683c->torch==2.9.0a0+git1fc683c) (1.22.4) 2025-08-14T21:28:53.1897260Z Requirement already satisfied: mpmath<1.4,>=1.1.0 in /opt/conda/envs/py_3.9/lib/python3.9/site-packages (from sympy>=1.13.3->torch==2.9.0a0+git1fc683c->torch==2.9.0a0+git1fc683c) (1.3.0) 2025-08-14T21:28:53.1941659Z Requirement already satisfied: MarkupSafe>=2.0 in /opt/conda/envs/py_3.9/lib/python3.9/site-packages (from jinja2->torch==2.9.0a0+git1fc683c->torch==2.9.0a0+git1fc683c) (3.0.2) 2025-08-14T21:28:53.8838290Z Installing collected packages: torch 2025-08-14T21:29:00.3723082Z ERROR: pip's dependency resolver does not currently take into account all the packages that are installed. This behaviour is the source of the following dependency conflicts. 2025-08-14T21:29:00.3724042Z dall-e 0.1 requires torchvision, which is not installed. 2025-08-14T21:29:00.3724362Z effdet 0.4.1 requires torchvision, which is not installed. 2025-08-14T21:29:00.3724767Z pytorch-labs-segment-anything-fast 0.2 requires torchao, which is not installed. 2025-08-14T21:29:00.3725283Z pytorch-labs-segment-anything-fast 0.2 requires torchvision>=0.17.0.dev20231026, which is not installed. 2025-08-14T21:29:00.3725807Z timm 1.0.14 requires torchvision, which is not installed. 2025-08-14T21:29:00.3726198Z Successfully installed torch-2.9.0a0+git1fc683c 2025-08-14T21:29:00.4593518Z + export TERM=vt100 2025-08-14T21:29:00.4595211Z + TERM=vt100 2025-08-14T21:29:00.4595448Z ++ dirname .ci/pytorch/test.sh 2025-08-14T21:29:00.4604656Z + source .ci/pytorch/common.sh 2025-08-14T21:29:00.4606638Z +++ dirname .ci/pytorch/common.sh 2025-08-14T21:29:00.4616128Z ++ source .ci/pytorch/common_utils.sh 2025-08-14T21:29:00.4618164Z +++ declare -f -t trap_add 2025-08-14T21:29:00.4618528Z ++ set -ex -o pipefail 2025-08-14T21:29:00.4618807Z ++ [[ linux-jammy-py3.9-gcc11-build == *rocm* ]] 2025-08-14T21:29:00.4619154Z ++ BUILD_TEST_LIBTORCH=0 2025-08-14T21:29:00.4622860Z ++ dirname .ci/pytorch/test.sh 2025-08-14T21:29:00.4649708Z + source .ci/pytorch/common-build.sh 2025-08-14T21:29:00.4651834Z ++ [[ linux-jammy-py3.9-gcc11-build != *win-* ]] 2025-08-14T21:29:00.4657157Z ++++ dirname .ci/pytorch/common-build.sh 2025-08-14T21:29:00.4667580Z +++ cd .ci/pytorch 2025-08-14T21:29:00.4667926Z +++ pwd -P 2025-08-14T21:29:00.4668139Z ++ script_dir=/var/lib/jenkins/workspace/.ci/pytorch 2025-08-14T21:29:00.4668548Z ++ [[ linux-jammy-py3.9-gcc11-build == *-pch* ]] 2025-08-14T21:29:00.4669369Z ++ which sccache 2025-08-14T21:29:00.4693752Z ++ [[ -z ossci-compiler-cache-circleci-v2 ]] 2025-08-14T21:29:00.4697738Z ++ sccache --stop-server 2025-08-14T21:29:00.4718892Z ++ true 2025-08-14T21:29:00.4720936Z ++ rm -f /var/lib/jenkins/sccache_error.log 2025-08-14T21:29:00.4729869Z ++ trap_add sccache_epilogue EXIT 2025-08-14T21:29:00.4731949Z ++ trap_add_cmd=sccache_epilogue 2025-08-14T21:29:00.4732262Z ++ shift 2025-08-14T21:29:00.4736752Z ++ for trap_add_name in "$@" 2025-08-14T21:29:00.4737096Z ++++ trap -p EXIT 2025-08-14T21:29:00.4737354Z +++ eval 'extract_trap_cmd ' 2025-08-14T21:29:00.4737617Z ++++ extract_trap_cmd 2025-08-14T21:29:00.4737859Z ++++ printf '%s\n' '' 2025-08-14T21:29:00.4738113Z +++ printf '%s\n' sccache_epilogue 2025-08-14T21:29:00.4742001Z ++ trap -- ' 2025-08-14T21:29:00.4742346Z sccache_epilogue' EXIT 2025-08-14T21:29:00.4742561Z ++ [[ -n 1 ]] 2025-08-14T21:29:00.4742815Z ++ echo 'Skipping sccache server initialization, setting environment variables' 2025-08-14T21:29:00.4743152Z Skipping sccache server initialization, setting environment variables 2025-08-14T21:29:00.4743411Z ++ export SCCACHE_IDLE_TIMEOUT=0 2025-08-14T21:29:00.4743596Z ++ SCCACHE_IDLE_TIMEOUT=0 2025-08-14T21:29:00.4743804Z ++ export SCCACHE_ERROR_LOG=/var/lib/jenkins/sccache_error.log 2025-08-14T21:29:00.4744078Z ++ SCCACHE_ERROR_LOG=/var/lib/jenkins/sccache_error.log 2025-08-14T21:29:00.4744346Z ++ export RUST_LOG=sccache::server=error 2025-08-14T21:29:00.4744537Z ++ RUST_LOG=sccache::server=error 2025-08-14T21:29:00.4744725Z ++ sccache --zero-stats 2025-08-14T21:29:00.6337694Z Statistics zeroed. 2025-08-14T21:29:00.6341767Z ++ which ccache 2025-08-14T21:29:00.6372110Z + [[ linux-jammy-py3.9-gcc11-build != *rocm* ]] 2025-08-14T21:29:00.6372495Z + [[ linux-jammy-py3.9-gcc11-build != *s390x* ]] 2025-08-14T21:29:00.6372869Z + [[ -d /var/lib/jenkins/workspace ]] 2025-08-14T21:29:00.6373139Z ++ stat -c %u /var/lib/jenkins/workspace 2025-08-14T21:29:00.6378445Z + WORKSPACE_ORIGINAL_OWNER_ID=1000 2025-08-14T21:29:00.6378672Z + trap_add cleanup_workspace EXIT 2025-08-14T21:29:00.6379100Z + trap_add_cmd=cleanup_workspace 2025-08-14T21:29:00.6379374Z + shift 2025-08-14T21:29:00.6379552Z + for trap_add_name in "$@" 2025-08-14T21:29:00.6383866Z +++ trap -p EXIT 2025-08-14T21:29:00.6389412Z ++ eval 'extract_trap_cmd trap -- '\'' 2025-08-14T21:29:00.6394185Z sccache_epilogue'\'' EXIT' 2025-08-14T21:29:00.6399196Z +++ extract_trap_cmd trap -- ' 2025-08-14T21:29:00.6403696Z sccache_epilogue' EXIT 2025-08-14T21:29:00.6405338Z +++ printf '%s\n' ' 2025-08-14T21:29:00.6405633Z sccache_epilogue' 2025-08-14T21:29:00.6410470Z ++ printf '%s\n' cleanup_workspace 2025-08-14T21:29:00.6412078Z + trap -- ' 2025-08-14T21:29:00.6412359Z sccache_epilogue 2025-08-14T21:29:00.6416596Z cleanup_workspace' EXIT 2025-08-14T21:29:00.6418833Z + sudo chown -R jenkins /var/lib/jenkins/workspace 2025-08-14T21:29:01.0566408Z + git config --global --add safe.directory /var/lib/jenkins/workspace 2025-08-14T21:29:01.0582546Z + echo 'Environment variables:' 2025-08-14T21:29:01.0584521Z Environment variables: 2025-08-14T21:29:01.0585036Z + env 2025-08-14T21:29:01.0590454Z GITHUB_WORKSPACE=/home/ec2-user/actions-runner/_work/pytorch/pytorch 2025-08-14T21:29:01.0594279Z CONTINUE_THROUGH_ERROR=True 2025-08-14T21:29:01.0598696Z BUILD_ENVIRONMENT=linux-jammy-py3.9-gcc11-build 2025-08-14T21:29:01.0602758Z HOSTNAME=4dd890d366a3 2025-08-14T21:29:01.0607049Z GITHUB_PATH=/home/ec2-user/actions-runner/_work/_temp/_runner_file_commands/add_path_97004b05-2bd1-47e8-980a-f3efe7af1b01 2025-08-14T21:29:01.0611257Z GITHUB_ACTION=__run_2 2025-08-14T21:29:01.0616016Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=0 2025-08-14T21:29:01.0619626Z GITHUB_RUN_NUMBER=147536 2025-08-14T21:29:01.0619990Z TEST_CONFIG=dynamic_cpu_inductor_huggingface 2025-08-14T21:29:01.0620237Z GITHUB_REPOSITORY_OWNER_ID=21003710 2025-08-14T21:29:01.0620450Z TORCH_NVCC_FLAGS=-Xfatbin -compress-all 2025-08-14T21:29:01.0620778Z SCCACHE_IDLE_TIMEOUT=0 2025-08-14T21:29:01.0621501Z SCRIBE_GRAPHQL_ACCESS_TOKEN=*** 2025-08-14T21:29:01.0621736Z GITHUB_TRIGGERING_ACTOR=pytorchmergebot 2025-08-14T21:29:01.0621938Z GITHUB_REF_TYPE=branch 2025-08-14T21:29:01.0622102Z TORCH_CUDA_ARCH_LIST=Maxwell 2025-08-14T21:29:01.0622304Z BASE_SHA=1fc683cf17c8c673044538d10266c00f92987be2 2025-08-14T21:29:01.0622743Z XLA_CUDA= 2025-08-14T21:29:01.0622893Z NCCL_LIB_DIR=/usr/local/cuda/lib64/ 2025-08-14T21:29:01.0623179Z HUGGING_FACE_HUB_TOKEN=*** 2025-08-14T21:29:01.0631687Z *** 2025-08-14T21:29:01.0631878Z GITHUB_REPOSITORY_ID=65600975 2025-08-14T21:29:01.0632064Z GITHUB_ACTIONS=true 2025-08-14T21:29:01.0632267Z SCCACHE_ERROR_LOG=/var/lib/jenkins/sccache_error.log 2025-08-14T21:29:01.0632507Z SHA1=1fc683cf17c8c673044538d10266c00f92987be2 2025-08-14T21:29:01.0632720Z GITHUB_SHA=1fc683cf17c8c673044538d10266c00f92987be2 2025-08-14T21:29:01.0633021Z GITHUB_WORKFLOW_REF=pytorch/pytorch/.github/workflows/inductor.yml@refs/heads/main 2025-08-14T21:29:01.0633289Z UCC_HOME=/usr 2025-08-14T21:29:01.0633433Z VERBOSE_TEST_LOGS=False 2025-08-14T21:29:01.0633590Z GITHUB_REF=refs/heads/main 2025-08-14T21:29:01.0633749Z SHARD_NUMBER=1 2025-08-14T21:29:01.0633897Z GITHUB_REF_PROTECTED=true 2025-08-14T21:29:01.0634053Z HOME=/var/lib/jenkins 2025-08-14T21:29:01.0634235Z GITHUB_API_URL=https://api.github.com 2025-08-14T21:29:01.0634443Z PYTORCH_TEST_RERUN_DISABLED_TESTS=0 2025-08-14T21:29:01.0634612Z UCX_COMMIT= 2025-08-14T21:29:01.0634746Z USE_SYSTEM_NCCL=1 2025-08-14T21:29:01.0634891Z NUM_TEST_SHARDS=1 2025-08-14T21:29:01.0635026Z UCX_HOME=/usr 2025-08-14T21:29:01.0635352Z GITHUB_STATE=/home/ec2-user/actions-runner/_work/_temp/_runner_file_commands/save_state_97004b05-2bd1-47e8-980a-f3efe7af1b01 2025-08-14T21:29:01.0635872Z JOB_NAME=linux-jammy-cpu-py3.9-gcc11-inductor / test (dynamic_cpu_inductor_huggingface, 1, 1, linux.8xlarge.amx) 2025-08-14T21:29:01.0636367Z GITHUB_ENV=/home/ec2-user/actions-runner/_work/_temp/_runner_file_commands/set_env_97004b05-2bd1-47e8-980a-f3efe7af1b01 2025-08-14T21:29:01.0636801Z GITHUB_EVENT_PATH=/home/ec2-user/actions-runner/_work/_temp/_github_workflow/event.json 2025-08-14T21:29:01.0637078Z GITHUB_EVENT_NAME=push 2025-08-14T21:29:01.0637235Z DASHBOARD_TAG= 2025-08-14T21:29:01.0637376Z GITHUB_RUN_ID=16976255153 2025-08-14T21:29:01.0637539Z INSTALLED_OPENBLAS= 2025-08-14T21:29:01.0637884Z GITHUB_STEP_SUMMARY=/home/ec2-user/actions-runner/_work/_temp/_runner_file_commands/step_summary_97004b05-2bd1-47e8-980a-f3efe7af1b01 2025-08-14T21:29:01.0638249Z GITHUB_ACTOR=pytorchmergebot 2025-08-14T21:29:01.0638461Z PR_NUMBER= 2025-08-14T21:29:01.0638597Z DESIRED_CUDA= 2025-08-14T21:29:01.0638737Z GITHUB_RUN_ATTEMPT=1 2025-08-14T21:29:01.0638893Z ANACONDA_PYTHON_VERSION=3.9 2025-08-14T21:29:01.0639093Z GITHUB_GRAPHQL_URL=https://api.github.com/graphql 2025-08-14T21:29:01.0639296Z TERM=vt100 2025-08-14T21:29:01.0639424Z INSTALLED_VISION=yes 2025-08-14T21:29:01.0639573Z BRANCH=main 2025-08-14T21:29:01.0639716Z SCCACHE_REGION=us-east-1 2025-08-14T21:29:01.0639879Z OPENSSL_ROOT_DIR=/opt/openssl 2025-08-14T21:29:01.0640054Z CUDA_PATH=/usr/local/cuda 2025-08-14T21:29:01.0640352Z GITHUB_ACTION_PATH=/home/ec2-user/actions-runner/_work/pytorch/pytorch/./.github/actions/setup-linux 2025-08-14T21:29:01.0640678Z GITHUB_SERVER_URL=https://github.com 2025-08-14T21:29:01.0640851Z UCC_COMMIT= 2025-08-14T21:29:01.0640989Z REENABLED_ISSUES= 2025-08-14T21:29:01.0641128Z DOCS=yes 2025-08-14T21:29:01.0641249Z SHLVL=1 2025-08-14T21:29:01.0641378Z MAX_JOBS=30 2025-08-14T21:29:01.0641517Z GITHUB_ACTOR_ID=97764156 2025-08-14T21:29:01.0641717Z GITHUB_WORKFLOW_SHA=1fc683cf17c8c673044538d10266c00f92987be2 2025-08-14T21:29:01.0641935Z GITHUB_REF_NAME=main 2025-08-14T21:29:01.0642295Z XLA_CLANG_CACHE_S3_BUCKET_NAME=ossci-compiler-clang-cache-circleci-xla 2025-08-14T21:29:01.0642587Z GITHUB_JOB=test 2025-08-14T21:29:01.0642735Z NO_TEST_TIMEOUT=False 2025-08-14T21:29:01.0642887Z TD_DISTRIBUTED=False 2025-08-14T21:29:01.0643044Z GITHUB_REPOSITORY=pytorch/pytorch 2025-08-14T21:29:01.0643229Z GITHUB_RETENTION_DAYS=90 2025-08-14T21:29:01.0643391Z OPENSSL_DIR=/opt/openssl 2025-08-14T21:29:01.0643545Z GITHUB_ACTION_REPOSITORY= 2025-08-14T21:29:01.0643970Z PATH=/opt/cache/bin:/usr/local/nvidia/bin:/usr/local/cuda/bin:/opt/conda/envs/py_3.9/bin:/opt/conda/bin:/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin 2025-08-14T21:29:01.0644441Z GITHUB_BASE_REF= 2025-08-14T21:29:01.0644580Z INSTALLED_ACL= 2025-08-14T21:29:01.0644840Z ARTIFACTS_FILE_SUFFIX=test-dynamic_cpu_inductor_huggingface-1-1-linux.8xlarge.amx_48128039107 2025-08-14T21:29:01.0645130Z CI=true 2025-08-14T21:29:01.0645269Z GITHUB_REPOSITORY_OWNER=pytorch 2025-08-14T21:29:01.0645483Z RUST_LOG=sccache::server=error 2025-08-14T21:29:01.0645655Z JOB_ID=48128039107 2025-08-14T21:29:01.0645796Z GITHUB_HEAD_REF= 2025-08-14T21:29:01.0645930Z GITHUB_ACTION_REF= 2025-08-14T21:29:01.0646109Z SCCACHE_BUCKET=ossci-compiler-cache-circleci-v2 2025-08-14T21:29:01.0646320Z TEST_SHOWLOCALS=False 2025-08-14T21:29:01.0646469Z GITHUB_WORKFLOW=inductor 2025-08-14T21:29:01.0646634Z DEBIAN_FRONTEND=noninteractive 2025-08-14T21:29:01.0646977Z GITHUB_OUTPUT=/home/ec2-user/actions-runner/_work/_temp/_runner_file_commands/set_output_97004b05-2bd1-47e8-980a-f3efe7af1b01 2025-08-14T21:29:01.0647315Z NO_TD=False 2025-08-14T21:29:01.0647450Z SKIP_SCCACHE_INITIALIZATION=1 2025-08-14T21:29:01.0647642Z NCCL_INCLUDE_DIR=/usr/local/cuda/include/ 2025-08-14T21:29:01.0647843Z _=/usr/bin/env 2025-08-14T21:29:01.0648034Z ++ python -c 'import site; print(site.getsitepackages()[0])' 2025-08-14T21:29:01.0839521Z + TORCH_INSTALL_DIR=/opt/conda/envs/py_3.9/lib/python3.9/site-packages/torch 2025-08-14T21:29:01.0843965Z + TORCH_BIN_DIR=/opt/conda/envs/py_3.9/lib/python3.9/site-packages/torch/bin 2025-08-14T21:29:01.0848306Z + TORCH_LIB_DIR=/opt/conda/envs/py_3.9/lib/python3.9/site-packages/torch/lib 2025-08-14T21:29:01.0852561Z + TORCH_TEST_DIR=/opt/conda/envs/py_3.9/lib/python3.9/site-packages/torch/test 2025-08-14T21:29:01.0856331Z + BUILD_DIR=build 2025-08-14T21:29:01.0856697Z + BUILD_RENAMED_DIR=build_renamed 2025-08-14T21:29:01.0856983Z + BUILD_BIN_DIR=build/bin 2025-08-14T21:29:01.0857165Z + SHARD_NUMBER=1 2025-08-14T21:29:01.0857305Z + NUM_TEST_SHARDS=1 2025-08-14T21:29:01.0857482Z + export TORCH_SERIALIZATION_DEBUG=1 2025-08-14T21:29:01.0857690Z + TORCH_SERIALIZATION_DEBUG=1 2025-08-14T21:29:01.0857857Z + export VALGRIND=ON 2025-08-14T21:29:01.0858024Z + VALGRIND=ON 2025-08-14T21:29:01.0858202Z + [[ linux-jammy-py3.9-gcc11-build == *clang9* ]] 2025-08-14T21:29:01.0858439Z + [[ linux-jammy-py3.9-gcc11-build == *xpu* ]] 2025-08-14T21:29:01.0858650Z + [[ linux-jammy-py3.9-gcc11-build == *s390x* ]] 2025-08-14T21:29:01.0858841Z + [[ 0 == \1 ]] 2025-08-14T21:29:01.0858980Z + [[ True == \1 ]] 2025-08-14T21:29:01.0859146Z + [[ linux-jammy-py3.9-gcc11-build != *bazel* ]] 2025-08-14T21:29:01.0859356Z ++ realpath build/custom_test_artifacts 2025-08-14T21:29:01.0859643Z + CUSTOM_TEST_ARTIFACT_BUILD_DIR=/var/lib/jenkins/workspace/build/custom_test_artifacts 2025-08-14T21:29:01.0859917Z + [[ -n '' ]] 2025-08-14T21:29:01.0860068Z + echo 'Environment variables' 2025-08-14T21:29:01.0860244Z Environment variables 2025-08-14T21:29:01.0860382Z + env 2025-08-14T21:29:01.0878730Z GITHUB_WORKSPACE=/home/ec2-user/actions-runner/_work/pytorch/pytorch 2025-08-14T21:29:01.0879112Z CONTINUE_THROUGH_ERROR=True 2025-08-14T21:29:01.0879358Z BUILD_ENVIRONMENT=linux-jammy-py3.9-gcc11-build 2025-08-14T21:29:01.0879567Z HOSTNAME=4dd890d366a3 2025-08-14T21:29:01.0879906Z GITHUB_PATH=/home/ec2-user/actions-runner/_work/_temp/_runner_file_commands/add_path_97004b05-2bd1-47e8-980a-f3efe7af1b01 2025-08-14T21:29:01.0880256Z GITHUB_ACTION=__run_2 2025-08-14T21:29:01.0880417Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=0 2025-08-14T21:29:01.0880779Z GITHUB_RUN_NUMBER=147536 2025-08-14T21:29:01.0880979Z TEST_CONFIG=dynamic_cpu_inductor_huggingface 2025-08-14T21:29:01.0881189Z GITHUB_REPOSITORY_OWNER_ID=21003710 2025-08-14T21:29:01.0881388Z TORCH_NVCC_FLAGS=-Xfatbin -compress-all 2025-08-14T21:29:01.0881650Z SCCACHE_IDLE_TIMEOUT=0 2025-08-14T21:29:01.0881949Z SCRIBE_GRAPHQL_ACCESS_TOKEN=*** 2025-08-14T21:29:01.0882141Z GITHUB_TRIGGERING_ACTOR=pytorchmergebot 2025-08-14T21:29:01.0882329Z GITHUB_REF_TYPE=branch 2025-08-14T21:29:01.0882497Z TORCH_CUDA_ARCH_LIST=Maxwell 2025-08-14T21:29:01.0882699Z BASE_SHA=1fc683cf17c8c673044538d10266c00f92987be2 2025-08-14T21:29:01.0882893Z XLA_CUDA= 2025-08-14T21:29:01.0883099Z NCCL_LIB_DIR=/usr/local/cuda/lib64/ 2025-08-14T21:29:01.0883341Z HUGGING_FACE_HUB_TOKEN=*** 2025-08-14T21:29:01.0883543Z *** 2025-08-14T21:29:01.0883676Z GITHUB_REPOSITORY_ID=65600975 2025-08-14T21:29:01.0883847Z GITHUB_ACTIONS=true 2025-08-14T21:29:01.0884029Z SCCACHE_ERROR_LOG=/var/lib/jenkins/sccache_error.log 2025-08-14T21:29:01.0884251Z SHA1=1fc683cf17c8c673044538d10266c00f92987be2 2025-08-14T21:29:01.0884469Z GITHUB_SHA=1fc683cf17c8c673044538d10266c00f92987be2 2025-08-14T21:29:01.0885075Z GITHUB_WORKFLOW_REF=pytorch/pytorch/.github/workflows/inductor.yml@refs/heads/main 2025-08-14T21:29:01.0885350Z UCC_HOME=/usr 2025-08-14T21:29:01.0885492Z TORCH_SERIALIZATION_DEBUG=1 2025-08-14T21:29:01.0885655Z VERBOSE_TEST_LOGS=False 2025-08-14T21:29:01.0885815Z GITHUB_REF=refs/heads/main 2025-08-14T21:29:01.0885963Z SHARD_NUMBER=1 2025-08-14T21:29:01.0886107Z GITHUB_REF_PROTECTED=true 2025-08-14T21:29:01.0886264Z HOME=/var/lib/jenkins 2025-08-14T21:29:01.0886430Z GITHUB_API_URL=https://api.github.com 2025-08-14T21:29:01.0886632Z PYTORCH_TEST_RERUN_DISABLED_TESTS=0 2025-08-14T21:29:01.0886803Z UCX_COMMIT= 2025-08-14T21:29:01.0886929Z USE_SYSTEM_NCCL=1 2025-08-14T21:29:01.0887070Z NUM_TEST_SHARDS=1 2025-08-14T21:29:01.0887212Z UCX_HOME=/usr 2025-08-14T21:29:01.0887523Z GITHUB_STATE=/home/ec2-user/actions-runner/_work/_temp/_runner_file_commands/save_state_97004b05-2bd1-47e8-980a-f3efe7af1b01 2025-08-14T21:29:01.0888029Z JOB_NAME=linux-jammy-cpu-py3.9-gcc11-inductor / test (dynamic_cpu_inductor_huggingface, 1, 1, linux.8xlarge.amx) 2025-08-14T21:29:01.0888520Z GITHUB_ENV=/home/ec2-user/actions-runner/_work/_temp/_runner_file_commands/set_env_97004b05-2bd1-47e8-980a-f3efe7af1b01 2025-08-14T21:29:01.0888948Z GITHUB_EVENT_PATH=/home/ec2-user/actions-runner/_work/_temp/_github_workflow/event.json 2025-08-14T21:29:01.0889215Z GITHUB_EVENT_NAME=push 2025-08-14T21:29:01.0889367Z DASHBOARD_TAG= 2025-08-14T21:29:01.0889509Z GITHUB_RUN_ID=16976255153 2025-08-14T21:29:01.0889657Z INSTALLED_OPENBLAS= 2025-08-14T21:29:01.0889993Z GITHUB_STEP_SUMMARY=/home/ec2-user/actions-runner/_work/_temp/_runner_file_commands/step_summary_97004b05-2bd1-47e8-980a-f3efe7af1b01 2025-08-14T21:29:01.0890358Z GITHUB_ACTOR=pytorchmergebot 2025-08-14T21:29:01.0890515Z PR_NUMBER= 2025-08-14T21:29:01.0890645Z DESIRED_CUDA= 2025-08-14T21:29:01.0890780Z GITHUB_RUN_ATTEMPT=1 2025-08-14T21:29:01.0890926Z VALGRIND=ON 2025-08-14T21:29:01.0891060Z ANACONDA_PYTHON_VERSION=3.9 2025-08-14T21:29:01.0891257Z GITHUB_GRAPHQL_URL=https://api.github.com/graphql 2025-08-14T21:29:01.0891454Z TERM=vt100 2025-08-14T21:29:01.0891580Z INSTALLED_VISION=yes 2025-08-14T21:29:01.0891725Z BRANCH=main 2025-08-14T21:29:01.0891865Z SCCACHE_REGION=us-east-1 2025-08-14T21:29:01.0892025Z OPENSSL_ROOT_DIR=/opt/openssl 2025-08-14T21:29:01.0892195Z CUDA_PATH=/usr/local/cuda 2025-08-14T21:29:01.0892486Z GITHUB_ACTION_PATH=/home/ec2-user/actions-runner/_work/pytorch/pytorch/./.github/actions/setup-linux 2025-08-14T21:29:01.0892795Z GITHUB_SERVER_URL=https://github.com 2025-08-14T21:29:01.0892977Z UCC_COMMIT= 2025-08-14T21:29:01.0893108Z REENABLED_ISSUES= 2025-08-14T21:29:01.0893242Z DOCS=yes 2025-08-14T21:29:01.0893371Z SHLVL=1 2025-08-14T21:29:01.0893498Z MAX_JOBS=30 2025-08-14T21:29:01.0893628Z GITHUB_ACTOR_ID=97764156 2025-08-14T21:29:01.0893828Z GITHUB_WORKFLOW_SHA=1fc683cf17c8c673044538d10266c00f92987be2 2025-08-14T21:29:01.0894102Z GITHUB_REF_NAME=main 2025-08-14T21:29:01.0894324Z XLA_CLANG_CACHE_S3_BUCKET_NAME=ossci-compiler-clang-cache-circleci-xla 2025-08-14T21:29:01.0894568Z GITHUB_JOB=test 2025-08-14T21:29:01.0894716Z NO_TEST_TIMEOUT=False 2025-08-14T21:29:01.0894872Z TD_DISTRIBUTED=False 2025-08-14T21:29:01.0895031Z GITHUB_REPOSITORY=pytorch/pytorch 2025-08-14T21:29:01.0895217Z GITHUB_RETENTION_DAYS=90 2025-08-14T21:29:01.0895382Z OPENSSL_DIR=/opt/openssl 2025-08-14T21:29:01.0895539Z GITHUB_ACTION_REPOSITORY= 2025-08-14T21:29:01.0895966Z PATH=/opt/cache/bin:/usr/local/nvidia/bin:/usr/local/cuda/bin:/opt/conda/envs/py_3.9/bin:/opt/conda/bin:/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin 2025-08-14T21:29:01.0896425Z GITHUB_BASE_REF= 2025-08-14T21:29:01.0896559Z INSTALLED_ACL= 2025-08-14T21:29:01.0896825Z ARTIFACTS_FILE_SUFFIX=test-dynamic_cpu_inductor_huggingface-1-1-linux.8xlarge.amx_48128039107 2025-08-14T21:29:01.0897117Z CI=true 2025-08-14T21:29:01.0897250Z GITHUB_REPOSITORY_OWNER=pytorch 2025-08-14T21:29:01.0897458Z RUST_LOG=sccache::server=error 2025-08-14T21:29:01.0897624Z JOB_ID=48128039107 2025-08-14T21:29:01.0897764Z GITHUB_HEAD_REF= 2025-08-14T21:29:01.0897898Z GITHUB_ACTION_REF= 2025-08-14T21:29:01.0898077Z SCCACHE_BUCKET=ossci-compiler-cache-circleci-v2 2025-08-14T21:29:01.0898284Z TEST_SHOWLOCALS=False 2025-08-14T21:29:01.0898427Z GITHUB_WORKFLOW=inductor 2025-08-14T21:29:01.0898588Z DEBIAN_FRONTEND=noninteractive 2025-08-14T21:29:01.0898932Z GITHUB_OUTPUT=/home/ec2-user/actions-runner/_work/_temp/_runner_file_commands/set_output_97004b05-2bd1-47e8-980a-f3efe7af1b01 2025-08-14T21:29:01.0899263Z NO_TD=False 2025-08-14T21:29:01.0899410Z SKIP_SCCACHE_INITIALIZATION=1 2025-08-14T21:29:01.0899594Z NCCL_INCLUDE_DIR=/usr/local/cuda/include/ 2025-08-14T21:29:01.0899770Z _=/usr/bin/env 2025-08-14T21:29:01.0899916Z + echo 'Testing pytorch' 2025-08-14T21:29:01.0900068Z Testing pytorch 2025-08-14T21:29:01.0900217Z + export LANG=C.UTF-8 2025-08-14T21:29:01.0900367Z + LANG=C.UTF-8 2025-08-14T21:29:01.0900502Z + PR_NUMBER= 2025-08-14T21:29:01.0900674Z + [[ dynamic_cpu_inductor_huggingface == \d\e\f\a\u\l\t ]] 2025-08-14T21:29:01.0900935Z + [[ dynamic_cpu_inductor_huggingface == \d\i\s\t\r\i\b\u\t\e\d ]] 2025-08-14T21:29:01.0901183Z + [[ dynamic_cpu_inductor_huggingface == \s\l\o\w ]] 2025-08-14T21:29:01.0901424Z + [[ linux-jammy-py3.9-gcc11-build == *slow-gradcheck* ]] 2025-08-14T21:29:01.0901650Z + [[ linux-jammy-py3.9-gcc11-build == *cuda* ]] 2025-08-14T21:29:01.0901863Z + [[ linux-jammy-py3.9-gcc11-build == *rocm* ]] 2025-08-14T21:29:01.0902071Z + [[ linux-jammy-py3.9-gcc11-build == *xpu* ]] 2025-08-14T21:29:01.0902282Z + [[ dynamic_cpu_inductor_huggingface == *crossref* ]] 2025-08-14T21:29:01.0902507Z + [[ linux-jammy-py3.9-gcc11-build == *rocm* ]] 2025-08-14T21:29:01.0902718Z + [[ linux-jammy-py3.9-gcc11-build == *xpu* ]] 2025-08-14T21:29:01.0902933Z + [[ linux-jammy-py3.9-gcc11-build != *-bazel-* ]] 2025-08-14T21:29:01.0903131Z + pip_install ninja==1.10.2 2025-08-14T21:29:01.0903355Z + pip_install_pkg='python3 -m pip install --progress-bar off' 2025-08-14T21:29:01.0903621Z + python3 -m pip install --progress-bar off ninja==1.10.2 2025-08-14T21:29:01.4336022Z Collecting ninja==1.10.2 2025-08-14T21:29:01.4438492Z Downloading ninja-1.10.2-py2.py3-none-manylinux_2_5_x86_64.manylinux1_x86_64.whl.metadata (5.0 kB) 2025-08-14T21:29:01.4553456Z Downloading ninja-1.10.2-py2.py3-none-manylinux_2_5_x86_64.manylinux1_x86_64.whl (108 kB) 2025-08-14T21:29:02.1259123Z Installing collected packages: ninja 2025-08-14T21:29:02.1260838Z Attempting uninstall: ninja 2025-08-14T21:29:02.1266622Z Found existing installation: ninja 1.11.1.3 2025-08-14T21:29:02.1282263Z Uninstalling ninja-1.11.1.3: 2025-08-14T21:29:02.1329400Z Successfully uninstalled ninja-1.11.1.3 2025-08-14T21:29:02.1773114Z Successfully installed ninja-1.10.2 2025-08-14T21:29:02.2639168Z + export PATH=/var/lib/jenkins/.local/bin:/opt/cache/bin:/usr/local/nvidia/bin:/usr/local/cuda/bin:/opt/conda/envs/py_3.9/bin:/opt/conda/bin:/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin 2025-08-14T21:29:02.2640544Z + PATH=/var/lib/jenkins/.local/bin:/opt/cache/bin:/usr/local/nvidia/bin:/usr/local/cuda/bin:/opt/conda/envs/py_3.9/bin:/opt/conda/bin:/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin 2025-08-14T21:29:02.2641105Z + [[ linux-jammy-py3.9-gcc11-build == *aarch64* ]] 2025-08-14T21:29:02.2641352Z + [[ linux-jammy-py3.9-gcc11-build == *asan* ]] 2025-08-14T21:29:02.2641580Z + [[ linux-jammy-py3.9-gcc11-build == *-debug* ]] 2025-08-14T21:29:02.2641802Z + [[ linux-jammy-py3.9-gcc11-build != *-bazel-* ]] 2025-08-14T21:29:02.2642112Z + echo 'We are not in debug mode: linux-jammy-py3.9-gcc11-build. Expect the assertion to pass' 2025-08-14T21:29:02.2642750Z We are not in debug mode: linux-jammy-py3.9-gcc11-build. Expect the assertion to pass 2025-08-14T21:29:02.2643015Z + cd test 2025-08-14T21:29:02.2643239Z + python -c 'import torch; torch._C._crash_if_debug_asserts_fail(424242)' 2025-08-14T21:29:03.2734407Z + [[ dynamic_cpu_inductor_huggingface == \n\o\g\p\u\_\N\O\_\A\V\X\2 ]] 2025-08-14T21:29:03.2736133Z + [[ dynamic_cpu_inductor_huggingface == \n\o\g\p\u\_\A\V\X\5\1\2 ]] 2025-08-14T21:29:03.2739477Z + [[ dynamic_cpu_inductor_huggingface == \l\e\g\a\c\y\_\n\v\i\d\i\a\_\d\r\i\v\e\r ]] 2025-08-14T21:29:03.2739938Z + DYNAMO_BENCHMARK_FLAGS=() 2025-08-14T21:29:03.2740829Z + [[ dynamic_cpu_inductor_huggingface == *pr_time_benchmarks* ]] 2025-08-14T21:29:03.2741238Z + [[ dynamic_cpu_inductor_huggingface == *dynamo_eager* ]] 2025-08-14T21:29:03.2741559Z + [[ dynamic_cpu_inductor_huggingface == *aot_eager* ]] 2025-08-14T21:29:03.2741842Z + [[ dynamic_cpu_inductor_huggingface == *aot_inductor* ]] 2025-08-14T21:29:03.2742180Z + [[ dynamic_cpu_inductor_huggingface == *max_autotune_inductor* ]] 2025-08-14T21:29:03.2742499Z + [[ dynamic_cpu_inductor_huggingface == *inductor* ]] 2025-08-14T21:29:03.2742766Z + [[ dynamic_cpu_inductor_huggingface != *perf* ]] 2025-08-14T21:29:03.2743001Z + DYNAMO_BENCHMARK_FLAGS+=(--inductor) 2025-08-14T21:29:03.2743229Z + [[ dynamic_cpu_inductor_huggingface == *dynamic* ]] 2025-08-14T21:29:03.2743508Z + DYNAMO_BENCHMARK_FLAGS+=(--dynamic-shapes --dynamic-batch-only) 2025-08-14T21:29:03.2743778Z + [[ dynamic_cpu_inductor_huggingface == *cpu* ]] 2025-08-14T21:29:03.2743982Z + DYNAMO_BENCHMARK_FLAGS+=(--device cpu) 2025-08-14T21:29:03.2759782Z + [[ linux-jammy-py3.9-gcc11-build == *libtorch* ]] 2025-08-14T21:29:03.2760219Z + [[ linux-jammy-py3.9-gcc11-build == *-bazel-* ]] 2025-08-14T21:29:03.2760553Z + cd test 2025-08-14T21:29:03.2761923Z + python -c 'import torch; print(torch.__config__.show())' 2025-08-14T21:29:04.1121155Z PyTorch built with: 2025-08-14T21:29:04.1123067Z - GCC 11.4 2025-08-14T21:29:04.1127836Z - C++ Version: 201703 2025-08-14T21:29:04.1130120Z - Intel(R) oneAPI Math Kernel Library Version 2024.2-Product Build 20240605 for Intel(R) 64 architecture applications 2025-08-14T21:29:04.1130668Z - Intel(R) MKL-DNN v3.7.1 (Git Hash 8d263e693366ef8db40acc569cc7d8edf644556d) 2025-08-14T21:29:04.1134629Z - OpenMP 201511 (a.k.a. OpenMP 4.5) 2025-08-14T21:29:04.1136600Z - LAPACK is enabled (usually provided by MKL) 2025-08-14T21:29:04.1136946Z - NNPACK is enabled 2025-08-14T21:29:04.1141931Z - CPU capability usage: AVX512 2025-08-14T21:29:04.1144805Z - Build settings: BLAS_INFO=mkl, BUILD_TYPE=Release, COMMIT_SHA=1fc683cf17c8c673044538d10266c00f92987be2, CXX_COMPILER=/opt/cache/bin/c++, CXX_FLAGS= -fvisibility-inlines-hidden -DUSE_PTHREADPOOL -DNDEBUG -DUSE_KINETO -DLIBKINETO_NOCUPTI -DLIBKINETO_NOROCTRACER -DLIBKINETO_NOXPUPTI=ON -DUSE_FBGEMM -DUSE_PYTORCH_QNNPACK -DUSE_XNNPACK -DSYMBOLICATE_MOBILE_DEBUG_HANDLE -O2 -fPIC -DC10_NODEPRECATED -Wall -Wextra -Werror=return-type -Werror=non-virtual-dtor -Werror=range-loop-construct -Werror=bool-operation -Wnarrowing -Wno-missing-field-initializers -Wno-unknown-pragmas -Wno-unused-parameter -Wno-strict-overflow -Wno-strict-aliasing -Wno-stringop-overflow -Wsuggest-override -Wno-psabi -Wno-error=old-style-cast -faligned-new -Werror -Wno-maybe-uninitialized -fno-math-errno -fno-trapping-math -Werror=format -Wno-stringop-overflow, LAPACK_INFO=mkl, PERF_WITH_AVX=1, PERF_WITH_AVX2=1, TORCH_VERSION=2.9.0, USE_CUDA=OFF, USE_CUDNN=OFF, USE_CUSPARSELT=OFF, USE_GFLAGS=OFF, USE_GLOG=OFF, USE_GLOO=ON, USE_MKL=ON, USE_MKLDNN=ON, USE_MPI=OFF, USE_NCCL=OFF, USE_NNPACK=ON, USE_OPENMP=ON, USE_ROCM=OFF, USE_ROCM_KERNEL_ASSERT=OFF, USE_XCCL=OFF, USE_XPU=OFF, 2025-08-14T21:29:04.1147480Z 2025-08-14T21:29:04.2777565Z + cd test 2025-08-14T21:29:04.2778080Z + python -c 'import torch; print(torch.__config__.parallel_info())' 2025-08-14T21:29:05.1238755Z ATen/Parallel: 2025-08-14T21:29:05.1239285Z at::get_num_threads() : 16 2025-08-14T21:29:05.1243265Z at::get_num_interop_threads() : 16 2025-08-14T21:29:05.1247065Z OpenMP 201511 (a.k.a. OpenMP 4.5) 2025-08-14T21:29:05.1250526Z omp_get_max_threads() : 16 2025-08-14T21:29:05.1252351Z Intel(R) oneAPI Math Kernel Library Version 2024.2-Product Build 20240605 for Intel(R) 64 architecture applications 2025-08-14T21:29:05.1252814Z mkl_get_max_threads() : 16 2025-08-14T21:29:05.1256968Z Intel(R) MKL-DNN v3.7.1 (Git Hash 8d263e693366ef8db40acc569cc7d8edf644556d) 2025-08-14T21:29:05.1260445Z std::thread::hardware_concurrency() : 32 2025-08-14T21:29:05.1260734Z Environment variables: 2025-08-14T21:29:05.1260917Z OMP_NUM_THREADS : [not set] 2025-08-14T21:29:05.1261101Z MKL_NUM_THREADS : [not set] 2025-08-14T21:29:05.1261287Z ATen parallel backend: OpenMP 2025-08-14T21:29:05.1261401Z 2025-08-14T21:29:05.2920504Z + [[ dynamic_cpu_inductor_huggingface == *numpy_2* ]] 2025-08-14T21:29:05.2923973Z + [[ linux-jammy-py3.9-gcc11-build == *aarch64* ]] 2025-08-14T21:29:05.2925759Z + [[ dynamic_cpu_inductor_huggingface == *backward* ]] 2025-08-14T21:29:05.2926058Z + [[ dynamic_cpu_inductor_huggingface == *xla* ]] 2025-08-14T21:29:05.2926299Z + [[ dynamic_cpu_inductor_huggingface == *executorch* ]] 2025-08-14T21:29:05.2926559Z + [[ dynamic_cpu_inductor_huggingface == \j\i\t\_\l\e\g\a\c\y ]] 2025-08-14T21:29:05.2926811Z + [[ linux-jammy-py3.9-gcc11-build == *libtorch* ]] 2025-08-14T21:29:05.2927066Z + [[ dynamic_cpu_inductor_huggingface == distributed ]] 2025-08-14T21:29:05.2927321Z + [[ dynamic_cpu_inductor_huggingface == *operator_benchmark* ]] 2025-08-14T21:29:05.2927598Z + [[ dynamic_cpu_inductor_huggingface == *inductor_distributed* ]] 2025-08-14T21:29:05.2927865Z + [[ dynamic_cpu_inductor_huggingface == *inductor-halide* ]] 2025-08-14T21:29:05.2928139Z + [[ dynamic_cpu_inductor_huggingface == *inductor-triton-cpu* ]] 2025-08-14T21:29:05.2928429Z + [[ dynamic_cpu_inductor_huggingface == *inductor-micro-benchmark* ]] 2025-08-14T21:29:05.2928694Z + [[ dynamic_cpu_inductor_huggingface == *huggingface* ]] 2025-08-14T21:29:05.2928913Z + install_torchvision 2025-08-14T21:29:05.2929074Z + local orig_preload 2025-08-14T21:29:05.2929227Z + local commit 2025-08-14T21:29:05.2929379Z ++ get_pinned_commit vision 2025-08-14T21:29:05.2929558Z ++ cat .github/ci_commit_pins/vision.txt 2025-08-14T21:29:05.3339159Z + commit=966da7e46f65d6d49df3e31214470a4fe5cc8e66 2025-08-14T21:29:05.3339588Z + orig_preload= 2025-08-14T21:29:05.3339835Z + '[' -n '' ']' 2025-08-14T21:29:05.3340030Z + [[ linux-jammy-py3.9-gcc11-build == *cuda* ]] 2025-08-14T21:29:05.3340449Z + pip_build_and_install git+https://github.com/pytorch/vision.git@966da7e46f65d6d49df3e31214470a4fe5cc8e66 dist/vision 2025-08-14T21:29:05.3340964Z + local build_target=git+https://github.com/pytorch/vision.git@966da7e46f65d6d49df3e31214470a4fe5cc8e66 2025-08-14T21:29:05.3341286Z + local wheel_dir=dist/vision 2025-08-14T21:29:05.3341452Z + local found_whl=0 2025-08-14T21:29:05.3341613Z + for file in "${wheel_dir}"/*.whl 2025-08-14T21:29:05.3341887Z + [[ -f dist/vision/torchvision-0.22.0a0+966da7e-cp39-cp39-linux_x86_64.whl ]] 2025-08-14T21:29:05.3342151Z + found_whl=1 2025-08-14T21:29:05.3342289Z + break 2025-08-14T21:29:05.3342419Z + '[' 1 == 0 ']' 2025-08-14T21:29:05.3342564Z + for file in "${wheel_dir}"/*.whl 2025-08-14T21:29:05.3342842Z + pip_install_whl dist/vision/torchvision-0.22.0a0+966da7e-cp39-cp39-linux_x86_64.whl 2025-08-14T21:29:05.3343492Z + args=('dist/vision/torchvision-0.22.0a0+966da7e-cp39-cp39-linux_x86_64.whl') 2025-08-14T21:29:05.3343768Z + local args 2025-08-14T21:29:05.3344000Z + [[ dist/vision/torchvision-0.22.0a0+966da7e-cp39-cp39-linux_x86_64.whl == *\ * ]] 2025-08-14T21:29:05.3344281Z + for path in "${args[@]}" 2025-08-14T21:29:05.3344556Z + echo 'Installing dist/vision/torchvision-0.22.0a0+966da7e-cp39-cp39-linux_x86_64.whl' 2025-08-14T21:29:05.3345018Z Installing dist/vision/torchvision-0.22.0a0+966da7e-cp39-cp39-linux_x86_64.whl 2025-08-14T21:29:05.3345459Z + python3 -mpip install --no-index --no-deps dist/vision/torchvision-0.22.0a0+966da7e-cp39-cp39-linux_x86_64.whl 2025-08-14T21:29:05.5901742Z Processing ./dist/vision/torchvision-0.22.0a0+966da7e-cp39-cp39-linux_x86_64.whl 2025-08-14T21:29:05.5966294Z Installing collected packages: torchvision 2025-08-14T21:29:06.1370220Z Successfully installed torchvision-0.22.0a0+966da7e 2025-08-14T21:29:06.1717720Z + '[' -n '' ']' 2025-08-14T21:29:06.1722156Z + id=0 2025-08-14T21:29:06.1724044Z + test_dynamo_benchmark huggingface 0 2025-08-14T21:29:06.1724374Z ++ pwd 2025-08-14T21:29:06.1724708Z + TEST_REPORTS_DIR=/var/lib/jenkins/workspace/test/test-reports 2025-08-14T21:29:06.1725021Z + local suite=huggingface 2025-08-14T21:29:06.1729497Z + shift 2025-08-14T21:29:06.1731092Z + local shard_id=0 2025-08-14T21:29:06.1731384Z + shift 2025-08-14T21:29:06.1734981Z + [[ dynamic_cpu_inductor_huggingface == *perf_compare* ]] 2025-08-14T21:29:06.1739638Z + [[ dynamic_cpu_inductor_huggingface == *perf* ]] 2025-08-14T21:29:06.1743870Z + [[ dynamic_cpu_inductor_huggingface == *cpu* ]] 2025-08-14T21:29:06.1745589Z + local dt=float32 2025-08-14T21:29:06.1745928Z + [[ dynamic_cpu_inductor_huggingface == *amp* ]] 2025-08-14T21:29:06.1750542Z + [[ dynamic_cpu_inductor_huggingface == *freezing* ]] 2025-08-14T21:29:06.1752325Z + test_single_dynamo_benchmark inference huggingface 0 --inference --float32 2025-08-14T21:29:06.1752632Z ++ pwd 2025-08-14T21:29:06.1752848Z + TEST_REPORTS_DIR=/var/lib/jenkins/workspace/test/test-reports 2025-08-14T21:29:06.1753124Z + mkdir -p /var/lib/jenkins/workspace/test/test-reports 2025-08-14T21:29:06.1753332Z + local name=inference 2025-08-14T21:29:06.1753490Z + shift 2025-08-14T21:29:06.1753634Z + local suite=huggingface 2025-08-14T21:29:06.1753788Z + shift 2025-08-14T21:29:06.1753921Z + local shard_id=0 2025-08-14T21:29:06.1754064Z + shift 2025-08-14T21:29:06.1754201Z + partition_flags=() 2025-08-14T21:29:06.1754356Z + local partition_flags 2025-08-14T21:29:06.1754511Z + [[ -n 1 ]] 2025-08-14T21:29:06.1754650Z + [[ -n 0 ]] 2025-08-14T21:29:06.1754886Z + partition_flags=(--total-partitions "$NUM_TEST_SHARDS" --partition-id "$shard_id") 2025-08-14T21:29:06.1755212Z + [[ dynamic_cpu_inductor_huggingface == *perf_compare* ]] 2025-08-14T21:29:06.1755453Z + [[ dynamic_cpu_inductor_huggingface == *perf* ]] 2025-08-14T21:29:06.1755668Z + [[ dynamic_cpu_inductor_huggingface == *_avx2* ]] 2025-08-14T21:29:06.1755897Z + [[ dynamic_cpu_inductor_huggingface == *_avx512* ]] 2025-08-14T21:29:06.1756676Z + python benchmarks/dynamo/huggingface.py --ci --accuracy --timing --explain --print-compilation-time --inductor --dynamic-shapes --dynamic-batch-only --device cpu --inference --float32 --total-partitions 1 --partition-id 0 --output /var/lib/jenkins/workspace/test/test-reports/inference_huggingface.csv 2025-08-14T21:29:09.0884768Z /opt/conda/envs/py_3.9/lib/python3.9/site-packages/llvmlite/binding/ffi.py:175: UserWarning: pkg_resources is deprecated as an API. See https://setuptools.pypa.io/en/latest/pkg_resources.html. The pkg_resources package is slated for removal as early as 2025-11-30. Refrain from using this package or pin to Setuptools<81. 2025-08-14T21:29:09.0885686Z from pkg_resources import resource_filename 2025-08-14T21:29:09.4397089Z 2025-08-14T21:29:09.4434917Z config.json: 0% 0.00/694 [00:00bcxy", (query, key)) # multiply 2025-08-14T21:30:58.8554681Z 2025-08-14T21:30:58.8554810Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.8555292Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.8555738Z layer_outputs = layer_module( 2025-08-14T21:30:58.8556062Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.8556410Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.8556798Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.8557184Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.8557560Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.8557941Z self_outputs = self.self( 2025-08-14T21:30:58.8558312Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 524, in forward 2025-08-14T21:30:58.8558722Z attn_scores = self._sliding_chunks_query_key_matmul( 2025-08-14T21:30:58.8559177Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 796, in _sliding_chunks_query_key_matmul 2025-08-14T21:30:58.8559719Z diagonal_chunked_attention_scores = torch.einsum("bcxd,bcyd->bcxy", (query, key)) # multiply 2025-08-14T21:30:58.8559952Z 2025-08-14T21:30:58.8560050Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.8560523Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.8560959Z layer_outputs = layer_module( 2025-08-14T21:30:58.8561280Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.8561613Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.8561996Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.8562376Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.8562759Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.8563152Z self_outputs = self.self( 2025-08-14T21:30:58.8563516Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 524, in forward 2025-08-14T21:30:58.8563926Z attn_scores = self._sliding_chunks_query_key_matmul( 2025-08-14T21:30:58.8564379Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 796, in _sliding_chunks_query_key_matmul 2025-08-14T21:30:58.8564920Z diagonal_chunked_attention_scores = torch.einsum("bcxd,bcyd->bcxy", (query, key)) # multiply 2025-08-14T21:30:58.8565143Z 2025-08-14T21:30:58.8565240Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.8565769Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.8566213Z layer_outputs = layer_module( 2025-08-14T21:30:58.8566534Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.8566860Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.8567242Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.8567628Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.8568008Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.8568414Z self_outputs = self.self( 2025-08-14T21:30:58.8568780Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 524, in forward 2025-08-14T21:30:58.8569193Z attn_scores = self._sliding_chunks_query_key_matmul( 2025-08-14T21:30:58.8569650Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 796, in _sliding_chunks_query_key_matmul 2025-08-14T21:30:58.8570176Z diagonal_chunked_attention_scores = torch.einsum("bcxd,bcyd->bcxy", (query, key)) # multiply 2025-08-14T21:30:58.8570404Z 2025-08-14T21:30:58.8570480Z cudagraph partition due to non gpu ops 2025-08-14T21:30:58.8570679Z cudagraph partition due to non gpu ops 2025-08-14T21:30:58.8570863Z cudagraph partition due to non gpu ops 2025-08-14T21:30:58.8571051Z cudagraph partition due to non gpu ops 2025-08-14T21:30:58.8571267Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.8571741Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.8572180Z layer_outputs = layer_module( 2025-08-14T21:30:58.8572501Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.8572834Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.8573212Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.8573595Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.8573984Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.8574362Z self_outputs = self.self( 2025-08-14T21:30:58.8574723Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 536, in forward 2025-08-14T21:30:58.8575132Z diagonal_mask = self._sliding_chunks_query_key_matmul( 2025-08-14T21:30:58.8575599Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 834, in _sliding_chunks_query_key_matmul 2025-08-14T21:30:58.8576097Z self._mask_invalid_locations(diagonal_attention_scores, window_overlap) 2025-08-14T21:30:58.8576575Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 762, in _mask_invalid_locations 2025-08-14T21:30:58.8577081Z input_tensor[:, :affected_seq_len, :, : affected_seq_len + 1] = torch.full_like( 2025-08-14T21:30:58.8577274Z 2025-08-14T21:30:58.8577351Z cudagraph partition due to non gpu ops 2025-08-14T21:30:58.8577569Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.8578038Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.8578483Z layer_outputs = layer_module( 2025-08-14T21:30:58.8578831Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.8579160Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.8579544Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.8579923Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.8580301Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.8580677Z self_outputs = self.self( 2025-08-14T21:30:58.8581040Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 541, in forward 2025-08-14T21:30:58.8581455Z attn_scores += diagonal_mask 2025-08-14T21:30:58.8581569Z 2025-08-14T21:30:58.8581670Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.8582136Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.8582585Z layer_outputs = layer_module( 2025-08-14T21:30:58.8582908Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.8583239Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.8583622Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.8584003Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.8584387Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.8584919Z self_outputs = self.self( 2025-08-14T21:30:58.8585292Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 579, in forward 2025-08-14T21:30:58.8585689Z attn_probs = nn.functional.softmax( 2025-08-14T21:30:58.8585815Z 2025-08-14T21:30:58.8585920Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.8586391Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.8586840Z layer_outputs = layer_module( 2025-08-14T21:30:58.8587162Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.8587487Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.8587878Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.8588271Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.8588653Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.8589029Z self_outputs = self.self( 2025-08-14T21:30:58.8589398Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 613, in forward 2025-08-14T21:30:58.8589823Z attn_output = self._sliding_chunks_matmul_attn_probs_value( 2025-08-14T21:30:58.8590306Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 863, in _sliding_chunks_matmul_attn_probs_value 2025-08-14T21:30:58.8590841Z padded_value = nn.functional.pad(value, (0, 0, window_overlap, window_overlap), value=-1) 2025-08-14T21:30:58.8591238Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/torch/nn/functional.py", line 5294, in pad 2025-08-14T21:30:58.8591567Z return torch._C._nn.pad(input, pad, mode, value) 2025-08-14T21:30:58.8591710Z 2025-08-14T21:30:58.8591812Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.8592346Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.8592794Z layer_outputs = layer_module( 2025-08-14T21:30:58.8593114Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.8593437Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.8593823Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.8594209Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.8594650Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.8595024Z self_outputs = self.self( 2025-08-14T21:30:58.8595391Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 613, in forward 2025-08-14T21:30:58.8595822Z attn_output = self._sliding_chunks_matmul_attn_probs_value( 2025-08-14T21:30:58.8596306Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 876, in _sliding_chunks_matmul_attn_probs_value 2025-08-14T21:30:58.8596804Z chunked_attn_probs = self._pad_and_diagonalize(chunked_attn_probs) 2025-08-14T21:30:58.8597269Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 699, in _pad_and_diagonalize 2025-08-14T21:30:58.8597703Z chunked_hidden_states = nn.functional.pad( 2025-08-14T21:30:58.8598020Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/torch/nn/functional.py", line 5294, in pad 2025-08-14T21:30:58.8598338Z return torch._C._nn.pad(input, pad, mode, value) 2025-08-14T21:30:58.8598486Z 2025-08-14T21:30:58.8598584Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.8599059Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.8599495Z layer_outputs = layer_module( 2025-08-14T21:30:58.8599814Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.8600144Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.8600526Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.8600903Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.8601292Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.8601678Z self_outputs = self.self( 2025-08-14T21:30:58.8602045Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 613, in forward 2025-08-14T21:30:58.8602462Z attn_output = self._sliding_chunks_matmul_attn_probs_value( 2025-08-14T21:30:58.8602954Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 878, in _sliding_chunks_matmul_attn_probs_value 2025-08-14T21:30:58.8603472Z context = torch.einsum("bcwd,bcdh->bcwh", (chunked_attn_probs, chunked_value)) 2025-08-14T21:30:58.8603662Z 2025-08-14T21:30:58.8603763Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.8604225Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.8604671Z layer_outputs = layer_module( 2025-08-14T21:30:58.8604989Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.8605315Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.8605730Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.8606115Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.8606494Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.8606863Z self_outputs = self.self( 2025-08-14T21:30:58.8607232Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 613, in forward 2025-08-14T21:30:58.8607652Z attn_output = self._sliding_chunks_matmul_attn_probs_value( 2025-08-14T21:30:58.8608170Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 878, in _sliding_chunks_matmul_attn_probs_value 2025-08-14T21:30:58.8608677Z context = torch.einsum("bcwd,bcdh->bcwh", (chunked_attn_probs, chunked_value)) 2025-08-14T21:30:58.8608887Z 2025-08-14T21:30:58.8608985Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.8609455Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.8609897Z layer_outputs = layer_module( 2025-08-14T21:30:58.8610209Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.8610540Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.8610923Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.8611304Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.8611686Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.8612063Z self_outputs = self.self( 2025-08-14T21:30:58.8612432Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 618, in forward 2025-08-14T21:30:58.8612911Z attn_output = attn_output.transpose(0, 1).reshape(seq_len, batch_size, embed_dim).contiguous() 2025-08-14T21:30:58.8613140Z 2025-08-14T21:30:58.8613235Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.8613705Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.8614151Z layer_outputs = layer_module( 2025-08-14T21:30:58.8614459Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.8614793Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.8615178Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.8615563Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.8615940Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1144, in forward 2025-08-14T21:30:58.8616357Z attn_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:30:58.8616771Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1094, in forward 2025-08-14T21:30:58.8617156Z hidden_states = self.dense(hidden_states) 2025-08-14T21:30:58.8617293Z 2025-08-14T21:30:58.8617392Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.8617861Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.8618304Z layer_outputs = layer_module( 2025-08-14T21:30:58.8618656Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.8618992Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.8619377Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1211, in forward 2025-08-14T21:30:58.8619765Z layer_output = apply_chunking_to_forward( 2025-08-14T21:30:58.8620131Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:30:58.8620494Z return forward_fn(*input_tensors) 2025-08-14T21:30:58.8620911Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1218, in ff_chunk 2025-08-14T21:30:58.8621325Z intermediate_output = self.intermediate(attn_output) 2025-08-14T21:30:58.8621740Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1160, in forward 2025-08-14T21:30:58.8622134Z hidden_states = self.dense(hidden_states) 2025-08-14T21:30:58.8622260Z 2025-08-14T21:30:58.8622360Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.8622821Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.8623263Z layer_outputs = layer_module( 2025-08-14T21:30:58.8623584Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.8623923Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.8624299Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1211, in forward 2025-08-14T21:30:58.8624762Z layer_output = apply_chunking_to_forward( 2025-08-14T21:30:58.8625145Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:30:58.8625513Z return forward_fn(*input_tensors) 2025-08-14T21:30:58.8625892Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1218, in ff_chunk 2025-08-14T21:30:58.8626313Z intermediate_output = self.intermediate(attn_output) 2025-08-14T21:30:58.8626726Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1161, in forward 2025-08-14T21:30:58.8627138Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:30:58.8627497Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:30:58.8627818Z return self.act(input) 2025-08-14T21:30:58.8627920Z 2025-08-14T21:30:58.8628023Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.8628492Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.8628944Z layer_outputs = layer_module( 2025-08-14T21:30:58.8629267Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.8629598Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.8629975Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1211, in forward 2025-08-14T21:30:58.8630363Z layer_output = apply_chunking_to_forward( 2025-08-14T21:30:58.8630736Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:30:58.8631091Z return forward_fn(*input_tensors) 2025-08-14T21:30:58.8631503Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1219, in ff_chunk 2025-08-14T21:30:58.8631946Z layer_output = self.output(intermediate_output, attn_output) 2025-08-14T21:30:58.8632376Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1174, in forward 2025-08-14T21:30:58.8632765Z hidden_states = self.dense(hidden_states) 2025-08-14T21:30:58.8632899Z 2025-08-14T21:30:58.8632994Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.8633467Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.8633946Z layer_outputs = layer_module( 2025-08-14T21:30:58.8634258Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.8634595Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.8634985Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.8635365Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.8635748Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.8636129Z self_outputs = self.self( 2025-08-14T21:30:58.8636496Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 509, in forward 2025-08-14T21:30:58.8636878Z query_vectors = self.query(hidden_states) 2025-08-14T21:30:58.8637013Z 2025-08-14T21:30:58.8637108Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.8637581Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.8638022Z layer_outputs = layer_module( 2025-08-14T21:30:58.8638337Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.8638673Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.8639060Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.8639440Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.8639826Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.8640207Z self_outputs = self.self( 2025-08-14T21:30:58.8640578Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 524, in forward 2025-08-14T21:30:58.8640985Z attn_scores = self._sliding_chunks_query_key_matmul( 2025-08-14T21:30:58.8641448Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 796, in _sliding_chunks_query_key_matmul 2025-08-14T21:30:58.8641984Z diagonal_chunked_attention_scores = torch.einsum("bcxd,bcyd->bcxy", (query, key)) # multiply 2025-08-14T21:30:58.8642206Z 2025-08-14T21:30:58.8642308Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.8642772Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.8643214Z layer_outputs = layer_module( 2025-08-14T21:30:58.8643534Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.8643867Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.8644244Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.8644627Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.8645044Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.8645418Z self_outputs = self.self( 2025-08-14T21:30:58.8645782Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 510, in forward 2025-08-14T21:30:58.8646164Z key_vectors = self.key(hidden_states) 2025-08-14T21:30:58.8646287Z 2025-08-14T21:30:58.8646388Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.8646848Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.8647326Z layer_outputs = layer_module( 2025-08-14T21:30:58.8647640Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.8647972Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.8648349Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.8648732Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.8649113Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.8649482Z self_outputs = self.self( 2025-08-14T21:30:58.8649845Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 524, in forward 2025-08-14T21:30:58.8650258Z attn_scores = self._sliding_chunks_query_key_matmul( 2025-08-14T21:30:58.8650720Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 796, in _sliding_chunks_query_key_matmul 2025-08-14T21:30:58.8651252Z diagonal_chunked_attention_scores = torch.einsum("bcxd,bcyd->bcxy", (query, key)) # multiply 2025-08-14T21:30:58.8651482Z 2025-08-14T21:30:58.8651578Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.8652047Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.8652492Z layer_outputs = layer_module( 2025-08-14T21:30:58.8652803Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.8653136Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.8653523Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.8653909Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.8654284Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.8654665Z self_outputs = self.self( 2025-08-14T21:30:58.8655030Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 524, in forward 2025-08-14T21:30:58.8655427Z attn_scores = self._sliding_chunks_query_key_matmul( 2025-08-14T21:30:58.8655882Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 796, in _sliding_chunks_query_key_matmul 2025-08-14T21:30:58.8656417Z diagonal_chunked_attention_scores = torch.einsum("bcxd,bcyd->bcxy", (query, key)) # multiply 2025-08-14T21:30:58.8656642Z 2025-08-14T21:30:58.8656743Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.8657205Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.8657652Z layer_outputs = layer_module( 2025-08-14T21:30:58.8658002Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.8658341Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.8658725Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.8659114Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.8659504Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.8659885Z self_outputs = self.self( 2025-08-14T21:30:58.8660278Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 524, in forward 2025-08-14T21:30:58.8660682Z attn_scores = self._sliding_chunks_query_key_matmul( 2025-08-14T21:30:58.8661139Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 796, in _sliding_chunks_query_key_matmul 2025-08-14T21:30:58.8661672Z diagonal_chunked_attention_scores = torch.einsum("bcxd,bcyd->bcxy", (query, key)) # multiply 2025-08-14T21:30:58.8661892Z 2025-08-14T21:30:58.8661965Z cudagraph partition due to non gpu ops 2025-08-14T21:30:58.8662164Z cudagraph partition due to non gpu ops 2025-08-14T21:30:58.8662355Z cudagraph partition due to non gpu ops 2025-08-14T21:30:58.8662538Z cudagraph partition due to non gpu ops 2025-08-14T21:30:58.8662750Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.8663222Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.8663667Z layer_outputs = layer_module( 2025-08-14T21:30:58.8663980Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.8664313Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.8664792Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.8665183Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.8665569Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.8665950Z self_outputs = self.self( 2025-08-14T21:30:58.8666317Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 536, in forward 2025-08-14T21:30:58.8666729Z diagonal_mask = self._sliding_chunks_query_key_matmul( 2025-08-14T21:30:58.8667198Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 834, in _sliding_chunks_query_key_matmul 2025-08-14T21:30:58.8667705Z self._mask_invalid_locations(diagonal_attention_scores, window_overlap) 2025-08-14T21:30:58.8668201Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 762, in _mask_invalid_locations 2025-08-14T21:30:58.8668684Z input_tensor[:, :affected_seq_len, :, : affected_seq_len + 1] = torch.full_like( 2025-08-14T21:30:58.8668880Z 2025-08-14T21:30:58.8668953Z cudagraph partition due to non gpu ops 2025-08-14T21:30:58.8669175Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.8669643Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.8670094Z layer_outputs = layer_module( 2025-08-14T21:30:58.8670417Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.8670754Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.8671168Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.8671564Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.8671946Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.8672323Z self_outputs = self.self( 2025-08-14T21:30:58.8672679Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 541, in forward 2025-08-14T21:30:58.8673058Z attn_scores += diagonal_mask 2025-08-14T21:30:58.8673201Z 2025-08-14T21:30:58.8673308Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.8673782Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.8674222Z layer_outputs = layer_module( 2025-08-14T21:30:58.8674545Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.8674884Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.8675265Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.8675652Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.8676043Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.8676422Z self_outputs = self.self( 2025-08-14T21:30:58.8676789Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 579, in forward 2025-08-14T21:30:58.8677182Z attn_probs = nn.functional.softmax( 2025-08-14T21:30:58.8677306Z 2025-08-14T21:30:58.8677412Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.8677888Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.8678330Z layer_outputs = layer_module( 2025-08-14T21:30:58.8678652Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.8678990Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.8679372Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.8679759Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.8680143Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.8680521Z self_outputs = self.self( 2025-08-14T21:30:58.8680887Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 511, in forward 2025-08-14T21:30:58.8681285Z value_vectors = self.value(hidden_states) 2025-08-14T21:30:58.8681416Z 2025-08-14T21:30:58.8681520Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.8681991Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.8682428Z layer_outputs = layer_module( 2025-08-14T21:30:58.8682751Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.8683089Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.8683469Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.8683855Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.8684277Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.8684812Z self_outputs = self.self( 2025-08-14T21:30:58.8685178Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 613, in forward 2025-08-14T21:30:58.8685607Z attn_output = self._sliding_chunks_matmul_attn_probs_value( 2025-08-14T21:30:58.8686100Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 863, in _sliding_chunks_matmul_attn_probs_value 2025-08-14T21:30:58.8686704Z padded_value = nn.functional.pad(value, (0, 0, window_overlap, window_overlap), value=-1) 2025-08-14T21:30:58.8687088Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/torch/nn/functional.py", line 5294, in pad 2025-08-14T21:30:58.8687414Z return torch._C._nn.pad(input, pad, mode, value) 2025-08-14T21:30:58.8687556Z 2025-08-14T21:30:58.8687663Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.8688136Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.8688573Z layer_outputs = layer_module( 2025-08-14T21:30:58.8688895Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.8689230Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.8689611Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.8690001Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.8690381Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.8690759Z self_outputs = self.self( 2025-08-14T21:30:58.8691115Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 613, in forward 2025-08-14T21:30:58.8691537Z attn_output = self._sliding_chunks_matmul_attn_probs_value( 2025-08-14T21:30:58.8692020Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 876, in _sliding_chunks_matmul_attn_probs_value 2025-08-14T21:30:58.8692521Z chunked_attn_probs = self._pad_and_diagonalize(chunked_attn_probs) 2025-08-14T21:30:58.8692982Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 699, in _pad_and_diagonalize 2025-08-14T21:30:58.8693417Z chunked_hidden_states = nn.functional.pad( 2025-08-14T21:30:58.8693728Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/torch/nn/functional.py", line 5294, in pad 2025-08-14T21:30:58.8694044Z return torch._C._nn.pad(input, pad, mode, value) 2025-08-14T21:30:58.8694190Z 2025-08-14T21:30:58.8694289Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.8694763Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.8695209Z layer_outputs = layer_module( 2025-08-14T21:30:58.8695520Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.8695855Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.8696235Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.8696616Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.8696989Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.8697364Z self_outputs = self.self( 2025-08-14T21:30:58.8697773Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 613, in forward 2025-08-14T21:30:58.8698207Z attn_output = self._sliding_chunks_matmul_attn_probs_value( 2025-08-14T21:30:58.8698691Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 878, in _sliding_chunks_matmul_attn_probs_value 2025-08-14T21:30:58.8699211Z context = torch.einsum("bcwd,bcdh->bcwh", (chunked_attn_probs, chunked_value)) 2025-08-14T21:30:58.8699406Z 2025-08-14T21:30:58.8699512Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.8700005Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.8700448Z layer_outputs = layer_module( 2025-08-14T21:30:58.8700764Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.8701096Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.8701471Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.8701858Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.8702242Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.8702614Z self_outputs = self.self( 2025-08-14T21:30:58.8702970Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 613, in forward 2025-08-14T21:30:58.8703392Z attn_output = self._sliding_chunks_matmul_attn_probs_value( 2025-08-14T21:30:58.8703874Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 878, in _sliding_chunks_matmul_attn_probs_value 2025-08-14T21:30:58.8704383Z context = torch.einsum("bcwd,bcdh->bcwh", (chunked_attn_probs, chunked_value)) 2025-08-14T21:30:58.8704572Z 2025-08-14T21:30:58.8704711Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.8705197Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.8705644Z layer_outputs = layer_module( 2025-08-14T21:30:58.8705955Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.8706293Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.8706681Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.8707072Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.8707455Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.8707837Z self_outputs = self.self( 2025-08-14T21:30:58.8708208Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 618, in forward 2025-08-14T21:30:58.8708697Z attn_output = attn_output.transpose(0, 1).reshape(seq_len, batch_size, embed_dim).contiguous() 2025-08-14T21:30:58.8708917Z 2025-08-14T21:30:58.8709012Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.8709486Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.8709936Z layer_outputs = layer_module( 2025-08-14T21:30:58.8710256Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.8710613Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.8710997Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.8711380Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.8711755Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1144, in forward 2025-08-14T21:30:58.8712171Z attn_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:30:58.8712586Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1094, in forward 2025-08-14T21:30:58.8713021Z hidden_states = self.dense(hidden_states) 2025-08-14T21:30:58.8713150Z 2025-08-14T21:30:58.8713245Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.8713727Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.8714177Z layer_outputs = layer_module( 2025-08-14T21:30:58.8714495Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.8714820Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.8715208Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1211, in forward 2025-08-14T21:30:58.8715599Z layer_output = apply_chunking_to_forward( 2025-08-14T21:30:58.8715970Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:30:58.8716331Z return forward_fn(*input_tensors) 2025-08-14T21:30:58.8716715Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1218, in ff_chunk 2025-08-14T21:30:58.8717138Z intermediate_output = self.intermediate(attn_output) 2025-08-14T21:30:58.8717542Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1160, in forward 2025-08-14T21:30:58.8717933Z hidden_states = self.dense(hidden_states) 2025-08-14T21:30:58.8718067Z 2025-08-14T21:30:58.8718164Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.8718634Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.8719072Z layer_outputs = layer_module( 2025-08-14T21:30:58.8719390Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.8719722Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.8720104Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1211, in forward 2025-08-14T21:30:58.8720490Z layer_output = apply_chunking_to_forward( 2025-08-14T21:30:58.8720865Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:30:58.8721230Z return forward_fn(*input_tensors) 2025-08-14T21:30:58.8721607Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1218, in ff_chunk 2025-08-14T21:30:58.8722028Z intermediate_output = self.intermediate(attn_output) 2025-08-14T21:30:58.8722439Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1161, in forward 2025-08-14T21:30:58.8722861Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:30:58.8723207Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:30:58.8723525Z return self.act(input) 2025-08-14T21:30:58.8723626Z 2025-08-14T21:30:58.8723758Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.8724237Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.8724674Z layer_outputs = layer_module( 2025-08-14T21:30:58.8724993Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.8725327Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.8725706Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1211, in forward 2025-08-14T21:30:58.8726128Z layer_output = apply_chunking_to_forward( 2025-08-14T21:30:58.8726496Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:30:58.8726857Z return forward_fn(*input_tensors) 2025-08-14T21:30:58.8727233Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1219, in ff_chunk 2025-08-14T21:30:58.8727666Z layer_output = self.output(intermediate_output, attn_output) 2025-08-14T21:30:58.8728086Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1174, in forward 2025-08-14T21:30:58.8728474Z hidden_states = self.dense(hidden_states) 2025-08-14T21:30:58.8728599Z 2025-08-14T21:30:58.8728695Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.8729163Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.8729610Z layer_outputs = layer_module( 2025-08-14T21:30:58.8729927Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.8730254Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.8730639Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.8731022Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.8731397Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.8731774Z self_outputs = self.self( 2025-08-14T21:30:58.8732141Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 509, in forward 2025-08-14T21:30:58.8732532Z query_vectors = self.query(hidden_states) 2025-08-14T21:30:58.8732657Z 2025-08-14T21:30:58.8732751Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.8733224Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.8733666Z layer_outputs = layer_module( 2025-08-14T21:30:58.8733979Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.8734303Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.8734685Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.8735068Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.8735439Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.8735818Z self_outputs = self.self( 2025-08-14T21:30:58.8736178Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 524, in forward 2025-08-14T21:30:58.8736613Z attn_scores = self._sliding_chunks_query_key_matmul( 2025-08-14T21:30:58.8737067Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 796, in _sliding_chunks_query_key_matmul 2025-08-14T21:30:58.8737599Z diagonal_chunked_attention_scores = torch.einsum("bcxd,bcyd->bcxy", (query, key)) # multiply 2025-08-14T21:30:58.8737831Z 2025-08-14T21:30:58.8737927Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.8738398Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.8738870Z layer_outputs = layer_module( 2025-08-14T21:30:58.8739194Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.8739528Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.8739917Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.8740297Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.8740681Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.8741061Z self_outputs = self.self( 2025-08-14T21:30:58.8741422Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 510, in forward 2025-08-14T21:30:58.8741810Z key_vectors = self.key(hidden_states) 2025-08-14T21:30:58.8741940Z 2025-08-14T21:30:58.8742037Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.8742507Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.8742947Z layer_outputs = layer_module( 2025-08-14T21:30:58.8743270Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.8743611Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.8744000Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.8744379Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.8744829Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.8745223Z self_outputs = self.self( 2025-08-14T21:30:58.8745590Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 524, in forward 2025-08-14T21:30:58.8746002Z attn_scores = self._sliding_chunks_query_key_matmul( 2025-08-14T21:30:58.8746465Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 796, in _sliding_chunks_query_key_matmul 2025-08-14T21:30:58.8747007Z diagonal_chunked_attention_scores = torch.einsum("bcxd,bcyd->bcxy", (query, key)) # multiply 2025-08-14T21:30:58.8747230Z 2025-08-14T21:30:58.8747325Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.8747799Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.8748247Z layer_outputs = layer_module( 2025-08-14T21:30:58.8748568Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.8748898Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.8749285Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.8749670Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.8750080Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.8750458Z self_outputs = self.self( 2025-08-14T21:30:58.8750829Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 524, in forward 2025-08-14T21:30:58.8751244Z attn_scores = self._sliding_chunks_query_key_matmul( 2025-08-14T21:30:58.8751698Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 796, in _sliding_chunks_query_key_matmul 2025-08-14T21:30:58.8752258Z diagonal_chunked_attention_scores = torch.einsum("bcxd,bcyd->bcxy", (query, key)) # multiply 2025-08-14T21:30:58.8752486Z 2025-08-14T21:30:58.8752581Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.8753052Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.8768222Z layer_outputs = layer_module( 2025-08-14T21:30:58.8768639Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.8769001Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.8769424Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.8769821Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.8770221Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.8770628Z self_outputs = self.self( 2025-08-14T21:30:58.8770996Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 524, in forward 2025-08-14T21:30:58.8771420Z attn_scores = self._sliding_chunks_query_key_matmul( 2025-08-14T21:30:58.8771888Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 796, in _sliding_chunks_query_key_matmul 2025-08-14T21:30:58.8772433Z diagonal_chunked_attention_scores = torch.einsum("bcxd,bcyd->bcxy", (query, key)) # multiply 2025-08-14T21:30:58.8772666Z 2025-08-14T21:30:58.8772747Z cudagraph partition due to non gpu ops 2025-08-14T21:30:58.8772952Z cudagraph partition due to non gpu ops 2025-08-14T21:30:58.8773144Z cudagraph partition due to non gpu ops 2025-08-14T21:30:58.8773327Z cudagraph partition due to non gpu ops 2025-08-14T21:30:58.8773556Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.8774039Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.8774494Z layer_outputs = layer_module( 2025-08-14T21:30:58.8774818Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.8775162Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.8775554Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.8775939Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.8776317Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.8776699Z self_outputs = self.self( 2025-08-14T21:30:58.8777069Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 536, in forward 2025-08-14T21:30:58.8777476Z diagonal_mask = self._sliding_chunks_query_key_matmul( 2025-08-14T21:30:58.8778048Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 834, in _sliding_chunks_query_key_matmul 2025-08-14T21:30:58.8779422Z self._mask_invalid_locations(diagonal_attention_scores, window_overlap) 2025-08-14T21:30:58.8780417Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 762, in _mask_invalid_locations 2025-08-14T21:30:58.8781329Z input_tensor[:, :affected_seq_len, :, : affected_seq_len + 1] = torch.full_like( 2025-08-14T21:30:58.8781690Z 2025-08-14T21:30:58.8781821Z cudagraph partition due to non gpu ops 2025-08-14T21:30:58.8782226Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.8783462Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.8784310Z layer_outputs = layer_module( 2025-08-14T21:30:58.8785114Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.8785738Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.8786455Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.8787165Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.8787867Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.8788539Z self_outputs = self.self( 2025-08-14T21:30:58.8789200Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 541, in forward 2025-08-14T21:30:58.8789917Z attn_scores += diagonal_mask 2025-08-14T21:30:58.8790122Z 2025-08-14T21:30:58.8790293Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.8791164Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.8791974Z layer_outputs = layer_module( 2025-08-14T21:30:58.8792545Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.8793153Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.8793866Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.8794571Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.8795274Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.8795981Z self_outputs = self.self( 2025-08-14T21:30:58.8796637Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 579, in forward 2025-08-14T21:30:58.8797361Z attn_probs = nn.functional.softmax( 2025-08-14T21:30:58.8797588Z 2025-08-14T21:30:58.8797755Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.8798635Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.8799450Z layer_outputs = layer_module( 2025-08-14T21:30:58.8800044Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.8800652Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.8801365Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.8802072Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.8802779Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.8803740Z self_outputs = self.self( 2025-08-14T21:30:58.8804424Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 511, in forward 2025-08-14T21:30:58.8805148Z value_vectors = self.value(hidden_states) 2025-08-14T21:30:58.8805387Z 2025-08-14T21:30:58.8805554Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.8806432Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.8807256Z layer_outputs = layer_module( 2025-08-14T21:30:58.8807952Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.8808561Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.8809286Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.8810000Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.8810712Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.8811411Z self_outputs = self.self( 2025-08-14T21:30:58.8812098Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 613, in forward 2025-08-14T21:30:58.8812893Z attn_output = self._sliding_chunks_matmul_attn_probs_value( 2025-08-14T21:30:58.8813806Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 863, in _sliding_chunks_matmul_attn_probs_value 2025-08-14T21:30:58.8814811Z padded_value = nn.functional.pad(value, (0, 0, window_overlap, window_overlap), value=-1) 2025-08-14T21:30:58.8815531Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/torch/nn/functional.py", line 5294, in pad 2025-08-14T21:30:58.8816133Z return torch._C._nn.pad(input, pad, mode, value) 2025-08-14T21:30:58.8816395Z 2025-08-14T21:30:58.8816567Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.8817468Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.8818290Z layer_outputs = layer_module( 2025-08-14T21:30:58.8818874Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.8819499Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.8820212Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.8820920Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.8821640Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.8822350Z self_outputs = self.self( 2025-08-14T21:30:58.8823020Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 613, in forward 2025-08-14T21:30:58.8823795Z attn_output = self._sliding_chunks_matmul_attn_probs_value( 2025-08-14T21:30:58.8824814Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 876, in _sliding_chunks_matmul_attn_probs_value 2025-08-14T21:30:58.8825766Z chunked_attn_probs = self._pad_and_diagonalize(chunked_attn_probs) 2025-08-14T21:30:58.8826630Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 699, in _pad_and_diagonalize 2025-08-14T21:30:58.8827425Z chunked_hidden_states = nn.functional.pad( 2025-08-14T21:30:58.8828091Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/torch/nn/functional.py", line 5294, in pad 2025-08-14T21:30:58.8828687Z return torch._C._nn.pad(input, pad, mode, value) 2025-08-14T21:30:58.8828937Z 2025-08-14T21:30:58.8829103Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.8829987Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.8830812Z layer_outputs = layer_module( 2025-08-14T21:30:58.8831396Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.8832085Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.8832795Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.8833505Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.8834209Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.8834900Z self_outputs = self.self( 2025-08-14T21:30:58.8835572Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 613, in forward 2025-08-14T21:30:58.8836361Z attn_output = self._sliding_chunks_matmul_attn_probs_value( 2025-08-14T21:30:58.8837278Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 878, in _sliding_chunks_matmul_attn_probs_value 2025-08-14T21:30:58.8838245Z context = torch.einsum("bcwd,bcdh->bcwh", (chunked_attn_probs, chunked_value)) 2025-08-14T21:30:58.8838612Z 2025-08-14T21:30:58.8838782Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.8839661Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.8840503Z layer_outputs = layer_module( 2025-08-14T21:30:58.8841076Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.8841695Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.8842415Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.8843119Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.8843810Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.8844512Z self_outputs = self.self( 2025-08-14T21:30:58.8845184Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 613, in forward 2025-08-14T21:30:58.8845948Z attn_output = self._sliding_chunks_matmul_attn_probs_value( 2025-08-14T21:30:58.8846858Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 878, in _sliding_chunks_matmul_attn_probs_value 2025-08-14T21:30:58.8847817Z context = torch.einsum("bcwd,bcdh->bcwh", (chunked_attn_probs, chunked_value)) 2025-08-14T21:30:58.8848163Z 2025-08-14T21:30:58.8848336Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.8849220Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.8850046Z layer_outputs = layer_module( 2025-08-14T21:30:58.8850616Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.8851213Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.8851916Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.8852719Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.8853455Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.8854144Z self_outputs = self.self( 2025-08-14T21:30:58.8854828Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 618, in forward 2025-08-14T21:30:58.8855723Z attn_output = attn_output.transpose(0, 1).reshape(seq_len, batch_size, embed_dim).contiguous() 2025-08-14T21:30:58.8856130Z 2025-08-14T21:30:58.8856312Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.8857260Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.8858077Z layer_outputs = layer_module( 2025-08-14T21:30:58.8858664Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.8859268Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.8859969Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.8860674Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.8861386Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1144, in forward 2025-08-14T21:30:58.8862151Z attn_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:30:58.8862917Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1094, in forward 2025-08-14T21:30:58.8863635Z hidden_states = self.dense(hidden_states) 2025-08-14T21:30:58.8863862Z 2025-08-14T21:30:58.8864034Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.8865005Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.8865840Z layer_outputs = layer_module( 2025-08-14T21:30:58.8866422Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.8867040Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.8867750Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1211, in forward 2025-08-14T21:30:58.8868479Z layer_output = apply_chunking_to_forward( 2025-08-14T21:30:58.8869172Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:30:58.8869842Z return forward_fn(*input_tensors) 2025-08-14T21:30:58.8870555Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1218, in ff_chunk 2025-08-14T21:30:58.8871342Z intermediate_output = self.intermediate(attn_output) 2025-08-14T21:30:58.8872120Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1160, in forward 2025-08-14T21:30:58.8872839Z hidden_states = self.dense(hidden_states) 2025-08-14T21:30:58.8873069Z 2025-08-14T21:30:58.8873236Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.8874116Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.8874952Z layer_outputs = layer_module( 2025-08-14T21:30:58.8875525Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.8876135Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.8876936Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1211, in forward 2025-08-14T21:30:58.8877647Z layer_output = apply_chunking_to_forward( 2025-08-14T21:30:58.8878319Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:30:58.8878993Z return forward_fn(*input_tensors) 2025-08-14T21:30:58.8879695Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1218, in ff_chunk 2025-08-14T21:30:58.8880458Z intermediate_output = self.intermediate(attn_output) 2025-08-14T21:30:58.8881263Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1161, in forward 2025-08-14T21:30:58.8882033Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:30:58.8882686Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:30:58.8883249Z return self.act(input) 2025-08-14T21:30:58.8883437Z 2025-08-14T21:30:58.8883606Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.8884487Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.8885527Z layer_outputs = layer_module( 2025-08-14T21:30:58.8886074Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.8886669Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.8887376Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1211, in forward 2025-08-14T21:30:58.8888087Z layer_output = apply_chunking_to_forward( 2025-08-14T21:30:58.8888760Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:30:58.8889416Z return forward_fn(*input_tensors) 2025-08-14T21:30:58.8890128Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1219, in ff_chunk 2025-08-14T21:30:58.8890886Z layer_output = self.output(intermediate_output, attn_output) 2025-08-14T21:30:58.8891674Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1174, in forward 2025-08-14T21:30:58.8892387Z hidden_states = self.dense(hidden_states) 2025-08-14T21:30:58.8892620Z 2025-08-14T21:30:58.8892793Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.8893655Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.8894487Z layer_outputs = layer_module( 2025-08-14T21:30:58.8895069Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.8895679Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.8896384Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.8897093Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.8897808Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.8898511Z self_outputs = self.self( 2025-08-14T21:30:58.8899193Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 509, in forward 2025-08-14T21:30:58.8899891Z query_vectors = self.query(hidden_states) 2025-08-14T21:30:58.8900107Z 2025-08-14T21:30:58.8900272Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.8901279Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.8902103Z layer_outputs = layer_module( 2025-08-14T21:30:58.8902686Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.8903303Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.8904003Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.8904933Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.8905641Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.8906340Z self_outputs = self.self( 2025-08-14T21:30:58.8907011Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 524, in forward 2025-08-14T21:30:58.8907748Z attn_scores = self._sliding_chunks_query_key_matmul( 2025-08-14T21:30:58.8908590Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 796, in _sliding_chunks_query_key_matmul 2025-08-14T21:30:58.8909585Z diagonal_chunked_attention_scores = torch.einsum("bcxd,bcyd->bcxy", (query, key)) # multiply 2025-08-14T21:30:58.8910006Z 2025-08-14T21:30:58.8910175Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.8911067Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.8911902Z layer_outputs = layer_module( 2025-08-14T21:30:58.8912472Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.8913071Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.8913782Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.8914489Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.8915194Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.8915905Z self_outputs = self.self( 2025-08-14T21:30:58.8916584Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 510, in forward 2025-08-14T21:30:58.8917290Z key_vectors = self.key(hidden_states) 2025-08-14T21:30:58.8917508Z 2025-08-14T21:30:58.8917676Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.8918551Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.8919388Z layer_outputs = layer_module( 2025-08-14T21:30:58.8919949Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.8920545Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.8921242Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.8921939Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.8922627Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.8923312Z self_outputs = self.self( 2025-08-14T21:30:58.8923978Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 524, in forward 2025-08-14T21:30:58.8924743Z attn_scores = self._sliding_chunks_query_key_matmul( 2025-08-14T21:30:58.8925675Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 796, in _sliding_chunks_query_key_matmul 2025-08-14T21:30:58.8926674Z diagonal_chunked_attention_scores = torch.einsum("bcxd,bcyd->bcxy", (query, key)) # multiply 2025-08-14T21:30:58.8927081Z 2025-08-14T21:30:58.8927258Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.8928129Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.8929034Z layer_outputs = layer_module( 2025-08-14T21:30:58.8929613Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.8930224Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.8930929Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.8931637Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.8932337Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.8933040Z self_outputs = self.self( 2025-08-14T21:30:58.8933713Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 524, in forward 2025-08-14T21:30:58.8934460Z attn_scores = self._sliding_chunks_query_key_matmul( 2025-08-14T21:30:58.8935326Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 796, in _sliding_chunks_query_key_matmul 2025-08-14T21:30:58.8936325Z diagonal_chunked_attention_scores = torch.einsum("bcxd,bcyd->bcxy", (query, key)) # multiply 2025-08-14T21:30:58.8936726Z 2025-08-14T21:30:58.8936891Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.8937778Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.8938615Z layer_outputs = layer_module( 2025-08-14T21:30:58.8939187Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.8939779Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.8940475Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.8941195Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.8941893Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.8942585Z self_outputs = self.self( 2025-08-14T21:30:58.8943270Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 524, in forward 2025-08-14T21:30:58.8944012Z attn_scores = self._sliding_chunks_query_key_matmul( 2025-08-14T21:30:58.8944948Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 796, in _sliding_chunks_query_key_matmul 2025-08-14T21:30:58.8945961Z diagonal_chunked_attention_scores = torch.einsum("bcxd,bcyd->bcxy", (query, key)) # multiply 2025-08-14T21:30:58.8946382Z 2025-08-14T21:30:58.8946505Z cudagraph partition due to non gpu ops 2025-08-14T21:30:58.8946837Z cudagraph partition due to non gpu ops 2025-08-14T21:30:58.8947164Z cudagraph partition due to non gpu ops 2025-08-14T21:30:58.8947491Z cudagraph partition due to non gpu ops 2025-08-14T21:30:58.8947874Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.8948841Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.8949664Z layer_outputs = layer_module( 2025-08-14T21:30:58.8950245Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.8950851Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.8951545Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.8952257Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.8952975Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.8953734Z self_outputs = self.self( 2025-08-14T21:30:58.8954418Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 536, in forward 2025-08-14T21:30:58.8955160Z diagonal_mask = self._sliding_chunks_query_key_matmul( 2025-08-14T21:30:58.8956026Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 834, in _sliding_chunks_query_key_matmul 2025-08-14T21:30:58.8956945Z self._mask_invalid_locations(diagonal_attention_scores, window_overlap) 2025-08-14T21:30:58.8957857Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 762, in _mask_invalid_locations 2025-08-14T21:30:58.8958761Z input_tensor[:, :affected_seq_len, :, : affected_seq_len + 1] = torch.full_like( 2025-08-14T21:30:58.8959110Z 2025-08-14T21:30:58.8959236Z cudagraph partition due to non gpu ops 2025-08-14T21:30:58.8959613Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.8960508Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.8961349Z layer_outputs = layer_module( 2025-08-14T21:30:58.8961937Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.8962530Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.8963254Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.8963969Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.8964458Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.8964568Z self_outputs = self.self( 2025-08-14T21:30:58.8965069Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 541, in forward 2025-08-14T21:30:58.8965175Z attn_scores += diagonal_mask 2025-08-14T21:30:58.8965181Z 2025-08-14T21:30:58.8965358Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.8965986Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.8966091Z layer_outputs = layer_module( 2025-08-14T21:30:58.8966479Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.8966602Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.8967099Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.8967220Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.8967713Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.8967818Z self_outputs = self.self( 2025-08-14T21:30:58.8968387Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 579, in forward 2025-08-14T21:30:58.8968519Z attn_probs = nn.functional.softmax( 2025-08-14T21:30:58.8968526Z 2025-08-14T21:30:58.8968687Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.8969298Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.8969410Z layer_outputs = layer_module( 2025-08-14T21:30:58.8969791Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.8969981Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.8970473Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.8970587Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.8971091Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.8971199Z self_outputs = self.self( 2025-08-14T21:30:58.8971675Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 511, in forward 2025-08-14T21:30:58.8971802Z value_vectors = self.value(hidden_states) 2025-08-14T21:30:58.8971808Z 2025-08-14T21:30:58.8971969Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.8972586Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.8972692Z layer_outputs = layer_module( 2025-08-14T21:30:58.8973078Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.8973203Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.8973696Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.8973816Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.8974297Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.8974402Z self_outputs = self.self( 2025-08-14T21:30:58.8974898Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 613, in forward 2025-08-14T21:30:58.8975084Z attn_output = self._sliding_chunks_matmul_attn_probs_value( 2025-08-14T21:30:58.8975703Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 863, in _sliding_chunks_matmul_attn_probs_value 2025-08-14T21:30:58.8975992Z padded_value = nn.functional.pad(value, (0, 0, window_overlap, window_overlap), value=-1) 2025-08-14T21:30:58.8976321Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/torch/nn/functional.py", line 5294, in pad 2025-08-14T21:30:58.8976481Z return torch._C._nn.pad(input, pad, mode, value) 2025-08-14T21:30:58.8976488Z 2025-08-14T21:30:58.8976624Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.8977203Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.8977318Z layer_outputs = layer_module( 2025-08-14T21:30:58.8977698Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.8977826Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.8978308Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.8978487Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.8978973Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.8979074Z self_outputs = self.self( 2025-08-14T21:30:58.8979568Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 613, in forward 2025-08-14T21:30:58.8979748Z attn_output = self._sliding_chunks_matmul_attn_probs_value( 2025-08-14T21:30:58.8980364Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 876, in _sliding_chunks_matmul_attn_probs_value 2025-08-14T21:30:58.8980644Z chunked_attn_probs = self._pad_and_diagonalize(chunked_attn_probs) 2025-08-14T21:30:58.8981166Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 699, in _pad_and_diagonalize 2025-08-14T21:30:58.8981319Z chunked_hidden_states = nn.functional.pad( 2025-08-14T21:30:58.8981637Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/torch/nn/functional.py", line 5294, in pad 2025-08-14T21:30:58.8981785Z return torch._C._nn.pad(input, pad, mode, value) 2025-08-14T21:30:58.8981791Z 2025-08-14T21:30:58.8981960Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.8982995Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.8983111Z layer_outputs = layer_module( 2025-08-14T21:30:58.8983488Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.8983611Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.8984094Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.8984211Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.8984865Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.8984985Z self_outputs = self.self( 2025-08-14T21:30:58.8985470Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 613, in forward 2025-08-14T21:30:58.8985661Z attn_output = self._sliding_chunks_matmul_attn_probs_value( 2025-08-14T21:30:58.8986268Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 878, in _sliding_chunks_matmul_attn_probs_value 2025-08-14T21:30:58.8986526Z context = torch.einsum("bcwd,bcdh->bcwh", (chunked_attn_probs, chunked_value)) 2025-08-14T21:30:58.8986534Z 2025-08-14T21:30:58.8986707Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.8987323Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.8987440Z layer_outputs = layer_module( 2025-08-14T21:30:58.8987805Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.8987920Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.8988413Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.8988531Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.8989008Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.8989109Z self_outputs = self.self( 2025-08-14T21:30:58.8989592Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 613, in forward 2025-08-14T21:30:58.8989935Z attn_output = self._sliding_chunks_matmul_attn_probs_value( 2025-08-14T21:30:58.8990554Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 878, in _sliding_chunks_matmul_attn_probs_value 2025-08-14T21:30:58.8990798Z context = torch.einsum("bcwd,bcdh->bcwh", (chunked_attn_probs, chunked_value)) 2025-08-14T21:30:58.8990815Z 2025-08-14T21:30:58.8990983Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.8991601Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.8991795Z layer_outputs = layer_module( 2025-08-14T21:30:58.8992171Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.8992290Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.8992793Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.8992906Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.8993417Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.8993517Z self_outputs = self.self( 2025-08-14T21:30:58.8994001Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 618, in forward 2025-08-14T21:30:58.8994316Z attn_output = attn_output.transpose(0, 1).reshape(seq_len, batch_size, embed_dim).contiguous() 2025-08-14T21:30:58.8994327Z 2025-08-14T21:30:58.8994494Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.8995115Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.8995220Z layer_outputs = layer_module( 2025-08-14T21:30:58.8995599Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.8995726Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.8996207Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.8996325Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.8996824Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1144, in forward 2025-08-14T21:30:58.8997005Z attn_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:30:58.8997494Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1094, in forward 2025-08-14T21:30:58.8997620Z hidden_states = self.dense(hidden_states) 2025-08-14T21:30:58.8997630Z 2025-08-14T21:30:58.8997796Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.8998414Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.8998519Z layer_outputs = layer_module( 2025-08-14T21:30:58.8998907Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.8999024Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.8999524Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1211, in forward 2025-08-14T21:30:58.8999665Z layer_output = apply_chunking_to_forward( 2025-08-14T21:30:58.9000118Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:30:58.9000298Z return forward_fn(*input_tensors) 2025-08-14T21:30:58.9000796Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1218, in ff_chunk 2025-08-14T21:30:58.9000973Z intermediate_output = self.intermediate(attn_output) 2025-08-14T21:30:58.9001474Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1160, in forward 2025-08-14T21:30:58.9001600Z hidden_states = self.dense(hidden_states) 2025-08-14T21:30:58.9001607Z 2025-08-14T21:30:58.9001767Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9002431Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9002538Z layer_outputs = layer_module( 2025-08-14T21:30:58.9002931Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9003042Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9003522Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1211, in forward 2025-08-14T21:30:58.9003652Z layer_output = apply_chunking_to_forward( 2025-08-14T21:30:58.9004092Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:30:58.9004210Z return forward_fn(*input_tensors) 2025-08-14T21:30:58.9004698Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1218, in ff_chunk 2025-08-14T21:30:58.9004868Z intermediate_output = self.intermediate(attn_output) 2025-08-14T21:30:58.9005361Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1161, in forward 2025-08-14T21:30:58.9005540Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:30:58.9005911Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:30:58.9006015Z return self.act(input) 2025-08-14T21:30:58.9006021Z 2025-08-14T21:30:58.9006191Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9006800Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9006901Z layer_outputs = layer_module( 2025-08-14T21:30:58.9007273Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9007399Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9007889Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1211, in forward 2025-08-14T21:30:58.9008025Z layer_output = apply_chunking_to_forward( 2025-08-14T21:30:58.9008473Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:30:58.9008587Z return forward_fn(*input_tensors) 2025-08-14T21:30:58.9009088Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1219, in ff_chunk 2025-08-14T21:30:58.9009282Z layer_output = self.output(intermediate_output, attn_output) 2025-08-14T21:30:58.9009783Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1174, in forward 2025-08-14T21:30:58.9009908Z hidden_states = self.dense(hidden_states) 2025-08-14T21:30:58.9009913Z 2025-08-14T21:30:58.9010078Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9010756Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9010863Z layer_outputs = layer_module( 2025-08-14T21:30:58.9011253Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9011371Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9011859Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.9011984Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.9012479Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.9012630Z self_outputs = self.self( 2025-08-14T21:30:58.9013136Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 509, in forward 2025-08-14T21:30:58.9013264Z query_vectors = self.query(hidden_states) 2025-08-14T21:30:58.9013270Z 2025-08-14T21:30:58.9013438Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9014045Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9014149Z layer_outputs = layer_module( 2025-08-14T21:30:58.9014532Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9014647Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9015136Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.9015267Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.9015755Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.9015863Z self_outputs = self.self( 2025-08-14T21:30:58.9016338Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 524, in forward 2025-08-14T21:30:58.9016495Z attn_scores = self._sliding_chunks_query_key_matmul( 2025-08-14T21:30:58.9017100Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 796, in _sliding_chunks_query_key_matmul 2025-08-14T21:30:58.9017404Z diagonal_chunked_attention_scores = torch.einsum("bcxd,bcyd->bcxy", (query, key)) # multiply 2025-08-14T21:30:58.9017411Z 2025-08-14T21:30:58.9017574Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9018190Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9018298Z layer_outputs = layer_module( 2025-08-14T21:30:58.9018690Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9018802Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9019289Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.9019408Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.9019897Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.9020005Z self_outputs = self.self( 2025-08-14T21:30:58.9020497Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 510, in forward 2025-08-14T21:30:58.9020616Z key_vectors = self.key(hidden_states) 2025-08-14T21:30:58.9020621Z 2025-08-14T21:30:58.9020799Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9021467Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9021584Z layer_outputs = layer_module( 2025-08-14T21:30:58.9021971Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9022088Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9022584Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.9022697Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.9023234Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.9023342Z self_outputs = self.self( 2025-08-14T21:30:58.9023838Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 524, in forward 2025-08-14T21:30:58.9024005Z attn_scores = self._sliding_chunks_query_key_matmul( 2025-08-14T21:30:58.9024603Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 796, in _sliding_chunks_query_key_matmul 2025-08-14T21:30:58.9025003Z diagonal_chunked_attention_scores = torch.einsum("bcxd,bcyd->bcxy", (query, key)) # multiply 2025-08-14T21:30:58.9025014Z 2025-08-14T21:30:58.9025188Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9025811Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9025932Z layer_outputs = layer_module( 2025-08-14T21:30:58.9026309Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9026429Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9026929Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.9027040Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.9027534Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.9027645Z self_outputs = self.self( 2025-08-14T21:30:58.9028139Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 524, in forward 2025-08-14T21:30:58.9028306Z attn_scores = self._sliding_chunks_query_key_matmul( 2025-08-14T21:30:58.9028902Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 796, in _sliding_chunks_query_key_matmul 2025-08-14T21:30:58.9029205Z diagonal_chunked_attention_scores = torch.einsum("bcxd,bcyd->bcxy", (query, key)) # multiply 2025-08-14T21:30:58.9029218Z 2025-08-14T21:30:58.9029376Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9029967Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9030077Z layer_outputs = layer_module( 2025-08-14T21:30:58.9030455Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9030570Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9031058Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.9031175Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.9031683Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.9031865Z self_outputs = self.self( 2025-08-14T21:30:58.9032352Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 524, in forward 2025-08-14T21:30:58.9032523Z attn_scores = self._sliding_chunks_query_key_matmul( 2025-08-14T21:30:58.9033113Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 796, in _sliding_chunks_query_key_matmul 2025-08-14T21:30:58.9033424Z diagonal_chunked_attention_scores = torch.einsum("bcxd,bcyd->bcxy", (query, key)) # multiply 2025-08-14T21:30:58.9033430Z 2025-08-14T21:30:58.9033606Z cudagraph partition due to non gpu ops 2025-08-14T21:30:58.9033724Z cudagraph partition due to non gpu ops 2025-08-14T21:30:58.9033846Z cudagraph partition due to non gpu ops 2025-08-14T21:30:58.9033958Z cudagraph partition due to non gpu ops 2025-08-14T21:30:58.9034119Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9034739Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9034843Z layer_outputs = layer_module( 2025-08-14T21:30:58.9035228Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9035345Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9035830Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.9035957Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.9036438Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.9036539Z self_outputs = self.self( 2025-08-14T21:30:58.9037030Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 536, in forward 2025-08-14T21:30:58.9037195Z diagonal_mask = self._sliding_chunks_query_key_matmul( 2025-08-14T21:30:58.9037791Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 834, in _sliding_chunks_query_key_matmul 2025-08-14T21:30:58.9038026Z self._mask_invalid_locations(diagonal_attention_scores, window_overlap) 2025-08-14T21:30:58.9038590Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 762, in _mask_invalid_locations 2025-08-14T21:30:58.9038838Z input_tensor[:, :affected_seq_len, :, : affected_seq_len + 1] = torch.full_like( 2025-08-14T21:30:58.9038848Z 2025-08-14T21:30:58.9038972Z cudagraph partition due to non gpu ops 2025-08-14T21:30:58.9039147Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9039776Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9039884Z layer_outputs = layer_module( 2025-08-14T21:30:58.9040277Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9040395Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9040900Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.9041010Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.9041492Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.9041607Z self_outputs = self.self( 2025-08-14T21:30:58.9042087Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 541, in forward 2025-08-14T21:30:58.9042248Z attn_scores += diagonal_mask 2025-08-14T21:30:58.9042262Z 2025-08-14T21:30:58.9042427Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9043046Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9043161Z layer_outputs = layer_module( 2025-08-14T21:30:58.9043536Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9043653Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9044194Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.9044308Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.9044799Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.9044906Z self_outputs = self.self( 2025-08-14T21:30:58.9045394Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 579, in forward 2025-08-14T21:30:58.9045520Z attn_probs = nn.functional.softmax( 2025-08-14T21:30:58.9045526Z 2025-08-14T21:30:58.9045689Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9046310Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9046415Z layer_outputs = layer_module( 2025-08-14T21:30:58.9046790Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9046915Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9047384Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.9047494Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.9047981Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.9048081Z self_outputs = self.self( 2025-08-14T21:30:58.9048571Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 511, in forward 2025-08-14T21:30:58.9048698Z value_vectors = self.value(hidden_states) 2025-08-14T21:30:58.9048704Z 2025-08-14T21:30:58.9048870Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9049489Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9049593Z layer_outputs = layer_module( 2025-08-14T21:30:58.9049975Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9050088Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9050566Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.9050685Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.9051172Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.9051275Z self_outputs = self.self( 2025-08-14T21:30:58.9051748Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 613, in forward 2025-08-14T21:30:58.9051936Z attn_output = self._sliding_chunks_matmul_attn_probs_value( 2025-08-14T21:30:58.9052616Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 863, in _sliding_chunks_matmul_attn_probs_value 2025-08-14T21:30:58.9052900Z padded_value = nn.functional.pad(value, (0, 0, window_overlap, window_overlap), value=-1) 2025-08-14T21:30:58.9053229Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/torch/nn/functional.py", line 5294, in pad 2025-08-14T21:30:58.9053386Z return torch._C._nn.pad(input, pad, mode, value) 2025-08-14T21:30:58.9053393Z 2025-08-14T21:30:58.9053559Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9054183Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9054343Z layer_outputs = layer_module( 2025-08-14T21:30:58.9054725Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9054851Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9055346Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.9055470Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.9055943Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.9056045Z self_outputs = self.self( 2025-08-14T21:30:58.9056539Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 613, in forward 2025-08-14T21:30:58.9056719Z attn_output = self._sliding_chunks_matmul_attn_probs_value( 2025-08-14T21:30:58.9057339Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 876, in _sliding_chunks_matmul_attn_probs_value 2025-08-14T21:30:58.9057563Z chunked_attn_probs = self._pad_and_diagonalize(chunked_attn_probs) 2025-08-14T21:30:58.9058120Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 699, in _pad_and_diagonalize 2025-08-14T21:30:58.9058266Z chunked_hidden_states = nn.functional.pad( 2025-08-14T21:30:58.9058591Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/torch/nn/functional.py", line 5294, in pad 2025-08-14T21:30:58.9058741Z return torch._C._nn.pad(input, pad, mode, value) 2025-08-14T21:30:58.9058746Z 2025-08-14T21:30:58.9058918Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9059519Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9059642Z layer_outputs = layer_module( 2025-08-14T21:30:58.9060025Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9060146Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9060653Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.9060771Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.9061267Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.9061373Z self_outputs = self.self( 2025-08-14T21:30:58.9061848Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 613, in forward 2025-08-14T21:30:58.9062039Z attn_output = self._sliding_chunks_matmul_attn_probs_value( 2025-08-14T21:30:58.9062658Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 878, in _sliding_chunks_matmul_attn_probs_value 2025-08-14T21:30:58.9062899Z context = torch.einsum("bcwd,bcdh->bcwh", (chunked_attn_probs, chunked_value)) 2025-08-14T21:30:58.9062913Z 2025-08-14T21:30:58.9063140Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9063757Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9063873Z layer_outputs = layer_module( 2025-08-14T21:30:58.9064258Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9064378Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9064972Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.9065169Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.9065664Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.9065766Z self_outputs = self.self( 2025-08-14T21:30:58.9066256Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 613, in forward 2025-08-14T21:30:58.9066453Z attn_output = self._sliding_chunks_matmul_attn_probs_value( 2025-08-14T21:30:58.9067054Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 878, in _sliding_chunks_matmul_attn_probs_value 2025-08-14T21:30:58.9067304Z context = torch.einsum("bcwd,bcdh->bcwh", (chunked_attn_probs, chunked_value)) 2025-08-14T21:30:58.9067312Z 2025-08-14T21:30:58.9067474Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9068087Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9068200Z layer_outputs = layer_module( 2025-08-14T21:30:58.9068586Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9068713Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9069186Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.9069298Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.9069787Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.9069888Z self_outputs = self.self( 2025-08-14T21:30:58.9070350Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 618, in forward 2025-08-14T21:30:58.9070654Z attn_output = attn_output.transpose(0, 1).reshape(seq_len, batch_size, embed_dim).contiguous() 2025-08-14T21:30:58.9070661Z 2025-08-14T21:30:58.9070824Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9071444Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9071548Z layer_outputs = layer_module( 2025-08-14T21:30:58.9071926Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9072052Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9072531Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.9072653Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.9073144Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1144, in forward 2025-08-14T21:30:58.9073323Z attn_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:30:58.9073880Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1094, in forward 2025-08-14T21:30:58.9074004Z hidden_states = self.dense(hidden_states) 2025-08-14T21:30:58.9074011Z 2025-08-14T21:30:58.9074182Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9074802Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9074906Z layer_outputs = layer_module( 2025-08-14T21:30:58.9075293Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9075455Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9075940Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1211, in forward 2025-08-14T21:30:58.9076072Z layer_output = apply_chunking_to_forward( 2025-08-14T21:30:58.9076521Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:30:58.9076649Z return forward_fn(*input_tensors) 2025-08-14T21:30:58.9077141Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1218, in ff_chunk 2025-08-14T21:30:58.9077305Z intermediate_output = self.intermediate(attn_output) 2025-08-14T21:30:58.9077796Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1160, in forward 2025-08-14T21:30:58.9077919Z hidden_states = self.dense(hidden_states) 2025-08-14T21:30:58.9077930Z 2025-08-14T21:30:58.9078106Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9078726Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9078838Z layer_outputs = layer_module( 2025-08-14T21:30:58.9079220Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9079336Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9079836Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1211, in forward 2025-08-14T21:30:58.9079959Z layer_output = apply_chunking_to_forward( 2025-08-14T21:30:58.9080408Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:30:58.9080534Z return forward_fn(*input_tensors) 2025-08-14T21:30:58.9081034Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1218, in ff_chunk 2025-08-14T21:30:58.9081207Z intermediate_output = self.intermediate(attn_output) 2025-08-14T21:30:58.9081697Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1161, in forward 2025-08-14T21:30:58.9081869Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:30:58.9082234Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:30:58.9082340Z return self.act(input) 2025-08-14T21:30:58.9082348Z 2025-08-14T21:30:58.9082511Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9083140Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9083252Z layer_outputs = layer_module( 2025-08-14T21:30:58.9083635Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9083751Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9084292Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1211, in forward 2025-08-14T21:30:58.9084424Z layer_output = apply_chunking_to_forward( 2025-08-14T21:30:58.9085041Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:30:58.9085161Z return forward_fn(*input_tensors) 2025-08-14T21:30:58.9085647Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1219, in ff_chunk 2025-08-14T21:30:58.9085834Z layer_output = self.output(intermediate_output, attn_output) 2025-08-14T21:30:58.9086876Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1174, in forward 2025-08-14T21:30:58.9087001Z hidden_states = self.dense(hidden_states) 2025-08-14T21:30:58.9087008Z 2025-08-14T21:30:58.9087170Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9087793Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9087899Z layer_outputs = layer_module( 2025-08-14T21:30:58.9088289Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9088404Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9088893Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.9089015Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.9089490Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.9089605Z self_outputs = self.self( 2025-08-14T21:30:58.9090082Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 509, in forward 2025-08-14T21:30:58.9090205Z query_vectors = self.query(hidden_states) 2025-08-14T21:30:58.9090211Z 2025-08-14T21:30:58.9090385Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9090993Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9091094Z layer_outputs = layer_module( 2025-08-14T21:30:58.9091487Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9091608Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9092108Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.9092218Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.9092700Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.9092811Z self_outputs = self.self( 2025-08-14T21:30:58.9093291Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 524, in forward 2025-08-14T21:30:58.9093458Z attn_scores = self._sliding_chunks_query_key_matmul( 2025-08-14T21:30:58.9094054Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 796, in _sliding_chunks_query_key_matmul 2025-08-14T21:30:58.9094365Z diagonal_chunked_attention_scores = torch.einsum("bcxd,bcyd->bcxy", (query, key)) # multiply 2025-08-14T21:30:58.9094372Z 2025-08-14T21:30:58.9094543Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9095243Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9095361Z layer_outputs = layer_module( 2025-08-14T21:30:58.9095738Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9095853Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9096352Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.9096467Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.9096953Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.9097122Z self_outputs = self.self( 2025-08-14T21:30:58.9097616Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 510, in forward 2025-08-14T21:30:58.9097739Z key_vectors = self.key(hidden_states) 2025-08-14T21:30:58.9097749Z 2025-08-14T21:30:58.9097917Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9098538Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9098649Z layer_outputs = layer_module( 2025-08-14T21:30:58.9099029Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9099153Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9099642Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.9099759Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.9100255Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.9100362Z self_outputs = self.self( 2025-08-14T21:30:58.9100853Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 524, in forward 2025-08-14T21:30:58.9101025Z attn_scores = self._sliding_chunks_query_key_matmul( 2025-08-14T21:30:58.9101629Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 796, in _sliding_chunks_query_key_matmul 2025-08-14T21:30:58.9101942Z diagonal_chunked_attention_scores = torch.einsum("bcxd,bcyd->bcxy", (query, key)) # multiply 2025-08-14T21:30:58.9101948Z 2025-08-14T21:30:58.9102106Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9102718Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9102826Z layer_outputs = layer_module( 2025-08-14T21:30:58.9103209Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9103334Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9103822Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.9103934Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.9104428Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.9104532Z self_outputs = self.self( 2025-08-14T21:30:58.9105110Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 524, in forward 2025-08-14T21:30:58.9105279Z attn_scores = self._sliding_chunks_query_key_matmul( 2025-08-14T21:30:58.9105968Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 796, in _sliding_chunks_query_key_matmul 2025-08-14T21:30:58.9106286Z diagonal_chunked_attention_scores = torch.einsum("bcxd,bcyd->bcxy", (query, key)) # multiply 2025-08-14T21:30:58.9106293Z 2025-08-14T21:30:58.9106456Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9107076Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9107182Z layer_outputs = layer_module( 2025-08-14T21:30:58.9107560Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9107754Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9108243Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.9108359Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.9108859Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.9108963Z self_outputs = self.self( 2025-08-14T21:30:58.9109449Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 524, in forward 2025-08-14T21:30:58.9109607Z attn_scores = self._sliding_chunks_query_key_matmul( 2025-08-14T21:30:58.9110179Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 796, in _sliding_chunks_query_key_matmul 2025-08-14T21:30:58.9110488Z diagonal_chunked_attention_scores = torch.einsum("bcxd,bcyd->bcxy", (query, key)) # multiply 2025-08-14T21:30:58.9110495Z 2025-08-14T21:30:58.9110617Z cudagraph partition due to non gpu ops 2025-08-14T21:30:58.9110737Z cudagraph partition due to non gpu ops 2025-08-14T21:30:58.9110849Z cudagraph partition due to non gpu ops 2025-08-14T21:30:58.9110966Z cudagraph partition due to non gpu ops 2025-08-14T21:30:58.9111134Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9111742Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9111847Z layer_outputs = layer_module( 2025-08-14T21:30:58.9112224Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9112341Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9112835Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.9112954Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.9113435Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.9113548Z self_outputs = self.self( 2025-08-14T21:30:58.9114033Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 536, in forward 2025-08-14T21:30:58.9114209Z diagonal_mask = self._sliding_chunks_query_key_matmul( 2025-08-14T21:30:58.9114796Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 834, in _sliding_chunks_query_key_matmul 2025-08-14T21:30:58.9115031Z self._mask_invalid_locations(diagonal_attention_scores, window_overlap) 2025-08-14T21:30:58.9115611Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 762, in _mask_invalid_locations 2025-08-14T21:30:58.9115857Z input_tensor[:, :affected_seq_len, :, : affected_seq_len + 1] = torch.full_like( 2025-08-14T21:30:58.9115864Z 2025-08-14T21:30:58.9115989Z cudagraph partition due to non gpu ops 2025-08-14T21:30:58.9116211Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9116841Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9116956Z layer_outputs = layer_module( 2025-08-14T21:30:58.9117324Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9117442Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9117935Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.9118089Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.9118585Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.9118687Z self_outputs = self.self( 2025-08-14T21:30:58.9119168Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 541, in forward 2025-08-14T21:30:58.9119279Z attn_scores += diagonal_mask 2025-08-14T21:30:58.9119285Z 2025-08-14T21:30:58.9119450Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9120076Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9120181Z layer_outputs = layer_module( 2025-08-14T21:30:58.9120554Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9120678Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9121160Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.9121274Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.9121774Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.9121873Z self_outputs = self.self( 2025-08-14T21:30:58.9122348Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 579, in forward 2025-08-14T21:30:58.9122464Z attn_probs = nn.functional.softmax( 2025-08-14T21:30:58.9122470Z 2025-08-14T21:30:58.9122633Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9123260Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9123365Z layer_outputs = layer_module( 2025-08-14T21:30:58.9123753Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9123871Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9124361Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.9124480Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.9124959Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.9125071Z self_outputs = self.self( 2025-08-14T21:30:58.9125557Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 511, in forward 2025-08-14T21:30:58.9125691Z value_vectors = self.value(hidden_states) 2025-08-14T21:30:58.9125698Z 2025-08-14T21:30:58.9125862Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9126479Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9126639Z layer_outputs = layer_module( 2025-08-14T21:30:58.9127022Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9127136Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9127636Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.9127749Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.9128228Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.9128390Z self_outputs = self.self( 2025-08-14T21:30:58.9128884Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 613, in forward 2025-08-14T21:30:58.9129090Z attn_output = self._sliding_chunks_matmul_attn_probs_value( 2025-08-14T21:30:58.9129727Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 863, in _sliding_chunks_matmul_attn_probs_value 2025-08-14T21:30:58.9130016Z padded_value = nn.functional.pad(value, (0, 0, window_overlap, window_overlap), value=-1) 2025-08-14T21:30:58.9130356Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/torch/nn/functional.py", line 5294, in pad 2025-08-14T21:30:58.9130506Z return torch._C._nn.pad(input, pad, mode, value) 2025-08-14T21:30:58.9130512Z 2025-08-14T21:30:58.9130681Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9131292Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9131400Z layer_outputs = layer_module( 2025-08-14T21:30:58.9131777Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9131893Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9132384Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.9132499Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.9132979Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.9133089Z self_outputs = self.self( 2025-08-14T21:30:58.9133573Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 613, in forward 2025-08-14T21:30:58.9133758Z attn_output = self._sliding_chunks_matmul_attn_probs_value( 2025-08-14T21:30:58.9134381Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 876, in _sliding_chunks_matmul_attn_probs_value 2025-08-14T21:30:58.9134600Z chunked_attn_probs = self._pad_and_diagonalize(chunked_attn_probs) 2025-08-14T21:30:58.9135150Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 699, in _pad_and_diagonalize 2025-08-14T21:30:58.9135284Z chunked_hidden_states = nn.functional.pad( 2025-08-14T21:30:58.9135607Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/torch/nn/functional.py", line 5294, in pad 2025-08-14T21:30:58.9135769Z return torch._C._nn.pad(input, pad, mode, value) 2025-08-14T21:30:58.9135774Z 2025-08-14T21:30:58.9135938Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9136555Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9136663Z layer_outputs = layer_module( 2025-08-14T21:30:58.9137043Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9137230Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9137720Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.9137835Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.9138330Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.9138432Z self_outputs = self.self( 2025-08-14T21:30:58.9138924Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 613, in forward 2025-08-14T21:30:58.9139166Z attn_output = self._sliding_chunks_matmul_attn_probs_value( 2025-08-14T21:30:58.9139770Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 878, in _sliding_chunks_matmul_attn_probs_value 2025-08-14T21:30:58.9140027Z context = torch.einsum("bcwd,bcdh->bcwh", (chunked_attn_probs, chunked_value)) 2025-08-14T21:30:58.9140033Z 2025-08-14T21:30:58.9140196Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9140811Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9140919Z layer_outputs = layer_module( 2025-08-14T21:30:58.9141299Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9141425Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9141917Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.9142038Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.9142523Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.9142623Z self_outputs = self.self( 2025-08-14T21:30:58.9143119Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 613, in forward 2025-08-14T21:30:58.9143298Z attn_output = self._sliding_chunks_matmul_attn_probs_value( 2025-08-14T21:30:58.9143900Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 878, in _sliding_chunks_matmul_attn_probs_value 2025-08-14T21:30:58.9144153Z context = torch.einsum("bcwd,bcdh->bcwh", (chunked_attn_probs, chunked_value)) 2025-08-14T21:30:58.9144163Z 2025-08-14T21:30:58.9144327Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9145017Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9145136Z layer_outputs = layer_module( 2025-08-14T21:30:58.9145515Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9145642Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9146109Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.9146226Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.9146713Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.9146823Z self_outputs = self.self( 2025-08-14T21:30:58.9147305Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 618, in forward 2025-08-14T21:30:58.9147607Z attn_output = attn_output.transpose(0, 1).reshape(seq_len, batch_size, embed_dim).contiguous() 2025-08-14T21:30:58.9147615Z 2025-08-14T21:30:58.9147864Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9148482Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9148588Z layer_outputs = layer_module( 2025-08-14T21:30:58.9148977Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9149095Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9149589Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.9149752Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.9150241Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1144, in forward 2025-08-14T21:30:58.9150427Z attn_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:30:58.9150920Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1094, in forward 2025-08-14T21:30:58.9151045Z hidden_states = self.dense(hidden_states) 2025-08-14T21:30:58.9151051Z 2025-08-14T21:30:58.9151219Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9151838Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9151954Z layer_outputs = layer_module( 2025-08-14T21:30:58.9152331Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9152449Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9152940Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1211, in forward 2025-08-14T21:30:58.9153076Z layer_output = apply_chunking_to_forward( 2025-08-14T21:30:58.9153538Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:30:58.9153653Z return forward_fn(*input_tensors) 2025-08-14T21:30:58.9154144Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1218, in ff_chunk 2025-08-14T21:30:58.9154325Z intermediate_output = self.intermediate(attn_output) 2025-08-14T21:30:58.9154823Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1160, in forward 2025-08-14T21:30:58.9154956Z hidden_states = self.dense(hidden_states) 2025-08-14T21:30:58.9154970Z 2025-08-14T21:30:58.9155129Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9155746Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9155859Z layer_outputs = layer_module( 2025-08-14T21:30:58.9156242Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9156359Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9156856Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1211, in forward 2025-08-14T21:30:58.9156982Z layer_output = apply_chunking_to_forward( 2025-08-14T21:30:58.9157442Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:30:58.9157562Z return forward_fn(*input_tensors) 2025-08-14T21:30:58.9158046Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1218, in ff_chunk 2025-08-14T21:30:58.9158280Z intermediate_output = self.intermediate(attn_output) 2025-08-14T21:30:58.9158761Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1161, in forward 2025-08-14T21:30:58.9158940Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:30:58.9159284Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:30:58.9159387Z return self.act(input) 2025-08-14T21:30:58.9159394Z 2025-08-14T21:30:58.9159561Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9160227Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9160331Z layer_outputs = layer_module( 2025-08-14T21:30:58.9160712Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9160833Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9161335Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1211, in forward 2025-08-14T21:30:58.9161461Z layer_output = apply_chunking_to_forward( 2025-08-14T21:30:58.9161912Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:30:58.9162036Z return forward_fn(*input_tensors) 2025-08-14T21:30:58.9162531Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1219, in ff_chunk 2025-08-14T21:30:58.9162737Z layer_output = self.output(intermediate_output, attn_output) 2025-08-14T21:30:58.9163228Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1174, in forward 2025-08-14T21:30:58.9163356Z hidden_states = self.dense(hidden_states) 2025-08-14T21:30:58.9163362Z 2025-08-14T21:30:58.9163533Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9164152Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9164264Z layer_outputs = layer_module( 2025-08-14T21:30:58.9164646Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9164758Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9165257Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.9165378Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.9165865Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.9165977Z self_outputs = self.self( 2025-08-14T21:30:58.9166468Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 509, in forward 2025-08-14T21:30:58.9166601Z query_vectors = self.query(hidden_states) 2025-08-14T21:30:58.9166607Z 2025-08-14T21:30:58.9166767Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9167383Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9167497Z layer_outputs = layer_module( 2025-08-14T21:30:58.9167885Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9168011Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9168562Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.9168678Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.9169177Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.9169284Z self_outputs = self.self( 2025-08-14T21:30:58.9169767Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 524, in forward 2025-08-14T21:30:58.9169932Z attn_scores = self._sliding_chunks_query_key_matmul( 2025-08-14T21:30:58.9170527Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 796, in _sliding_chunks_query_key_matmul 2025-08-14T21:30:58.9170889Z diagonal_chunked_attention_scores = torch.einsum("bcxd,bcyd->bcxy", (query, key)) # multiply 2025-08-14T21:30:58.9170895Z 2025-08-14T21:30:58.9171057Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9171661Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9171772Z layer_outputs = layer_module( 2025-08-14T21:30:58.9172145Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9172270Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9172754Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.9172872Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.9173369Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.9173473Z self_outputs = self.self( 2025-08-14T21:30:58.9173972Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 510, in forward 2025-08-14T21:30:58.9174088Z key_vectors = self.key(hidden_states) 2025-08-14T21:30:58.9174095Z 2025-08-14T21:30:58.9174258Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9174881Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9174984Z layer_outputs = layer_module( 2025-08-14T21:30:58.9175360Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9175490Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9175977Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.9176099Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.9176585Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.9176679Z self_outputs = self.self( 2025-08-14T21:30:58.9177160Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 524, in forward 2025-08-14T21:30:58.9177315Z attn_scores = self._sliding_chunks_query_key_matmul( 2025-08-14T21:30:58.9177917Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 796, in _sliding_chunks_query_key_matmul 2025-08-14T21:30:58.9178222Z diagonal_chunked_attention_scores = torch.einsum("bcxd,bcyd->bcxy", (query, key)) # multiply 2025-08-14T21:30:58.9178234Z 2025-08-14T21:30:58.9178398Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9179002Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9179161Z layer_outputs = layer_module( 2025-08-14T21:30:58.9179550Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9179665Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9180137Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.9180261Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.9180752Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.9180898Z self_outputs = self.self( 2025-08-14T21:30:58.9181372Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 524, in forward 2025-08-14T21:30:58.9181525Z attn_scores = self._sliding_chunks_query_key_matmul( 2025-08-14T21:30:58.9182133Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 796, in _sliding_chunks_query_key_matmul 2025-08-14T21:30:58.9182424Z diagonal_chunked_attention_scores = torch.einsum("bcxd,bcyd->bcxy", (query, key)) # multiply 2025-08-14T21:30:58.9182429Z 2025-08-14T21:30:58.9182584Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9183190Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9183297Z layer_outputs = layer_module( 2025-08-14T21:30:58.9183688Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9183805Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9184294Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.9184418Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.9185098Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.9185214Z self_outputs = self.self( 2025-08-14T21:30:58.9185684Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 524, in forward 2025-08-14T21:30:58.9185833Z attn_scores = self._sliding_chunks_query_key_matmul( 2025-08-14T21:30:58.9186439Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 796, in _sliding_chunks_query_key_matmul 2025-08-14T21:30:58.9186745Z diagonal_chunked_attention_scores = torch.einsum("bcxd,bcyd->bcxy", (query, key)) # multiply 2025-08-14T21:30:58.9186752Z 2025-08-14T21:30:58.9186882Z cudagraph partition due to non gpu ops 2025-08-14T21:30:58.9187005Z cudagraph partition due to non gpu ops 2025-08-14T21:30:58.9187119Z cudagraph partition due to non gpu ops 2025-08-14T21:30:58.9187244Z cudagraph partition due to non gpu ops 2025-08-14T21:30:58.9187399Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9188008Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9188123Z layer_outputs = layer_module( 2025-08-14T21:30:58.9188507Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9188638Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9189122Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.9189238Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.9189866Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.9189974Z self_outputs = self.self( 2025-08-14T21:30:58.9190464Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 536, in forward 2025-08-14T21:30:58.9190647Z diagonal_mask = self._sliding_chunks_query_key_matmul( 2025-08-14T21:30:58.9191253Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 834, in _sliding_chunks_query_key_matmul 2025-08-14T21:30:58.9191489Z self._mask_invalid_locations(diagonal_attention_scores, window_overlap) 2025-08-14T21:30:58.9192128Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 762, in _mask_invalid_locations 2025-08-14T21:30:58.9192367Z input_tensor[:, :affected_seq_len, :, : affected_seq_len + 1] = torch.full_like( 2025-08-14T21:30:58.9192389Z 2025-08-14T21:30:58.9192516Z cudagraph partition due to non gpu ops 2025-08-14T21:30:58.9192686Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9193318Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9193428Z layer_outputs = layer_module( 2025-08-14T21:30:58.9193812Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9193942Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9194433Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.9194557Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.9195051Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.9195155Z self_outputs = self.self( 2025-08-14T21:30:58.9195645Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 541, in forward 2025-08-14T21:30:58.9195748Z attn_scores += diagonal_mask 2025-08-14T21:30:58.9195755Z 2025-08-14T21:30:58.9195920Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9196542Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9196640Z layer_outputs = layer_module( 2025-08-14T21:30:58.9197034Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9197151Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9197638Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.9197757Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.9198238Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.9198347Z self_outputs = self.self( 2025-08-14T21:30:58.9198825Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 579, in forward 2025-08-14T21:30:58.9198936Z attn_probs = nn.functional.softmax( 2025-08-14T21:30:58.9198943Z 2025-08-14T21:30:58.9199108Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9199720Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9199832Z layer_outputs = layer_module( 2025-08-14T21:30:58.9200242Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9200354Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9200846Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.9200957Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.9201450Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.9201559Z self_outputs = self.self( 2025-08-14T21:30:58.9202056Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 511, in forward 2025-08-14T21:30:58.9202239Z value_vectors = self.value(hidden_states) 2025-08-14T21:30:58.9202246Z 2025-08-14T21:30:58.9202404Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9203016Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9203129Z layer_outputs = layer_module( 2025-08-14T21:30:58.9203511Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9203633Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9204116Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.9204228Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.9204725Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.9204827Z self_outputs = self.self( 2025-08-14T21:30:58.9205303Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 613, in forward 2025-08-14T21:30:58.9205501Z attn_output = self._sliding_chunks_matmul_attn_probs_value( 2025-08-14T21:30:58.9206111Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 863, in _sliding_chunks_matmul_attn_probs_value 2025-08-14T21:30:58.9206409Z padded_value = nn.functional.pad(value, (0, 0, window_overlap, window_overlap), value=-1) 2025-08-14T21:30:58.9206732Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/torch/nn/functional.py", line 5294, in pad 2025-08-14T21:30:58.9206880Z return torch._C._nn.pad(input, pad, mode, value) 2025-08-14T21:30:58.9206890Z 2025-08-14T21:30:58.9207057Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9207670Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9207787Z layer_outputs = layer_module( 2025-08-14T21:30:58.9208171Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9208285Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9208782Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.9208893Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.9209381Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.9209484Z self_outputs = self.self( 2025-08-14T21:30:58.9209965Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 613, in forward 2025-08-14T21:30:58.9210160Z attn_output = self._sliding_chunks_matmul_attn_probs_value( 2025-08-14T21:30:58.9210839Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 876, in _sliding_chunks_matmul_attn_probs_value 2025-08-14T21:30:58.9211056Z chunked_attn_probs = self._pad_and_diagonalize(chunked_attn_probs) 2025-08-14T21:30:58.9211611Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 699, in _pad_and_diagonalize 2025-08-14T21:30:58.9211744Z chunked_hidden_states = nn.functional.pad( 2025-08-14T21:30:58.9212074Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/torch/nn/functional.py", line 5294, in pad 2025-08-14T21:30:58.9212228Z return torch._C._nn.pad(input, pad, mode, value) 2025-08-14T21:30:58.9212309Z 2025-08-14T21:30:58.9212477Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9213104Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9213215Z layer_outputs = layer_module( 2025-08-14T21:30:58.9213608Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9213726Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9214217Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.9214342Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.9214834Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.9214953Z self_outputs = self.self( 2025-08-14T21:30:58.9215435Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 613, in forward 2025-08-14T21:30:58.9215618Z attn_output = self._sliding_chunks_matmul_attn_probs_value( 2025-08-14T21:30:58.9216252Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 878, in _sliding_chunks_matmul_attn_probs_value 2025-08-14T21:30:58.9216498Z context = torch.einsum("bcwd,bcdh->bcwh", (chunked_attn_probs, chunked_value)) 2025-08-14T21:30:58.9216505Z 2025-08-14T21:30:58.9216680Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9217288Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9217396Z layer_outputs = layer_module( 2025-08-14T21:30:58.9217784Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9217906Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9218397Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.9218523Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.9219011Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.9219123Z self_outputs = self.self( 2025-08-14T21:30:58.9219615Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 613, in forward 2025-08-14T21:30:58.9219802Z attn_output = self._sliding_chunks_matmul_attn_probs_value( 2025-08-14T21:30:58.9220426Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 878, in _sliding_chunks_matmul_attn_probs_value 2025-08-14T21:30:58.9220674Z context = torch.einsum("bcwd,bcdh->bcwh", (chunked_attn_probs, chunked_value)) 2025-08-14T21:30:58.9220680Z 2025-08-14T21:30:58.9220854Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9221528Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9221636Z layer_outputs = layer_module( 2025-08-14T21:30:58.9222031Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9222150Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9222645Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.9222758Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.9223251Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.9223427Z self_outputs = self.self( 2025-08-14T21:30:58.9223912Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 618, in forward 2025-08-14T21:30:58.9224223Z attn_output = attn_output.transpose(0, 1).reshape(seq_len, batch_size, embed_dim).contiguous() 2025-08-14T21:30:58.9224239Z 2025-08-14T21:30:58.9224406Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9225116Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9225240Z layer_outputs = layer_module( 2025-08-14T21:30:58.9225618Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9225743Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9226253Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.9226365Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.9226866Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1144, in forward 2025-08-14T21:30:58.9227046Z attn_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:30:58.9227544Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1094, in forward 2025-08-14T21:30:58.9227674Z hidden_states = self.dense(hidden_states) 2025-08-14T21:30:58.9227680Z 2025-08-14T21:30:58.9227842Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9228463Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9228572Z layer_outputs = layer_module( 2025-08-14T21:30:58.9228952Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9229078Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9229568Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1211, in forward 2025-08-14T21:30:58.9229693Z layer_output = apply_chunking_to_forward( 2025-08-14T21:30:58.9230147Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:30:58.9230261Z return forward_fn(*input_tensors) 2025-08-14T21:30:58.9230771Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1218, in ff_chunk 2025-08-14T21:30:58.9230943Z intermediate_output = self.intermediate(attn_output) 2025-08-14T21:30:58.9231440Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1160, in forward 2025-08-14T21:30:58.9231571Z hidden_states = self.dense(hidden_states) 2025-08-14T21:30:58.9231576Z 2025-08-14T21:30:58.9231792Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9232397Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9232499Z layer_outputs = layer_module( 2025-08-14T21:30:58.9232874Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9232998Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9233484Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1211, in forward 2025-08-14T21:30:58.9233666Z layer_output = apply_chunking_to_forward( 2025-08-14T21:30:58.9234115Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:30:58.9234230Z return forward_fn(*input_tensors) 2025-08-14T21:30:58.9234724Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1218, in ff_chunk 2025-08-14T21:30:58.9234889Z intermediate_output = self.intermediate(attn_output) 2025-08-14T21:30:58.9235380Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1161, in forward 2025-08-14T21:30:58.9235564Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:30:58.9235927Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:30:58.9236042Z return self.act(input) 2025-08-14T21:30:58.9236051Z 2025-08-14T21:30:58.9236206Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9236819Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9236937Z layer_outputs = layer_module( 2025-08-14T21:30:58.9237325Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9237454Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9237948Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1211, in forward 2025-08-14T21:30:58.9238075Z layer_output = apply_chunking_to_forward( 2025-08-14T21:30:58.9238532Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:30:58.9238641Z return forward_fn(*input_tensors) 2025-08-14T21:30:58.9239127Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1219, in ff_chunk 2025-08-14T21:30:58.9239323Z layer_output = self.output(intermediate_output, attn_output) 2025-08-14T21:30:58.9239811Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1174, in forward 2025-08-14T21:30:58.9239941Z hidden_states = self.dense(hidden_states) 2025-08-14T21:30:58.9239947Z 2025-08-14T21:30:58.9240107Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9240717Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9240834Z layer_outputs = layer_module( 2025-08-14T21:30:58.9241209Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9241337Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9241815Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.9241930Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.9242500Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.9242619Z self_outputs = self.self( 2025-08-14T21:30:58.9243118Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 509, in forward 2025-08-14T21:30:58.9243244Z query_vectors = self.query(hidden_states) 2025-08-14T21:30:58.9243250Z 2025-08-14T21:30:58.9243416Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9244039Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9244189Z layer_outputs = layer_module( 2025-08-14T21:30:58.9244566Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9244686Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9245185Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.9245304Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.9245789Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.9245895Z self_outputs = self.self( 2025-08-14T21:30:58.9246396Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 524, in forward 2025-08-14T21:30:58.9246557Z attn_scores = self._sliding_chunks_query_key_matmul( 2025-08-14T21:30:58.9247152Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 796, in _sliding_chunks_query_key_matmul 2025-08-14T21:30:58.9247460Z diagonal_chunked_attention_scores = torch.einsum("bcxd,bcyd->bcxy", (query, key)) # multiply 2025-08-14T21:30:58.9247470Z 2025-08-14T21:30:58.9247628Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9248257Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9248360Z layer_outputs = layer_module( 2025-08-14T21:30:58.9248740Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9248856Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9249331Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.9249458Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.9249943Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.9250049Z self_outputs = self.self( 2025-08-14T21:30:58.9250529Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 510, in forward 2025-08-14T21:30:58.9250644Z key_vectors = self.key(hidden_states) 2025-08-14T21:30:58.9250651Z 2025-08-14T21:30:58.9250817Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9251423Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9251528Z layer_outputs = layer_module( 2025-08-14T21:30:58.9251916Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9252028Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9252510Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.9252678Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.9253173Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.9253281Z self_outputs = self.self( 2025-08-14T21:30:58.9253772Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 524, in forward 2025-08-14T21:30:58.9253931Z attn_scores = self._sliding_chunks_query_key_matmul( 2025-08-14T21:30:58.9254543Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 796, in _sliding_chunks_query_key_matmul 2025-08-14T21:30:58.9254894Z diagonal_chunked_attention_scores = torch.einsum("bcxd,bcyd->bcxy", (query, key)) # multiply 2025-08-14T21:30:58.9254900Z 2025-08-14T21:30:58.9255060Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9255676Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9255781Z layer_outputs = layer_module( 2025-08-14T21:30:58.9256165Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9256280Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9256779Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.9256889Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.9257373Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.9257485Z self_outputs = self.self( 2025-08-14T21:30:58.9257988Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 524, in forward 2025-08-14T21:30:58.9258154Z attn_scores = self._sliding_chunks_query_key_matmul( 2025-08-14T21:30:58.9258746Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 796, in _sliding_chunks_query_key_matmul 2025-08-14T21:30:58.9259054Z diagonal_chunked_attention_scores = torch.einsum("bcxd,bcyd->bcxy", (query, key)) # multiply 2025-08-14T21:30:58.9259061Z 2025-08-14T21:30:58.9259231Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9259858Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9259976Z layer_outputs = layer_module( 2025-08-14T21:30:58.9260359Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9260476Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9260982Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.9261099Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.9261585Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.9261695Z self_outputs = self.self( 2025-08-14T21:30:58.9262183Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 524, in forward 2025-08-14T21:30:58.9262354Z attn_scores = self._sliding_chunks_query_key_matmul( 2025-08-14T21:30:58.9262938Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 796, in _sliding_chunks_query_key_matmul 2025-08-14T21:30:58.9263235Z diagonal_chunked_attention_scores = torch.einsum("bcxd,bcyd->bcxy", (query, key)) # multiply 2025-08-14T21:30:58.9263294Z 2025-08-14T21:30:58.9263429Z cudagraph partition due to non gpu ops 2025-08-14T21:30:58.9263546Z cudagraph partition due to non gpu ops 2025-08-14T21:30:58.9263671Z cudagraph partition due to non gpu ops 2025-08-14T21:30:58.9263782Z cudagraph partition due to non gpu ops 2025-08-14T21:30:58.9263937Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9264567Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9264765Z layer_outputs = layer_module( 2025-08-14T21:30:58.9265211Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9265341Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9265824Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.9265954Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.9266429Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.9266533Z self_outputs = self.self( 2025-08-14T21:30:58.9267017Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 536, in forward 2025-08-14T21:30:58.9267184Z diagonal_mask = self._sliding_chunks_query_key_matmul( 2025-08-14T21:30:58.9267793Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 834, in _sliding_chunks_query_key_matmul 2025-08-14T21:30:58.9268030Z self._mask_invalid_locations(diagonal_attention_scores, window_overlap) 2025-08-14T21:30:58.9268597Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 762, in _mask_invalid_locations 2025-08-14T21:30:58.9268845Z input_tensor[:, :affected_seq_len, :, : affected_seq_len + 1] = torch.full_like( 2025-08-14T21:30:58.9268852Z 2025-08-14T21:30:58.9268974Z cudagraph partition due to non gpu ops 2025-08-14T21:30:58.9269143Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9269757Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9269861Z layer_outputs = layer_module( 2025-08-14T21:30:58.9270249Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9270370Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9270851Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.9270972Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.9271465Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.9271576Z self_outputs = self.self( 2025-08-14T21:30:58.9272053Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 541, in forward 2025-08-14T21:30:58.9272159Z attn_scores += diagonal_mask 2025-08-14T21:30:58.9272164Z 2025-08-14T21:30:58.9272334Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9272954Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9273076Z layer_outputs = layer_module( 2025-08-14T21:30:58.9273446Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9273563Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9274127Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.9274243Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.9274734Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.9274848Z self_outputs = self.self( 2025-08-14T21:30:58.9275319Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 579, in forward 2025-08-14T21:30:58.9275497Z attn_probs = nn.functional.softmax( 2025-08-14T21:30:58.9275502Z 2025-08-14T21:30:58.9275663Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9276271Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9276394Z layer_outputs = layer_module( 2025-08-14T21:30:58.9276780Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9276908Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9277393Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.9277505Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.9278005Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.9278110Z self_outputs = self.self( 2025-08-14T21:30:58.9278594Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 511, in forward 2025-08-14T21:30:58.9278725Z value_vectors = self.value(hidden_states) 2025-08-14T21:30:58.9278732Z 2025-08-14T21:30:58.9278898Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9279511Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9279616Z layer_outputs = layer_module( 2025-08-14T21:30:58.9279995Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9280123Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9280609Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.9280737Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.9281220Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.9281324Z self_outputs = self.self( 2025-08-14T21:30:58.9281829Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 613, in forward 2025-08-14T21:30:58.9282017Z attn_output = self._sliding_chunks_matmul_attn_probs_value( 2025-08-14T21:30:58.9282629Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 863, in _sliding_chunks_matmul_attn_probs_value 2025-08-14T21:30:58.9282906Z padded_value = nn.functional.pad(value, (0, 0, window_overlap, window_overlap), value=-1) 2025-08-14T21:30:58.9283234Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/torch/nn/functional.py", line 5294, in pad 2025-08-14T21:30:58.9283396Z return torch._C._nn.pad(input, pad, mode, value) 2025-08-14T21:30:58.9283403Z 2025-08-14T21:30:58.9283563Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9284242Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9284351Z layer_outputs = layer_module( 2025-08-14T21:30:58.9284870Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9285003Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9285505Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.9285620Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.9286125Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.9286339Z self_outputs = self.self( 2025-08-14T21:30:58.9286835Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 613, in forward 2025-08-14T21:30:58.9287022Z attn_output = self._sliding_chunks_matmul_attn_probs_value( 2025-08-14T21:30:58.9287636Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 876, in _sliding_chunks_matmul_attn_probs_value 2025-08-14T21:30:58.9287861Z chunked_attn_probs = self._pad_and_diagonalize(chunked_attn_probs) 2025-08-14T21:30:58.9288407Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 699, in _pad_and_diagonalize 2025-08-14T21:30:58.9288544Z chunked_hidden_states = nn.functional.pad( 2025-08-14T21:30:58.9288863Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/torch/nn/functional.py", line 5294, in pad 2025-08-14T21:30:58.9289016Z return torch._C._nn.pad(input, pad, mode, value) 2025-08-14T21:30:58.9289022Z 2025-08-14T21:30:58.9289181Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9289775Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9289876Z layer_outputs = layer_module( 2025-08-14T21:30:58.9290234Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9290348Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9290828Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.9290937Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.9291412Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.9291523Z self_outputs = self.self( 2025-08-14T21:30:58.9292008Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 613, in forward 2025-08-14T21:30:58.9292208Z attn_output = self._sliding_chunks_matmul_attn_probs_value( 2025-08-14T21:30:58.9292820Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 878, in _sliding_chunks_matmul_attn_probs_value 2025-08-14T21:30:58.9293064Z context = torch.einsum("bcwd,bcdh->bcwh", (chunked_attn_probs, chunked_value)) 2025-08-14T21:30:58.9293071Z 2025-08-14T21:30:58.9293242Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9293854Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9293971Z layer_outputs = layer_module( 2025-08-14T21:30:58.9294341Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9294459Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9295045Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.9295155Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.9295647Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.9295751Z self_outputs = self.self( 2025-08-14T21:30:58.9296236Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 613, in forward 2025-08-14T21:30:58.9296434Z attn_output = self._sliding_chunks_matmul_attn_probs_value( 2025-08-14T21:30:58.9297086Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 878, in _sliding_chunks_matmul_attn_probs_value 2025-08-14T21:30:58.9297332Z context = torch.einsum("bcwd,bcdh->bcwh", (chunked_attn_probs, chunked_value)) 2025-08-14T21:30:58.9297346Z 2025-08-14T21:30:58.9297508Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9298122Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9298238Z layer_outputs = layer_module( 2025-08-14T21:30:58.9298618Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9298737Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9299228Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.9299344Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.9299845Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.9299951Z self_outputs = self.self( 2025-08-14T21:30:58.9300442Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 618, in forward 2025-08-14T21:30:58.9300759Z attn_output = attn_output.transpose(0, 1).reshape(seq_len, batch_size, embed_dim).contiguous() 2025-08-14T21:30:58.9300765Z 2025-08-14T21:30:58.9300929Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9301554Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9301661Z layer_outputs = layer_module( 2025-08-14T21:30:58.9302039Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9302164Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9302644Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.9302762Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.9303262Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1144, in forward 2025-08-14T21:30:58.9303437Z attn_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:30:58.9303923Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1094, in forward 2025-08-14T21:30:58.9304045Z hidden_states = self.dense(hidden_states) 2025-08-14T21:30:58.9304051Z 2025-08-14T21:30:58.9304214Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9304959Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9305070Z layer_outputs = layer_module( 2025-08-14T21:30:58.9305542Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9305666Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9306155Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1211, in forward 2025-08-14T21:30:58.9306293Z layer_output = apply_chunking_to_forward( 2025-08-14T21:30:58.9306749Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:30:58.9306870Z return forward_fn(*input_tensors) 2025-08-14T21:30:58.9307357Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1218, in ff_chunk 2025-08-14T21:30:58.9307582Z intermediate_output = self.intermediate(attn_output) 2025-08-14T21:30:58.9308076Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1160, in forward 2025-08-14T21:30:58.9308201Z hidden_states = self.dense(hidden_states) 2025-08-14T21:30:58.9308208Z 2025-08-14T21:30:58.9308376Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9308997Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9309101Z layer_outputs = layer_module( 2025-08-14T21:30:58.9309476Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9309591Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9310085Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1211, in forward 2025-08-14T21:30:58.9310214Z layer_output = apply_chunking_to_forward( 2025-08-14T21:30:58.9310652Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:30:58.9310771Z return forward_fn(*input_tensors) 2025-08-14T21:30:58.9311261Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1218, in ff_chunk 2025-08-14T21:30:58.9311431Z intermediate_output = self.intermediate(attn_output) 2025-08-14T21:30:58.9311918Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1161, in forward 2025-08-14T21:30:58.9312084Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:30:58.9312450Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:30:58.9312558Z return self.act(input) 2025-08-14T21:30:58.9312565Z 2025-08-14T21:30:58.9312727Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9313341Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9313441Z layer_outputs = layer_module( 2025-08-14T21:30:58.9313809Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9313930Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9314419Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1211, in forward 2025-08-14T21:30:58.9314550Z layer_output = apply_chunking_to_forward( 2025-08-14T21:30:58.9314997Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:30:58.9315117Z return forward_fn(*input_tensors) 2025-08-14T21:30:58.9315613Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1219, in ff_chunk 2025-08-14T21:30:58.9315855Z layer_output = self.output(intermediate_output, attn_output) 2025-08-14T21:30:58.9316352Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1174, in forward 2025-08-14T21:30:58.9316470Z hidden_states = self.dense(hidden_states) 2025-08-14T21:30:58.9316476Z 2025-08-14T21:30:58.9316640Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9317261Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9317365Z layer_outputs = layer_module( 2025-08-14T21:30:58.9317798Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9317913Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9318399Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.9318524Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.9319011Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.9319117Z self_outputs = self.self( 2025-08-14T21:30:58.9319612Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 509, in forward 2025-08-14T21:30:58.9319733Z query_vectors = self.query(hidden_states) 2025-08-14T21:30:58.9319740Z 2025-08-14T21:30:58.9319906Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9320528Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9320634Z layer_outputs = layer_module( 2025-08-14T21:30:58.9321027Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9321146Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9321641Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.9321759Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.9322238Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.9322351Z self_outputs = self.self( 2025-08-14T21:30:58.9322839Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 524, in forward 2025-08-14T21:30:58.9323008Z attn_scores = self._sliding_chunks_query_key_matmul( 2025-08-14T21:30:58.9323621Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 796, in _sliding_chunks_query_key_matmul 2025-08-14T21:30:58.9323930Z diagonal_chunked_attention_scores = torch.einsum("bcxd,bcyd->bcxy", (query, key)) # multiply 2025-08-14T21:30:58.9323936Z 2025-08-14T21:30:58.9324105Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9324727Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9324833Z layer_outputs = layer_module( 2025-08-14T21:30:58.9325217Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9325332Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9325837Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.9325948Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.9326480Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.9326590Z self_outputs = self.self( 2025-08-14T21:30:58.9327079Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 510, in forward 2025-08-14T21:30:58.9327207Z key_vectors = self.key(hidden_states) 2025-08-14T21:30:58.9327213Z 2025-08-14T21:30:58.9327370Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9327996Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9328153Z layer_outputs = layer_module( 2025-08-14T21:30:58.9328533Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9328647Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9329159Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.9329277Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.9329753Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.9329855Z self_outputs = self.self( 2025-08-14T21:30:58.9330348Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 524, in forward 2025-08-14T21:30:58.9330509Z attn_scores = self._sliding_chunks_query_key_matmul( 2025-08-14T21:30:58.9331101Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 796, in _sliding_chunks_query_key_matmul 2025-08-14T21:30:58.9331415Z diagonal_chunked_attention_scores = torch.einsum("bcxd,bcyd->bcxy", (query, key)) # multiply 2025-08-14T21:30:58.9331420Z 2025-08-14T21:30:58.9331587Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9332209Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9332321Z layer_outputs = layer_module( 2025-08-14T21:30:58.9332690Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9332806Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9333296Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.9333415Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.9333921Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.9334024Z self_outputs = self.self( 2025-08-14T21:30:58.9334514Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 524, in forward 2025-08-14T21:30:58.9334680Z attn_scores = self._sliding_chunks_query_key_matmul( 2025-08-14T21:30:58.9335291Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 796, in _sliding_chunks_query_key_matmul 2025-08-14T21:30:58.9335604Z diagonal_chunked_attention_scores = torch.einsum("bcxd,bcyd->bcxy", (query, key)) # multiply 2025-08-14T21:30:58.9335611Z 2025-08-14T21:30:58.9335776Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9336397Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9336510Z layer_outputs = layer_module( 2025-08-14T21:30:58.9336958Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9337084Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9337566Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.9337678Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.9338167Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.9338269Z self_outputs = self.self( 2025-08-14T21:30:58.9338760Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 524, in forward 2025-08-14T21:30:58.9338969Z attn_scores = self._sliding_chunks_query_key_matmul( 2025-08-14T21:30:58.9339560Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 796, in _sliding_chunks_query_key_matmul 2025-08-14T21:30:58.9339872Z diagonal_chunked_attention_scores = torch.einsum("bcxd,bcyd->bcxy", (query, key)) # multiply 2025-08-14T21:30:58.9339878Z 2025-08-14T21:30:58.9340004Z cudagraph partition due to non gpu ops 2025-08-14T21:30:58.9340126Z cudagraph partition due to non gpu ops 2025-08-14T21:30:58.9340239Z cudagraph partition due to non gpu ops 2025-08-14T21:30:58.9340354Z cudagraph partition due to non gpu ops 2025-08-14T21:30:58.9340522Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9341142Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9341251Z layer_outputs = layer_module( 2025-08-14T21:30:58.9341649Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9341765Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9342260Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.9342372Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.9342851Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.9342963Z self_outputs = self.self( 2025-08-14T21:30:58.9343458Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 536, in forward 2025-08-14T21:30:58.9343626Z diagonal_mask = self._sliding_chunks_query_key_matmul( 2025-08-14T21:30:58.9344228Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 834, in _sliding_chunks_query_key_matmul 2025-08-14T21:30:58.9344458Z self._mask_invalid_locations(diagonal_attention_scores, window_overlap) 2025-08-14T21:30:58.9345141Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 762, in _mask_invalid_locations 2025-08-14T21:30:58.9345388Z input_tensor[:, :affected_seq_len, :, : affected_seq_len + 1] = torch.full_like( 2025-08-14T21:30:58.9345395Z 2025-08-14T21:30:58.9345514Z cudagraph partition due to non gpu ops 2025-08-14T21:30:58.9345685Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9346297Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9346415Z layer_outputs = layer_module( 2025-08-14T21:30:58.9346800Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9346920Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9347493Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.9347611Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.9348116Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.9348222Z self_outputs = self.self( 2025-08-14T21:30:58.9348720Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 541, in forward 2025-08-14T21:30:58.9348838Z attn_scores += diagonal_mask 2025-08-14T21:30:58.9348843Z 2025-08-14T21:30:58.9349010Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9349687Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9349799Z layer_outputs = layer_module( 2025-08-14T21:30:58.9350180Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9350310Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9350791Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.9350903Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.9351396Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.9351495Z self_outputs = self.self( 2025-08-14T21:30:58.9351984Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 579, in forward 2025-08-14T21:30:58.9352107Z attn_probs = nn.functional.softmax( 2025-08-14T21:30:58.9352114Z 2025-08-14T21:30:58.9352280Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9352905Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9353011Z layer_outputs = layer_module( 2025-08-14T21:30:58.9353394Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9353517Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9354008Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.9354131Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.9354624Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.9354724Z self_outputs = self.self( 2025-08-14T21:30:58.9355212Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 511, in forward 2025-08-14T21:30:58.9355342Z value_vectors = self.value(hidden_states) 2025-08-14T21:30:58.9355349Z 2025-08-14T21:30:58.9355529Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9356155Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9356262Z layer_outputs = layer_module( 2025-08-14T21:30:58.9356656Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9356772Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9357274Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.9357387Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.9357940Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.9358053Z self_outputs = self.self( 2025-08-14T21:30:58.9358541Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 613, in forward 2025-08-14T21:30:58.9358729Z attn_output = self._sliding_chunks_matmul_attn_probs_value( 2025-08-14T21:30:58.9359360Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 863, in _sliding_chunks_matmul_attn_probs_value 2025-08-14T21:30:58.9359651Z padded_value = nn.functional.pad(value, (0, 0, window_overlap, window_overlap), value=-1) 2025-08-14T21:30:58.9360033Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/torch/nn/functional.py", line 5294, in pad 2025-08-14T21:30:58.9360184Z return torch._C._nn.pad(input, pad, mode, value) 2025-08-14T21:30:58.9360189Z 2025-08-14T21:30:58.9360349Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9360996Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9361105Z layer_outputs = layer_module( 2025-08-14T21:30:58.9361495Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9361609Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9362093Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.9362216Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.9362696Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.9362806Z self_outputs = self.self( 2025-08-14T21:30:58.9363300Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 613, in forward 2025-08-14T21:30:58.9363487Z attn_output = self._sliding_chunks_matmul_attn_probs_value( 2025-08-14T21:30:58.9364111Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 876, in _sliding_chunks_matmul_attn_probs_value 2025-08-14T21:30:58.9364323Z chunked_attn_probs = self._pad_and_diagonalize(chunked_attn_probs) 2025-08-14T21:30:58.9364870Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 699, in _pad_and_diagonalize 2025-08-14T21:30:58.9365010Z chunked_hidden_states = nn.functional.pad( 2025-08-14T21:30:58.9365333Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/torch/nn/functional.py", line 5294, in pad 2025-08-14T21:30:58.9365486Z return torch._C._nn.pad(input, pad, mode, value) 2025-08-14T21:30:58.9365493Z 2025-08-14T21:30:58.9365658Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9366275Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9366380Z layer_outputs = layer_module( 2025-08-14T21:30:58.9366757Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9366881Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9367373Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.9367492Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.9367995Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.9368098Z self_outputs = self.self( 2025-08-14T21:30:58.9368651Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 613, in forward 2025-08-14T21:30:58.9368831Z attn_output = self._sliding_chunks_matmul_attn_probs_value( 2025-08-14T21:30:58.9369441Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 878, in _sliding_chunks_matmul_attn_probs_value 2025-08-14T21:30:58.9369691Z context = torch.einsum("bcwd,bcdh->bcwh", (chunked_attn_probs, chunked_value)) 2025-08-14T21:30:58.9369698Z 2025-08-14T21:30:58.9369859Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9370467Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9370627Z layer_outputs = layer_module( 2025-08-14T21:30:58.9371007Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9371133Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9371614Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.9371728Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.9372231Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.9372336Z self_outputs = self.self( 2025-08-14T21:30:58.9372824Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 613, in forward 2025-08-14T21:30:58.9373009Z attn_output = self._sliding_chunks_matmul_attn_probs_value( 2025-08-14T21:30:58.9373627Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 878, in _sliding_chunks_matmul_attn_probs_value 2025-08-14T21:30:58.9373862Z context = torch.einsum("bcwd,bcdh->bcwh", (chunked_attn_probs, chunked_value)) 2025-08-14T21:30:58.9373868Z 2025-08-14T21:30:58.9374014Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9374631Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9374735Z layer_outputs = layer_module( 2025-08-14T21:30:58.9375118Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9375245Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9375739Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.9375862Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.9376345Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.9376451Z self_outputs = self.self( 2025-08-14T21:30:58.9376942Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 618, in forward 2025-08-14T21:30:58.9377246Z attn_output = attn_output.transpose(0, 1).reshape(seq_len, batch_size, embed_dim).contiguous() 2025-08-14T21:30:58.9377252Z 2025-08-14T21:30:58.9377417Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9378031Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9378138Z layer_outputs = layer_module( 2025-08-14T21:30:58.9378515Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9378629Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9379165Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.9379284Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.9379784Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1144, in forward 2025-08-14T21:30:58.9379965Z attn_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:30:58.9380455Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1094, in forward 2025-08-14T21:30:58.9380579Z hidden_states = self.dense(hidden_states) 2025-08-14T21:30:58.9380644Z 2025-08-14T21:30:58.9380816Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9381433Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9381550Z layer_outputs = layer_module( 2025-08-14T21:30:58.9381917Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9382031Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9382522Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1211, in forward 2025-08-14T21:30:58.9382654Z layer_output = apply_chunking_to_forward( 2025-08-14T21:30:58.9383103Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:30:58.9383233Z return forward_fn(*input_tensors) 2025-08-14T21:30:58.9383727Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1218, in ff_chunk 2025-08-14T21:30:58.9383905Z intermediate_output = self.intermediate(attn_output) 2025-08-14T21:30:58.9384399Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1160, in forward 2025-08-14T21:30:58.9384520Z hidden_states = self.dense(hidden_states) 2025-08-14T21:30:58.9384526Z 2025-08-14T21:30:58.9384895Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9385500Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9385619Z layer_outputs = layer_module( 2025-08-14T21:30:58.9386002Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9386123Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9386623Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1211, in forward 2025-08-14T21:30:58.9386749Z layer_output = apply_chunking_to_forward( 2025-08-14T21:30:58.9387203Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:30:58.9387326Z return forward_fn(*input_tensors) 2025-08-14T21:30:58.9387817Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1218, in ff_chunk 2025-08-14T21:30:58.9387997Z intermediate_output = self.intermediate(attn_output) 2025-08-14T21:30:58.9388484Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1161, in forward 2025-08-14T21:30:58.9388662Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:30:58.9389038Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:30:58.9389146Z return self.act(input) 2025-08-14T21:30:58.9389153Z 2025-08-14T21:30:58.9389452Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9390073Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9390181Z layer_outputs = layer_module( 2025-08-14T21:30:58.9390572Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9390689Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9391180Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1211, in forward 2025-08-14T21:30:58.9391369Z layer_output = apply_chunking_to_forward( 2025-08-14T21:30:58.9391816Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:30:58.9391937Z return forward_fn(*input_tensors) 2025-08-14T21:30:58.9392441Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1219, in ff_chunk 2025-08-14T21:30:58.9392638Z layer_output = self.output(intermediate_output, attn_output) 2025-08-14T21:30:58.9393113Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1174, in forward 2025-08-14T21:30:58.9393234Z hidden_states = self.dense(hidden_states) 2025-08-14T21:30:58.9393241Z 2025-08-14T21:30:58.9393409Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9394015Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9394121Z layer_outputs = layer_module( 2025-08-14T21:30:58.9394509Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9394625Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9395121Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.9395233Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.9395713Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.9395825Z self_outputs = self.self( 2025-08-14T21:30:58.9396308Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 509, in forward 2025-08-14T21:30:58.9396435Z query_vectors = self.query(hidden_states) 2025-08-14T21:30:58.9396448Z 2025-08-14T21:30:58.9396609Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9397218Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9397334Z layer_outputs = layer_module( 2025-08-14T21:30:58.9397706Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9397822Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9398314Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.9398430Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.9398923Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.9399033Z self_outputs = self.self( 2025-08-14T21:30:58.9399528Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 524, in forward 2025-08-14T21:30:58.9399694Z attn_scores = self._sliding_chunks_query_key_matmul( 2025-08-14T21:30:58.9400337Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 796, in _sliding_chunks_query_key_matmul 2025-08-14T21:30:58.9400655Z diagonal_chunked_attention_scores = torch.einsum("bcxd,bcyd->bcxy", (query, key)) # multiply 2025-08-14T21:30:58.9400662Z 2025-08-14T21:30:58.9400826Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9401437Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9401556Z layer_outputs = layer_module( 2025-08-14T21:30:58.9401994Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9402121Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9402601Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.9402718Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.9403210Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.9403311Z self_outputs = self.self( 2025-08-14T21:30:58.9403792Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 510, in forward 2025-08-14T21:30:58.9403914Z key_vectors = self.key(hidden_states) 2025-08-14T21:30:58.9403920Z 2025-08-14T21:30:58.9404081Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9404712Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9404815Z layer_outputs = layer_module( 2025-08-14T21:30:58.9405186Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9405312Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9405783Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.9405898Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.9406387Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.9406483Z self_outputs = self.self( 2025-08-14T21:30:58.9406977Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 524, in forward 2025-08-14T21:30:58.9407137Z attn_scores = self._sliding_chunks_query_key_matmul( 2025-08-14T21:30:58.9407736Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 796, in _sliding_chunks_query_key_matmul 2025-08-14T21:30:58.9408069Z diagonal_chunked_attention_scores = torch.einsum("bcxd,bcyd->bcxy", (query, key)) # multiply 2025-08-14T21:30:58.9408075Z 2025-08-14T21:30:58.9408237Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9408848Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9408951Z layer_outputs = layer_module( 2025-08-14T21:30:58.9409330Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9409458Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9409951Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.9410077Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.9410613Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.9410716Z self_outputs = self.self( 2025-08-14T21:30:58.9411224Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 524, in forward 2025-08-14T21:30:58.9411383Z attn_scores = self._sliding_chunks_query_key_matmul( 2025-08-14T21:30:58.9411979Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 796, in _sliding_chunks_query_key_matmul 2025-08-14T21:30:58.9412283Z diagonal_chunked_attention_scores = torch.einsum("bcxd,bcyd->bcxy", (query, key)) # multiply 2025-08-14T21:30:58.9412336Z 2025-08-14T21:30:58.9412501Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9413136Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9413249Z layer_outputs = layer_module( 2025-08-14T21:30:58.9413635Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9413754Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9414252Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.9414373Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.9414859Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.9414968Z self_outputs = self.self( 2025-08-14T21:30:58.9415465Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 524, in forward 2025-08-14T21:30:58.9415620Z attn_scores = self._sliding_chunks_query_key_matmul( 2025-08-14T21:30:58.9416221Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 796, in _sliding_chunks_query_key_matmul 2025-08-14T21:30:58.9416529Z diagonal_chunked_attention_scores = torch.einsum("bcxd,bcyd->bcxy", (query, key)) # multiply 2025-08-14T21:30:58.9416536Z 2025-08-14T21:30:58.9416658Z cudagraph partition due to non gpu ops 2025-08-14T21:30:58.9416785Z cudagraph partition due to non gpu ops 2025-08-14T21:30:58.9416902Z cudagraph partition due to non gpu ops 2025-08-14T21:30:58.9417017Z cudagraph partition due to non gpu ops 2025-08-14T21:30:58.9417181Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9417801Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9417914Z layer_outputs = layer_module( 2025-08-14T21:30:58.9418291Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9418403Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9418887Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.9418994Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.9419487Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.9419591Z self_outputs = self.self( 2025-08-14T21:30:58.9420063Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 536, in forward 2025-08-14T21:30:58.9420243Z diagonal_mask = self._sliding_chunks_query_key_matmul( 2025-08-14T21:30:58.9420842Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 834, in _sliding_chunks_query_key_matmul 2025-08-14T21:30:58.9421138Z self._mask_invalid_locations(diagonal_attention_scores, window_overlap) 2025-08-14T21:30:58.9421705Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 762, in _mask_invalid_locations 2025-08-14T21:30:58.9421945Z input_tensor[:, :affected_seq_len, :, : affected_seq_len + 1] = torch.full_like( 2025-08-14T21:30:58.9421951Z 2025-08-14T21:30:58.9422077Z cudagraph partition due to non gpu ops 2025-08-14T21:30:58.9422240Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9422858Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9423033Z layer_outputs = layer_module( 2025-08-14T21:30:58.9423405Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9423534Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9424015Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.9424126Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.9424617Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.9424817Z self_outputs = self.self( 2025-08-14T21:30:58.9425315Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 541, in forward 2025-08-14T21:30:58.9425421Z attn_scores += diagonal_mask 2025-08-14T21:30:58.9425427Z 2025-08-14T21:30:58.9425583Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9426210Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9426314Z layer_outputs = layer_module( 2025-08-14T21:30:58.9426690Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9426811Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9427299Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.9427420Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.9427901Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.9428011Z self_outputs = self.self( 2025-08-14T21:30:58.9428512Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 579, in forward 2025-08-14T21:30:58.9428636Z attn_probs = nn.functional.softmax( 2025-08-14T21:30:58.9428642Z 2025-08-14T21:30:58.9428815Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9429436Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9429540Z layer_outputs = layer_module( 2025-08-14T21:30:58.9429926Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9430044Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9430529Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.9430648Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.9431132Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.9431243Z self_outputs = self.self( 2025-08-14T21:30:58.9431791Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 511, in forward 2025-08-14T21:30:58.9431922Z value_vectors = self.value(hidden_states) 2025-08-14T21:30:58.9431936Z 2025-08-14T21:30:58.9432093Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9432714Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9432826Z layer_outputs = layer_module( 2025-08-14T21:30:58.9433204Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9433405Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9433905Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.9434025Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.9434521Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.9434625Z self_outputs = self.self( 2025-08-14T21:30:58.9435119Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 613, in forward 2025-08-14T21:30:58.9435316Z attn_output = self._sliding_chunks_matmul_attn_probs_value( 2025-08-14T21:30:58.9435937Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 863, in _sliding_chunks_matmul_attn_probs_value 2025-08-14T21:30:58.9436231Z padded_value = nn.functional.pad(value, (0, 0, window_overlap, window_overlap), value=-1) 2025-08-14T21:30:58.9436557Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/torch/nn/functional.py", line 5294, in pad 2025-08-14T21:30:58.9436709Z return torch._C._nn.pad(input, pad, mode, value) 2025-08-14T21:30:58.9436716Z 2025-08-14T21:30:58.9436888Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9437505Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9437620Z layer_outputs = layer_module( 2025-08-14T21:30:58.9437989Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9438108Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9438594Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.9438704Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.9439190Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.9439294Z self_outputs = self.self( 2025-08-14T21:30:58.9439782Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 613, in forward 2025-08-14T21:30:58.9439972Z attn_output = self._sliding_chunks_matmul_attn_probs_value( 2025-08-14T21:30:58.9440583Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 876, in _sliding_chunks_matmul_attn_probs_value 2025-08-14T21:30:58.9440805Z chunked_attn_probs = self._pad_and_diagonalize(chunked_attn_probs) 2025-08-14T21:30:58.9441357Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 699, in _pad_and_diagonalize 2025-08-14T21:30:58.9441500Z chunked_hidden_states = nn.functional.pad( 2025-08-14T21:30:58.9441831Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/torch/nn/functional.py", line 5294, in pad 2025-08-14T21:30:58.9442038Z return torch._C._nn.pad(input, pad, mode, value) 2025-08-14T21:30:58.9442044Z 2025-08-14T21:30:58.9442206Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9442832Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9442940Z layer_outputs = layer_module( 2025-08-14T21:30:58.9443332Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9443450Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9443985Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.9444105Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.9444589Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.9444700Z self_outputs = self.self( 2025-08-14T21:30:58.9445194Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 613, in forward 2025-08-14T21:30:58.9445376Z attn_output = self._sliding_chunks_matmul_attn_probs_value( 2025-08-14T21:30:58.9445991Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 878, in _sliding_chunks_matmul_attn_probs_value 2025-08-14T21:30:58.9446241Z context = torch.einsum("bcwd,bcdh->bcwh", (chunked_attn_probs, chunked_value)) 2025-08-14T21:30:58.9446251Z 2025-08-14T21:30:58.9446418Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9447051Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9447161Z layer_outputs = layer_module( 2025-08-14T21:30:58.9447565Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9447686Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9448171Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.9448295Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.9448787Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.9448891Z self_outputs = self.self( 2025-08-14T21:30:58.9449394Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 613, in forward 2025-08-14T21:30:58.9449575Z attn_output = self._sliding_chunks_matmul_attn_probs_value( 2025-08-14T21:30:58.9450208Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 878, in _sliding_chunks_matmul_attn_probs_value 2025-08-14T21:30:58.9450458Z context = torch.einsum("bcwd,bcdh->bcwh", (chunked_attn_probs, chunked_value)) 2025-08-14T21:30:58.9450464Z 2025-08-14T21:30:58.9450633Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9451260Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9451368Z layer_outputs = layer_module( 2025-08-14T21:30:58.9451753Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9451880Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9452373Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.9452498Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.9453033Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.9453145Z self_outputs = self.self( 2025-08-14T21:30:58.9453629Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 618, in forward 2025-08-14T21:30:58.9453931Z attn_output = attn_output.transpose(0, 1).reshape(seq_len, batch_size, embed_dim).contiguous() 2025-08-14T21:30:58.9453939Z 2025-08-14T21:30:58.9454107Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9454765Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9454874Z layer_outputs = layer_module( 2025-08-14T21:30:58.9455253Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9455370Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9455864Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.9455979Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.9456465Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1144, in forward 2025-08-14T21:30:58.9456636Z attn_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:30:58.9457109Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1094, in forward 2025-08-14T21:30:58.9457245Z hidden_states = self.dense(hidden_states) 2025-08-14T21:30:58.9457252Z 2025-08-14T21:30:58.9457414Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9458011Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9458126Z layer_outputs = layer_module( 2025-08-14T21:30:58.9458499Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9458621Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9459108Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1211, in forward 2025-08-14T21:30:58.9459237Z layer_output = apply_chunking_to_forward( 2025-08-14T21:30:58.9459700Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:30:58.9459816Z return forward_fn(*input_tensors) 2025-08-14T21:30:58.9460328Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1218, in ff_chunk 2025-08-14T21:30:58.9460505Z intermediate_output = self.intermediate(attn_output) 2025-08-14T21:30:58.9461002Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1160, in forward 2025-08-14T21:30:58.9461138Z hidden_states = self.dense(hidden_states) 2025-08-14T21:30:58.9461145Z 2025-08-14T21:30:58.9461314Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9461931Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9462051Z layer_outputs = layer_module( 2025-08-14T21:30:58.9462425Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9462554Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9463123Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1211, in forward 2025-08-14T21:30:58.9463251Z layer_output = apply_chunking_to_forward( 2025-08-14T21:30:58.9463706Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:30:58.9463824Z return forward_fn(*input_tensors) 2025-08-14T21:30:58.9464322Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1218, in ff_chunk 2025-08-14T21:30:58.9464492Z intermediate_output = self.intermediate(attn_output) 2025-08-14T21:30:58.9465076Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1161, in forward 2025-08-14T21:30:58.9465339Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:30:58.9465696Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:30:58.9465798Z return self.act(input) 2025-08-14T21:30:58.9465809Z 2025-08-14T21:30:58.9465970Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9466582Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9466698Z layer_outputs = layer_module( 2025-08-14T21:30:58.9467073Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9467193Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9467692Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1211, in forward 2025-08-14T21:30:58.9467814Z layer_output = apply_chunking_to_forward( 2025-08-14T21:30:58.9468277Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:30:58.9468396Z return forward_fn(*input_tensors) 2025-08-14T21:30:58.9468887Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1219, in ff_chunk 2025-08-14T21:30:58.9469086Z layer_output = self.output(intermediate_output, attn_output) 2025-08-14T21:30:58.9469573Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1174, in forward 2025-08-14T21:30:58.9469715Z hidden_states = self.dense(hidden_states) 2025-08-14T21:30:58.9469722Z 2025-08-14T21:30:58.9469885Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9470505Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9470619Z layer_outputs = layer_module( 2025-08-14T21:30:58.9471002Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9471121Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9471616Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.9471733Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.9472231Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.9472335Z self_outputs = self.self( 2025-08-14T21:30:58.9472827Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 509, in forward 2025-08-14T21:30:58.9472949Z query_vectors = self.query(hidden_states) 2025-08-14T21:30:58.9472955Z 2025-08-14T21:30:58.9473114Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9473793Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9473902Z layer_outputs = layer_module( 2025-08-14T21:30:58.9474292Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9474418Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9474912Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.9475040Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.9475592Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.9475699Z self_outputs = self.self( 2025-08-14T21:30:58.9476196Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 524, in forward 2025-08-14T21:30:58.9476348Z attn_scores = self._sliding_chunks_query_key_matmul( 2025-08-14T21:30:58.9476952Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 796, in _sliding_chunks_query_key_matmul 2025-08-14T21:30:58.9477277Z diagonal_chunked_attention_scores = torch.einsum("bcxd,bcyd->bcxy", (query, key)) # multiply 2025-08-14T21:30:58.9477284Z 2025-08-14T21:30:58.9477447Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9478072Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9478185Z layer_outputs = layer_module( 2025-08-14T21:30:58.9478567Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9478692Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9479186Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.9479311Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.9479794Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.9479898Z self_outputs = self.self( 2025-08-14T21:30:58.9480396Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 510, in forward 2025-08-14T21:30:58.9480512Z key_vectors = self.key(hidden_states) 2025-08-14T21:30:58.9480523Z 2025-08-14T21:30:58.9480690Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9481317Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9481427Z layer_outputs = layer_module( 2025-08-14T21:30:58.9481815Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9481928Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9482411Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.9482533Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.9483023Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.9483139Z self_outputs = self.self( 2025-08-14T21:30:58.9483628Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 524, in forward 2025-08-14T21:30:58.9483783Z attn_scores = self._sliding_chunks_query_key_matmul( 2025-08-14T21:30:58.9484454Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 796, in _sliding_chunks_query_key_matmul 2025-08-14T21:30:58.9484883Z diagonal_chunked_attention_scores = torch.einsum("bcxd,bcyd->bcxy", (query, key)) # multiply 2025-08-14T21:30:58.9484892Z 2025-08-14T21:30:58.9485066Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9485680Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9485786Z layer_outputs = layer_module( 2025-08-14T21:30:58.9486290Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9486407Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9486909Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.9487028Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.9487524Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.9487642Z self_outputs = self.self( 2025-08-14T21:30:58.9488124Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 524, in forward 2025-08-14T21:30:58.9488282Z attn_scores = self._sliding_chunks_query_key_matmul( 2025-08-14T21:30:58.9488884Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 796, in _sliding_chunks_query_key_matmul 2025-08-14T21:30:58.9489188Z diagonal_chunked_attention_scores = torch.einsum("bcxd,bcyd->bcxy", (query, key)) # multiply 2025-08-14T21:30:58.9489195Z 2025-08-14T21:30:58.9489364Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9489980Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9490084Z layer_outputs = layer_module( 2025-08-14T21:30:58.9490474Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9490592Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9491085Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.9491205Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.9491699Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.9491813Z self_outputs = self.self( 2025-08-14T21:30:58.9492292Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 524, in forward 2025-08-14T21:30:58.9492461Z attn_scores = self._sliding_chunks_query_key_matmul( 2025-08-14T21:30:58.9493060Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 796, in _sliding_chunks_query_key_matmul 2025-08-14T21:30:58.9493361Z diagonal_chunked_attention_scores = torch.einsum("bcxd,bcyd->bcxy", (query, key)) # multiply 2025-08-14T21:30:58.9493368Z 2025-08-14T21:30:58.9493499Z cudagraph partition due to non gpu ops 2025-08-14T21:30:58.9493618Z cudagraph partition due to non gpu ops 2025-08-14T21:30:58.9493732Z cudagraph partition due to non gpu ops 2025-08-14T21:30:58.9493856Z cudagraph partition due to non gpu ops 2025-08-14T21:30:58.9494009Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9494639Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9494825Z layer_outputs = layer_module( 2025-08-14T21:30:58.9495211Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9495337Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9495820Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.9495948Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.9496420Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.9496564Z self_outputs = self.self( 2025-08-14T21:30:58.9497046Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 536, in forward 2025-08-14T21:30:58.9497214Z diagonal_mask = self._sliding_chunks_query_key_matmul( 2025-08-14T21:30:58.9497805Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 834, in _sliding_chunks_query_key_matmul 2025-08-14T21:30:58.9498045Z self._mask_invalid_locations(diagonal_attention_scores, window_overlap) 2025-08-14T21:30:58.9498606Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 762, in _mask_invalid_locations 2025-08-14T21:30:58.9498847Z input_tensor[:, :affected_seq_len, :, : affected_seq_len + 1] = torch.full_like( 2025-08-14T21:30:58.9498854Z 2025-08-14T21:30:58.9498973Z cudagraph partition due to non gpu ops 2025-08-14T21:30:58.9499139Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9499771Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9499878Z layer_outputs = layer_module( 2025-08-14T21:30:58.9500272Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9500389Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9500886Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.9501007Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.9501491Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.9501606Z self_outputs = self.self( 2025-08-14T21:30:58.9502106Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 541, in forward 2025-08-14T21:30:58.9502211Z attn_scores += diagonal_mask 2025-08-14T21:30:58.9502217Z 2025-08-14T21:30:58.9502387Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9502999Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9503104Z layer_outputs = layer_module( 2025-08-14T21:30:58.9503499Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9503617Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9504114Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.9504224Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.9504794Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.9504914Z self_outputs = self.self( 2025-08-14T21:30:58.9505476Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 579, in forward 2025-08-14T21:30:58.9505599Z attn_probs = nn.functional.softmax( 2025-08-14T21:30:58.9505606Z 2025-08-14T21:30:58.9505770Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9506382Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9506496Z layer_outputs = layer_module( 2025-08-14T21:30:58.9506876Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9506999Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9507545Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.9507658Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.9508157Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.9508260Z self_outputs = self.self( 2025-08-14T21:30:58.9508749Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 511, in forward 2025-08-14T21:30:58.9508882Z value_vectors = self.value(hidden_states) 2025-08-14T21:30:58.9508889Z 2025-08-14T21:30:58.9509053Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9509658Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9509768Z layer_outputs = layer_module( 2025-08-14T21:30:58.9510143Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9510273Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9510762Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.9510883Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.9511364Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.9511470Z self_outputs = self.self( 2025-08-14T21:30:58.9511959Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 613, in forward 2025-08-14T21:30:58.9512144Z attn_output = self._sliding_chunks_matmul_attn_probs_value( 2025-08-14T21:30:58.9512767Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 863, in _sliding_chunks_matmul_attn_probs_value 2025-08-14T21:30:58.9513058Z padded_value = nn.functional.pad(value, (0, 0, window_overlap, window_overlap), value=-1) 2025-08-14T21:30:58.9513390Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/torch/nn/functional.py", line 5294, in pad 2025-08-14T21:30:58.9513552Z return torch._C._nn.pad(input, pad, mode, value) 2025-08-14T21:30:58.9513559Z 2025-08-14T21:30:58.9513727Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9514357Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9514474Z layer_outputs = layer_module( 2025-08-14T21:30:58.9514855Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9514984Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9515476Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.9515592Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.9516155Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.9516259Z self_outputs = self.self( 2025-08-14T21:30:58.9516766Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 613, in forward 2025-08-14T21:30:58.9516957Z attn_output = self._sliding_chunks_matmul_attn_probs_value( 2025-08-14T21:30:58.9517571Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 876, in _sliding_chunks_matmul_attn_probs_value 2025-08-14T21:30:58.9517842Z chunked_attn_probs = self._pad_and_diagonalize(chunked_attn_probs) 2025-08-14T21:30:58.9518397Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 699, in _pad_and_diagonalize 2025-08-14T21:30:58.9518533Z chunked_hidden_states = nn.functional.pad( 2025-08-14T21:30:58.9518869Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/torch/nn/functional.py", line 5294, in pad 2025-08-14T21:30:58.9519022Z return torch._C._nn.pad(input, pad, mode, value) 2025-08-14T21:30:58.9519028Z 2025-08-14T21:30:58.9519203Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9519810Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9519916Z layer_outputs = layer_module( 2025-08-14T21:30:58.9520306Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9520426Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9520912Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.9521025Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.9521515Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.9521628Z self_outputs = self.self( 2025-08-14T21:30:58.9522102Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 613, in forward 2025-08-14T21:30:58.9522283Z attn_output = self._sliding_chunks_matmul_attn_probs_value( 2025-08-14T21:30:58.9522919Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 878, in _sliding_chunks_matmul_attn_probs_value 2025-08-14T21:30:58.9523164Z context = torch.einsum("bcwd,bcdh->bcwh", (chunked_attn_probs, chunked_value)) 2025-08-14T21:30:58.9523171Z 2025-08-14T21:30:58.9523342Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9523956Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9524070Z layer_outputs = layer_module( 2025-08-14T21:30:58.9524452Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9524569Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9525066Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.9525182Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.9525668Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.9525782Z self_outputs = self.self( 2025-08-14T21:30:58.9526265Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 613, in forward 2025-08-14T21:30:58.9526506Z attn_output = self._sliding_chunks_matmul_attn_probs_value( 2025-08-14T21:30:58.9527117Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 878, in _sliding_chunks_matmul_attn_probs_value 2025-08-14T21:30:58.9527365Z context = torch.einsum("bcwd,bcdh->bcwh", (chunked_attn_probs, chunked_value)) 2025-08-14T21:30:58.9527372Z 2025-08-14T21:30:58.9527546Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9528160Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9528320Z layer_outputs = layer_module( 2025-08-14T21:30:58.9528707Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9528821Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9529319Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.9529434Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.9529926Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.9530037Z self_outputs = self.self( 2025-08-14T21:30:58.9530518Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 618, in forward 2025-08-14T21:30:58.9530828Z attn_output = attn_output.transpose(0, 1).reshape(seq_len, batch_size, embed_dim).contiguous() 2025-08-14T21:30:58.9530838Z 2025-08-14T21:30:58.9530993Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9531597Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9531707Z layer_outputs = layer_module( 2025-08-14T21:30:58.9532084Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9532208Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9532686Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.9532796Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.9533294Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1144, in forward 2025-08-14T21:30:58.9533473Z attn_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:30:58.9533969Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1094, in forward 2025-08-14T21:30:58.9534095Z hidden_states = self.dense(hidden_states) 2025-08-14T21:30:58.9534101Z 2025-08-14T21:30:58.9534272Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9534908Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9535012Z layer_outputs = layer_module( 2025-08-14T21:30:58.9535403Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9535519Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9536007Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1211, in forward 2025-08-14T21:30:58.9536149Z layer_output = apply_chunking_to_forward( 2025-08-14T21:30:58.9536599Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:30:58.9536717Z return forward_fn(*input_tensors) 2025-08-14T21:30:58.9537279Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1218, in ff_chunk 2025-08-14T21:30:58.9537457Z intermediate_output = self.intermediate(attn_output) 2025-08-14T21:30:58.9537964Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1160, in forward 2025-08-14T21:30:58.9538087Z hidden_states = self.dense(hidden_states) 2025-08-14T21:30:58.9538093Z 2025-08-14T21:30:58.9538253Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9538867Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9539033Z layer_outputs = layer_module( 2025-08-14T21:30:58.9539423Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9539548Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9540036Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1211, in forward 2025-08-14T21:30:58.9540171Z layer_output = apply_chunking_to_forward( 2025-08-14T21:30:58.9540622Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:30:58.9540737Z return forward_fn(*input_tensors) 2025-08-14T21:30:58.9541237Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1218, in ff_chunk 2025-08-14T21:30:58.9541415Z intermediate_output = self.intermediate(attn_output) 2025-08-14T21:30:58.9541917Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1161, in forward 2025-08-14T21:30:58.9542091Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:30:58.9542451Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:30:58.9542562Z return self.act(input) 2025-08-14T21:30:58.9542569Z 2025-08-14T21:30:58.9542731Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9543352Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9543458Z layer_outputs = layer_module( 2025-08-14T21:30:58.9543843Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9543975Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9544455Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1211, in forward 2025-08-14T21:30:58.9544593Z layer_output = apply_chunking_to_forward( 2025-08-14T21:30:58.9545125Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:30:58.9545240Z return forward_fn(*input_tensors) 2025-08-14T21:30:58.9545715Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1219, in ff_chunk 2025-08-14T21:30:58.9545899Z layer_output = self.output(intermediate_output, attn_output) 2025-08-14T21:30:58.9546384Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1174, in forward 2025-08-14T21:30:58.9546519Z hidden_states = self.dense(hidden_states) 2025-08-14T21:30:58.9546525Z 2025-08-14T21:30:58.9546692Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9547305Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9547472Z layer_outputs = layer_module( 2025-08-14T21:30:58.9547852Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9547980Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9548474Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.9548602Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.9549088Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.9549240Z self_outputs = self.self( 2025-08-14T21:30:58.9549746Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 509, in forward 2025-08-14T21:30:58.9549867Z query_vectors = self.query(hidden_states) 2025-08-14T21:30:58.9549873Z 2025-08-14T21:30:58.9550048Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9550663Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9550768Z layer_outputs = layer_module( 2025-08-14T21:30:58.9551161Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9551275Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9551760Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.9551890Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.9552365Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.9552477Z self_outputs = self.self( 2025-08-14T21:30:58.9552968Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 524, in forward 2025-08-14T21:30:58.9553123Z attn_scores = self._sliding_chunks_query_key_matmul( 2025-08-14T21:30:58.9553724Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 796, in _sliding_chunks_query_key_matmul 2025-08-14T21:30:58.9554034Z diagonal_chunked_attention_scores = torch.einsum("bcxd,bcyd->bcxy", (query, key)) # multiply 2025-08-14T21:30:58.9554040Z 2025-08-14T21:30:58.9554210Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9554823Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9554930Z layer_outputs = layer_module( 2025-08-14T21:30:58.9555332Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9555449Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9555958Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.9556072Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.9556558Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.9556667Z self_outputs = self.self( 2025-08-14T21:30:58.9557153Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 510, in forward 2025-08-14T21:30:58.9557273Z key_vectors = self.key(hidden_states) 2025-08-14T21:30:58.9557279Z 2025-08-14T21:30:58.9557449Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9558113Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9558228Z layer_outputs = layer_module( 2025-08-14T21:30:58.9558603Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9558721Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9559218Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.9559334Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.9559828Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.9559986Z self_outputs = self.self( 2025-08-14T21:30:58.9560480Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 524, in forward 2025-08-14T21:30:58.9560651Z attn_scores = self._sliding_chunks_query_key_matmul( 2025-08-14T21:30:58.9561256Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 796, in _sliding_chunks_query_key_matmul 2025-08-14T21:30:58.9561563Z diagonal_chunked_attention_scores = torch.einsum("bcxd,bcyd->bcxy", (query, key)) # multiply 2025-08-14T21:30:58.9561579Z 2025-08-14T21:30:58.9561742Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9562355Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9562476Z layer_outputs = layer_module( 2025-08-14T21:30:58.9562855Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9562971Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9563469Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.9563583Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.9564084Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.9564188Z self_outputs = self.self( 2025-08-14T21:30:58.9564668Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 524, in forward 2025-08-14T21:30:58.9564831Z attn_scores = self._sliding_chunks_query_key_matmul( 2025-08-14T21:30:58.9565429Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 796, in _sliding_chunks_query_key_matmul 2025-08-14T21:30:58.9565737Z diagonal_chunked_attention_scores = torch.einsum("bcxd,bcyd->bcxy", (query, key)) # multiply 2025-08-14T21:30:58.9565744Z 2025-08-14T21:30:58.9565905Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9566521Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9566635Z layer_outputs = layer_module( 2025-08-14T21:30:58.9567009Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9567131Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9567617Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.9567735Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.9568220Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.9568321Z self_outputs = self.self( 2025-08-14T21:30:58.9568866Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 524, in forward 2025-08-14T21:30:58.9569036Z attn_scores = self._sliding_chunks_query_key_matmul( 2025-08-14T21:30:58.9569636Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 796, in _sliding_chunks_query_key_matmul 2025-08-14T21:30:58.9569947Z diagonal_chunked_attention_scores = torch.einsum("bcxd,bcyd->bcxy", (query, key)) # multiply 2025-08-14T21:30:58.9569953Z 2025-08-14T21:30:58.9570080Z cudagraph partition due to non gpu ops 2025-08-14T21:30:58.9570243Z cudagraph partition due to non gpu ops 2025-08-14T21:30:58.9570363Z cudagraph partition due to non gpu ops 2025-08-14T21:30:58.9570478Z cudagraph partition due to non gpu ops 2025-08-14T21:30:58.9570653Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9571280Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9571389Z layer_outputs = layer_module( 2025-08-14T21:30:58.9571784Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9571903Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9572390Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.9572514Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.9573004Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.9573115Z self_outputs = self.self( 2025-08-14T21:30:58.9573612Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 536, in forward 2025-08-14T21:30:58.9573787Z diagonal_mask = self._sliding_chunks_query_key_matmul( 2025-08-14T21:30:58.9574404Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 834, in _sliding_chunks_query_key_matmul 2025-08-14T21:30:58.9574636Z self._mask_invalid_locations(diagonal_attention_scores, window_overlap) 2025-08-14T21:30:58.9575210Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 762, in _mask_invalid_locations 2025-08-14T21:30:58.9575453Z input_tensor[:, :affected_seq_len, :, : affected_seq_len + 1] = torch.full_like( 2025-08-14T21:30:58.9575464Z 2025-08-14T21:30:58.9575587Z cudagraph partition due to non gpu ops 2025-08-14T21:30:58.9575763Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9576390Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9576499Z layer_outputs = layer_module( 2025-08-14T21:30:58.9576875Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9576990Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9577486Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.9577598Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.9578077Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.9578193Z self_outputs = self.self( 2025-08-14T21:30:58.9578680Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 541, in forward 2025-08-14T21:30:58.9578798Z attn_scores += diagonal_mask 2025-08-14T21:30:58.9578803Z 2025-08-14T21:30:58.9579025Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9579643Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9579757Z layer_outputs = layer_module( 2025-08-14T21:30:58.9580143Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9580270Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9580757Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.9580934Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.9581424Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.9581528Z self_outputs = self.self( 2025-08-14T21:30:58.9582030Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 579, in forward 2025-08-14T21:30:58.9582160Z attn_probs = nn.functional.softmax( 2025-08-14T21:30:58.9582166Z 2025-08-14T21:30:58.9582330Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9582964Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9583080Z layer_outputs = layer_module( 2025-08-14T21:30:58.9583461Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9583579Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9584060Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.9584181Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.9584853Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.9584954Z self_outputs = self.self( 2025-08-14T21:30:58.9585432Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 511, in forward 2025-08-14T21:30:58.9585554Z value_vectors = self.value(hidden_states) 2025-08-14T21:30:58.9585560Z 2025-08-14T21:30:58.9585728Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9586340Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9586453Z layer_outputs = layer_module( 2025-08-14T21:30:58.9586838Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9586957Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9587445Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.9587563Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.9588045Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.9588155Z self_outputs = self.self( 2025-08-14T21:30:58.9588642Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 613, in forward 2025-08-14T21:30:58.9588836Z attn_output = self._sliding_chunks_matmul_attn_probs_value( 2025-08-14T21:30:58.9589474Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 863, in _sliding_chunks_matmul_attn_probs_value 2025-08-14T21:30:58.9589871Z padded_value = nn.functional.pad(value, (0, 0, window_overlap, window_overlap), value=-1) 2025-08-14T21:30:58.9590211Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/torch/nn/functional.py", line 5294, in pad 2025-08-14T21:30:58.9590364Z return torch._C._nn.pad(input, pad, mode, value) 2025-08-14T21:30:58.9590371Z 2025-08-14T21:30:58.9590535Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9591155Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9591261Z layer_outputs = layer_module( 2025-08-14T21:30:58.9591721Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9591840Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9592325Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.9592448Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.9592939Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.9593040Z self_outputs = self.self( 2025-08-14T21:30:58.9593535Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 613, in forward 2025-08-14T21:30:58.9593717Z attn_output = self._sliding_chunks_matmul_attn_probs_value( 2025-08-14T21:30:58.9594340Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 876, in _sliding_chunks_matmul_attn_probs_value 2025-08-14T21:30:58.9594565Z chunked_attn_probs = self._pad_and_diagonalize(chunked_attn_probs) 2025-08-14T21:30:58.9595125Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 699, in _pad_and_diagonalize 2025-08-14T21:30:58.9595271Z chunked_hidden_states = nn.functional.pad( 2025-08-14T21:30:58.9595591Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/torch/nn/functional.py", line 5294, in pad 2025-08-14T21:30:58.9595749Z return torch._C._nn.pad(input, pad, mode, value) 2025-08-14T21:30:58.9595755Z 2025-08-14T21:30:58.9595914Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9596512Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9596630Z layer_outputs = layer_module( 2025-08-14T21:30:58.9597008Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9597126Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9597624Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.9597737Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.9598227Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.9598333Z self_outputs = self.self( 2025-08-14T21:30:58.9598819Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 613, in forward 2025-08-14T21:30:58.9599016Z attn_output = self._sliding_chunks_matmul_attn_probs_value( 2025-08-14T21:30:58.9599624Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 878, in _sliding_chunks_matmul_attn_probs_value 2025-08-14T21:30:58.9599876Z context = torch.einsum("bcwd,bcdh->bcwh", (chunked_attn_probs, chunked_value)) 2025-08-14T21:30:58.9599883Z 2025-08-14T21:30:58.9600049Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9600722Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9600841Z layer_outputs = layer_module( 2025-08-14T21:30:58.9601228Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9601360Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9601853Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.9602060Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.9602569Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.9602675Z self_outputs = self.self( 2025-08-14T21:30:58.9603166Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 613, in forward 2025-08-14T21:30:58.9603354Z attn_output = self._sliding_chunks_matmul_attn_probs_value( 2025-08-14T21:30:58.9603968Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 878, in _sliding_chunks_matmul_attn_probs_value 2025-08-14T21:30:58.9604223Z context = torch.einsum("bcwd,bcdh->bcwh", (chunked_attn_probs, chunked_value)) 2025-08-14T21:30:58.9604229Z 2025-08-14T21:30:58.9604396Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9605014Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9605135Z layer_outputs = layer_module( 2025-08-14T21:30:58.9605520Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9605653Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9606145Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.9606259Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.9606754Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1135, in forward 2025-08-14T21:30:58.9606857Z self_outputs = self.self( 2025-08-14T21:30:58.9607349Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 618, in forward 2025-08-14T21:30:58.9607658Z attn_output = attn_output.transpose(0, 1).reshape(seq_len, batch_size, embed_dim).contiguous() 2025-08-14T21:30:58.9607665Z 2025-08-14T21:30:58.9607834Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9608468Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9608574Z layer_outputs = layer_module( 2025-08-14T21:30:58.9608967Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9609083Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9609561Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1199, in forward 2025-08-14T21:30:58.9609685Z self_attn_outputs = self.attention( 2025-08-14T21:30:58.9610174Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1144, in forward 2025-08-14T21:30:58.9610353Z attn_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:30:58.9610845Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1094, in forward 2025-08-14T21:30:58.9611019Z hidden_states = self.dense(hidden_states) 2025-08-14T21:30:58.9611026Z 2025-08-14T21:30:58.9611200Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9611824Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9611931Z layer_outputs = layer_module( 2025-08-14T21:30:58.9612327Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9612440Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9612986Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1211, in forward 2025-08-14T21:30:58.9613112Z layer_output = apply_chunking_to_forward( 2025-08-14T21:30:58.9613566Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:30:58.9613691Z return forward_fn(*input_tensors) 2025-08-14T21:30:58.9614185Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1218, in ff_chunk 2025-08-14T21:30:58.9614365Z intermediate_output = self.intermediate(attn_output) 2025-08-14T21:30:58.9614856Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1160, in forward 2025-08-14T21:30:58.9614978Z hidden_states = self.dense(hidden_states) 2025-08-14T21:30:58.9614984Z 2025-08-14T21:30:58.9615160Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9615783Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9615891Z layer_outputs = layer_module( 2025-08-14T21:30:58.9616270Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9616383Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9616878Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1211, in forward 2025-08-14T21:30:58.9617001Z layer_output = apply_chunking_to_forward( 2025-08-14T21:30:58.9617443Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:30:58.9617564Z return forward_fn(*input_tensors) 2025-08-14T21:30:58.9618066Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1218, in ff_chunk 2025-08-14T21:30:58.9618248Z intermediate_output = self.intermediate(attn_output) 2025-08-14T21:30:58.9618731Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1161, in forward 2025-08-14T21:30:58.9618903Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:30:58.9619270Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:30:58.9619372Z return self.act(input) 2025-08-14T21:30:58.9619379Z 2025-08-14T21:30:58.9619551Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:30:58.9620160Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1259, in torch_dynamo_resume_in_forward_at_1244 2025-08-14T21:30:58.9620267Z layer_outputs = layer_module( 2025-08-14T21:30:58.9620650Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:30:58.9620766Z return super().__call__(*args, **kwargs) 2025-08-14T21:30:58.9621323Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1211, in forward 2025-08-14T21:30:58.9621459Z layer_output = apply_chunking_to_forward( 2025-08-14T21:30:58.9621902Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:30:58.9622024Z return forward_fn(*input_tensors) 2025-08-14T21:30:58.9622524Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1219, in ff_chunk 2025-08-14T21:30:58.9622710Z layer_output = self.output(intermediate_output, attn_output) 2025-08-14T21:30:58.9623198Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1174, in forward 2025-08-14T21:30:58.9623362Z hidden_states = self.dense(hidden_states) 2025-08-14T21:30:58.9623368Z 2025-08-14T21:32:01.0617917Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:01.0622062Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1716, in torch_dynamo_resume_in_forward_at_1703 2025-08-14T21:32:01.0623053Z prediction_scores = self.lm_head(sequence_output) 2025-08-14T21:32:01.0623497Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1333, in forward 2025-08-14T21:32:01.0623905Z x = self.dense(features) 2025-08-14T21:32:01.0624023Z 2025-08-14T21:32:01.0624129Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:01.0624632Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1716, in torch_dynamo_resume_in_forward_at_1703 2025-08-14T21:32:01.0625243Z prediction_scores = self.lm_head(sequence_output) 2025-08-14T21:32:01.0625646Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1338, in forward 2025-08-14T21:32:01.0626035Z x = self.decoder(x) 2025-08-14T21:32:01.0626142Z 2025-08-14T21:32:01.0626237Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:01.0626722Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/longformer/modeling_longformer.py", line 1723, in torch_dynamo_resume_in_forward_at_1703 2025-08-14T21:32:01.0627276Z masked_lm_loss = loss_fct(prediction_scores.view(-1, self.config.vocab_size), labels.view(-1)) 2025-08-14T21:32:01.0627506Z 2025-08-14T21:32:02.4403672Z Compilation time (from dynamo_timed): 89.78628015 2025-08-14T21:32:02.4604621Z pass 2025-08-14T21:32:02.4606674Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:32:02.4607577Z TIMING: gc:0.00717 entire_frame_compile:89.78628 _recursive_pre_grad_passes:0.01772 _recursive_joint_graph_passes:0.8791 _recursive_post_grad_passes:1.62378 async_compile.wait:2.61581 code_gen:70.33277 inductor_compile:76.73429 backend_compile:85.2827 total_wall_time:89.78628 2025-08-14T21:32:02.4609136Z STATS: call_* op count: 1787 | FakeTensorMode.__torch_dispatch__:56346 | FakeTensor.__torch_dispatch__:16842 | ProxyTorchDispatchMode.__torch_dispatch__:17446 2025-08-14T21:32:02.4609618Z Dynamo produced 4 graphs covering 1787 ops with 4 graph breaks (1 unique) 2025-08-14T21:32:07.0569921Z /opt/conda/envs/py_3.9/lib/python3.9/site-packages/llvmlite/binding/ffi.py:175: UserWarning: pkg_resources is deprecated as an API. See https://setuptools.pypa.io/en/latest/pkg_resources.html. The pkg_resources package is slated for removal as early as 2025-11-30. Refrain from using this package or pin to Setuptools<81. 2025-08-14T21:32:07.0571935Z from pkg_resources import resource_filename 2025-08-14T21:32:07.6036335Z 2025-08-14T21:32:10.1059923Z loading model: 0it [00:00, ?it/s] 2025-08-14T21:32:10.1063472Z loading model: 0it [00:02, ?it/s] 2025-08-14T21:32:10.1076874Z cpu eval BartForCausalLM 2025-08-14T21:32:11.2347017Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:32:11.6987421Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:32:12.2067710Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:32:19.0119166Z cudagraph partition due to non gpu ops 2025-08-14T21:32:19.0120785Z cudagraph partition due to non gpu ops 2025-08-14T21:32:19.0121122Z cudagraph partition due to non gpu ops 2025-08-14T21:32:19.0126238Z cudagraph partition due to non gpu ops 2025-08-14T21:32:19.0128131Z cudagraph partition due to non gpu ops 2025-08-14T21:32:19.0128892Z cudagraph partition due to non gpu ops 2025-08-14T21:32:19.0132861Z cudagraph partition due to non gpu ops 2025-08-14T21:32:19.0134731Z cudagraph partition due to non gpu ops 2025-08-14T21:32:19.0135069Z cudagraph partition due to non gpu ops 2025-08-14T21:32:19.0139776Z cudagraph partition due to non gpu ops 2025-08-14T21:32:19.0140193Z cudagraph partition due to non gpu ops 2025-08-14T21:32:19.0140478Z cudagraph partition due to non gpu ops 2025-08-14T21:32:19.0145493Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:19.0149329Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:19.0151147Z return mod(**inputs) 2025-08-14T21:32:19.0151675Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1901, in forward 2025-08-14T21:32:19.0156132Z outputs = self.model.decoder( 2025-08-14T21:32:19.0158515Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:19.0162888Z layer_outputs = decoder_layer( 2025-08-14T21:32:19.0167228Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:19.0168919Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:19.0169525Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:19.0169997Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:19.0173651Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 216, in forward 2025-08-14T21:32:19.0177487Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:32:19.0179078Z 2025-08-14T21:32:19.0179363Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:19.0179765Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:19.0180230Z return mod(**inputs) 2025-08-14T21:32:19.0181058Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1901, in forward 2025-08-14T21:32:19.0181645Z outputs = self.model.decoder( 2025-08-14T21:32:19.0182509Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:19.0183068Z layer_outputs = decoder_layer( 2025-08-14T21:32:19.0183861Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:19.0184420Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:19.0185154Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:19.0185549Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:19.0185950Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 235, in forward 2025-08-14T21:32:19.0186320Z key_states = self.k_proj(current_states) 2025-08-14T21:32:19.0186450Z 2025-08-14T21:32:19.0186554Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:19.0187098Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:19.0187414Z return mod(**inputs) 2025-08-14T21:32:19.0187754Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1901, in forward 2025-08-14T21:32:19.0188111Z outputs = self.model.decoder( 2025-08-14T21:32:19.0188471Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:19.0188827Z layer_outputs = decoder_layer( 2025-08-14T21:32:19.0189155Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:19.0190234Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:19.0190593Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:19.0190973Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:19.0191343Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 236, in forward 2025-08-14T21:32:19.0191705Z value_states = self.v_proj(current_states) 2025-08-14T21:32:19.0191841Z 2025-08-14T21:32:19.0191918Z cudagraph partition due to non gpu ops 2025-08-14T21:32:19.0192120Z cudagraph partition due to non gpu ops 2025-08-14T21:32:19.0192324Z cudagraph partition due to non gpu ops 2025-08-14T21:32:19.0192517Z cudagraph partition due to non gpu ops 2025-08-14T21:32:19.0192738Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:19.0193086Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:19.0193396Z return mod(**inputs) 2025-08-14T21:32:19.0193739Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1901, in forward 2025-08-14T21:32:19.0194110Z outputs = self.model.decoder( 2025-08-14T21:32:19.0194462Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:19.0194824Z layer_outputs = decoder_layer( 2025-08-14T21:32:19.0195162Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:19.0195500Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:19.0195865Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:19.0196253Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:19.0196640Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 254, in forward 2025-08-14T21:32:19.0197015Z attn_output, attn_weights = attention_interface( 2025-08-14T21:32:19.0197444Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:32:19.0197924Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:32:19.0198098Z 2025-08-14T21:32:19.0198206Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:19.0198530Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:19.0198829Z return mod(**inputs) 2025-08-14T21:32:19.0199161Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1901, in forward 2025-08-14T21:32:19.0199513Z outputs = self.model.decoder( 2025-08-14T21:32:19.0199876Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:19.0200228Z layer_outputs = decoder_layer( 2025-08-14T21:32:19.0200593Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:19.0200932Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:19.0201286Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:19.0201660Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:19.0202022Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 254, in forward 2025-08-14T21:32:19.0202395Z attn_output, attn_weights = attention_interface( 2025-08-14T21:32:19.0202802Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:32:19.0203263Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:32:19.0203412Z 2025-08-14T21:32:19.0203510Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:19.0203841Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:19.0204137Z return mod(**inputs) 2025-08-14T21:32:19.0204467Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1901, in forward 2025-08-14T21:32:19.0204816Z outputs = self.model.decoder( 2025-08-14T21:32:19.0205165Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:19.0205514Z layer_outputs = decoder_layer( 2025-08-14T21:32:19.0205824Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:19.0206164Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:19.0206513Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:19.0206884Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:19.0207246Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 268, in forward 2025-08-14T21:32:19.0207608Z attn_output = self.out_proj(attn_output) 2025-08-14T21:32:19.0207732Z 2025-08-14T21:32:19.0207835Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:19.0208164Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:19.0208457Z return mod(**inputs) 2025-08-14T21:32:19.0208788Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1901, in forward 2025-08-14T21:32:19.0209143Z outputs = self.model.decoder( 2025-08-14T21:32:19.0209480Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:19.0209884Z layer_outputs = decoder_layer( 2025-08-14T21:32:19.0210213Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:19.0210548Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:19.0210891Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 445, in forward 2025-08-14T21:32:19.0211290Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:32:19.0211449Z 2025-08-14T21:32:19.0211556Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:19.0211878Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:19.0212181Z return mod(**inputs) 2025-08-14T21:32:19.0212518Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1901, in forward 2025-08-14T21:32:19.0212870Z outputs = self.model.decoder( 2025-08-14T21:32:19.0213207Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:19.0213600Z layer_outputs = decoder_layer( 2025-08-14T21:32:19.0213919Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:19.0214241Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:19.0214592Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 445, in forward 2025-08-14T21:32:19.0214984Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:32:19.0215340Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:32:19.0215688Z return self.act(input) 2025-08-14T21:32:19.0215798Z 2025-08-14T21:32:19.0215895Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:19.0216229Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:19.0216532Z return mod(**inputs) 2025-08-14T21:32:19.0216858Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1901, in forward 2025-08-14T21:32:19.0217211Z outputs = self.model.decoder( 2025-08-14T21:32:19.0217556Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:19.0217900Z layer_outputs = decoder_layer( 2025-08-14T21:32:19.0218220Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:19.0218551Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:19.0218909Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 447, in forward 2025-08-14T21:32:19.0219260Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:32:19.0219390Z 2025-08-14T21:32:19.0219487Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:19.0219822Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:19.0220116Z return mod(**inputs) 2025-08-14T21:32:19.0220450Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1901, in forward 2025-08-14T21:32:19.0220802Z outputs = self.model.decoder( 2025-08-14T21:32:19.0221151Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:19.0221494Z layer_outputs = decoder_layer( 2025-08-14T21:32:19.0221814Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:19.0222151Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:19.0222499Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:19.0222876Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:19.0223250Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 216, in forward 2025-08-14T21:32:19.0223675Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:32:19.0223866Z 2025-08-14T21:32:19.0223963Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:19.0224294Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:19.0224596Z return mod(**inputs) 2025-08-14T21:32:19.0225004Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1901, in forward 2025-08-14T21:32:19.0225357Z outputs = self.model.decoder( 2025-08-14T21:32:19.0225705Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:19.0226062Z layer_outputs = decoder_layer( 2025-08-14T21:32:19.0226428Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:19.0226769Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:19.0227129Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:19.0227505Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:19.0227869Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 235, in forward 2025-08-14T21:32:19.0228228Z key_states = self.k_proj(current_states) 2025-08-14T21:32:19.0228387Z 2025-08-14T21:32:19.0228488Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:19.0228817Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:19.0229109Z return mod(**inputs) 2025-08-14T21:32:19.0229440Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1901, in forward 2025-08-14T21:32:19.0229795Z outputs = self.model.decoder( 2025-08-14T21:32:19.0230133Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:19.0230485Z layer_outputs = decoder_layer( 2025-08-14T21:32:19.0230802Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:19.0231130Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:19.0231474Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:19.0231847Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:19.0232216Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 236, in forward 2025-08-14T21:32:19.0232564Z value_states = self.v_proj(current_states) 2025-08-14T21:32:19.0232700Z 2025-08-14T21:32:19.0232774Z cudagraph partition due to non gpu ops 2025-08-14T21:32:19.0232970Z cudagraph partition due to non gpu ops 2025-08-14T21:32:19.0233159Z cudagraph partition due to non gpu ops 2025-08-14T21:32:19.0233341Z cudagraph partition due to non gpu ops 2025-08-14T21:32:19.0233554Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:19.0233881Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:19.0234168Z return mod(**inputs) 2025-08-14T21:32:19.0234495Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1901, in forward 2025-08-14T21:32:19.0234851Z outputs = self.model.decoder( 2025-08-14T21:32:19.0235194Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:19.0235536Z layer_outputs = decoder_layer( 2025-08-14T21:32:19.0235855Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:19.0236183Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:19.0236526Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:19.0236899Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:19.0237266Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 254, in forward 2025-08-14T21:32:19.0237636Z attn_output, attn_weights = attention_interface( 2025-08-14T21:32:19.0238038Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:32:19.0238478Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:32:19.0238654Z 2025-08-14T21:32:19.0238784Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:19.0239119Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:19.0239410Z return mod(**inputs) 2025-08-14T21:32:19.0239738Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1901, in forward 2025-08-14T21:32:19.0240091Z outputs = self.model.decoder( 2025-08-14T21:32:19.0240431Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:19.0240782Z layer_outputs = decoder_layer( 2025-08-14T21:32:19.0241157Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:19.0241491Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:19.0241836Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:19.0242213Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:19.0242584Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 254, in forward 2025-08-14T21:32:19.0242957Z attn_output, attn_weights = attention_interface( 2025-08-14T21:32:19.0243359Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:32:19.0243781Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:32:19.0243930Z 2025-08-14T21:32:19.0244033Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:19.0244361Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:19.0244657Z return mod(**inputs) 2025-08-14T21:32:19.0244987Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1901, in forward 2025-08-14T21:32:19.0245343Z outputs = self.model.decoder( 2025-08-14T21:32:19.0245684Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:19.0246034Z layer_outputs = decoder_layer( 2025-08-14T21:32:19.0246355Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:19.0246679Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:19.0247029Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:19.0247405Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:19.0247773Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 268, in forward 2025-08-14T21:32:19.0248122Z attn_output = self.out_proj(attn_output) 2025-08-14T21:32:19.0248253Z 2025-08-14T21:32:19.0248350Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:19.0248680Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:19.0248977Z return mod(**inputs) 2025-08-14T21:32:19.0249296Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1901, in forward 2025-08-14T21:32:19.0249645Z outputs = self.model.decoder( 2025-08-14T21:32:19.0249990Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:19.0250337Z layer_outputs = decoder_layer( 2025-08-14T21:32:19.0250657Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:19.0250986Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:19.0251334Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 445, in forward 2025-08-14T21:32:19.0251754Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:32:19.0251921Z 2025-08-14T21:32:19.0252017Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:19.0252351Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:19.0252643Z return mod(**inputs) 2025-08-14T21:32:19.0252973Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1901, in forward 2025-08-14T21:32:19.0253329Z outputs = self.model.decoder( 2025-08-14T21:32:19.0253673Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:19.0254054Z layer_outputs = decoder_layer( 2025-08-14T21:32:19.0254374Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:19.0254708Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:19.0255060Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 445, in forward 2025-08-14T21:32:19.0255444Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:32:19.0255799Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:32:19.0256116Z return self.act(input) 2025-08-14T21:32:19.0256219Z 2025-08-14T21:32:19.0256314Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:19.0256646Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:19.0256949Z return mod(**inputs) 2025-08-14T21:32:19.0257279Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1901, in forward 2025-08-14T21:32:19.0257624Z outputs = self.model.decoder( 2025-08-14T21:32:19.0257969Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:19.0258321Z layer_outputs = decoder_layer( 2025-08-14T21:32:19.0258634Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:19.0258969Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:19.0259322Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 447, in forward 2025-08-14T21:32:19.0259675Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:32:19.0259800Z 2025-08-14T21:32:19.0259896Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:19.0260226Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:19.0260524Z return mod(**inputs) 2025-08-14T21:32:19.0260847Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1901, in forward 2025-08-14T21:32:19.0261198Z outputs = self.model.decoder( 2025-08-14T21:32:19.0261543Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:19.0261893Z layer_outputs = decoder_layer( 2025-08-14T21:32:19.0262204Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:19.0262536Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:19.0262892Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:19.0263269Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:19.0263633Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 216, in forward 2025-08-14T21:32:19.0264052Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:32:19.0264271Z 2025-08-14T21:32:19.0264378Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:19.0264769Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:19.0265073Z return mod(**inputs) 2025-08-14T21:32:19.0265407Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1901, in forward 2025-08-14T21:32:19.0265764Z outputs = self.model.decoder( 2025-08-14T21:32:19.0266106Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:19.0266511Z layer_outputs = decoder_layer( 2025-08-14T21:32:19.0266832Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:19.0267165Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:19.0267515Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:19.0267892Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:19.0268259Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 235, in forward 2025-08-14T21:32:19.0268608Z key_states = self.k_proj(current_states) 2025-08-14T21:32:19.0268741Z 2025-08-14T21:32:19.0268836Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:19.0269168Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:19.0269471Z return mod(**inputs) 2025-08-14T21:32:19.0269798Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1901, in forward 2025-08-14T21:32:19.0270149Z outputs = self.model.decoder( 2025-08-14T21:32:19.0270502Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:19.0270847Z layer_outputs = decoder_layer( 2025-08-14T21:32:19.0271166Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:19.0271498Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:19.0271852Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:19.0272217Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:19.0272588Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 236, in forward 2025-08-14T21:32:19.0272952Z value_states = self.v_proj(current_states) 2025-08-14T21:32:19.0273081Z 2025-08-14T21:32:19.0273162Z cudagraph partition due to non gpu ops 2025-08-14T21:32:19.0273353Z cudagraph partition due to non gpu ops 2025-08-14T21:32:19.0273548Z cudagraph partition due to non gpu ops 2025-08-14T21:32:19.0273742Z cudagraph partition due to non gpu ops 2025-08-14T21:32:19.0273950Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:19.0274283Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:19.0274585Z return mod(**inputs) 2025-08-14T21:32:19.0274906Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1901, in forward 2025-08-14T21:32:19.0275263Z outputs = self.model.decoder( 2025-08-14T21:32:19.0275610Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:19.0275963Z layer_outputs = decoder_layer( 2025-08-14T21:32:19.0276277Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:19.0276608Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:19.0276992Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:19.0277366Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:19.0277728Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 254, in forward 2025-08-14T21:32:19.0278098Z attn_output, attn_weights = attention_interface( 2025-08-14T21:32:19.0278506Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:32:19.0278943Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:32:19.0279150Z 2025-08-14T21:32:19.0279245Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:19.0279573Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:19.0279871Z return mod(**inputs) 2025-08-14T21:32:19.0280191Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1901, in forward 2025-08-14T21:32:19.0280544Z outputs = self.model.decoder( 2025-08-14T21:32:19.0280888Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:19.0281229Z layer_outputs = decoder_layer( 2025-08-14T21:32:19.0281542Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:19.0281873Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:19.0282223Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:19.0282588Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:19.0282953Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 254, in forward 2025-08-14T21:32:19.0283325Z attn_output, attn_weights = attention_interface( 2025-08-14T21:32:19.0283729Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:32:19.0284141Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:32:19.0284297Z 2025-08-14T21:32:19.0284391Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:19.0284845Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:19.0285143Z return mod(**inputs) 2025-08-14T21:32:19.0285475Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1901, in forward 2025-08-14T21:32:19.0285834Z outputs = self.model.decoder( 2025-08-14T21:32:19.0286183Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:19.0286530Z layer_outputs = decoder_layer( 2025-08-14T21:32:19.0286854Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:19.0287192Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:19.0287537Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:19.0287911Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:19.0288282Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 268, in forward 2025-08-14T21:32:19.0288644Z attn_output = self.out_proj(attn_output) 2025-08-14T21:32:19.0288772Z 2025-08-14T21:32:19.0288866Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:19.0289197Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:19.0289497Z return mod(**inputs) 2025-08-14T21:32:19.0289894Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1901, in forward 2025-08-14T21:32:19.0290247Z outputs = self.model.decoder( 2025-08-14T21:32:19.0290595Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:19.0290949Z layer_outputs = decoder_layer( 2025-08-14T21:32:19.0291264Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:19.0291600Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:19.0291957Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 445, in forward 2025-08-14T21:32:19.0292402Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:32:19.0292561Z 2025-08-14T21:32:19.0292657Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:19.0292994Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:19.0293296Z return mod(**inputs) 2025-08-14T21:32:19.0293625Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1901, in forward 2025-08-14T21:32:19.0293976Z outputs = self.model.decoder( 2025-08-14T21:32:19.0294323Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:19.0294675Z layer_outputs = decoder_layer( 2025-08-14T21:32:19.0294985Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:19.0295317Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:19.0295668Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 445, in forward 2025-08-14T21:32:19.0296056Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:32:19.0296409Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:32:19.0296723Z return self.act(input) 2025-08-14T21:32:19.0296823Z 2025-08-14T21:32:19.0296927Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:19.0297250Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:19.0297546Z return mod(**inputs) 2025-08-14T21:32:19.0297874Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1901, in forward 2025-08-14T21:32:19.0298227Z outputs = self.model.decoder( 2025-08-14T21:32:19.0298565Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:19.0298912Z layer_outputs = decoder_layer( 2025-08-14T21:32:19.0299232Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:19.0299557Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:19.0299912Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 447, in forward 2025-08-14T21:32:19.0300269Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:32:19.0300395Z 2025-08-14T21:32:19.0300498Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:19.0300820Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:19.0301123Z return mod(**inputs) 2025-08-14T21:32:19.0301455Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1901, in forward 2025-08-14T21:32:19.0301808Z outputs = self.model.decoder( 2025-08-14T21:32:19.0302145Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:19.0302529Z layer_outputs = decoder_layer( 2025-08-14T21:32:19.0302853Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:19.0303178Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:19.0303529Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:19.0303911Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:19.0304282Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 216, in forward 2025-08-14T21:32:19.0304788Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:32:19.0304991Z 2025-08-14T21:32:19.0305087Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:19.0305419Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:19.0305719Z return mod(**inputs) 2025-08-14T21:32:19.0306047Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1901, in forward 2025-08-14T21:32:19.0306403Z outputs = self.model.decoder( 2025-08-14T21:32:19.0306750Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:19.0307097Z layer_outputs = decoder_layer( 2025-08-14T21:32:19.0307418Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:19.0307752Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:19.0308110Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:19.0308478Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:19.0308851Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 235, in forward 2025-08-14T21:32:19.0309209Z key_states = self.k_proj(current_states) 2025-08-14T21:32:19.0309333Z 2025-08-14T21:32:19.0309428Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:19.0309760Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:19.0310057Z return mod(**inputs) 2025-08-14T21:32:19.0310383Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1901, in forward 2025-08-14T21:32:19.0310728Z outputs = self.model.decoder( 2025-08-14T21:32:19.0311078Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:19.0311430Z layer_outputs = decoder_layer( 2025-08-14T21:32:19.0311743Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:19.0312076Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:19.0312429Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:19.0312803Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:19.0313164Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 236, in forward 2025-08-14T21:32:19.0313524Z value_states = self.v_proj(current_states) 2025-08-14T21:32:19.0313654Z 2025-08-14T21:32:19.0313733Z cudagraph partition due to non gpu ops 2025-08-14T21:32:19.0313929Z cudagraph partition due to non gpu ops 2025-08-14T21:32:19.0314116Z cudagraph partition due to non gpu ops 2025-08-14T21:32:19.0314306Z cudagraph partition due to non gpu ops 2025-08-14T21:32:19.0314522Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:19.0314844Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:19.0315181Z return mod(**inputs) 2025-08-14T21:32:19.0315516Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1901, in forward 2025-08-14T21:32:19.0315867Z outputs = self.model.decoder( 2025-08-14T21:32:19.0316214Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:19.0316566Z layer_outputs = decoder_layer( 2025-08-14T21:32:19.0316888Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:19.0317267Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:19.0317621Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:19.0317997Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:19.0318370Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 254, in forward 2025-08-14T21:32:19.0318741Z attn_output, attn_weights = attention_interface( 2025-08-14T21:32:19.0319156Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:32:19.0319602Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:32:19.0319771Z 2025-08-14T21:32:19.0319866Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:19.0320197Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:19.0320499Z return mod(**inputs) 2025-08-14T21:32:19.0320828Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1901, in forward 2025-08-14T21:32:19.0321178Z outputs = self.model.decoder( 2025-08-14T21:32:19.0321531Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:19.0321883Z layer_outputs = decoder_layer( 2025-08-14T21:32:19.0322200Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:19.0322524Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:19.0322877Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:19.0323251Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:19.0323610Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 254, in forward 2025-08-14T21:32:19.0323986Z attn_output, attn_weights = attention_interface( 2025-08-14T21:32:19.0324391Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:32:19.0324814Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:32:19.0324961Z 2025-08-14T21:32:19.0325055Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:19.0325383Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:19.0325681Z return mod(**inputs) 2025-08-14T21:32:19.0326001Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1901, in forward 2025-08-14T21:32:19.0326358Z outputs = self.model.decoder( 2025-08-14T21:32:19.0326702Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:19.0327060Z layer_outputs = decoder_layer( 2025-08-14T21:32:19.0327370Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:19.0327705Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:19.0328089Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:19.0328465Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:19.0328826Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 268, in forward 2025-08-14T21:32:19.0329182Z attn_output = self.out_proj(attn_output) 2025-08-14T21:32:19.0329306Z 2025-08-14T21:32:19.0329408Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:19.0329732Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:19.0330072Z return mod(**inputs) 2025-08-14T21:32:19.0330402Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1901, in forward 2025-08-14T21:32:19.0330759Z outputs = self.model.decoder( 2025-08-14T21:32:19.0331100Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:19.0331452Z layer_outputs = decoder_layer( 2025-08-14T21:32:19.0331771Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:19.0332096Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:19.0332449Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 445, in forward 2025-08-14T21:32:19.0332845Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:32:19.0333004Z 2025-08-14T21:32:19.0333109Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:19.0333437Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:19.0333737Z return mod(**inputs) 2025-08-14T21:32:19.0334068Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1901, in forward 2025-08-14T21:32:19.0334426Z outputs = self.model.decoder( 2025-08-14T21:32:19.0334765Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:19.0335117Z layer_outputs = decoder_layer( 2025-08-14T21:32:19.0335439Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:19.0335765Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:19.0336119Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 445, in forward 2025-08-14T21:32:19.0336519Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:32:19.0336876Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:32:19.0337181Z return self.act(input) 2025-08-14T21:32:19.0337289Z 2025-08-14T21:32:19.0337385Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:19.0337716Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:19.0338008Z return mod(**inputs) 2025-08-14T21:32:19.0338339Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1901, in forward 2025-08-14T21:32:19.0338688Z outputs = self.model.decoder( 2025-08-14T21:32:19.0339033Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:19.0339375Z layer_outputs = decoder_layer( 2025-08-14T21:32:19.0339699Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:19.0340030Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:19.0340382Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 447, in forward 2025-08-14T21:32:19.0340763Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:32:19.0340900Z 2025-08-14T21:32:19.0340993Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:19.0341323Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:19.0341614Z return mod(**inputs) 2025-08-14T21:32:19.0341941Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1901, in forward 2025-08-14T21:32:19.0342295Z outputs = self.model.decoder( 2025-08-14T21:32:19.0342641Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:19.0343013Z layer_outputs = decoder_layer( 2025-08-14T21:32:19.0343328Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:19.0343657Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:19.0344004Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:19.0344377Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:19.0344807Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 216, in forward 2025-08-14T21:32:19.0345241Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:32:19.0345429Z 2025-08-14T21:32:19.0345525Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:19.0345859Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:19.0346161Z return mod(**inputs) 2025-08-14T21:32:19.0346492Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1901, in forward 2025-08-14T21:32:19.0346842Z outputs = self.model.decoder( 2025-08-14T21:32:19.0347191Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:19.0347545Z layer_outputs = decoder_layer( 2025-08-14T21:32:19.0347853Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:19.0348188Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:19.0348543Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:19.0348918Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:19.0349282Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 235, in forward 2025-08-14T21:32:19.0349638Z key_states = self.k_proj(current_states) 2025-08-14T21:32:19.0349762Z 2025-08-14T21:32:19.0349865Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:19.0350188Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:19.0350488Z return mod(**inputs) 2025-08-14T21:32:19.0350814Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1901, in forward 2025-08-14T21:32:19.0351168Z outputs = self.model.decoder( 2025-08-14T21:32:19.0351504Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:19.0351854Z layer_outputs = decoder_layer( 2025-08-14T21:32:19.0352171Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:19.0352502Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:19.0352843Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:19.0353254Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:19.0353624Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 236, in forward 2025-08-14T21:32:19.0353979Z value_states = self.v_proj(current_states) 2025-08-14T21:32:19.0354114Z 2025-08-14T21:32:19.0354189Z cudagraph partition due to non gpu ops 2025-08-14T21:32:19.0354387Z cudagraph partition due to non gpu ops 2025-08-14T21:32:19.0354578Z cudagraph partition due to non gpu ops 2025-08-14T21:32:19.0354762Z cudagraph partition due to non gpu ops 2025-08-14T21:32:19.0354976Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:19.0355344Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:19.0355637Z return mod(**inputs) 2025-08-14T21:32:19.0355975Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1901, in forward 2025-08-14T21:32:19.0356338Z outputs = self.model.decoder( 2025-08-14T21:32:19.0356687Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:19.0357034Z layer_outputs = decoder_layer( 2025-08-14T21:32:19.0357357Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:19.0357692Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:19.0358038Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:19.0358413Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:19.0358787Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 254, in forward 2025-08-14T21:32:19.0359158Z attn_output, attn_weights = attention_interface( 2025-08-14T21:32:19.0359563Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:32:19.0360011Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:32:19.0360179Z 2025-08-14T21:32:19.0360280Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:19.0360609Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:19.0360902Z return mod(**inputs) 2025-08-14T21:32:19.0361228Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1901, in forward 2025-08-14T21:32:19.0361582Z outputs = self.model.decoder( 2025-08-14T21:32:19.0361921Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:19.0362273Z layer_outputs = decoder_layer( 2025-08-14T21:32:19.0362591Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:19.0362924Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:19.0363271Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:19.0363641Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:19.0364011Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 254, in forward 2025-08-14T21:32:19.0364378Z attn_output, attn_weights = attention_interface( 2025-08-14T21:32:19.0364789Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:32:19.0365215Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:32:19.0365363Z 2025-08-14T21:32:19.0365463Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:19.0365843Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:19.0366151Z return mod(**inputs) 2025-08-14T21:32:19.0366501Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1901, in forward 2025-08-14T21:32:19.0366864Z outputs = self.model.decoder( 2025-08-14T21:32:19.0367210Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:19.0367569Z layer_outputs = decoder_layer( 2025-08-14T21:32:19.0367891Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:19.0368254Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:19.0368614Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:19.0368993Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:19.0369365Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 268, in forward 2025-08-14T21:32:19.0369720Z attn_output = self.out_proj(attn_output) 2025-08-14T21:32:19.0369853Z 2025-08-14T21:32:19.0369949Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:19.0370279Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:19.0370570Z return mod(**inputs) 2025-08-14T21:32:19.0370899Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1901, in forward 2025-08-14T21:32:19.0371253Z outputs = self.model.decoder( 2025-08-14T21:32:19.0371598Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:19.0371940Z layer_outputs = decoder_layer( 2025-08-14T21:32:19.0372260Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:19.0372593Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:19.0372943Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 445, in forward 2025-08-14T21:32:19.0373328Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:32:19.0373492Z 2025-08-14T21:32:19.0373586Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:19.0374074Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:19.0374365Z return mod(**inputs) 2025-08-14T21:32:19.0374695Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1901, in forward 2025-08-14T21:32:19.0375042Z outputs = self.model.decoder( 2025-08-14T21:32:19.0375387Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:19.0375733Z layer_outputs = decoder_layer( 2025-08-14T21:32:19.0376051Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:19.0376385Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:19.0376725Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 445, in forward 2025-08-14T21:32:19.0377114Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:32:19.0377467Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:32:19.0377784Z return self.act(input) 2025-08-14T21:32:19.0377885Z 2025-08-14T21:32:19.0377979Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:19.0378311Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:19.0378609Z return mod(**inputs) 2025-08-14T21:32:19.0378974Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1901, in forward 2025-08-14T21:32:19.0379325Z outputs = self.model.decoder( 2025-08-14T21:32:19.0379669Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:19.0380017Z layer_outputs = decoder_layer( 2025-08-14T21:32:19.0380327Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:19.0380659Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:19.0381072Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 447, in forward 2025-08-14T21:32:19.0381432Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:32:19.0381557Z 2025-08-14T21:32:19.0381652Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:19.0381991Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:19.0382294Z return mod(**inputs) 2025-08-14T21:32:19.0382614Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1901, in forward 2025-08-14T21:32:19.0382968Z outputs = self.model.decoder( 2025-08-14T21:32:19.0383312Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:19.0383661Z layer_outputs = decoder_layer( 2025-08-14T21:32:19.0383972Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:19.0384309Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:19.0384863Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:19.0385249Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:19.0385618Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 216, in forward 2025-08-14T21:32:19.0386043Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:32:19.0386232Z 2025-08-14T21:32:19.0386337Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:19.0386661Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:19.0386964Z return mod(**inputs) 2025-08-14T21:32:19.0387293Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1901, in forward 2025-08-14T21:32:19.0387649Z outputs = self.model.decoder( 2025-08-14T21:32:19.0387986Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:19.0388340Z layer_outputs = decoder_layer( 2025-08-14T21:32:19.0388661Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:19.0388987Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:19.0389338Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:19.0389711Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:19.0390080Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 235, in forward 2025-08-14T21:32:19.0390428Z key_states = self.k_proj(current_states) 2025-08-14T21:32:19.0390559Z 2025-08-14T21:32:19.0390653Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:19.0390982Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:19.0391279Z return mod(**inputs) 2025-08-14T21:32:19.0391652Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1901, in forward 2025-08-14T21:32:19.0392008Z outputs = self.model.decoder( 2025-08-14T21:32:19.0392352Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:19.0392693Z layer_outputs = decoder_layer( 2025-08-14T21:32:19.0393009Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:19.0393339Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:19.0393691Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:19.0394105Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:19.0394479Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 236, in forward 2025-08-14T21:32:19.0394844Z value_states = self.v_proj(current_states) 2025-08-14T21:32:19.0394981Z 2025-08-14T21:32:19.0395065Z cudagraph partition due to non gpu ops 2025-08-14T21:32:19.0395256Z cudagraph partition due to non gpu ops 2025-08-14T21:32:19.0395452Z cudagraph partition due to non gpu ops 2025-08-14T21:32:19.0395643Z cudagraph partition due to non gpu ops 2025-08-14T21:32:19.0395852Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:19.0396188Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:19.0396490Z return mod(**inputs) 2025-08-14T21:32:19.0396814Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1901, in forward 2025-08-14T21:32:19.0397173Z outputs = self.model.decoder( 2025-08-14T21:32:19.0397520Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:19.0397871Z layer_outputs = decoder_layer( 2025-08-14T21:32:19.0398186Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:19.0398520Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:19.0398873Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:19.0399239Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:19.0399608Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 254, in forward 2025-08-14T21:32:19.0399981Z attn_output, attn_weights = attention_interface( 2025-08-14T21:32:19.0400391Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:32:19.0400826Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:32:19.0401004Z 2025-08-14T21:32:19.0401103Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:19.0401437Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:19.0401738Z return mod(**inputs) 2025-08-14T21:32:19.0402060Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1901, in forward 2025-08-14T21:32:19.0402412Z outputs = self.model.decoder( 2025-08-14T21:32:19.0402756Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:19.0403097Z layer_outputs = decoder_layer( 2025-08-14T21:32:19.0403418Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:19.0403749Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:19.0404101Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:19.0404492Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:19.0404866Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 254, in forward 2025-08-14T21:32:19.0405238Z attn_output, attn_weights = attention_interface( 2025-08-14T21:32:19.0405649Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:32:19.0406065Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:32:19.0406224Z 2025-08-14T21:32:19.0406317Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:19.0406673Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:19.0406961Z return mod(**inputs) 2025-08-14T21:32:19.0407287Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1901, in forward 2025-08-14T21:32:19.0407639Z outputs = self.model.decoder( 2025-08-14T21:32:19.0407984Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:19.0408326Z layer_outputs = decoder_layer( 2025-08-14T21:32:19.0408642Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:19.0408971Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:19.0409315Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:19.0409691Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:19.0410062Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 268, in forward 2025-08-14T21:32:19.0410421Z attn_output = self.out_proj(attn_output) 2025-08-14T21:32:19.0410544Z 2025-08-14T21:32:19.0410639Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:19.0410966Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:19.0411263Z return mod(**inputs) 2025-08-14T21:32:19.0411591Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1901, in forward 2025-08-14T21:32:19.0411935Z outputs = self.model.decoder( 2025-08-14T21:32:19.0412279Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:19.0412633Z layer_outputs = decoder_layer( 2025-08-14T21:32:19.0412949Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:19.0413286Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:19.0413637Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 445, in forward 2025-08-14T21:32:19.0414034Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:32:19.0414192Z 2025-08-14T21:32:19.0414287Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:19.0414616Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:19.0414914Z return mod(**inputs) 2025-08-14T21:32:19.0415230Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1901, in forward 2025-08-14T21:32:19.0415580Z outputs = self.model.decoder( 2025-08-14T21:32:19.0415923Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:19.0416275Z layer_outputs = decoder_layer( 2025-08-14T21:32:19.0416583Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:19.0416914Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:19.0417300Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 445, in forward 2025-08-14T21:32:19.0417695Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:32:19.0418046Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:32:19.0418360Z return self.act(input) 2025-08-14T21:32:19.0418463Z 2025-08-14T21:32:19.0418566Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:19.0418888Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:19.0419215Z return mod(**inputs) 2025-08-14T21:32:19.0419541Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1901, in forward 2025-08-14T21:32:19.0419893Z outputs = self.model.decoder( 2025-08-14T21:32:19.0420230Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:19.0420579Z layer_outputs = decoder_layer( 2025-08-14T21:32:19.0420897Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:19.0421219Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:19.0421571Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 447, in forward 2025-08-14T21:32:19.0421928Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:32:19.0422052Z 2025-08-14T21:32:19.0422153Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:19.0422477Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:19.0422771Z return mod(**inputs) 2025-08-14T21:32:19.0423097Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1901, in forward 2025-08-14T21:32:19.0423448Z outputs = self.model.decoder( 2025-08-14T21:32:19.0423783Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:19.0424125Z layer_outputs = decoder_layer( 2025-08-14T21:32:19.0424445Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:19.0424836Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:19.0425199Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:19.0425583Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:19.0425954Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 216, in forward 2025-08-14T21:32:19.0426371Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:32:19.0426571Z 2025-08-14T21:32:19.0426668Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:19.0426999Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:19.0427291Z return mod(**inputs) 2025-08-14T21:32:19.0427624Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1901, in forward 2025-08-14T21:32:19.0427980Z outputs = self.model.decoder( 2025-08-14T21:32:19.0428329Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:19.0428681Z layer_outputs = decoder_layer( 2025-08-14T21:32:19.0429002Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:19.0429345Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:19.0429731Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:19.0430103Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:19.0430472Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 235, in forward 2025-08-14T21:32:19.0430827Z key_states = self.k_proj(current_states) 2025-08-14T21:32:19.0430948Z 2025-08-14T21:32:19.0431042Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:19.0431370Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:19.0431667Z return mod(**inputs) 2025-08-14T21:32:19.0432026Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1901, in forward 2025-08-14T21:32:19.0432373Z outputs = self.model.decoder( 2025-08-14T21:32:19.0432721Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:19.0433072Z layer_outputs = decoder_layer( 2025-08-14T21:32:19.0433384Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:19.0433716Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:19.0434073Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:19.0434451Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:19.0434815Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 236, in forward 2025-08-14T21:32:19.0435183Z value_states = self.v_proj(current_states) 2025-08-14T21:32:19.0435311Z 2025-08-14T21:32:19.0435393Z cudagraph partition due to non gpu ops 2025-08-14T21:32:19.0435588Z cudagraph partition due to non gpu ops 2025-08-14T21:32:19.0435774Z cudagraph partition due to non gpu ops 2025-08-14T21:32:19.0435966Z cudagraph partition due to non gpu ops 2025-08-14T21:32:19.0436180Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:19.0436503Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:19.0436804Z return mod(**inputs) 2025-08-14T21:32:19.0437135Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1901, in forward 2025-08-14T21:32:19.0437479Z outputs = self.model.decoder( 2025-08-14T21:32:19.0437827Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:19.0438181Z layer_outputs = decoder_layer( 2025-08-14T21:32:19.0438498Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:19.0438822Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:19.0439178Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:19.0439556Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:19.0439927Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 254, in forward 2025-08-14T21:32:19.0440295Z attn_output, attn_weights = attention_interface( 2025-08-14T21:32:19.0440706Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:32:19.0441151Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:32:19.0441325Z 2025-08-14T21:32:19.0441420Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:19.0441755Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:19.0442060Z return mod(**inputs) 2025-08-14T21:32:19.0442422Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1901, in forward 2025-08-14T21:32:19.0442776Z outputs = self.model.decoder( 2025-08-14T21:32:19.0443124Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:19.0443477Z layer_outputs = decoder_layer( 2025-08-14T21:32:19.0443793Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:19.0444128Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:19.0444482Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:19.0444888Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:19.0445251Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 254, in forward 2025-08-14T21:32:19.0445626Z attn_output, attn_weights = attention_interface( 2025-08-14T21:32:19.0446035Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:32:19.0446458Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:32:19.0446606Z 2025-08-14T21:32:19.0446699Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:19.0447031Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:19.0447329Z return mod(**inputs) 2025-08-14T21:32:19.0447656Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1901, in forward 2025-08-14T21:32:19.0448016Z outputs = self.model.decoder( 2025-08-14T21:32:19.0448362Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:19.0448714Z layer_outputs = decoder_layer( 2025-08-14T21:32:19.0449026Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:19.0449364Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:19.0449717Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:19.0450091Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:19.0450457Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 268, in forward 2025-08-14T21:32:19.0450821Z attn_output = self.out_proj(attn_output) 2025-08-14T21:32:19.0450948Z 2025-08-14T21:32:19.0451050Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:19.0451373Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:19.0451676Z return mod(**inputs) 2025-08-14T21:32:19.0452009Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1901, in forward 2025-08-14T21:32:19.0452362Z outputs = self.model.decoder( 2025-08-14T21:32:19.0452704Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:19.0453055Z layer_outputs = decoder_layer( 2025-08-14T21:32:19.0453373Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:19.0453700Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:19.0454052Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 445, in forward 2025-08-14T21:32:19.0454452Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:32:19.0454608Z 2025-08-14T21:32:19.0454710Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:19.0455070Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:19.0455371Z return mod(**inputs) 2025-08-14T21:32:19.0455705Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1901, in forward 2025-08-14T21:32:19.0456057Z outputs = self.model.decoder( 2025-08-14T21:32:19.0456393Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:19.0456743Z layer_outputs = decoder_layer( 2025-08-14T21:32:19.0457062Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:19.0457426Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:19.0457781Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 445, in forward 2025-08-14T21:32:19.0458180Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:32:19.0458536Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:32:19.0458845Z return self.act(input) 2025-08-14T21:32:19.0458955Z 2025-08-14T21:32:19.0459052Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:19.0459386Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:19.0459680Z return mod(**inputs) 2025-08-14T21:32:19.0460017Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1901, in forward 2025-08-14T21:32:19.0460376Z outputs = self.model.decoder( 2025-08-14T21:32:19.0460725Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:19.0461069Z layer_outputs = decoder_layer( 2025-08-14T21:32:19.0461389Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:19.0461725Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:19.0462073Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 447, in forward 2025-08-14T21:32:19.0462434Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:32:19.0462564Z 2025-08-14T21:32:19.0462660Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:19.0462993Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:19.0463284Z return mod(**inputs) 2025-08-14T21:32:19.0463618Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1901, in forward 2025-08-14T21:32:19.0463970Z outputs = self.model.decoder( 2025-08-14T21:32:19.0464319Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:19.0464725Z layer_outputs = decoder_layer( 2025-08-14T21:32:19.0465058Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:19.0465392Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:19.0465737Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:19.0466114Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:19.0466483Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 216, in forward 2025-08-14T21:32:19.0466906Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:32:19.0467094Z 2025-08-14T21:32:19.0467188Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:19.0467516Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:19.0467851Z return mod(**inputs) 2025-08-14T21:32:19.0468188Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1901, in forward 2025-08-14T21:32:19.0468537Z outputs = self.model.decoder( 2025-08-14T21:32:19.0468882Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:19.0469239Z layer_outputs = decoder_layer( 2025-08-14T21:32:19.0469550Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:19.0469885Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:19.0470270Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:19.0470639Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:19.0471000Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 235, in forward 2025-08-14T21:32:19.0471354Z key_states = self.k_proj(current_states) 2025-08-14T21:32:19.0471477Z 2025-08-14T21:32:19.0471577Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:19.0471896Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:19.0472191Z return mod(**inputs) 2025-08-14T21:32:19.0472515Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1901, in forward 2025-08-14T21:32:19.0472865Z outputs = self.model.decoder( 2025-08-14T21:32:19.0473203Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:19.0473552Z layer_outputs = decoder_layer( 2025-08-14T21:32:19.0473869Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:19.0474199Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:19.0474543Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:19.0474916Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:19.0475284Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 236, in forward 2025-08-14T21:32:19.0475640Z value_states = self.v_proj(current_states) 2025-08-14T21:32:19.0475775Z 2025-08-14T21:32:19.0475848Z cudagraph partition due to non gpu ops 2025-08-14T21:32:19.0476043Z cudagraph partition due to non gpu ops 2025-08-14T21:32:19.0476240Z cudagraph partition due to non gpu ops 2025-08-14T21:32:19.0476424Z cudagraph partition due to non gpu ops 2025-08-14T21:32:19.0476637Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:19.0476966Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:19.0477258Z return mod(**inputs) 2025-08-14T21:32:19.0477588Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1901, in forward 2025-08-14T21:32:19.0477936Z outputs = self.model.decoder( 2025-08-14T21:32:19.0478271Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:19.0478616Z layer_outputs = decoder_layer( 2025-08-14T21:32:19.0478932Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:19.0479263Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:19.0479611Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:19.0479982Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:19.0480374Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 254, in forward 2025-08-14T21:32:19.0480750Z attn_output, attn_weights = attention_interface( 2025-08-14T21:32:19.0481154Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:32:19.0481597Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:32:19.0481765Z 2025-08-14T21:32:19.0481868Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:19.0482196Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:19.0482541Z return mod(**inputs) 2025-08-14T21:32:19.0482868Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1901, in forward 2025-08-14T21:32:19.0483217Z outputs = self.model.decoder( 2025-08-14T21:32:19.0483557Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:19.0483905Z layer_outputs = decoder_layer( 2025-08-14T21:32:19.0484220Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:19.0484547Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:19.0485010Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:19.0485392Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:19.0485761Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 254, in forward 2025-08-14T21:32:19.0486135Z attn_output, attn_weights = attention_interface( 2025-08-14T21:32:19.0486548Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:32:19.0486976Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:32:19.0487126Z 2025-08-14T21:32:19.0487230Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:19.0487553Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:19.0487853Z return mod(**inputs) 2025-08-14T21:32:19.0488183Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1901, in forward 2025-08-14T21:32:19.0488539Z outputs = self.model.decoder( 2025-08-14T21:32:19.0488878Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:19.0489234Z layer_outputs = decoder_layer( 2025-08-14T21:32:19.0489553Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:19.0489881Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:19.0490238Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:19.0490614Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:19.0490984Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 268, in forward 2025-08-14T21:32:19.0491335Z attn_output = self.out_proj(attn_output) 2025-08-14T21:32:19.0491467Z 2025-08-14T21:32:19.0491561Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:19.0491898Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:19.0492190Z return mod(**inputs) 2025-08-14T21:32:19.0492521Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1901, in forward 2025-08-14T21:32:19.0492874Z outputs = self.model.decoder( 2025-08-14T21:32:19.0493275Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:19.0493620Z layer_outputs = decoder_layer( 2025-08-14T21:32:19.0493939Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:19.0494272Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:19.0494617Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 445, in forward 2025-08-14T21:32:19.0495009Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:32:19.0495174Z 2025-08-14T21:32:19.0495270Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:19.0495647Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:19.0495938Z return mod(**inputs) 2025-08-14T21:32:19.0496266Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1901, in forward 2025-08-14T21:32:19.0496619Z outputs = self.model.decoder( 2025-08-14T21:32:19.0496963Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:19.0497305Z layer_outputs = decoder_layer( 2025-08-14T21:32:19.0497620Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:19.0497951Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:19.0498295Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 445, in forward 2025-08-14T21:32:19.0498690Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:32:19.0499043Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:32:19.0499355Z return self.act(input) 2025-08-14T21:32:19.0499457Z 2025-08-14T21:32:19.0499553Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:19.0513039Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:19.0513489Z return mod(**inputs) 2025-08-14T21:32:19.0513863Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1901, in forward 2025-08-14T21:32:19.0514252Z outputs = self.model.decoder( 2025-08-14T21:32:19.0514624Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:19.0514998Z layer_outputs = decoder_layer( 2025-08-14T21:32:19.0515345Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:19.0515700Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:19.0516075Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 447, in forward 2025-08-14T21:32:19.0516457Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:32:19.0516592Z 2025-08-14T21:32:19.0516697Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:19.0517045Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:19.0517358Z return mod(**inputs) 2025-08-14T21:32:19.0517696Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1901, in forward 2025-08-14T21:32:19.0518068Z outputs = self.model.decoder( 2025-08-14T21:32:19.0518432Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:19.0518804Z layer_outputs = decoder_layer( 2025-08-14T21:32:19.0519132Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:19.0519476Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:19.0519943Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:19.0520334Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:19.0520724Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 216, in forward 2025-08-14T21:32:19.0521173Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:32:19.0521362Z 2025-08-14T21:32:19.0521469Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:19.0521799Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:19.0522153Z return mod(**inputs) 2025-08-14T21:32:19.0522488Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1901, in forward 2025-08-14T21:32:19.0522846Z outputs = self.model.decoder( 2025-08-14T21:32:19.0523192Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:19.0523554Z layer_outputs = decoder_layer( 2025-08-14T21:32:19.0523881Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:19.0524213Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:19.0524579Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:19.0524964Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:19.0525354Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 235, in forward 2025-08-14T21:32:19.0525706Z key_states = self.k_proj(current_states) 2025-08-14T21:32:19.0525839Z 2025-08-14T21:32:19.0525934Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:19.0526272Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:19.0526568Z return mod(**inputs) 2025-08-14T21:32:19.0526898Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1901, in forward 2025-08-14T21:32:19.0527251Z outputs = self.model.decoder( 2025-08-14T21:32:19.0527594Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:19.0527934Z layer_outputs = decoder_layer( 2025-08-14T21:32:19.0528253Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:19.0528593Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:19.0528947Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:19.0529311Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:19.0529682Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 236, in forward 2025-08-14T21:32:19.0530045Z value_states = self.v_proj(current_states) 2025-08-14T21:32:19.0530175Z 2025-08-14T21:32:19.0530250Z cudagraph partition due to non gpu ops 2025-08-14T21:32:19.0530451Z cudagraph partition due to non gpu ops 2025-08-14T21:32:19.0530642Z cudagraph partition due to non gpu ops 2025-08-14T21:32:19.0530830Z cudagraph partition due to non gpu ops 2025-08-14T21:32:19.0531039Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:19.0531376Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:19.0531435Z return mod(**inputs) 2025-08-14T21:32:19.0531673Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1901, in forward 2025-08-14T21:32:19.0531741Z outputs = self.model.decoder( 2025-08-14T21:32:19.0532003Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:19.0532078Z layer_outputs = decoder_layer( 2025-08-14T21:32:19.0532282Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:19.0532361Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:19.0532590Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:19.0532682Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:19.0532949Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 254, in forward 2025-08-14T21:32:19.0533040Z attn_output, attn_weights = attention_interface( 2025-08-14T21:32:19.0533318Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:32:19.0533446Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:32:19.0533450Z 2025-08-14T21:32:19.0533545Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:19.0533740Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:19.0533800Z return mod(**inputs) 2025-08-14T21:32:19.0534029Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1901, in forward 2025-08-14T21:32:19.0534107Z outputs = self.model.decoder( 2025-08-14T21:32:19.0534339Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:19.0534412Z layer_outputs = decoder_layer( 2025-08-14T21:32:19.0534616Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:19.0534692Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:19.0534928Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:19.0535016Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:19.0535241Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 254, in forward 2025-08-14T21:32:19.0535339Z attn_output, attn_weights = attention_interface( 2025-08-14T21:32:19.0535606Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:32:19.0535718Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:32:19.0535722Z 2025-08-14T21:32:19.0535816Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:19.0535999Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:19.0536070Z return mod(**inputs) 2025-08-14T21:32:19.0536298Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1901, in forward 2025-08-14T21:32:19.0536373Z outputs = self.model.decoder( 2025-08-14T21:32:19.0536600Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:19.0536666Z layer_outputs = decoder_layer( 2025-08-14T21:32:19.0536875Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:19.0536949Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:19.0537174Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:19.0537270Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:19.0537527Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 268, in forward 2025-08-14T21:32:19.0537614Z attn_output = self.out_proj(attn_output) 2025-08-14T21:32:19.0537617Z 2025-08-14T21:32:19.0537710Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:19.0537891Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:19.0537959Z return mod(**inputs) 2025-08-14T21:32:19.0538188Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1901, in forward 2025-08-14T21:32:19.0538262Z outputs = self.model.decoder( 2025-08-14T21:32:19.0538522Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:19.0538588Z layer_outputs = decoder_layer( 2025-08-14T21:32:19.0538797Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:19.0538869Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:19.0539092Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 445, in forward 2025-08-14T21:32:19.0539210Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:32:19.0539214Z 2025-08-14T21:32:19.0539305Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:19.0539495Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:19.0539552Z return mod(**inputs) 2025-08-14T21:32:19.0539778Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1901, in forward 2025-08-14T21:32:19.0539857Z outputs = self.model.decoder( 2025-08-14T21:32:19.0540086Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:19.0540159Z layer_outputs = decoder_layer( 2025-08-14T21:32:19.0540359Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:19.0540430Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:19.0540659Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 445, in forward 2025-08-14T21:32:19.0540766Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:32:19.0540957Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:32:19.0541034Z return self.act(input) 2025-08-14T21:32:19.0541038Z 2025-08-14T21:32:19.0541132Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:19.0541318Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:19.0541377Z return mod(**inputs) 2025-08-14T21:32:19.0541605Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1901, in forward 2025-08-14T21:32:19.0541682Z outputs = self.model.decoder( 2025-08-14T21:32:19.0541910Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:19.0541974Z layer_outputs = decoder_layer( 2025-08-14T21:32:19.0542181Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:19.0542252Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:19.0542486Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 447, in forward 2025-08-14T21:32:19.0542565Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:32:19.0542568Z 2025-08-14T21:32:19.0542660Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:19.0542890Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:19.0542952Z return mod(**inputs) 2025-08-14T21:32:19.0543194Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1901, in forward 2025-08-14T21:32:19.0543263Z outputs = self.model.decoder( 2025-08-14T21:32:19.0543493Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:19.0543565Z layer_outputs = decoder_layer( 2025-08-14T21:32:19.0543765Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:19.0543866Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:19.0544099Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:19.0544189Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:19.0544423Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 216, in forward 2025-08-14T21:32:19.0544563Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:32:19.0544566Z 2025-08-14T21:32:19.0544747Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:19.0544947Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:19.0545006Z return mod(**inputs) 2025-08-14T21:32:19.0545244Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1901, in forward 2025-08-14T21:32:19.0545313Z outputs = self.model.decoder( 2025-08-14T21:32:19.0545541Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:19.0545614Z layer_outputs = decoder_layer( 2025-08-14T21:32:19.0545818Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:19.0545890Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:19.0546122Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:19.0546211Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:19.0546442Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 235, in forward 2025-08-14T21:32:19.0546515Z key_states = self.k_proj(current_states) 2025-08-14T21:32:19.0546519Z 2025-08-14T21:32:19.0546621Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:19.0546808Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:19.0546868Z return mod(**inputs) 2025-08-14T21:32:19.0547105Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1901, in forward 2025-08-14T21:32:19.0547171Z outputs = self.model.decoder( 2025-08-14T21:32:19.0547396Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:19.0547468Z layer_outputs = decoder_layer( 2025-08-14T21:32:19.0547666Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:19.0547743Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:19.0547969Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:19.0548061Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:19.0548292Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 236, in forward 2025-08-14T21:32:19.0548369Z value_states = self.v_proj(current_states) 2025-08-14T21:32:19.0548372Z 2025-08-14T21:32:19.0548484Z cudagraph partition due to non gpu ops 2025-08-14T21:32:19.0548559Z cudagraph partition due to non gpu ops 2025-08-14T21:32:19.0548629Z cudagraph partition due to non gpu ops 2025-08-14T21:32:19.0548707Z cudagraph partition due to non gpu ops 2025-08-14T21:32:19.0548801Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:19.0548983Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:19.0549049Z return mod(**inputs) 2025-08-14T21:32:19.0549280Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1901, in forward 2025-08-14T21:32:19.0549385Z outputs = self.model.decoder( 2025-08-14T21:32:19.0549613Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:19.0549677Z layer_outputs = decoder_layer( 2025-08-14T21:32:19.0549888Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:19.0549959Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:19.0550183Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:19.0550279Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:19.0550501Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 254, in forward 2025-08-14T21:32:19.0550596Z attn_output, attn_weights = attention_interface( 2025-08-14T21:32:19.0550864Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:32:19.0550987Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:32:19.0550991Z 2025-08-14T21:32:19.0551092Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:19.0551273Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:19.0551340Z return mod(**inputs) 2025-08-14T21:32:19.0551568Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1901, in forward 2025-08-14T21:32:19.0551633Z outputs = self.model.decoder( 2025-08-14T21:32:19.0551866Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:19.0551931Z layer_outputs = decoder_layer( 2025-08-14T21:32:19.0552129Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:19.0552210Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:19.0552433Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:19.0552530Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:19.0552752Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 254, in forward 2025-08-14T21:32:19.0552839Z attn_output, attn_weights = attention_interface( 2025-08-14T21:32:19.0553112Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:32:19.0553210Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:32:19.0553213Z 2025-08-14T21:32:19.0553311Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:19.0553493Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:19.0553553Z return mod(**inputs) 2025-08-14T21:32:19.0553786Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1901, in forward 2025-08-14T21:32:19.0553910Z outputs = self.model.decoder( 2025-08-14T21:32:19.0554144Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:19.0554214Z layer_outputs = decoder_layer( 2025-08-14T21:32:19.0554416Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:19.0554493Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:19.0554720Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:19.0554809Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:19.0555067Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 268, in forward 2025-08-14T21:32:19.0555142Z attn_output = self.out_proj(attn_output) 2025-08-14T21:32:19.0555145Z 2025-08-14T21:32:19.0555237Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:19.0555428Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:19.0555486Z return mod(**inputs) 2025-08-14T21:32:19.0555720Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1901, in forward 2025-08-14T21:32:19.0555785Z outputs = self.model.decoder( 2025-08-14T21:32:19.0556012Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:19.0556083Z layer_outputs = decoder_layer( 2025-08-14T21:32:19.0556284Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:19.0556360Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:19.0556585Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 445, in forward 2025-08-14T21:32:19.0556694Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:32:19.0556698Z 2025-08-14T21:32:19.0556800Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:19.0556981Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:19.0557040Z return mod(**inputs) 2025-08-14T21:32:19.0557274Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1901, in forward 2025-08-14T21:32:19.0557340Z outputs = self.model.decoder( 2025-08-14T21:32:19.0557571Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:19.0557639Z layer_outputs = decoder_layer( 2025-08-14T21:32:19.0557840Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:19.0557919Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:19.0558147Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 445, in forward 2025-08-14T21:32:19.0558255Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:32:19.0558455Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:32:19.0558518Z return self.act(input) 2025-08-14T21:32:19.0558521Z 2025-08-14T21:32:19.0558624Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:19.0558802Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:19.0558863Z return mod(**inputs) 2025-08-14T21:32:19.0559095Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1901, in forward 2025-08-14T21:32:19.0559159Z outputs = self.model.decoder( 2025-08-14T21:32:19.0559421Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:19.0559486Z layer_outputs = decoder_layer( 2025-08-14T21:32:19.0559690Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:19.0559768Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:19.0559992Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 447, in forward 2025-08-14T21:32:19.0560064Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:32:19.0560075Z 2025-08-14T21:32:19.0560168Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:19.0560387Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:19.0560454Z return mod(**inputs) 2025-08-14T21:32:19.0560684Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1901, in forward 2025-08-14T21:32:19.0560751Z outputs = self.model.decoder( 2025-08-14T21:32:19.0560990Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:19.0561053Z layer_outputs = decoder_layer( 2025-08-14T21:32:19.0561263Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:19.0561335Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:19.0561564Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:19.0561664Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:19.0561893Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 216, in forward 2025-08-14T21:32:19.0562033Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:32:19.0562036Z 2025-08-14T21:32:19.0562137Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:19.0562322Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:19.0562388Z return mod(**inputs) 2025-08-14T21:32:19.0562620Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1901, in forward 2025-08-14T21:32:19.0562686Z outputs = self.model.decoder( 2025-08-14T21:32:19.0562923Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:19.0562989Z layer_outputs = decoder_layer( 2025-08-14T21:32:19.0563194Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:19.0563270Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:19.0563501Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:19.0563596Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:19.0563825Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 235, in forward 2025-08-14T21:32:19.0563897Z key_states = self.k_proj(current_states) 2025-08-14T21:32:19.0563900Z 2025-08-14T21:32:19.0564000Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:19.0564183Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:19.0564248Z return mod(**inputs) 2025-08-14T21:32:19.0564482Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1901, in forward 2025-08-14T21:32:19.0564548Z outputs = self.model.decoder( 2025-08-14T21:32:19.0564787Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:19.0564877Z layer_outputs = decoder_layer( 2025-08-14T21:32:19.0565079Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:19.0565154Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:19.0565377Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:19.0565472Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:19.0565693Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 236, in forward 2025-08-14T21:32:19.0565797Z value_states = self.v_proj(current_states) 2025-08-14T21:32:19.0565801Z 2025-08-14T21:32:19.0565882Z cudagraph partition due to non gpu ops 2025-08-14T21:32:19.0565953Z cudagraph partition due to non gpu ops 2025-08-14T21:32:19.0566026Z cudagraph partition due to non gpu ops 2025-08-14T21:32:19.0566096Z cudagraph partition due to non gpu ops 2025-08-14T21:32:19.0566191Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:19.0566377Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:19.0566434Z return mod(**inputs) 2025-08-14T21:32:19.0566663Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1901, in forward 2025-08-14T21:32:19.0566736Z outputs = self.model.decoder( 2025-08-14T21:32:19.0566964Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:19.0567037Z layer_outputs = decoder_layer( 2025-08-14T21:32:19.0567239Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:19.0567310Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:19.0567545Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:19.0567634Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:19.0567859Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 254, in forward 2025-08-14T21:32:19.0567955Z attn_output, attn_weights = attention_interface( 2025-08-14T21:32:19.0568223Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:32:19.0568355Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:32:19.0568361Z 2025-08-14T21:32:19.0568454Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:19.0568634Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:19.0568698Z return mod(**inputs) 2025-08-14T21:32:19.0568931Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1901, in forward 2025-08-14T21:32:19.0568998Z outputs = self.model.decoder( 2025-08-14T21:32:19.0569235Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:19.0569301Z layer_outputs = decoder_layer( 2025-08-14T21:32:19.0569510Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:19.0569581Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:19.0569808Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:19.0569907Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:19.0570131Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 254, in forward 2025-08-14T21:32:19.0570253Z attn_output, attn_weights = attention_interface( 2025-08-14T21:32:19.0570521Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:32:19.0570621Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:32:19.0570624Z 2025-08-14T21:32:19.0570724Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:19.0570907Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:19.0570964Z return mod(**inputs) 2025-08-14T21:32:19.0571204Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1901, in forward 2025-08-14T21:32:19.0571301Z outputs = self.model.decoder( 2025-08-14T21:32:19.0571535Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:19.0571599Z layer_outputs = decoder_layer( 2025-08-14T21:32:19.0571802Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:19.0571880Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:19.0572102Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:19.0572197Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:19.0572419Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 268, in forward 2025-08-14T21:32:19.0572492Z attn_output = self.out_proj(attn_output) 2025-08-14T21:32:19.0572498Z 2025-08-14T21:32:19.0572597Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:19.0572776Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:19.0572835Z return mod(**inputs) 2025-08-14T21:32:19.0573071Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1901, in forward 2025-08-14T21:32:19.0573137Z outputs = self.model.decoder( 2025-08-14T21:32:19.0573368Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:19.0573432Z layer_outputs = decoder_layer( 2025-08-14T21:32:19.0573631Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:19.0573708Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:19.0573929Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 445, in forward 2025-08-14T21:32:19.0574047Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:32:19.0574050Z 2025-08-14T21:32:19.0574142Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:19.0574324Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:19.0574389Z return mod(**inputs) 2025-08-14T21:32:19.0574615Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1901, in forward 2025-08-14T21:32:19.0574682Z outputs = self.model.decoder( 2025-08-14T21:32:19.0574917Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:19.0574980Z layer_outputs = decoder_layer( 2025-08-14T21:32:19.0575186Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:19.0575258Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:19.0575481Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 445, in forward 2025-08-14T21:32:19.0575593Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:32:19.0575814Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:32:19.0575879Z return self.act(input) 2025-08-14T21:32:19.0575890Z 2025-08-14T21:32:19.0575983Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:19.0576162Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:19.0576230Z return mod(**inputs) 2025-08-14T21:32:19.0576456Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1901, in forward 2025-08-14T21:32:19.0576520Z outputs = self.model.decoder( 2025-08-14T21:32:19.0576781Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:19.0576844Z layer_outputs = decoder_layer( 2025-08-14T21:32:19.0577055Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:19.0577126Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:19.0577353Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 447, in forward 2025-08-14T21:32:19.0577434Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:32:19.0577438Z 2025-08-14T21:32:19.0577528Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:19.0577710Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:19.0577773Z return mod(**inputs) 2025-08-14T21:32:19.0578000Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1901, in forward 2025-08-14T21:32:19.0578074Z outputs = self.model.decoder( 2025-08-14T21:32:19.0578300Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:19.0578365Z layer_outputs = decoder_layer( 2025-08-14T21:32:19.0578573Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:19.0578643Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:19.0578867Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:19.0578961Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:19.0579186Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 216, in forward 2025-08-14T21:32:19.0579336Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:32:19.0579340Z 2025-08-14T21:32:19.0579430Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:19.0579613Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:19.0579677Z return mod(**inputs) 2025-08-14T21:32:19.0579907Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1901, in forward 2025-08-14T21:32:19.0579980Z outputs = self.model.decoder( 2025-08-14T21:32:19.0580209Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:19.0580273Z layer_outputs = decoder_layer( 2025-08-14T21:32:19.0580481Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:19.0580552Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:19.0580781Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:19.0580876Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:19.0581129Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 235, in forward 2025-08-14T21:32:19.0581210Z key_states = self.k_proj(current_states) 2025-08-14T21:32:19.0581213Z 2025-08-14T21:32:19.0581306Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:19.0581489Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:19.0581555Z return mod(**inputs) 2025-08-14T21:32:19.0581787Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1901, in forward 2025-08-14T21:32:19.0581861Z outputs = self.model.decoder( 2025-08-14T21:32:19.0582090Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:19.0582190Z layer_outputs = decoder_layer( 2025-08-14T21:32:19.0582399Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:19.0582471Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:19.0582692Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:19.0582787Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:19.0583009Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 236, in forward 2025-08-14T21:32:19.0583093Z value_states = self.v_proj(current_states) 2025-08-14T21:32:19.0583096Z 2025-08-14T21:32:19.0583166Z cudagraph partition due to non gpu ops 2025-08-14T21:32:19.0583237Z cudagraph partition due to non gpu ops 2025-08-14T21:32:19.0583317Z cudagraph partition due to non gpu ops 2025-08-14T21:32:19.0583386Z cudagraph partition due to non gpu ops 2025-08-14T21:32:19.0583479Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:19.0583665Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:19.0583726Z return mod(**inputs) 2025-08-14T21:32:19.0583960Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1901, in forward 2025-08-14T21:32:19.0584027Z outputs = self.model.decoder( 2025-08-14T21:32:19.0584253Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:19.0584327Z layer_outputs = decoder_layer( 2025-08-14T21:32:19.0584535Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:19.0584817Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:19.0585074Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:19.0585166Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:19.0585409Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 254, in forward 2025-08-14T21:32:19.0585501Z attn_output, attn_weights = attention_interface( 2025-08-14T21:32:19.0585776Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:32:19.0585912Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:32:19.0585917Z 2025-08-14T21:32:19.0586013Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:19.0586210Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:19.0586274Z return mod(**inputs) 2025-08-14T21:32:19.0586510Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1901, in forward 2025-08-14T21:32:19.0586589Z outputs = self.model.decoder( 2025-08-14T21:32:19.0586900Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:19.0586980Z layer_outputs = decoder_layer( 2025-08-14T21:32:19.0587193Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:19.0587266Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:19.0587505Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:19.0587595Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:19.0587829Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 254, in forward 2025-08-14T21:32:19.0587974Z attn_output, attn_weights = attention_interface( 2025-08-14T21:32:19.0588252Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:32:19.0588366Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:32:19.0588369Z 2025-08-14T21:32:19.0588465Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:19.0588651Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:19.0588719Z return mod(**inputs) 2025-08-14T21:32:19.0588957Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1901, in forward 2025-08-14T21:32:19.0589027Z outputs = self.model.decoder( 2025-08-14T21:32:19.0589269Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:19.0589339Z layer_outputs = decoder_layer( 2025-08-14T21:32:19.0589557Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:19.0589631Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:19.0589864Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:19.0589962Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:19.0590193Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 268, in forward 2025-08-14T21:32:19.0590277Z attn_output = self.out_proj(attn_output) 2025-08-14T21:32:19.0590281Z 2025-08-14T21:32:19.0590375Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:19.0590561Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:19.0590634Z return mod(**inputs) 2025-08-14T21:32:19.0590866Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1901, in forward 2025-08-14T21:32:19.0590935Z outputs = self.model.decoder( 2025-08-14T21:32:19.0591176Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:19.0591241Z layer_outputs = decoder_layer( 2025-08-14T21:32:19.0591456Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:19.0591527Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:19.0591761Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 445, in forward 2025-08-14T21:32:19.0591877Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:32:19.0591880Z 2025-08-14T21:32:19.0591975Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:19.0592171Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:19.0592231Z return mod(**inputs) 2025-08-14T21:32:19.0592466Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1901, in forward 2025-08-14T21:32:19.0592570Z outputs = self.model.decoder( 2025-08-14T21:32:19.0592808Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:19.0592873Z layer_outputs = decoder_layer( 2025-08-14T21:32:19.0593088Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:19.0593161Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:19.0593398Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 445, in forward 2025-08-14T21:32:19.0593537Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:32:19.0593737Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:32:19.0593808Z return self.act(input) 2025-08-14T21:32:19.0593811Z 2025-08-14T21:32:19.0593908Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:19.0594091Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:19.0594156Z return mod(**inputs) 2025-08-14T21:32:19.0594388Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1901, in forward 2025-08-14T21:32:19.0594461Z outputs = self.model.decoder( 2025-08-14T21:32:19.0594693Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:19.0594757Z layer_outputs = decoder_layer( 2025-08-14T21:32:19.0594970Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:19.0595040Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:19.0595274Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 447, in forward 2025-08-14T21:32:19.0595352Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:32:19.0595356Z 2025-08-14T21:32:19.0595449Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:19.0595639Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:19.0595699Z return mod(**inputs) 2025-08-14T21:32:19.0595931Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1917, in forward 2025-08-14T21:32:19.0596010Z logits = self.lm_head(outputs[0]) 2025-08-14T21:32:19.0596014Z 2025-08-14T21:32:19.0596106Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:19.0596299Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:19.0596358Z return mod(**inputs) 2025-08-14T21:32:19.0596592Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1923, in forward 2025-08-14T21:32:19.0596739Z loss = loss_fct(logits.view(-1, self.config.vocab_size), labels.view(-1)) 2025-08-14T21:32:19.0596743Z 2025-08-14T21:32:27.5497571Z Compilation time (from dynamo_timed): 13.484638692 2025-08-14T21:32:27.5754964Z pass 2025-08-14T21:32:27.5755455Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:32:27.5756323Z TIMING: _recursive_pre_grad_passes:0.00669 _recursive_joint_graph_passes:0.54969 _recursive_post_grad_passes:0.0787 async_compile.wait:0.59025 code_gen:7.27837 inductor_compile:8.44848 backend_compile:11.45014 gc:0.00062 entire_frame_compile:13.48464 total_wall_time:13.48464 2025-08-14T21:32:27.5757361Z STATS: call_* op count: 372 | FakeTensorMode.__torch_dispatch__:13198 | FakeTensor.__torch_dispatch__:4868 | ProxyTorchDispatchMode.__torch_dispatch__:4813 2025-08-14T21:32:27.5757853Z Dynamo produced 1 graphs covering 372 ops with 0 graph breaks (0 unique) 2025-08-14T21:32:31.5877829Z /opt/conda/envs/py_3.9/lib/python3.9/site-packages/llvmlite/binding/ffi.py:175: UserWarning: pkg_resources is deprecated as an API. See https://setuptools.pypa.io/en/latest/pkg_resources.html. The pkg_resources package is slated for removal as early as 2025-11-30. Refrain from using this package or pin to Setuptools<81. 2025-08-14T21:32:31.5878790Z from pkg_resources import resource_filename 2025-08-14T21:32:32.1428057Z 2025-08-14T21:32:36.7407280Z loading model: 0it [00:00, ?it/s] 2025-08-14T21:32:36.7410919Z loading model: 0it [00:04, ?it/s] 2025-08-14T21:32:36.7435599Z cpu eval BartForConditionalGeneration 2025-08-14T21:32:39.1759962Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:32:40.1026676Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:32:41.0483630Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:32:55.9499478Z cudagraph partition due to non gpu ops 2025-08-14T21:32:55.9502953Z cudagraph partition due to non gpu ops 2025-08-14T21:32:55.9505723Z cudagraph partition due to non gpu ops 2025-08-14T21:32:55.9511018Z cudagraph partition due to non gpu ops 2025-08-14T21:32:55.9512983Z cudagraph partition due to non gpu ops 2025-08-14T21:32:55.9513327Z cudagraph partition due to non gpu ops 2025-08-14T21:32:55.9518585Z cudagraph partition due to non gpu ops 2025-08-14T21:32:55.9518823Z cudagraph partition due to non gpu ops 2025-08-14T21:32:55.9519030Z cudagraph partition due to non gpu ops 2025-08-14T21:32:55.9519224Z cudagraph partition due to non gpu ops 2025-08-14T21:32:55.9530787Z cudagraph partition due to non gpu ops 2025-08-14T21:32:55.9535307Z cudagraph partition due to non gpu ops 2025-08-14T21:32:55.9539258Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:55.9539809Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:55.9540268Z return mod(**inputs) 2025-08-14T21:32:55.9540662Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:55.9541065Z outputs = self.model( 2025-08-14T21:32:55.9541417Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1270, in forward 2025-08-14T21:32:55.9541784Z encoder_outputs = self.encoder( 2025-08-14T21:32:55.9542147Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 869, in forward 2025-08-14T21:32:55.9542513Z layer_outputs = encoder_layer( 2025-08-14T21:32:55.9542843Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:55.9543188Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:55.9543548Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 312, in forward 2025-08-14T21:32:55.9543928Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:32:55.9544297Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 216, in forward 2025-08-14T21:32:55.9544849Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:32:55.9545058Z 2025-08-14T21:32:55.9545166Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:55.9545529Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:55.9545857Z return mod(**inputs) 2025-08-14T21:32:55.9546213Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:55.9546580Z outputs = self.model( 2025-08-14T21:32:55.9546950Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1270, in forward 2025-08-14T21:32:55.9547643Z encoder_outputs = self.encoder( 2025-08-14T21:32:55.9548038Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 869, in forward 2025-08-14T21:32:55.9548404Z layer_outputs = encoder_layer( 2025-08-14T21:32:55.9548740Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:55.9549079Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:55.9549444Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 312, in forward 2025-08-14T21:32:55.9549944Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:32:55.9550325Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 235, in forward 2025-08-14T21:32:55.9550686Z key_states = self.k_proj(current_states) 2025-08-14T21:32:55.9550821Z 2025-08-14T21:32:55.9550925Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:55.9551263Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:55.9551576Z return mod(**inputs) 2025-08-14T21:32:55.9551912Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:55.9552538Z outputs = self.model( 2025-08-14T21:32:55.9552879Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1270, in forward 2025-08-14T21:32:55.9553234Z encoder_outputs = self.encoder( 2025-08-14T21:32:55.9553596Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 869, in forward 2025-08-14T21:32:55.9553957Z layer_outputs = encoder_layer( 2025-08-14T21:32:55.9554289Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:55.9554628Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:55.9554991Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 312, in forward 2025-08-14T21:32:55.9555370Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:32:55.9555742Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 236, in forward 2025-08-14T21:32:55.9556106Z value_states = self.v_proj(current_states) 2025-08-14T21:32:55.9556244Z 2025-08-14T21:32:55.9556322Z cudagraph partition due to non gpu ops 2025-08-14T21:32:55.9556524Z cudagraph partition due to non gpu ops 2025-08-14T21:32:55.9556713Z cudagraph partition due to non gpu ops 2025-08-14T21:32:55.9556905Z cudagraph partition due to non gpu ops 2025-08-14T21:32:55.9557123Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:55.9557458Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:55.9557773Z return mod(**inputs) 2025-08-14T21:32:55.9558129Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:55.9558487Z outputs = self.model( 2025-08-14T21:32:55.9558816Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1270, in forward 2025-08-14T21:32:55.9559183Z encoder_outputs = self.encoder( 2025-08-14T21:32:55.9559553Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 869, in forward 2025-08-14T21:32:55.9559917Z layer_outputs = encoder_layer( 2025-08-14T21:32:55.9560242Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:55.9560571Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:55.9560998Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 312, in forward 2025-08-14T21:32:55.9561363Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:32:55.9561722Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 254, in forward 2025-08-14T21:32:55.9562093Z attn_output, attn_weights = attention_interface( 2025-08-14T21:32:55.9562506Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:32:55.9562945Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:32:55.9563125Z 2025-08-14T21:32:55.9563257Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:55.9563599Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:55.9563906Z return mod(**inputs) 2025-08-14T21:32:55.9564241Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:55.9564597Z outputs = self.model( 2025-08-14T21:32:55.9564934Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1270, in forward 2025-08-14T21:32:55.9565331Z encoder_outputs = self.encoder( 2025-08-14T21:32:55.9565690Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 869, in forward 2025-08-14T21:32:55.9566061Z layer_outputs = encoder_layer( 2025-08-14T21:32:55.9566408Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:55.9566770Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:55.9567138Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 312, in forward 2025-08-14T21:32:55.9567518Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:32:55.9567901Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 254, in forward 2025-08-14T21:32:55.9568281Z attn_output, attn_weights = attention_interface( 2025-08-14T21:32:55.9568697Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:32:55.9569125Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:32:55.9569276Z 2025-08-14T21:32:55.9569375Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:55.9569708Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:55.9570013Z return mod(**inputs) 2025-08-14T21:32:55.9570348Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:55.9570694Z outputs = self.model( 2025-08-14T21:32:55.9571034Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1270, in forward 2025-08-14T21:32:55.9571391Z encoder_outputs = self.encoder( 2025-08-14T21:32:55.9571736Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 869, in forward 2025-08-14T21:32:55.9572093Z layer_outputs = encoder_layer( 2025-08-14T21:32:55.9572416Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:55.9572755Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:55.9573115Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 312, in forward 2025-08-14T21:32:55.9573490Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:32:55.9573857Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 268, in forward 2025-08-14T21:32:55.9574254Z attn_output = self.out_proj(attn_output) 2025-08-14T21:32:55.9574381Z 2025-08-14T21:32:55.9574477Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:55.9574805Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:55.9575101Z return mod(**inputs) 2025-08-14T21:32:55.9575421Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:55.9575766Z outputs = self.model( 2025-08-14T21:32:55.9576091Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1270, in forward 2025-08-14T21:32:55.9576477Z encoder_outputs = self.encoder( 2025-08-14T21:32:55.9576816Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 869, in forward 2025-08-14T21:32:55.9577166Z layer_outputs = encoder_layer( 2025-08-14T21:32:55.9577497Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:55.9577827Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:55.9578193Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 323, in forward 2025-08-14T21:32:55.9578607Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:32:55.9578768Z 2025-08-14T21:32:55.9578871Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:55.9579192Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:55.9579495Z return mod(**inputs) 2025-08-14T21:32:55.9579831Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:55.9580175Z outputs = self.model( 2025-08-14T21:32:55.9580507Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1270, in forward 2025-08-14T21:32:55.9580860Z encoder_outputs = self.encoder( 2025-08-14T21:32:55.9581210Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 869, in forward 2025-08-14T21:32:55.9581574Z layer_outputs = encoder_layer( 2025-08-14T21:32:55.9581914Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:55.9582260Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:55.9582622Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 323, in forward 2025-08-14T21:32:55.9583022Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:32:55.9583412Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:32:55.9583737Z return self.act(input) 2025-08-14T21:32:55.9583841Z 2025-08-14T21:32:55.9583941Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:55.9584280Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:55.9584854Z return mod(**inputs) 2025-08-14T21:32:55.9585214Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:55.9585590Z outputs = self.model( 2025-08-14T21:32:55.9585954Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1270, in forward 2025-08-14T21:32:55.9586323Z encoder_outputs = self.encoder( 2025-08-14T21:32:55.9586682Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 869, in forward 2025-08-14T21:32:55.9587048Z layer_outputs = encoder_layer( 2025-08-14T21:32:55.9587382Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:55.9587794Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:55.9588152Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 325, in forward 2025-08-14T21:32:55.9588523Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:32:55.9588653Z 2025-08-14T21:32:55.9588761Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:55.9589098Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:55.9589398Z return mod(**inputs) 2025-08-14T21:32:55.9589738Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:55.9590156Z outputs = self.model( 2025-08-14T21:32:55.9590498Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1270, in forward 2025-08-14T21:32:55.9590867Z encoder_outputs = self.encoder( 2025-08-14T21:32:55.9591234Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 869, in forward 2025-08-14T21:32:55.9591600Z layer_outputs = encoder_layer( 2025-08-14T21:32:55.9591948Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:55.9592297Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:55.9592668Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 312, in forward 2025-08-14T21:32:55.9593045Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:32:55.9593434Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 216, in forward 2025-08-14T21:32:55.9593876Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:32:55.9594074Z 2025-08-14T21:32:55.9594186Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:55.9594523Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:55.9594842Z return mod(**inputs) 2025-08-14T21:32:55.9595195Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:55.9595566Z outputs = self.model( 2025-08-14T21:32:55.9595916Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1270, in forward 2025-08-14T21:32:55.9596290Z encoder_outputs = self.encoder( 2025-08-14T21:32:55.9596660Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 869, in forward 2025-08-14T21:32:55.9597023Z layer_outputs = encoder_layer( 2025-08-14T21:32:55.9597350Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:55.9597757Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:55.9598121Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 312, in forward 2025-08-14T21:32:55.9598488Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:32:55.9598859Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 235, in forward 2025-08-14T21:32:55.9599219Z key_states = self.k_proj(current_states) 2025-08-14T21:32:55.9599345Z 2025-08-14T21:32:55.9599445Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:55.9599784Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:55.9600094Z return mod(**inputs) 2025-08-14T21:32:55.9600433Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:55.9600780Z outputs = self.model( 2025-08-14T21:32:55.9601152Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1270, in forward 2025-08-14T21:32:55.9601506Z encoder_outputs = self.encoder( 2025-08-14T21:32:55.9601845Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 869, in forward 2025-08-14T21:32:55.9602192Z layer_outputs = encoder_layer( 2025-08-14T21:32:55.9602508Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:55.9602845Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:55.9603240Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 312, in forward 2025-08-14T21:32:55.9603616Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:32:55.9603990Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 236, in forward 2025-08-14T21:32:55.9604372Z value_states = self.v_proj(current_states) 2025-08-14T21:32:55.9604508Z 2025-08-14T21:32:55.9604584Z cudagraph partition due to non gpu ops 2025-08-14T21:32:55.9604783Z cudagraph partition due to non gpu ops 2025-08-14T21:32:55.9604978Z cudagraph partition due to non gpu ops 2025-08-14T21:32:55.9605165Z cudagraph partition due to non gpu ops 2025-08-14T21:32:55.9605389Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:55.9605731Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:55.9606036Z return mod(**inputs) 2025-08-14T21:32:55.9606377Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:55.9606721Z outputs = self.model( 2025-08-14T21:32:55.9607050Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1270, in forward 2025-08-14T21:32:55.9607397Z encoder_outputs = self.encoder( 2025-08-14T21:32:55.9607741Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 869, in forward 2025-08-14T21:32:55.9608090Z layer_outputs = encoder_layer( 2025-08-14T21:32:55.9608410Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:55.9608736Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:55.9609089Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 312, in forward 2025-08-14T21:32:55.9609460Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:32:55.9609821Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 254, in forward 2025-08-14T21:32:55.9610194Z attn_output, attn_weights = attention_interface( 2025-08-14T21:32:55.9610605Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:32:55.9611045Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:32:55.9611212Z 2025-08-14T21:32:55.9611308Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:55.9611641Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:55.9611943Z return mod(**inputs) 2025-08-14T21:32:55.9612277Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:55.9612618Z outputs = self.model( 2025-08-14T21:32:55.9612950Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1270, in forward 2025-08-14T21:32:55.9613297Z encoder_outputs = self.encoder( 2025-08-14T21:32:55.9613675Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 869, in forward 2025-08-14T21:32:55.9614034Z layer_outputs = encoder_layer( 2025-08-14T21:32:55.9614362Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:55.9614703Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:55.9615058Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 312, in forward 2025-08-14T21:32:55.9615434Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:32:55.9615814Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 254, in forward 2025-08-14T21:32:55.9616216Z attn_output, attn_weights = attention_interface( 2025-08-14T21:32:55.9616635Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:32:55.9617068Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:32:55.9617223Z 2025-08-14T21:32:55.9617330Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:55.9617658Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:55.9617964Z return mod(**inputs) 2025-08-14T21:32:55.9618304Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:55.9618659Z outputs = self.model( 2025-08-14T21:32:55.9618989Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1270, in forward 2025-08-14T21:32:55.9619353Z encoder_outputs = self.encoder( 2025-08-14T21:32:55.9619703Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 869, in forward 2025-08-14T21:32:55.9620052Z layer_outputs = encoder_layer( 2025-08-14T21:32:55.9620386Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:55.9620743Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:55.9621106Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 312, in forward 2025-08-14T21:32:55.9621475Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:32:55.9621848Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 268, in forward 2025-08-14T21:32:55.9622216Z attn_output = self.out_proj(attn_output) 2025-08-14T21:32:55.9622342Z 2025-08-14T21:32:55.9622451Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:55.9622780Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:55.9623087Z return mod(**inputs) 2025-08-14T21:32:55.9623425Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:55.9623776Z outputs = self.model( 2025-08-14T21:32:55.9624117Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1270, in forward 2025-08-14T21:32:55.9624479Z encoder_outputs = self.encoder( 2025-08-14T21:32:55.9624945Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 869, in forward 2025-08-14T21:32:55.9625356Z layer_outputs = encoder_layer( 2025-08-14T21:32:55.9625706Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:55.9626083Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:55.9626461Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 323, in forward 2025-08-14T21:32:55.9626894Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:32:55.9627067Z 2025-08-14T21:32:55.9627204Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:55.9627551Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:55.9627852Z return mod(**inputs) 2025-08-14T21:32:55.9628187Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:55.9628541Z outputs = self.model( 2025-08-14T21:32:55.9628869Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1270, in forward 2025-08-14T21:32:55.9629233Z encoder_outputs = self.encoder( 2025-08-14T21:32:55.9629627Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 869, in forward 2025-08-14T21:32:55.9629987Z layer_outputs = encoder_layer( 2025-08-14T21:32:55.9630306Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:55.9630651Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:55.9631014Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 323, in forward 2025-08-14T21:32:55.9631423Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:32:55.9631781Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:32:55.9632104Z return self.act(input) 2025-08-14T21:32:55.9632206Z 2025-08-14T21:32:55.9632310Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:55.9632646Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:55.9632952Z return mod(**inputs) 2025-08-14T21:32:55.9633291Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:55.9633653Z outputs = self.model( 2025-08-14T21:32:55.9633978Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1270, in forward 2025-08-14T21:32:55.9634339Z encoder_outputs = self.encoder( 2025-08-14T21:32:55.9634693Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 869, in forward 2025-08-14T21:32:55.9635042Z layer_outputs = encoder_layer( 2025-08-14T21:32:55.9635378Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:55.9635718Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:55.9636083Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 325, in forward 2025-08-14T21:32:55.9636444Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:32:55.9636579Z 2025-08-14T21:32:55.9636675Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:55.9637016Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:55.9637313Z return mod(**inputs) 2025-08-14T21:32:55.9637650Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:55.9638004Z outputs = self.model( 2025-08-14T21:32:55.9638340Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1270, in forward 2025-08-14T21:32:55.9638696Z encoder_outputs = self.encoder( 2025-08-14T21:32:55.9639059Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 869, in forward 2025-08-14T21:32:55.9639421Z layer_outputs = encoder_layer( 2025-08-14T21:32:55.9639746Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:55.9640074Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:55.9640466Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 312, in forward 2025-08-14T21:32:55.9640854Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:32:55.9641222Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 216, in forward 2025-08-14T21:32:55.9641659Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:32:55.9641861Z 2025-08-14T21:32:55.9641958Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:55.9642298Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:55.9642633Z return mod(**inputs) 2025-08-14T21:32:55.9642976Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:55.9643339Z outputs = self.model( 2025-08-14T21:32:55.9643676Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1270, in forward 2025-08-14T21:32:55.9644039Z encoder_outputs = self.encoder( 2025-08-14T21:32:55.9644395Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 869, in forward 2025-08-14T21:32:55.9644756Z layer_outputs = encoder_layer( 2025-08-14T21:32:55.9645075Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:55.9645420Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:55.9645788Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 312, in forward 2025-08-14T21:32:55.9646202Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:32:55.9646570Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 235, in forward 2025-08-14T21:32:55.9646948Z key_states = self.k_proj(current_states) 2025-08-14T21:32:55.9647073Z 2025-08-14T21:32:55.9647176Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:55.9647500Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:55.9647801Z return mod(**inputs) 2025-08-14T21:32:55.9648135Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:55.9648481Z outputs = self.model( 2025-08-14T21:32:55.9648807Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1270, in forward 2025-08-14T21:32:55.9649163Z encoder_outputs = self.encoder( 2025-08-14T21:32:55.9649510Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 869, in forward 2025-08-14T21:32:55.9649855Z layer_outputs = encoder_layer( 2025-08-14T21:32:55.9650177Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:55.9650512Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:55.9650865Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 312, in forward 2025-08-14T21:32:55.9651224Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:32:55.9651588Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 236, in forward 2025-08-14T21:32:55.9651950Z value_states = self.v_proj(current_states) 2025-08-14T21:32:55.9652083Z 2025-08-14T21:32:55.9652164Z cudagraph partition due to non gpu ops 2025-08-14T21:32:55.9652355Z cudagraph partition due to non gpu ops 2025-08-14T21:32:55.9652548Z cudagraph partition due to non gpu ops 2025-08-14T21:32:55.9652739Z cudagraph partition due to non gpu ops 2025-08-14T21:32:55.9652951Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:55.9654258Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:55.9654568Z return mod(**inputs) 2025-08-14T21:32:55.9654894Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:55.9655248Z outputs = self.model( 2025-08-14T21:32:55.9655581Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1270, in forward 2025-08-14T21:32:55.9655937Z encoder_outputs = self.encoder( 2025-08-14T21:32:55.9656278Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 869, in forward 2025-08-14T21:32:55.9656687Z layer_outputs = encoder_layer( 2025-08-14T21:32:55.9657013Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:55.9657345Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:55.9657689Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 312, in forward 2025-08-14T21:32:55.9658056Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:32:55.9658418Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 254, in forward 2025-08-14T21:32:55.9658783Z attn_output, attn_weights = attention_interface( 2025-08-14T21:32:55.9659193Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:32:55.9659637Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:32:55.9659805Z 2025-08-14T21:32:55.9659909Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:55.9660235Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:55.9660537Z return mod(**inputs) 2025-08-14T21:32:55.9660872Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:55.9661212Z outputs = self.model( 2025-08-14T21:32:55.9661545Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1270, in forward 2025-08-14T21:32:55.9661897Z encoder_outputs = self.encoder( 2025-08-14T21:32:55.9662244Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 869, in forward 2025-08-14T21:32:55.9662584Z layer_outputs = encoder_layer( 2025-08-14T21:32:55.9662907Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:55.9663240Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:55.9663589Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 312, in forward 2025-08-14T21:32:55.9663947Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:32:55.9664308Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 254, in forward 2025-08-14T21:32:55.9664785Z attn_output, attn_weights = attention_interface( 2025-08-14T21:32:55.9665238Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:32:55.9665702Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:32:55.9665871Z 2025-08-14T21:32:55.9665971Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:55.9666328Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:55.9666625Z return mod(**inputs) 2025-08-14T21:32:55.9666962Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:55.9667394Z outputs = self.model( 2025-08-14T21:32:55.9667748Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1270, in forward 2025-08-14T21:32:55.9668135Z encoder_outputs = self.encoder( 2025-08-14T21:32:55.9668510Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 869, in forward 2025-08-14T21:32:55.9668893Z layer_outputs = encoder_layer( 2025-08-14T21:32:55.9669232Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:55.9669625Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:55.9670009Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 312, in forward 2025-08-14T21:32:55.9670408Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:32:55.9670798Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 268, in forward 2025-08-14T21:32:55.9671188Z attn_output = self.out_proj(attn_output) 2025-08-14T21:32:55.9671323Z 2025-08-14T21:32:55.9671434Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:55.9671787Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:55.9672110Z return mod(**inputs) 2025-08-14T21:32:55.9672468Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:55.9672845Z outputs = self.model( 2025-08-14T21:32:55.9673199Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1270, in forward 2025-08-14T21:32:55.9673582Z encoder_outputs = self.encoder( 2025-08-14T21:32:55.9673959Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 869, in forward 2025-08-14T21:32:55.9674332Z layer_outputs = encoder_layer( 2025-08-14T21:32:55.9674678Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:55.9675011Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:55.9675359Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 323, in forward 2025-08-14T21:32:55.9675748Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:32:55.9675914Z 2025-08-14T21:32:55.9676011Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:55.9676344Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:55.9676640Z return mod(**inputs) 2025-08-14T21:32:55.9676965Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:55.9677314Z outputs = self.model( 2025-08-14T21:32:55.9677646Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1270, in forward 2025-08-14T21:32:55.9677991Z encoder_outputs = self.encoder( 2025-08-14T21:32:55.9678339Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 869, in forward 2025-08-14T21:32:55.9678691Z layer_outputs = encoder_layer( 2025-08-14T21:32:55.9679010Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:55.9679337Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:55.9679692Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 323, in forward 2025-08-14T21:32:55.9680083Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:32:55.9680431Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:32:55.9680773Z return self.act(input) 2025-08-14T21:32:55.9680885Z 2025-08-14T21:32:55.9680981Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:55.9681306Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:55.9681594Z return mod(**inputs) 2025-08-14T21:32:55.9681921Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:55.9682267Z outputs = self.model( 2025-08-14T21:32:55.9682592Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1270, in forward 2025-08-14T21:32:55.9682975Z encoder_outputs = self.encoder( 2025-08-14T21:32:55.9683321Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 869, in forward 2025-08-14T21:32:55.9683671Z layer_outputs = encoder_layer( 2025-08-14T21:32:55.9683983Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:55.9684318Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:55.9684835Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 325, in forward 2025-08-14T21:32:55.9685215Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:32:55.9685346Z 2025-08-14T21:32:55.9685444Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:55.9685782Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:55.9686096Z return mod(**inputs) 2025-08-14T21:32:55.9686432Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:55.9686782Z outputs = self.model( 2025-08-14T21:32:55.9687116Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1270, in forward 2025-08-14T21:32:55.9687472Z encoder_outputs = self.encoder( 2025-08-14T21:32:55.9687809Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 869, in forward 2025-08-14T21:32:55.9688160Z layer_outputs = encoder_layer( 2025-08-14T21:32:55.9688483Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:55.9688807Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:55.9689163Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 312, in forward 2025-08-14T21:32:55.9689536Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:32:55.9689896Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 216, in forward 2025-08-14T21:32:55.9690307Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:32:55.9690504Z 2025-08-14T21:32:55.9690600Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:55.9690930Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:55.9691225Z return mod(**inputs) 2025-08-14T21:32:55.9691545Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:55.9691891Z outputs = self.model( 2025-08-14T21:32:55.9692221Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1270, in forward 2025-08-14T21:32:55.9692568Z encoder_outputs = self.encoder( 2025-08-14T21:32:55.9692911Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 869, in forward 2025-08-14T21:32:55.9693260Z layer_outputs = encoder_layer( 2025-08-14T21:32:55.9693651Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:55.9693978Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:55.9694330Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 312, in forward 2025-08-14T21:32:55.9694697Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:32:55.9695050Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 235, in forward 2025-08-14T21:32:55.9695407Z key_states = self.k_proj(current_states) 2025-08-14T21:32:55.9695538Z 2025-08-14T21:32:55.9695682Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:55.9696018Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:55.9696312Z return mod(**inputs) 2025-08-14T21:32:55.9696644Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:55.9696995Z outputs = self.model( 2025-08-14T21:32:55.9697323Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1270, in forward 2025-08-14T21:32:55.9697668Z encoder_outputs = self.encoder( 2025-08-14T21:32:55.9698014Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 869, in forward 2025-08-14T21:32:55.9698368Z layer_outputs = encoder_layer( 2025-08-14T21:32:55.9698679Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:55.9699011Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:55.9699360Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 312, in forward 2025-08-14T21:32:55.9699723Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:32:55.9700081Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 236, in forward 2025-08-14T21:32:55.9700441Z value_states = self.v_proj(current_states) 2025-08-14T21:32:55.9700569Z 2025-08-14T21:32:55.9700650Z cudagraph partition due to non gpu ops 2025-08-14T21:32:55.9700839Z cudagraph partition due to non gpu ops 2025-08-14T21:32:55.9701034Z cudagraph partition due to non gpu ops 2025-08-14T21:32:55.9701224Z cudagraph partition due to non gpu ops 2025-08-14T21:32:55.9701440Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:55.9701765Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:55.9702068Z return mod(**inputs) 2025-08-14T21:32:55.9702401Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:55.9702745Z outputs = self.model( 2025-08-14T21:32:55.9703079Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1270, in forward 2025-08-14T21:32:55.9703437Z encoder_outputs = self.encoder( 2025-08-14T21:32:55.9703785Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 869, in forward 2025-08-14T21:32:55.9704129Z layer_outputs = encoder_layer( 2025-08-14T21:32:55.9704457Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:55.9704867Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:55.9705240Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 312, in forward 2025-08-14T21:32:55.9705640Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:32:55.9706030Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 254, in forward 2025-08-14T21:32:55.9706474Z attn_output, attn_weights = attention_interface( 2025-08-14T21:32:55.9706884Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:32:55.9707377Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:32:55.9707556Z 2025-08-14T21:32:55.9707666Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:55.9708018Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:55.9708327Z return mod(**inputs) 2025-08-14T21:32:55.9708675Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:55.9709084Z outputs = self.model( 2025-08-14T21:32:55.9709428Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1270, in forward 2025-08-14T21:32:55.9709802Z encoder_outputs = self.encoder( 2025-08-14T21:32:55.9710172Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 869, in forward 2025-08-14T21:32:55.9710543Z layer_outputs = encoder_layer( 2025-08-14T21:32:55.9710871Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:55.9711222Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:55.9711591Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 312, in forward 2025-08-14T21:32:55.9711971Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:32:55.9712357Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 254, in forward 2025-08-14T21:32:55.9712748Z attn_output, attn_weights = attention_interface( 2025-08-14T21:32:55.9713176Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:32:55.9713615Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:32:55.9713777Z 2025-08-14T21:32:55.9713877Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:55.9714226Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:55.9714559Z return mod(**inputs) 2025-08-14T21:32:55.9714884Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:55.9715230Z outputs = self.model( 2025-08-14T21:32:55.9715563Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1270, in forward 2025-08-14T21:32:55.9715906Z encoder_outputs = self.encoder( 2025-08-14T21:32:55.9716247Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 869, in forward 2025-08-14T21:32:55.9716599Z layer_outputs = encoder_layer( 2025-08-14T21:32:55.9716920Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:55.9717246Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:55.9717598Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 312, in forward 2025-08-14T21:32:55.9717968Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:32:55.9718321Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 268, in forward 2025-08-14T21:32:55.9718685Z attn_output = self.out_proj(attn_output) 2025-08-14T21:32:55.9718818Z 2025-08-14T21:32:55.9718914Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:55.9719239Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:55.9719531Z return mod(**inputs) 2025-08-14T21:32:55.9719893Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:55.9720245Z outputs = self.model( 2025-08-14T21:32:55.9720577Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1270, in forward 2025-08-14T21:32:55.9720922Z encoder_outputs = self.encoder( 2025-08-14T21:32:55.9721275Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 869, in forward 2025-08-14T21:32:55.9721623Z layer_outputs = encoder_layer( 2025-08-14T21:32:55.9721964Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:55.9722295Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:55.9722648Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 323, in forward 2025-08-14T21:32:55.9723043Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:32:55.9723202Z 2025-08-14T21:32:55.9723297Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:55.9723626Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:55.9723923Z return mod(**inputs) 2025-08-14T21:32:55.9724244Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:55.9724589Z outputs = self.model( 2025-08-14T21:32:55.9724918Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1270, in forward 2025-08-14T21:32:55.9725272Z encoder_outputs = self.encoder( 2025-08-14T21:32:55.9725608Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 869, in forward 2025-08-14T21:32:55.9725958Z layer_outputs = encoder_layer( 2025-08-14T21:32:55.9726278Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:55.9726612Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:55.9726961Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 323, in forward 2025-08-14T21:32:55.9727352Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:32:55.9727707Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:32:55.9728013Z return self.act(input) 2025-08-14T21:32:55.9728124Z 2025-08-14T21:32:55.9728219Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:55.9728547Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:55.9728844Z return mod(**inputs) 2025-08-14T21:32:55.9729166Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:55.9729512Z outputs = self.model( 2025-08-14T21:32:55.9729839Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1270, in forward 2025-08-14T21:32:55.9730182Z encoder_outputs = self.encoder( 2025-08-14T21:32:55.9730525Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 869, in forward 2025-08-14T21:32:55.9730870Z layer_outputs = encoder_layer( 2025-08-14T21:32:55.9731186Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:55.9731509Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:55.9731857Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 325, in forward 2025-08-14T21:32:55.9732212Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:32:55.9732336Z 2025-08-14T21:32:55.9732473Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:55.9732798Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:55.9733094Z return mod(**inputs) 2025-08-14T21:32:55.9733423Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:55.9733766Z outputs = self.model( 2025-08-14T21:32:55.9734099Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1270, in forward 2025-08-14T21:32:55.9734491Z encoder_outputs = self.encoder( 2025-08-14T21:32:55.9734840Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 869, in forward 2025-08-14T21:32:55.9735183Z layer_outputs = encoder_layer( 2025-08-14T21:32:55.9735505Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:55.9735841Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:55.9736187Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 312, in forward 2025-08-14T21:32:55.9736557Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:32:55.9736923Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 216, in forward 2025-08-14T21:32:55.9737345Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:32:55.9737531Z 2025-08-14T21:32:55.9737630Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:55.9737962Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:55.9738262Z return mod(**inputs) 2025-08-14T21:32:55.9738588Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:55.9738938Z outputs = self.model( 2025-08-14T21:32:55.9739272Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1270, in forward 2025-08-14T21:32:55.9739624Z encoder_outputs = self.encoder( 2025-08-14T21:32:55.9739962Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 869, in forward 2025-08-14T21:32:55.9740313Z layer_outputs = encoder_layer( 2025-08-14T21:32:55.9740634Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:55.9740969Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:55.9741316Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 312, in forward 2025-08-14T21:32:55.9741687Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:32:55.9742054Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 235, in forward 2025-08-14T21:32:55.9742407Z key_states = self.k_proj(current_states) 2025-08-14T21:32:55.9742541Z 2025-08-14T21:32:55.9742638Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:55.9742970Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:55.9743269Z return mod(**inputs) 2025-08-14T21:32:55.9743592Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:55.9743938Z outputs = self.model( 2025-08-14T21:32:55.9744275Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1270, in forward 2025-08-14T21:32:55.9744628Z encoder_outputs = self.encoder( 2025-08-14T21:32:55.9745134Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 869, in forward 2025-08-14T21:32:55.9745504Z layer_outputs = encoder_layer( 2025-08-14T21:32:55.9745834Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:55.9746182Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:55.9746547Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 312, in forward 2025-08-14T21:32:55.9746976Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:32:55.9747354Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 236, in forward 2025-08-14T21:32:55.9747752Z value_states = self.v_proj(current_states) 2025-08-14T21:32:55.9747891Z 2025-08-14T21:32:55.9747967Z cudagraph partition due to non gpu ops 2025-08-14T21:32:55.9748169Z cudagraph partition due to non gpu ops 2025-08-14T21:32:55.9748366Z cudagraph partition due to non gpu ops 2025-08-14T21:32:55.9748569Z cudagraph partition due to non gpu ops 2025-08-14T21:32:55.9748796Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:55.9749134Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:55.9749438Z return mod(**inputs) 2025-08-14T21:32:55.9749776Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:55.9750134Z outputs = self.model( 2025-08-14T21:32:55.9750469Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1270, in forward 2025-08-14T21:32:55.9750837Z encoder_outputs = self.encoder( 2025-08-14T21:32:55.9751195Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 869, in forward 2025-08-14T21:32:55.9751551Z layer_outputs = encoder_layer( 2025-08-14T21:32:55.9751884Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:55.9752226Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:55.9752589Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 312, in forward 2025-08-14T21:32:55.9752960Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:32:55.9753333Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 254, in forward 2025-08-14T21:32:55.9753717Z attn_output, attn_weights = attention_interface( 2025-08-14T21:32:55.9754137Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:32:55.9754587Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:32:55.9754768Z 2025-08-14T21:32:55.9754864Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:55.9755210Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:55.9755509Z return mod(**inputs) 2025-08-14T21:32:55.9755848Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:55.9756206Z outputs = self.model( 2025-08-14T21:32:55.9756544Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1270, in forward 2025-08-14T21:32:55.9756902Z encoder_outputs = self.encoder( 2025-08-14T21:32:55.9757255Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 869, in forward 2025-08-14T21:32:55.9757617Z layer_outputs = encoder_layer( 2025-08-14T21:32:55.9757942Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:55.9758275Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:55.9758718Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 312, in forward 2025-08-14T21:32:55.9759094Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:32:55.9759457Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 254, in forward 2025-08-14T21:32:55.9759835Z attn_output, attn_weights = attention_interface( 2025-08-14T21:32:55.9760252Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:32:55.9760680Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:32:55.9760857Z 2025-08-14T21:32:55.9760953Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:55.9761283Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:55.9761580Z return mod(**inputs) 2025-08-14T21:32:55.9761905Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:55.9762253Z outputs = self.model( 2025-08-14T21:32:55.9762583Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1270, in forward 2025-08-14T21:32:55.9762935Z encoder_outputs = self.encoder( 2025-08-14T21:32:55.9763275Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 869, in forward 2025-08-14T21:32:55.9763625Z layer_outputs = encoder_layer( 2025-08-14T21:32:55.9763943Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:55.9764280Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:55.9764628Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 312, in forward 2025-08-14T21:32:55.9764996Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:32:55.9765361Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 268, in forward 2025-08-14T21:32:55.9765713Z attn_output = self.out_proj(attn_output) 2025-08-14T21:32:55.9765848Z 2025-08-14T21:32:55.9765946Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:55.9766287Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:55.9766601Z return mod(**inputs) 2025-08-14T21:32:55.9766925Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:55.9767279Z outputs = self.model( 2025-08-14T21:32:55.9767613Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1270, in forward 2025-08-14T21:32:55.9767961Z encoder_outputs = self.encoder( 2025-08-14T21:32:55.9768311Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 869, in forward 2025-08-14T21:32:55.9768664Z layer_outputs = encoder_layer( 2025-08-14T21:32:55.9768989Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:55.9769317Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:55.9769671Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 323, in forward 2025-08-14T21:32:55.9770069Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:32:55.9770231Z 2025-08-14T21:32:55.9770333Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:55.9770658Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:55.9770958Z return mod(**inputs) 2025-08-14T21:32:55.9771316Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:55.9771657Z outputs = self.model( 2025-08-14T21:32:55.9771987Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1270, in forward 2025-08-14T21:32:55.9772337Z encoder_outputs = self.encoder( 2025-08-14T21:32:55.9772680Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 869, in forward 2025-08-14T21:32:55.9773022Z layer_outputs = encoder_layer( 2025-08-14T21:32:55.9773341Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:55.9773704Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:55.9774051Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 323, in forward 2025-08-14T21:32:55.9774449Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:32:55.9774812Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:32:55.9775128Z return self.act(input) 2025-08-14T21:32:55.9775231Z 2025-08-14T21:32:55.9775326Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:55.9775662Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:55.9775963Z return mod(**inputs) 2025-08-14T21:32:55.9776288Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:55.9776639Z outputs = self.model( 2025-08-14T21:32:55.9776971Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1270, in forward 2025-08-14T21:32:55.9777322Z encoder_outputs = self.encoder( 2025-08-14T21:32:55.9777660Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 869, in forward 2025-08-14T21:32:55.9778010Z layer_outputs = encoder_layer( 2025-08-14T21:32:55.9778331Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:55.9778664Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:55.9779010Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 325, in forward 2025-08-14T21:32:55.9779368Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:32:55.9779495Z 2025-08-14T21:32:55.9779597Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:55.9779924Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:55.9780225Z return mod(**inputs) 2025-08-14T21:32:55.9780561Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:55.9780914Z outputs = self.model( 2025-08-14T21:32:55.9781242Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1270, in forward 2025-08-14T21:32:55.9781596Z encoder_outputs = self.encoder( 2025-08-14T21:32:55.9781945Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 869, in forward 2025-08-14T21:32:55.9782289Z layer_outputs = encoder_layer( 2025-08-14T21:32:55.9782612Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:55.9782945Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:55.9783305Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 312, in forward 2025-08-14T21:32:55.9783667Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:32:55.9784121Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 216, in forward 2025-08-14T21:32:55.9784732Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:32:55.9784935Z 2025-08-14T21:32:55.9785043Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:55.9785378Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:55.9785695Z return mod(**inputs) 2025-08-14T21:32:55.9786048Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:55.9786410Z outputs = self.model( 2025-08-14T21:32:55.9786819Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1270, in forward 2025-08-14T21:32:55.9787171Z encoder_outputs = self.encoder( 2025-08-14T21:32:55.9787517Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 869, in forward 2025-08-14T21:32:55.9787862Z layer_outputs = encoder_layer( 2025-08-14T21:32:55.9788181Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:55.9788512Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:55.9788857Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 312, in forward 2025-08-14T21:32:55.9789226Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:32:55.9789593Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 235, in forward 2025-08-14T21:32:55.9789958Z key_states = self.k_proj(current_states) 2025-08-14T21:32:55.9790082Z 2025-08-14T21:32:55.9790178Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:55.9790511Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:55.9790812Z return mod(**inputs) 2025-08-14T21:32:55.9791140Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:55.9791481Z outputs = self.model( 2025-08-14T21:32:55.9791810Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1270, in forward 2025-08-14T21:32:55.9792160Z encoder_outputs = self.encoder( 2025-08-14T21:32:55.9792494Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 869, in forward 2025-08-14T21:32:55.9792840Z layer_outputs = encoder_layer( 2025-08-14T21:32:55.9793160Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:55.9793485Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:55.9793825Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 312, in forward 2025-08-14T21:32:55.9794189Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:32:55.9794550Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 236, in forward 2025-08-14T21:32:55.9794902Z value_states = self.v_proj(current_states) 2025-08-14T21:32:55.9795039Z 2025-08-14T21:32:55.9795114Z cudagraph partition due to non gpu ops 2025-08-14T21:32:55.9795310Z cudagraph partition due to non gpu ops 2025-08-14T21:32:55.9795501Z cudagraph partition due to non gpu ops 2025-08-14T21:32:55.9795685Z cudagraph partition due to non gpu ops 2025-08-14T21:32:55.9795902Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:55.9796234Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:55.9796524Z return mod(**inputs) 2025-08-14T21:32:55.9796853Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:55.9797246Z outputs = self.model( 2025-08-14T21:32:55.9797581Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1270, in forward 2025-08-14T21:32:55.9797928Z encoder_outputs = self.encoder( 2025-08-14T21:32:55.9798274Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 869, in forward 2025-08-14T21:32:55.9798623Z layer_outputs = encoder_layer( 2025-08-14T21:32:55.9798937Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:55.9799317Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:55.9799669Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 312, in forward 2025-08-14T21:32:55.9800038Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:32:55.9800397Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 254, in forward 2025-08-14T21:32:55.9800772Z attn_output, attn_weights = attention_interface( 2025-08-14T21:32:55.9801182Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:32:55.9801621Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:32:55.9801789Z 2025-08-14T21:32:55.9801885Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:55.9802219Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:55.9802522Z return mod(**inputs) 2025-08-14T21:32:55.9802846Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:55.9803193Z outputs = self.model( 2025-08-14T21:32:55.9803526Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1270, in forward 2025-08-14T21:32:55.9803878Z encoder_outputs = self.encoder( 2025-08-14T21:32:55.9804214Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 869, in forward 2025-08-14T21:32:55.9804562Z layer_outputs = encoder_layer( 2025-08-14T21:32:55.9804892Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:55.9805226Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:55.9805595Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 312, in forward 2025-08-14T21:32:55.9805966Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:32:55.9806330Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 254, in forward 2025-08-14T21:32:55.9806693Z attn_output, attn_weights = attention_interface( 2025-08-14T21:32:55.9807100Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:32:55.9807522Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:32:55.9807670Z 2025-08-14T21:32:55.9807771Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:55.9808096Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:55.9808394Z return mod(**inputs) 2025-08-14T21:32:55.9808724Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:55.9809068Z outputs = self.model( 2025-08-14T21:32:55.9809396Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1270, in forward 2025-08-14T21:32:55.9809747Z encoder_outputs = self.encoder( 2025-08-14T21:32:55.9810122Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 869, in forward 2025-08-14T21:32:55.9810466Z layer_outputs = encoder_layer( 2025-08-14T21:32:55.9810785Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:55.9811119Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:55.9811464Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 312, in forward 2025-08-14T21:32:55.9811829Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:32:55.9812193Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 268, in forward 2025-08-14T21:32:55.9812580Z attn_output = self.out_proj(attn_output) 2025-08-14T21:32:55.9812706Z 2025-08-14T21:32:55.9812802Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:55.9813134Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:55.9813436Z return mod(**inputs) 2025-08-14T21:32:55.9813764Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:55.9814105Z outputs = self.model( 2025-08-14T21:32:55.9814432Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1270, in forward 2025-08-14T21:32:55.9814783Z encoder_outputs = self.encoder( 2025-08-14T21:32:55.9815121Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 869, in forward 2025-08-14T21:32:55.9815474Z layer_outputs = encoder_layer( 2025-08-14T21:32:55.9815795Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:55.9816127Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:55.9816474Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 323, in forward 2025-08-14T21:32:55.9816869Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:32:55.9817025Z 2025-08-14T21:32:55.9817126Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:55.9817447Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:55.9817748Z return mod(**inputs) 2025-08-14T21:32:55.9818076Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:55.9818420Z outputs = self.model( 2025-08-14T21:32:55.9818741Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1270, in forward 2025-08-14T21:32:55.9819091Z encoder_outputs = self.encoder( 2025-08-14T21:32:55.9819437Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 869, in forward 2025-08-14T21:32:55.9819788Z layer_outputs = encoder_layer( 2025-08-14T21:32:55.9820097Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:55.9820426Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:55.9820777Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 323, in forward 2025-08-14T21:32:55.9821159Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:32:55.9821513Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:32:55.9821828Z return self.act(input) 2025-08-14T21:32:55.9821927Z 2025-08-14T21:32:55.9822028Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:55.9822353Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:55.9822680Z return mod(**inputs) 2025-08-14T21:32:55.9823015Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:55.9823356Z outputs = self.model( 2025-08-14T21:32:55.9823686Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1270, in forward 2025-08-14T21:32:55.9824037Z encoder_outputs = self.encoder( 2025-08-14T21:32:55.9824419Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 869, in forward 2025-08-14T21:32:55.9824871Z layer_outputs = encoder_layer( 2025-08-14T21:32:55.9825205Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:55.9825548Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:55.9825919Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 325, in forward 2025-08-14T21:32:55.9826276Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:32:55.9826412Z 2025-08-14T21:32:55.9826509Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:55.9826853Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:55.9827154Z return mod(**inputs) 2025-08-14T21:32:55.9827493Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:55.9827855Z outputs = self.model( 2025-08-14T21:32:55.9828195Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1270, in forward 2025-08-14T21:32:55.9828552Z encoder_outputs = self.encoder( 2025-08-14T21:32:55.9828908Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 869, in forward 2025-08-14T21:32:55.9829273Z layer_outputs = encoder_layer( 2025-08-14T21:32:55.9829594Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:55.9829939Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:55.9830303Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 312, in forward 2025-08-14T21:32:55.9830683Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:32:55.9831054Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 216, in forward 2025-08-14T21:32:55.9831488Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:32:55.9831684Z 2025-08-14T21:32:55.9831790Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:55.9832123Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:55.9832436Z return mod(**inputs) 2025-08-14T21:32:55.9832777Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:55.9833133Z outputs = self.model( 2025-08-14T21:32:55.9833463Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1270, in forward 2025-08-14T21:32:55.9833823Z encoder_outputs = self.encoder( 2025-08-14T21:32:55.9834179Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 869, in forward 2025-08-14T21:32:55.9834540Z layer_outputs = encoder_layer( 2025-08-14T21:32:55.9834865Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:55.9835204Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:55.9835568Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 312, in forward 2025-08-14T21:32:55.9835969Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:32:55.9836346Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 235, in forward 2025-08-14T21:32:55.9836716Z key_states = self.k_proj(current_states) 2025-08-14T21:32:55.9836843Z 2025-08-14T21:32:55.9836946Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:55.9837281Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:55.9837587Z return mod(**inputs) 2025-08-14T21:32:55.9837932Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:55.9838298Z outputs = self.model( 2025-08-14T21:32:55.9838629Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1270, in forward 2025-08-14T21:32:55.9838979Z encoder_outputs = self.encoder( 2025-08-14T21:32:55.9839325Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 869, in forward 2025-08-14T21:32:55.9839668Z layer_outputs = encoder_layer( 2025-08-14T21:32:55.9839985Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:55.9840314Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:55.9840655Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 312, in forward 2025-08-14T21:32:55.9841022Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:32:55.9841386Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 236, in forward 2025-08-14T21:32:55.9841746Z value_states = self.v_proj(current_states) 2025-08-14T21:32:55.9841876Z 2025-08-14T21:32:55.9841949Z cudagraph partition due to non gpu ops 2025-08-14T21:32:55.9842148Z cudagraph partition due to non gpu ops 2025-08-14T21:32:55.9842341Z cudagraph partition due to non gpu ops 2025-08-14T21:32:55.9842522Z cudagraph partition due to non gpu ops 2025-08-14T21:32:55.9842738Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:55.9843069Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:55.9843369Z return mod(**inputs) 2025-08-14T21:32:55.9843694Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:55.9844040Z outputs = self.model( 2025-08-14T21:32:55.9844374Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1270, in forward 2025-08-14T21:32:55.9844718Z encoder_outputs = self.encoder( 2025-08-14T21:32:55.9845072Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 869, in forward 2025-08-14T21:32:55.9845433Z layer_outputs = encoder_layer( 2025-08-14T21:32:55.9845765Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:55.9846097Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:55.9846457Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 312, in forward 2025-08-14T21:32:55.9846834Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:32:55.9847215Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 254, in forward 2025-08-14T21:32:55.9847594Z attn_output, attn_weights = attention_interface( 2025-08-14T21:32:55.9848003Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:32:55.9848443Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:32:55.9848609Z 2025-08-14T21:32:55.9848733Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:55.9849065Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:55.9849366Z return mod(**inputs) 2025-08-14T21:32:55.9849701Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:55.9850046Z outputs = self.model( 2025-08-14T21:32:55.9850381Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1270, in forward 2025-08-14T21:32:55.9850778Z encoder_outputs = self.encoder( 2025-08-14T21:32:55.9851115Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 869, in forward 2025-08-14T21:32:55.9851462Z layer_outputs = encoder_layer( 2025-08-14T21:32:55.9851789Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:55.9852120Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:55.9852462Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 312, in forward 2025-08-14T21:32:55.9852829Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:32:55.9853188Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 254, in forward 2025-08-14T21:32:55.9853559Z attn_output, attn_weights = attention_interface( 2025-08-14T21:32:55.9853959Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:32:55.9854389Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:32:55.9854536Z 2025-08-14T21:32:55.9854640Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:55.9854967Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:55.9855266Z return mod(**inputs) 2025-08-14T21:32:55.9855597Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:55.9855942Z outputs = self.model( 2025-08-14T21:32:55.9856261Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1270, in forward 2025-08-14T21:32:55.9856610Z encoder_outputs = self.encoder( 2025-08-14T21:32:55.9856953Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 869, in forward 2025-08-14T21:32:55.9857302Z layer_outputs = encoder_layer( 2025-08-14T21:32:55.9857614Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:55.9857944Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:55.9858295Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 312, in forward 2025-08-14T21:32:55.9858653Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:32:55.9859015Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 268, in forward 2025-08-14T21:32:55.9859371Z attn_output = self.out_proj(attn_output) 2025-08-14T21:32:55.9859496Z 2025-08-14T21:32:55.9859598Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:55.9859921Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:55.9860227Z return mod(**inputs) 2025-08-14T21:32:55.9860559Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:55.9860896Z outputs = self.model( 2025-08-14T21:32:55.9861254Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1270, in forward 2025-08-14T21:32:55.9861614Z encoder_outputs = self.encoder( 2025-08-14T21:32:55.9861962Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 869, in forward 2025-08-14T21:32:55.9862309Z layer_outputs = encoder_layer( 2025-08-14T21:32:55.9862632Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:55.9862965Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:55.9863312Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 323, in forward 2025-08-14T21:32:55.9863738Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:32:55.9863908Z 2025-08-14T21:32:55.9864004Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:55.9864338Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:55.9864639Z return mod(**inputs) 2025-08-14T21:32:55.9865049Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:55.9865411Z outputs = self.model( 2025-08-14T21:32:55.9865759Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1270, in forward 2025-08-14T21:32:55.9866105Z encoder_outputs = self.encoder( 2025-08-14T21:32:55.9866453Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 869, in forward 2025-08-14T21:32:55.9866809Z layer_outputs = encoder_layer( 2025-08-14T21:32:55.9867129Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:55.9867465Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:55.9867822Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 323, in forward 2025-08-14T21:32:55.9868218Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:32:55.9868572Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:32:55.9868886Z return self.act(input) 2025-08-14T21:32:55.9868990Z 2025-08-14T21:32:55.9869093Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:55.9869419Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:55.9869721Z return mod(**inputs) 2025-08-14T21:32:55.9870049Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:55.9870401Z outputs = self.model( 2025-08-14T21:32:55.9870726Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1270, in forward 2025-08-14T21:32:55.9871078Z encoder_outputs = self.encoder( 2025-08-14T21:32:55.9871422Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 869, in forward 2025-08-14T21:32:55.9871769Z layer_outputs = encoder_layer( 2025-08-14T21:32:55.9872082Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:55.9872410Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:55.9872759Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 325, in forward 2025-08-14T21:32:55.9873107Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:32:55.9873244Z 2025-08-14T21:32:55.9873340Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:55.9873671Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:55.9873967Z return mod(**inputs) 2025-08-14T21:32:55.9874320Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:55.9874674Z outputs = self.model( 2025-08-14T21:32:55.9875006Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1270, in forward 2025-08-14T21:32:55.9875354Z encoder_outputs = self.encoder( 2025-08-14T21:32:55.9875703Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 869, in forward 2025-08-14T21:32:55.9876052Z layer_outputs = encoder_layer( 2025-08-14T21:32:55.9876373Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:55.9876729Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:55.9877081Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 312, in forward 2025-08-14T21:32:55.9877453Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:32:55.9877807Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 216, in forward 2025-08-14T21:32:55.9878227Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:32:55.9878422Z 2025-08-14T21:32:55.9878518Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:55.9878848Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:55.9879140Z return mod(**inputs) 2025-08-14T21:32:55.9879471Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:55.9879823Z outputs = self.model( 2025-08-14T21:32:55.9880151Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1270, in forward 2025-08-14T21:32:55.9880493Z encoder_outputs = self.encoder( 2025-08-14T21:32:55.9880837Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 869, in forward 2025-08-14T21:32:55.9881187Z layer_outputs = encoder_layer( 2025-08-14T21:32:55.9881497Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:55.9881827Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:55.9882176Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 312, in forward 2025-08-14T21:32:55.9882543Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:32:55.9882901Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 235, in forward 2025-08-14T21:32:55.9883255Z key_states = self.k_proj(current_states) 2025-08-14T21:32:55.9883379Z 2025-08-14T21:32:55.9883480Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:55.9883812Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:55.9884104Z return mod(**inputs) 2025-08-14T21:32:55.9884431Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:55.9884898Z outputs = self.model( 2025-08-14T21:32:55.9885228Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1270, in forward 2025-08-14T21:32:55.9885592Z encoder_outputs = self.encoder( 2025-08-14T21:32:55.9885950Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 869, in forward 2025-08-14T21:32:55.9886319Z layer_outputs = encoder_layer( 2025-08-14T21:32:55.9886660Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:55.9886742Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:55.9887034Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 312, in forward 2025-08-14T21:32:55.9887121Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:32:55.9887354Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 236, in forward 2025-08-14T21:32:55.9887432Z value_states = self.v_proj(current_states) 2025-08-14T21:32:55.9887436Z 2025-08-14T21:32:55.9887515Z cudagraph partition due to non gpu ops 2025-08-14T21:32:55.9887584Z cudagraph partition due to non gpu ops 2025-08-14T21:32:55.9887653Z cudagraph partition due to non gpu ops 2025-08-14T21:32:55.9887775Z cudagraph partition due to non gpu ops 2025-08-14T21:32:55.9887875Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:55.9888060Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:55.9888127Z return mod(**inputs) 2025-08-14T21:32:55.9888358Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:55.9888432Z outputs = self.model( 2025-08-14T21:32:55.9888658Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1270, in forward 2025-08-14T21:32:55.9888725Z encoder_outputs = self.encoder( 2025-08-14T21:32:55.9888958Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 869, in forward 2025-08-14T21:32:55.9889025Z layer_outputs = encoder_layer( 2025-08-14T21:32:55.9889230Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:55.9889309Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:55.9889532Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 312, in forward 2025-08-14T21:32:55.9889625Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:32:55.9889847Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 254, in forward 2025-08-14T21:32:55.9889937Z attn_output, attn_weights = attention_interface( 2025-08-14T21:32:55.9890210Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:32:55.9890329Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:32:55.9890332Z 2025-08-14T21:32:55.9890431Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:55.9890617Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:55.9890676Z return mod(**inputs) 2025-08-14T21:32:55.9890910Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:55.9890974Z outputs = self.model( 2025-08-14T21:32:55.9891200Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1270, in forward 2025-08-14T21:32:55.9891275Z encoder_outputs = self.encoder( 2025-08-14T21:32:55.9891500Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 869, in forward 2025-08-14T21:32:55.9891572Z layer_outputs = encoder_layer( 2025-08-14T21:32:55.9891774Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:55.9891845Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:55.9892077Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 312, in forward 2025-08-14T21:32:55.9892160Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:32:55.9892421Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 254, in forward 2025-08-14T21:32:55.9892520Z attn_output, attn_weights = attention_interface( 2025-08-14T21:32:55.9892788Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:32:55.9892896Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:32:55.9892899Z 2025-08-14T21:32:55.9892994Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:55.9893177Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:55.9893277Z return mod(**inputs) 2025-08-14T21:32:55.9893506Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:55.9893574Z outputs = self.model( 2025-08-14T21:32:55.9893804Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1270, in forward 2025-08-14T21:32:55.9893874Z encoder_outputs = self.encoder( 2025-08-14T21:32:55.9894105Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 869, in forward 2025-08-14T21:32:55.9894171Z layer_outputs = encoder_layer( 2025-08-14T21:32:55.9894371Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:55.9894449Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:55.9894671Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 312, in forward 2025-08-14T21:32:55.9894762Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:32:55.9894984Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 268, in forward 2025-08-14T21:32:55.9895058Z attn_output = self.out_proj(attn_output) 2025-08-14T21:32:55.9895061Z 2025-08-14T21:32:55.9895164Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:55.9895346Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:55.9895412Z return mod(**inputs) 2025-08-14T21:32:55.9895639Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:55.9895701Z outputs = self.model( 2025-08-14T21:32:55.9895935Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1270, in forward 2025-08-14T21:32:55.9896001Z encoder_outputs = self.encoder( 2025-08-14T21:32:55.9896228Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 869, in forward 2025-08-14T21:32:55.9896301Z layer_outputs = encoder_layer( 2025-08-14T21:32:55.9896500Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:55.9896579Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:55.9896802Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 323, in forward 2025-08-14T21:32:55.9896914Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:32:55.9896917Z 2025-08-14T21:32:55.9897018Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:55.9897196Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:55.9897255Z return mod(**inputs) 2025-08-14T21:32:55.9897493Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:55.9897553Z outputs = self.model( 2025-08-14T21:32:55.9897786Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1270, in forward 2025-08-14T21:32:55.9897880Z encoder_outputs = self.encoder( 2025-08-14T21:32:55.9898105Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 869, in forward 2025-08-14T21:32:55.9898177Z layer_outputs = encoder_layer( 2025-08-14T21:32:55.9898375Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:55.9898453Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:55.9898676Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 323, in forward 2025-08-14T21:32:55.9898782Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:32:55.9899018Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:32:55.9899083Z return self.act(input) 2025-08-14T21:32:55.9899086Z 2025-08-14T21:32:55.9899179Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:55.9899369Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:55.9899427Z return mod(**inputs) 2025-08-14T21:32:55.9899662Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:55.9899723Z outputs = self.model( 2025-08-14T21:32:55.9899952Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1270, in forward 2025-08-14T21:32:55.9900027Z encoder_outputs = self.encoder( 2025-08-14T21:32:55.9900251Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 869, in forward 2025-08-14T21:32:55.9900319Z layer_outputs = encoder_layer( 2025-08-14T21:32:55.9900527Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:55.9900599Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:55.9900833Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 325, in forward 2025-08-14T21:32:55.9900905Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:32:55.9900908Z 2025-08-14T21:32:55.9901000Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:55.9901186Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:55.9901245Z return mod(**inputs) 2025-08-14T21:32:55.9901478Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:55.9901541Z outputs = self.model( 2025-08-14T21:32:55.9901765Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1270, in forward 2025-08-14T21:32:55.9901837Z encoder_outputs = self.encoder( 2025-08-14T21:32:55.9902064Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 869, in forward 2025-08-14T21:32:55.9902127Z layer_outputs = encoder_layer( 2025-08-14T21:32:55.9902335Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:55.9902405Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:55.9902635Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 312, in forward 2025-08-14T21:32:55.9902718Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:32:55.9902940Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 216, in forward 2025-08-14T21:32:55.9903089Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:32:55.9903092Z 2025-08-14T21:32:55.9903184Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:55.9903400Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:55.9903461Z return mod(**inputs) 2025-08-14T21:32:55.9903688Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:55.9903756Z outputs = self.model( 2025-08-14T21:32:55.9903987Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1270, in forward 2025-08-14T21:32:55.9904053Z encoder_outputs = self.encoder( 2025-08-14T21:32:55.9904286Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 869, in forward 2025-08-14T21:32:55.9904382Z layer_outputs = encoder_layer( 2025-08-14T21:32:55.9904595Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:55.9904723Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:55.9904958Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 312, in forward 2025-08-14T21:32:55.9905049Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:32:55.9905275Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 235, in forward 2025-08-14T21:32:55.9905346Z key_states = self.k_proj(current_states) 2025-08-14T21:32:55.9905357Z 2025-08-14T21:32:55.9905449Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:55.9905629Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:55.9905698Z return mod(**inputs) 2025-08-14T21:32:55.9905926Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:55.9905987Z outputs = self.model( 2025-08-14T21:32:55.9906222Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1270, in forward 2025-08-14T21:32:55.9906287Z encoder_outputs = self.encoder( 2025-08-14T21:32:55.9906519Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 869, in forward 2025-08-14T21:32:55.9906585Z layer_outputs = encoder_layer( 2025-08-14T21:32:55.9906786Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:55.9906866Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:55.9907091Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 312, in forward 2025-08-14T21:32:55.9907185Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:32:55.9907420Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 236, in forward 2025-08-14T21:32:55.9907499Z value_states = self.v_proj(current_states) 2025-08-14T21:32:55.9907504Z 2025-08-14T21:32:55.9907588Z cudagraph partition due to non gpu ops 2025-08-14T21:32:55.9907661Z cudagraph partition due to non gpu ops 2025-08-14T21:32:55.9907731Z cudagraph partition due to non gpu ops 2025-08-14T21:32:55.9907807Z cudagraph partition due to non gpu ops 2025-08-14T21:32:55.9907899Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:55.9908080Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:55.9908147Z return mod(**inputs) 2025-08-14T21:32:55.9908373Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:55.9908443Z outputs = self.model( 2025-08-14T21:32:55.9908668Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1270, in forward 2025-08-14T21:32:55.9908733Z encoder_outputs = self.encoder( 2025-08-14T21:32:55.9909002Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 869, in forward 2025-08-14T21:32:55.9909070Z layer_outputs = encoder_layer( 2025-08-14T21:32:55.9909270Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:55.9909350Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:55.9909570Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 312, in forward 2025-08-14T21:32:55.9909659Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:32:55.9909910Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 254, in forward 2025-08-14T21:32:55.9910000Z attn_output, attn_weights = attention_interface( 2025-08-14T21:32:55.9910275Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:32:55.9910399Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:32:55.9910402Z 2025-08-14T21:32:55.9910503Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:55.9910683Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:55.9910743Z return mod(**inputs) 2025-08-14T21:32:55.9910977Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:55.9911038Z outputs = self.model( 2025-08-14T21:32:55.9911263Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1270, in forward 2025-08-14T21:32:55.9911339Z encoder_outputs = self.encoder( 2025-08-14T21:32:55.9911563Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 869, in forward 2025-08-14T21:32:55.9911640Z layer_outputs = encoder_layer( 2025-08-14T21:32:55.9911841Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:55.9911912Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:55.9912141Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 312, in forward 2025-08-14T21:32:55.9912223Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:32:55.9912454Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 254, in forward 2025-08-14T21:32:55.9912548Z attn_output, attn_weights = attention_interface( 2025-08-14T21:32:55.9912811Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:32:55.9912920Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:32:55.9912923Z 2025-08-14T21:32:55.9913020Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:55.9913201Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:55.9913267Z return mod(**inputs) 2025-08-14T21:32:55.9913494Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:55.9913561Z outputs = self.model( 2025-08-14T21:32:55.9913786Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1270, in forward 2025-08-14T21:32:55.9913851Z encoder_outputs = self.encoder( 2025-08-14T21:32:55.9914086Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 869, in forward 2025-08-14T21:32:55.9914151Z layer_outputs = encoder_layer( 2025-08-14T21:32:55.9914349Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:55.9914494Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:55.9914720Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 312, in forward 2025-08-14T21:32:55.9914806Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:32:55.9915028Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 268, in forward 2025-08-14T21:32:55.9915100Z attn_output = self.out_proj(attn_output) 2025-08-14T21:32:55.9915103Z 2025-08-14T21:32:55.9915202Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:55.9915416Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:55.9915479Z return mod(**inputs) 2025-08-14T21:32:55.9915706Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:55.9915767Z outputs = self.model( 2025-08-14T21:32:55.9916002Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1270, in forward 2025-08-14T21:32:55.9916068Z encoder_outputs = self.encoder( 2025-08-14T21:32:55.9916291Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 869, in forward 2025-08-14T21:32:55.9916365Z layer_outputs = encoder_layer( 2025-08-14T21:32:55.9916565Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:55.9916641Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:55.9916867Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 323, in forward 2025-08-14T21:32:55.9916977Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:32:55.9916980Z 2025-08-14T21:32:55.9917081Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:55.9917265Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:55.9917330Z return mod(**inputs) 2025-08-14T21:32:55.9917556Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:55.9917617Z outputs = self.model( 2025-08-14T21:32:55.9917848Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1270, in forward 2025-08-14T21:32:55.9917915Z encoder_outputs = self.encoder( 2025-08-14T21:32:55.9918139Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 869, in forward 2025-08-14T21:32:55.9918214Z layer_outputs = encoder_layer( 2025-08-14T21:32:55.9918434Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:55.9918510Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:55.9918736Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 323, in forward 2025-08-14T21:32:55.9918844Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:32:55.9919045Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:32:55.9919109Z return self.act(input) 2025-08-14T21:32:55.9919112Z 2025-08-14T21:32:55.9919205Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:55.9919394Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:55.9919454Z return mod(**inputs) 2025-08-14T21:32:55.9919690Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:55.9919751Z outputs = self.model( 2025-08-14T21:32:55.9920005Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1270, in forward 2025-08-14T21:32:55.9920085Z encoder_outputs = self.encoder( 2025-08-14T21:32:55.9920318Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 869, in forward 2025-08-14T21:32:55.9920393Z layer_outputs = encoder_layer( 2025-08-14T21:32:55.9920598Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:55.9920673Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:55.9920910Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 325, in forward 2025-08-14T21:32:55.9921013Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:32:55.9921016Z 2025-08-14T21:32:55.9921110Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:55.9921300Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:55.9921360Z return mod(**inputs) 2025-08-14T21:32:55.9921595Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:55.9921656Z outputs = self.model( 2025-08-14T21:32:55.9921883Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1270, in forward 2025-08-14T21:32:55.9921956Z encoder_outputs = self.encoder( 2025-08-14T21:32:55.9922179Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 869, in forward 2025-08-14T21:32:55.9922247Z layer_outputs = encoder_layer( 2025-08-14T21:32:55.9922454Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:55.9922525Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:55.9922759Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 312, in forward 2025-08-14T21:32:55.9922843Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:32:55.9923069Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 216, in forward 2025-08-14T21:32:55.9923214Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:32:55.9923218Z 2025-08-14T21:32:55.9923309Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:55.9923498Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:55.9923561Z return mod(**inputs) 2025-08-14T21:32:55.9923789Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:55.9923858Z outputs = self.model( 2025-08-14T21:32:55.9924088Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1270, in forward 2025-08-14T21:32:55.9924155Z encoder_outputs = self.encoder( 2025-08-14T21:32:55.9924386Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 869, in forward 2025-08-14T21:32:55.9924450Z layer_outputs = encoder_layer( 2025-08-14T21:32:55.9924657Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:55.9924728Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:55.9924949Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 312, in forward 2025-08-14T21:32:55.9925042Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:32:55.9925264Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 235, in forward 2025-08-14T21:32:55.9925347Z key_states = self.k_proj(current_states) 2025-08-14T21:32:55.9925350Z 2025-08-14T21:32:55.9925472Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:55.9925657Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:55.9925724Z return mod(**inputs) 2025-08-14T21:32:55.9925953Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:55.9926015Z outputs = self.model( 2025-08-14T21:32:55.9926252Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1270, in forward 2025-08-14T21:32:55.9926348Z encoder_outputs = self.encoder( 2025-08-14T21:32:55.9926581Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 869, in forward 2025-08-14T21:32:55.9926647Z layer_outputs = encoder_layer( 2025-08-14T21:32:55.9926847Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:55.9926925Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:55.9927150Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 312, in forward 2025-08-14T21:32:55.9927229Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:32:55.9927459Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 236, in forward 2025-08-14T21:32:55.9927535Z value_states = self.v_proj(current_states) 2025-08-14T21:32:55.9927539Z 2025-08-14T21:32:55.9927618Z cudagraph partition due to non gpu ops 2025-08-14T21:32:55.9927692Z cudagraph partition due to non gpu ops 2025-08-14T21:32:55.9927760Z cudagraph partition due to non gpu ops 2025-08-14T21:32:55.9927835Z cudagraph partition due to non gpu ops 2025-08-14T21:32:55.9927927Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:55.9928110Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:55.9928176Z return mod(**inputs) 2025-08-14T21:32:55.9928401Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:55.9928467Z outputs = self.model( 2025-08-14T21:32:55.9928693Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1270, in forward 2025-08-14T21:32:55.9928758Z encoder_outputs = self.encoder( 2025-08-14T21:32:55.9928987Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 869, in forward 2025-08-14T21:32:55.9929054Z layer_outputs = encoder_layer( 2025-08-14T21:32:55.9929260Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:55.9929329Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:55.9929552Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 312, in forward 2025-08-14T21:32:55.9929641Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:32:55.9929863Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 254, in forward 2025-08-14T21:32:55.9929952Z attn_output, attn_weights = attention_interface( 2025-08-14T21:32:55.9930222Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:32:55.9930341Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:32:55.9930346Z 2025-08-14T21:32:55.9930445Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:55.9930625Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:55.9930684Z return mod(**inputs) 2025-08-14T21:32:55.9930943Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:55.9931007Z outputs = self.model( 2025-08-14T21:32:55.9931238Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1270, in forward 2025-08-14T21:32:55.9931311Z encoder_outputs = self.encoder( 2025-08-14T21:32:55.9931538Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 869, in forward 2025-08-14T21:32:55.9931610Z layer_outputs = encoder_layer( 2025-08-14T21:32:55.9931811Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:55.9931922Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:55.9932156Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 312, in forward 2025-08-14T21:32:55.9932240Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:32:55.9932471Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 254, in forward 2025-08-14T21:32:55.9932561Z attn_output, attn_weights = attention_interface( 2025-08-14T21:32:55.9932827Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:32:55.9932935Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:32:55.9932939Z 2025-08-14T21:32:55.9933031Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:55.9933225Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:55.9933285Z return mod(**inputs) 2025-08-14T21:32:55.9933515Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:55.9933587Z outputs = self.model( 2025-08-14T21:32:55.9933820Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1270, in forward 2025-08-14T21:32:55.9933885Z encoder_outputs = self.encoder( 2025-08-14T21:32:55.9934119Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 869, in forward 2025-08-14T21:32:55.9934183Z layer_outputs = encoder_layer( 2025-08-14T21:32:55.9934390Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:55.9934461Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:55.9934689Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 312, in forward 2025-08-14T21:32:55.9934777Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:32:55.9934999Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 268, in forward 2025-08-14T21:32:55.9935075Z attn_output = self.out_proj(attn_output) 2025-08-14T21:32:55.9935079Z 2025-08-14T21:32:55.9935179Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:55.9935360Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:55.9935426Z return mod(**inputs) 2025-08-14T21:32:55.9935655Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:55.9935715Z outputs = self.model( 2025-08-14T21:32:55.9935950Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1270, in forward 2025-08-14T21:32:55.9936019Z encoder_outputs = self.encoder( 2025-08-14T21:32:55.9936242Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 869, in forward 2025-08-14T21:32:55.9936316Z layer_outputs = encoder_layer( 2025-08-14T21:32:55.9936546Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:55.9936629Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:55.9936850Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 323, in forward 2025-08-14T21:32:55.9936960Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:32:55.9936963Z 2025-08-14T21:32:55.9937064Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:55.9937248Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:55.9937346Z return mod(**inputs) 2025-08-14T21:32:55.9937579Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:55.9937640Z outputs = self.model( 2025-08-14T21:32:55.9937882Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1270, in forward 2025-08-14T21:32:55.9937946Z encoder_outputs = self.encoder( 2025-08-14T21:32:55.9938169Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 869, in forward 2025-08-14T21:32:55.9938242Z layer_outputs = encoder_layer( 2025-08-14T21:32:55.9938441Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:55.9938521Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:55.9938744Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 323, in forward 2025-08-14T21:32:55.9938855Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:32:55.9939057Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:32:55.9939120Z return self.act(input) 2025-08-14T21:32:55.9939126Z 2025-08-14T21:32:55.9939230Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:55.9939410Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:55.9939468Z return mod(**inputs) 2025-08-14T21:32:55.9939701Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:55.9939760Z outputs = self.model( 2025-08-14T21:32:55.9939986Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1270, in forward 2025-08-14T21:32:55.9940061Z encoder_outputs = self.encoder( 2025-08-14T21:32:55.9940290Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 869, in forward 2025-08-14T21:32:55.9940360Z layer_outputs = encoder_layer( 2025-08-14T21:32:55.9940562Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:55.9940633Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:55.9940863Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 325, in forward 2025-08-14T21:32:55.9940934Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:32:55.9940938Z 2025-08-14T21:32:55.9941028Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:55.9941217Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:55.9941275Z return mod(**inputs) 2025-08-14T21:32:55.9941514Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:55.9941573Z outputs = self.model( 2025-08-14T21:32:55.9941797Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1270, in forward 2025-08-14T21:32:55.9941897Z encoder_outputs = self.encoder( 2025-08-14T21:32:55.9942125Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 869, in forward 2025-08-14T21:32:55.9942197Z layer_outputs = encoder_layer( 2025-08-14T21:32:55.9942396Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:55.9942468Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:55.9942697Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 312, in forward 2025-08-14T21:32:55.9942811Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:32:55.9943035Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 216, in forward 2025-08-14T21:32:55.9943180Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:32:55.9943184Z 2025-08-14T21:32:55.9943279Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:55.9943464Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:55.9943522Z return mod(**inputs) 2025-08-14T21:32:55.9943750Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:55.9943820Z outputs = self.model( 2025-08-14T21:32:55.9944048Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1270, in forward 2025-08-14T21:32:55.9944112Z encoder_outputs = self.encoder( 2025-08-14T21:32:55.9944344Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 869, in forward 2025-08-14T21:32:55.9944408Z layer_outputs = encoder_layer( 2025-08-14T21:32:55.9944615Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:55.9944749Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:55.9944982Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 312, in forward 2025-08-14T21:32:55.9945073Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:32:55.9945297Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 235, in forward 2025-08-14T21:32:55.9945379Z key_states = self.k_proj(current_states) 2025-08-14T21:32:55.9945382Z 2025-08-14T21:32:55.9945476Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:55.9945662Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:55.9945731Z return mod(**inputs) 2025-08-14T21:32:55.9945962Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:55.9946025Z outputs = self.model( 2025-08-14T21:32:55.9946266Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1270, in forward 2025-08-14T21:32:55.9946333Z encoder_outputs = self.encoder( 2025-08-14T21:32:55.9946568Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 869, in forward 2025-08-14T21:32:55.9946633Z layer_outputs = encoder_layer( 2025-08-14T21:32:55.9946838Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:55.9946917Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:55.9947147Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 312, in forward 2025-08-14T21:32:55.9947238Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:32:55.9947495Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 236, in forward 2025-08-14T21:32:55.9947575Z value_states = self.v_proj(current_states) 2025-08-14T21:32:55.9947579Z 2025-08-14T21:32:55.9947658Z cudagraph partition due to non gpu ops 2025-08-14T21:32:55.9947729Z cudagraph partition due to non gpu ops 2025-08-14T21:32:55.9947798Z cudagraph partition due to non gpu ops 2025-08-14T21:32:55.9947873Z cudagraph partition due to non gpu ops 2025-08-14T21:32:55.9947964Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:55.9948152Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:55.9948241Z return mod(**inputs) 2025-08-14T21:32:55.9948469Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:55.9948538Z outputs = self.model( 2025-08-14T21:32:55.9948766Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1270, in forward 2025-08-14T21:32:55.9948833Z encoder_outputs = self.encoder( 2025-08-14T21:32:55.9949064Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 869, in forward 2025-08-14T21:32:55.9949128Z layer_outputs = encoder_layer( 2025-08-14T21:32:55.9949334Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:55.9949405Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:55.9949626Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 312, in forward 2025-08-14T21:32:55.9949718Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:32:55.9949946Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 254, in forward 2025-08-14T21:32:55.9950035Z attn_output, attn_weights = attention_interface( 2025-08-14T21:32:55.9950312Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:32:55.9950433Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:32:55.9950436Z 2025-08-14T21:32:55.9950536Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:55.9950718Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:55.9950776Z return mod(**inputs) 2025-08-14T21:32:55.9951012Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:55.9951076Z outputs = self.model( 2025-08-14T21:32:55.9951310Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1270, in forward 2025-08-14T21:32:55.9951378Z encoder_outputs = self.encoder( 2025-08-14T21:32:55.9951608Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 869, in forward 2025-08-14T21:32:55.9951681Z layer_outputs = encoder_layer( 2025-08-14T21:32:55.9951879Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:55.9951950Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:55.9952183Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 312, in forward 2025-08-14T21:32:55.9952263Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:32:55.9952494Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 254, in forward 2025-08-14T21:32:55.9952584Z attn_output, attn_weights = attention_interface( 2025-08-14T21:32:55.9952846Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:32:55.9952978Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:32:55.9952981Z 2025-08-14T21:32:55.9953077Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:55.9953267Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:55.9953325Z return mod(**inputs) 2025-08-14T21:32:55.9953554Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:55.9953620Z outputs = self.model( 2025-08-14T21:32:55.9953848Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1270, in forward 2025-08-14T21:32:55.9953951Z encoder_outputs = self.encoder( 2025-08-14T21:32:55.9954185Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 869, in forward 2025-08-14T21:32:55.9954250Z layer_outputs = encoder_layer( 2025-08-14T21:32:55.9954459Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:55.9954531Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:55.9954758Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 312, in forward 2025-08-14T21:32:55.9954846Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:32:55.9955071Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 268, in forward 2025-08-14T21:32:55.9955144Z attn_output = self.out_proj(attn_output) 2025-08-14T21:32:55.9955156Z 2025-08-14T21:32:55.9955249Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:55.9955431Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:55.9955497Z return mod(**inputs) 2025-08-14T21:32:55.9955731Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:55.9955790Z outputs = self.model( 2025-08-14T21:32:55.9956027Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1270, in forward 2025-08-14T21:32:55.9956091Z encoder_outputs = self.encoder( 2025-08-14T21:32:55.9956323Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 869, in forward 2025-08-14T21:32:55.9956388Z layer_outputs = encoder_layer( 2025-08-14T21:32:55.9956591Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:55.9956671Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:55.9956897Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 323, in forward 2025-08-14T21:32:55.9957005Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:32:55.9957011Z 2025-08-14T21:32:55.9957111Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:55.9957294Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:55.9957357Z return mod(**inputs) 2025-08-14T21:32:55.9957588Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:55.9957648Z outputs = self.model( 2025-08-14T21:32:55.9957884Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1270, in forward 2025-08-14T21:32:55.9957952Z encoder_outputs = self.encoder( 2025-08-14T21:32:55.9958178Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 869, in forward 2025-08-14T21:32:55.9958251Z layer_outputs = encoder_layer( 2025-08-14T21:32:55.9958485Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:55.9958567Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:55.9958794Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 323, in forward 2025-08-14T21:32:55.9958902Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:32:55.9959105Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:32:55.9959169Z return self.act(input) 2025-08-14T21:32:55.9959172Z 2025-08-14T21:32:55.9959272Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:55.9959493Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:55.9959552Z return mod(**inputs) 2025-08-14T21:32:55.9959785Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:55.9959847Z outputs = self.model( 2025-08-14T21:32:55.9960074Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1270, in forward 2025-08-14T21:32:55.9960149Z encoder_outputs = self.encoder( 2025-08-14T21:32:55.9960382Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 869, in forward 2025-08-14T21:32:55.9960454Z layer_outputs = encoder_layer( 2025-08-14T21:32:55.9960653Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:55.9960727Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:55.9960957Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 325, in forward 2025-08-14T21:32:55.9961029Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:32:55.9961033Z 2025-08-14T21:32:55.9961136Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:55.9961313Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:55.9961371Z return mod(**inputs) 2025-08-14T21:32:55.9961606Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:55.9961665Z outputs = self.model( 2025-08-14T21:32:55.9961891Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1270, in forward 2025-08-14T21:32:55.9961965Z encoder_outputs = self.encoder( 2025-08-14T21:32:55.9962193Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 869, in forward 2025-08-14T21:32:55.9962266Z layer_outputs = encoder_layer( 2025-08-14T21:32:55.9962466Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:55.9962537Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:55.9962765Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 312, in forward 2025-08-14T21:32:55.9962846Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:32:55.9963066Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 216, in forward 2025-08-14T21:32:55.9963209Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:32:55.9963213Z 2025-08-14T21:32:55.9963306Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:55.9963495Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:55.9963553Z return mod(**inputs) 2025-08-14T21:32:55.9963780Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:55.9963877Z outputs = self.model( 2025-08-14T21:32:55.9964106Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1270, in forward 2025-08-14T21:32:55.9964180Z encoder_outputs = self.encoder( 2025-08-14T21:32:55.9964404Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 869, in forward 2025-08-14T21:32:55.9964470Z layer_outputs = encoder_layer( 2025-08-14T21:32:55.9964678Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:55.9964749Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:55.9965000Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 312, in forward 2025-08-14T21:32:55.9965089Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:32:55.9965315Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 235, in forward 2025-08-14T21:32:55.9965393Z key_states = self.k_proj(current_states) 2025-08-14T21:32:55.9965396Z 2025-08-14T21:32:55.9965489Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:55.9965669Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:55.9965736Z return mod(**inputs) 2025-08-14T21:32:55.9965961Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:55.9966028Z outputs = self.model( 2025-08-14T21:32:55.9966255Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1270, in forward 2025-08-14T21:32:55.9966324Z encoder_outputs = self.encoder( 2025-08-14T21:32:55.9966555Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 869, in forward 2025-08-14T21:32:55.9966621Z layer_outputs = encoder_layer( 2025-08-14T21:32:55.9966820Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:55.9966896Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:55.9967119Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 312, in forward 2025-08-14T21:32:55.9967206Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:32:55.9967430Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 236, in forward 2025-08-14T21:32:55.9967509Z value_states = self.v_proj(current_states) 2025-08-14T21:32:55.9967513Z 2025-08-14T21:32:55.9967591Z cudagraph partition due to non gpu ops 2025-08-14T21:32:55.9967661Z cudagraph partition due to non gpu ops 2025-08-14T21:32:55.9967730Z cudagraph partition due to non gpu ops 2025-08-14T21:32:55.9967806Z cudagraph partition due to non gpu ops 2025-08-14T21:32:55.9967901Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:55.9968089Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:55.9968147Z return mod(**inputs) 2025-08-14T21:32:55.9968375Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:55.9968442Z outputs = self.model( 2025-08-14T21:32:55.9968670Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1270, in forward 2025-08-14T21:32:55.9968735Z encoder_outputs = self.encoder( 2025-08-14T21:32:55.9968969Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 869, in forward 2025-08-14T21:32:55.9969033Z layer_outputs = encoder_layer( 2025-08-14T21:32:55.9969240Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:55.9969341Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:55.9969566Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 312, in forward 2025-08-14T21:32:55.9969654Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:32:55.9969877Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 254, in forward 2025-08-14T21:32:55.9969966Z attn_output, attn_weights = attention_interface( 2025-08-14T21:32:55.9970241Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:32:55.9970391Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:32:55.9970394Z 2025-08-14T21:32:55.9970495Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:55.9970680Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:55.9970739Z return mod(**inputs) 2025-08-14T21:32:55.9970977Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:55.9971037Z outputs = self.model( 2025-08-14T21:32:55.9971272Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1270, in forward 2025-08-14T21:32:55.9971339Z encoder_outputs = self.encoder( 2025-08-14T21:32:55.9971565Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 869, in forward 2025-08-14T21:32:55.9971643Z layer_outputs = encoder_layer( 2025-08-14T21:32:55.9971845Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:55.9971916Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:55.9972151Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 312, in forward 2025-08-14T21:32:55.9972232Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:32:55.9972465Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 254, in forward 2025-08-14T21:32:55.9972553Z attn_output, attn_weights = attention_interface( 2025-08-14T21:32:55.9972819Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:32:55.9972926Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:32:55.9972932Z 2025-08-14T21:32:55.9973026Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:55.9973218Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:55.9973278Z return mod(**inputs) 2025-08-14T21:32:55.9973509Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:55.9973580Z outputs = self.model( 2025-08-14T21:32:55.9973810Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1270, in forward 2025-08-14T21:32:55.9973875Z encoder_outputs = self.encoder( 2025-08-14T21:32:55.9974109Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 869, in forward 2025-08-14T21:32:55.9974174Z layer_outputs = encoder_layer( 2025-08-14T21:32:55.9974381Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:55.9974455Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:55.9974678Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 312, in forward 2025-08-14T21:32:55.9974766Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:32:55.9975027Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 268, in forward 2025-08-14T21:32:55.9975110Z attn_output = self.out_proj(attn_output) 2025-08-14T21:32:55.9975114Z 2025-08-14T21:32:55.9975206Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:55.9975388Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:55.9975452Z return mod(**inputs) 2025-08-14T21:32:55.9975679Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:55.9975769Z outputs = self.model( 2025-08-14T21:32:55.9976007Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1270, in forward 2025-08-14T21:32:55.9976073Z encoder_outputs = self.encoder( 2025-08-14T21:32:55.9976308Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 869, in forward 2025-08-14T21:32:55.9976375Z layer_outputs = encoder_layer( 2025-08-14T21:32:55.9976578Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:55.9976656Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:55.9976880Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 323, in forward 2025-08-14T21:32:55.9976990Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:32:55.9977000Z 2025-08-14T21:32:55.9977093Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:55.9977278Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:55.9977342Z return mod(**inputs) 2025-08-14T21:32:55.9977573Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:55.9977637Z outputs = self.model( 2025-08-14T21:32:55.9977874Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1270, in forward 2025-08-14T21:32:55.9977939Z encoder_outputs = self.encoder( 2025-08-14T21:32:55.9978171Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 869, in forward 2025-08-14T21:32:55.9978235Z layer_outputs = encoder_layer( 2025-08-14T21:32:55.9978437Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:55.9978519Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:55.9978745Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 323, in forward 2025-08-14T21:32:55.9978851Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:32:55.9979053Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:32:55.9979116Z return self.act(input) 2025-08-14T21:32:55.9979119Z 2025-08-14T21:32:55.9979219Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:55.9979398Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:55.9979456Z return mod(**inputs) 2025-08-14T21:32:55.9979690Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:55.9979749Z outputs = self.model( 2025-08-14T21:32:55.9979981Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1270, in forward 2025-08-14T21:32:55.9980053Z encoder_outputs = self.encoder( 2025-08-14T21:32:55.9980279Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 869, in forward 2025-08-14T21:32:55.9980377Z layer_outputs = encoder_layer( 2025-08-14T21:32:55.9980578Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:55.9980649Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:55.9980880Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 325, in forward 2025-08-14T21:32:55.9980953Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:32:55.9980956Z 2025-08-14T21:32:55.9981056Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:55.9981236Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:55.9981332Z return mod(**inputs) 2025-08-14T21:32:55.9981567Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:55.9981627Z outputs = self.model( 2025-08-14T21:32:55.9981858Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:55.9981932Z decoder_outputs = self.decoder( 2025-08-14T21:32:55.9982158Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:55.9982228Z layer_outputs = decoder_layer( 2025-08-14T21:32:55.9982429Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:55.9982501Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:55.9982731Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:55.9982826Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:55.9983049Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 216, in forward 2025-08-14T21:32:55.9983196Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:32:55.9983200Z 2025-08-14T21:32:55.9983785Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:55.9983975Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:55.9984034Z return mod(**inputs) 2025-08-14T21:32:55.9984263Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:55.9984332Z outputs = self.model( 2025-08-14T21:32:55.9984727Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:55.9984820Z decoder_outputs = self.decoder( 2025-08-14T21:32:55.9985051Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:55.9985119Z layer_outputs = decoder_layer( 2025-08-14T21:32:55.9985329Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:55.9985399Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:55.9985635Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:55.9985728Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:55.9985955Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 235, in forward 2025-08-14T21:32:55.9986036Z key_states = self.k_proj(current_states) 2025-08-14T21:32:55.9986042Z 2025-08-14T21:32:55.9986137Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:55.9986324Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:55.9986394Z return mod(**inputs) 2025-08-14T21:32:55.9986703Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:55.9986778Z outputs = self.model( 2025-08-14T21:32:55.9987010Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:55.9987076Z decoder_outputs = self.decoder( 2025-08-14T21:32:55.9987313Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:55.9987377Z layer_outputs = decoder_layer( 2025-08-14T21:32:55.9987577Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:55.9987693Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:55.9987917Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:55.9988019Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:55.9988247Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 236, in forward 2025-08-14T21:32:55.9988325Z value_states = self.v_proj(current_states) 2025-08-14T21:32:55.9988328Z 2025-08-14T21:32:55.9988410Z cudagraph partition due to non gpu ops 2025-08-14T21:32:55.9988482Z cudagraph partition due to non gpu ops 2025-08-14T21:32:55.9988562Z cudagraph partition due to non gpu ops 2025-08-14T21:32:55.9988632Z cudagraph partition due to non gpu ops 2025-08-14T21:32:55.9988724Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:55.9988918Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:55.9988977Z return mod(**inputs) 2025-08-14T21:32:55.9989203Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:55.9989275Z outputs = self.model( 2025-08-14T21:32:55.9989501Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:55.9989574Z decoder_outputs = self.decoder( 2025-08-14T21:32:55.9989800Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:55.9989865Z layer_outputs = decoder_layer( 2025-08-14T21:32:55.9990074Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:55.9990144Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:55.9990370Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:55.9990465Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:55.9990691Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 254, in forward 2025-08-14T21:32:55.9990787Z attn_output, attn_weights = attention_interface( 2025-08-14T21:32:55.9991052Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:32:55.9991171Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:32:55.9991174Z 2025-08-14T21:32:55.9991273Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:55.9991451Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:55.9991521Z return mod(**inputs) 2025-08-14T21:32:55.9991747Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:55.9991809Z outputs = self.model( 2025-08-14T21:32:55.9992069Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:55.9992138Z decoder_outputs = self.decoder( 2025-08-14T21:32:55.9992370Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:55.9992443Z layer_outputs = decoder_layer( 2025-08-14T21:32:55.9992647Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:55.9992724Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:55.9992953Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:55.9993071Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:55.9993306Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 254, in forward 2025-08-14T21:32:55.9993391Z attn_output, attn_weights = attention_interface( 2025-08-14T21:32:55.9993662Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:32:55.9993767Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:32:55.9993770Z 2025-08-14T21:32:55.9993862Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:55.9994052Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:55.9994111Z return mod(**inputs) 2025-08-14T21:32:55.9994341Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:55.9994413Z outputs = self.model( 2025-08-14T21:32:55.9994640Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:55.9994712Z decoder_outputs = self.decoder( 2025-08-14T21:32:55.9994944Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:55.9995010Z layer_outputs = decoder_layer( 2025-08-14T21:32:55.9995219Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:55.9995289Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:55.9995516Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:55.9995609Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:55.9995834Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 268, in forward 2025-08-14T21:32:55.9995918Z attn_output = self.out_proj(attn_output) 2025-08-14T21:32:55.9995922Z 2025-08-14T21:32:55.9996015Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:55.9996200Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:55.9996266Z return mod(**inputs) 2025-08-14T21:32:55.9996496Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:55.9996565Z outputs = self.model( 2025-08-14T21:32:55.9996795Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:55.9996859Z decoder_outputs = self.decoder( 2025-08-14T21:32:55.9997093Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:55.9997160Z layer_outputs = decoder_layer( 2025-08-14T21:32:55.9997363Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:55.9997441Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:55.9997712Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 430, in forward 2025-08-14T21:32:55.9997823Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:32:55.9998049Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 216, in forward 2025-08-14T21:32:55.9998187Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:32:55.9998190Z 2025-08-14T21:32:55.9998291Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:55.9998472Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:55.9999089Z return mod(**inputs) 2025-08-14T21:32:55.9999330Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:55.9999393Z outputs = self.model( 2025-08-14T21:32:55.9999641Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:55.9999710Z decoder_outputs = self.decoder( 2025-08-14T21:32:55.9999944Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0000018Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0000227Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0000308Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0000542Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 430, in forward 2025-08-14T21:32:56.0000644Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:32:56.0000884Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 235, in forward 2025-08-14T21:32:56.0000957Z key_states = self.k_proj(current_states) 2025-08-14T21:32:56.0000963Z 2025-08-14T21:32:56.0001060Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0001256Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0001315Z return mod(**inputs) 2025-08-14T21:32:56.0001565Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0001627Z outputs = self.model( 2025-08-14T21:32:56.0001864Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0001943Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0002180Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0002252Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0002461Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0002531Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0002772Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 430, in forward 2025-08-14T21:32:56.0002868Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:32:56.0003096Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 236, in forward 2025-08-14T21:32:56.0003183Z value_states = self.v_proj(current_states) 2025-08-14T21:32:56.0003186Z 2025-08-14T21:32:56.0003260Z cudagraph partition due to non gpu ops 2025-08-14T21:32:56.0003338Z cudagraph partition due to non gpu ops 2025-08-14T21:32:56.0003407Z cudagraph partition due to non gpu ops 2025-08-14T21:32:56.0003475Z cudagraph partition due to non gpu ops 2025-08-14T21:32:56.0003575Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0003788Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0003849Z return mod(**inputs) 2025-08-14T21:32:56.0004084Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0004145Z outputs = self.model( 2025-08-14T21:32:56.0004383Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0004448Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0004677Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0004781Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0004987Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0005057Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0005297Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 430, in forward 2025-08-14T21:32:56.0005396Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:32:56.0005636Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 254, in forward 2025-08-14T21:32:56.0005723Z attn_output, attn_weights = attention_interface( 2025-08-14T21:32:56.0005992Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:32:56.0006124Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:32:56.0006128Z 2025-08-14T21:32:56.0006221Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0006410Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0006469Z return mod(**inputs) 2025-08-14T21:32:56.0006707Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0006773Z outputs = self.model( 2025-08-14T21:32:56.0007006Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0007073Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0007313Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0007378Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0007591Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0007661Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0007890Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 430, in forward 2025-08-14T21:32:56.0007996Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:32:56.0008224Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 254, in forward 2025-08-14T21:32:56.0008318Z attn_output, attn_weights = attention_interface( 2025-08-14T21:32:56.0008589Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:32:56.0008684Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:32:56.0008687Z 2025-08-14T21:32:56.0008786Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0008973Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0009032Z return mod(**inputs) 2025-08-14T21:32:56.0009272Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0009361Z outputs = self.model( 2025-08-14T21:32:56.0009595Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0009664Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0009894Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0009966Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0010168Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0010320Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0010554Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 430, in forward 2025-08-14T21:32:56.0010651Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:32:56.0010885Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 268, in forward 2025-08-14T21:32:56.0010959Z attn_output = self.out_proj(attn_output) 2025-08-14T21:32:56.0010963Z 2025-08-14T21:32:56.0011056Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0011246Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0011304Z return mod(**inputs) 2025-08-14T21:32:56.0011539Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0011600Z outputs = self.model( 2025-08-14T21:32:56.0011832Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0011907Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0012139Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0012208Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0012421Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0012494Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0012729Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 445, in forward 2025-08-14T21:32:56.0012840Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:32:56.0012843Z 2025-08-14T21:32:56.0012935Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0013129Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0013191Z return mod(**inputs) 2025-08-14T21:32:56.0013427Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0013488Z outputs = self.model( 2025-08-14T21:32:56.0013718Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0013792Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0014022Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0014086Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0014305Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0014376Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0014614Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 445, in forward 2025-08-14T21:32:56.0014723Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:32:56.0014947Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:32:56.0015019Z return self.act(input) 2025-08-14T21:32:56.0015023Z 2025-08-14T21:32:56.0015116Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0015295Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0015360Z return mod(**inputs) 2025-08-14T21:32:56.0015586Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0015654Z outputs = self.model( 2025-08-14T21:32:56.0015880Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0015987Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0016224Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0016288Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0016498Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0016569Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0016792Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 447, in forward 2025-08-14T21:32:56.0016872Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:32:56.0016875Z 2025-08-14T21:32:56.0016968Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0017148Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0017217Z return mod(**inputs) 2025-08-14T21:32:56.0017443Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0017512Z outputs = self.model( 2025-08-14T21:32:56.0017736Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0017802Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0018033Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0018097Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0018296Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0018373Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0018596Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:56.0018696Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:56.0018919Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 216, in forward 2025-08-14T21:32:56.0019059Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:32:56.0019062Z 2025-08-14T21:32:56.0019161Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0019342Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0019408Z return mod(**inputs) 2025-08-14T21:32:56.0019633Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0019693Z outputs = self.model( 2025-08-14T21:32:56.0019928Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0019996Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0020221Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0020293Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0020520Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0020601Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0020827Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:56.0020915Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:56.0021147Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 235, in forward 2025-08-14T21:32:56.0021219Z key_states = self.k_proj(current_states) 2025-08-14T21:32:56.0021251Z 2025-08-14T21:32:56.0021351Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0021532Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0021590Z return mod(**inputs) 2025-08-14T21:32:56.0021823Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0021886Z outputs = self.model( 2025-08-14T21:32:56.0022110Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0022183Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0022409Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0022479Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0022680Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0022753Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0022987Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:56.0023074Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:56.0023300Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 236, in forward 2025-08-14T21:32:56.0023386Z value_states = self.v_proj(current_states) 2025-08-14T21:32:56.0023390Z 2025-08-14T21:32:56.0023461Z cudagraph partition due to non gpu ops 2025-08-14T21:32:56.0023540Z cudagraph partition due to non gpu ops 2025-08-14T21:32:56.0023609Z cudagraph partition due to non gpu ops 2025-08-14T21:32:56.0023678Z cudagraph partition due to non gpu ops 2025-08-14T21:32:56.0023778Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0023959Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0024022Z return mod(**inputs) 2025-08-14T21:32:56.0024256Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0024318Z outputs = self.model( 2025-08-14T21:32:56.0024560Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0024627Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0024932Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0025013Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0025220Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0025293Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0025534Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:56.0025630Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:56.0025866Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 254, in forward 2025-08-14T21:32:56.0025997Z attn_output, attn_weights = attention_interface( 2025-08-14T21:32:56.0026267Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:32:56.0026397Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:32:56.0026400Z 2025-08-14T21:32:56.0026493Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0026681Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0026742Z return mod(**inputs) 2025-08-14T21:32:56.0027008Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0027079Z outputs = self.model( 2025-08-14T21:32:56.0027312Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0027384Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0027624Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0027692Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0027903Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0027974Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0028201Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:56.0028297Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:56.0028528Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 254, in forward 2025-08-14T21:32:56.0028622Z attn_output, attn_weights = attention_interface( 2025-08-14T21:32:56.0028889Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:32:56.0028988Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:32:56.0028991Z 2025-08-14T21:32:56.0029092Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0029275Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0029335Z return mod(**inputs) 2025-08-14T21:32:56.0029574Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0029635Z outputs = self.model( 2025-08-14T21:32:56.0029877Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0029943Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0030175Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0030247Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0030450Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0030528Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0030757Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:56.0030843Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:56.0031081Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 268, in forward 2025-08-14T21:32:56.0031159Z attn_output = self.out_proj(attn_output) 2025-08-14T21:32:56.0031162Z 2025-08-14T21:32:56.0031254Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0031445Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0035550Z return mod(**inputs) 2025-08-14T21:32:56.0035806Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0035871Z outputs = self.model( 2025-08-14T21:32:56.0036102Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0036178Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0036408Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0036475Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0036710Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0036782Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0037016Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 430, in forward 2025-08-14T21:32:56.0037147Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:32:56.0037371Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 216, in forward 2025-08-14T21:32:56.0037516Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:32:56.0037520Z 2025-08-14T21:32:56.0037613Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0037793Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0037860Z return mod(**inputs) 2025-08-14T21:32:56.0038090Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0038157Z outputs = self.model( 2025-08-14T21:32:56.0038382Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0038453Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0038684Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0038746Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0038946Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0039025Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0039247Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 430, in forward 2025-08-14T21:32:56.0039350Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:32:56.0039574Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 235, in forward 2025-08-14T21:32:56.0039645Z key_states = self.k_proj(current_states) 2025-08-14T21:32:56.0039648Z 2025-08-14T21:32:56.0039754Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0039937Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0039994Z return mod(**inputs) 2025-08-14T21:32:56.0040225Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0040285Z outputs = self.model( 2025-08-14T21:32:56.0040515Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0040581Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0040808Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0040880Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0041110Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0041248Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0041475Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 430, in forward 2025-08-14T21:32:56.0041572Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:32:56.0041803Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 236, in forward 2025-08-14T21:32:56.0041881Z value_states = self.v_proj(current_states) 2025-08-14T21:32:56.0041885Z 2025-08-14T21:32:56.0041956Z cudagraph partition due to non gpu ops 2025-08-14T21:32:56.0042053Z cudagraph partition due to non gpu ops 2025-08-14T21:32:56.0042122Z cudagraph partition due to non gpu ops 2025-08-14T21:32:56.0042197Z cudagraph partition due to non gpu ops 2025-08-14T21:32:56.0042292Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0042478Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0042547Z return mod(**inputs) 2025-08-14T21:32:56.0042777Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0042838Z outputs = self.model( 2025-08-14T21:32:56.0043075Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0043143Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0043377Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0043445Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0043648Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0043726Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0043956Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 430, in forward 2025-08-14T21:32:56.0044056Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:32:56.0044293Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 254, in forward 2025-08-14T21:32:56.0044383Z attn_output, attn_weights = attention_interface( 2025-08-14T21:32:56.0044658Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:32:56.0044779Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:32:56.0044784Z 2025-08-14T21:32:56.0044879Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0045070Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0045130Z return mod(**inputs) 2025-08-14T21:32:56.0045372Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0045433Z outputs = self.model( 2025-08-14T21:32:56.0045663Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0045737Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0045964Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0046030Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0046240Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0046312Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0046545Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 430, in forward 2025-08-14T21:32:56.0046672Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:32:56.0046921Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 254, in forward 2025-08-14T21:32:56.0047017Z attn_output, attn_weights = attention_interface( 2025-08-14T21:32:56.0047280Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:32:56.0047381Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:32:56.0047384Z 2025-08-14T21:32:56.0047477Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0047675Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0047742Z return mod(**inputs) 2025-08-14T21:32:56.0047970Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0048031Z outputs = self.model( 2025-08-14T21:32:56.0048269Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0048335Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0048568Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0048631Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0048832Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0048910Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0049134Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 430, in forward 2025-08-14T21:32:56.0049236Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:32:56.0049462Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 268, in forward 2025-08-14T21:32:56.0049538Z attn_output = self.out_proj(attn_output) 2025-08-14T21:32:56.0049541Z 2025-08-14T21:32:56.0049639Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0049822Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0049879Z return mod(**inputs) 2025-08-14T21:32:56.0050112Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0050172Z outputs = self.model( 2025-08-14T21:32:56.0050405Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0050472Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0050694Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0050766Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0050970Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0051040Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0051269Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 445, in forward 2025-08-14T21:32:56.0051376Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:32:56.0051380Z 2025-08-14T21:32:56.0051476Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0051658Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0051717Z return mod(**inputs) 2025-08-14T21:32:56.0051950Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0052010Z outputs = self.model( 2025-08-14T21:32:56.0052269Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0052356Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0052586Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0052658Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0052860Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0052932Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0053167Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 445, in forward 2025-08-14T21:32:56.0053292Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:32:56.0053494Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:32:56.0053559Z return self.act(input) 2025-08-14T21:32:56.0053564Z 2025-08-14T21:32:56.0053656Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0053844Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0053903Z return mod(**inputs) 2025-08-14T21:32:56.0054129Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0054197Z outputs = self.model( 2025-08-14T21:32:56.0054422Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0054495Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0054719Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0054783Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0054992Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0055063Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0055294Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 447, in forward 2025-08-14T21:32:56.0055365Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:32:56.0055369Z 2025-08-14T21:32:56.0055460Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0055649Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0055707Z return mod(**inputs) 2025-08-14T21:32:56.0055937Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0056004Z outputs = self.model( 2025-08-14T21:32:56.0056232Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0056305Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0056531Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0056596Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0056804Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0056873Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0057096Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:56.0057195Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:56.0057420Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 216, in forward 2025-08-14T21:32:56.0057565Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:32:56.0057611Z 2025-08-14T21:32:56.0057708Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0057891Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0057958Z return mod(**inputs) 2025-08-14T21:32:56.0058187Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0058254Z outputs = self.model( 2025-08-14T21:32:56.0058481Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0058562Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0058797Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0058862Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0059065Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0059143Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0059368Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:56.0059465Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:56.0059687Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 235, in forward 2025-08-14T21:32:56.0059759Z key_states = self.k_proj(current_states) 2025-08-14T21:32:56.0059762Z 2025-08-14T21:32:56.0059861Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0060043Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0060108Z return mod(**inputs) 2025-08-14T21:32:56.0060335Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0060400Z outputs = self.model( 2025-08-14T21:32:56.0060633Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0060697Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0060922Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0060991Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0061192Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0061272Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0061495Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:56.0061586Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:56.0061822Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 236, in forward 2025-08-14T21:32:56.0061904Z value_states = self.v_proj(current_states) 2025-08-14T21:32:56.0061907Z 2025-08-14T21:32:56.0061986Z cudagraph partition due to non gpu ops 2025-08-14T21:32:56.0062057Z cudagraph partition due to non gpu ops 2025-08-14T21:32:56.0062124Z cudagraph partition due to non gpu ops 2025-08-14T21:32:56.0062199Z cudagraph partition due to non gpu ops 2025-08-14T21:32:56.0062292Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0062475Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0062543Z return mod(**inputs) 2025-08-14T21:32:56.0062771Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0062831Z outputs = self.model( 2025-08-14T21:32:56.0063099Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0063182Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0063420Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0063484Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0063686Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0063764Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0063991Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:56.0064105Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:56.0064334Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 254, in forward 2025-08-14T21:32:56.0064427Z attn_output, attn_weights = attention_interface( 2025-08-14T21:32:56.0064797Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:32:56.0064929Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:32:56.0064932Z 2025-08-14T21:32:56.0065027Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0065221Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0065283Z return mod(**inputs) 2025-08-14T21:32:56.0065529Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0065594Z outputs = self.model( 2025-08-14T21:32:56.0065840Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0065917Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0066148Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0066223Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0066427Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0066499Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0066736Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:56.0066826Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:56.0067051Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 254, in forward 2025-08-14T21:32:56.0067147Z attn_output, attn_weights = attention_interface( 2025-08-14T21:32:56.0067416Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:32:56.0067526Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:32:56.0067529Z 2025-08-14T21:32:56.0067622Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0067803Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0067870Z return mod(**inputs) 2025-08-14T21:32:56.0068099Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0068165Z outputs = self.model( 2025-08-14T21:32:56.0068393Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0068460Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0068696Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0068792Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0069010Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0069091Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0069314Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:56.0069408Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:56.0069630Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 268, in forward 2025-08-14T21:32:56.0069723Z attn_output = self.out_proj(attn_output) 2025-08-14T21:32:56.0069726Z 2025-08-14T21:32:56.0069826Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0070008Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0070068Z return mod(**inputs) 2025-08-14T21:32:56.0070310Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0070371Z outputs = self.model( 2025-08-14T21:32:56.0070607Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0070673Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0070901Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0070974Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0071177Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0071257Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0071480Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 430, in forward 2025-08-14T21:32:56.0071582Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:32:56.0071827Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 216, in forward 2025-08-14T21:32:56.0071965Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:32:56.0071968Z 2025-08-14T21:32:56.0072061Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0072252Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0072311Z return mod(**inputs) 2025-08-14T21:32:56.0072545Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0072606Z outputs = self.model( 2025-08-14T21:32:56.0072832Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0072908Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0073134Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0073205Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0073407Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0073479Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0073711Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 430, in forward 2025-08-14T21:32:56.0073810Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:32:56.0074038Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 235, in forward 2025-08-14T21:32:56.0074116Z key_states = self.k_proj(current_states) 2025-08-14T21:32:56.0074120Z 2025-08-14T21:32:56.0074241Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0074451Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0074512Z return mod(**inputs) 2025-08-14T21:32:56.0074740Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0074807Z outputs = self.model( 2025-08-14T21:32:56.0075035Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0075102Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0075351Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0075415Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0075623Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0075697Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0075920Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 430, in forward 2025-08-14T21:32:56.0076024Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:32:56.0076249Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 236, in forward 2025-08-14T21:32:56.0076330Z value_states = self.v_proj(current_states) 2025-08-14T21:32:56.0076334Z 2025-08-14T21:32:56.0076404Z cudagraph partition due to non gpu ops 2025-08-14T21:32:56.0076476Z cudagraph partition due to non gpu ops 2025-08-14T21:32:56.0076556Z cudagraph partition due to non gpu ops 2025-08-14T21:32:56.0076623Z cudagraph partition due to non gpu ops 2025-08-14T21:32:56.0076715Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0076903Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0076966Z return mod(**inputs) 2025-08-14T21:32:56.0077201Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0077259Z outputs = self.model( 2025-08-14T21:32:56.0077485Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0077558Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0077784Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0077849Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0078053Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0078123Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0078357Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 430, in forward 2025-08-14T21:32:56.0078451Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:32:56.0078673Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 254, in forward 2025-08-14T21:32:56.0078766Z attn_output, attn_weights = attention_interface( 2025-08-14T21:32:56.0079029Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:32:56.0079148Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:32:56.0079159Z 2025-08-14T21:32:56.0079251Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0079431Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0079495Z return mod(**inputs) 2025-08-14T21:32:56.0079748Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0079826Z outputs = self.model( 2025-08-14T21:32:56.0080062Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0080128Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0080362Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0080426Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0080625Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0080722Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0080946Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 430, in forward 2025-08-14T21:32:56.0081041Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:32:56.0081273Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 254, in forward 2025-08-14T21:32:56.0081363Z attn_output, attn_weights = attention_interface( 2025-08-14T21:32:56.0081635Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:32:56.0081731Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:32:56.0081734Z 2025-08-14T21:32:56.0081825Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0082015Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0082077Z return mod(**inputs) 2025-08-14T21:32:56.0082312Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0082373Z outputs = self.model( 2025-08-14T21:32:56.0082602Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0082677Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0082902Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0082966Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0083174Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0083245Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0083475Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 430, in forward 2025-08-14T21:32:56.0083572Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:32:56.0083796Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 268, in forward 2025-08-14T21:32:56.0083880Z attn_output = self.out_proj(attn_output) 2025-08-14T21:32:56.0083884Z 2025-08-14T21:32:56.0083977Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0084165Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0084224Z return mod(**inputs) 2025-08-14T21:32:56.0084453Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0084521Z outputs = self.model( 2025-08-14T21:32:56.0084866Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0084943Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0085220Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0085287Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0085577Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0085682Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0085915Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 445, in forward 2025-08-14T21:32:56.0086034Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:32:56.0086037Z 2025-08-14T21:32:56.0086131Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0086320Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0086428Z return mod(**inputs) 2025-08-14T21:32:56.0086663Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0086733Z outputs = self.model( 2025-08-14T21:32:56.0086967Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0087037Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0087276Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0087342Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0087554Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0087627Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0087856Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 445, in forward 2025-08-14T21:32:56.0087973Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:32:56.0088171Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:32:56.0088235Z return self.act(input) 2025-08-14T21:32:56.0088238Z 2025-08-14T21:32:56.0088342Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0088526Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0088591Z return mod(**inputs) 2025-08-14T21:32:56.0088825Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0088885Z outputs = self.model( 2025-08-14T21:32:56.0089125Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0089192Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0089422Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0089495Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0089703Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0089785Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0090012Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 447, in forward 2025-08-14T21:32:56.0090085Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:32:56.0090088Z 2025-08-14T21:32:56.0090189Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0090373Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0090440Z return mod(**inputs) 2025-08-14T21:32:56.0090671Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0090734Z outputs = self.model( 2025-08-14T21:32:56.0090971Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0091066Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0091334Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0091406Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0091603Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0091680Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0091904Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:56.0091992Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:56.0092243Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 216, in forward 2025-08-14T21:32:56.0092380Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:32:56.0092384Z 2025-08-14T21:32:56.0092485Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0092666Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0092723Z return mod(**inputs) 2025-08-14T21:32:56.0092957Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0093017Z outputs = self.model( 2025-08-14T21:32:56.0093242Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0093313Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0093541Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0093612Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0093812Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0093887Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0094119Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:56.0094207Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:56.0094430Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 235, in forward 2025-08-14T21:32:56.0094510Z key_states = self.k_proj(current_states) 2025-08-14T21:32:56.0094513Z 2025-08-14T21:32:56.0094606Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0094793Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0094852Z return mod(**inputs) 2025-08-14T21:32:56.0095076Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0095143Z outputs = self.model( 2025-08-14T21:32:56.0095373Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0095448Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0095674Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0095737Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0095944Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0096014Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0096238Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:56.0096334Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:56.0096586Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 236, in forward 2025-08-14T21:32:56.0096689Z value_states = self.v_proj(current_states) 2025-08-14T21:32:56.0096693Z 2025-08-14T21:32:56.0096767Z cudagraph partition due to non gpu ops 2025-08-14T21:32:56.0096837Z cudagraph partition due to non gpu ops 2025-08-14T21:32:56.0096914Z cudagraph partition due to non gpu ops 2025-08-14T21:32:56.0096982Z cudagraph partition due to non gpu ops 2025-08-14T21:32:56.0097074Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0097265Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0097324Z return mod(**inputs) 2025-08-14T21:32:56.0097577Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0097638Z outputs = self.model( 2025-08-14T21:32:56.0097864Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0097942Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0098169Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0098234Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0098442Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0098513Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0098744Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:56.0098832Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:56.0099056Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 254, in forward 2025-08-14T21:32:56.0099151Z attn_output, attn_weights = attention_interface( 2025-08-14T21:32:56.0099418Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:32:56.0099546Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:32:56.0099550Z 2025-08-14T21:32:56.0099642Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0099825Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0099892Z return mod(**inputs) 2025-08-14T21:32:56.0100119Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0100179Z outputs = self.model( 2025-08-14T21:32:56.0100412Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0100479Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0100712Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0100780Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0100979Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0101056Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0101280Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:56.0101374Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:56.0101598Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 254, in forward 2025-08-14T21:32:56.0101688Z attn_output, attn_weights = attention_interface( 2025-08-14T21:32:56.0101957Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:32:56.0102082Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:32:56.0102100Z 2025-08-14T21:32:56.0102196Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0102388Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0102449Z return mod(**inputs) 2025-08-14T21:32:56.0102689Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0102752Z outputs = self.model( 2025-08-14T21:32:56.0102978Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0103084Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0103310Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0103374Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0103582Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0103654Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0103883Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:56.0103971Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:56.0104193Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 268, in forward 2025-08-14T21:32:56.0104274Z attn_output = self.out_proj(attn_output) 2025-08-14T21:32:56.0104278Z 2025-08-14T21:32:56.0104369Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0104558Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0104618Z return mod(**inputs) 2025-08-14T21:32:56.0104943Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0105019Z outputs = self.model( 2025-08-14T21:32:56.0105249Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0105318Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0105549Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0105614Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0105826Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0105899Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0106123Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 430, in forward 2025-08-14T21:32:56.0106229Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:32:56.0106459Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 216, in forward 2025-08-14T21:32:56.0106605Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:32:56.0106608Z 2025-08-14T21:32:56.0106703Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0106882Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0106949Z return mod(**inputs) 2025-08-14T21:32:56.0107175Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0107240Z outputs = self.model( 2025-08-14T21:32:56.0107475Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0107541Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0107807Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0107891Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0108098Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0108176Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0108410Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 430, in forward 2025-08-14T21:32:56.0108508Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:32:56.0108754Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 235, in forward 2025-08-14T21:32:56.0108826Z key_states = self.k_proj(current_states) 2025-08-14T21:32:56.0108830Z 2025-08-14T21:32:56.0108930Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0109117Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0109177Z return mod(**inputs) 2025-08-14T21:32:56.0109419Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0109478Z outputs = self.model( 2025-08-14T21:32:56.0109720Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0109786Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0110018Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0110093Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0110296Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0110367Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0110605Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 430, in forward 2025-08-14T21:32:56.0110704Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:32:56.0110940Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 236, in forward 2025-08-14T21:32:56.0111019Z value_states = self.v_proj(current_states) 2025-08-14T21:32:56.0111023Z 2025-08-14T21:32:56.0111094Z cudagraph partition due to non gpu ops 2025-08-14T21:32:56.0111175Z cudagraph partition due to non gpu ops 2025-08-14T21:32:56.0111244Z cudagraph partition due to non gpu ops 2025-08-14T21:32:56.0111313Z cudagraph partition due to non gpu ops 2025-08-14T21:32:56.0111416Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0111601Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0111668Z return mod(**inputs) 2025-08-14T21:32:56.0111908Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0111969Z outputs = self.model( 2025-08-14T21:32:56.0112208Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0112275Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0112506Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0112581Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0112787Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0112864Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0113096Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 430, in forward 2025-08-14T21:32:56.0113243Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:32:56.0113474Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 254, in forward 2025-08-14T21:32:56.0113563Z attn_output, attn_weights = attention_interface( 2025-08-14T21:32:56.0113833Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:32:56.0113954Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:32:56.0113957Z 2025-08-14T21:32:56.0114050Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0114256Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0114316Z return mod(**inputs) 2025-08-14T21:32:56.0114546Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0114616Z outputs = self.model( 2025-08-14T21:32:56.0114843Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0114916Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0115144Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0115208Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0115414Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0115487Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0115718Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 430, in forward 2025-08-14T21:32:56.0115814Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:32:56.0116039Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 254, in forward 2025-08-14T21:32:56.0116136Z attn_output, attn_weights = attention_interface( 2025-08-14T21:32:56.0116399Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:32:56.0116494Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:32:56.0116505Z 2025-08-14T21:32:56.0116597Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0116778Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0116846Z return mod(**inputs) 2025-08-14T21:32:56.0117073Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0117134Z outputs = self.model( 2025-08-14T21:32:56.0117370Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0117438Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0117670Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0117734Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0117934Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0118010Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0118234Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 430, in forward 2025-08-14T21:32:56.0118332Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:32:56.0118563Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 268, in forward 2025-08-14T21:32:56.0118636Z attn_output = self.out_proj(attn_output) 2025-08-14T21:32:56.0118681Z 2025-08-14T21:32:56.0118782Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0118966Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0119025Z return mod(**inputs) 2025-08-14T21:32:56.0119261Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0119320Z outputs = self.model( 2025-08-14T21:32:56.0119548Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0119641Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0119870Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0119942Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0120146Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0120219Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0120450Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 445, in forward 2025-08-14T21:32:56.0120560Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:32:56.0120563Z 2025-08-14T21:32:56.0120664Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0120845Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0120903Z return mod(**inputs) 2025-08-14T21:32:56.0121142Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0121202Z outputs = self.model( 2025-08-14T21:32:56.0121430Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0121509Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0121734Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0121804Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0122002Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0122075Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0122306Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 445, in forward 2025-08-14T21:32:56.0122417Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:32:56.0122621Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:32:56.0122687Z return self.act(input) 2025-08-14T21:32:56.0122690Z 2025-08-14T21:32:56.0122786Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0122975Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0123033Z return mod(**inputs) 2025-08-14T21:32:56.0123260Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0123328Z outputs = self.model( 2025-08-14T21:32:56.0123556Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0123630Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0123859Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0123923Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0124131Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0124227Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0124481Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 447, in forward 2025-08-14T21:32:56.0124563Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:32:56.0124567Z 2025-08-14T21:32:56.0124660Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0124849Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0124907Z return mod(**inputs) 2025-08-14T21:32:56.0125133Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0125221Z outputs = self.model( 2025-08-14T21:32:56.0125447Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0125527Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0125756Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0125823Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0126029Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0126100Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0126323Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:56.0126421Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:56.0126646Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 216, in forward 2025-08-14T21:32:56.0126788Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:32:56.0126791Z 2025-08-14T21:32:56.0126882Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0127067Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0127133Z return mod(**inputs) 2025-08-14T21:32:56.0127358Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0127426Z outputs = self.model( 2025-08-14T21:32:56.0127651Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0127717Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0127949Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0128015Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0128216Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0128298Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0128522Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:56.0128617Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:56.0128839Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 235, in forward 2025-08-14T21:32:56.0128911Z key_states = self.k_proj(current_states) 2025-08-14T21:32:56.0128914Z 2025-08-14T21:32:56.0129013Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0129192Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0129254Z return mod(**inputs) 2025-08-14T21:32:56.0129486Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0129546Z outputs = self.model( 2025-08-14T21:32:56.0129805Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0129902Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0130131Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0130204Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0130404Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0130483Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0130708Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:56.0130815Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:56.0131050Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 236, in forward 2025-08-14T21:32:56.0131128Z value_states = self.v_proj(current_states) 2025-08-14T21:32:56.0131133Z 2025-08-14T21:32:56.0131205Z cudagraph partition due to non gpu ops 2025-08-14T21:32:56.0131282Z cudagraph partition due to non gpu ops 2025-08-14T21:32:56.0131351Z cudagraph partition due to non gpu ops 2025-08-14T21:32:56.0131426Z cudagraph partition due to non gpu ops 2025-08-14T21:32:56.0131516Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0131696Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0131760Z return mod(**inputs) 2025-08-14T21:32:56.0131988Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0132049Z outputs = self.model( 2025-08-14T21:32:56.0132281Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0132348Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0132581Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0132646Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0132847Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0132925Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0133149Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:56.0133236Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:56.0133467Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 254, in forward 2025-08-14T21:32:56.0133554Z attn_output, attn_weights = attention_interface( 2025-08-14T21:32:56.0133831Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:32:56.0133952Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:32:56.0133955Z 2025-08-14T21:32:56.0134045Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0134231Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0134290Z return mod(**inputs) 2025-08-14T21:32:56.0134526Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0134585Z outputs = self.model( 2025-08-14T21:32:56.0134812Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0134884Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0135141Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0135220Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0135427Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0135498Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0135728Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:56.0135816Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:56.0136039Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 254, in forward 2025-08-14T21:32:56.0136151Z attn_output, attn_weights = attention_interface( 2025-08-14T21:32:56.0136416Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:32:56.0136520Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:32:56.0136527Z 2025-08-14T21:32:56.0136619Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0136800Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0136869Z return mod(**inputs) 2025-08-14T21:32:56.0137093Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0137154Z outputs = self.model( 2025-08-14T21:32:56.0137387Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0137455Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0137688Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0137755Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0137957Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0138037Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0138260Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:56.0138348Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:56.0138580Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 268, in forward 2025-08-14T21:32:56.0138653Z attn_output = self.out_proj(attn_output) 2025-08-14T21:32:56.0138656Z 2025-08-14T21:32:56.0138756Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0138935Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0138993Z return mod(**inputs) 2025-08-14T21:32:56.0139228Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0139290Z outputs = self.model( 2025-08-14T21:32:56.0139524Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0139589Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0139814Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0139885Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0140083Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0140155Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0140384Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 430, in forward 2025-08-14T21:32:56.0140479Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:32:56.0140739Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 216, in forward 2025-08-14T21:32:56.0140897Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:32:56.0140900Z 2025-08-14T21:32:56.0140993Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0141181Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0141239Z return mod(**inputs) 2025-08-14T21:32:56.0141475Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0141556Z outputs = self.model( 2025-08-14T21:32:56.0141783Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0141856Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0142084Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0142150Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0142359Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0142430Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0142662Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 430, in forward 2025-08-14T21:32:56.0142758Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:32:56.0142983Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 235, in forward 2025-08-14T21:32:56.0143067Z key_states = self.k_proj(current_states) 2025-08-14T21:32:56.0143070Z 2025-08-14T21:32:56.0143162Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0143352Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0143414Z return mod(**inputs) 2025-08-14T21:32:56.0143639Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0143706Z outputs = self.model( 2025-08-14T21:32:56.0143933Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0143998Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0144230Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0144297Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0144505Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0144575Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0144887Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 430, in forward 2025-08-14T21:32:56.0145001Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:32:56.0145233Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 236, in forward 2025-08-14T21:32:56.0145312Z value_states = self.v_proj(current_states) 2025-08-14T21:32:56.0145323Z 2025-08-14T21:32:56.0145398Z cudagraph partition due to non gpu ops 2025-08-14T21:32:56.0145473Z cudagraph partition due to non gpu ops 2025-08-14T21:32:56.0145552Z cudagraph partition due to non gpu ops 2025-08-14T21:32:56.0145626Z cudagraph partition due to non gpu ops 2025-08-14T21:32:56.0145724Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0145915Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0145977Z return mod(**inputs) 2025-08-14T21:32:56.0146264Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0146355Z outputs = self.model( 2025-08-14T21:32:56.0146586Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0146660Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0146889Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0146952Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0147166Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0147253Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0147476Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 430, in forward 2025-08-14T21:32:56.0147582Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:32:56.0147805Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 254, in forward 2025-08-14T21:32:56.0147899Z attn_output, attn_weights = attention_interface( 2025-08-14T21:32:56.0148161Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:32:56.0148279Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:32:56.0148283Z 2025-08-14T21:32:56.0148380Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0148559Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0148627Z return mod(**inputs) 2025-08-14T21:32:56.0148852Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0148913Z outputs = self.model( 2025-08-14T21:32:56.0149151Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0149217Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0149443Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0149515Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0149715Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0149791Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0150017Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 430, in forward 2025-08-14T21:32:56.0150113Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:32:56.0150345Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 254, in forward 2025-08-14T21:32:56.0150434Z attn_output, attn_weights = attention_interface( 2025-08-14T21:32:56.0150704Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:32:56.0150798Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:32:56.0150802Z 2025-08-14T21:32:56.0150893Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0151077Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0151137Z return mod(**inputs) 2025-08-14T21:32:56.0151364Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0151435Z outputs = self.model( 2025-08-14T21:32:56.0151664Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0151781Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0152008Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0152073Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0152283Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0152356Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0152592Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 430, in forward 2025-08-14T21:32:56.0152705Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:32:56.0152930Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 268, in forward 2025-08-14T21:32:56.0153011Z attn_output = self.out_proj(attn_output) 2025-08-14T21:32:56.0153015Z 2025-08-14T21:32:56.0153109Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0153292Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0153359Z return mod(**inputs) 2025-08-14T21:32:56.0153585Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0153653Z outputs = self.model( 2025-08-14T21:32:56.0153878Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0153943Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0154176Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0154241Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0154441Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0154522Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0154748Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 445, in forward 2025-08-14T21:32:56.0154865Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:32:56.0154868Z 2025-08-14T21:32:56.0154961Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0155142Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0155208Z return mod(**inputs) 2025-08-14T21:32:56.0155436Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0155505Z outputs = self.model( 2025-08-14T21:32:56.0155732Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0155799Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0156034Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0156100Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0156303Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0156380Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0156604Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 445, in forward 2025-08-14T21:32:56.0156718Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:32:56.0156916Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:32:56.0156980Z return self.act(input) 2025-08-14T21:32:56.0156983Z 2025-08-14T21:32:56.0157087Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0157299Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0157385Z return mod(**inputs) 2025-08-14T21:32:56.0157614Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0157675Z outputs = self.model( 2025-08-14T21:32:56.0157913Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0157977Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0158205Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0158295Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0158502Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0158580Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0158812Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 447, in forward 2025-08-14T21:32:56.0158887Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:32:56.0158890Z 2025-08-14T21:32:56.0158992Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0159175Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0159234Z return mod(**inputs) 2025-08-14T21:32:56.0159471Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0159532Z outputs = self.model( 2025-08-14T21:32:56.0159769Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0159835Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0160068Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0160141Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0160345Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0160416Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0160649Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:56.0160738Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:56.0160972Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 216, in forward 2025-08-14T21:32:56.0161112Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:32:56.0161116Z 2025-08-14T21:32:56.0161209Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0161400Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0161461Z return mod(**inputs) 2025-08-14T21:32:56.0161701Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0161761Z outputs = self.model( 2025-08-14T21:32:56.0161991Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0162067Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0162297Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0162366Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0162578Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0162649Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0162914Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:56.0163020Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:56.0163247Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 235, in forward 2025-08-14T21:32:56.0163328Z key_states = self.k_proj(current_states) 2025-08-14T21:32:56.0163331Z 2025-08-14T21:32:56.0163423Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0163611Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0163689Z return mod(**inputs) 2025-08-14T21:32:56.0163921Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0163989Z outputs = self.model( 2025-08-14T21:32:56.0164222Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0164289Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0164527Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0164591Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0164802Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0164872Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0165097Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:56.0165196Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:56.0165424Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 236, in forward 2025-08-14T21:32:56.0165510Z value_states = self.v_proj(current_states) 2025-08-14T21:32:56.0165516Z 2025-08-14T21:32:56.0165590Z cudagraph partition due to non gpu ops 2025-08-14T21:32:56.0165664Z cudagraph partition due to non gpu ops 2025-08-14T21:32:56.0165742Z cudagraph partition due to non gpu ops 2025-08-14T21:32:56.0165810Z cudagraph partition due to non gpu ops 2025-08-14T21:32:56.0165903Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0166094Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0166152Z return mod(**inputs) 2025-08-14T21:32:56.0166381Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0166449Z outputs = self.model( 2025-08-14T21:32:56.0166679Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0166752Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0166981Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0167046Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0167256Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0167328Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0167562Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:56.0167650Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:56.0167875Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 254, in forward 2025-08-14T21:32:56.0167968Z attn_output, attn_weights = attention_interface( 2025-08-14T21:32:56.0168273Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:32:56.0168410Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:32:56.0168421Z 2025-08-14T21:32:56.0168513Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0168697Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0168763Z return mod(**inputs) 2025-08-14T21:32:56.0168990Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0169050Z outputs = self.model( 2025-08-14T21:32:56.0169285Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0169368Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0169602Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0169669Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0169874Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0169950Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0170177Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:56.0170264Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:56.0170499Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 254, in forward 2025-08-14T21:32:56.0170588Z attn_output, attn_weights = attention_interface( 2025-08-14T21:32:56.0170859Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:32:56.0170957Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:32:56.0170960Z 2025-08-14T21:32:56.0171057Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0171246Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0171305Z return mod(**inputs) 2025-08-14T21:32:56.0171542Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0171603Z outputs = self.model( 2025-08-14T21:32:56.0171830Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0171903Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0172129Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0172192Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0172399Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0172474Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0172707Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:56.0172795Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:56.0173020Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 268, in forward 2025-08-14T21:32:56.0173100Z attn_output = self.out_proj(attn_output) 2025-08-14T21:32:56.0173103Z 2025-08-14T21:32:56.0173194Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0173378Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0173444Z return mod(**inputs) 2025-08-14T21:32:56.0173671Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0173762Z outputs = self.model( 2025-08-14T21:32:56.0174009Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0174075Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0174308Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0174372Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0174581Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0174651Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0174894Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 430, in forward 2025-08-14T21:32:56.0175002Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:32:56.0175229Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 216, in forward 2025-08-14T21:32:56.0175368Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:32:56.0175379Z 2025-08-14T21:32:56.0175473Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0175653Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0175718Z return mod(**inputs) 2025-08-14T21:32:56.0175951Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0176012Z outputs = self.model( 2025-08-14T21:32:56.0176247Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0176313Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0176548Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0176614Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0176815Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0176893Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0177117Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 430, in forward 2025-08-14T21:32:56.0177215Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:32:56.0177447Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 235, in forward 2025-08-14T21:32:56.0177521Z key_states = self.k_proj(current_states) 2025-08-14T21:32:56.0177525Z 2025-08-14T21:32:56.0177622Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0177803Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0177864Z return mod(**inputs) 2025-08-14T21:32:56.0178102Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0178162Z outputs = self.model( 2025-08-14T21:32:56.0178390Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0178464Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0178689Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0178763Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0178965Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0179036Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0179336Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 430, in forward 2025-08-14T21:32:56.0179449Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:32:56.0179680Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 236, in forward 2025-08-14T21:32:56.0179760Z value_states = self.v_proj(current_states) 2025-08-14T21:32:56.0179763Z 2025-08-14T21:32:56.0179833Z cudagraph partition due to non gpu ops 2025-08-14T21:32:56.0179911Z cudagraph partition due to non gpu ops 2025-08-14T21:32:56.0179978Z cudagraph partition due to non gpu ops 2025-08-14T21:32:56.0180046Z cudagraph partition due to non gpu ops 2025-08-14T21:32:56.0180163Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0180344Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0180410Z return mod(**inputs) 2025-08-14T21:32:56.0180640Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0180701Z outputs = self.model( 2025-08-14T21:32:56.0180935Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0181001Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0181227Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0181299Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0181501Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0181582Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0181809Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 430, in forward 2025-08-14T21:32:56.0181905Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:32:56.0182139Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 254, in forward 2025-08-14T21:32:56.0182230Z attn_output, attn_weights = attention_interface( 2025-08-14T21:32:56.0182497Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:32:56.0182623Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:32:56.0182627Z 2025-08-14T21:32:56.0182718Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0182907Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0182967Z return mod(**inputs) 2025-08-14T21:32:56.0183195Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0183263Z outputs = self.model( 2025-08-14T21:32:56.0183491Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0183566Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0183794Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0183859Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0184069Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0184139Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0184363Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 430, in forward 2025-08-14T21:32:56.0184467Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:32:56.0184849Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 254, in forward 2025-08-14T21:32:56.0185015Z attn_output, attn_weights = attention_interface( 2025-08-14T21:32:56.0185362Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:32:56.0185462Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:32:56.0185466Z 2025-08-14T21:32:56.0185574Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0185766Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0185833Z return mod(**inputs) 2025-08-14T21:32:56.0186080Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0186173Z outputs = self.model( 2025-08-14T21:32:56.0186408Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0201947Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0202339Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0202421Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0202660Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0202745Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0202989Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 430, in forward 2025-08-14T21:32:56.0203108Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:32:56.0203344Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 268, in forward 2025-08-14T21:32:56.0203430Z attn_output = self.out_proj(attn_output) 2025-08-14T21:32:56.0203436Z 2025-08-14T21:32:56.0203542Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0203741Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0203815Z return mod(**inputs) 2025-08-14T21:32:56.0204050Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0204125Z outputs = self.model( 2025-08-14T21:32:56.0204357Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0204428Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0204664Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0204735Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0204943Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0205033Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0205262Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 445, in forward 2025-08-14T21:32:56.0205385Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:32:56.0205390Z 2025-08-14T21:32:56.0205489Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0205677Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0205748Z return mod(**inputs) 2025-08-14T21:32:56.0205979Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0206045Z outputs = self.model( 2025-08-14T21:32:56.0206280Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0206351Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0206681Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0206780Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0206993Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0207076Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0207312Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 445, in forward 2025-08-14T21:32:56.0207432Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:32:56.0207657Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:32:56.0207723Z return self.act(input) 2025-08-14T21:32:56.0207726Z 2025-08-14T21:32:56.0207835Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0208023Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0208085Z return mod(**inputs) 2025-08-14T21:32:56.0208325Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0208391Z outputs = self.model( 2025-08-14T21:32:56.0208627Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0208696Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0208922Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0209000Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0209198Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0209271Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0209504Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 447, in forward 2025-08-14T21:32:56.0209582Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:32:56.0209586Z 2025-08-14T21:32:56.0209689Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0209870Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0209929Z return mod(**inputs) 2025-08-14T21:32:56.0210162Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0210223Z outputs = self.model( 2025-08-14T21:32:56.0210455Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0210521Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0210749Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0210824Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0211023Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0211094Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0211326Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:56.0211420Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:56.0211649Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 216, in forward 2025-08-14T21:32:56.0211790Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:32:56.0211794Z 2025-08-14T21:32:56.0211888Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0212109Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0212185Z return mod(**inputs) 2025-08-14T21:32:56.0212420Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0212481Z outputs = self.model( 2025-08-14T21:32:56.0212707Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0212781Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0213006Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0213088Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0213304Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0213376Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0213620Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:56.0213714Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:56.0213947Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 235, in forward 2025-08-14T21:32:56.0214030Z key_states = self.k_proj(current_states) 2025-08-14T21:32:56.0214034Z 2025-08-14T21:32:56.0214132Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0214318Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0214387Z return mod(**inputs) 2025-08-14T21:32:56.0214625Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0214694Z outputs = self.model( 2025-08-14T21:32:56.0214929Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0215000Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0215244Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0215310Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0215527Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0215598Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0215831Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:56.0215933Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:56.0216166Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 236, in forward 2025-08-14T21:32:56.0216244Z value_states = self.v_proj(current_states) 2025-08-14T21:32:56.0216248Z 2025-08-14T21:32:56.0216336Z cudagraph partition due to non gpu ops 2025-08-14T21:32:56.0216412Z cudagraph partition due to non gpu ops 2025-08-14T21:32:56.0216489Z cudagraph partition due to non gpu ops 2025-08-14T21:32:56.0216558Z cudagraph partition due to non gpu ops 2025-08-14T21:32:56.0216654Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0216850Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0216910Z return mod(**inputs) 2025-08-14T21:32:56.0217146Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0217216Z outputs = self.model( 2025-08-14T21:32:56.0217450Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0217522Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0217813Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0217896Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0218113Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0218186Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0218417Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:56.0218514Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:56.0218745Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 254, in forward 2025-08-14T21:32:56.0218862Z attn_output, attn_weights = attention_interface( 2025-08-14T21:32:56.0219129Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:32:56.0219259Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:32:56.0219264Z 2025-08-14T21:32:56.0219367Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0219548Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0219617Z return mod(**inputs) 2025-08-14T21:32:56.0219844Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0219906Z outputs = self.model( 2025-08-14T21:32:56.0220143Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0220213Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0220438Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0220513Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0220716Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0220800Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0221023Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:56.0221114Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:56.0221350Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 254, in forward 2025-08-14T21:32:56.0221441Z attn_output, attn_weights = attention_interface( 2025-08-14T21:32:56.0221719Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:32:56.0221823Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:32:56.0221826Z 2025-08-14T21:32:56.0221922Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0222115Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0222175Z return mod(**inputs) 2025-08-14T21:32:56.0222403Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0222474Z outputs = self.model( 2025-08-14T21:32:56.0222701Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0222775Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0222999Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0223069Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0223275Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0223374Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0223616Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:56.0223713Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:56.0223937Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 268, in forward 2025-08-14T21:32:56.0224020Z attn_output = self.out_proj(attn_output) 2025-08-14T21:32:56.0224023Z 2025-08-14T21:32:56.0224116Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0224297Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0224382Z return mod(**inputs) 2025-08-14T21:32:56.0224616Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0224772Z outputs = self.model( 2025-08-14T21:32:56.0225014Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0225083Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0225325Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0225391Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0225592Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0225672Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0225902Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 430, in forward 2025-08-14T21:32:56.0226010Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:32:56.0226238Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 216, in forward 2025-08-14T21:32:56.0226385Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:32:56.0226389Z 2025-08-14T21:32:56.0226494Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0226677Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0226746Z return mod(**inputs) 2025-08-14T21:32:56.0226980Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0227043Z outputs = self.model( 2025-08-14T21:32:56.0227281Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0227349Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0227580Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0227657Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0227864Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0227946Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0228175Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 430, in forward 2025-08-14T21:32:56.0228275Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:32:56.0228512Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 235, in forward 2025-08-14T21:32:56.0228588Z key_states = self.k_proj(current_states) 2025-08-14T21:32:56.0228591Z 2025-08-14T21:32:56.0228694Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0228878Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0228936Z return mod(**inputs) 2025-08-14T21:32:56.0229209Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0229286Z outputs = self.model( 2025-08-14T21:32:56.0229514Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0229589Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0229816Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0229888Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0230104Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0230176Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0230406Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 430, in forward 2025-08-14T21:32:56.0230505Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:32:56.0230730Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 236, in forward 2025-08-14T21:32:56.0230816Z value_states = self.v_proj(current_states) 2025-08-14T21:32:56.0230819Z 2025-08-14T21:32:56.0230893Z cudagraph partition due to non gpu ops 2025-08-14T21:32:56.0230973Z cudagraph partition due to non gpu ops 2025-08-14T21:32:56.0231041Z cudagraph partition due to non gpu ops 2025-08-14T21:32:56.0231110Z cudagraph partition due to non gpu ops 2025-08-14T21:32:56.0231211Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0231396Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0231454Z return mod(**inputs) 2025-08-14T21:32:56.0231688Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0231754Z outputs = self.model( 2025-08-14T21:32:56.0231988Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0232054Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0232281Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0232354Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0232553Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0232625Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0232859Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 430, in forward 2025-08-14T21:32:56.0232953Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:32:56.0233190Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 254, in forward 2025-08-14T21:32:56.0233282Z attn_output, attn_weights = attention_interface( 2025-08-14T21:32:56.0233553Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:32:56.0233676Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:32:56.0233679Z 2025-08-14T21:32:56.0233772Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0233959Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0234019Z return mod(**inputs) 2025-08-14T21:32:56.0234253Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0234313Z outputs = self.model( 2025-08-14T21:32:56.0234565Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0234922Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0235153Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0235220Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0235431Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0235502Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0235740Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 430, in forward 2025-08-14T21:32:56.0235856Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:32:56.0236083Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 254, in forward 2025-08-14T21:32:56.0236181Z attn_output, attn_weights = attention_interface( 2025-08-14T21:32:56.0236450Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:32:56.0236557Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:32:56.0236560Z 2025-08-14T21:32:56.0236652Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0236834Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0236903Z return mod(**inputs) 2025-08-14T21:32:56.0237131Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0237195Z outputs = self.model( 2025-08-14T21:32:56.0237428Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0237494Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0237729Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0237795Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0237996Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0238073Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0238295Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 430, in forward 2025-08-14T21:32:56.0238390Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:32:56.0238620Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 268, in forward 2025-08-14T21:32:56.0238693Z attn_output = self.out_proj(attn_output) 2025-08-14T21:32:56.0238697Z 2025-08-14T21:32:56.0238795Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0238980Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0239039Z return mod(**inputs) 2025-08-14T21:32:56.0239270Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0239330Z outputs = self.model( 2025-08-14T21:32:56.0239561Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0239625Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0239849Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0239920Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0240118Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0240188Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0240456Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 445, in forward 2025-08-14T21:32:56.0240583Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:32:56.0240587Z 2025-08-14T21:32:56.0240687Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0240867Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0240926Z return mod(**inputs) 2025-08-14T21:32:56.0241161Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0241238Z outputs = self.model( 2025-08-14T21:32:56.0241475Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0241541Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0241770Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0241843Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0242042Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0242112Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0242343Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 445, in forward 2025-08-14T21:32:56.0242449Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:32:56.0242651Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:32:56.0242715Z return self.act(input) 2025-08-14T21:32:56.0242718Z 2025-08-14T21:32:56.0242814Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0243001Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0243064Z return mod(**inputs) 2025-08-14T21:32:56.0243291Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0243359Z outputs = self.model( 2025-08-14T21:32:56.0243585Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0243660Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0243885Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0243951Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0244161Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0244232Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0244465Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 447, in forward 2025-08-14T21:32:56.0244540Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:32:56.0244543Z 2025-08-14T21:32:56.0244636Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0244822Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0244880Z return mod(**inputs) 2025-08-14T21:32:56.0245107Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0245176Z outputs = self.model( 2025-08-14T21:32:56.0245400Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0245475Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0245702Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0245793Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0246017Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0246087Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0246312Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:56.0246411Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:56.0246635Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 216, in forward 2025-08-14T21:32:56.0246798Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:32:56.0246802Z 2025-08-14T21:32:56.0246894Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0247075Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0247145Z return mod(**inputs) 2025-08-14T21:32:56.0247376Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0247442Z outputs = self.model( 2025-08-14T21:32:56.0247670Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0247736Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0247973Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0248037Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0248242Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0248321Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0248545Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:56.0248646Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:56.0248873Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 235, in forward 2025-08-14T21:32:56.0248946Z key_states = self.k_proj(current_states) 2025-08-14T21:32:56.0248949Z 2025-08-14T21:32:56.0249048Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0249230Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0249295Z return mod(**inputs) 2025-08-14T21:32:56.0249523Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0249585Z outputs = self.model( 2025-08-14T21:32:56.0249820Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0249888Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0250114Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0250186Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0250387Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0250465Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0250689Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:56.0250777Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:56.0251010Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 236, in forward 2025-08-14T21:32:56.0251088Z value_states = self.v_proj(current_states) 2025-08-14T21:32:56.0251091Z 2025-08-14T21:32:56.0251171Z cudagraph partition due to non gpu ops 2025-08-14T21:32:56.0251289Z cudagraph partition due to non gpu ops 2025-08-14T21:32:56.0251361Z cudagraph partition due to non gpu ops 2025-08-14T21:32:56.0251437Z cudagraph partition due to non gpu ops 2025-08-14T21:32:56.0251530Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0251713Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0251780Z return mod(**inputs) 2025-08-14T21:32:56.0252006Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0252067Z outputs = self.model( 2025-08-14T21:32:56.0252317Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0252382Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0252617Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0252682Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0252880Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0252959Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0253182Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:56.0253276Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:56.0253496Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 254, in forward 2025-08-14T21:32:56.0253585Z attn_output, attn_weights = attention_interface( 2025-08-14T21:32:56.0253853Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:32:56.0253974Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:32:56.0253981Z 2025-08-14T21:32:56.0254073Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0254260Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0254318Z return mod(**inputs) 2025-08-14T21:32:56.0254552Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0254611Z outputs = self.model( 2025-08-14T21:32:56.0254836Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0254910Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0255133Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0255204Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0255405Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0255479Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0255707Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:56.0255795Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:56.0256018Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 254, in forward 2025-08-14T21:32:56.0256113Z attn_output, attn_weights = attention_interface( 2025-08-14T21:32:56.0256375Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:32:56.0256481Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:32:56.0256485Z 2025-08-14T21:32:56.0256577Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0256784Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0256868Z return mod(**inputs) 2025-08-14T21:32:56.0257102Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0257163Z outputs = self.model( 2025-08-14T21:32:56.0257397Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0257463Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0257699Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0257787Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0257991Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0258071Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0258306Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:56.0258403Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:56.0258633Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 268, in forward 2025-08-14T21:32:56.0258709Z attn_output = self.out_proj(attn_output) 2025-08-14T21:32:56.0258712Z 2025-08-14T21:32:56.0258812Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0258996Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0259057Z return mod(**inputs) 2025-08-14T21:32:56.0259298Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0259358Z outputs = self.model( 2025-08-14T21:32:56.0259601Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0259667Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0259898Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0259968Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0260173Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0260252Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0260483Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 430, in forward 2025-08-14T21:32:56.0260583Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:32:56.0260822Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 216, in forward 2025-08-14T21:32:56.0260967Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:32:56.0260971Z 2025-08-14T21:32:56.0261065Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0261254Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0261313Z return mod(**inputs) 2025-08-14T21:32:56.0261553Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0261612Z outputs = self.model( 2025-08-14T21:32:56.0261843Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0261920Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0262149Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0262213Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0262450Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0262541Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0262774Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 430, in forward 2025-08-14T21:32:56.0262871Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:32:56.0263094Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 235, in forward 2025-08-14T21:32:56.0263172Z key_states = self.k_proj(current_states) 2025-08-14T21:32:56.0263189Z 2025-08-14T21:32:56.0263283Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0263470Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0263528Z return mod(**inputs) 2025-08-14T21:32:56.0263757Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0263827Z outputs = self.model( 2025-08-14T21:32:56.0264052Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0264119Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0264355Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0264419Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0264627Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0264773Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0265004Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 430, in forward 2025-08-14T21:32:56.0265111Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:32:56.0265339Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 236, in forward 2025-08-14T21:32:56.0265424Z value_states = self.v_proj(current_states) 2025-08-14T21:32:56.0265428Z 2025-08-14T21:32:56.0265500Z cudagraph partition due to non gpu ops 2025-08-14T21:32:56.0265573Z cudagraph partition due to non gpu ops 2025-08-14T21:32:56.0265653Z cudagraph partition due to non gpu ops 2025-08-14T21:32:56.0265723Z cudagraph partition due to non gpu ops 2025-08-14T21:32:56.0265818Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0266005Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0266067Z return mod(**inputs) 2025-08-14T21:32:56.0266303Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0266365Z outputs = self.model( 2025-08-14T21:32:56.0266596Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0266670Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0266897Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0266961Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0267168Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0267239Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0267472Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 430, in forward 2025-08-14T21:32:56.0267569Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:32:56.0267796Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 254, in forward 2025-08-14T21:32:56.0267937Z attn_output, attn_weights = attention_interface( 2025-08-14T21:32:56.0268208Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:32:56.0268332Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:32:56.0268344Z 2025-08-14T21:32:56.0268436Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0268616Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0268683Z return mod(**inputs) 2025-08-14T21:32:56.0268929Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0268990Z outputs = self.model( 2025-08-14T21:32:56.0269224Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0269295Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0269531Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0269595Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0269795Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0269874Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0270096Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 430, in forward 2025-08-14T21:32:56.0270194Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:32:56.0270427Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 254, in forward 2025-08-14T21:32:56.0270515Z attn_output, attn_weights = attention_interface( 2025-08-14T21:32:56.0270789Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:32:56.0270888Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:32:56.0270892Z 2025-08-14T21:32:56.0270984Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0271175Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0271234Z return mod(**inputs) 2025-08-14T21:32:56.0271472Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0271534Z outputs = self.model( 2025-08-14T21:32:56.0271762Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0271836Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0272065Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0272131Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0272339Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0272409Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0272639Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 430, in forward 2025-08-14T21:32:56.0272733Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:32:56.0272957Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 268, in forward 2025-08-14T21:32:56.0273040Z attn_output = self.out_proj(attn_output) 2025-08-14T21:32:56.0273044Z 2025-08-14T21:32:56.0273134Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0273320Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0273426Z return mod(**inputs) 2025-08-14T21:32:56.0273657Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0273725Z outputs = self.model( 2025-08-14T21:32:56.0273952Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0274018Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0274252Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0274362Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0274571Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0274641Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0274865Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 445, in forward 2025-08-14T21:32:56.0274984Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:32:56.0274987Z 2025-08-14T21:32:56.0275081Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0275260Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0275324Z return mod(**inputs) 2025-08-14T21:32:56.0275549Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0275615Z outputs = self.model( 2025-08-14T21:32:56.0275842Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0275908Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0276140Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0276208Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0276415Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0276486Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0276711Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 445, in forward 2025-08-14T21:32:56.0276824Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:32:56.0277018Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:32:56.0277082Z return self.act(input) 2025-08-14T21:32:56.0277085Z 2025-08-14T21:32:56.0277187Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0277366Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0277430Z return mod(**inputs) 2025-08-14T21:32:56.0277663Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0277722Z outputs = self.model( 2025-08-14T21:32:56.0277953Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0278017Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0278241Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0278311Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0278512Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0278588Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0278810Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 447, in forward 2025-08-14T21:32:56.0278932Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:32:56.0278936Z 2025-08-14T21:32:56.0279037Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0279219Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0279284Z return mod(**inputs) 2025-08-14T21:32:56.0279510Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0279569Z outputs = self.model( 2025-08-14T21:32:56.0279800Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0279881Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0280112Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0280183Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0280387Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0280467Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0280694Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:56.0280782Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:56.0281017Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 216, in forward 2025-08-14T21:32:56.0281154Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:32:56.0281159Z 2025-08-14T21:32:56.0281261Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0281441Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0281499Z return mod(**inputs) 2025-08-14T21:32:56.0281740Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0281802Z outputs = self.model( 2025-08-14T21:32:56.0282030Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0282103Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0282330Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0282402Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0282602Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0282674Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0282913Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:56.0283004Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:56.0283232Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 235, in forward 2025-08-14T21:32:56.0283312Z key_states = self.k_proj(current_states) 2025-08-14T21:32:56.0283315Z 2025-08-14T21:32:56.0283408Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0283595Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0283654Z return mod(**inputs) 2025-08-14T21:32:56.0283886Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0283956Z outputs = self.model( 2025-08-14T21:32:56.0284186Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0284257Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0284515Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0284786Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0285008Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0285080Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0285308Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:56.0285409Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:56.0285684Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 236, in forward 2025-08-14T21:32:56.0285772Z value_states = self.v_proj(current_states) 2025-08-14T21:32:56.0285775Z 2025-08-14T21:32:56.0285849Z cudagraph partition due to non gpu ops 2025-08-14T21:32:56.0285923Z cudagraph partition due to non gpu ops 2025-08-14T21:32:56.0286006Z cudagraph partition due to non gpu ops 2025-08-14T21:32:56.0286075Z cudagraph partition due to non gpu ops 2025-08-14T21:32:56.0286169Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0286357Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0286415Z return mod(**inputs) 2025-08-14T21:32:56.0286647Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0286708Z outputs = self.model( 2025-08-14T21:32:56.0286933Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0287011Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0287239Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0287315Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0287517Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0287589Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0287823Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:56.0287911Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:56.0288136Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 254, in forward 2025-08-14T21:32:56.0288235Z attn_output, attn_weights = attention_interface( 2025-08-14T21:32:56.0288502Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:32:56.0288630Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:32:56.0288633Z 2025-08-14T21:32:56.0288730Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0288913Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0288980Z return mod(**inputs) 2025-08-14T21:32:56.0289208Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0289276Z outputs = self.model( 2025-08-14T21:32:56.0289502Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0289569Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0289803Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0289867Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0290111Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0290213Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0290436Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:56.0290529Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:56.0290752Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 254, in forward 2025-08-14T21:32:56.0290838Z attn_output, attn_weights = attention_interface( 2025-08-14T21:32:56.0291110Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:32:56.0291227Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:32:56.0291230Z 2025-08-14T21:32:56.0291331Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0291517Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0291577Z return mod(**inputs) 2025-08-14T21:32:56.0291815Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0291876Z outputs = self.model( 2025-08-14T21:32:56.0292104Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0292178Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0292406Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0292479Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0292680Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0292752Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0292988Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:56.0293078Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:56.0293301Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 268, in forward 2025-08-14T21:32:56.0293383Z attn_output = self.out_proj(attn_output) 2025-08-14T21:32:56.0293386Z 2025-08-14T21:32:56.0293478Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0293666Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0293725Z return mod(**inputs) 2025-08-14T21:32:56.0293952Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0294020Z outputs = self.model( 2025-08-14T21:32:56.0294249Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0294324Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0294551Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0294616Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0294822Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0294895Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0295119Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 430, in forward 2025-08-14T21:32:56.0295226Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:32:56.0295451Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 216, in forward 2025-08-14T21:32:56.0295596Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:32:56.0295645Z 2025-08-14T21:32:56.0295742Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0295922Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0295988Z return mod(**inputs) 2025-08-14T21:32:56.0296214Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0296282Z outputs = self.model( 2025-08-14T21:32:56.0296507Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0296588Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0296824Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0296887Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0297093Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0297173Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0297397Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 430, in forward 2025-08-14T21:32:56.0297501Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:32:56.0297725Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 235, in forward 2025-08-14T21:32:56.0297796Z key_states = self.k_proj(current_states) 2025-08-14T21:32:56.0297800Z 2025-08-14T21:32:56.0297900Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0298081Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0298140Z return mod(**inputs) 2025-08-14T21:32:56.0298377Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0298438Z outputs = self.model( 2025-08-14T21:32:56.0298675Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0298740Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0298969Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0299040Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0299244Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0299323Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0299550Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 430, in forward 2025-08-14T21:32:56.0299646Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:32:56.0299880Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 236, in forward 2025-08-14T21:32:56.0299959Z value_states = self.v_proj(current_states) 2025-08-14T21:32:56.0299962Z 2025-08-14T21:32:56.0300032Z cudagraph partition due to non gpu ops 2025-08-14T21:32:56.0300111Z cudagraph partition due to non gpu ops 2025-08-14T21:32:56.0300178Z cudagraph partition due to non gpu ops 2025-08-14T21:32:56.0300252Z cudagraph partition due to non gpu ops 2025-08-14T21:32:56.0300344Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0300527Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0300596Z return mod(**inputs) 2025-08-14T21:32:56.0300825Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0300886Z outputs = self.model( 2025-08-14T21:32:56.0301162Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0301244Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0301480Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0301541Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0301744Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0301819Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0302048Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 430, in forward 2025-08-14T21:32:56.0302161Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:32:56.0302392Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 254, in forward 2025-08-14T21:32:56.0302483Z attn_output, attn_weights = attention_interface( 2025-08-14T21:32:56.0302754Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:32:56.0302874Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:32:56.0302878Z 2025-08-14T21:32:56.0302967Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0303153Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0303210Z return mod(**inputs) 2025-08-14T21:32:56.0303440Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0303500Z outputs = self.model( 2025-08-14T21:32:56.0303726Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0303795Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0304025Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0304090Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0304297Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0304366Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0304596Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 430, in forward 2025-08-14T21:32:56.0304749Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:32:56.0304984Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 254, in forward 2025-08-14T21:32:56.0305077Z attn_output, attn_weights = attention_interface( 2025-08-14T21:32:56.0305346Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:32:56.0305451Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:32:56.0305454Z 2025-08-14T21:32:56.0305545Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0305726Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0305791Z return mod(**inputs) 2025-08-14T21:32:56.0306018Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0306077Z outputs = self.model( 2025-08-14T21:32:56.0306313Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0306380Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0306609Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0306723Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0306925Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0306998Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0307218Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 430, in forward 2025-08-14T21:32:56.0307308Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:32:56.0307536Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 268, in forward 2025-08-14T21:32:56.0307625Z attn_output = self.out_proj(attn_output) 2025-08-14T21:32:56.0307629Z 2025-08-14T21:32:56.0307722Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0307901Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0307960Z return mod(**inputs) 2025-08-14T21:32:56.0308201Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0308260Z outputs = self.model( 2025-08-14T21:32:56.0308493Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0308558Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0308780Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0308849Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0309049Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0309119Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0309348Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 445, in forward 2025-08-14T21:32:56.0309462Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:32:56.0309466Z 2025-08-14T21:32:56.0309564Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0309745Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0309803Z return mod(**inputs) 2025-08-14T21:32:56.0310034Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0310102Z outputs = self.model( 2025-08-14T21:32:56.0310327Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0310402Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0310628Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0310693Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0310906Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0310974Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0311204Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 445, in forward 2025-08-14T21:32:56.0311309Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:32:56.0311502Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:32:56.0311571Z return self.act(input) 2025-08-14T21:32:56.0311575Z 2025-08-14T21:32:56.0311668Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0311852Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0311911Z return mod(**inputs) 2025-08-14T21:32:56.0312164Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0312243Z outputs = self.model( 2025-08-14T21:32:56.0312469Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0312532Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0312765Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0312829Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0313032Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0313117Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0313343Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 447, in forward 2025-08-14T21:32:56.0313423Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:32:56.0313430Z 2025-08-14T21:32:56.0313522Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0313701Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0313767Z return mod(**inputs) 2025-08-14T21:32:56.0313994Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0314058Z outputs = self.model( 2025-08-14T21:32:56.0314286Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0314352Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0314587Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0314649Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0314856Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0314928Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0315152Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:56.0315247Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:56.0315474Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 216, in forward 2025-08-14T21:32:56.0315611Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:32:56.0315623Z 2025-08-14T21:32:56.0315715Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0315894Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0315956Z return mod(**inputs) 2025-08-14T21:32:56.0316184Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0316244Z outputs = self.model( 2025-08-14T21:32:56.0316481Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0316544Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0316777Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0316839Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0317040Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0317116Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0317340Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:56.0317428Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:56.0317699Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 235, in forward 2025-08-14T21:32:56.0317775Z key_states = self.k_proj(current_states) 2025-08-14T21:32:56.0317778Z 2025-08-14T21:32:56.0317875Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0318056Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0318114Z return mod(**inputs) 2025-08-14T21:32:56.0318345Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0318420Z outputs = self.model( 2025-08-14T21:32:56.0318647Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0318717Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0318944Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0319016Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0319215Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0319284Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0319513Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:56.0319600Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:56.0319827Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 236, in forward 2025-08-14T21:32:56.0319905Z value_states = self.v_proj(current_states) 2025-08-14T21:32:56.0319908Z 2025-08-14T21:32:56.0319979Z cudagraph partition due to non gpu ops 2025-08-14T21:32:56.0320054Z cudagraph partition due to non gpu ops 2025-08-14T21:32:56.0320124Z cudagraph partition due to non gpu ops 2025-08-14T21:32:56.0320193Z cudagraph partition due to non gpu ops 2025-08-14T21:32:56.0320292Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0320469Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0320533Z return mod(**inputs) 2025-08-14T21:32:56.0320759Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0320818Z outputs = self.model( 2025-08-14T21:32:56.0321054Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0321121Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0321347Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0321419Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0321621Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0321698Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0321920Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:56.0322007Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:56.0322237Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 254, in forward 2025-08-14T21:32:56.0322325Z attn_output, attn_weights = attention_interface( 2025-08-14T21:32:56.0322592Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:32:56.0322720Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:32:56.0322724Z 2025-08-14T21:32:56.0322850Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0323060Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0323120Z return mod(**inputs) 2025-08-14T21:32:56.0323347Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0323413Z outputs = self.model( 2025-08-14T21:32:56.0323640Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0323711Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0323954Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0324018Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0324222Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0324295Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0324521Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:56.0324615Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:56.0324836Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 254, in forward 2025-08-14T21:32:56.0324929Z attn_output, attn_weights = attention_interface( 2025-08-14T21:32:56.0325194Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:32:56.0325292Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:32:56.0325296Z 2025-08-14T21:32:56.0325394Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0325570Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0325637Z return mod(**inputs) 2025-08-14T21:32:56.0325864Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0325924Z outputs = self.model( 2025-08-14T21:32:56.0326156Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0326222Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0326447Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0326518Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0326723Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0326798Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0327023Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:56.0327110Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:56.0327338Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 268, in forward 2025-08-14T21:32:56.0327410Z attn_output = self.out_proj(attn_output) 2025-08-14T21:32:56.0327413Z 2025-08-14T21:32:56.0327511Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0327689Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0327746Z return mod(**inputs) 2025-08-14T21:32:56.0327981Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0328041Z outputs = self.model( 2025-08-14T21:32:56.0328267Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0328364Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0328607Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0328676Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0328882Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0328953Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0329185Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 430, in forward 2025-08-14T21:32:56.0329283Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:32:56.0329525Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 216, in forward 2025-08-14T21:32:56.0329667Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:32:56.0329671Z 2025-08-14T21:32:56.0329766Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0329951Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0330010Z return mod(**inputs) 2025-08-14T21:32:56.0330237Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0330303Z outputs = self.model( 2025-08-14T21:32:56.0330529Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0330599Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0330826Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0330889Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0331096Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0331169Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0331394Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 430, in forward 2025-08-14T21:32:56.0331498Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:32:56.0331723Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 235, in forward 2025-08-14T21:32:56.0331800Z key_states = self.k_proj(current_states) 2025-08-14T21:32:56.0331803Z 2025-08-14T21:32:56.0331894Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0332077Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0332143Z return mod(**inputs) 2025-08-14T21:32:56.0332372Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0332439Z outputs = self.model( 2025-08-14T21:32:56.0332667Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0332731Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0332962Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0333025Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0333225Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0333302Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0333527Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 430, in forward 2025-08-14T21:32:56.0333629Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:32:56.0333886Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 236, in forward 2025-08-14T21:32:56.0333979Z value_states = self.v_proj(current_states) 2025-08-14T21:32:56.0333983Z 2025-08-14T21:32:56.0334064Z cudagraph partition due to non gpu ops 2025-08-14T21:32:56.0334136Z cudagraph partition due to non gpu ops 2025-08-14T21:32:56.0334205Z cudagraph partition due to non gpu ops 2025-08-14T21:32:56.0334283Z cudagraph partition due to non gpu ops 2025-08-14T21:32:56.0334375Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0334563Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0334638Z return mod(**inputs) 2025-08-14T21:32:56.0334867Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0334932Z outputs = self.model( 2025-08-14T21:32:56.0335163Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0335231Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0335467Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0335533Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0335746Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0335818Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0336043Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 430, in forward 2025-08-14T21:32:56.0336149Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:32:56.0336372Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 254, in forward 2025-08-14T21:32:56.0336462Z attn_output, attn_weights = attention_interface( 2025-08-14T21:32:56.0336742Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:32:56.0336860Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:32:56.0336864Z 2025-08-14T21:32:56.0336963Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0337145Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0337202Z return mod(**inputs) 2025-08-14T21:32:56.0337436Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0337496Z outputs = self.model( 2025-08-14T21:32:56.0337729Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0337794Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0338024Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0338095Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0338296Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0338367Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0338597Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 430, in forward 2025-08-14T21:32:56.0338693Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:32:56.0338920Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 254, in forward 2025-08-14T21:32:56.0339009Z attn_output, attn_weights = attention_interface( 2025-08-14T21:32:56.0339275Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:32:56.0339420Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:32:56.0339423Z 2025-08-14T21:32:56.0339516Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0339705Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0339763Z return mod(**inputs) 2025-08-14T21:32:56.0339992Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0340059Z outputs = self.model( 2025-08-14T21:32:56.0340283Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0340371Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0340611Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0340676Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0340892Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0340963Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0341192Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 430, in forward 2025-08-14T21:32:56.0341294Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:32:56.0341522Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 268, in forward 2025-08-14T21:32:56.0341601Z attn_output = self.out_proj(attn_output) 2025-08-14T21:32:56.0341606Z 2025-08-14T21:32:56.0341698Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0341880Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0341944Z return mod(**inputs) 2025-08-14T21:32:56.0342178Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0342238Z outputs = self.model( 2025-08-14T21:32:56.0342473Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0342536Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0342773Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0342837Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0343039Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0343115Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0343341Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 445, in forward 2025-08-14T21:32:56.0343451Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:32:56.0343461Z 2025-08-14T21:32:56.0343552Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0343734Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0343796Z return mod(**inputs) 2025-08-14T21:32:56.0344023Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0344082Z outputs = self.model( 2025-08-14T21:32:56.0344316Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0344381Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0344614Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0344738Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0344978Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0345072Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0345299Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 445, in forward 2025-08-14T21:32:56.0345406Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:32:56.0345607Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:32:56.0345667Z return self.act(input) 2025-08-14T21:32:56.0345671Z 2025-08-14T21:32:56.0345785Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0345964Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0346023Z return mod(**inputs) 2025-08-14T21:32:56.0346264Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0346325Z outputs = self.model( 2025-08-14T21:32:56.0346562Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0346626Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0346851Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0346922Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0347121Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0347194Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0347425Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 447, in forward 2025-08-14T21:32:56.0347496Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:32:56.0347500Z 2025-08-14T21:32:56.0347599Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0347780Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0347838Z return mod(**inputs) 2025-08-14T21:32:56.0348072Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0348130Z outputs = self.model( 2025-08-14T21:32:56.0348358Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0348428Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0348656Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0348725Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0348925Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0348999Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0349229Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:56.0349316Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:56.0349548Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 216, in forward 2025-08-14T21:32:56.0349685Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:32:56.0349688Z 2025-08-14T21:32:56.0349780Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0349971Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0350028Z return mod(**inputs) 2025-08-14T21:32:56.0350256Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0350369Z outputs = self.model( 2025-08-14T21:32:56.0350597Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0350668Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0350889Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0350950Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0351158Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0351244Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0351468Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:56.0351563Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:56.0351788Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 235, in forward 2025-08-14T21:32:56.0351870Z key_states = self.k_proj(current_states) 2025-08-14T21:32:56.0351873Z 2025-08-14T21:32:56.0351961Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0352139Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0352203Z return mod(**inputs) 2025-08-14T21:32:56.0352425Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0352489Z outputs = self.model( 2025-08-14T21:32:56.0352714Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0352779Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0353010Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0353078Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0353279Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0353353Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0353574Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:56.0353669Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:56.0353888Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 236, in forward 2025-08-14T21:32:56.0353966Z value_states = self.v_proj(current_states) 2025-08-14T21:32:56.0353969Z 2025-08-14T21:32:56.0354046Z cudagraph partition due to non gpu ops 2025-08-14T21:32:56.0354116Z cudagraph partition due to non gpu ops 2025-08-14T21:32:56.0354183Z cudagraph partition due to non gpu ops 2025-08-14T21:32:56.0354260Z cudagraph partition due to non gpu ops 2025-08-14T21:32:56.0354350Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0354534Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0354591Z return mod(**inputs) 2025-08-14T21:32:56.0354816Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0354882Z outputs = self.model( 2025-08-14T21:32:56.0355107Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0355172Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0355400Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0355463Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0355697Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0355781Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0356007Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:56.0356099Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:56.0356321Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 254, in forward 2025-08-14T21:32:56.0356412Z attn_output, attn_weights = attention_interface( 2025-08-14T21:32:56.0356679Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:32:56.0356813Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:32:56.0356817Z 2025-08-14T21:32:56.0356914Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0357097Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0357157Z return mod(**inputs) 2025-08-14T21:32:56.0357395Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0357455Z outputs = self.model( 2025-08-14T21:32:56.0357689Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0357754Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0357981Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0358051Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0358251Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0358327Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0358554Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:56.0358641Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:56.0358873Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 254, in forward 2025-08-14T21:32:56.0358957Z attn_output, attn_weights = attention_interface( 2025-08-14T21:32:56.0359221Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:32:56.0359324Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:32:56.0359329Z 2025-08-14T21:32:56.0359419Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0359605Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0359665Z return mod(**inputs) 2025-08-14T21:32:56.0359898Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0359967Z outputs = self.model( 2025-08-14T21:32:56.0360194Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0360265Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0360494Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0360556Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0360766Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0360836Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0361062Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:56.0361187Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:56.0361463Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 268, in forward 2025-08-14T21:32:56.0361544Z attn_output = self.out_proj(attn_output) 2025-08-14T21:32:56.0361547Z 2025-08-14T21:32:56.0361636Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0361812Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0361876Z return mod(**inputs) 2025-08-14T21:32:56.0362101Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0362177Z outputs = self.model( 2025-08-14T21:32:56.0362408Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0362473Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0362709Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0362775Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0362971Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0363049Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0363272Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 430, in forward 2025-08-14T21:32:56.0363378Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:32:56.0363604Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 216, in forward 2025-08-14T21:32:56.0363739Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:32:56.0363743Z 2025-08-14T21:32:56.0363840Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0364026Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0364085Z return mod(**inputs) 2025-08-14T21:32:56.0364318Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0364377Z outputs = self.model( 2025-08-14T21:32:56.0364607Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0364673Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0364899Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0364971Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0365169Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0365248Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0365471Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 430, in forward 2025-08-14T21:32:56.0365567Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:32:56.0365798Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 235, in forward 2025-08-14T21:32:56.0365870Z key_states = self.k_proj(current_states) 2025-08-14T21:32:56.0365873Z 2025-08-14T21:32:56.0365964Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0366148Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0366209Z return mod(**inputs) 2025-08-14T21:32:56.0366438Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0366498Z outputs = self.model( 2025-08-14T21:32:56.0366749Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0366834Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0367059Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0367123Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0367329Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0367398Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0367644Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 430, in forward 2025-08-14T21:32:56.0367738Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:32:56.0367959Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 236, in forward 2025-08-14T21:32:56.0368047Z value_states = self.v_proj(current_states) 2025-08-14T21:32:56.0368050Z 2025-08-14T21:32:56.0368121Z cudagraph partition due to non gpu ops 2025-08-14T21:32:56.0368199Z cudagraph partition due to non gpu ops 2025-08-14T21:32:56.0368266Z cudagraph partition due to non gpu ops 2025-08-14T21:32:56.0368335Z cudagraph partition due to non gpu ops 2025-08-14T21:32:56.0368433Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0368610Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0368666Z return mod(**inputs) 2025-08-14T21:32:56.0368900Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0368959Z outputs = self.model( 2025-08-14T21:32:56.0369190Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0369258Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0369482Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0369552Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0369748Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0369818Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0370046Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 430, in forward 2025-08-14T21:32:56.0370141Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:32:56.0370370Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 254, in forward 2025-08-14T21:32:56.0370456Z attn_output, attn_weights = attention_interface( 2025-08-14T21:32:56.0370725Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:32:56.0370853Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:32:56.0370857Z 2025-08-14T21:32:56.0370948Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0371133Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0371191Z return mod(**inputs) 2025-08-14T21:32:56.0371415Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0371483Z outputs = self.model( 2025-08-14T21:32:56.0371707Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0371772Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0372028Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0372105Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0372312Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0372381Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0372607Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 430, in forward 2025-08-14T21:32:56.0372707Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:32:56.0372932Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 254, in forward 2025-08-14T21:32:56.0373035Z attn_output, attn_weights = attention_interface( 2025-08-14T21:32:56.0373305Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:32:56.0373402Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:32:56.0373406Z 2025-08-14T21:32:56.0373507Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0373685Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0373743Z return mod(**inputs) 2025-08-14T21:32:56.0373976Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0374036Z outputs = self.model( 2025-08-14T21:32:56.0374267Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0374333Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0374558Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0374628Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0374829Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0374900Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0375128Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 430, in forward 2025-08-14T21:32:56.0375223Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:32:56.0375450Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 268, in forward 2025-08-14T21:32:56.0375521Z attn_output = self.out_proj(attn_output) 2025-08-14T21:32:56.0375526Z 2025-08-14T21:32:56.0375616Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0375803Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0375860Z return mod(**inputs) 2025-08-14T21:32:56.0376094Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0376157Z outputs = self.model( 2025-08-14T21:32:56.0376384Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0376456Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0376681Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0376744Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0376949Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0377021Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0377251Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 445, in forward 2025-08-14T21:32:56.0377358Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:32:56.0377401Z 2025-08-14T21:32:56.0377497Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0377687Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0377747Z return mod(**inputs) 2025-08-14T21:32:56.0377974Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0378041Z outputs = self.model( 2025-08-14T21:32:56.0378267Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0378357Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0378589Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0378654Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0378869Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0378943Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0379180Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 445, in forward 2025-08-14T21:32:56.0379289Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:32:56.0379490Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:32:56.0379559Z return self.act(input) 2025-08-14T21:32:56.0379562Z 2025-08-14T21:32:56.0379654Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0379839Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0379904Z return mod(**inputs) 2025-08-14T21:32:56.0380137Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0380208Z outputs = self.model( 2025-08-14T21:32:56.0380443Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0380508Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0380746Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0380809Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0381013Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0381092Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0381323Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 447, in forward 2025-08-14T21:32:56.0381402Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:32:56.0381406Z 2025-08-14T21:32:56.0381498Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0381685Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0381751Z return mod(**inputs) 2025-08-14T21:32:56.0381984Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0382047Z outputs = self.model( 2025-08-14T21:32:56.0382280Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0382345Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0382585Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0382648Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0382854Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0382964Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0383205Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:56.0383301Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:56.0383525Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 216, in forward 2025-08-14T21:32:56.0383661Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:32:56.0383665Z 2025-08-14T21:32:56.0383764Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0383974Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0384038Z return mod(**inputs) 2025-08-14T21:32:56.0384265Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0384325Z outputs = self.model( 2025-08-14T21:32:56.0384752Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0384833Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0385062Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0385134Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0385334Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0385415Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0385658Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:56.0385750Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:56.0386056Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 235, in forward 2025-08-14T21:32:56.0386130Z key_states = self.k_proj(current_states) 2025-08-14T21:32:56.0386134Z 2025-08-14T21:32:56.0386235Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0386415Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0386474Z return mod(**inputs) 2025-08-14T21:32:56.0386709Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0386768Z outputs = self.model( 2025-08-14T21:32:56.0387000Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0387076Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0387306Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0387381Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0387584Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0387653Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0387888Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:56.0387975Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:56.0388199Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 236, in forward 2025-08-14T21:32:56.0388281Z value_states = self.v_proj(current_states) 2025-08-14T21:32:56.0388284Z 2025-08-14T21:32:56.0388357Z cudagraph partition due to non gpu ops 2025-08-14T21:32:56.0388433Z cudagraph partition due to non gpu ops 2025-08-14T21:32:56.0388501Z cudagraph partition due to non gpu ops 2025-08-14T21:32:56.0388568Z cudagraph partition due to non gpu ops 2025-08-14T21:32:56.0388722Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0388924Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0388982Z return mod(**inputs) 2025-08-14T21:32:56.0389220Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0389280Z outputs = self.model( 2025-08-14T21:32:56.0389515Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0389582Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0389840Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0389914Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0390116Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0390192Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0390424Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:56.0390512Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:56.0390745Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 254, in forward 2025-08-14T21:32:56.0390834Z attn_output, attn_weights = attention_interface( 2025-08-14T21:32:56.0391095Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:32:56.0391223Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:32:56.0391227Z 2025-08-14T21:32:56.0391318Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0391504Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0391564Z return mod(**inputs) 2025-08-14T21:32:56.0391789Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0391859Z outputs = self.model( 2025-08-14T21:32:56.0392082Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0392147Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0392378Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0392441Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0392647Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0392718Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0392943Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:56.0393037Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:56.0393261Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 254, in forward 2025-08-14T21:32:56.0393352Z attn_output, attn_weights = attention_interface( 2025-08-14T21:32:56.0393613Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:32:56.0393709Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:32:56.0393714Z 2025-08-14T21:32:56.0393813Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0393995Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0394052Z return mod(**inputs) 2025-08-14T21:32:56.0394313Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0394389Z outputs = self.model( 2025-08-14T21:32:56.0394629Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0394694Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0394924Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0394993Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0395199Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0395294Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0395517Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 413, in forward 2025-08-14T21:32:56.0395602Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:32:56.0395836Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 268, in forward 2025-08-14T21:32:56.0395911Z attn_output = self.out_proj(attn_output) 2025-08-14T21:32:56.0395914Z 2025-08-14T21:32:56.0396006Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0396195Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0396254Z return mod(**inputs) 2025-08-14T21:32:56.0396487Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0396547Z outputs = self.model( 2025-08-14T21:32:56.0396772Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0396843Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0397068Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0397134Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0397338Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0397406Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0397636Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 430, in forward 2025-08-14T21:32:56.0397732Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:32:56.0397954Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 216, in forward 2025-08-14T21:32:56.0398100Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:32:56.0398103Z 2025-08-14T21:32:56.0398196Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0398383Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0398441Z return mod(**inputs) 2025-08-14T21:32:56.0398669Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0398734Z outputs = self.model( 2025-08-14T21:32:56.0398961Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0399026Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0399258Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0399323Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0399529Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0399599Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0399850Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 430, in forward 2025-08-14T21:32:56.0399972Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:32:56.0400193Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 235, in forward 2025-08-14T21:32:56.0400272Z key_states = self.k_proj(current_states) 2025-08-14T21:32:56.0400275Z 2025-08-14T21:32:56.0400366Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0400542Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0400624Z return mod(**inputs) 2025-08-14T21:32:56.0400851Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0400910Z outputs = self.model( 2025-08-14T21:32:56.0401146Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0401211Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0401442Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0401505Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0401703Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0401781Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0402000Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 430, in forward 2025-08-14T21:32:56.0402095Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:32:56.0402325Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 236, in forward 2025-08-14T21:32:56.0402404Z value_states = self.v_proj(current_states) 2025-08-14T21:32:56.0402408Z 2025-08-14T21:32:56.0402488Z cudagraph partition due to non gpu ops 2025-08-14T21:32:56.0402559Z cudagraph partition due to non gpu ops 2025-08-14T21:32:56.0402628Z cudagraph partition due to non gpu ops 2025-08-14T21:32:56.0402702Z cudagraph partition due to non gpu ops 2025-08-14T21:32:56.0402795Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0402974Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0403041Z return mod(**inputs) 2025-08-14T21:32:56.0403267Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0403336Z outputs = self.model( 2025-08-14T21:32:56.0403563Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0403627Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0403864Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0403927Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0404134Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0404204Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0404426Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 430, in forward 2025-08-14T21:32:56.0404528Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:32:56.0404752Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 254, in forward 2025-08-14T21:32:56.0404840Z attn_output, attn_weights = attention_interface( 2025-08-14T21:32:56.0405145Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:32:56.0405280Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:32:56.0405284Z 2025-08-14T21:32:56.0405380Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0405559Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0405617Z return mod(**inputs) 2025-08-14T21:32:56.0405849Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0405909Z outputs = self.model( 2025-08-14T21:32:56.0406161Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0406228Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0406452Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0406526Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0406725Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0406795Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0407027Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 430, in forward 2025-08-14T21:32:56.0407123Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:32:56.0407355Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 254, in forward 2025-08-14T21:32:56.0407443Z attn_output, attn_weights = attention_interface( 2025-08-14T21:32:56.0407707Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:32:56.0407810Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:32:56.0407813Z 2025-08-14T21:32:56.0407910Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0408098Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0408158Z return mod(**inputs) 2025-08-14T21:32:56.0408384Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0408451Z outputs = self.model( 2025-08-14T21:32:56.0408677Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0408744Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0408976Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0409041Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0409251Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0409323Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0409546Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 430, in forward 2025-08-14T21:32:56.0409648Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:32:56.0409869Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 268, in forward 2025-08-14T21:32:56.0409941Z attn_output = self.out_proj(attn_output) 2025-08-14T21:32:56.0409949Z 2025-08-14T21:32:56.0410041Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0410224Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0410287Z return mod(**inputs) 2025-08-14T21:32:56.0410512Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0410628Z outputs = self.model( 2025-08-14T21:32:56.0410860Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0410923Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0411156Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0411219Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0411428Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0411527Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0411765Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 445, in forward 2025-08-14T21:32:56.0411878Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:32:56.0411881Z 2025-08-14T21:32:56.0411986Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0412180Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0412247Z return mod(**inputs) 2025-08-14T21:32:56.0412483Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0412544Z outputs = self.model( 2025-08-14T21:32:56.0412788Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0412855Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0413095Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0413168Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0413377Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0413464Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0413701Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 445, in forward 2025-08-14T21:32:56.0413812Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:32:56.0414021Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:32:56.0414086Z return self.act(input) 2025-08-14T21:32:56.0414090Z 2025-08-14T21:32:56.0414194Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0414385Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0414446Z return mod(**inputs) 2025-08-14T21:32:56.0414689Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1471, in forward 2025-08-14T21:32:56.0414752Z outputs = self.model( 2025-08-14T21:32:56.0414994Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1288, in forward 2025-08-14T21:32:56.0415071Z decoder_outputs = self.decoder( 2025-08-14T21:32:56.0415309Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1115, in forward 2025-08-14T21:32:56.0415384Z layer_outputs = decoder_layer( 2025-08-14T21:32:56.0415595Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:32:56.0415670Z return super().__call__(*args, **kwargs) 2025-08-14T21:32:56.0415915Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 447, in forward 2025-08-14T21:32:56.0415991Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:32:56.0415995Z 2025-08-14T21:32:56.0416098Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0416314Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0416393Z return mod(**inputs) 2025-08-14T21:32:56.0416652Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1490, in forward 2025-08-14T21:32:56.0416728Z lm_logits = self.lm_head(outputs[0]) 2025-08-14T21:32:56.0416732Z 2025-08-14T21:32:56.0416828Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:32:56.0417029Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:32:56.0417090Z return mod(**inputs) 2025-08-14T21:32:56.0417356Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bart/modeling_bart.py", line 1497, in forward 2025-08-14T21:32:56.0417519Z masked_lm_loss = loss_fct(lm_logits.view(-1, self.config.vocab_size), labels.view(-1)) 2025-08-14T21:32:56.0417523Z 2025-08-14T21:33:07.0950624Z Compilation time (from dynamo_timed): 24.061986546 2025-08-14T21:33:07.1035255Z pass 2025-08-14T21:33:07.1037145Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:33:07.1037998Z TIMING: _recursive_pre_grad_passes:0.01204 _recursive_joint_graph_passes:1.01418 _recursive_post_grad_passes:0.16018 async_compile.wait:0.75497 code_gen:9.56839 inductor_compile:12.25856 backend_compile:18.8269 gc:0.0004 entire_frame_compile:24.06199 total_wall_time:24.06199 2025-08-14T21:33:07.1041977Z STATS: call_* op count: 980 | FakeTensorMode.__torch_dispatch__:33505 | FakeTensor.__torch_dispatch__:11921 | ProxyTorchDispatchMode.__torch_dispatch__:12370 2025-08-14T21:33:07.1046380Z Dynamo produced 1 graphs covering 980 ops with 0 graph breaks (0 unique) 2025-08-14T21:33:11.4725414Z /opt/conda/envs/py_3.9/lib/python3.9/site-packages/llvmlite/binding/ffi.py:175: UserWarning: pkg_resources is deprecated as an API. See https://setuptools.pypa.io/en/latest/pkg_resources.html. The pkg_resources package is slated for removal as early as 2025-11-30. Refrain from using this package or pin to Setuptools<81. 2025-08-14T21:33:11.4726274Z from pkg_resources import resource_filename 2025-08-14T21:33:12.0083035Z 2025-08-14T21:33:13.2580330Z loading model: 0it [00:00, ?it/s] 2025-08-14T21:33:13.2581710Z loading model: 0it [00:01, ?it/s] 2025-08-14T21:33:13.2593389Z cpu eval BertForMaskedLM 2025-08-14T21:33:13.6527463Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:33:13.8319015Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:33:14.0090178Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:33:20.7252252Z cudagraph partition due to non gpu ops 2025-08-14T21:33:20.7256370Z cudagraph partition due to non gpu ops 2025-08-14T21:33:20.7260868Z cudagraph partition due to non gpu ops 2025-08-14T21:33:20.7262658Z cudagraph partition due to non gpu ops 2025-08-14T21:33:20.7263011Z cudagraph partition due to non gpu ops 2025-08-14T21:33:20.7267628Z cudagraph partition due to non gpu ops 2025-08-14T21:33:20.7272005Z cudagraph partition due to non gpu ops 2025-08-14T21:33:20.7276274Z cudagraph partition due to non gpu ops 2025-08-14T21:33:20.7276667Z cudagraph partition due to non gpu ops 2025-08-14T21:33:20.7276946Z cudagraph partition due to non gpu ops 2025-08-14T21:33:20.7277229Z cudagraph partition due to non gpu ops 2025-08-14T21:33:20.7277576Z cudagraph partition due to non gpu ops 2025-08-14T21:33:20.7278370Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:20.7278880Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:20.7282942Z return mod(**inputs) 2025-08-14T21:33:20.7287885Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1308, in forward 2025-08-14T21:33:20.7288767Z outputs = self.bert( 2025-08-14T21:33:20.7289696Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:20.7290220Z encoder_outputs = self.encoder( 2025-08-14T21:33:20.7290716Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:20.7291112Z layer_outputs = layer_module( 2025-08-14T21:33:20.7291447Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:20.7291794Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:20.7292367Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 584, in forward 2025-08-14T21:33:20.7292736Z self_attention_outputs = self.attention( 2025-08-14T21:33:20.7293104Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:20.7293461Z return func(*args, **kwargs) 2025-08-14T21:33:20.7293800Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 514, in forward 2025-08-14T21:33:20.7294151Z self_outputs = self.self( 2025-08-14T21:33:20.7294491Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:20.7294843Z return func(*args, **kwargs) 2025-08-14T21:33:20.7295192Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 374, in forward 2025-08-14T21:33:20.7295716Z self.query(hidden_states).view(bsz, -1, self.num_attention_heads, self.attention_head_size).transpose(1, 2) 2025-08-14T21:33:20.7295965Z 2025-08-14T21:33:20.7296076Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:20.7296418Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:20.7296726Z return mod(**inputs) 2025-08-14T21:33:20.7297070Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1308, in forward 2025-08-14T21:33:20.7297430Z outputs = self.bert( 2025-08-14T21:33:20.7297760Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:20.7298119Z encoder_outputs = self.encoder( 2025-08-14T21:33:20.7298470Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:20.7298821Z layer_outputs = layer_module( 2025-08-14T21:33:20.7299134Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:20.7299467Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:20.7299822Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 584, in forward 2025-08-14T21:33:20.7300174Z self_attention_outputs = self.attention( 2025-08-14T21:33:20.7300526Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:20.7300865Z return func(*args, **kwargs) 2025-08-14T21:33:20.7301199Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 514, in forward 2025-08-14T21:33:20.7301540Z self_outputs = self.self( 2025-08-14T21:33:20.7301869Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:20.7302210Z return func(*args, **kwargs) 2025-08-14T21:33:20.7302540Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 402, in forward 2025-08-14T21:33:20.7302900Z self.key(current_states) 2025-08-14T21:33:20.7303015Z 2025-08-14T21:33:20.7303201Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:20.7303545Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:20.7303847Z return mod(**inputs) 2025-08-14T21:33:20.7304188Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1308, in forward 2025-08-14T21:33:20.7305027Z outputs = self.bert( 2025-08-14T21:33:20.7305364Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:20.7305776Z encoder_outputs = self.encoder( 2025-08-14T21:33:20.7306125Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:20.7306477Z layer_outputs = layer_module( 2025-08-14T21:33:20.7306796Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:20.7307134Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:20.7307484Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 584, in forward 2025-08-14T21:33:20.7307840Z self_attention_outputs = self.attention( 2025-08-14T21:33:20.7308185Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:20.7308529Z return func(*args, **kwargs) 2025-08-14T21:33:20.7308867Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 514, in forward 2025-08-14T21:33:20.7309209Z self_outputs = self.self( 2025-08-14T21:33:20.7309540Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:20.7309875Z return func(*args, **kwargs) 2025-08-14T21:33:20.7310239Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 407, in forward 2025-08-14T21:33:20.7310590Z self.value(current_states) 2025-08-14T21:33:20.7310700Z 2025-08-14T21:33:20.7310784Z cudagraph partition due to non gpu ops 2025-08-14T21:33:20.7311002Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:20.7311336Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:20.7311637Z return mod(**inputs) 2025-08-14T21:33:20.7311965Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1308, in forward 2025-08-14T21:33:20.7312306Z outputs = self.bert( 2025-08-14T21:33:20.7312633Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:20.7312989Z encoder_outputs = self.encoder( 2025-08-14T21:33:20.7313330Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:20.7313679Z layer_outputs = layer_module( 2025-08-14T21:33:20.7313996Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:20.7314328Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:20.7314674Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 584, in forward 2025-08-14T21:33:20.7315031Z self_attention_outputs = self.attention( 2025-08-14T21:33:20.7315385Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:20.7315728Z return func(*args, **kwargs) 2025-08-14T21:33:20.7316056Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 514, in forward 2025-08-14T21:33:20.7316401Z self_outputs = self.self( 2025-08-14T21:33:20.7316761Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:20.7317113Z return func(*args, **kwargs) 2025-08-14T21:33:20.7317448Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 438, in forward 2025-08-14T21:33:20.7317855Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:33:20.7318027Z 2025-08-14T21:33:20.7318131Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:20.7318456Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:20.7318777Z return mod(**inputs) 2025-08-14T21:33:20.7319104Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1308, in forward 2025-08-14T21:33:20.7319440Z outputs = self.bert( 2025-08-14T21:33:20.7319766Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:20.7320118Z encoder_outputs = self.encoder( 2025-08-14T21:33:20.7320459Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:20.7320797Z layer_outputs = layer_module( 2025-08-14T21:33:20.7321109Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:20.7321438Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:20.7321780Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 584, in forward 2025-08-14T21:33:20.7322138Z self_attention_outputs = self.attention( 2025-08-14T21:33:20.7322484Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:20.7322822Z return func(*args, **kwargs) 2025-08-14T21:33:20.7323151Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 524, in forward 2025-08-14T21:33:20.7323555Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:33:20.7323952Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 461, in forward 2025-08-14T21:33:20.7324310Z hidden_states = self.dense(hidden_states) 2025-08-14T21:33:20.7324436Z 2025-08-14T21:33:20.7324532Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:20.7324861Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:20.7325162Z return mod(**inputs) 2025-08-14T21:33:20.7325482Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1308, in forward 2025-08-14T21:33:20.7325829Z outputs = self.bert( 2025-08-14T21:33:20.7326159Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:20.7326515Z encoder_outputs = self.encoder( 2025-08-14T21:33:20.7326848Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:20.7327194Z layer_outputs = layer_module( 2025-08-14T21:33:20.7327511Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:20.7327833Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:20.7328185Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 614, in forward 2025-08-14T21:33:20.7328545Z layer_output = apply_chunking_to_forward( 2025-08-14T21:33:20.7328915Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:33:20.7329304Z return forward_fn(*input_tensors) 2025-08-14T21:33:20.7329706Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 622, in feed_forward_chunk 2025-08-14T21:33:20.7330134Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:33:20.7330533Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 539, in forward 2025-08-14T21:33:20.7330891Z hidden_states = self.dense(hidden_states) 2025-08-14T21:33:20.7331028Z 2025-08-14T21:33:20.7331130Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:20.7331470Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:20.7331779Z return mod(**inputs) 2025-08-14T21:33:20.7332109Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1308, in forward 2025-08-14T21:33:20.7332453Z outputs = self.bert( 2025-08-14T21:33:20.7332782Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:20.7333131Z encoder_outputs = self.encoder( 2025-08-14T21:33:20.7333477Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:20.7333823Z layer_outputs = layer_module( 2025-08-14T21:33:20.7334136Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:20.7334469Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:20.7334820Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 614, in forward 2025-08-14T21:33:20.7335176Z layer_output = apply_chunking_to_forward( 2025-08-14T21:33:20.7335541Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:33:20.7335909Z return forward_fn(*input_tensors) 2025-08-14T21:33:20.7336287Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 622, in feed_forward_chunk 2025-08-14T21:33:20.7336712Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:33:20.7337099Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 540, in forward 2025-08-14T21:33:20.7337484Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:33:20.7337834Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:33:20.7338143Z return self.act(input) 2025-08-14T21:33:20.7338253Z 2025-08-14T21:33:20.7338349Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:20.7338682Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:20.7338984Z return mod(**inputs) 2025-08-14T21:33:20.7339310Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1308, in forward 2025-08-14T21:33:20.7339657Z outputs = self.bert( 2025-08-14T21:33:20.7339985Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:20.7340337Z encoder_outputs = self.encoder( 2025-08-14T21:33:20.7340676Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:20.7341025Z layer_outputs = layer_module( 2025-08-14T21:33:20.7341346Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:20.7341673Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:20.7342030Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 614, in forward 2025-08-14T21:33:20.7342442Z layer_output = apply_chunking_to_forward( 2025-08-14T21:33:20.7342813Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:33:20.7343167Z return forward_fn(*input_tensors) 2025-08-14T21:33:20.7343541Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 623, in feed_forward_chunk 2025-08-14T21:33:20.7343972Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:33:20.7344368Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 552, in forward 2025-08-14T21:33:20.7344822Z hidden_states = self.dense(hidden_states) 2025-08-14T21:33:20.7344961Z 2025-08-14T21:33:20.7345060Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:20.7345405Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:20.7345705Z return mod(**inputs) 2025-08-14T21:33:20.7346037Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1308, in forward 2025-08-14T21:33:20.7346391Z outputs = self.bert( 2025-08-14T21:33:20.7346723Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:20.7347074Z encoder_outputs = self.encoder( 2025-08-14T21:33:20.7347421Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:20.7347778Z layer_outputs = layer_module( 2025-08-14T21:33:20.7348091Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:20.7348428Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:20.7348784Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 584, in forward 2025-08-14T21:33:20.7349144Z self_attention_outputs = self.attention( 2025-08-14T21:33:20.7349492Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:20.7349834Z return func(*args, **kwargs) 2025-08-14T21:33:20.7350170Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 514, in forward 2025-08-14T21:33:20.7350512Z self_outputs = self.self( 2025-08-14T21:33:20.7350845Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:20.7351189Z return func(*args, **kwargs) 2025-08-14T21:33:20.7351527Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 374, in forward 2025-08-14T21:33:20.7352004Z self.query(hidden_states).view(bsz, -1, self.num_attention_heads, self.attention_head_size).transpose(1, 2) 2025-08-14T21:33:20.7352254Z 2025-08-14T21:33:20.7352348Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:20.7352680Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:20.7352977Z return mod(**inputs) 2025-08-14T21:33:20.7353304Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1308, in forward 2025-08-14T21:33:20.7353651Z outputs = self.bert( 2025-08-14T21:33:20.7353979Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:20.7354331Z encoder_outputs = self.encoder( 2025-08-14T21:33:20.7354677Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:20.7355027Z layer_outputs = layer_module( 2025-08-14T21:33:20.7355381Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:20.7355732Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:20.7356086Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 584, in forward 2025-08-14T21:33:20.7356444Z self_attention_outputs = self.attention( 2025-08-14T21:33:20.7356784Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:20.7357125Z return func(*args, **kwargs) 2025-08-14T21:33:20.7357462Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 514, in forward 2025-08-14T21:33:20.7357830Z self_outputs = self.self( 2025-08-14T21:33:20.7358154Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:20.7358499Z return func(*args, **kwargs) 2025-08-14T21:33:20.7358845Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 402, in forward 2025-08-14T21:33:20.7359194Z self.key(current_states) 2025-08-14T21:33:20.7359298Z 2025-08-14T21:33:20.7359393Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:20.7359728Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:20.7360026Z return mod(**inputs) 2025-08-14T21:33:20.7360345Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1308, in forward 2025-08-14T21:33:20.7360692Z outputs = self.bert( 2025-08-14T21:33:20.7361019Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:20.7361372Z encoder_outputs = self.encoder( 2025-08-14T21:33:20.7361711Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:20.7362062Z layer_outputs = layer_module( 2025-08-14T21:33:20.7362382Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:20.7362708Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:20.7363064Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 584, in forward 2025-08-14T21:33:20.7363423Z self_attention_outputs = self.attention( 2025-08-14T21:33:20.7363773Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:20.7364107Z return func(*args, **kwargs) 2025-08-14T21:33:20.7364445Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 514, in forward 2025-08-14T21:33:20.7364793Z self_outputs = self.self( 2025-08-14T21:33:20.7365119Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:20.7365461Z return func(*args, **kwargs) 2025-08-14T21:33:20.7365799Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 407, in forward 2025-08-14T21:33:20.7366146Z self.value(current_states) 2025-08-14T21:33:20.7366255Z 2025-08-14T21:33:20.7366329Z cudagraph partition due to non gpu ops 2025-08-14T21:33:20.7366552Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:20.7366888Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:20.7367184Z return mod(**inputs) 2025-08-14T21:33:20.7367513Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1308, in forward 2025-08-14T21:33:20.7367861Z outputs = self.bert( 2025-08-14T21:33:20.7368221Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:20.7368581Z encoder_outputs = self.encoder( 2025-08-14T21:33:20.7368928Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:20.7369284Z layer_outputs = layer_module( 2025-08-14T21:33:20.7369602Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:20.7369940Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:20.7370298Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 584, in forward 2025-08-14T21:33:20.7370674Z self_attention_outputs = self.attention( 2025-08-14T21:33:20.7371017Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:20.7371358Z return func(*args, **kwargs) 2025-08-14T21:33:20.7371701Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 514, in forward 2025-08-14T21:33:20.7372049Z self_outputs = self.self( 2025-08-14T21:33:20.7372372Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:20.7372710Z return func(*args, **kwargs) 2025-08-14T21:33:20.7373047Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 438, in forward 2025-08-14T21:33:20.7373441Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:33:20.7373619Z 2025-08-14T21:33:20.7373724Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:20.7374061Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:20.7374363Z return mod(**inputs) 2025-08-14T21:33:20.7374686Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1308, in forward 2025-08-14T21:33:20.7375036Z outputs = self.bert( 2025-08-14T21:33:20.7375360Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:20.7375703Z encoder_outputs = self.encoder( 2025-08-14T21:33:20.7376047Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:20.7376391Z layer_outputs = layer_module( 2025-08-14T21:33:20.7376708Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:20.7377033Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:20.7377386Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 584, in forward 2025-08-14T21:33:20.7377744Z self_attention_outputs = self.attention( 2025-08-14T21:33:20.7378097Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:20.7378427Z return func(*args, **kwargs) 2025-08-14T21:33:20.7378765Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 524, in forward 2025-08-14T21:33:20.7379163Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:33:20.7379555Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 461, in forward 2025-08-14T21:33:20.7379921Z hidden_states = self.dense(hidden_states) 2025-08-14T21:33:20.7380055Z 2025-08-14T21:33:20.7380150Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:20.7380482Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:20.7380775Z return mod(**inputs) 2025-08-14T21:33:20.7381140Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1308, in forward 2025-08-14T21:33:20.7381507Z outputs = self.bert( 2025-08-14T21:33:20.7381832Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:20.7382186Z encoder_outputs = self.encoder( 2025-08-14T21:33:20.7382533Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:20.7382884Z layer_outputs = layer_module( 2025-08-14T21:33:20.7383197Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:20.7383548Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:20.7383897Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 614, in forward 2025-08-14T21:33:20.7384259Z layer_output = apply_chunking_to_forward( 2025-08-14T21:33:20.7385031Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:33:20.7385413Z return forward_fn(*input_tensors) 2025-08-14T21:33:20.7385800Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 622, in feed_forward_chunk 2025-08-14T21:33:20.7386219Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:33:20.7386618Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 539, in forward 2025-08-14T21:33:20.7386991Z hidden_states = self.dense(hidden_states) 2025-08-14T21:33:20.7387120Z 2025-08-14T21:33:20.7387230Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:20.7387562Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:20.7387874Z return mod(**inputs) 2025-08-14T21:33:20.7388215Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1308, in forward 2025-08-14T21:33:20.7388562Z outputs = self.bert( 2025-08-14T21:33:20.7388898Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:20.7389258Z encoder_outputs = self.encoder( 2025-08-14T21:33:20.7389610Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:20.7389956Z layer_outputs = layer_module( 2025-08-14T21:33:20.7390287Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:20.7390625Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:20.7390980Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 614, in forward 2025-08-14T21:33:20.7391341Z layer_output = apply_chunking_to_forward( 2025-08-14T21:33:20.7391717Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:33:20.7392084Z return forward_fn(*input_tensors) 2025-08-14T21:33:20.7392461Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 622, in feed_forward_chunk 2025-08-14T21:33:20.7392883Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:33:20.7393282Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 540, in forward 2025-08-14T21:33:20.7393675Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:33:20.7394024Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:33:20.7394341Z return self.act(input) 2025-08-14T21:33:20.7394445Z 2025-08-14T21:33:20.7394647Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:20.7394990Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:20.7395286Z return mod(**inputs) 2025-08-14T21:33:20.7395618Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1308, in forward 2025-08-14T21:33:20.7395972Z outputs = self.bert( 2025-08-14T21:33:20.7396295Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:20.7396681Z encoder_outputs = self.encoder( 2025-08-14T21:33:20.7397030Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:20.7397379Z layer_outputs = layer_module( 2025-08-14T21:33:20.7397691Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:20.7398024Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:20.7398372Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 614, in forward 2025-08-14T21:33:20.7398725Z layer_output = apply_chunking_to_forward( 2025-08-14T21:33:20.7399093Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:33:20.7399451Z return forward_fn(*input_tensors) 2025-08-14T21:33:20.7399825Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 623, in feed_forward_chunk 2025-08-14T21:33:20.7400252Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:33:20.7400654Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 552, in forward 2025-08-14T21:33:20.7401018Z hidden_states = self.dense(hidden_states) 2025-08-14T21:33:20.7401147Z 2025-08-14T21:33:20.7401249Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:20.7401577Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:20.7401877Z return mod(**inputs) 2025-08-14T21:33:20.7402205Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1308, in forward 2025-08-14T21:33:20.7402542Z outputs = self.bert( 2025-08-14T21:33:20.7402870Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:20.7403223Z encoder_outputs = self.encoder( 2025-08-14T21:33:20.7403568Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:20.7403909Z layer_outputs = layer_module( 2025-08-14T21:33:20.7404235Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:20.7404572Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:20.7404921Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 584, in forward 2025-08-14T21:33:20.7405280Z self_attention_outputs = self.attention( 2025-08-14T21:33:20.7405637Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:20.7405981Z return func(*args, **kwargs) 2025-08-14T21:33:20.7406314Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 514, in forward 2025-08-14T21:33:20.7406662Z self_outputs = self.self( 2025-08-14T21:33:20.7406996Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:20.7407330Z return func(*args, **kwargs) 2025-08-14T21:33:20.7407748Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 374, in forward 2025-08-14T21:33:20.7408233Z self.query(hidden_states).view(bsz, -1, self.num_attention_heads, self.attention_head_size).transpose(1, 2) 2025-08-14T21:33:20.7408480Z 2025-08-14T21:33:20.7408584Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:20.7408913Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:20.7409217Z return mod(**inputs) 2025-08-14T21:33:20.7409559Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1308, in forward 2025-08-14T21:33:20.7409922Z outputs = self.bert( 2025-08-14T21:33:20.7410241Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:20.7410594Z encoder_outputs = self.encoder( 2025-08-14T21:33:20.7410945Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:20.7411289Z layer_outputs = layer_module( 2025-08-14T21:33:20.7411609Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:20.7411946Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:20.7412299Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 584, in forward 2025-08-14T21:33:20.7412649Z self_attention_outputs = self.attention( 2025-08-14T21:33:20.7412999Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:20.7413340Z return func(*args, **kwargs) 2025-08-14T21:33:20.7413671Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 514, in forward 2025-08-14T21:33:20.7414020Z self_outputs = self.self( 2025-08-14T21:33:20.7414353Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:20.7414688Z return func(*args, **kwargs) 2025-08-14T21:33:20.7415016Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 402, in forward 2025-08-14T21:33:20.7415363Z self.key(current_states) 2025-08-14T21:33:20.7415467Z 2025-08-14T21:33:20.7415570Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:20.7415900Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:20.7416192Z return mod(**inputs) 2025-08-14T21:33:20.7416521Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1308, in forward 2025-08-14T21:33:20.7416866Z outputs = self.bert( 2025-08-14T21:33:20.7417184Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:20.7417541Z encoder_outputs = self.encoder( 2025-08-14T21:33:20.7417885Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:20.7418230Z layer_outputs = layer_module( 2025-08-14T21:33:20.7418540Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:20.7418872Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:20.7419225Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 584, in forward 2025-08-14T21:33:20.7419576Z self_attention_outputs = self.attention( 2025-08-14T21:33:20.7419925Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:20.7420265Z return func(*args, **kwargs) 2025-08-14T21:33:20.7420651Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 514, in forward 2025-08-14T21:33:20.7420994Z self_outputs = self.self( 2025-08-14T21:33:20.7421325Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:20.7421669Z return func(*args, **kwargs) 2025-08-14T21:33:20.7421998Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 407, in forward 2025-08-14T21:33:20.7422348Z self.value(current_states) 2025-08-14T21:33:20.7422481Z 2025-08-14T21:33:20.7422558Z cudagraph partition due to non gpu ops 2025-08-14T21:33:20.7422780Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:20.7423104Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:20.7423402Z return mod(**inputs) 2025-08-14T21:33:20.7423732Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1308, in forward 2025-08-14T21:33:20.7424072Z outputs = self.bert( 2025-08-14T21:33:20.7424395Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:20.7424816Z encoder_outputs = self.encoder( 2025-08-14T21:33:20.7425168Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:20.7425511Z layer_outputs = layer_module( 2025-08-14T21:33:20.7425834Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:20.7426168Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:20.7426522Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 584, in forward 2025-08-14T21:33:20.7426876Z self_attention_outputs = self.attention( 2025-08-14T21:33:20.7427228Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:20.7427568Z return func(*args, **kwargs) 2025-08-14T21:33:20.7427900Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 514, in forward 2025-08-14T21:33:20.7428250Z self_outputs = self.self( 2025-08-14T21:33:20.7428589Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:20.7428930Z return func(*args, **kwargs) 2025-08-14T21:33:20.7429260Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 438, in forward 2025-08-14T21:33:20.7429660Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:33:20.7429829Z 2025-08-14T21:33:20.7429932Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:20.7430258Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:20.7430554Z return mod(**inputs) 2025-08-14T21:33:20.7430877Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1308, in forward 2025-08-14T21:33:20.7431216Z outputs = self.bert( 2025-08-14T21:33:20.7431531Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:20.7431881Z encoder_outputs = self.encoder( 2025-08-14T21:33:20.7432223Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:20.7432559Z layer_outputs = layer_module( 2025-08-14T21:33:20.7432875Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:20.7433237Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:20.7433602Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 584, in forward 2025-08-14T21:33:20.7433952Z self_attention_outputs = self.attention( 2025-08-14T21:33:20.7434300Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:20.7434639Z return func(*args, **kwargs) 2025-08-14T21:33:20.7434972Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 524, in forward 2025-08-14T21:33:20.7435386Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:33:20.7435785Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 461, in forward 2025-08-14T21:33:20.7436142Z hidden_states = self.dense(hidden_states) 2025-08-14T21:33:20.7436273Z 2025-08-14T21:33:20.7436373Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:20.7436709Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:20.7437011Z return mod(**inputs) 2025-08-14T21:33:20.7437340Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1308, in forward 2025-08-14T21:33:20.7437682Z outputs = self.bert( 2025-08-14T21:33:20.7438011Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:20.7438371Z encoder_outputs = self.encoder( 2025-08-14T21:33:20.7438711Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:20.7439059Z layer_outputs = layer_module( 2025-08-14T21:33:20.7439377Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:20.7439711Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:20.7440058Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 614, in forward 2025-08-14T21:33:20.7440419Z layer_output = apply_chunking_to_forward( 2025-08-14T21:33:20.7440791Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:33:20.7441156Z return forward_fn(*input_tensors) 2025-08-14T21:33:20.7441525Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 622, in feed_forward_chunk 2025-08-14T21:33:20.7441949Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:33:20.7442341Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 539, in forward 2025-08-14T21:33:20.7442693Z hidden_states = self.dense(hidden_states) 2025-08-14T21:33:20.7442828Z 2025-08-14T21:33:20.7442926Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:20.7443258Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:20.7443581Z return mod(**inputs) 2025-08-14T21:33:20.7443900Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1308, in forward 2025-08-14T21:33:20.7444247Z outputs = self.bert( 2025-08-14T21:33:20.7444574Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:20.7444918Z encoder_outputs = self.encoder( 2025-08-14T21:33:20.7445265Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:20.7445614Z layer_outputs = layer_module( 2025-08-14T21:33:20.7445960Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:20.7446306Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:20.7446668Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 614, in forward 2025-08-14T21:33:20.7447039Z layer_output = apply_chunking_to_forward( 2025-08-14T21:33:20.7447415Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:33:20.7447779Z return forward_fn(*input_tensors) 2025-08-14T21:33:20.7448160Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 622, in feed_forward_chunk 2025-08-14T21:33:20.7448598Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:33:20.7448985Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 540, in forward 2025-08-14T21:33:20.7449381Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:33:20.7449739Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:33:20.7450059Z return self.act(input) 2025-08-14T21:33:20.7450164Z 2025-08-14T21:33:20.7450262Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:20.7450601Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:20.7450909Z return mod(**inputs) 2025-08-14T21:33:20.7451242Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1308, in forward 2025-08-14T21:33:20.7451590Z outputs = self.bert( 2025-08-14T21:33:20.7451923Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:20.7452281Z encoder_outputs = self.encoder( 2025-08-14T21:33:20.7452628Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:20.7452985Z layer_outputs = layer_module( 2025-08-14T21:33:20.7453311Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:20.7453649Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:20.7453998Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 614, in forward 2025-08-14T21:33:20.7454365Z layer_output = apply_chunking_to_forward( 2025-08-14T21:33:20.7454738Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:33:20.7455101Z return forward_fn(*input_tensors) 2025-08-14T21:33:20.7455482Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 623, in feed_forward_chunk 2025-08-14T21:33:20.7455925Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:33:20.7456335Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 552, in forward 2025-08-14T21:33:20.7456692Z hidden_states = self.dense(hidden_states) 2025-08-14T21:33:20.7456826Z 2025-08-14T21:33:20.7456925Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:20.7457265Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:20.7457569Z return mod(**inputs) 2025-08-14T21:33:20.7457895Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1308, in forward 2025-08-14T21:33:20.7458248Z outputs = self.bert( 2025-08-14T21:33:20.7458579Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:20.7458929Z encoder_outputs = self.encoder( 2025-08-14T21:33:20.7459318Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:20.7459681Z layer_outputs = layer_module( 2025-08-14T21:33:20.7460002Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:20.7460327Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:20.7460681Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 584, in forward 2025-08-14T21:33:20.7461041Z self_attention_outputs = self.attention( 2025-08-14T21:33:20.7461408Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:20.7461754Z return func(*args, **kwargs) 2025-08-14T21:33:20.7462094Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 514, in forward 2025-08-14T21:33:20.7462445Z self_outputs = self.self( 2025-08-14T21:33:20.7462773Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:20.7463119Z return func(*args, **kwargs) 2025-08-14T21:33:20.7463465Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 374, in forward 2025-08-14T21:33:20.7463941Z self.query(hidden_states).view(bsz, -1, self.num_attention_heads, self.attention_head_size).transpose(1, 2) 2025-08-14T21:33:20.7464185Z 2025-08-14T21:33:20.7464282Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:20.7464620Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:20.7464991Z return mod(**inputs) 2025-08-14T21:33:20.7465317Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1308, in forward 2025-08-14T21:33:20.7465667Z outputs = self.bert( 2025-08-14T21:33:20.7466006Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:20.7466364Z encoder_outputs = self.encoder( 2025-08-14T21:33:20.7466706Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:20.7467058Z layer_outputs = layer_module( 2025-08-14T21:33:20.7467382Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:20.7467712Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:20.7468073Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 584, in forward 2025-08-14T21:33:20.7468437Z self_attention_outputs = self.attention( 2025-08-14T21:33:20.7468789Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:20.7469128Z return func(*args, **kwargs) 2025-08-14T21:33:20.7469469Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 514, in forward 2025-08-14T21:33:20.7469820Z self_outputs = self.self( 2025-08-14T21:33:20.7470148Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:20.7470494Z return func(*args, **kwargs) 2025-08-14T21:33:20.7470833Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 402, in forward 2025-08-14T21:33:20.7471185Z self.key(current_states) 2025-08-14T21:33:20.7471291Z 2025-08-14T21:33:20.7471386Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:20.7471722Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:20.7472019Z return mod(**inputs) 2025-08-14T21:33:20.7472375Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1308, in forward 2025-08-14T21:33:20.7472751Z outputs = self.bert( 2025-08-14T21:33:20.7473079Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:20.7473431Z encoder_outputs = self.encoder( 2025-08-14T21:33:20.7473770Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:20.7474118Z layer_outputs = layer_module( 2025-08-14T21:33:20.7474434Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:20.7474784Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:20.7475130Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 584, in forward 2025-08-14T21:33:20.7475490Z self_attention_outputs = self.attention( 2025-08-14T21:33:20.7475840Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:20.7476176Z return func(*args, **kwargs) 2025-08-14T21:33:20.7476515Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 514, in forward 2025-08-14T21:33:20.7476863Z self_outputs = self.self( 2025-08-14T21:33:20.7477192Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:20.7477526Z return func(*args, **kwargs) 2025-08-14T21:33:20.7477866Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 407, in forward 2025-08-14T21:33:20.7478215Z self.value(current_states) 2025-08-14T21:33:20.7478322Z 2025-08-14T21:33:20.7478396Z cudagraph partition due to non gpu ops 2025-08-14T21:33:20.7478621Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:20.7478957Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:20.7479257Z return mod(**inputs) 2025-08-14T21:33:20.7479580Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1308, in forward 2025-08-14T21:33:20.7479925Z outputs = self.bert( 2025-08-14T21:33:20.7480250Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:20.7480591Z encoder_outputs = self.encoder( 2025-08-14T21:33:20.7480940Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:20.7481289Z layer_outputs = layer_module( 2025-08-14T21:33:20.7481603Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:20.7481929Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:20.7482280Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 584, in forward 2025-08-14T21:33:20.7482638Z self_attention_outputs = self.attention( 2025-08-14T21:33:20.7482987Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:20.7483321Z return func(*args, **kwargs) 2025-08-14T21:33:20.7483658Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 514, in forward 2025-08-14T21:33:20.7484009Z self_outputs = self.self( 2025-08-14T21:33:20.7484332Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:20.7484806Z return func(*args, **kwargs) 2025-08-14T21:33:20.7485213Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 438, in forward 2025-08-14T21:33:20.7485646Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:33:20.7485814Z 2025-08-14T21:33:20.7485912Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:20.7486246Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:20.7486548Z return mod(**inputs) 2025-08-14T21:33:20.7486876Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1308, in forward 2025-08-14T21:33:20.7487230Z outputs = self.bert( 2025-08-14T21:33:20.7487587Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:20.7487941Z encoder_outputs = self.encoder( 2025-08-14T21:33:20.7488275Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:20.7488624Z layer_outputs = layer_module( 2025-08-14T21:33:20.7488938Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:20.7489267Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:20.7489612Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 584, in forward 2025-08-14T21:33:20.7489967Z self_attention_outputs = self.attention( 2025-08-14T21:33:20.7490313Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:20.7490649Z return func(*args, **kwargs) 2025-08-14T21:33:20.7490984Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 524, in forward 2025-08-14T21:33:20.7491382Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:33:20.7491778Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 461, in forward 2025-08-14T21:33:20.7492127Z hidden_states = self.dense(hidden_states) 2025-08-14T21:33:20.7492258Z 2025-08-14T21:33:20.7492352Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:20.7492680Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:20.7492972Z return mod(**inputs) 2025-08-14T21:33:20.7493300Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1308, in forward 2025-08-14T21:33:20.7493644Z outputs = self.bert( 2025-08-14T21:33:20.7493967Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:20.7494306Z encoder_outputs = self.encoder( 2025-08-14T21:33:20.7494653Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:20.7494999Z layer_outputs = layer_module( 2025-08-14T21:33:20.7495304Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:20.7495630Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:20.7495977Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 614, in forward 2025-08-14T21:33:20.7496330Z layer_output = apply_chunking_to_forward( 2025-08-14T21:33:20.7496687Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:33:20.7497050Z return forward_fn(*input_tensors) 2025-08-14T21:33:20.7497424Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 622, in feed_forward_chunk 2025-08-14T21:33:20.7497839Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:33:20.7498269Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 539, in forward 2025-08-14T21:33:20.7498634Z hidden_states = self.dense(hidden_states) 2025-08-14T21:33:20.7498760Z 2025-08-14T21:33:20.7498863Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:20.7499192Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:20.7499494Z return mod(**inputs) 2025-08-14T21:33:20.7499824Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1308, in forward 2025-08-14T21:33:20.7500205Z outputs = self.bert( 2025-08-14T21:33:20.7500526Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:20.7500878Z encoder_outputs = self.encoder( 2025-08-14T21:33:20.7501227Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:20.7501576Z layer_outputs = layer_module( 2025-08-14T21:33:20.7501890Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:20.7502220Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:20.7502581Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 614, in forward 2025-08-14T21:33:20.7502934Z layer_output = apply_chunking_to_forward( 2025-08-14T21:33:20.7503302Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:33:20.7503666Z return forward_fn(*input_tensors) 2025-08-14T21:33:20.7504045Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 622, in feed_forward_chunk 2025-08-14T21:33:20.7504461Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:33:20.7504912Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 540, in forward 2025-08-14T21:33:20.7505307Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:33:20.7505654Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:33:20.7505970Z return self.act(input) 2025-08-14T21:33:20.7506081Z 2025-08-14T21:33:20.7506177Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:20.7506513Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:20.7506810Z return mod(**inputs) 2025-08-14T21:33:20.7507143Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1308, in forward 2025-08-14T21:33:20.7507493Z outputs = self.bert( 2025-08-14T21:33:20.7507827Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:20.7508176Z encoder_outputs = self.encoder( 2025-08-14T21:33:20.7508525Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:20.7508878Z layer_outputs = layer_module( 2025-08-14T21:33:20.7509203Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:20.7509547Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:20.7509910Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 614, in forward 2025-08-14T21:33:20.7510279Z layer_output = apply_chunking_to_forward( 2025-08-14T21:33:20.7510654Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:33:20.7511047Z return forward_fn(*input_tensors) 2025-08-14T21:33:20.7511436Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 623, in feed_forward_chunk 2025-08-14T21:33:20.7511852Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:33:20.7512249Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 552, in forward 2025-08-14T21:33:20.7512608Z hidden_states = self.dense(hidden_states) 2025-08-14T21:33:20.7512732Z 2025-08-14T21:33:20.7512834Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:20.7513177Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:20.7513476Z return mod(**inputs) 2025-08-14T21:33:20.7513803Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1308, in forward 2025-08-14T21:33:20.7514144Z outputs = self.bert( 2025-08-14T21:33:20.7514470Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:20.7514822Z encoder_outputs = self.encoder( 2025-08-14T21:33:20.7515164Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:20.7515506Z layer_outputs = layer_module( 2025-08-14T21:33:20.7515825Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:20.7516155Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:20.7516508Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 584, in forward 2025-08-14T21:33:20.7516859Z self_attention_outputs = self.attention( 2025-08-14T21:33:20.7517206Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:20.7517555Z return func(*args, **kwargs) 2025-08-14T21:33:20.7517886Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 514, in forward 2025-08-14T21:33:20.7518233Z self_outputs = self.self( 2025-08-14T21:33:20.7518565Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:20.7518953Z return func(*args, **kwargs) 2025-08-14T21:33:20.7519297Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 374, in forward 2025-08-14T21:33:20.7519787Z self.query(hidden_states).view(bsz, -1, self.num_attention_heads, self.attention_head_size).transpose(1, 2) 2025-08-14T21:33:20.7520025Z 2025-08-14T21:33:20.7520128Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:20.7520462Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:20.7520756Z return mod(**inputs) 2025-08-14T21:33:20.7521084Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1308, in forward 2025-08-14T21:33:20.7521434Z outputs = self.bert( 2025-08-14T21:33:20.7521760Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:20.7522118Z encoder_outputs = self.encoder( 2025-08-14T21:33:20.7522469Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:20.7522827Z layer_outputs = layer_module( 2025-08-14T21:33:20.7523145Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:20.7523483Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:20.7523870Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 584, in forward 2025-08-14T21:33:20.7524250Z self_attention_outputs = self.attention( 2025-08-14T21:33:20.7524615Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:20.7524968Z return func(*args, **kwargs) 2025-08-14T21:33:20.7525317Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 514, in forward 2025-08-14T21:33:20.7525665Z self_outputs = self.self( 2025-08-14T21:33:20.7526004Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:20.7526373Z return func(*args, **kwargs) 2025-08-14T21:33:20.7526716Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 402, in forward 2025-08-14T21:33:20.7527080Z self.key(current_states) 2025-08-14T21:33:20.7527198Z 2025-08-14T21:33:20.7527300Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:20.7527654Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:20.7527960Z return mod(**inputs) 2025-08-14T21:33:20.7528301Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1308, in forward 2025-08-14T21:33:20.7528662Z outputs = self.bert( 2025-08-14T21:33:20.7529006Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:20.7529372Z encoder_outputs = self.encoder( 2025-08-14T21:33:20.7529737Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:20.7530101Z layer_outputs = layer_module( 2025-08-14T21:33:20.7530428Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:20.7530778Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:20.7531152Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 584, in forward 2025-08-14T21:33:20.7531526Z self_attention_outputs = self.attention( 2025-08-14T21:33:20.7531889Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:20.7532246Z return func(*args, **kwargs) 2025-08-14T21:33:20.7532599Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 514, in forward 2025-08-14T21:33:20.7532953Z self_outputs = self.self( 2025-08-14T21:33:20.7533303Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:20.7533651Z return func(*args, **kwargs) 2025-08-14T21:33:20.7533996Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 407, in forward 2025-08-14T21:33:20.7534349Z self.value(current_states) 2025-08-14T21:33:20.7534469Z 2025-08-14T21:33:20.7534547Z cudagraph partition due to non gpu ops 2025-08-14T21:33:20.7534779Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:20.7535112Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:20.7535421Z return mod(**inputs) 2025-08-14T21:33:20.7535757Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1308, in forward 2025-08-14T21:33:20.7536113Z outputs = self.bert( 2025-08-14T21:33:20.7536438Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:20.7536795Z encoder_outputs = self.encoder( 2025-08-14T21:33:20.7537178Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:20.7537549Z layer_outputs = layer_module( 2025-08-14T21:33:20.7537863Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:20.7538193Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:20.7538544Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 584, in forward 2025-08-14T21:33:20.7538893Z self_attention_outputs = self.attention( 2025-08-14T21:33:20.7539244Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:20.7539606Z return func(*args, **kwargs) 2025-08-14T21:33:20.7539950Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 514, in forward 2025-08-14T21:33:20.7540296Z self_outputs = self.self( 2025-08-14T21:33:20.7540632Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:20.7540980Z return func(*args, **kwargs) 2025-08-14T21:33:20.7541316Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 438, in forward 2025-08-14T21:33:20.7541726Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:33:20.7541910Z 2025-08-14T21:33:20.7542010Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:20.7542349Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:20.7542650Z return mod(**inputs) 2025-08-14T21:33:20.7542984Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1308, in forward 2025-08-14T21:33:20.7543337Z outputs = self.bert( 2025-08-14T21:33:20.7543667Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:20.7544027Z encoder_outputs = self.encoder( 2025-08-14T21:33:20.7544376Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:20.7544802Z layer_outputs = layer_module( 2025-08-14T21:33:20.7545117Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:20.7545453Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:20.7545809Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 584, in forward 2025-08-14T21:33:20.7546173Z self_attention_outputs = self.attention( 2025-08-14T21:33:20.7546519Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:20.7546860Z return func(*args, **kwargs) 2025-08-14T21:33:20.7547198Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 524, in forward 2025-08-14T21:33:20.7547598Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:33:20.7547994Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 461, in forward 2025-08-14T21:33:20.7548350Z hidden_states = self.dense(hidden_states) 2025-08-14T21:33:20.7548478Z 2025-08-14T21:33:20.7548583Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:20.7548908Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:20.7549209Z return mod(**inputs) 2025-08-14T21:33:20.7549537Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1308, in forward 2025-08-14T21:33:20.7549875Z outputs = self.bert( 2025-08-14T21:33:20.7551327Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:20.7551725Z encoder_outputs = self.encoder( 2025-08-14T21:33:20.7552078Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:20.7552424Z layer_outputs = layer_module( 2025-08-14T21:33:20.7552750Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:20.7553082Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:20.7553438Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 614, in forward 2025-08-14T21:33:20.7553817Z layer_output = apply_chunking_to_forward( 2025-08-14T21:33:20.7554194Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:33:20.7554563Z return forward_fn(*input_tensors) 2025-08-14T21:33:20.7554940Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 622, in feed_forward_chunk 2025-08-14T21:33:20.7555365Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:33:20.7555756Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 539, in forward 2025-08-14T21:33:20.7556113Z hidden_states = self.dense(hidden_states) 2025-08-14T21:33:20.7556239Z 2025-08-14T21:33:20.7556334Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:20.7556667Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:20.7556970Z return mod(**inputs) 2025-08-14T21:33:20.7557291Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1308, in forward 2025-08-14T21:33:20.7557635Z outputs = self.bert( 2025-08-14T21:33:20.7557963Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:20.7558317Z encoder_outputs = self.encoder( 2025-08-14T21:33:20.7558657Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:20.7559007Z layer_outputs = layer_module( 2025-08-14T21:33:20.7559324Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:20.7559652Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:20.7559997Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 614, in forward 2025-08-14T21:33:20.7560357Z layer_output = apply_chunking_to_forward( 2025-08-14T21:33:20.7560722Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:33:20.7561082Z return forward_fn(*input_tensors) 2025-08-14T21:33:20.7561457Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 622, in feed_forward_chunk 2025-08-14T21:33:20.7561873Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:33:20.7562258Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 540, in forward 2025-08-14T21:33:20.7562638Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:33:20.7562985Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:33:20.7563301Z return self.act(input) 2025-08-14T21:33:20.7563401Z 2025-08-14T21:33:20.7563501Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:20.7563827Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:20.7564125Z return mod(**inputs) 2025-08-14T21:33:20.7564479Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1308, in forward 2025-08-14T21:33:20.7564835Z outputs = self.bert( 2025-08-14T21:33:20.7565164Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:20.7565518Z encoder_outputs = self.encoder( 2025-08-14T21:33:20.7565864Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:20.7566206Z layer_outputs = layer_module( 2025-08-14T21:33:20.7566547Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:20.7566885Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:20.7567233Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 614, in forward 2025-08-14T21:33:20.7567600Z layer_output = apply_chunking_to_forward( 2025-08-14T21:33:20.7567976Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:33:20.7568347Z return forward_fn(*input_tensors) 2025-08-14T21:33:20.7568719Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 623, in feed_forward_chunk 2025-08-14T21:33:20.7569149Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:33:20.7569550Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 552, in forward 2025-08-14T21:33:20.7569914Z hidden_states = self.dense(hidden_states) 2025-08-14T21:33:20.7570040Z 2025-08-14T21:33:20.7570135Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:20.7570467Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:20.7570773Z return mod(**inputs) 2025-08-14T21:33:20.7571098Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1308, in forward 2025-08-14T21:33:20.7571446Z outputs = self.bert( 2025-08-14T21:33:20.7571774Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:20.7572128Z encoder_outputs = self.encoder( 2025-08-14T21:33:20.7572469Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:20.7572820Z layer_outputs = layer_module( 2025-08-14T21:33:20.7573144Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:20.7573473Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:20.7573828Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 584, in forward 2025-08-14T21:33:20.7574187Z self_attention_outputs = self.attention( 2025-08-14T21:33:20.7574542Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:20.7574879Z return func(*args, **kwargs) 2025-08-14T21:33:20.7575218Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 514, in forward 2025-08-14T21:33:20.7575563Z self_outputs = self.self( 2025-08-14T21:33:20.7575889Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:20.7576234Z return func(*args, **kwargs) 2025-08-14T21:33:20.7576575Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 374, in forward 2025-08-14T21:33:20.7577051Z self.query(hidden_states).view(bsz, -1, self.num_attention_heads, self.attention_head_size).transpose(1, 2) 2025-08-14T21:33:20.7577342Z 2025-08-14T21:33:20.7577440Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:20.7577775Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:20.7578073Z return mod(**inputs) 2025-08-14T21:33:20.7578406Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1308, in forward 2025-08-14T21:33:20.7578743Z outputs = self.bert( 2025-08-14T21:33:20.7579069Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:20.7579436Z encoder_outputs = self.encoder( 2025-08-14T21:33:20.7579778Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:20.7580125Z layer_outputs = layer_module( 2025-08-14T21:33:20.7580448Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:20.7580783Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:20.7581129Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 584, in forward 2025-08-14T21:33:20.7581486Z self_attention_outputs = self.attention( 2025-08-14T21:33:20.7581836Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:20.7582168Z return func(*args, **kwargs) 2025-08-14T21:33:20.7582510Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 514, in forward 2025-08-14T21:33:20.7582858Z self_outputs = self.self( 2025-08-14T21:33:20.7583190Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:20.7583521Z return func(*args, **kwargs) 2025-08-14T21:33:20.7583866Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 402, in forward 2025-08-14T21:33:20.7584217Z self.key(current_states) 2025-08-14T21:33:20.7584324Z 2025-08-14T21:33:20.7584428Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:20.7584972Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:20.7585288Z return mod(**inputs) 2025-08-14T21:33:20.7585634Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1308, in forward 2025-08-14T21:33:20.7585994Z outputs = self.bert( 2025-08-14T21:33:20.7586339Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:20.7586697Z encoder_outputs = self.encoder( 2025-08-14T21:33:20.7587050Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:20.7587406Z layer_outputs = layer_module( 2025-08-14T21:33:20.7587739Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:20.7588085Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:20.7588443Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 584, in forward 2025-08-14T21:33:20.7588817Z self_attention_outputs = self.attention( 2025-08-14T21:33:20.7589180Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:20.7589535Z return func(*args, **kwargs) 2025-08-14T21:33:20.7589875Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 514, in forward 2025-08-14T21:33:20.7590233Z self_outputs = self.self( 2025-08-14T21:33:20.7590632Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:20.7591000Z return func(*args, **kwargs) 2025-08-14T21:33:20.7591348Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 407, in forward 2025-08-14T21:33:20.7591712Z self.value(current_states) 2025-08-14T21:33:20.7591825Z 2025-08-14T21:33:20.7591911Z cudagraph partition due to non gpu ops 2025-08-14T21:33:20.7592140Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:20.7592485Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:20.7592823Z return mod(**inputs) 2025-08-14T21:33:20.7593156Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1308, in forward 2025-08-14T21:33:20.7593511Z outputs = self.bert( 2025-08-14T21:33:20.7593849Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:20.7594216Z encoder_outputs = self.encoder( 2025-08-14T21:33:20.7594564Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:20.7594922Z layer_outputs = layer_module( 2025-08-14T21:33:20.7595246Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:20.7595586Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:20.7595940Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 584, in forward 2025-08-14T21:33:20.7596311Z self_attention_outputs = self.attention( 2025-08-14T21:33:20.7596676Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:20.7597022Z return func(*args, **kwargs) 2025-08-14T21:33:20.7597370Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 514, in forward 2025-08-14T21:33:20.7597730Z self_outputs = self.self( 2025-08-14T21:33:20.7598070Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:20.7598412Z return func(*args, **kwargs) 2025-08-14T21:33:20.7598758Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 438, in forward 2025-08-14T21:33:20.7599175Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:33:20.7599394Z 2025-08-14T21:33:20.7599491Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:20.7599822Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:20.7600125Z return mod(**inputs) 2025-08-14T21:33:20.7600457Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1308, in forward 2025-08-14T21:33:20.7600802Z outputs = self.bert( 2025-08-14T21:33:20.7601128Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:20.7601482Z encoder_outputs = self.encoder( 2025-08-14T21:33:20.7601820Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:20.7602173Z layer_outputs = layer_module( 2025-08-14T21:33:20.7602488Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:20.7602820Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:20.7603166Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 584, in forward 2025-08-14T21:33:20.7603523Z self_attention_outputs = self.attention( 2025-08-14T21:33:20.7603911Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:20.7604267Z return func(*args, **kwargs) 2025-08-14T21:33:20.7604595Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 524, in forward 2025-08-14T21:33:20.7604997Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:33:20.7605393Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 461, in forward 2025-08-14T21:33:20.7605743Z hidden_states = self.dense(hidden_states) 2025-08-14T21:33:20.7605921Z 2025-08-14T21:33:20.7606018Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:20.7606349Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:20.7606648Z return mod(**inputs) 2025-08-14T21:33:20.7606971Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1308, in forward 2025-08-14T21:33:20.7607318Z outputs = self.bert( 2025-08-14T21:33:20.7607646Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:20.7607990Z encoder_outputs = self.encoder( 2025-08-14T21:33:20.7608334Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:20.7608680Z layer_outputs = layer_module( 2025-08-14T21:33:20.7608996Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:20.7609320Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:20.7609670Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 614, in forward 2025-08-14T21:33:20.7610029Z layer_output = apply_chunking_to_forward( 2025-08-14T21:33:20.7610399Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:33:20.7610754Z return forward_fn(*input_tensors) 2025-08-14T21:33:20.7611128Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 622, in feed_forward_chunk 2025-08-14T21:33:20.7611546Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:33:20.7611928Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 539, in forward 2025-08-14T21:33:20.7612289Z hidden_states = self.dense(hidden_states) 2025-08-14T21:33:20.7612421Z 2025-08-14T21:33:20.7612518Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:20.7612848Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:20.7613142Z return mod(**inputs) 2025-08-14T21:33:20.7613471Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1308, in forward 2025-08-14T21:33:20.7613820Z outputs = self.bert( 2025-08-14T21:33:20.7614139Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:20.7614496Z encoder_outputs = self.encoder( 2025-08-14T21:33:20.7614842Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:20.7615191Z layer_outputs = layer_module( 2025-08-14T21:33:20.7615503Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:20.7615838Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:20.7616194Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 614, in forward 2025-08-14T21:33:20.7616583Z layer_output = apply_chunking_to_forward( 2025-08-14T21:33:20.7616972Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:33:20.7617344Z return forward_fn(*input_tensors) 2025-08-14T21:33:20.7617728Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 622, in feed_forward_chunk 2025-08-14T21:33:20.7618146Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:33:20.7618542Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 540, in forward 2025-08-14T21:33:20.7618943Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:33:20.7619299Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:33:20.7619605Z return self.act(input) 2025-08-14T21:33:20.7619716Z 2025-08-14T21:33:20.7619816Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:20.7620154Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:20.7620455Z return mod(**inputs) 2025-08-14T21:33:20.7620779Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1308, in forward 2025-08-14T21:33:20.7621127Z outputs = self.bert( 2025-08-14T21:33:20.7621452Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:20.7621796Z encoder_outputs = self.encoder( 2025-08-14T21:33:20.7622143Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:20.7622490Z layer_outputs = layer_module( 2025-08-14T21:33:20.7622811Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:20.7623140Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:20.7623495Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 614, in forward 2025-08-14T21:33:20.7623856Z layer_output = apply_chunking_to_forward( 2025-08-14T21:33:20.7624218Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:33:20.7624577Z return forward_fn(*input_tensors) 2025-08-14T21:33:20.7625021Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 623, in feed_forward_chunk 2025-08-14T21:33:20.7625464Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:33:20.7625863Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 552, in forward 2025-08-14T21:33:20.7626226Z hidden_states = self.dense(hidden_states) 2025-08-14T21:33:20.7626359Z 2025-08-14T21:33:20.7626463Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:20.7626800Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:20.7627099Z return mod(**inputs) 2025-08-14T21:33:20.7627433Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1308, in forward 2025-08-14T21:33:20.7627784Z outputs = self.bert( 2025-08-14T21:33:20.7628105Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:20.7628465Z encoder_outputs = self.encoder( 2025-08-14T21:33:20.7628816Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:20.7629172Z layer_outputs = layer_module( 2025-08-14T21:33:20.7629518Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:20.7629869Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:20.7630219Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 584, in forward 2025-08-14T21:33:20.7630571Z self_attention_outputs = self.attention( 2025-08-14T21:33:20.7630928Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:20.7631271Z return func(*args, **kwargs) 2025-08-14T21:33:20.7631614Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 514, in forward 2025-08-14T21:33:20.7631974Z self_outputs = self.self( 2025-08-14T21:33:20.7632312Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:20.7632664Z return func(*args, **kwargs) 2025-08-14T21:33:20.7633000Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 374, in forward 2025-08-14T21:33:20.7633488Z self.query(hidden_states).view(bsz, -1, self.num_attention_heads, self.attention_head_size).transpose(1, 2) 2025-08-14T21:33:20.7633739Z 2025-08-14T21:33:20.7633836Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:20.7634170Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:20.7634463Z return mod(**inputs) 2025-08-14T21:33:20.7634797Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1308, in forward 2025-08-14T21:33:20.7635144Z outputs = self.bert( 2025-08-14T21:33:20.7635473Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:20.7635819Z encoder_outputs = self.encoder( 2025-08-14T21:33:20.7636171Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:20.7636523Z layer_outputs = layer_module( 2025-08-14T21:33:20.7636833Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:20.7637170Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:20.7637524Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 584, in forward 2025-08-14T21:33:20.7637882Z self_attention_outputs = self.attention( 2025-08-14T21:33:20.7638226Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:20.7638299Z return func(*args, **kwargs) 2025-08-14T21:33:20.7638523Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 514, in forward 2025-08-14T21:33:20.7638587Z self_outputs = self.self( 2025-08-14T21:33:20.7638821Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:20.7638883Z return func(*args, **kwargs) 2025-08-14T21:33:20.7639117Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 402, in forward 2025-08-14T21:33:20.7639182Z self.key(current_states) 2025-08-14T21:33:20.7639185Z 2025-08-14T21:33:20.7639281Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:20.7639474Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:20.7639534Z return mod(**inputs) 2025-08-14T21:33:20.7639769Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1308, in forward 2025-08-14T21:33:20.7639830Z outputs = self.bert( 2025-08-14T21:33:20.7640126Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:20.7640219Z encoder_outputs = self.encoder( 2025-08-14T21:33:20.7640450Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:20.7640515Z layer_outputs = layer_module( 2025-08-14T21:33:20.7640729Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:20.7640801Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:20.7641036Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 584, in forward 2025-08-14T21:33:20.7641128Z self_attention_outputs = self.attention( 2025-08-14T21:33:20.7641352Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:20.7641425Z return func(*args, **kwargs) 2025-08-14T21:33:20.7641653Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 514, in forward 2025-08-14T21:33:20.7641717Z self_outputs = self.self( 2025-08-14T21:33:20.7641946Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:20.7642007Z return func(*args, **kwargs) 2025-08-14T21:33:20.7642239Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 407, in forward 2025-08-14T21:33:20.7642306Z self.value(current_states) 2025-08-14T21:33:20.7642309Z 2025-08-14T21:33:20.7642386Z cudagraph partition due to non gpu ops 2025-08-14T21:33:20.7642488Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:20.7642671Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:20.7642730Z return mod(**inputs) 2025-08-14T21:33:20.7642970Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1308, in forward 2025-08-14T21:33:20.7643029Z outputs = self.bert( 2025-08-14T21:33:20.7643265Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:20.7643331Z encoder_outputs = self.encoder( 2025-08-14T21:33:20.7643559Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:20.7643631Z layer_outputs = layer_module( 2025-08-14T21:33:20.7643833Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:20.7643912Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:20.7644136Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 584, in forward 2025-08-14T21:33:20.7644210Z self_attention_outputs = self.attention( 2025-08-14T21:33:20.7644441Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:20.7644503Z return func(*args, **kwargs) 2025-08-14T21:33:20.7644727Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 514, in forward 2025-08-14T21:33:20.7644798Z self_outputs = self.self( 2025-08-14T21:33:20.7645015Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:20.7645084Z return func(*args, **kwargs) 2025-08-14T21:33:20.7645311Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 438, in forward 2025-08-14T21:33:20.7645432Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:33:20.7645437Z 2025-08-14T21:33:20.7645538Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:20.7645776Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:20.7645838Z return mod(**inputs) 2025-08-14T21:33:20.7646076Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1308, in forward 2025-08-14T21:33:20.7646135Z outputs = self.bert( 2025-08-14T21:33:20.7646375Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:20.7646442Z encoder_outputs = self.encoder( 2025-08-14T21:33:20.7646667Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:20.7646756Z layer_outputs = layer_module( 2025-08-14T21:33:20.7646958Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:20.7647037Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:20.7647264Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 584, in forward 2025-08-14T21:33:20.7647339Z self_attention_outputs = self.attention( 2025-08-14T21:33:20.7647568Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:20.7647630Z return func(*args, **kwargs) 2025-08-14T21:33:20.7647854Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 524, in forward 2025-08-14T21:33:20.7647981Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:33:20.7648206Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 461, in forward 2025-08-14T21:33:20.7648290Z hidden_states = self.dense(hidden_states) 2025-08-14T21:33:20.7648294Z 2025-08-14T21:33:20.7648385Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:20.7648571Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:20.7648638Z return mod(**inputs) 2025-08-14T21:33:20.7648866Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1308, in forward 2025-08-14T21:33:20.7648933Z outputs = self.bert( 2025-08-14T21:33:20.7649159Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:20.7649225Z encoder_outputs = self.encoder( 2025-08-14T21:33:20.7649454Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:20.7649520Z layer_outputs = layer_module( 2025-08-14T21:33:20.7649720Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:20.7649800Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:20.7650025Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 614, in forward 2025-08-14T21:33:20.7650109Z layer_output = apply_chunking_to_forward( 2025-08-14T21:33:20.7650348Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:33:20.7650417Z return forward_fn(*input_tensors) 2025-08-14T21:33:20.7650678Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 622, in feed_forward_chunk 2025-08-14T21:33:20.7650790Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:33:20.7651012Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 539, in forward 2025-08-14T21:33:20.7651094Z hidden_states = self.dense(hidden_states) 2025-08-14T21:33:20.7651098Z 2025-08-14T21:33:20.7651219Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:20.7651425Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:20.7651484Z return mod(**inputs) 2025-08-14T21:33:20.7651711Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1308, in forward 2025-08-14T21:33:20.7651778Z outputs = self.bert( 2025-08-14T21:33:20.7652007Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:20.7652080Z encoder_outputs = self.encoder( 2025-08-14T21:33:20.7652323Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:20.7652386Z layer_outputs = layer_module( 2025-08-14T21:33:20.7652596Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:20.7652671Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:20.7652896Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 614, in forward 2025-08-14T21:33:20.7652981Z layer_output = apply_chunking_to_forward( 2025-08-14T21:33:20.7653220Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:33:20.7653296Z return forward_fn(*input_tensors) 2025-08-14T21:33:20.7653553Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 622, in feed_forward_chunk 2025-08-14T21:33:20.7653662Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:33:20.7653895Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 540, in forward 2025-08-14T21:33:20.7653999Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:33:20.7654208Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:33:20.7654270Z return self.act(input) 2025-08-14T21:33:20.7654273Z 2025-08-14T21:33:20.7654366Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:20.7654555Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:20.7654612Z return mod(**inputs) 2025-08-14T21:33:20.7654842Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1308, in forward 2025-08-14T21:33:20.7654909Z outputs = self.bert( 2025-08-14T21:33:20.7655138Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:20.7655211Z encoder_outputs = self.encoder( 2025-08-14T21:33:20.7655442Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:20.7655507Z layer_outputs = layer_module( 2025-08-14T21:33:20.7655717Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:20.7655787Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:20.7656012Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 614, in forward 2025-08-14T21:33:20.7656094Z layer_output = apply_chunking_to_forward( 2025-08-14T21:33:20.7656330Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:33:20.7656407Z return forward_fn(*input_tensors) 2025-08-14T21:33:20.7656662Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 623, in feed_forward_chunk 2025-08-14T21:33:20.7656782Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:33:20.7657064Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 552, in forward 2025-08-14T21:33:20.7657140Z hidden_states = self.dense(hidden_states) 2025-08-14T21:33:20.7657144Z 2025-08-14T21:33:20.7657243Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:20.7657428Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:20.7657487Z return mod(**inputs) 2025-08-14T21:33:20.7657723Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1308, in forward 2025-08-14T21:33:20.7657800Z outputs = self.bert( 2025-08-14T21:33:20.7658027Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:20.7658098Z encoder_outputs = self.encoder( 2025-08-14T21:33:20.7658329Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:20.7658399Z layer_outputs = layer_module( 2025-08-14T21:33:20.7658600Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:20.7658671Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:20.7658900Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 584, in forward 2025-08-14T21:33:20.7658974Z self_attention_outputs = self.attention( 2025-08-14T21:33:20.7659195Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:20.7659266Z return func(*args, **kwargs) 2025-08-14T21:33:20.7659491Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 514, in forward 2025-08-14T21:33:20.7659563Z self_outputs = self.self( 2025-08-14T21:33:20.7659788Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:20.7659851Z return func(*args, **kwargs) 2025-08-14T21:33:20.7660085Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 374, in forward 2025-08-14T21:33:20.7660276Z self.query(hidden_states).view(bsz, -1, self.num_attention_heads, self.attention_head_size).transpose(1, 2) 2025-08-14T21:33:20.7660279Z 2025-08-14T21:33:20.7660382Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:20.7660567Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:20.7660627Z return mod(**inputs) 2025-08-14T21:33:20.7660864Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1308, in forward 2025-08-14T21:33:20.7660922Z outputs = self.bert( 2025-08-14T21:33:20.7661154Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:20.7661229Z encoder_outputs = self.encoder( 2025-08-14T21:33:20.7661457Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:20.7661531Z layer_outputs = layer_module( 2025-08-14T21:33:20.7661733Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:20.7661804Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:20.7662037Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 584, in forward 2025-08-14T21:33:20.7662109Z self_attention_outputs = self.attention( 2025-08-14T21:33:20.7662337Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:20.7662428Z return func(*args, **kwargs) 2025-08-14T21:33:20.7662670Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 514, in forward 2025-08-14T21:33:20.7662739Z self_outputs = self.self( 2025-08-14T21:33:20.7662959Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:20.7663019Z return func(*args, **kwargs) 2025-08-14T21:33:20.7663250Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 402, in forward 2025-08-14T21:33:20.7663315Z self.key(current_states) 2025-08-14T21:33:20.7663340Z 2025-08-14T21:33:20.7663443Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:20.7663625Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:20.7663684Z return mod(**inputs) 2025-08-14T21:33:20.7663922Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1308, in forward 2025-08-14T21:33:20.7663984Z outputs = self.bert( 2025-08-14T21:33:20.7664211Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:20.7664287Z encoder_outputs = self.encoder( 2025-08-14T21:33:20.7664511Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:20.7664582Z layer_outputs = layer_module( 2025-08-14T21:33:20.7664851Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:20.7664932Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:20.7665167Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 584, in forward 2025-08-14T21:33:20.7665242Z self_attention_outputs = self.attention( 2025-08-14T21:33:20.7665471Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:20.7665535Z return func(*args, **kwargs) 2025-08-14T21:33:20.7665759Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 514, in forward 2025-08-14T21:33:20.7665828Z self_outputs = self.self( 2025-08-14T21:33:20.7666048Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:20.7666109Z return func(*args, **kwargs) 2025-08-14T21:33:20.7666341Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 407, in forward 2025-08-14T21:33:20.7666406Z self.value(current_states) 2025-08-14T21:33:20.7666410Z 2025-08-14T21:33:20.7666492Z cudagraph partition due to non gpu ops 2025-08-14T21:33:20.7666588Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:20.7666773Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:20.7666840Z return mod(**inputs) 2025-08-14T21:33:20.7667071Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1308, in forward 2025-08-14T21:33:20.7667131Z outputs = self.bert( 2025-08-14T21:33:20.7667367Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:20.7667433Z encoder_outputs = self.encoder( 2025-08-14T21:33:20.7667665Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:20.7667731Z layer_outputs = layer_module( 2025-08-14T21:33:20.7667933Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:20.7668045Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:20.7668296Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 584, in forward 2025-08-14T21:33:20.7668368Z self_attention_outputs = self.attention( 2025-08-14T21:33:20.7668598Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:20.7668659Z return func(*args, **kwargs) 2025-08-14T21:33:20.7668893Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 514, in forward 2025-08-14T21:33:20.7668954Z self_outputs = self.self( 2025-08-14T21:33:20.7669193Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:20.7669261Z return func(*args, **kwargs) 2025-08-14T21:33:20.7669488Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 438, in forward 2025-08-14T21:33:20.7669617Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:33:20.7669621Z 2025-08-14T21:33:20.7669714Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:20.7669895Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:20.7669960Z return mod(**inputs) 2025-08-14T21:33:20.7670190Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1308, in forward 2025-08-14T21:33:20.7670248Z outputs = self.bert( 2025-08-14T21:33:20.7670486Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:20.7670555Z encoder_outputs = self.encoder( 2025-08-14T21:33:20.7670785Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:20.7670849Z layer_outputs = layer_module( 2025-08-14T21:33:20.7671052Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:20.7671129Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:20.7671357Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 584, in forward 2025-08-14T21:33:20.7671430Z self_attention_outputs = self.attention( 2025-08-14T21:33:20.7671660Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:20.7671725Z return func(*args, **kwargs) 2025-08-14T21:33:20.7671960Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 524, in forward 2025-08-14T21:33:20.7672077Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:33:20.7672306Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 461, in forward 2025-08-14T21:33:20.7672391Z hidden_states = self.dense(hidden_states) 2025-08-14T21:33:20.7672394Z 2025-08-14T21:33:20.7672490Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:20.7672680Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:20.7672738Z return mod(**inputs) 2025-08-14T21:33:20.7672967Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1308, in forward 2025-08-14T21:33:20.7673035Z outputs = self.bert( 2025-08-14T21:33:20.7673265Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:20.7673331Z encoder_outputs = self.encoder( 2025-08-14T21:33:20.7673566Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:20.7673673Z layer_outputs = layer_module( 2025-08-14T21:33:20.7673885Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:20.7673956Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:20.7674180Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 614, in forward 2025-08-14T21:33:20.7674264Z layer_output = apply_chunking_to_forward( 2025-08-14T21:33:20.7674503Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:33:20.7674596Z return forward_fn(*input_tensors) 2025-08-14T21:33:20.7674849Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 622, in feed_forward_chunk 2025-08-14T21:33:20.7674959Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:33:20.7675192Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 539, in forward 2025-08-14T21:33:20.7675268Z hidden_states = self.dense(hidden_states) 2025-08-14T21:33:20.7675272Z 2025-08-14T21:33:20.7675364Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:20.7675555Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:20.7675612Z return mod(**inputs) 2025-08-14T21:33:20.7675846Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1308, in forward 2025-08-14T21:33:20.7675908Z outputs = self.bert( 2025-08-14T21:33:20.7676133Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:20.7676207Z encoder_outputs = self.encoder( 2025-08-14T21:33:20.7676432Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:20.7676497Z layer_outputs = layer_module( 2025-08-14T21:33:20.7676705Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:20.7676775Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:20.7677004Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 614, in forward 2025-08-14T21:33:20.7677078Z layer_output = apply_chunking_to_forward( 2025-08-14T21:33:20.7677313Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:33:20.7677391Z return forward_fn(*input_tensors) 2025-08-14T21:33:20.7677643Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 622, in feed_forward_chunk 2025-08-14T21:33:20.7677757Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:33:20.7677987Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 540, in forward 2025-08-14T21:33:20.7678091Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:33:20.7678294Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:33:20.7678358Z return self.act(input) 2025-08-14T21:33:20.7678362Z 2025-08-14T21:33:20.7678453Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:20.7678643Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:20.7678703Z return mod(**inputs) 2025-08-14T21:33:20.7678938Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1308, in forward 2025-08-14T21:33:20.7678996Z outputs = self.bert( 2025-08-14T21:33:20.7679252Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:20.7679345Z encoder_outputs = self.encoder( 2025-08-14T21:33:20.7679573Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:20.7679644Z layer_outputs = layer_module( 2025-08-14T21:33:20.7679849Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:20.7679921Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:20.7680154Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 614, in forward 2025-08-14T21:33:20.7680247Z layer_output = apply_chunking_to_forward( 2025-08-14T21:33:20.7680483Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:33:20.7680560Z return forward_fn(*input_tensors) 2025-08-14T21:33:20.7680820Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 623, in feed_forward_chunk 2025-08-14T21:33:20.7680950Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:33:20.7681174Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 552, in forward 2025-08-14T21:33:20.7681247Z hidden_states = self.dense(hidden_states) 2025-08-14T21:33:20.7681250Z 2025-08-14T21:33:20.7681352Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:20.7681535Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:20.7681604Z return mod(**inputs) 2025-08-14T21:33:20.7681833Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1308, in forward 2025-08-14T21:33:20.7681892Z outputs = self.bert( 2025-08-14T21:33:20.7682131Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:20.7682199Z encoder_outputs = self.encoder( 2025-08-14T21:33:20.7682425Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:20.7682495Z layer_outputs = layer_module( 2025-08-14T21:33:20.7682697Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:20.7682773Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:20.7682998Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 584, in forward 2025-08-14T21:33:20.7683072Z self_attention_outputs = self.attention( 2025-08-14T21:33:20.7683298Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:20.7683363Z return func(*args, **kwargs) 2025-08-14T21:33:20.7683589Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 514, in forward 2025-08-14T21:33:20.7683658Z self_outputs = self.self( 2025-08-14T21:33:20.7683880Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:20.7683948Z return func(*args, **kwargs) 2025-08-14T21:33:20.7684174Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 374, in forward 2025-08-14T21:33:20.7684367Z self.query(hidden_states).view(bsz, -1, self.num_attention_heads, self.attention_head_size).transpose(1, 2) 2025-08-14T21:33:20.7684372Z 2025-08-14T21:33:20.7684472Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:20.7684837Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:20.7684911Z return mod(**inputs) 2025-08-14T21:33:20.7685251Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1308, in forward 2025-08-14T21:33:20.7685313Z outputs = self.bert( 2025-08-14T21:33:20.7685556Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:20.7685623Z encoder_outputs = self.encoder( 2025-08-14T21:33:20.7685850Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:20.7685925Z layer_outputs = layer_module( 2025-08-14T21:33:20.7686155Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:20.7686242Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:20.7686465Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 584, in forward 2025-08-14T21:33:20.7686543Z self_attention_outputs = self.attention( 2025-08-14T21:33:20.7686772Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:20.7686834Z return func(*args, **kwargs) 2025-08-14T21:33:20.7687060Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 514, in forward 2025-08-14T21:33:20.7687131Z self_outputs = self.self( 2025-08-14T21:33:20.7687350Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:20.7687421Z return func(*args, **kwargs) 2025-08-14T21:33:20.7687644Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 402, in forward 2025-08-14T21:33:20.7687709Z self.key(current_states) 2025-08-14T21:33:20.7687712Z 2025-08-14T21:33:20.7687814Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:20.7688003Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:20.7688071Z return mod(**inputs) 2025-08-14T21:33:20.7688298Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1308, in forward 2025-08-14T21:33:20.7688356Z outputs = self.bert( 2025-08-14T21:33:20.7688590Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:20.7688657Z encoder_outputs = self.encoder( 2025-08-14T21:33:20.7688888Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:20.7688961Z layer_outputs = layer_module( 2025-08-14T21:33:20.7689162Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:20.7689243Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:20.7689467Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 584, in forward 2025-08-14T21:33:20.7689541Z self_attention_outputs = self.attention( 2025-08-14T21:33:20.7689768Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:20.7689829Z return func(*args, **kwargs) 2025-08-14T21:33:20.7690053Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 514, in forward 2025-08-14T21:33:20.7690123Z self_outputs = self.self( 2025-08-14T21:33:20.7690344Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:20.7690414Z return func(*args, **kwargs) 2025-08-14T21:33:20.7690663Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 407, in forward 2025-08-14T21:33:20.7690748Z self.value(current_states) 2025-08-14T21:33:20.7690752Z 2025-08-14T21:33:20.7690834Z cudagraph partition due to non gpu ops 2025-08-14T21:33:20.7690930Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:20.7691118Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:20.7691178Z return mod(**inputs) 2025-08-14T21:33:20.7691409Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1308, in forward 2025-08-14T21:33:20.7691478Z outputs = self.bert( 2025-08-14T21:33:20.7691723Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:20.7691790Z encoder_outputs = self.encoder( 2025-08-14T21:33:20.7692021Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:20.7692088Z layer_outputs = layer_module( 2025-08-14T21:33:20.7692296Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:20.7692366Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:20.7692587Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 584, in forward 2025-08-14T21:33:20.7692670Z self_attention_outputs = self.attention( 2025-08-14T21:33:20.7692886Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:20.7692948Z return func(*args, **kwargs) 2025-08-14T21:33:20.7693178Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 514, in forward 2025-08-14T21:33:20.7693239Z self_outputs = self.self( 2025-08-14T21:33:20.7693468Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:20.7693532Z return func(*args, **kwargs) 2025-08-14T21:33:20.7693755Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 438, in forward 2025-08-14T21:33:20.7693884Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:33:20.7693887Z 2025-08-14T21:33:20.7693978Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:20.7694164Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:20.7694223Z return mod(**inputs) 2025-08-14T21:33:20.7694452Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1308, in forward 2025-08-14T21:33:20.7694516Z outputs = self.bert( 2025-08-14T21:33:20.7694739Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:20.7694808Z encoder_outputs = self.encoder( 2025-08-14T21:33:20.7695040Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:20.7695103Z layer_outputs = layer_module( 2025-08-14T21:33:20.7695308Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:20.7695378Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:20.7695598Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 584, in forward 2025-08-14T21:33:20.7695679Z self_attention_outputs = self.attention( 2025-08-14T21:33:20.7695895Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:20.7695956Z return func(*args, **kwargs) 2025-08-14T21:33:20.7696213Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 524, in forward 2025-08-14T21:33:20.7696345Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:33:20.7696576Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 461, in forward 2025-08-14T21:33:20.7696655Z hidden_states = self.dense(hidden_states) 2025-08-14T21:33:20.7696658Z 2025-08-14T21:33:20.7696751Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:20.7696941Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:20.7697016Z return mod(**inputs) 2025-08-14T21:33:20.7697251Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1308, in forward 2025-08-14T21:33:20.7697310Z outputs = self.bert( 2025-08-14T21:33:20.7697537Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:20.7697610Z encoder_outputs = self.encoder( 2025-08-14T21:33:20.7697834Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:20.7697898Z layer_outputs = layer_module( 2025-08-14T21:33:20.7698105Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:20.7698176Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:20.7698408Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 614, in forward 2025-08-14T21:33:20.7698487Z layer_output = apply_chunking_to_forward( 2025-08-14T21:33:20.7698725Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:33:20.7698804Z return forward_fn(*input_tensors) 2025-08-14T21:33:20.7699059Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 622, in feed_forward_chunk 2025-08-14T21:33:20.7699173Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:33:20.7699403Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 539, in forward 2025-08-14T21:33:20.7699478Z hidden_states = self.dense(hidden_states) 2025-08-14T21:33:20.7699481Z 2025-08-14T21:33:20.7699580Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:20.7699763Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:20.7699824Z return mod(**inputs) 2025-08-14T21:33:20.7700059Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1308, in forward 2025-08-14T21:33:20.7700118Z outputs = self.bert( 2025-08-14T21:33:20.7700353Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:20.7700421Z encoder_outputs = self.encoder( 2025-08-14T21:33:20.7700646Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:20.7700720Z layer_outputs = layer_module( 2025-08-14T21:33:20.7700920Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:20.7700991Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:20.7701223Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 614, in forward 2025-08-14T21:33:20.7701300Z layer_output = apply_chunking_to_forward( 2025-08-14T21:33:20.7701545Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:33:20.7701615Z return forward_fn(*input_tensors) 2025-08-14T21:33:20.7701914Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 622, in feed_forward_chunk 2025-08-14T21:33:20.7702035Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:33:20.7702263Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 540, in forward 2025-08-14T21:33:20.7702374Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:33:20.7702569Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:33:20.7702633Z return self.act(input) 2025-08-14T21:33:20.7702652Z 2025-08-14T21:33:20.7702756Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:20.7702939Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:20.7702996Z return mod(**inputs) 2025-08-14T21:33:20.7703233Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1308, in forward 2025-08-14T21:33:20.7703293Z outputs = self.bert( 2025-08-14T21:33:20.7703527Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:20.7703593Z encoder_outputs = self.encoder( 2025-08-14T21:33:20.7703815Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:20.7703886Z layer_outputs = layer_module( 2025-08-14T21:33:20.7704087Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:20.7704160Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:20.7704392Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 614, in forward 2025-08-14T21:33:20.7704469Z layer_output = apply_chunking_to_forward( 2025-08-14T21:33:20.7704780Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:33:20.7704855Z return forward_fn(*input_tensors) 2025-08-14T21:33:20.7705111Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 623, in feed_forward_chunk 2025-08-14T21:33:20.7705240Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:33:20.7705467Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 552, in forward 2025-08-14T21:33:20.7705553Z hidden_states = self.dense(hidden_states) 2025-08-14T21:33:20.7705558Z 2025-08-14T21:33:20.7705653Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:20.7705837Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:20.7705906Z return mod(**inputs) 2025-08-14T21:33:20.7706143Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1308, in forward 2025-08-14T21:33:20.7706203Z outputs = self.bert( 2025-08-14T21:33:20.7706441Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:20.7706508Z encoder_outputs = self.encoder( 2025-08-14T21:33:20.7706741Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:20.7706806Z layer_outputs = layer_module( 2025-08-14T21:33:20.7707010Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:20.7707089Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:20.7707316Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 584, in forward 2025-08-14T21:33:20.7707455Z self_attention_outputs = self.attention( 2025-08-14T21:33:20.7707679Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:20.7707745Z return func(*args, **kwargs) 2025-08-14T21:33:20.7707976Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 514, in forward 2025-08-14T21:33:20.7708038Z self_outputs = self.self( 2025-08-14T21:33:20.7708257Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:20.7708343Z return func(*args, **kwargs) 2025-08-14T21:33:20.7708566Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 374, in forward 2025-08-14T21:33:20.7708765Z self.query(hidden_states).view(bsz, -1, self.num_attention_heads, self.attention_head_size).transpose(1, 2) 2025-08-14T21:33:20.7708768Z 2025-08-14T21:33:20.7708866Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:20.7709050Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:20.7709114Z return mod(**inputs) 2025-08-14T21:33:20.7709343Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1308, in forward 2025-08-14T21:33:20.7709401Z outputs = self.bert( 2025-08-14T21:33:20.7709634Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:20.7709699Z encoder_outputs = self.encoder( 2025-08-14T21:33:20.7709932Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:20.7709995Z layer_outputs = layer_module( 2025-08-14T21:33:20.7710197Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:20.7710278Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:20.7710503Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 584, in forward 2025-08-14T21:33:20.7710582Z self_attention_outputs = self.attention( 2025-08-14T21:33:20.7710801Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:20.7710862Z return func(*args, **kwargs) 2025-08-14T21:33:20.7711093Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 514, in forward 2025-08-14T21:33:20.7711155Z self_outputs = self.self( 2025-08-14T21:33:20.7711376Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:20.7711444Z return func(*args, **kwargs) 2025-08-14T21:33:20.7711670Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 402, in forward 2025-08-14T21:33:20.7711742Z self.key(current_states) 2025-08-14T21:33:20.7711745Z 2025-08-14T21:33:20.7711838Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:20.7712020Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:20.7712085Z return mod(**inputs) 2025-08-14T21:33:20.7712314Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1308, in forward 2025-08-14T21:33:20.7712372Z outputs = self.bert( 2025-08-14T21:33:20.7712611Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:20.7712676Z encoder_outputs = self.encoder( 2025-08-14T21:33:20.7712906Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:20.7713040Z layer_outputs = layer_module( 2025-08-14T21:33:20.7713243Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:20.7713322Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:20.7713544Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 584, in forward 2025-08-14T21:33:20.7713627Z self_attention_outputs = self.attention( 2025-08-14T21:33:20.7713845Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:20.7713925Z return func(*args, **kwargs) 2025-08-14T21:33:20.7714158Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 514, in forward 2025-08-14T21:33:20.7714221Z self_outputs = self.self( 2025-08-14T21:33:20.7714445Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:20.7714516Z return func(*args, **kwargs) 2025-08-14T21:33:20.7714743Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 407, in forward 2025-08-14T21:33:20.7714816Z self.value(current_states) 2025-08-14T21:33:20.7714819Z 2025-08-14T21:33:20.7714893Z cudagraph partition due to non gpu ops 2025-08-14T21:33:20.7714989Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:20.7715182Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:20.7715244Z return mod(**inputs) 2025-08-14T21:33:20.7715476Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1308, in forward 2025-08-14T21:33:20.7715543Z outputs = self.bert( 2025-08-14T21:33:20.7715773Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:20.7715851Z encoder_outputs = self.encoder( 2025-08-14T21:33:20.7716079Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:20.7716143Z layer_outputs = layer_module( 2025-08-14T21:33:20.7716354Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:20.7716426Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:20.7716651Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 584, in forward 2025-08-14T21:33:20.7716734Z self_attention_outputs = self.attention( 2025-08-14T21:33:20.7716957Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:20.7717026Z return func(*args, **kwargs) 2025-08-14T21:33:20.7717257Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 514, in forward 2025-08-14T21:33:20.7717322Z self_outputs = self.self( 2025-08-14T21:33:20.7717552Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:20.7717612Z return func(*args, **kwargs) 2025-08-14T21:33:20.7717849Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 438, in forward 2025-08-14T21:33:20.7717971Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:33:20.7717974Z 2025-08-14T21:33:20.7718070Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:20.7718261Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:20.7718319Z return mod(**inputs) 2025-08-14T21:33:20.7718577Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1308, in forward 2025-08-14T21:33:20.7718658Z outputs = self.bert( 2025-08-14T21:33:20.7718890Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:20.7718962Z encoder_outputs = self.encoder( 2025-08-14T21:33:20.7719188Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:20.7719251Z layer_outputs = layer_module( 2025-08-14T21:33:20.7719460Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:20.7719549Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:20.7719773Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 584, in forward 2025-08-14T21:33:20.7719855Z self_attention_outputs = self.attention( 2025-08-14T21:33:20.7720078Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:20.7720147Z return func(*args, **kwargs) 2025-08-14T21:33:20.7720371Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 524, in forward 2025-08-14T21:33:20.7720488Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:33:20.7720719Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 461, in forward 2025-08-14T21:33:20.7720794Z hidden_states = self.dense(hidden_states) 2025-08-14T21:33:20.7720798Z 2025-08-14T21:33:20.7720896Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:20.7721078Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:20.7721138Z return mod(**inputs) 2025-08-14T21:33:20.7721374Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1308, in forward 2025-08-14T21:33:20.7721432Z outputs = self.bert( 2025-08-14T21:33:20.7721657Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:20.7721730Z encoder_outputs = self.encoder( 2025-08-14T21:33:20.7721953Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:20.7722022Z layer_outputs = layer_module( 2025-08-14T21:33:20.7722221Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:20.7722292Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:20.7722524Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 614, in forward 2025-08-14T21:33:20.7722601Z layer_output = apply_chunking_to_forward( 2025-08-14T21:33:20.7722845Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:33:20.7722916Z return forward_fn(*input_tensors) 2025-08-14T21:33:20.7723167Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 622, in feed_forward_chunk 2025-08-14T21:33:20.7723284Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:33:20.7723506Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 539, in forward 2025-08-14T21:33:20.7723582Z hidden_states = self.dense(hidden_states) 2025-08-14T21:33:20.7723587Z 2025-08-14T21:33:20.7723686Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:20.7723866Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:20.7723930Z return mod(**inputs) 2025-08-14T21:33:20.7724185Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1308, in forward 2025-08-14T21:33:20.7724258Z outputs = self.bert( 2025-08-14T21:33:20.7724492Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:20.7724556Z encoder_outputs = self.encoder( 2025-08-14T21:33:20.7724780Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:20.7724849Z layer_outputs = layer_module( 2025-08-14T21:33:20.7725050Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:20.7725142Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:20.7725365Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 614, in forward 2025-08-14T21:33:20.7725440Z layer_output = apply_chunking_to_forward( 2025-08-14T21:33:20.7725692Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:33:20.7725762Z return forward_fn(*input_tensors) 2025-08-14T21:33:20.7726019Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 622, in feed_forward_chunk 2025-08-14T21:33:20.7726128Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:33:20.7726349Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 540, in forward 2025-08-14T21:33:20.7726462Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:33:20.7726656Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:33:20.7726719Z return self.act(input) 2025-08-14T21:33:20.7726730Z 2025-08-14T21:33:20.7726822Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:20.7727007Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:20.7727096Z return mod(**inputs) 2025-08-14T21:33:20.7727321Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1308, in forward 2025-08-14T21:33:20.7727380Z outputs = self.bert( 2025-08-14T21:33:20.7727613Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:20.7727678Z encoder_outputs = self.encoder( 2025-08-14T21:33:20.7727907Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:20.7727973Z layer_outputs = layer_module( 2025-08-14T21:33:20.7728171Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:20.7728250Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:20.7728475Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 614, in forward 2025-08-14T21:33:20.7728550Z layer_output = apply_chunking_to_forward( 2025-08-14T21:33:20.7728794Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:33:20.7728863Z return forward_fn(*input_tensors) 2025-08-14T21:33:20.7729121Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 623, in feed_forward_chunk 2025-08-14T21:33:20.7729243Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:33:20.7729466Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 552, in forward 2025-08-14T21:33:20.7729547Z hidden_states = self.dense(hidden_states) 2025-08-14T21:33:20.7729550Z 2025-08-14T21:33:20.7729680Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:20.7729889Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:20.7729948Z return mod(**inputs) 2025-08-14T21:33:20.7730175Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1308, in forward 2025-08-14T21:33:20.7730243Z outputs = self.bert( 2025-08-14T21:33:20.7730471Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:20.7730536Z encoder_outputs = self.encoder( 2025-08-14T21:33:20.7730781Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:20.7730845Z layer_outputs = layer_module( 2025-08-14T21:33:20.7731053Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:20.7731128Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:20.7731348Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 584, in forward 2025-08-14T21:33:20.7731430Z self_attention_outputs = self.attention( 2025-08-14T21:33:20.7731650Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:20.7731714Z return func(*args, **kwargs) 2025-08-14T21:33:20.7731943Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 514, in forward 2025-08-14T21:33:20.7732008Z self_outputs = self.self( 2025-08-14T21:33:20.7732234Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:20.7732295Z return func(*args, **kwargs) 2025-08-14T21:33:20.7732522Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 374, in forward 2025-08-14T21:33:20.7732721Z self.query(hidden_states).view(bsz, -1, self.num_attention_heads, self.attention_head_size).transpose(1, 2) 2025-08-14T21:33:20.7732724Z 2025-08-14T21:33:20.7732817Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:20.7733003Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:20.7733062Z return mod(**inputs) 2025-08-14T21:33:20.7733288Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1308, in forward 2025-08-14T21:33:20.7733356Z outputs = self.bert( 2025-08-14T21:33:20.7733582Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:20.7733647Z encoder_outputs = self.encoder( 2025-08-14T21:33:20.7733881Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:20.7733946Z layer_outputs = layer_module( 2025-08-14T21:33:20.7734152Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:20.7734223Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:20.7734445Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 584, in forward 2025-08-14T21:33:20.7734525Z self_attention_outputs = self.attention( 2025-08-14T21:33:20.7734741Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:20.7734804Z return func(*args, **kwargs) 2025-08-14T21:33:20.7735032Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 514, in forward 2025-08-14T21:33:20.7735095Z self_outputs = self.self( 2025-08-14T21:33:20.7735350Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:20.7735424Z return func(*args, **kwargs) 2025-08-14T21:33:20.7735648Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 402, in forward 2025-08-14T21:33:20.7735720Z self.key(current_states) 2025-08-14T21:33:20.7735723Z 2025-08-14T21:33:20.7735815Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:20.7736002Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:20.7736078Z return mod(**inputs) 2025-08-14T21:33:20.7736306Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1308, in forward 2025-08-14T21:33:20.7736371Z outputs = self.bert( 2025-08-14T21:33:20.7736599Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:20.7736667Z encoder_outputs = self.encoder( 2025-08-14T21:33:20.7736898Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:20.7736961Z layer_outputs = layer_module( 2025-08-14T21:33:20.7737170Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:20.7737242Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:20.7737464Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 584, in forward 2025-08-14T21:33:20.7737547Z self_attention_outputs = self.attention( 2025-08-14T21:33:20.7737768Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:20.7737828Z return func(*args, **kwargs) 2025-08-14T21:33:20.7738061Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 514, in forward 2025-08-14T21:33:20.7738126Z self_outputs = self.self( 2025-08-14T21:33:20.7738354Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:20.7738414Z return func(*args, **kwargs) 2025-08-14T21:33:20.7738637Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 407, in forward 2025-08-14T21:33:20.7738710Z self.value(current_states) 2025-08-14T21:33:20.7738714Z 2025-08-14T21:33:20.7738787Z cudagraph partition due to non gpu ops 2025-08-14T21:33:20.7738889Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:20.7739068Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:20.7739126Z return mod(**inputs) 2025-08-14T21:33:20.7739364Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1308, in forward 2025-08-14T21:33:20.7739425Z outputs = self.bert( 2025-08-14T21:33:20.7739652Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:20.7739725Z encoder_outputs = self.encoder( 2025-08-14T21:33:20.7739949Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:20.7740018Z layer_outputs = layer_module( 2025-08-14T21:33:20.7740219Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:20.7740290Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:20.7740520Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 584, in forward 2025-08-14T21:33:20.7740593Z self_attention_outputs = self.attention( 2025-08-14T21:33:20.7740841Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:20.7740928Z return func(*args, **kwargs) 2025-08-14T21:33:20.7741151Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 514, in forward 2025-08-14T21:33:20.7741221Z self_outputs = self.self( 2025-08-14T21:33:20.7741438Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:20.7741499Z return func(*args, **kwargs) 2025-08-14T21:33:20.7741730Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 438, in forward 2025-08-14T21:33:20.7741873Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:33:20.7741876Z 2025-08-14T21:33:20.7741976Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:20.7742158Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:20.7742218Z return mod(**inputs) 2025-08-14T21:33:20.7742455Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1308, in forward 2025-08-14T21:33:20.7742515Z outputs = self.bert( 2025-08-14T21:33:20.7742743Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:20.7742817Z encoder_outputs = self.encoder( 2025-08-14T21:33:20.7743050Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:20.7743125Z layer_outputs = layer_module( 2025-08-14T21:33:20.7743327Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:20.7743397Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:20.7743630Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 584, in forward 2025-08-14T21:33:20.7743704Z self_attention_outputs = self.attention( 2025-08-14T21:33:20.7743923Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:20.7743992Z return func(*args, **kwargs) 2025-08-14T21:33:20.7744216Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 524, in forward 2025-08-14T21:33:20.7744339Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:33:20.7744563Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 461, in forward 2025-08-14T21:33:20.7744700Z hidden_states = self.dense(hidden_states) 2025-08-14T21:33:20.7744706Z 2025-08-14T21:33:20.7744817Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:20.7745005Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:20.7745076Z return mod(**inputs) 2025-08-14T21:33:20.7745303Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1308, in forward 2025-08-14T21:33:20.7745364Z outputs = self.bert( 2025-08-14T21:33:20.7745599Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:20.7745668Z encoder_outputs = self.encoder( 2025-08-14T21:33:20.7745891Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:20.7745965Z layer_outputs = layer_module( 2025-08-14T21:33:20.7746168Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:20.7746248Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:20.7746529Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 614, in forward 2025-08-14T21:33:20.7746610Z layer_output = apply_chunking_to_forward( 2025-08-14T21:33:20.7746856Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:33:20.7746927Z return forward_fn(*input_tensors) 2025-08-14T21:33:20.7747182Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 622, in feed_forward_chunk 2025-08-14T21:33:20.7747302Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:33:20.7747553Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 539, in forward 2025-08-14T21:33:20.7747635Z hidden_states = self.dense(hidden_states) 2025-08-14T21:33:20.7747638Z 2025-08-14T21:33:20.7747733Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:20.7747920Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:20.7747988Z return mod(**inputs) 2025-08-14T21:33:20.7748217Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1308, in forward 2025-08-14T21:33:20.7748283Z outputs = self.bert( 2025-08-14T21:33:20.7748513Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:20.7748580Z encoder_outputs = self.encoder( 2025-08-14T21:33:20.7748812Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:20.7748879Z layer_outputs = layer_module( 2025-08-14T21:33:20.7749079Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:20.7749158Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:20.7749383Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 614, in forward 2025-08-14T21:33:20.7749468Z layer_output = apply_chunking_to_forward( 2025-08-14T21:33:20.7749705Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:33:20.7749774Z return forward_fn(*input_tensors) 2025-08-14T21:33:20.7750035Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 622, in feed_forward_chunk 2025-08-14T21:33:20.7750147Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:33:20.7750381Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 540, in forward 2025-08-14T21:33:20.7750483Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:33:20.7750680Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:33:20.7750754Z return self.act(input) 2025-08-14T21:33:20.7750757Z 2025-08-14T21:33:20.7750852Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:20.7751032Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:20.7751099Z return mod(**inputs) 2025-08-14T21:33:20.7751327Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1308, in forward 2025-08-14T21:33:20.7751392Z outputs = self.bert( 2025-08-14T21:33:20.7751623Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:20.7751689Z encoder_outputs = self.encoder( 2025-08-14T21:33:20.7751921Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:20.7752030Z layer_outputs = layer_module( 2025-08-14T21:33:20.7752238Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:20.7752314Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:20.7752540Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 614, in forward 2025-08-14T21:33:20.7752622Z layer_output = apply_chunking_to_forward( 2025-08-14T21:33:20.7752862Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:33:20.7752946Z return forward_fn(*input_tensors) 2025-08-14T21:33:20.7753204Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 623, in feed_forward_chunk 2025-08-14T21:33:20.7753323Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:33:20.7753558Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 552, in forward 2025-08-14T21:33:20.7753634Z hidden_states = self.dense(hidden_states) 2025-08-14T21:33:20.7753637Z 2025-08-14T21:33:20.7753731Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:20.7753921Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:20.7753980Z return mod(**inputs) 2025-08-14T21:33:20.7754209Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1308, in forward 2025-08-14T21:33:20.7754275Z outputs = self.bert( 2025-08-14T21:33:20.7754503Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:20.7754577Z encoder_outputs = self.encoder( 2025-08-14T21:33:20.7754804Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:20.7754872Z layer_outputs = layer_module( 2025-08-14T21:33:20.7755080Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:20.7755152Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:20.7755382Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 584, in forward 2025-08-14T21:33:20.7755459Z self_attention_outputs = self.attention( 2025-08-14T21:33:20.7755679Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:20.7755750Z return func(*args, **kwargs) 2025-08-14T21:33:20.7755974Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 514, in forward 2025-08-14T21:33:20.7756038Z self_outputs = self.self( 2025-08-14T21:33:20.7756269Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:20.7756332Z return func(*args, **kwargs) 2025-08-14T21:33:20.7756564Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 374, in forward 2025-08-14T21:33:20.7756755Z self.query(hidden_states).view(bsz, -1, self.num_attention_heads, self.attention_head_size).transpose(1, 2) 2025-08-14T21:33:20.7756758Z 2025-08-14T21:33:20.7756853Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:20.7757046Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:20.7757106Z return mod(**inputs) 2025-08-14T21:33:20.7757341Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1308, in forward 2025-08-14T21:33:20.7757400Z outputs = self.bert( 2025-08-14T21:33:20.7757659Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:20.7757757Z encoder_outputs = self.encoder( 2025-08-14T21:33:20.7757981Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:20.7758044Z layer_outputs = layer_module( 2025-08-14T21:33:20.7758252Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:20.7758322Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:20.7758553Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 584, in forward 2025-08-14T21:33:20.7758645Z self_attention_outputs = self.attention( 2025-08-14T21:33:20.7758865Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:20.7758935Z return func(*args, **kwargs) 2025-08-14T21:33:20.7759167Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 514, in forward 2025-08-14T21:33:20.7759229Z self_outputs = self.self( 2025-08-14T21:33:20.7759456Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:20.7759516Z return func(*args, **kwargs) 2025-08-14T21:33:20.7759749Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 402, in forward 2025-08-14T21:33:20.7759813Z self.key(current_states) 2025-08-14T21:33:20.7759818Z 2025-08-14T21:33:20.7759911Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:20.7760100Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:20.7760160Z return mod(**inputs) 2025-08-14T21:33:20.7760391Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1308, in forward 2025-08-14T21:33:20.7760459Z outputs = self.bert( 2025-08-14T21:33:20.7760688Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:20.7760762Z encoder_outputs = self.encoder( 2025-08-14T21:33:20.7760986Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:20.7761050Z layer_outputs = layer_module( 2025-08-14T21:33:20.7761258Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:20.7761331Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:20.7761565Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 584, in forward 2025-08-14T21:33:20.7761639Z self_attention_outputs = self.attention( 2025-08-14T21:33:20.7761860Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:20.7761930Z return func(*args, **kwargs) 2025-08-14T21:33:20.7762155Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 514, in forward 2025-08-14T21:33:20.7762217Z self_outputs = self.self( 2025-08-14T21:33:20.7762444Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:20.7762506Z return func(*args, **kwargs) 2025-08-14T21:33:20.7762739Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 407, in forward 2025-08-14T21:33:20.7762806Z self.value(current_states) 2025-08-14T21:33:20.7762809Z 2025-08-14T21:33:20.7762883Z cudagraph partition due to non gpu ops 2025-08-14T21:33:20.7762984Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:20.7763193Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:20.7763267Z return mod(**inputs) 2025-08-14T21:33:20.7763504Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1308, in forward 2025-08-14T21:33:20.7763563Z outputs = self.bert( 2025-08-14T21:33:20.7763798Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:20.7763866Z encoder_outputs = self.encoder( 2025-08-14T21:33:20.7764090Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:20.7764177Z layer_outputs = layer_module( 2025-08-14T21:33:20.7764377Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:20.7764447Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:20.7764678Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 584, in forward 2025-08-14T21:33:20.7764750Z self_attention_outputs = self.attention( 2025-08-14T21:33:20.7764975Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:20.7765036Z return func(*args, **kwargs) 2025-08-14T21:33:20.7765259Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 514, in forward 2025-08-14T21:33:20.7765327Z self_outputs = self.self( 2025-08-14T21:33:20.7765547Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:20.7765614Z return func(*args, **kwargs) 2025-08-14T21:33:20.7765836Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 438, in forward 2025-08-14T21:33:20.7765961Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:33:20.7765964Z 2025-08-14T21:33:20.7766063Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:20.7766246Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:20.7766304Z return mod(**inputs) 2025-08-14T21:33:20.7766541Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1308, in forward 2025-08-14T21:33:20.7766598Z outputs = self.bert( 2025-08-14T21:33:20.7766835Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:20.7766903Z encoder_outputs = self.encoder( 2025-08-14T21:33:20.7767129Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:20.7767201Z layer_outputs = layer_module( 2025-08-14T21:33:20.7767406Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:20.7767476Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:20.7767709Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 584, in forward 2025-08-14T21:33:20.7767781Z self_attention_outputs = self.attention( 2025-08-14T21:33:20.7768008Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:20.7768070Z return func(*args, **kwargs) 2025-08-14T21:33:20.7768296Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 524, in forward 2025-08-14T21:33:20.7768421Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:33:20.7768675Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 461, in forward 2025-08-14T21:33:20.7768781Z hidden_states = self.dense(hidden_states) 2025-08-14T21:33:20.7768784Z 2025-08-14T21:33:20.7768875Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:20.7769054Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:20.7769121Z return mod(**inputs) 2025-08-14T21:33:20.7769347Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1308, in forward 2025-08-14T21:33:20.7769407Z outputs = self.bert( 2025-08-14T21:33:20.7769641Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:20.7769726Z encoder_outputs = self.encoder( 2025-08-14T21:33:20.7769962Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:20.7770030Z layer_outputs = layer_module( 2025-08-14T21:33:20.7770235Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:20.7770313Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:20.7770539Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 614, in forward 2025-08-14T21:33:20.7770623Z layer_output = apply_chunking_to_forward( 2025-08-14T21:33:20.7770863Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:33:20.7770933Z return forward_fn(*input_tensors) 2025-08-14T21:33:20.7771200Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 622, in feed_forward_chunk 2025-08-14T21:33:20.7771310Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:33:20.7771538Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 539, in forward 2025-08-14T21:33:20.7771621Z hidden_states = self.dense(hidden_states) 2025-08-14T21:33:20.7771624Z 2025-08-14T21:33:20.7771718Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:20.7771909Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:20.7771968Z return mod(**inputs) 2025-08-14T21:33:20.7772201Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1308, in forward 2025-08-14T21:33:20.7772269Z outputs = self.bert( 2025-08-14T21:33:20.7772501Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:20.7772577Z encoder_outputs = self.encoder( 2025-08-14T21:33:20.7772804Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:20.7772873Z layer_outputs = layer_module( 2025-08-14T21:33:20.7773081Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:20.7773151Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:20.7773377Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 614, in forward 2025-08-14T21:33:20.7773462Z layer_output = apply_chunking_to_forward( 2025-08-14T21:33:20.7773700Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:33:20.7773777Z return forward_fn(*input_tensors) 2025-08-14T21:33:20.7774033Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 622, in feed_forward_chunk 2025-08-14T21:33:20.7774141Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:33:20.7774404Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 540, in forward 2025-08-14T21:33:20.7774522Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:33:20.7774717Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:33:20.7774789Z return self.act(input) 2025-08-14T21:33:20.7774792Z 2025-08-14T21:33:20.7774886Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:20.7775077Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:20.7775152Z return mod(**inputs) 2025-08-14T21:33:20.7775377Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1308, in forward 2025-08-14T21:33:20.7775443Z outputs = self.bert( 2025-08-14T21:33:20.7775671Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:20.7775744Z encoder_outputs = self.encoder( 2025-08-14T21:33:20.7775969Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:20.7776033Z layer_outputs = layer_module( 2025-08-14T21:33:20.7776239Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:20.7776309Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:20.7776530Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 614, in forward 2025-08-14T21:33:20.7776615Z layer_output = apply_chunking_to_forward( 2025-08-14T21:33:20.7776849Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:33:20.7776922Z return forward_fn(*input_tensors) 2025-08-14T21:33:20.7777176Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 623, in feed_forward_chunk 2025-08-14T21:33:20.7777298Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:33:20.7777528Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 552, in forward 2025-08-14T21:33:20.7777600Z hidden_states = self.dense(hidden_states) 2025-08-14T21:33:20.7777603Z 2025-08-14T21:33:20.7777701Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:20.7777880Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:20.7777940Z return mod(**inputs) 2025-08-14T21:33:20.7778173Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1323, in forward 2025-08-14T21:33:20.7778258Z prediction_scores = self.cls(sequence_output) 2025-08-14T21:33:20.7778483Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 780, in forward 2025-08-14T21:33:20.7778593Z prediction_scores = self.predictions(sequence_output) 2025-08-14T21:33:20.7778818Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 769, in forward 2025-08-14T21:33:20.7778909Z hidden_states = self.transform(hidden_states) 2025-08-14T21:33:20.7779130Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 745, in forward 2025-08-14T21:33:20.7779201Z hidden_states = self.dense(hidden_states) 2025-08-14T21:33:20.7779206Z 2025-08-14T21:33:20.7779306Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:20.7779488Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:20.7779554Z return mod(**inputs) 2025-08-14T21:33:20.7779807Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1323, in forward 2025-08-14T21:33:20.7779905Z prediction_scores = self.cls(sequence_output) 2025-08-14T21:33:20.7780139Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 780, in forward 2025-08-14T21:33:20.7780239Z prediction_scores = self.predictions(sequence_output) 2025-08-14T21:33:20.7780463Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 770, in forward 2025-08-14T21:33:20.7780552Z hidden_states = self.decoder(hidden_states) 2025-08-14T21:33:20.7780556Z 2025-08-14T21:33:20.7780668Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:20.7780854Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:20.7780913Z return mod(**inputs) 2025-08-14T21:33:20.7781142Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1328, in forward 2025-08-14T21:33:20.7781323Z masked_lm_loss = loss_fct(prediction_scores.view(-1, self.config.vocab_size), labels.view(-1)) 2025-08-14T21:33:20.7781327Z 2025-08-14T21:33:28.3167318Z Compilation time (from dynamo_timed): 13.221636075 2025-08-14T21:33:28.3239575Z pass 2025-08-14T21:33:28.3244010Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:33:28.3248340Z TIMING: _recursive_pre_grad_passes:0.006 _recursive_joint_graph_passes:0.54181 _recursive_post_grad_passes:0.07076 async_compile.wait:0.63402 code_gen:6.52397 inductor_compile:7.60022 backend_compile:10.45044 gc:0.00031 entire_frame_compile:13.22164 total_wall_time:13.22164 2025-08-14T21:33:28.3249754Z STATS: call_* op count: 289 | FakeTensorMode.__torch_dispatch__:12337 | FakeTensor.__torch_dispatch__:4686 | ProxyTorchDispatchMode.__torch_dispatch__:4495 2025-08-14T21:33:28.3250266Z Dynamo produced 1 graphs covering 289 ops with 0 graph breaks (0 unique) 2025-08-14T21:33:32.2941710Z /opt/conda/envs/py_3.9/lib/python3.9/site-packages/llvmlite/binding/ffi.py:175: UserWarning: pkg_resources is deprecated as an API. See https://setuptools.pypa.io/en/latest/pkg_resources.html. The pkg_resources package is slated for removal as early as 2025-11-30. Refrain from using this package or pin to Setuptools<81. 2025-08-14T21:33:32.2942685Z from pkg_resources import resource_filename 2025-08-14T21:33:32.8153278Z 2025-08-14T21:33:33.8315555Z loading model: 0it [00:00, ?it/s] 2025-08-14T21:33:33.8319799Z loading model: 0it [00:01, ?it/s] 2025-08-14T21:33:33.8330169Z cpu eval BertForQuestionAnswering 2025-08-14T21:33:34.1712498Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:33:34.3147522Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:33:34.4552958Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:33:41.1505277Z cudagraph partition due to non gpu ops 2025-08-14T21:33:41.1508673Z cudagraph partition due to non gpu ops 2025-08-14T21:33:41.1512485Z cudagraph partition due to non gpu ops 2025-08-14T21:33:41.1516920Z cudagraph partition due to non gpu ops 2025-08-14T21:33:41.1521416Z cudagraph partition due to non gpu ops 2025-08-14T21:33:41.1526033Z cudagraph partition due to non gpu ops 2025-08-14T21:33:41.1530402Z cudagraph partition due to non gpu ops 2025-08-14T21:33:41.1534967Z cudagraph partition due to non gpu ops 2025-08-14T21:33:41.1535359Z cudagraph partition due to non gpu ops 2025-08-14T21:33:41.1535672Z cudagraph partition due to non gpu ops 2025-08-14T21:33:41.1536118Z cudagraph partition due to non gpu ops 2025-08-14T21:33:41.1536313Z cudagraph partition due to non gpu ops 2025-08-14T21:33:41.1536537Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:41.1537218Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:41.1539270Z return mod(**inputs) 2025-08-14T21:33:41.1539694Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1767, in forward 2025-08-14T21:33:41.1540181Z outputs = self.bert( 2025-08-14T21:33:41.1540581Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:41.1540948Z encoder_outputs = self.encoder( 2025-08-14T21:33:41.1541307Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:41.1541785Z layer_outputs = layer_module( 2025-08-14T21:33:41.1553966Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:41.1554340Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:41.1554738Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 584, in forward 2025-08-14T21:33:41.1555123Z self_attention_outputs = self.attention( 2025-08-14T21:33:41.1555480Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:41.1555841Z return func(*args, **kwargs) 2025-08-14T21:33:41.1556189Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 514, in forward 2025-08-14T21:33:41.1556536Z self_outputs = self.self( 2025-08-14T21:33:41.1556875Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:41.1557225Z return func(*args, **kwargs) 2025-08-14T21:33:41.1557571Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 374, in forward 2025-08-14T21:33:41.1558051Z self.query(hidden_states).view(bsz, -1, self.num_attention_heads, self.attention_head_size).transpose(1, 2) 2025-08-14T21:33:41.1558316Z 2025-08-14T21:33:41.1558423Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:41.1558780Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:41.1559089Z return mod(**inputs) 2025-08-14T21:33:41.1559418Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1767, in forward 2025-08-14T21:33:41.1559769Z outputs = self.bert( 2025-08-14T21:33:41.1560106Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:41.1560464Z encoder_outputs = self.encoder( 2025-08-14T21:33:41.1560836Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:41.1561203Z layer_outputs = layer_module( 2025-08-14T21:33:41.1561532Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:41.1561861Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:41.1562213Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 584, in forward 2025-08-14T21:33:41.1562571Z self_attention_outputs = self.attention( 2025-08-14T21:33:41.1562915Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:41.1563260Z return func(*args, **kwargs) 2025-08-14T21:33:41.1563602Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 514, in forward 2025-08-14T21:33:41.1563951Z self_outputs = self.self( 2025-08-14T21:33:41.1564277Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:41.1564729Z return func(*args, **kwargs) 2025-08-14T21:33:41.1565113Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 402, in forward 2025-08-14T21:33:41.1565464Z self.key(current_states) 2025-08-14T21:33:41.1565571Z 2025-08-14T21:33:41.1565674Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:41.1566016Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:41.1566322Z return mod(**inputs) 2025-08-14T21:33:41.1566650Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1767, in forward 2025-08-14T21:33:41.1567039Z outputs = self.bert( 2025-08-14T21:33:41.1567374Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:41.1567739Z encoder_outputs = self.encoder( 2025-08-14T21:33:41.1568088Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:41.1568447Z layer_outputs = layer_module( 2025-08-14T21:33:41.1568773Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:41.1569105Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:41.1569461Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 584, in forward 2025-08-14T21:33:41.1569824Z self_attention_outputs = self.attention( 2025-08-14T21:33:41.1570176Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:41.1570519Z return func(*args, **kwargs) 2025-08-14T21:33:41.1570864Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 514, in forward 2025-08-14T21:33:41.1571216Z self_outputs = self.self( 2025-08-14T21:33:41.1571545Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:41.1571888Z return func(*args, **kwargs) 2025-08-14T21:33:41.1572226Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 407, in forward 2025-08-14T21:33:41.1572584Z self.value(current_states) 2025-08-14T21:33:41.1572694Z 2025-08-14T21:33:41.1572774Z cudagraph partition due to non gpu ops 2025-08-14T21:33:41.1573001Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:41.1573341Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:41.1573648Z return mod(**inputs) 2025-08-14T21:33:41.1573971Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1767, in forward 2025-08-14T21:33:41.1574319Z outputs = self.bert( 2025-08-14T21:33:41.1574651Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:41.1575005Z encoder_outputs = self.encoder( 2025-08-14T21:33:41.1575358Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:41.1575713Z layer_outputs = layer_module( 2025-08-14T21:33:41.1576036Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:41.1576366Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:41.1576723Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 584, in forward 2025-08-14T21:33:41.1577089Z self_attention_outputs = self.attention( 2025-08-14T21:33:41.1577435Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:41.1577821Z return func(*args, **kwargs) 2025-08-14T21:33:41.1578206Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 514, in forward 2025-08-14T21:33:41.1578556Z self_outputs = self.self( 2025-08-14T21:33:41.1578880Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:41.1579221Z return func(*args, **kwargs) 2025-08-14T21:33:41.1579560Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 438, in forward 2025-08-14T21:33:41.1579958Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:33:41.1580155Z 2025-08-14T21:33:41.1580257Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:41.1580592Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:41.1580893Z return mod(**inputs) 2025-08-14T21:33:41.1581221Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1767, in forward 2025-08-14T21:33:41.1581575Z outputs = self.bert( 2025-08-14T21:33:41.1581902Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:41.1582262Z encoder_outputs = self.encoder( 2025-08-14T21:33:41.1582612Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:41.1582961Z layer_outputs = layer_module( 2025-08-14T21:33:41.1583276Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:41.1583612Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:41.1583962Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 584, in forward 2025-08-14T21:33:41.1584327Z self_attention_outputs = self.attention( 2025-08-14T21:33:41.1584915Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:41.1585269Z return func(*args, **kwargs) 2025-08-14T21:33:41.1585608Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 524, in forward 2025-08-14T21:33:41.1586004Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:33:41.1586405Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 461, in forward 2025-08-14T21:33:41.1586772Z hidden_states = self.dense(hidden_states) 2025-08-14T21:33:41.1586903Z 2025-08-14T21:33:41.1587009Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:41.1587335Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:41.1587637Z return mod(**inputs) 2025-08-14T21:33:41.1587972Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1767, in forward 2025-08-14T21:33:41.1588314Z outputs = self.bert( 2025-08-14T21:33:41.1588644Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:41.1588998Z encoder_outputs = self.encoder( 2025-08-14T21:33:41.1589346Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:41.1589689Z layer_outputs = layer_module( 2025-08-14T21:33:41.1590005Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:41.1590338Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:41.1590688Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 614, in forward 2025-08-14T21:33:41.1591134Z layer_output = apply_chunking_to_forward( 2025-08-14T21:33:41.1591539Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:33:41.1591909Z return forward_fn(*input_tensors) 2025-08-14T21:33:41.1592280Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 622, in feed_forward_chunk 2025-08-14T21:33:41.1592706Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:33:41.1593098Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 539, in forward 2025-08-14T21:33:41.1593490Z hidden_states = self.dense(hidden_states) 2025-08-14T21:33:41.1593617Z 2025-08-14T21:33:41.1593714Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:41.1594049Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:41.1594354Z return mod(**inputs) 2025-08-14T21:33:41.1594677Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1767, in forward 2025-08-14T21:33:41.1595020Z outputs = self.bert( 2025-08-14T21:33:41.1595346Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:41.1595697Z encoder_outputs = self.encoder( 2025-08-14T21:33:41.1596031Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:41.1596379Z layer_outputs = layer_module( 2025-08-14T21:33:41.1596733Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:41.1597073Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:41.1597429Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 614, in forward 2025-08-14T21:33:41.1597803Z layer_output = apply_chunking_to_forward( 2025-08-14T21:33:41.1598182Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:33:41.1598549Z return forward_fn(*input_tensors) 2025-08-14T21:33:41.1598937Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 622, in feed_forward_chunk 2025-08-14T21:33:41.1599373Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:33:41.1599773Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 540, in forward 2025-08-14T21:33:41.1600168Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:33:41.1600529Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:33:41.1600850Z return self.act(input) 2025-08-14T21:33:41.1600968Z 2025-08-14T21:33:41.1601071Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:41.1601397Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:41.1601695Z return mod(**inputs) 2025-08-14T21:33:41.1602024Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1767, in forward 2025-08-14T21:33:41.1602360Z outputs = self.bert( 2025-08-14T21:33:41.1602683Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:41.1603034Z encoder_outputs = self.encoder( 2025-08-14T21:33:41.1603380Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:41.1603722Z layer_outputs = layer_module( 2025-08-14T21:33:41.1604074Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:41.1604425Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:41.1604773Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 614, in forward 2025-08-14T21:33:41.1605138Z layer_output = apply_chunking_to_forward( 2025-08-14T21:33:41.1605511Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:33:41.1605872Z return forward_fn(*input_tensors) 2025-08-14T21:33:41.1606245Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 623, in feed_forward_chunk 2025-08-14T21:33:41.1606711Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:33:41.1607113Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 552, in forward 2025-08-14T21:33:41.1607814Z hidden_states = self.dense(hidden_states) 2025-08-14T21:33:41.1607944Z 2025-08-14T21:33:41.1608040Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:41.1608377Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:41.1608681Z return mod(**inputs) 2025-08-14T21:33:41.1609004Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1767, in forward 2025-08-14T21:33:41.1609353Z outputs = self.bert( 2025-08-14T21:33:41.1609682Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:41.1610036Z encoder_outputs = self.encoder( 2025-08-14T21:33:41.1610378Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:41.1610727Z layer_outputs = layer_module( 2025-08-14T21:33:41.1611053Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:41.1611382Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:41.1611736Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 584, in forward 2025-08-14T21:33:41.1612096Z self_attention_outputs = self.attention( 2025-08-14T21:33:41.1612448Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:41.1612784Z return func(*args, **kwargs) 2025-08-14T21:33:41.1613125Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 514, in forward 2025-08-14T21:33:41.1613475Z self_outputs = self.self( 2025-08-14T21:33:41.1613811Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:41.1614147Z return func(*args, **kwargs) 2025-08-14T21:33:41.1614491Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 374, in forward 2025-08-14T21:33:41.1615142Z self.query(hidden_states).view(bsz, -1, self.num_attention_heads, self.attention_head_size).transpose(1, 2) 2025-08-14T21:33:41.1615392Z 2025-08-14T21:33:41.1615491Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:41.1615831Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:41.1616138Z return mod(**inputs) 2025-08-14T21:33:41.1616473Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1767, in forward 2025-08-14T21:33:41.1616821Z outputs = self.bert( 2025-08-14T21:33:41.1617155Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:41.1617512Z encoder_outputs = self.encoder( 2025-08-14T21:33:41.1617913Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:41.1618268Z layer_outputs = layer_module( 2025-08-14T21:33:41.1618587Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:41.1618925Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:41.1619274Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 584, in forward 2025-08-14T21:33:41.1619638Z self_attention_outputs = self.attention( 2025-08-14T21:33:41.1620016Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:41.1620357Z return func(*args, **kwargs) 2025-08-14T21:33:41.1620688Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 514, in forward 2025-08-14T21:33:41.1621043Z self_outputs = self.self( 2025-08-14T21:33:41.1621373Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:41.1621705Z return func(*args, **kwargs) 2025-08-14T21:33:41.1622043Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 402, in forward 2025-08-14T21:33:41.1622392Z self.key(current_states) 2025-08-14T21:33:41.1622497Z 2025-08-14T21:33:41.1622602Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:41.1622931Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:41.1623232Z return mod(**inputs) 2025-08-14T21:33:41.1623560Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1767, in forward 2025-08-14T21:33:41.1623902Z outputs = self.bert( 2025-08-14T21:33:41.1624230Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:41.1624586Z encoder_outputs = self.encoder( 2025-08-14T21:33:41.1624995Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:41.1625342Z layer_outputs = layer_module( 2025-08-14T21:33:41.1625661Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:41.1625991Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:41.1626337Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 584, in forward 2025-08-14T21:33:41.1626697Z self_attention_outputs = self.attention( 2025-08-14T21:33:41.1627050Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:41.1627395Z return func(*args, **kwargs) 2025-08-14T21:33:41.1627731Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 514, in forward 2025-08-14T21:33:41.1628081Z self_outputs = self.self( 2025-08-14T21:33:41.1628414Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:41.1628757Z return func(*args, **kwargs) 2025-08-14T21:33:41.1629090Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 407, in forward 2025-08-14T21:33:41.1629442Z self.value(current_states) 2025-08-14T21:33:41.1629553Z 2025-08-14T21:33:41.1629635Z cudagraph partition due to non gpu ops 2025-08-14T21:33:41.1629850Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:41.1630188Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:41.1630492Z return mod(**inputs) 2025-08-14T21:33:41.1630922Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1767, in forward 2025-08-14T21:33:41.1631264Z outputs = self.bert( 2025-08-14T21:33:41.1631594Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:41.1631950Z encoder_outputs = self.encoder( 2025-08-14T21:33:41.1632292Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:41.1632644Z layer_outputs = layer_module( 2025-08-14T21:33:41.1632981Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:41.1633316Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:41.1633661Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 584, in forward 2025-08-14T21:33:41.1634030Z self_attention_outputs = self.attention( 2025-08-14T21:33:41.1634382Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:41.1634717Z return func(*args, **kwargs) 2025-08-14T21:33:41.1635054Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 514, in forward 2025-08-14T21:33:41.1635401Z self_outputs = self.self( 2025-08-14T21:33:41.1635732Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:41.1636067Z return func(*args, **kwargs) 2025-08-14T21:33:41.1636406Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 438, in forward 2025-08-14T21:33:41.1636811Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:33:41.1636981Z 2025-08-14T21:33:41.1637087Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:41.1637419Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:41.1637717Z return mod(**inputs) 2025-08-14T21:33:41.1638049Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1767, in forward 2025-08-14T21:33:41.1638391Z outputs = self.bert( 2025-08-14T21:33:41.1638720Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:41.1639074Z encoder_outputs = self.encoder( 2025-08-14T21:33:41.1639422Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:41.1639765Z layer_outputs = layer_module( 2025-08-14T21:33:41.1640081Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:41.1640416Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:41.1640762Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 584, in forward 2025-08-14T21:33:41.1641122Z self_attention_outputs = self.attention( 2025-08-14T21:33:41.1641477Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:41.1641816Z return func(*args, **kwargs) 2025-08-14T21:33:41.1642144Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 524, in forward 2025-08-14T21:33:41.1642556Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:33:41.1642956Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 461, in forward 2025-08-14T21:33:41.1643310Z hidden_states = self.dense(hidden_states) 2025-08-14T21:33:41.1643447Z 2025-08-14T21:33:41.1643571Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:41.1643927Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:41.1644236Z return mod(**inputs) 2025-08-14T21:33:41.1644567Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1767, in forward 2025-08-14T21:33:41.1644922Z outputs = self.bert( 2025-08-14T21:33:41.1645257Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:41.1645610Z encoder_outputs = self.encoder( 2025-08-14T21:33:41.1645972Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:41.1646318Z layer_outputs = layer_module( 2025-08-14T21:33:41.1646635Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:41.1646963Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:41.1647316Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 614, in forward 2025-08-14T21:33:41.1647675Z layer_output = apply_chunking_to_forward( 2025-08-14T21:33:41.1648039Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:33:41.1648405Z return forward_fn(*input_tensors) 2025-08-14T21:33:41.1648779Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 622, in feed_forward_chunk 2025-08-14T21:33:41.1649200Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:33:41.1649585Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 539, in forward 2025-08-14T21:33:41.1649945Z hidden_states = self.dense(hidden_states) 2025-08-14T21:33:41.1650076Z 2025-08-14T21:33:41.1650182Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:41.1650518Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:41.1650811Z return mod(**inputs) 2025-08-14T21:33:41.1651137Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1767, in forward 2025-08-14T21:33:41.1651480Z outputs = self.bert( 2025-08-14T21:33:41.1651799Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:41.1652155Z encoder_outputs = self.encoder( 2025-08-14T21:33:41.1652498Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:41.1652845Z layer_outputs = layer_module( 2025-08-14T21:33:41.1653158Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:41.1653492Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:41.1653843Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 614, in forward 2025-08-14T21:33:41.1654193Z layer_output = apply_chunking_to_forward( 2025-08-14T21:33:41.1654560Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:33:41.1654932Z return forward_fn(*input_tensors) 2025-08-14T21:33:41.1655303Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 622, in feed_forward_chunk 2025-08-14T21:33:41.1655713Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:33:41.1656102Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 540, in forward 2025-08-14T21:33:41.1656519Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:33:41.1656897Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:33:41.1657214Z return self.act(input) 2025-08-14T21:33:41.1657327Z 2025-08-14T21:33:41.1657425Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:41.1657765Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:41.1658069Z return mod(**inputs) 2025-08-14T21:33:41.1658411Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1767, in forward 2025-08-14T21:33:41.1658777Z outputs = self.bert( 2025-08-14T21:33:41.1659107Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:41.1659453Z encoder_outputs = self.encoder( 2025-08-14T21:33:41.1659805Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:41.1660158Z layer_outputs = layer_module( 2025-08-14T21:33:41.1660474Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:41.1660812Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:41.1661168Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 614, in forward 2025-08-14T21:33:41.1661529Z layer_output = apply_chunking_to_forward( 2025-08-14T21:33:41.1661891Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:33:41.1662257Z return forward_fn(*input_tensors) 2025-08-14T21:33:41.1662632Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 623, in feed_forward_chunk 2025-08-14T21:33:41.1663067Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:33:41.1663462Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 552, in forward 2025-08-14T21:33:41.1663822Z hidden_states = self.dense(hidden_states) 2025-08-14T21:33:41.1663947Z 2025-08-14T21:33:41.1664050Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:41.1664375Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:41.1664750Z return mod(**inputs) 2025-08-14T21:33:41.1665085Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1767, in forward 2025-08-14T21:33:41.1665437Z outputs = self.bert( 2025-08-14T21:33:41.1665760Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:41.1666120Z encoder_outputs = self.encoder( 2025-08-14T21:33:41.1666476Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:41.1666824Z layer_outputs = layer_module( 2025-08-14T21:33:41.1667148Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:41.1667490Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:41.1667850Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 584, in forward 2025-08-14T21:33:41.1668211Z self_attention_outputs = self.attention( 2025-08-14T21:33:41.1668571Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:41.1668922Z return func(*args, **kwargs) 2025-08-14T21:33:41.1669262Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 514, in forward 2025-08-14T21:33:41.1670591Z self_outputs = self.self( 2025-08-14T21:33:41.1670930Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:41.1671279Z return func(*args, **kwargs) 2025-08-14T21:33:41.1671623Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 374, in forward 2025-08-14T21:33:41.1672104Z self.query(hidden_states).view(bsz, -1, self.num_attention_heads, self.attention_head_size).transpose(1, 2) 2025-08-14T21:33:41.1672346Z 2025-08-14T21:33:41.1672455Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:41.1672817Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:41.1673131Z return mod(**inputs) 2025-08-14T21:33:41.1673470Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1767, in forward 2025-08-14T21:33:41.1673828Z outputs = self.bert( 2025-08-14T21:33:41.1674162Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:41.1674520Z encoder_outputs = self.encoder( 2025-08-14T21:33:41.1674878Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:41.1675230Z layer_outputs = layer_module( 2025-08-14T21:33:41.1675560Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:41.1675902Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:41.1676266Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 584, in forward 2025-08-14T21:33:41.1676625Z self_attention_outputs = self.attention( 2025-08-14T21:33:41.1676988Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:41.1677342Z return func(*args, **kwargs) 2025-08-14T21:33:41.1677681Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 514, in forward 2025-08-14T21:33:41.1678037Z self_outputs = self.self( 2025-08-14T21:33:41.1678381Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:41.1678729Z return func(*args, **kwargs) 2025-08-14T21:33:41.1679068Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 402, in forward 2025-08-14T21:33:41.1679427Z self.key(current_states) 2025-08-14T21:33:41.1679537Z 2025-08-14T21:33:41.1679644Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:41.1679986Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:41.1680286Z return mod(**inputs) 2025-08-14T21:33:41.1680629Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1767, in forward 2025-08-14T21:33:41.1680982Z outputs = self.bert( 2025-08-14T21:33:41.1681310Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:41.1681669Z encoder_outputs = self.encoder( 2025-08-14T21:33:41.1682026Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:41.1682385Z layer_outputs = layer_module( 2025-08-14T21:33:41.1682707Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:41.1683045Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:41.1683406Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 584, in forward 2025-08-14T21:33:41.1683818Z self_attention_outputs = self.attention( 2025-08-14T21:33:41.1684172Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:41.1684512Z return func(*args, **kwargs) 2025-08-14T21:33:41.1684979Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 514, in forward 2025-08-14T21:33:41.1685327Z self_outputs = self.self( 2025-08-14T21:33:41.1685667Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:41.1686059Z return func(*args, **kwargs) 2025-08-14T21:33:41.1686393Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 407, in forward 2025-08-14T21:33:41.1686745Z self.value(current_states) 2025-08-14T21:33:41.1686862Z 2025-08-14T21:33:41.1686938Z cudagraph partition due to non gpu ops 2025-08-14T21:33:41.1687166Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:41.1687497Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:41.1687799Z return mod(**inputs) 2025-08-14T21:33:41.1688133Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1767, in forward 2025-08-14T21:33:41.1688474Z outputs = self.bert( 2025-08-14T21:33:41.1688812Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:41.1689173Z encoder_outputs = self.encoder( 2025-08-14T21:33:41.1689529Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:41.1689870Z layer_outputs = layer_module( 2025-08-14T21:33:41.1690192Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:41.1690532Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:41.1690880Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 584, in forward 2025-08-14T21:33:41.1691240Z self_attention_outputs = self.attention( 2025-08-14T21:33:41.1691592Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:41.1691937Z return func(*args, **kwargs) 2025-08-14T21:33:41.1692267Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 514, in forward 2025-08-14T21:33:41.1692617Z self_outputs = self.self( 2025-08-14T21:33:41.1692948Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:41.1693284Z return func(*args, **kwargs) 2025-08-14T21:33:41.1693617Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 438, in forward 2025-08-14T21:33:41.1694026Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:33:41.1694195Z 2025-08-14T21:33:41.1694300Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:41.1694628Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:41.1694928Z return mod(**inputs) 2025-08-14T21:33:41.1695259Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1767, in forward 2025-08-14T21:33:41.1695605Z outputs = self.bert( 2025-08-14T21:33:41.1695928Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:41.1696281Z encoder_outputs = self.encoder( 2025-08-14T21:33:41.1696677Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:41.1697050Z layer_outputs = layer_module( 2025-08-14T21:33:41.1697369Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:41.1697701Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:41.1698057Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 584, in forward 2025-08-14T21:33:41.1698413Z self_attention_outputs = self.attention( 2025-08-14T21:33:41.1698764Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:41.1699122Z return func(*args, **kwargs) 2025-08-14T21:33:41.1699462Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 524, in forward 2025-08-14T21:33:41.1699863Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:33:41.1700266Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 461, in forward 2025-08-14T21:33:41.1700631Z hidden_states = self.dense(hidden_states) 2025-08-14T21:33:41.1700758Z 2025-08-14T21:33:41.1700855Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:41.1701192Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:41.1701494Z return mod(**inputs) 2025-08-14T21:33:41.1701822Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1767, in forward 2025-08-14T21:33:41.1702159Z outputs = self.bert( 2025-08-14T21:33:41.1702489Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:41.1702845Z encoder_outputs = self.encoder( 2025-08-14T21:33:41.1703188Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:41.1703540Z layer_outputs = layer_module( 2025-08-14T21:33:41.1703858Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:41.1704190Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:41.1704536Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 614, in forward 2025-08-14T21:33:41.1704965Z layer_output = apply_chunking_to_forward( 2025-08-14T21:33:41.1705341Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:33:41.1705715Z return forward_fn(*input_tensors) 2025-08-14T21:33:41.1706088Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 622, in feed_forward_chunk 2025-08-14T21:33:41.1706511Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:33:41.1706915Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 539, in forward 2025-08-14T21:33:41.1707268Z hidden_states = self.dense(hidden_states) 2025-08-14T21:33:41.1707406Z 2025-08-14T21:33:41.1707503Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:41.1707840Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:41.1708144Z return mod(**inputs) 2025-08-14T21:33:41.1708469Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1767, in forward 2025-08-14T21:33:41.1708824Z outputs = self.bert( 2025-08-14T21:33:41.1709156Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:41.1709503Z encoder_outputs = self.encoder( 2025-08-14T21:33:41.1709885Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:41.1710253Z layer_outputs = layer_module( 2025-08-14T21:33:41.1710571Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:41.1710896Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:41.1711250Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 614, in forward 2025-08-14T21:33:41.1711611Z layer_output = apply_chunking_to_forward( 2025-08-14T21:33:41.1711983Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:33:41.1712358Z return forward_fn(*input_tensors) 2025-08-14T21:33:41.1712739Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 622, in feed_forward_chunk 2025-08-14T21:33:41.1713166Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:33:41.1713553Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 540, in forward 2025-08-14T21:33:41.1713941Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:33:41.1714294Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:33:41.1714611Z return self.act(input) 2025-08-14T21:33:41.1714715Z 2025-08-14T21:33:41.1714811Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:41.1715147Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:41.1715448Z return mod(**inputs) 2025-08-14T21:33:41.1715773Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1767, in forward 2025-08-14T21:33:41.1716121Z outputs = self.bert( 2025-08-14T21:33:41.1716453Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:41.1716810Z encoder_outputs = self.encoder( 2025-08-14T21:33:41.1717151Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:41.1717503Z layer_outputs = layer_module( 2025-08-14T21:33:41.1717826Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:41.1718161Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:41.1718511Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 614, in forward 2025-08-14T21:33:41.1718880Z layer_output = apply_chunking_to_forward( 2025-08-14T21:33:41.1719255Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:33:41.1719618Z return forward_fn(*input_tensors) 2025-08-14T21:33:41.1720002Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 623, in feed_forward_chunk 2025-08-14T21:33:41.1720436Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:33:41.1720840Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 552, in forward 2025-08-14T21:33:41.1721192Z hidden_states = self.dense(hidden_states) 2025-08-14T21:33:41.1721326Z 2025-08-14T21:33:41.1721422Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:41.1721760Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:41.1722060Z return mod(**inputs) 2025-08-14T21:33:41.1722383Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1767, in forward 2025-08-14T21:33:41.1722727Z outputs = self.bert( 2025-08-14T21:33:41.1723103Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:41.1723455Z encoder_outputs = self.encoder( 2025-08-14T21:33:41.1723803Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:41.1724155Z layer_outputs = layer_module( 2025-08-14T21:33:41.1724475Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:41.1724803Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:41.1725188Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 584, in forward 2025-08-14T21:33:41.1725548Z self_attention_outputs = self.attention( 2025-08-14T21:33:41.1725900Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:41.1726246Z return func(*args, **kwargs) 2025-08-14T21:33:41.1726584Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 514, in forward 2025-08-14T21:33:41.1726929Z self_outputs = self.self( 2025-08-14T21:33:41.1727253Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:41.1727596Z return func(*args, **kwargs) 2025-08-14T21:33:41.1727934Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 374, in forward 2025-08-14T21:33:41.1728403Z self.query(hidden_states).view(bsz, -1, self.num_attention_heads, self.attention_head_size).transpose(1, 2) 2025-08-14T21:33:41.1728652Z 2025-08-14T21:33:41.1728749Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:41.1729084Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:41.1729388Z return mod(**inputs) 2025-08-14T21:33:41.1729713Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1767, in forward 2025-08-14T21:33:41.1730063Z outputs = self.bert( 2025-08-14T21:33:41.1730394Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:41.1730746Z encoder_outputs = self.encoder( 2025-08-14T21:33:41.1731086Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:41.1731440Z layer_outputs = layer_module( 2025-08-14T21:33:41.1731757Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:41.1732082Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:41.1732438Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 584, in forward 2025-08-14T21:33:41.1732799Z self_attention_outputs = self.attention( 2025-08-14T21:33:41.1733154Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:41.1733489Z return func(*args, **kwargs) 2025-08-14T21:33:41.1733828Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 514, in forward 2025-08-14T21:33:41.1734177Z self_outputs = self.self( 2025-08-14T21:33:41.1734502Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:41.1734852Z return func(*args, **kwargs) 2025-08-14T21:33:41.1735188Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 402, in forward 2025-08-14T21:33:41.1735540Z self.key(current_states) 2025-08-14T21:33:41.1735647Z 2025-08-14T21:33:41.1735783Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:41.1736139Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:41.1736446Z return mod(**inputs) 2025-08-14T21:33:41.1736779Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1767, in forward 2025-08-14T21:33:41.1737122Z outputs = self.bert( 2025-08-14T21:33:41.1737456Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:41.1737812Z encoder_outputs = self.encoder( 2025-08-14T21:33:41.1738168Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:41.1738520Z layer_outputs = layer_module( 2025-08-14T21:33:41.1738839Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:41.1739177Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:41.1739523Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 584, in forward 2025-08-14T21:33:41.1739883Z self_attention_outputs = self.attention( 2025-08-14T21:33:41.1740233Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:41.1740566Z return func(*args, **kwargs) 2025-08-14T21:33:41.1740901Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 514, in forward 2025-08-14T21:33:41.1741249Z self_outputs = self.self( 2025-08-14T21:33:41.1741576Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:41.1741909Z return func(*args, **kwargs) 2025-08-14T21:33:41.1742246Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 407, in forward 2025-08-14T21:33:41.1742596Z self.value(current_states) 2025-08-14T21:33:41.1742703Z 2025-08-14T21:33:41.1742777Z cudagraph partition due to non gpu ops 2025-08-14T21:33:41.1742996Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:41.1743326Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:41.1743624Z return mod(**inputs) 2025-08-14T21:33:41.1743943Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1767, in forward 2025-08-14T21:33:41.1744290Z outputs = self.bert( 2025-08-14T21:33:41.1744702Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:41.1745065Z encoder_outputs = self.encoder( 2025-08-14T21:33:41.1745416Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:41.1745772Z layer_outputs = layer_module( 2025-08-14T21:33:41.1746094Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:41.1746422Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:41.1746779Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 584, in forward 2025-08-14T21:33:41.1747146Z self_attention_outputs = self.attention( 2025-08-14T21:33:41.1747501Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:41.1747842Z return func(*args, **kwargs) 2025-08-14T21:33:41.1748185Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 514, in forward 2025-08-14T21:33:41.1748537Z self_outputs = self.self( 2025-08-14T21:33:41.1748890Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:41.1749253Z return func(*args, **kwargs) 2025-08-14T21:33:41.1749596Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 438, in forward 2025-08-14T21:33:41.1750002Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:33:41.1750174Z 2025-08-14T21:33:41.1750271Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:41.1750607Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:41.1750937Z return mod(**inputs) 2025-08-14T21:33:41.1751263Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1767, in forward 2025-08-14T21:33:41.1751617Z outputs = self.bert( 2025-08-14T21:33:41.1751948Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:41.1752304Z encoder_outputs = self.encoder( 2025-08-14T21:33:41.1752645Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:41.1752996Z layer_outputs = layer_module( 2025-08-14T21:33:41.1753319Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:41.1753647Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:41.1754003Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 584, in forward 2025-08-14T21:33:41.1754369Z self_attention_outputs = self.attention( 2025-08-14T21:33:41.1754722Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:41.1755061Z return func(*args, **kwargs) 2025-08-14T21:33:41.1755403Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 524, in forward 2025-08-14T21:33:41.1755809Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:33:41.1756212Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 461, in forward 2025-08-14T21:33:41.1756570Z hidden_states = self.dense(hidden_states) 2025-08-14T21:33:41.1756707Z 2025-08-14T21:33:41.1756803Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:41.1757141Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:41.1757439Z return mod(**inputs) 2025-08-14T21:33:41.1757775Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1767, in forward 2025-08-14T21:33:41.1758169Z outputs = self.bert( 2025-08-14T21:33:41.1758510Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:41.1758868Z encoder_outputs = self.encoder( 2025-08-14T21:33:41.1759225Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:41.1759587Z layer_outputs = layer_module( 2025-08-14T21:33:41.1759910Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:41.1760255Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:41.1760618Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 614, in forward 2025-08-14T21:33:41.1760994Z layer_output = apply_chunking_to_forward( 2025-08-14T21:33:41.1761368Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:33:41.1761746Z return forward_fn(*input_tensors) 2025-08-14T21:33:41.1762195Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 622, in feed_forward_chunk 2025-08-14T21:33:41.1762629Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:33:41.1763026Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 539, in forward 2025-08-14T21:33:41.1763397Z hidden_states = self.dense(hidden_states) 2025-08-14T21:33:41.1763525Z 2025-08-14T21:33:41.1763629Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:41.1763963Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:41.1764286Z return mod(**inputs) 2025-08-14T21:33:41.1764623Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1767, in forward 2025-08-14T21:33:41.1764978Z outputs = self.bert( 2025-08-14T21:33:41.1765310Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:41.1765671Z encoder_outputs = self.encoder( 2025-08-14T21:33:41.1766027Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:41.1766388Z layer_outputs = layer_module( 2025-08-14T21:33:41.1766706Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:41.1767047Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:41.1767413Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 614, in forward 2025-08-14T21:33:41.1767780Z layer_output = apply_chunking_to_forward( 2025-08-14T21:33:41.1768158Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:33:41.1768533Z return forward_fn(*input_tensors) 2025-08-14T21:33:41.1768923Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 622, in feed_forward_chunk 2025-08-14T21:33:41.1769344Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:33:41.1769743Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 540, in forward 2025-08-14T21:33:41.1770140Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:33:41.1770493Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:33:41.1770818Z return self.act(input) 2025-08-14T21:33:41.1770931Z 2025-08-14T21:33:41.1771030Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:41.1771375Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:41.1771677Z return mod(**inputs) 2025-08-14T21:33:41.1772017Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1767, in forward 2025-08-14T21:33:41.1772376Z outputs = self.bert( 2025-08-14T21:33:41.1772713Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:41.1773133Z encoder_outputs = self.encoder( 2025-08-14T21:33:41.1773478Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:41.1773828Z layer_outputs = layer_module( 2025-08-14T21:33:41.1774140Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:41.1774472Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:41.1774824Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 614, in forward 2025-08-14T21:33:41.1775243Z layer_output = apply_chunking_to_forward( 2025-08-14T21:33:41.1775608Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:33:41.1775970Z return forward_fn(*input_tensors) 2025-08-14T21:33:41.1776343Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 623, in feed_forward_chunk 2025-08-14T21:33:41.1776765Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:33:41.1777166Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 552, in forward 2025-08-14T21:33:41.1777541Z hidden_states = self.dense(hidden_states) 2025-08-14T21:33:41.1777665Z 2025-08-14T21:33:41.1777768Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:41.1778093Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:41.1778396Z return mod(**inputs) 2025-08-14T21:33:41.1778729Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1767, in forward 2025-08-14T21:33:41.1779075Z outputs = self.bert( 2025-08-14T21:33:41.1779396Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:41.1779749Z encoder_outputs = self.encoder( 2025-08-14T21:33:41.1780094Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:41.1780440Z layer_outputs = layer_module( 2025-08-14T21:33:41.1780758Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:41.1781091Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:41.1781448Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 584, in forward 2025-08-14T21:33:41.1781804Z self_attention_outputs = self.attention( 2025-08-14T21:33:41.1782155Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:41.1782498Z return func(*args, **kwargs) 2025-08-14T21:33:41.1782830Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 514, in forward 2025-08-14T21:33:41.1783181Z self_outputs = self.self( 2025-08-14T21:33:41.1783514Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:41.1783857Z return func(*args, **kwargs) 2025-08-14T21:33:41.1784187Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 374, in forward 2025-08-14T21:33:41.1784862Z self.query(hidden_states).view(bsz, -1, self.num_attention_heads, self.attention_head_size).transpose(1, 2) 2025-08-14T21:33:41.1785122Z 2025-08-14T21:33:41.1785235Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:41.1785588Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:41.1785906Z return mod(**inputs) 2025-08-14T21:33:41.1786265Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1767, in forward 2025-08-14T21:33:41.1786666Z outputs = self.bert( 2025-08-14T21:33:41.1786990Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:41.1787351Z encoder_outputs = self.encoder( 2025-08-14T21:33:41.1787698Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:41.1788051Z layer_outputs = layer_module( 2025-08-14T21:33:41.1788437Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:41.1788800Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:41.1789165Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 584, in forward 2025-08-14T21:33:41.1789522Z self_attention_outputs = self.attention( 2025-08-14T21:33:41.1789879Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:41.1790230Z return func(*args, **kwargs) 2025-08-14T21:33:41.1790579Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 514, in forward 2025-08-14T21:33:41.1790952Z self_outputs = self.self( 2025-08-14T21:33:41.1791281Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:41.1791622Z return func(*args, **kwargs) 2025-08-14T21:33:41.1791957Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 402, in forward 2025-08-14T21:33:41.1792305Z self.key(current_states) 2025-08-14T21:33:41.1792418Z 2025-08-14T21:33:41.1792514Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:41.1792848Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:41.1793142Z return mod(**inputs) 2025-08-14T21:33:41.1793470Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1767, in forward 2025-08-14T21:33:41.1793818Z outputs = self.bert( 2025-08-14T21:33:41.1794144Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:41.1794492Z encoder_outputs = self.encoder( 2025-08-14T21:33:41.1794843Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:41.1795194Z layer_outputs = layer_module( 2025-08-14T21:33:41.1795508Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:41.1795842Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:41.1796195Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 584, in forward 2025-08-14T21:33:41.1796553Z self_attention_outputs = self.attention( 2025-08-14T21:33:41.1796898Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:41.1797241Z return func(*args, **kwargs) 2025-08-14T21:33:41.1797579Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 514, in forward 2025-08-14T21:33:41.1797924Z self_outputs = self.self( 2025-08-14T21:33:41.1798257Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:41.1798602Z return func(*args, **kwargs) 2025-08-14T21:33:41.1798940Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 407, in forward 2025-08-14T21:33:41.1799282Z self.value(current_states) 2025-08-14T21:33:41.1799399Z 2025-08-14T21:33:41.1799474Z cudagraph partition due to non gpu ops 2025-08-14T21:33:41.1799701Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:41.1800025Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:41.1800330Z return mod(**inputs) 2025-08-14T21:33:41.1800666Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1767, in forward 2025-08-14T21:33:41.1801014Z outputs = self.bert( 2025-08-14T21:33:41.1801369Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:41.1801734Z encoder_outputs = self.encoder( 2025-08-14T21:33:41.1802080Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:41.1802418Z layer_outputs = layer_module( 2025-08-14T21:33:41.1802736Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:41.1803067Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:41.1803419Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 584, in forward 2025-08-14T21:33:41.1803792Z self_attention_outputs = self.attention( 2025-08-14T21:33:41.1804146Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:41.1804490Z return func(*args, **kwargs) 2025-08-14T21:33:41.1804833Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 514, in forward 2025-08-14T21:33:41.1805180Z self_outputs = self.self( 2025-08-14T21:33:41.1805516Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:41.1805859Z return func(*args, **kwargs) 2025-08-14T21:33:41.1806189Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 438, in forward 2025-08-14T21:33:41.1806599Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:33:41.1806779Z 2025-08-14T21:33:41.1806875Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:41.1807212Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:41.1807505Z return mod(**inputs) 2025-08-14T21:33:41.1807839Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1767, in forward 2025-08-14T21:33:41.1808189Z outputs = self.bert( 2025-08-14T21:33:41.1808509Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:41.1808863Z encoder_outputs = self.encoder( 2025-08-14T21:33:41.1809211Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:41.1809557Z layer_outputs = layer_module( 2025-08-14T21:33:41.1809870Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:41.1810205Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:41.1810559Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 584, in forward 2025-08-14T21:33:41.1810919Z self_attention_outputs = self.attention( 2025-08-14T21:33:41.1811270Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:41.1811613Z return func(*args, **kwargs) 2025-08-14T21:33:41.1811954Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 524, in forward 2025-08-14T21:33:41.1812349Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:33:41.1812748Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 461, in forward 2025-08-14T21:33:41.1813110Z hidden_states = self.dense(hidden_states) 2025-08-14T21:33:41.1813241Z 2025-08-14T21:33:41.1813346Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:41.1813671Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:41.1813975Z return mod(**inputs) 2025-08-14T21:33:41.1814335Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1767, in forward 2025-08-14T21:33:41.1814692Z outputs = self.bert( 2025-08-14T21:33:41.1815019Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:41.1815378Z encoder_outputs = self.encoder( 2025-08-14T21:33:41.1815729Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:41.1816071Z layer_outputs = layer_module( 2025-08-14T21:33:41.1816394Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:41.1816747Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:41.1817093Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 614, in forward 2025-08-14T21:33:41.1817458Z layer_output = apply_chunking_to_forward( 2025-08-14T21:33:41.1817832Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:33:41.1818195Z return forward_fn(*input_tensors) 2025-08-14T21:33:41.1818568Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 622, in feed_forward_chunk 2025-08-14T21:33:41.1818989Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:33:41.1819380Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 539, in forward 2025-08-14T21:33:41.1819743Z hidden_states = self.dense(hidden_states) 2025-08-14T21:33:41.1819870Z 2025-08-14T21:33:41.1819967Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:41.1820301Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:41.1820605Z return mod(**inputs) 2025-08-14T21:33:41.1820932Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1767, in forward 2025-08-14T21:33:41.1821281Z outputs = self.bert( 2025-08-14T21:33:41.1821609Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:41.1821967Z encoder_outputs = self.encoder( 2025-08-14T21:33:41.1822308Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:41.1822660Z layer_outputs = layer_module( 2025-08-14T21:33:41.1822986Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:41.1823316Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:41.1823665Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 614, in forward 2025-08-14T21:33:41.1824032Z layer_output = apply_chunking_to_forward( 2025-08-14T21:33:41.1824401Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:33:41.1824827Z return forward_fn(*input_tensors) 2025-08-14T21:33:41.1825212Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 622, in feed_forward_chunk 2025-08-14T21:33:41.1825641Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:33:41.1826041Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 540, in forward 2025-08-14T21:33:41.1826429Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:33:41.1826786Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:33:41.1827106Z return self.act(input) 2025-08-14T21:33:41.1827210Z 2025-08-14T21:33:41.1827383Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:41.1827711Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:41.1828010Z return mod(**inputs) 2025-08-14T21:33:41.1828336Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1767, in forward 2025-08-14T21:33:41.1828674Z outputs = self.bert( 2025-08-14T21:33:41.1829002Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:41.1829351Z encoder_outputs = self.encoder( 2025-08-14T21:33:41.1829752Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:41.1830098Z layer_outputs = layer_module( 2025-08-14T21:33:41.1830418Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:41.1830758Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:41.1831106Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 614, in forward 2025-08-14T21:33:41.1831468Z layer_output = apply_chunking_to_forward( 2025-08-14T21:33:41.1831840Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:33:41.1832207Z return forward_fn(*input_tensors) 2025-08-14T21:33:41.1832580Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 623, in feed_forward_chunk 2025-08-14T21:33:41.1833016Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:33:41.1833426Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 552, in forward 2025-08-14T21:33:41.1833787Z hidden_states = self.dense(hidden_states) 2025-08-14T21:33:41.1833917Z 2025-08-14T21:33:41.1834014Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:41.1834348Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:41.1834647Z return mod(**inputs) 2025-08-14T21:33:41.1834968Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1767, in forward 2025-08-14T21:33:41.1835315Z outputs = self.bert( 2025-08-14T21:33:41.1835645Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:41.1836004Z encoder_outputs = self.encoder( 2025-08-14T21:33:41.1836347Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:41.1836701Z layer_outputs = layer_module( 2025-08-14T21:33:41.1837023Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:41.1837354Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:41.1837710Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 584, in forward 2025-08-14T21:33:41.1838070Z self_attention_outputs = self.attention( 2025-08-14T21:33:41.1838425Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:41.1838762Z return func(*args, **kwargs) 2025-08-14T21:33:41.1839102Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 514, in forward 2025-08-14T21:33:41.1839455Z self_outputs = self.self( 2025-08-14T21:33:41.1839779Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:41.1840122Z return func(*args, **kwargs) 2025-08-14T21:33:41.1840490Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 374, in forward 2025-08-14T21:33:41.1840981Z self.query(hidden_states).view(bsz, -1, self.num_attention_heads, self.attention_head_size).transpose(1, 2) 2025-08-14T21:33:41.1841225Z 2025-08-14T21:33:41.1841319Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:41.1841653Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:41.1841952Z return mod(**inputs) 2025-08-14T21:33:41.1842284Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1767, in forward 2025-08-14T21:33:41.1842637Z outputs = self.bert( 2025-08-14T21:33:41.1842968Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:41.1843326Z encoder_outputs = self.encoder( 2025-08-14T21:33:41.1843670Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:41.1844026Z layer_outputs = layer_module( 2025-08-14T21:33:41.1844347Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:41.1844681Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:41.1845033Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 584, in forward 2025-08-14T21:33:41.1845398Z self_attention_outputs = self.attention( 2025-08-14T21:33:41.1845751Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:41.1846087Z return func(*args, **kwargs) 2025-08-14T21:33:41.1846431Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 514, in forward 2025-08-14T21:33:41.1846784Z self_outputs = self.self( 2025-08-14T21:33:41.1847121Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:41.1847459Z return func(*args, **kwargs) 2025-08-14T21:33:41.1847801Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 402, in forward 2025-08-14T21:33:41.1848156Z self.key(current_states) 2025-08-14T21:33:41.1848264Z 2025-08-14T21:33:41.1848370Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:41.1848699Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:41.1849004Z return mod(**inputs) 2025-08-14T21:33:41.1849337Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1767, in forward 2025-08-14T21:33:41.1849679Z outputs = self.bert( 2025-08-14T21:33:41.1850012Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:41.1850369Z encoder_outputs = self.encoder( 2025-08-14T21:33:41.1850720Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:41.1851062Z layer_outputs = layer_module( 2025-08-14T21:33:41.1851383Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:41.1851718Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:41.1852066Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 584, in forward 2025-08-14T21:33:41.1852428Z self_attention_outputs = self.attention( 2025-08-14T21:33:41.1852779Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:41.1853120Z return func(*args, **kwargs) 2025-08-14T21:33:41.1853502Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 514, in forward 2025-08-14T21:33:41.1853853Z self_outputs = self.self( 2025-08-14T21:33:41.1854183Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:41.1854516Z return func(*args, **kwargs) 2025-08-14T21:33:41.1854860Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 407, in forward 2025-08-14T21:33:41.1855208Z self.value(current_states) 2025-08-14T21:33:41.1855335Z 2025-08-14T21:33:41.1855422Z cudagraph partition due to non gpu ops 2025-08-14T21:33:41.1855648Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:41.1855989Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:41.1856299Z return mod(**inputs) 2025-08-14T21:33:41.1856635Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1767, in forward 2025-08-14T21:33:41.1856994Z outputs = self.bert( 2025-08-14T21:33:41.1857332Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:41.1857694Z encoder_outputs = self.encoder( 2025-08-14T21:33:41.1858043Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:41.1858401Z layer_outputs = layer_module( 2025-08-14T21:33:41.1858732Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:41.1859074Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:41.1859429Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 584, in forward 2025-08-14T21:33:41.1859802Z self_attention_outputs = self.attention( 2025-08-14T21:33:41.1860162Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:41.1860508Z return func(*args, **kwargs) 2025-08-14T21:33:41.1860856Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 514, in forward 2025-08-14T21:33:41.1861213Z self_outputs = self.self( 2025-08-14T21:33:41.1861551Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:41.1861897Z return func(*args, **kwargs) 2025-08-14T21:33:41.1862245Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 438, in forward 2025-08-14T21:33:41.1862660Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:33:41.1862836Z 2025-08-14T21:33:41.1862939Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:41.1863288Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:41.1863600Z return mod(**inputs) 2025-08-14T21:33:41.1863939Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1767, in forward 2025-08-14T21:33:41.1864289Z outputs = self.bert( 2025-08-14T21:33:41.1864694Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:41.1865071Z encoder_outputs = self.encoder( 2025-08-14T21:33:41.1865415Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:41.1865774Z layer_outputs = layer_module( 2025-08-14T21:33:41.1866098Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:41.1866464Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:41.1866829Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 584, in forward 2025-08-14T21:33:41.1867192Z self_attention_outputs = self.attention( 2025-08-14T21:33:41.1867547Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:41.1867889Z return func(*args, **kwargs) 2025-08-14T21:33:41.1868221Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 524, in forward 2025-08-14T21:33:41.1868620Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:33:41.1869041Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 461, in forward 2025-08-14T21:33:41.1869395Z hidden_states = self.dense(hidden_states) 2025-08-14T21:33:41.1869532Z 2025-08-14T21:33:41.1869632Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:41.1869970Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:41.1870270Z return mod(**inputs) 2025-08-14T21:33:41.1870591Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1767, in forward 2025-08-14T21:33:41.1870940Z outputs = self.bert( 2025-08-14T21:33:41.1871270Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:41.1871613Z encoder_outputs = self.encoder( 2025-08-14T21:33:41.1871960Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:41.1872303Z layer_outputs = layer_module( 2025-08-14T21:33:41.1872624Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:41.1872948Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:41.1873312Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 614, in forward 2025-08-14T21:33:41.1873680Z layer_output = apply_chunking_to_forward( 2025-08-14T21:33:41.1874061Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:33:41.1874424Z return forward_fn(*input_tensors) 2025-08-14T21:33:41.1874810Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 622, in feed_forward_chunk 2025-08-14T21:33:41.1875239Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:33:41.1875631Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 539, in forward 2025-08-14T21:33:41.1875998Z hidden_states = self.dense(hidden_states) 2025-08-14T21:33:41.1876132Z 2025-08-14T21:33:41.1876231Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:41.1876571Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:41.1876869Z return mod(**inputs) 2025-08-14T21:33:41.1877205Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1767, in forward 2025-08-14T21:33:41.1877559Z outputs = self.bert( 2025-08-14T21:33:41.1877886Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:41.1878244Z encoder_outputs = self.encoder( 2025-08-14T21:33:41.1878604Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:41.1878959Z layer_outputs = layer_module( 2025-08-14T21:33:41.1879276Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:41.1879671Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:41.1880029Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 614, in forward 2025-08-14T21:33:41.1880389Z layer_output = apply_chunking_to_forward( 2025-08-14T21:33:41.1880753Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:33:41.1881118Z return forward_fn(*input_tensors) 2025-08-14T21:33:41.1881493Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 622, in feed_forward_chunk 2025-08-14T21:33:41.1881926Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:33:41.1882319Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 540, in forward 2025-08-14T21:33:41.1882706Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:33:41.1883066Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:33:41.1883379Z return self.act(input) 2025-08-14T21:33:41.1883492Z 2025-08-14T21:33:41.1883589Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:41.1883922Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:41.1884225Z return mod(**inputs) 2025-08-14T21:33:41.1884556Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1767, in forward 2025-08-14T21:33:41.1885029Z outputs = self.bert( 2025-08-14T21:33:41.1885375Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:41.1885736Z encoder_outputs = self.encoder( 2025-08-14T21:33:41.1886102Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:41.1886474Z layer_outputs = layer_module( 2025-08-14T21:33:41.1886796Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:41.1887125Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:41.1887492Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 614, in forward 2025-08-14T21:33:41.1887867Z layer_output = apply_chunking_to_forward( 2025-08-14T21:33:41.1888244Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:33:41.1888623Z return forward_fn(*input_tensors) 2025-08-14T21:33:41.1889015Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 623, in feed_forward_chunk 2025-08-14T21:33:41.1889460Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:33:41.1889869Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 552, in forward 2025-08-14T21:33:41.1890237Z hidden_states = self.dense(hidden_states) 2025-08-14T21:33:41.1890366Z 2025-08-14T21:33:41.1890474Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:41.1890817Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:41.1891119Z return mod(**inputs) 2025-08-14T21:33:41.1891455Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1767, in forward 2025-08-14T21:33:41.1891815Z outputs = self.bert( 2025-08-14T21:33:41.1892146Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:41.1892513Z encoder_outputs = self.encoder( 2025-08-14T21:33:41.1892935Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:41.1893321Z layer_outputs = layer_module( 2025-08-14T21:33:41.1893639Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:41.1893978Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:41.1894342Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 584, in forward 2025-08-14T21:33:41.1894708Z self_attention_outputs = self.attention( 2025-08-14T21:33:41.1895077Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:41.1895462Z return func(*args, **kwargs) 2025-08-14T21:33:41.1895811Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 514, in forward 2025-08-14T21:33:41.1896161Z self_outputs = self.self( 2025-08-14T21:33:41.1896510Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:41.1896865Z return func(*args, **kwargs) 2025-08-14T21:33:41.1897205Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 374, in forward 2025-08-14T21:33:41.1897695Z self.query(hidden_states).view(bsz, -1, self.num_attention_heads, self.attention_head_size).transpose(1, 2) 2025-08-14T21:33:41.1897951Z 2025-08-14T21:33:41.1898049Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:41.1898398Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:41.1898698Z return mod(**inputs) 2025-08-14T21:33:41.1899043Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1767, in forward 2025-08-14T21:33:41.1899387Z outputs = self.bert( 2025-08-14T21:33:41.1899719Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:41.1900066Z encoder_outputs = self.encoder( 2025-08-14T21:33:41.1900413Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:41.1900765Z layer_outputs = layer_module( 2025-08-14T21:33:41.1901077Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:41.1901413Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:41.1901765Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 584, in forward 2025-08-14T21:33:41.1902121Z self_attention_outputs = self.attention( 2025-08-14T21:33:41.1902461Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:41.1902538Z return func(*args, **kwargs) 2025-08-14T21:33:41.1902766Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 514, in forward 2025-08-14T21:33:41.1902830Z self_outputs = self.self( 2025-08-14T21:33:41.1903055Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:41.1903119Z return func(*args, **kwargs) 2025-08-14T21:33:41.1903348Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 402, in forward 2025-08-14T21:33:41.1903416Z self.key(current_states) 2025-08-14T21:33:41.1903420Z 2025-08-14T21:33:41.1903515Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:41.1903706Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:41.1903767Z return mod(**inputs) 2025-08-14T21:33:41.1904030Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1767, in forward 2025-08-14T21:33:41.1904105Z outputs = self.bert( 2025-08-14T21:33:41.1904334Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:41.1904408Z encoder_outputs = self.encoder( 2025-08-14T21:33:41.1904683Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:41.1904755Z layer_outputs = layer_module( 2025-08-14T21:33:41.1904964Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:41.1905054Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:41.1905285Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 584, in forward 2025-08-14T21:33:41.1905362Z self_attention_outputs = self.attention( 2025-08-14T21:33:41.1905581Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:41.1905652Z return func(*args, **kwargs) 2025-08-14T21:33:41.1905872Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 514, in forward 2025-08-14T21:33:41.1905936Z self_outputs = self.self( 2025-08-14T21:33:41.1906162Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:41.1906224Z return func(*args, **kwargs) 2025-08-14T21:33:41.1906453Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 407, in forward 2025-08-14T21:33:41.1906519Z self.value(current_states) 2025-08-14T21:33:41.1906523Z 2025-08-14T21:33:41.1906598Z cudagraph partition due to non gpu ops 2025-08-14T21:33:41.1906703Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:41.1906886Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:41.1906946Z return mod(**inputs) 2025-08-14T21:33:41.1907181Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1767, in forward 2025-08-14T21:33:41.1907242Z outputs = self.bert( 2025-08-14T21:33:41.1907477Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:41.1907545Z encoder_outputs = self.encoder( 2025-08-14T21:33:41.1907771Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:41.1907845Z layer_outputs = layer_module( 2025-08-14T21:33:41.1908044Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:41.1908123Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:41.1908349Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 584, in forward 2025-08-14T21:33:41.1908422Z self_attention_outputs = self.attention( 2025-08-14T21:33:41.1908646Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:41.1908710Z return func(*args, **kwargs) 2025-08-14T21:33:41.1908935Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 514, in forward 2025-08-14T21:33:41.1909008Z self_outputs = self.self( 2025-08-14T21:33:41.1909224Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:41.1909294Z return func(*args, **kwargs) 2025-08-14T21:33:41.1909545Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 438, in forward 2025-08-14T21:33:41.1909683Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:33:41.1909687Z 2025-08-14T21:33:41.1909790Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:41.1909975Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:41.1910034Z return mod(**inputs) 2025-08-14T21:33:41.1910272Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1767, in forward 2025-08-14T21:33:41.1910333Z outputs = self.bert( 2025-08-14T21:33:41.1910597Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:41.1910664Z encoder_outputs = self.encoder( 2025-08-14T21:33:41.1910892Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:41.1910969Z layer_outputs = layer_module( 2025-08-14T21:33:41.1911173Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:41.1911251Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:41.1911477Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 584, in forward 2025-08-14T21:33:41.1911551Z self_attention_outputs = self.attention( 2025-08-14T21:33:41.1911780Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:41.1911845Z return func(*args, **kwargs) 2025-08-14T21:33:41.1912071Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 524, in forward 2025-08-14T21:33:41.1912199Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:33:41.1912428Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 461, in forward 2025-08-14T21:33:41.1912514Z hidden_states = self.dense(hidden_states) 2025-08-14T21:33:41.1912517Z 2025-08-14T21:33:41.1912611Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:41.1912791Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:41.1912857Z return mod(**inputs) 2025-08-14T21:33:41.1913087Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1767, in forward 2025-08-14T21:33:41.1913148Z outputs = self.bert( 2025-08-14T21:33:41.1913386Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:41.1913453Z encoder_outputs = self.encoder( 2025-08-14T21:33:41.1913688Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:41.1913757Z layer_outputs = layer_module( 2025-08-14T21:33:41.1913960Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:41.1914037Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:41.1914260Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 614, in forward 2025-08-14T21:33:41.1914344Z layer_output = apply_chunking_to_forward( 2025-08-14T21:33:41.1914584Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:33:41.1914658Z return forward_fn(*input_tensors) 2025-08-14T21:33:41.1914922Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 622, in feed_forward_chunk 2025-08-14T21:33:41.1915034Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:33:41.1915286Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 539, in forward 2025-08-14T21:33:41.1915387Z hidden_states = self.dense(hidden_states) 2025-08-14T21:33:41.1915390Z 2025-08-14T21:33:41.1915486Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:41.1915678Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:41.1915738Z return mod(**inputs) 2025-08-14T21:33:41.1915966Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1767, in forward 2025-08-14T21:33:41.1916053Z outputs = self.bert( 2025-08-14T21:33:41.1916287Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:41.1916363Z encoder_outputs = self.encoder( 2025-08-14T21:33:41.1916595Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:41.1916662Z layer_outputs = layer_module( 2025-08-14T21:33:41.1916876Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:41.1916949Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:41.1917176Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 614, in forward 2025-08-14T21:33:41.1917262Z layer_output = apply_chunking_to_forward( 2025-08-14T21:33:41.1917502Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:33:41.1917582Z return forward_fn(*input_tensors) 2025-08-14T21:33:41.1917838Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 622, in feed_forward_chunk 2025-08-14T21:33:41.1917949Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:33:41.1918189Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 540, in forward 2025-08-14T21:33:41.1918295Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:33:41.1918496Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:33:41.1918560Z return self.act(input) 2025-08-14T21:33:41.1918563Z 2025-08-14T21:33:41.1918656Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:41.1918846Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:41.1918907Z return mod(**inputs) 2025-08-14T21:33:41.1919137Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1767, in forward 2025-08-14T21:33:41.1919204Z outputs = self.bert( 2025-08-14T21:33:41.1919435Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:41.1919511Z encoder_outputs = self.encoder( 2025-08-14T21:33:41.1919737Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:41.1919801Z layer_outputs = layer_module( 2025-08-14T21:33:41.1920012Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:41.1920083Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:41.1920309Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 614, in forward 2025-08-14T21:33:41.1920393Z layer_output = apply_chunking_to_forward( 2025-08-14T21:33:41.1920632Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:33:41.1920709Z return forward_fn(*input_tensors) 2025-08-14T21:33:41.1921005Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 623, in feed_forward_chunk 2025-08-14T21:33:41.1921127Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:33:41.1921361Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 552, in forward 2025-08-14T21:33:41.1921432Z hidden_states = self.dense(hidden_states) 2025-08-14T21:33:41.1921436Z 2025-08-14T21:33:41.1921538Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:41.1921718Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:41.1921793Z return mod(**inputs) 2025-08-14T21:33:41.1922029Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1767, in forward 2025-08-14T21:33:41.1922090Z outputs = self.bert( 2025-08-14T21:33:41.1922323Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:41.1922400Z encoder_outputs = self.encoder( 2025-08-14T21:33:41.1922626Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:41.1922699Z layer_outputs = layer_module( 2025-08-14T21:33:41.1922902Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:41.1922974Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:41.1923213Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 584, in forward 2025-08-14T21:33:41.1923287Z self_attention_outputs = self.attention( 2025-08-14T21:33:41.1923508Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:41.1923580Z return func(*args, **kwargs) 2025-08-14T21:33:41.1923808Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 514, in forward 2025-08-14T21:33:41.1923880Z self_outputs = self.self( 2025-08-14T21:33:41.1924101Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:41.1924164Z return func(*args, **kwargs) 2025-08-14T21:33:41.1924399Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 374, in forward 2025-08-14T21:33:41.1924590Z self.query(hidden_states).view(bsz, -1, self.num_attention_heads, self.attention_head_size).transpose(1, 2) 2025-08-14T21:33:41.1924594Z 2025-08-14T21:33:41.1924697Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:41.1924881Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:41.1924948Z return mod(**inputs) 2025-08-14T21:33:41.1925191Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1767, in forward 2025-08-14T21:33:41.1925251Z outputs = self.bert( 2025-08-14T21:33:41.1925482Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:41.1925557Z encoder_outputs = self.encoder( 2025-08-14T21:33:41.1925783Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:41.1925857Z layer_outputs = layer_module( 2025-08-14T21:33:41.1926064Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:41.1926142Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:41.1926409Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 584, in forward 2025-08-14T21:33:41.1926501Z self_attention_outputs = self.attention( 2025-08-14T21:33:41.1926733Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:41.1926797Z return func(*args, **kwargs) 2025-08-14T21:33:41.1927021Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 514, in forward 2025-08-14T21:33:41.1927094Z self_outputs = self.self( 2025-08-14T21:33:41.1927312Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:41.1927398Z return func(*args, **kwargs) 2025-08-14T21:33:41.1927632Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 402, in forward 2025-08-14T21:33:41.1927697Z self.key(current_states) 2025-08-14T21:33:41.1927700Z 2025-08-14T21:33:41.1927803Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:41.1927988Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:41.1928048Z return mod(**inputs) 2025-08-14T21:33:41.1928288Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1767, in forward 2025-08-14T21:33:41.1928346Z outputs = self.bert( 2025-08-14T21:33:41.1928576Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:41.1928649Z encoder_outputs = self.encoder( 2025-08-14T21:33:41.1928877Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:41.1928948Z layer_outputs = layer_module( 2025-08-14T21:33:41.1929154Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:41.1929228Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:41.1929464Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 584, in forward 2025-08-14T21:33:41.1929537Z self_attention_outputs = self.attention( 2025-08-14T21:33:41.1929766Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:41.1929827Z return func(*args, **kwargs) 2025-08-14T21:33:41.1930052Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 514, in forward 2025-08-14T21:33:41.1930123Z self_outputs = self.self( 2025-08-14T21:33:41.1930347Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:41.1930409Z return func(*args, **kwargs) 2025-08-14T21:33:41.1930647Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 407, in forward 2025-08-14T21:33:41.1930716Z self.value(current_states) 2025-08-14T21:33:41.1930719Z 2025-08-14T21:33:41.1930801Z cudagraph partition due to non gpu ops 2025-08-14T21:33:41.1930895Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:41.1931077Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:41.1931144Z return mod(**inputs) 2025-08-14T21:33:41.1931377Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1767, in forward 2025-08-14T21:33:41.1931437Z outputs = self.bert( 2025-08-14T21:33:41.1931678Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:41.1931744Z encoder_outputs = self.encoder( 2025-08-14T21:33:41.1931978Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:41.1932095Z layer_outputs = layer_module( 2025-08-14T21:33:41.1932299Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:41.1932378Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:41.1932601Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 584, in forward 2025-08-14T21:33:41.1932676Z self_attention_outputs = self.attention( 2025-08-14T21:33:41.1932903Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:41.1932982Z return func(*args, **kwargs) 2025-08-14T21:33:41.1933215Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 514, in forward 2025-08-14T21:33:41.1933278Z self_outputs = self.self( 2025-08-14T21:33:41.1933499Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:41.1933570Z return func(*args, **kwargs) 2025-08-14T21:33:41.1933799Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 438, in forward 2025-08-14T21:33:41.1933926Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:33:41.1933930Z 2025-08-14T21:33:41.1934023Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:41.1934204Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:41.1934270Z return mod(**inputs) 2025-08-14T21:33:41.1934504Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1767, in forward 2025-08-14T21:33:41.1934563Z outputs = self.bert( 2025-08-14T21:33:41.1934801Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:41.1934871Z encoder_outputs = self.encoder( 2025-08-14T21:33:41.1935105Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:41.1935170Z layer_outputs = layer_module( 2025-08-14T21:33:41.1935372Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:41.1935449Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:41.1935672Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 584, in forward 2025-08-14T21:33:41.1935747Z self_attention_outputs = self.attention( 2025-08-14T21:33:41.1935975Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:41.1936036Z return func(*args, **kwargs) 2025-08-14T21:33:41.1936272Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 524, in forward 2025-08-14T21:33:41.1936392Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:33:41.1936617Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 461, in forward 2025-08-14T21:33:41.1936699Z hidden_states = self.dense(hidden_states) 2025-08-14T21:33:41.1936702Z 2025-08-14T21:33:41.1936795Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:41.1936983Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:41.1937043Z return mod(**inputs) 2025-08-14T21:33:41.1937273Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1767, in forward 2025-08-14T21:33:41.1937341Z outputs = self.bert( 2025-08-14T21:33:41.1937600Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:41.1937684Z encoder_outputs = self.encoder( 2025-08-14T21:33:41.1937916Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:41.1937982Z layer_outputs = layer_module( 2025-08-14T21:33:41.1938191Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:41.1938261Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:41.1938485Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 614, in forward 2025-08-14T21:33:41.1938588Z layer_output = apply_chunking_to_forward( 2025-08-14T21:33:41.1938829Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:33:41.1938908Z return forward_fn(*input_tensors) 2025-08-14T21:33:41.1939167Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 622, in feed_forward_chunk 2025-08-14T21:33:41.1939280Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:33:41.1939513Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 539, in forward 2025-08-14T21:33:41.1939588Z hidden_states = self.dense(hidden_states) 2025-08-14T21:33:41.1939591Z 2025-08-14T21:33:41.1939685Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:41.1939877Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:41.1939938Z return mod(**inputs) 2025-08-14T21:33:41.1940173Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1767, in forward 2025-08-14T21:33:41.1940232Z outputs = self.bert( 2025-08-14T21:33:41.1940463Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:41.1940538Z encoder_outputs = self.encoder( 2025-08-14T21:33:41.1940764Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:41.1940829Z layer_outputs = layer_module( 2025-08-14T21:33:41.1941038Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:41.1941111Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:41.1941345Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 614, in forward 2025-08-14T21:33:41.1941422Z layer_output = apply_chunking_to_forward( 2025-08-14T21:33:41.1941659Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:33:41.1941735Z return forward_fn(*input_tensors) 2025-08-14T21:33:41.1941993Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 622, in feed_forward_chunk 2025-08-14T21:33:41.1942111Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:33:41.1942336Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 540, in forward 2025-08-14T21:33:41.1942439Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:33:41.1942643Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:33:41.1942707Z return self.act(input) 2025-08-14T21:33:41.1942710Z 2025-08-14T21:33:41.1942805Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:41.1942995Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:41.1943054Z return mod(**inputs) 2025-08-14T21:33:41.1943316Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1767, in forward 2025-08-14T21:33:41.1943394Z outputs = self.bert( 2025-08-14T21:33:41.1943623Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:41.1943696Z encoder_outputs = self.encoder( 2025-08-14T21:33:41.1943921Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:41.1943992Z layer_outputs = layer_module( 2025-08-14T21:33:41.1944193Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:41.1944283Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:41.1944513Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 614, in forward 2025-08-14T21:33:41.1944593Z layer_output = apply_chunking_to_forward( 2025-08-14T21:33:41.1944904Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:33:41.1944986Z return forward_fn(*input_tensors) 2025-08-14T21:33:41.1945242Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 623, in feed_forward_chunk 2025-08-14T21:33:41.1945373Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:33:41.1945600Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 552, in forward 2025-08-14T21:33:41.1945679Z hidden_states = self.dense(hidden_states) 2025-08-14T21:33:41.1945682Z 2025-08-14T21:33:41.1945785Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:41.1945968Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:41.1946037Z return mod(**inputs) 2025-08-14T21:33:41.1946271Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1767, in forward 2025-08-14T21:33:41.1946332Z outputs = self.bert( 2025-08-14T21:33:41.1946568Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:41.1946636Z encoder_outputs = self.encoder( 2025-08-14T21:33:41.1946861Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:41.1946933Z layer_outputs = layer_module( 2025-08-14T21:33:41.1947137Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:41.1947217Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:41.1947443Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 584, in forward 2025-08-14T21:33:41.1947522Z self_attention_outputs = self.attention( 2025-08-14T21:33:41.1947753Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:41.1947816Z return func(*args, **kwargs) 2025-08-14T21:33:41.1948040Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 514, in forward 2025-08-14T21:33:41.1948113Z self_outputs = self.self( 2025-08-14T21:33:41.1948335Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:41.1948407Z return func(*args, **kwargs) 2025-08-14T21:33:41.1948632Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 374, in forward 2025-08-14T21:33:41.1948820Z self.query(hidden_states).view(bsz, -1, self.num_attention_heads, self.attention_head_size).transpose(1, 2) 2025-08-14T21:33:41.1948824Z 2025-08-14T21:33:41.1948973Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:41.1949155Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:41.1949222Z return mod(**inputs) 2025-08-14T21:33:41.1949448Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1767, in forward 2025-08-14T21:33:41.1949507Z outputs = self.bert( 2025-08-14T21:33:41.1949739Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:41.1949825Z encoder_outputs = self.encoder( 2025-08-14T21:33:41.1950049Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:41.1950122Z layer_outputs = layer_module( 2025-08-14T21:33:41.1950325Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:41.1950405Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:41.1950630Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 584, in forward 2025-08-14T21:33:41.1950704Z self_attention_outputs = self.attention( 2025-08-14T21:33:41.1950930Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:41.1950994Z return func(*args, **kwargs) 2025-08-14T21:33:41.1951218Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 514, in forward 2025-08-14T21:33:41.1951290Z self_outputs = self.self( 2025-08-14T21:33:41.1951508Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:41.1951579Z return func(*args, **kwargs) 2025-08-14T21:33:41.1951807Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 402, in forward 2025-08-14T21:33:41.1951873Z self.key(current_states) 2025-08-14T21:33:41.1951876Z 2025-08-14T21:33:41.1951977Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:41.1952160Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:41.1952227Z return mod(**inputs) 2025-08-14T21:33:41.1952455Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1767, in forward 2025-08-14T21:33:41.1952516Z outputs = self.bert( 2025-08-14T21:33:41.1952755Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:41.1952823Z encoder_outputs = self.encoder( 2025-08-14T21:33:41.1953049Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:41.1953126Z layer_outputs = layer_module( 2025-08-14T21:33:41.1953326Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:41.1953405Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:41.1953627Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 584, in forward 2025-08-14T21:33:41.1953704Z self_attention_outputs = self.attention( 2025-08-14T21:33:41.1953930Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:41.1953994Z return func(*args, **kwargs) 2025-08-14T21:33:41.1954220Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 514, in forward 2025-08-14T21:33:41.1954294Z self_outputs = self.self( 2025-08-14T21:33:41.1954547Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:41.1954634Z return func(*args, **kwargs) 2025-08-14T21:33:41.1954864Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 407, in forward 2025-08-14T21:33:41.1954930Z self.value(current_states) 2025-08-14T21:33:41.1954934Z 2025-08-14T21:33:41.1955016Z cudagraph partition due to non gpu ops 2025-08-14T21:33:41.1955107Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:41.1955291Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:41.1955377Z return mod(**inputs) 2025-08-14T21:33:41.1955605Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1767, in forward 2025-08-14T21:33:41.1955673Z outputs = self.bert( 2025-08-14T21:33:41.1955901Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:41.1955969Z encoder_outputs = self.encoder( 2025-08-14T21:33:41.1956200Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:41.1956263Z layer_outputs = layer_module( 2025-08-14T21:33:41.1956471Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:41.1956542Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:41.1956765Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 584, in forward 2025-08-14T21:33:41.1956848Z self_attention_outputs = self.attention( 2025-08-14T21:33:41.1957066Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:41.1957129Z return func(*args, **kwargs) 2025-08-14T21:33:41.1957361Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 514, in forward 2025-08-14T21:33:41.1957426Z self_outputs = self.self( 2025-08-14T21:33:41.1957651Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:41.1957714Z return func(*args, **kwargs) 2025-08-14T21:33:41.1957937Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 438, in forward 2025-08-14T21:33:41.1958064Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:33:41.1958068Z 2025-08-14T21:33:41.1958162Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:41.1958350Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:41.1958410Z return mod(**inputs) 2025-08-14T21:33:41.1958641Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1767, in forward 2025-08-14T21:33:41.1958708Z outputs = self.bert( 2025-08-14T21:33:41.1958936Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:41.1959003Z encoder_outputs = self.encoder( 2025-08-14T21:33:41.1959233Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:41.1959297Z layer_outputs = layer_module( 2025-08-14T21:33:41.1959505Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:41.1959578Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:41.1959799Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 584, in forward 2025-08-14T21:33:41.1959881Z self_attention_outputs = self.attention( 2025-08-14T21:33:41.1960125Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:41.1960232Z return func(*args, **kwargs) 2025-08-14T21:33:41.1960467Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 524, in forward 2025-08-14T21:33:41.1960585Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:33:41.1960814Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 461, in forward 2025-08-14T21:33:41.1960889Z hidden_states = self.dense(hidden_states) 2025-08-14T21:33:41.1960906Z 2025-08-14T21:33:41.1961002Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:41.1961191Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:41.1961249Z return mod(**inputs) 2025-08-14T21:33:41.1961487Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1767, in forward 2025-08-14T21:33:41.1961549Z outputs = self.bert( 2025-08-14T21:33:41.1961776Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:41.1961850Z encoder_outputs = self.encoder( 2025-08-14T21:33:41.1962075Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:41.1962139Z layer_outputs = layer_module( 2025-08-14T21:33:41.1962348Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:41.1962421Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:41.1962654Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 614, in forward 2025-08-14T21:33:41.1962730Z layer_output = apply_chunking_to_forward( 2025-08-14T21:33:41.1962973Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:33:41.1963053Z return forward_fn(*input_tensors) 2025-08-14T21:33:41.1963310Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 622, in feed_forward_chunk 2025-08-14T21:33:41.1963421Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:33:41.1963654Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 539, in forward 2025-08-14T21:33:41.1963728Z hidden_states = self.dense(hidden_states) 2025-08-14T21:33:41.1963733Z 2025-08-14T21:33:41.1963833Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:41.1964019Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:41.1964079Z return mod(**inputs) 2025-08-14T21:33:41.1964319Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1767, in forward 2025-08-14T21:33:41.1964381Z outputs = self.bert( 2025-08-14T21:33:41.1964618Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:41.1964684Z encoder_outputs = self.encoder( 2025-08-14T21:33:41.1964907Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:41.1964980Z layer_outputs = layer_module( 2025-08-14T21:33:41.1965183Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:41.1965255Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:41.1965487Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 614, in forward 2025-08-14T21:33:41.1965614Z layer_output = apply_chunking_to_forward( 2025-08-14T21:33:41.1965880Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:33:41.1965952Z return forward_fn(*input_tensors) 2025-08-14T21:33:41.1966208Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 622, in feed_forward_chunk 2025-08-14T21:33:41.1966326Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:33:41.1966554Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 540, in forward 2025-08-14T21:33:41.1966683Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:33:41.1966878Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:33:41.1966942Z return self.act(input) 2025-08-14T21:33:41.1966946Z 2025-08-14T21:33:41.1967050Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:41.1967233Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:41.1967292Z return mod(**inputs) 2025-08-14T21:33:41.1967529Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1767, in forward 2025-08-14T21:33:41.1967589Z outputs = self.bert( 2025-08-14T21:33:41.1967824Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:41.1967891Z encoder_outputs = self.encoder( 2025-08-14T21:33:41.1968117Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:41.1968187Z layer_outputs = layer_module( 2025-08-14T21:33:41.1968389Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:41.1968462Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:41.1968692Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 614, in forward 2025-08-14T21:33:41.1968767Z layer_output = apply_chunking_to_forward( 2025-08-14T21:33:41.1969011Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:33:41.1969081Z return forward_fn(*input_tensors) 2025-08-14T21:33:41.1969332Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 623, in feed_forward_chunk 2025-08-14T21:33:41.1969462Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:33:41.1969687Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 552, in forward 2025-08-14T21:33:41.1969769Z hidden_states = self.dense(hidden_states) 2025-08-14T21:33:41.1969772Z 2025-08-14T21:33:41.1969869Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:41.1970051Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:41.1970120Z return mod(**inputs) 2025-08-14T21:33:41.1970347Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1767, in forward 2025-08-14T21:33:41.1970407Z outputs = self.bert( 2025-08-14T21:33:41.1970643Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:41.1970710Z encoder_outputs = self.encoder( 2025-08-14T21:33:41.1970942Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:41.1971006Z layer_outputs = layer_module( 2025-08-14T21:33:41.1971233Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:41.1971327Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:41.1971553Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 584, in forward 2025-08-14T21:33:41.1971629Z self_attention_outputs = self.attention( 2025-08-14T21:33:41.1971858Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:41.1971921Z return func(*args, **kwargs) 2025-08-14T21:33:41.1972152Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 514, in forward 2025-08-14T21:33:41.1972234Z self_outputs = self.self( 2025-08-14T21:33:41.1972456Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:41.1972528Z return func(*args, **kwargs) 2025-08-14T21:33:41.1972756Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 374, in forward 2025-08-14T21:33:41.1972954Z self.query(hidden_states).view(bsz, -1, self.num_attention_heads, self.attention_head_size).transpose(1, 2) 2025-08-14T21:33:41.1972957Z 2025-08-14T21:33:41.1973053Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:41.1973235Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:41.1973303Z return mod(**inputs) 2025-08-14T21:33:41.1973533Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1767, in forward 2025-08-14T21:33:41.1973596Z outputs = self.bert( 2025-08-14T21:33:41.1973832Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:41.1973898Z encoder_outputs = self.encoder( 2025-08-14T21:33:41.1974133Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:41.1974199Z layer_outputs = layer_module( 2025-08-14T21:33:41.1974398Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:41.1974475Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:41.1974697Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 584, in forward 2025-08-14T21:33:41.1974780Z self_attention_outputs = self.attention( 2025-08-14T21:33:41.1974997Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:41.1975062Z return func(*args, **kwargs) 2025-08-14T21:33:41.1975291Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 514, in forward 2025-08-14T21:33:41.1975354Z self_outputs = self.self( 2025-08-14T21:33:41.1975576Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:41.1975646Z return func(*args, **kwargs) 2025-08-14T21:33:41.1975868Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 402, in forward 2025-08-14T21:33:41.1975940Z self.key(current_states) 2025-08-14T21:33:41.1975943Z 2025-08-14T21:33:41.1976036Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:41.1976216Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:41.1976284Z return mod(**inputs) 2025-08-14T21:33:41.1976511Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1767, in forward 2025-08-14T21:33:41.1976570Z outputs = self.bert( 2025-08-14T21:33:41.1976840Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:41.1976922Z encoder_outputs = self.encoder( 2025-08-14T21:33:41.1977156Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:41.1977219Z layer_outputs = layer_module( 2025-08-14T21:33:41.1977419Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:41.1977498Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:41.1977720Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 584, in forward 2025-08-14T21:33:41.1977819Z self_attention_outputs = self.attention( 2025-08-14T21:33:41.1978037Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:41.1978099Z return func(*args, **kwargs) 2025-08-14T21:33:41.1978329Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 514, in forward 2025-08-14T21:33:41.1978394Z self_outputs = self.self( 2025-08-14T21:33:41.1978612Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:41.1978681Z return func(*args, **kwargs) 2025-08-14T21:33:41.1978903Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 407, in forward 2025-08-14T21:33:41.1978976Z self.value(current_states) 2025-08-14T21:33:41.1978980Z 2025-08-14T21:33:41.1979053Z cudagraph partition due to non gpu ops 2025-08-14T21:33:41.1979149Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:41.1979339Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:41.1979398Z return mod(**inputs) 2025-08-14T21:33:41.1979626Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1767, in forward 2025-08-14T21:33:41.1979696Z outputs = self.bert( 2025-08-14T21:33:41.1979922Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:41.1979998Z encoder_outputs = self.encoder( 2025-08-14T21:33:41.1980220Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:41.1980285Z layer_outputs = layer_module( 2025-08-14T21:33:41.1980493Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:41.1980573Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:41.1980796Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 584, in forward 2025-08-14T21:33:41.1980877Z self_attention_outputs = self.attention( 2025-08-14T21:33:41.1981100Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:41.1981170Z return func(*args, **kwargs) 2025-08-14T21:33:41.1981392Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 514, in forward 2025-08-14T21:33:41.1981455Z self_outputs = self.self( 2025-08-14T21:33:41.1981679Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:41.1981741Z return func(*args, **kwargs) 2025-08-14T21:33:41.1981972Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 438, in forward 2025-08-14T21:33:41.1982093Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:33:41.1982097Z 2025-08-14T21:33:41.1982189Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:41.1982406Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:41.1982483Z return mod(**inputs) 2025-08-14T21:33:41.1982717Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1767, in forward 2025-08-14T21:33:41.1982784Z outputs = self.bert( 2025-08-14T21:33:41.1983017Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:41.1983092Z encoder_outputs = self.encoder( 2025-08-14T21:33:41.1983322Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:41.1983407Z layer_outputs = layer_module( 2025-08-14T21:33:41.1983620Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:41.1983692Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:41.1983922Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 584, in forward 2025-08-14T21:33:41.1984007Z self_attention_outputs = self.attention( 2025-08-14T21:33:41.1984230Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:41.1984302Z return func(*args, **kwargs) 2025-08-14T21:33:41.1984537Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 524, in forward 2025-08-14T21:33:41.1984844Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:33:41.1985100Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 461, in forward 2025-08-14T21:33:41.1985183Z hidden_states = self.dense(hidden_states) 2025-08-14T21:33:41.1985187Z 2025-08-14T21:33:41.1985296Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:41.1985493Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:41.1985558Z return mod(**inputs) 2025-08-14T21:33:41.1985862Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1767, in forward 2025-08-14T21:33:41.1985926Z outputs = self.bert( 2025-08-14T21:33:41.1986164Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:41.1986241Z encoder_outputs = self.encoder( 2025-08-14T21:33:41.1986476Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:41.1986556Z layer_outputs = layer_module( 2025-08-14T21:33:41.1986769Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:41.1986845Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:41.1987097Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 614, in forward 2025-08-14T21:33:41.1987179Z layer_output = apply_chunking_to_forward( 2025-08-14T21:33:41.1987442Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:33:41.1987519Z return forward_fn(*input_tensors) 2025-08-14T21:33:41.1987791Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 622, in feed_forward_chunk 2025-08-14T21:33:41.1987918Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:33:41.1988157Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 539, in forward 2025-08-14T21:33:41.1988238Z hidden_states = self.dense(hidden_states) 2025-08-14T21:33:41.1988241Z 2025-08-14T21:33:41.1988408Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:41.1988628Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:41.1988698Z return mod(**inputs) 2025-08-14T21:33:41.1988936Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1767, in forward 2025-08-14T21:33:41.1989000Z outputs = self.bert( 2025-08-14T21:33:41.1989245Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:41.1989315Z encoder_outputs = self.encoder( 2025-08-14T21:33:41.1989575Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:41.1989651Z layer_outputs = layer_module( 2025-08-14T21:33:41.1989863Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:41.1989947Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:41.1990186Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 614, in forward 2025-08-14T21:33:41.1990265Z layer_output = apply_chunking_to_forward( 2025-08-14T21:33:41.1990524Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:33:41.1990597Z return forward_fn(*input_tensors) 2025-08-14T21:33:41.1990872Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 622, in feed_forward_chunk 2025-08-14T21:33:41.1990991Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:33:41.1991228Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 540, in forward 2025-08-14T21:33:41.1991346Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:33:41.1991553Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:33:41.1991621Z return self.act(input) 2025-08-14T21:33:41.1991632Z 2025-08-14T21:33:41.1991730Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:41.1991921Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:41.1991990Z return mod(**inputs) 2025-08-14T21:33:41.1992232Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1767, in forward 2025-08-14T21:33:41.1992296Z outputs = self.bert( 2025-08-14T21:33:41.1992545Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:41.1992615Z encoder_outputs = self.encoder( 2025-08-14T21:33:41.1992857Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:41.1992929Z layer_outputs = layer_module( 2025-08-14T21:33:41.1993142Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:41.1993226Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:41.1993461Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 614, in forward 2025-08-14T21:33:41.1993539Z layer_output = apply_chunking_to_forward( 2025-08-14T21:33:41.1993794Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:33:41.1993868Z return forward_fn(*input_tensors) 2025-08-14T21:33:41.1994141Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 623, in feed_forward_chunk 2025-08-14T21:33:41.1994268Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:33:41.1994556Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 552, in forward 2025-08-14T21:33:41.1994646Z hidden_states = self.dense(hidden_states) 2025-08-14T21:33:41.1994649Z 2025-08-14T21:33:41.1994749Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:41.1994948Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:41.1995011Z return mod(**inputs) 2025-08-14T21:33:41.1995251Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1767, in forward 2025-08-14T21:33:41.1995346Z outputs = self.bert( 2025-08-14T21:33:41.1995591Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:41.1995660Z encoder_outputs = self.encoder( 2025-08-14T21:33:41.1995918Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:41.1995983Z layer_outputs = layer_module( 2025-08-14T21:33:41.1996195Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:41.1996266Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:41.1996492Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 584, in forward 2025-08-14T21:33:41.1996574Z self_attention_outputs = self.attention( 2025-08-14T21:33:41.1996798Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:41.1996863Z return func(*args, **kwargs) 2025-08-14T21:33:41.1997098Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 514, in forward 2025-08-14T21:33:41.1997162Z self_outputs = self.self( 2025-08-14T21:33:41.1997394Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:41.1997460Z return func(*args, **kwargs) 2025-08-14T21:33:41.1997690Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 374, in forward 2025-08-14T21:33:41.1997888Z self.query(hidden_states).view(bsz, -1, self.num_attention_heads, self.attention_head_size).transpose(1, 2) 2025-08-14T21:33:41.1997891Z 2025-08-14T21:33:41.1997986Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:41.1998177Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:41.1998238Z return mod(**inputs) 2025-08-14T21:33:41.1998470Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1767, in forward 2025-08-14T21:33:41.1998537Z outputs = self.bert( 2025-08-14T21:33:41.1998773Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:41.1998841Z encoder_outputs = self.encoder( 2025-08-14T21:33:41.1999079Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:41.1999143Z layer_outputs = layer_module( 2025-08-14T21:33:41.1999356Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:41.1999426Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:41.1999654Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 584, in forward 2025-08-14T21:33:41.1999737Z self_attention_outputs = self.attention( 2025-08-14T21:33:41.1999959Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:41.2000048Z return func(*args, **kwargs) 2025-08-14T21:33:41.2000295Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 514, in forward 2025-08-14T21:33:41.2000359Z self_outputs = self.self( 2025-08-14T21:33:41.2000584Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:41.2000646Z return func(*args, **kwargs) 2025-08-14T21:33:41.2000867Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 402, in forward 2025-08-14T21:33:41.2000939Z self.key(current_states) 2025-08-14T21:33:41.2000958Z 2025-08-14T21:33:41.2001053Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:41.2001240Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:41.2001299Z return mod(**inputs) 2025-08-14T21:33:41.2001529Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1767, in forward 2025-08-14T21:33:41.2001596Z outputs = self.bert( 2025-08-14T21:33:41.2001823Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:41.2001889Z encoder_outputs = self.encoder( 2025-08-14T21:33:41.2002118Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:41.2002182Z layer_outputs = layer_module( 2025-08-14T21:33:41.2002389Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:41.2002460Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:41.2002683Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 584, in forward 2025-08-14T21:33:41.2002762Z self_attention_outputs = self.attention( 2025-08-14T21:33:41.2002985Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:41.2003047Z return func(*args, **kwargs) 2025-08-14T21:33:41.2003275Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 514, in forward 2025-08-14T21:33:41.2003336Z self_outputs = self.self( 2025-08-14T21:33:41.2003561Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:41.2003624Z return func(*args, **kwargs) 2025-08-14T21:33:41.2003848Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 407, in forward 2025-08-14T21:33:41.2003920Z self.value(current_states) 2025-08-14T21:33:41.2003924Z 2025-08-14T21:33:41.2003997Z cudagraph partition due to non gpu ops 2025-08-14T21:33:41.2004096Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:41.2004282Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:41.2004343Z return mod(**inputs) 2025-08-14T21:33:41.2004575Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1767, in forward 2025-08-14T21:33:41.2004638Z outputs = self.bert( 2025-08-14T21:33:41.2004864Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:41.2004937Z encoder_outputs = self.encoder( 2025-08-14T21:33:41.2005161Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:41.2005232Z layer_outputs = layer_module( 2025-08-14T21:33:41.2005432Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:41.2005503Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:41.2005783Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 584, in forward 2025-08-14T21:33:41.2005858Z self_attention_outputs = self.attention( 2025-08-14T21:33:41.2006076Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:41.2006147Z return func(*args, **kwargs) 2025-08-14T21:33:41.2006368Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 514, in forward 2025-08-14T21:33:41.2006437Z self_outputs = self.self( 2025-08-14T21:33:41.2006674Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:41.2006736Z return func(*args, **kwargs) 2025-08-14T21:33:41.2006970Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 438, in forward 2025-08-14T21:33:41.2007095Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:33:41.2007098Z 2025-08-14T21:33:41.2007199Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:41.2007379Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:41.2007439Z return mod(**inputs) 2025-08-14T21:33:41.2007676Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1767, in forward 2025-08-14T21:33:41.2007734Z outputs = self.bert( 2025-08-14T21:33:41.2007961Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:41.2008039Z encoder_outputs = self.encoder( 2025-08-14T21:33:41.2008262Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:41.2008332Z layer_outputs = layer_module( 2025-08-14T21:33:41.2008538Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:41.2008609Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:41.2008841Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 584, in forward 2025-08-14T21:33:41.2008915Z self_attention_outputs = self.attention( 2025-08-14T21:33:41.2009135Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:41.2009207Z return func(*args, **kwargs) 2025-08-14T21:33:41.2009432Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 524, in forward 2025-08-14T21:33:41.2009559Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:33:41.2009787Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 461, in forward 2025-08-14T21:33:41.2009865Z hidden_states = self.dense(hidden_states) 2025-08-14T21:33:41.2009868Z 2025-08-14T21:33:41.2009968Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:41.2010147Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:41.2010214Z return mod(**inputs) 2025-08-14T21:33:41.2010443Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1767, in forward 2025-08-14T21:33:41.2010503Z outputs = self.bert( 2025-08-14T21:33:41.2010738Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:41.2010805Z encoder_outputs = self.encoder( 2025-08-14T21:33:41.2011027Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:41.2011130Z layer_outputs = layer_module( 2025-08-14T21:33:41.2011347Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:41.2011425Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:41.2011649Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 614, in forward 2025-08-14T21:33:41.2011725Z layer_output = apply_chunking_to_forward( 2025-08-14T21:33:41.2011971Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:33:41.2012060Z return forward_fn(*input_tensors) 2025-08-14T21:33:41.2012314Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 622, in feed_forward_chunk 2025-08-14T21:33:41.2012431Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:33:41.2012658Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 539, in forward 2025-08-14T21:33:41.2012740Z hidden_states = self.dense(hidden_states) 2025-08-14T21:33:41.2012743Z 2025-08-14T21:33:41.2012834Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:41.2013018Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:41.2013084Z return mod(**inputs) 2025-08-14T21:33:41.2013313Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1767, in forward 2025-08-14T21:33:41.2013379Z outputs = self.bert( 2025-08-14T21:33:41.2013609Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:41.2013672Z encoder_outputs = self.encoder( 2025-08-14T21:33:41.2013904Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:41.2013971Z layer_outputs = layer_module( 2025-08-14T21:33:41.2014171Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:41.2014275Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:41.2014496Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 614, in forward 2025-08-14T21:33:41.2014579Z layer_output = apply_chunking_to_forward( 2025-08-14T21:33:41.2014816Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:33:41.2014886Z return forward_fn(*input_tensors) 2025-08-14T21:33:41.2015146Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 622, in feed_forward_chunk 2025-08-14T21:33:41.2015253Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:33:41.2015484Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 540, in forward 2025-08-14T21:33:41.2015590Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:33:41.2015783Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:33:41.2015852Z return self.act(input) 2025-08-14T21:33:41.2015855Z 2025-08-14T21:33:41.2015947Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:41.2016128Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:41.2016195Z return mod(**inputs) 2025-08-14T21:33:41.2016421Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1767, in forward 2025-08-14T21:33:41.2016486Z outputs = self.bert( 2025-08-14T21:33:41.2016741Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:41.2016830Z encoder_outputs = self.encoder( 2025-08-14T21:33:41.2017064Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:41.2017128Z layer_outputs = layer_module( 2025-08-14T21:33:41.2017329Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:41.2017408Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:41.2017633Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 614, in forward 2025-08-14T21:33:41.2017737Z layer_output = apply_chunking_to_forward( 2025-08-14T21:33:41.2017974Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:33:41.2018043Z return forward_fn(*input_tensors) 2025-08-14T21:33:41.2018308Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 623, in feed_forward_chunk 2025-08-14T21:33:41.2018430Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:33:41.2018664Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 552, in forward 2025-08-14T21:33:41.2018737Z hidden_states = self.dense(hidden_states) 2025-08-14T21:33:41.2018740Z 2025-08-14T21:33:41.2018831Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:41.2019024Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:41.2019085Z return mod(**inputs) 2025-08-14T21:33:41.2019314Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1767, in forward 2025-08-14T21:33:41.2019382Z outputs = self.bert( 2025-08-14T21:33:41.2019614Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:41.2019689Z encoder_outputs = self.encoder( 2025-08-14T21:33:41.2019914Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:41.2019978Z layer_outputs = layer_module( 2025-08-14T21:33:41.2020187Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:41.2020258Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:41.2020490Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 584, in forward 2025-08-14T21:33:41.2020566Z self_attention_outputs = self.attention( 2025-08-14T21:33:41.2020787Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:41.2020859Z return func(*args, **kwargs) 2025-08-14T21:33:41.2021087Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 514, in forward 2025-08-14T21:33:41.2021151Z self_outputs = self.self( 2025-08-14T21:33:41.2021381Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:41.2021446Z return func(*args, **kwargs) 2025-08-14T21:33:41.2021679Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 374, in forward 2025-08-14T21:33:41.2021869Z self.query(hidden_states).view(bsz, -1, self.num_attention_heads, self.attention_head_size).transpose(1, 2) 2025-08-14T21:33:41.2021874Z 2025-08-14T21:33:41.2021967Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:41.2022158Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:41.2022219Z return mod(**inputs) 2025-08-14T21:33:41.2022684Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1767, in forward 2025-08-14T21:33:41.2022757Z outputs = self.bert( 2025-08-14T21:33:41.2022989Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:41.2023068Z encoder_outputs = self.encoder( 2025-08-14T21:33:41.2023296Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:41.2023362Z layer_outputs = layer_module( 2025-08-14T21:33:41.2023593Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:41.2023668Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:41.2023902Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 584, in forward 2025-08-14T21:33:41.2023982Z self_attention_outputs = self.attention( 2025-08-14T21:33:41.2024206Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:41.2024280Z return func(*args, **kwargs) 2025-08-14T21:33:41.2024505Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 514, in forward 2025-08-14T21:33:41.2024571Z self_outputs = self.self( 2025-08-14T21:33:41.2024863Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:41.2024935Z return func(*args, **kwargs) 2025-08-14T21:33:41.2025168Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 402, in forward 2025-08-14T21:33:41.2025232Z self.key(current_states) 2025-08-14T21:33:41.2025236Z 2025-08-14T21:33:41.2025331Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:41.2025525Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:41.2025585Z return mod(**inputs) 2025-08-14T21:33:41.2025814Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1767, in forward 2025-08-14T21:33:41.2025882Z outputs = self.bert( 2025-08-14T21:33:41.2026110Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:41.2026186Z encoder_outputs = self.encoder( 2025-08-14T21:33:41.2026410Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:41.2026480Z layer_outputs = layer_module( 2025-08-14T21:33:41.2026690Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:41.2026762Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:41.2026998Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 584, in forward 2025-08-14T21:33:41.2027073Z self_attention_outputs = self.attention( 2025-08-14T21:33:41.2027293Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:41.2027364Z return func(*args, **kwargs) 2025-08-14T21:33:41.2027588Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 514, in forward 2025-08-14T21:33:41.2027651Z self_outputs = self.self( 2025-08-14T21:33:41.2027881Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:41.2027944Z return func(*args, **kwargs) 2025-08-14T21:33:41.2028176Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 407, in forward 2025-08-14T21:33:41.2028292Z self.value(current_states) 2025-08-14T21:33:41.2028296Z 2025-08-14T21:33:41.2028373Z cudagraph partition due to non gpu ops 2025-08-14T21:33:41.2028476Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:41.2028661Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:41.2028719Z return mod(**inputs) 2025-08-14T21:33:41.2028960Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1767, in forward 2025-08-14T21:33:41.2029019Z outputs = self.bert( 2025-08-14T21:33:41.2029272Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:41.2029340Z encoder_outputs = self.encoder( 2025-08-14T21:33:41.2029564Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:41.2029637Z layer_outputs = layer_module( 2025-08-14T21:33:41.2029839Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:41.2029909Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:41.2030136Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 584, in forward 2025-08-14T21:33:41.2030210Z self_attention_outputs = self.attention( 2025-08-14T21:33:41.2030434Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:41.2030498Z return func(*args, **kwargs) 2025-08-14T21:33:41.2030722Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 514, in forward 2025-08-14T21:33:41.2030791Z self_outputs = self.self( 2025-08-14T21:33:41.2031011Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:41.2031081Z return func(*args, **kwargs) 2025-08-14T21:33:41.2031308Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 438, in forward 2025-08-14T21:33:41.2031430Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:33:41.2031433Z 2025-08-14T21:33:41.2031532Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:41.2031711Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:41.2031771Z return mod(**inputs) 2025-08-14T21:33:41.2032009Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1767, in forward 2025-08-14T21:33:41.2032070Z outputs = self.bert( 2025-08-14T21:33:41.2032303Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:41.2032373Z encoder_outputs = self.encoder( 2025-08-14T21:33:41.2032598Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:41.2032671Z layer_outputs = layer_module( 2025-08-14T21:33:41.2032869Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:41.2032941Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:41.2033171Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 584, in forward 2025-08-14T21:33:41.2033246Z self_attention_outputs = self.attention( 2025-08-14T21:33:41.2033474Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:33:41.2033536Z return func(*args, **kwargs) 2025-08-14T21:33:41.2033788Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 524, in forward 2025-08-14T21:33:41.2033929Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:33:41.2034157Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 461, in forward 2025-08-14T21:33:41.2034239Z hidden_states = self.dense(hidden_states) 2025-08-14T21:33:41.2034243Z 2025-08-14T21:33:41.2034335Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:41.2034517Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:41.2034583Z return mod(**inputs) 2025-08-14T21:33:41.2034831Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1767, in forward 2025-08-14T21:33:41.2034890Z outputs = self.bert( 2025-08-14T21:33:41.2035129Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:41.2035201Z encoder_outputs = self.encoder( 2025-08-14T21:33:41.2035439Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:41.2035503Z layer_outputs = layer_module( 2025-08-14T21:33:41.2035707Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:41.2035787Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:41.2036020Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 614, in forward 2025-08-14T21:33:41.2036108Z layer_output = apply_chunking_to_forward( 2025-08-14T21:33:41.2036359Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:33:41.2036429Z return forward_fn(*input_tensors) 2025-08-14T21:33:41.2036699Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 622, in feed_forward_chunk 2025-08-14T21:33:41.2036812Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:33:41.2037043Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 539, in forward 2025-08-14T21:33:41.2037125Z hidden_states = self.dense(hidden_states) 2025-08-14T21:33:41.2037128Z 2025-08-14T21:33:41.2037221Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:41.2037415Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:41.2037477Z return mod(**inputs) 2025-08-14T21:33:41.2037713Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1767, in forward 2025-08-14T21:33:41.2037783Z outputs = self.bert( 2025-08-14T21:33:41.2038021Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:41.2038098Z encoder_outputs = self.encoder( 2025-08-14T21:33:41.2038326Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:41.2038391Z layer_outputs = layer_module( 2025-08-14T21:33:41.2038602Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:41.2038673Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:41.2038902Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 614, in forward 2025-08-14T21:33:41.2038987Z layer_output = apply_chunking_to_forward( 2025-08-14T21:33:41.2039227Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:33:41.2039302Z return forward_fn(*input_tensors) 2025-08-14T21:33:41.2039599Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 622, in feed_forward_chunk 2025-08-14T21:33:41.2039725Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:33:41.2039957Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 540, in forward 2025-08-14T21:33:41.2040059Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:33:41.2040253Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:33:41.2040323Z return self.act(input) 2025-08-14T21:33:41.2040340Z 2025-08-14T21:33:41.2040436Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:41.2040627Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:41.2040687Z return mod(**inputs) 2025-08-14T21:33:41.2040918Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1767, in forward 2025-08-14T21:33:41.2040988Z outputs = self.bert( 2025-08-14T21:33:41.2041219Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1028, in forward 2025-08-14T21:33:41.2041291Z encoder_outputs = self.encoder( 2025-08-14T21:33:41.2041518Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 675, in forward 2025-08-14T21:33:41.2041584Z layer_outputs = layer_module( 2025-08-14T21:33:41.2041792Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:33:41.2041865Z return super().__call__(*args, **kwargs) 2025-08-14T21:33:41.2042089Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 614, in forward 2025-08-14T21:33:41.2042172Z layer_output = apply_chunking_to_forward( 2025-08-14T21:33:41.2042414Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:33:41.2042490Z return forward_fn(*input_tensors) 2025-08-14T21:33:41.2042747Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 623, in feed_forward_chunk 2025-08-14T21:33:41.2042868Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:33:41.2043103Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 552, in forward 2025-08-14T21:33:41.2043178Z hidden_states = self.dense(hidden_states) 2025-08-14T21:33:41.2043182Z 2025-08-14T21:33:41.2043281Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:41.2043465Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:41.2043523Z return mod(**inputs) 2025-08-14T21:33:41.2043760Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1781, in forward 2025-08-14T21:33:41.2043838Z logits = self.qa_outputs(sequence_output) 2025-08-14T21:33:41.2043841Z 2025-08-14T21:33:41.2043934Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:41.2044124Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:41.2044183Z return mod(**inputs) 2025-08-14T21:33:41.2044420Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1799, in forward 2025-08-14T21:33:41.2044519Z start_loss = loss_fct(start_logits, start_positions) 2025-08-14T21:33:41.2044522Z 2025-08-14T21:33:41.2044613Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:33:41.2044799Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:33:41.2044858Z return mod(**inputs) 2025-08-14T21:33:41.2045137Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/bert/modeling_bert.py", line 1800, in forward 2025-08-14T21:33:41.2045225Z end_loss = loss_fct(end_logits, end_positions) 2025-08-14T21:33:41.2045229Z 2025-08-14T21:33:47.9220845Z Compilation time (from dynamo_timed): 12.428889979 2025-08-14T21:33:47.9228994Z pass 2025-08-14T21:33:47.9232665Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:33:47.9237392Z TIMING: _recursive_pre_grad_passes:0.00593 _recursive_joint_graph_passes:0.33869 _recursive_post_grad_passes:0.07803 async_compile.wait:0.00197 code_gen:5.77529 inductor_compile:6.86232 backend_compile:9.7036 gc:0.00012 entire_frame_compile:12.42889 total_wall_time:12.42889 2025-08-14T21:33:47.9238828Z STATS: call_* op count: 296 | FakeTensorMode.__torch_dispatch__:12371 | FakeTensor.__torch_dispatch__:4710 | ProxyTorchDispatchMode.__torch_dispatch__:4531 2025-08-14T21:33:47.9239358Z Dynamo produced 1 graphs covering 296 ops with 0 graph breaks (0 unique) 2025-08-14T21:33:52.0490030Z /opt/conda/envs/py_3.9/lib/python3.9/site-packages/llvmlite/binding/ffi.py:175: UserWarning: pkg_resources is deprecated as an API. See https://setuptools.pypa.io/en/latest/pkg_resources.html. The pkg_resources package is slated for removal as early as 2025-11-30. Refrain from using this package or pin to Setuptools<81. 2025-08-14T21:33:52.0490879Z from pkg_resources import resource_filename 2025-08-14T21:33:52.5901143Z 2025-08-14T21:34:10.1151487Z loading model: 0it [00:00, ?it/s] 2025-08-14T21:34:10.1155398Z loading model: 0it [00:17, ?it/s] 2025-08-14T21:34:10.1181975Z cpu eval BlenderbotForCausalLM 2025-08-14T21:34:10.3014468Z Compilation time (from dynamo_timed): 0 2025-08-14T21:34:10.3018382Z pass_due_to_skip 2025-08-14T21:34:10.3022837Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:34:10.3027105Z TIMING: total_wall_time:0 2025-08-14T21:34:10.3031193Z STATS: call_* op count: 0 2025-08-14T21:34:10.3033539Z Dynamo produced 0 graphs covering 0 ops with 0 graph breaks (0 unique) 2025-08-14T21:34:14.1187445Z /opt/conda/envs/py_3.9/lib/python3.9/site-packages/llvmlite/binding/ffi.py:175: UserWarning: pkg_resources is deprecated as an API. See https://setuptools.pypa.io/en/latest/pkg_resources.html. The pkg_resources package is slated for removal as early as 2025-11-30. Refrain from using this package or pin to Setuptools<81. 2025-08-14T21:34:14.1189380Z from pkg_resources import resource_filename 2025-08-14T21:34:14.6644816Z 2025-08-14T21:34:15.4936808Z loading model: 0it [00:00, ?it/s] 2025-08-14T21:34:15.4938352Z loading model: 0it [00:00, ?it/s] 2025-08-14T21:34:15.4945338Z cpu eval BlenderbotSmallForCausalLM 2025-08-14T21:34:15.6302338Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:34:15.6719589Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:34:15.7112263Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:34:20.6872139Z cudagraph partition due to non gpu ops 2025-08-14T21:34:20.6874134Z cudagraph partition due to non gpu ops 2025-08-14T21:34:20.6879221Z cudagraph partition due to non gpu ops 2025-08-14T21:34:20.6883972Z cudagraph partition due to non gpu ops 2025-08-14T21:34:20.6884372Z cudagraph partition due to non gpu ops 2025-08-14T21:34:20.6884893Z cudagraph partition due to non gpu ops 2025-08-14T21:34:20.6885181Z cudagraph partition due to non gpu ops 2025-08-14T21:34:20.6885399Z cudagraph partition due to non gpu ops 2025-08-14T21:34:20.6885627Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:20.6885993Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:20.6886316Z return mod(**inputs) 2025-08-14T21:34:20.6887070Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1512, in forward 2025-08-14T21:34:20.6887571Z outputs = self.model.decoder( 2025-08-14T21:34:20.6887983Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:20.6888393Z layer_outputs = decoder_layer( 2025-08-14T21:34:20.6888715Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:20.6889066Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:20.6889533Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 398, in forward 2025-08-14T21:34:20.6890040Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:34:20.6890576Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 199, in forward 2025-08-14T21:34:20.6891056Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:34:20.6891248Z 2025-08-14T21:34:20.6891354Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:20.6891687Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:20.6891994Z return mod(**inputs) 2025-08-14T21:34:20.6892383Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1512, in forward 2025-08-14T21:34:20.6892823Z outputs = self.model.decoder( 2025-08-14T21:34:20.6893355Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:20.6893755Z layer_outputs = decoder_layer( 2025-08-14T21:34:20.6894078Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:20.6894407Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:20.6894813Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 398, in forward 2025-08-14T21:34:20.6895237Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:34:20.6895659Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 218, in forward 2025-08-14T21:34:20.6896057Z key_states = self.k_proj(current_states) 2025-08-14T21:34:20.6896193Z 2025-08-14T21:34:20.6896287Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:20.6896626Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:20.6896963Z return mod(**inputs) 2025-08-14T21:34:20.6897477Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1512, in forward 2025-08-14T21:34:20.6897890Z outputs = self.model.decoder( 2025-08-14T21:34:20.6898292Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:20.6898695Z layer_outputs = decoder_layer( 2025-08-14T21:34:20.6899021Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:20.6899356Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:20.6899764Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 398, in forward 2025-08-14T21:34:20.6900186Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:34:20.6900661Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 219, in forward 2025-08-14T21:34:20.6901286Z value_states = self.v_proj(current_states) 2025-08-14T21:34:20.6901418Z 2025-08-14T21:34:20.6901501Z cudagraph partition due to non gpu ops 2025-08-14T21:34:20.6901689Z cudagraph partition due to non gpu ops 2025-08-14T21:34:20.6901879Z cudagraph partition due to non gpu ops 2025-08-14T21:34:20.6902063Z cudagraph partition due to non gpu ops 2025-08-14T21:34:20.6902268Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:20.6902592Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:20.6902918Z return mod(**inputs) 2025-08-14T21:34:20.6903400Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1512, in forward 2025-08-14T21:34:20.6903809Z outputs = self.model.decoder( 2025-08-14T21:34:20.6904220Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:20.6904715Z layer_outputs = decoder_layer( 2025-08-14T21:34:20.6905051Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:20.6905395Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:20.6905819Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 398, in forward 2025-08-14T21:34:20.6906312Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:34:20.6906728Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 237, in forward 2025-08-14T21:34:20.6907157Z attn_output, attn_weights = attention_interface( 2025-08-14T21:34:20.6907571Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:34:20.6908031Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:34:20.6908205Z 2025-08-14T21:34:20.6908302Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:20.6908639Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:20.6908937Z return mod(**inputs) 2025-08-14T21:34:20.6909321Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1512, in forward 2025-08-14T21:34:20.6909739Z outputs = self.model.decoder( 2025-08-14T21:34:20.6910174Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:20.6910584Z layer_outputs = decoder_layer( 2025-08-14T21:34:20.6910911Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:20.6911246Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:20.6911660Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 398, in forward 2025-08-14T21:34:20.6912095Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:34:20.6912526Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 237, in forward 2025-08-14T21:34:20.6912950Z attn_output, attn_weights = attention_interface( 2025-08-14T21:34:20.6913369Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:34:20.6913797Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:34:20.6913951Z 2025-08-14T21:34:20.6914055Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:20.6914426Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:20.6914789Z return mod(**inputs) 2025-08-14T21:34:20.6915190Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1512, in forward 2025-08-14T21:34:20.6915607Z outputs = self.model.decoder( 2025-08-14T21:34:20.6916025Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:20.6916446Z layer_outputs = decoder_layer( 2025-08-14T21:34:20.6916791Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:20.6917126Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:20.6917541Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 398, in forward 2025-08-14T21:34:20.6917981Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:34:20.6918408Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 251, in forward 2025-08-14T21:34:20.6918824Z attn_output = self.out_proj(attn_output) 2025-08-14T21:34:20.6918967Z 2025-08-14T21:34:20.6919061Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:20.6919385Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:20.6919674Z return mod(**inputs) 2025-08-14T21:34:20.6920054Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1512, in forward 2025-08-14T21:34:20.6920456Z outputs = self.model.decoder( 2025-08-14T21:34:20.6920852Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:20.6921247Z layer_outputs = decoder_layer( 2025-08-14T21:34:20.6921566Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:20.6921893Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:20.6922297Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 430, in forward 2025-08-14T21:34:20.6922735Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:34:20.6922902Z 2025-08-14T21:34:20.6922996Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:20.6923323Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:20.6923609Z return mod(**inputs) 2025-08-14T21:34:20.6923993Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1512, in forward 2025-08-14T21:34:20.6924398Z outputs = self.model.decoder( 2025-08-14T21:34:20.6924791Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:20.6925194Z layer_outputs = decoder_layer( 2025-08-14T21:34:20.6925520Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:20.6925858Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:20.6926274Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 430, in forward 2025-08-14T21:34:20.6926711Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:34:20.6927069Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:34:20.6927382Z return self.act(input) 2025-08-14T21:34:20.6927537Z 2025-08-14T21:34:20.6927635Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:20.6927967Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:20.6928270Z return mod(**inputs) 2025-08-14T21:34:20.6928653Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1512, in forward 2025-08-14T21:34:20.6929053Z outputs = self.model.decoder( 2025-08-14T21:34:20.6929454Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:20.6929876Z layer_outputs = decoder_layer( 2025-08-14T21:34:20.6930194Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:20.6930518Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:20.6930925Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 432, in forward 2025-08-14T21:34:20.6931338Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:34:20.6931464Z 2025-08-14T21:34:20.6931559Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:20.6931890Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:20.6932187Z return mod(**inputs) 2025-08-14T21:34:20.6932568Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1512, in forward 2025-08-14T21:34:20.6932968Z outputs = self.model.decoder( 2025-08-14T21:34:20.6933365Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:20.6933768Z layer_outputs = decoder_layer( 2025-08-14T21:34:20.6934085Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:20.6934409Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:20.6934810Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 398, in forward 2025-08-14T21:34:20.6935235Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:34:20.6935648Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 199, in forward 2025-08-14T21:34:20.6936120Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:34:20.6936315Z 2025-08-14T21:34:20.6936409Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:20.6936736Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:20.6937026Z return mod(**inputs) 2025-08-14T21:34:20.6937413Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1512, in forward 2025-08-14T21:34:20.6937821Z outputs = self.model.decoder( 2025-08-14T21:34:20.6938218Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:20.6938615Z layer_outputs = decoder_layer( 2025-08-14T21:34:20.6938932Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:20.6939266Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:20.6939662Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 398, in forward 2025-08-14T21:34:20.6940087Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:34:20.6940547Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 218, in forward 2025-08-14T21:34:20.6940983Z key_states = self.k_proj(current_states) 2025-08-14T21:34:20.6941109Z 2025-08-14T21:34:20.6941203Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:20.6941533Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:20.6941834Z return mod(**inputs) 2025-08-14T21:34:20.6942220Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1512, in forward 2025-08-14T21:34:20.6942639Z outputs = self.model.decoder( 2025-08-14T21:34:20.6943035Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:20.6943437Z layer_outputs = decoder_layer( 2025-08-14T21:34:20.6943748Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:20.6944083Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:20.6944487Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 398, in forward 2025-08-14T21:34:20.6945013Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:34:20.6945451Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 219, in forward 2025-08-14T21:34:20.6945888Z value_states = self.v_proj(current_states) 2025-08-14T21:34:20.6946032Z 2025-08-14T21:34:20.6946109Z cudagraph partition due to non gpu ops 2025-08-14T21:34:20.6946311Z cudagraph partition due to non gpu ops 2025-08-14T21:34:20.6946498Z cudagraph partition due to non gpu ops 2025-08-14T21:34:20.6946696Z cudagraph partition due to non gpu ops 2025-08-14T21:34:20.6946926Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:20.6947267Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:20.6947581Z return mod(**inputs) 2025-08-14T21:34:20.6947982Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1512, in forward 2025-08-14T21:34:20.6948405Z outputs = self.model.decoder( 2025-08-14T21:34:20.6948812Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:20.6949234Z layer_outputs = decoder_layer( 2025-08-14T21:34:20.6949567Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:20.6949905Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:20.6950325Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 398, in forward 2025-08-14T21:34:20.6950773Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:34:20.6951208Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 237, in forward 2025-08-14T21:34:20.6951640Z attn_output, attn_weights = attention_interface( 2025-08-14T21:34:20.6952060Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:34:20.6952518Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:34:20.6952697Z 2025-08-14T21:34:20.6952801Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:20.6953134Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:20.6953441Z return mod(**inputs) 2025-08-14T21:34:20.6953877Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1512, in forward 2025-08-14T21:34:20.6954307Z outputs = self.model.decoder( 2025-08-14T21:34:20.6954712Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:20.6955125Z layer_outputs = decoder_layer( 2025-08-14T21:34:20.6955453Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:20.6955786Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:20.6956219Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 398, in forward 2025-08-14T21:34:20.6956657Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:34:20.6957096Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 237, in forward 2025-08-14T21:34:20.6957527Z attn_output, attn_weights = attention_interface( 2025-08-14T21:34:20.6957943Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:34:20.6958377Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:34:20.6958528Z 2025-08-14T21:34:20.6958638Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:20.6958961Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:20.6959258Z return mod(**inputs) 2025-08-14T21:34:20.6959641Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1512, in forward 2025-08-14T21:34:20.6960038Z outputs = self.model.decoder( 2025-08-14T21:34:20.6960435Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:20.6960835Z layer_outputs = decoder_layer( 2025-08-14T21:34:20.6961154Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:20.6961478Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:20.6961881Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 398, in forward 2025-08-14T21:34:20.6962306Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:34:20.6962719Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 251, in forward 2025-08-14T21:34:20.6963130Z attn_output = self.out_proj(attn_output) 2025-08-14T21:34:20.6963260Z 2025-08-14T21:34:20.6963354Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:20.6963683Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:20.6963974Z return mod(**inputs) 2025-08-14T21:34:20.6964355Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1512, in forward 2025-08-14T21:34:20.6964757Z outputs = self.model.decoder( 2025-08-14T21:34:20.6965152Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:20.6965547Z layer_outputs = decoder_layer( 2025-08-14T21:34:20.6965867Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:20.6966198Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:20.6966590Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 430, in forward 2025-08-14T21:34:20.6967099Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:34:20.6967267Z 2025-08-14T21:34:20.6967362Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:20.6967687Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:20.6967972Z return mod(**inputs) 2025-08-14T21:34:20.6968353Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1512, in forward 2025-08-14T21:34:20.6968756Z outputs = self.model.decoder( 2025-08-14T21:34:20.6969168Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:20.6969560Z layer_outputs = decoder_layer( 2025-08-14T21:34:20.6969877Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:20.6970210Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:20.6970602Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 430, in forward 2025-08-14T21:34:20.6971040Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:34:20.6971394Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:34:20.6971706Z return self.act(input) 2025-08-14T21:34:20.6971807Z 2025-08-14T21:34:20.6971900Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:20.6972226Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:20.6972520Z return mod(**inputs) 2025-08-14T21:34:20.6972900Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1512, in forward 2025-08-14T21:34:20.6973299Z outputs = self.model.decoder( 2025-08-14T21:34:20.6973698Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:20.6974100Z layer_outputs = decoder_layer( 2025-08-14T21:34:20.6974408Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:20.6974745Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:20.6975147Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 432, in forward 2025-08-14T21:34:20.6975556Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:34:20.6975683Z 2025-08-14T21:34:20.6975775Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:20.6976101Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:20.6976401Z return mod(**inputs) 2025-08-14T21:34:20.6976774Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1512, in forward 2025-08-14T21:34:20.6977179Z outputs = self.model.decoder( 2025-08-14T21:34:20.6977572Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:20.6977987Z layer_outputs = decoder_layer( 2025-08-14T21:34:20.6978297Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:20.6978632Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:20.6979036Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 398, in forward 2025-08-14T21:34:20.6979461Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:34:20.6979906Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 199, in forward 2025-08-14T21:34:20.6980399Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:34:20.6980587Z 2025-08-14T21:34:20.6980690Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:20.6981018Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:20.6981309Z return mod(**inputs) 2025-08-14T21:34:20.6981693Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1512, in forward 2025-08-14T21:34:20.6982119Z outputs = self.model.decoder( 2025-08-14T21:34:20.6982508Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:20.6982912Z layer_outputs = decoder_layer( 2025-08-14T21:34:20.6983232Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:20.6983565Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:20.6983964Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 398, in forward 2025-08-14T21:34:20.6984387Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:34:20.6985019Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 218, in forward 2025-08-14T21:34:20.6985443Z key_states = self.k_proj(current_states) 2025-08-14T21:34:20.6985569Z 2025-08-14T21:34:20.6985663Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:20.6985996Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:20.6986298Z return mod(**inputs) 2025-08-14T21:34:20.6986680Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1512, in forward 2025-08-14T21:34:20.6987094Z outputs = self.model.decoder( 2025-08-14T21:34:20.6987496Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:20.6987902Z layer_outputs = decoder_layer( 2025-08-14T21:34:20.6988216Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:20.6988553Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:20.6988962Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 398, in forward 2025-08-14T21:34:20.6989389Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:34:20.6989810Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 219, in forward 2025-08-14T21:34:20.6990225Z value_states = self.v_proj(current_states) 2025-08-14T21:34:20.6990356Z 2025-08-14T21:34:20.6990438Z cudagraph partition due to non gpu ops 2025-08-14T21:34:20.6990629Z cudagraph partition due to non gpu ops 2025-08-14T21:34:20.6990824Z cudagraph partition due to non gpu ops 2025-08-14T21:34:20.6991015Z cudagraph partition due to non gpu ops 2025-08-14T21:34:20.6991230Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:20.6991558Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:20.6991858Z return mod(**inputs) 2025-08-14T21:34:20.6992246Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1512, in forward 2025-08-14T21:34:20.6992646Z outputs = self.model.decoder( 2025-08-14T21:34:20.6993129Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:20.6993564Z layer_outputs = decoder_layer( 2025-08-14T21:34:20.6993887Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:20.6994214Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:20.6994623Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 398, in forward 2025-08-14T21:34:20.6995048Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:34:20.6995495Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 237, in forward 2025-08-14T21:34:20.6995931Z attn_output, attn_weights = attention_interface( 2025-08-14T21:34:20.6996349Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:34:20.6996798Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:34:20.6996971Z 2025-08-14T21:34:20.6997069Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:20.6997404Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:20.6997708Z return mod(**inputs) 2025-08-14T21:34:20.6998096Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1512, in forward 2025-08-14T21:34:20.6998500Z outputs = self.model.decoder( 2025-08-14T21:34:20.6998904Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:20.6999310Z layer_outputs = decoder_layer( 2025-08-14T21:34:20.6999637Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:20.6999970Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:20.7000377Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 398, in forward 2025-08-14T21:34:20.7000807Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:34:20.7001224Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 237, in forward 2025-08-14T21:34:20.7001651Z attn_output, attn_weights = attention_interface( 2025-08-14T21:34:20.7002062Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:34:20.7002482Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:34:20.7002632Z 2025-08-14T21:34:20.7002733Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:20.7003066Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:20.7003366Z return mod(**inputs) 2025-08-14T21:34:20.7003754Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1512, in forward 2025-08-14T21:34:20.7004158Z outputs = self.model.decoder( 2025-08-14T21:34:20.7004558Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:20.7004969Z layer_outputs = decoder_layer( 2025-08-14T21:34:20.7005298Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:20.7005642Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:20.7006104Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 398, in forward 2025-08-14T21:34:20.7006546Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:34:20.7007023Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 251, in forward 2025-08-14T21:34:20.7007445Z attn_output = self.out_proj(attn_output) 2025-08-14T21:34:20.7007573Z 2025-08-14T21:34:20.7007677Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:20.7008012Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:20.7008334Z return mod(**inputs) 2025-08-14T21:34:20.7008730Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1512, in forward 2025-08-14T21:34:20.7009154Z outputs = self.model.decoder( 2025-08-14T21:34:20.7009559Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:20.7009977Z layer_outputs = decoder_layer( 2025-08-14T21:34:20.7010308Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:20.7010651Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:20.7011059Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 430, in forward 2025-08-14T21:34:20.7011524Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:34:20.7011688Z 2025-08-14T21:34:20.7011792Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:20.7012131Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:20.7012434Z return mod(**inputs) 2025-08-14T21:34:20.7012832Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1512, in forward 2025-08-14T21:34:20.7013254Z outputs = self.model.decoder( 2025-08-14T21:34:20.7013658Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:20.7014078Z layer_outputs = decoder_layer( 2025-08-14T21:34:20.7014408Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:20.7014751Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:20.7015164Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 430, in forward 2025-08-14T21:34:20.7015626Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:34:20.7015994Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:34:20.7016321Z return self.act(input) 2025-08-14T21:34:20.7016427Z 2025-08-14T21:34:20.7016524Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:20.7016864Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:20.7017175Z return mod(**inputs) 2025-08-14T21:34:20.7017567Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1512, in forward 2025-08-14T21:34:20.7017988Z outputs = self.model.decoder( 2025-08-14T21:34:20.7018400Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:20.7018817Z layer_outputs = decoder_layer( 2025-08-14T21:34:20.7019147Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:20.7019481Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:20.7019954Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 432, in forward 2025-08-14T21:34:20.7020367Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:34:20.7020493Z 2025-08-14T21:34:20.7020586Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:20.7020916Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:20.7021212Z return mod(**inputs) 2025-08-14T21:34:20.7021586Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1512, in forward 2025-08-14T21:34:20.7022008Z outputs = self.model.decoder( 2025-08-14T21:34:20.7022406Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:20.7022808Z layer_outputs = decoder_layer( 2025-08-14T21:34:20.7023123Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:20.7023456Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:20.7023858Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 398, in forward 2025-08-14T21:34:20.7024275Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:34:20.7024774Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 199, in forward 2025-08-14T21:34:20.7025273Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:34:20.7025469Z 2025-08-14T21:34:20.7025575Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:20.7025906Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:20.7026229Z return mod(**inputs) 2025-08-14T21:34:20.7026615Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1512, in forward 2025-08-14T21:34:20.7027024Z outputs = self.model.decoder( 2025-08-14T21:34:20.7027418Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:20.7027825Z layer_outputs = decoder_layer( 2025-08-14T21:34:20.7028151Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:20.7028489Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:20.7028887Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 398, in forward 2025-08-14T21:34:20.7029315Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:34:20.7029741Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 218, in forward 2025-08-14T21:34:20.7030143Z key_states = self.k_proj(current_states) 2025-08-14T21:34:20.7030274Z 2025-08-14T21:34:20.7030367Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:20.7030693Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:20.7030990Z return mod(**inputs) 2025-08-14T21:34:20.7031367Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1512, in forward 2025-08-14T21:34:20.7031776Z outputs = self.model.decoder( 2025-08-14T21:34:20.7032173Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:20.7032577Z layer_outputs = decoder_layer( 2025-08-14T21:34:20.7032930Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:20.7033278Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:20.7033683Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 398, in forward 2025-08-14T21:34:20.7034103Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:34:20.7034523Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 219, in forward 2025-08-14T21:34:20.7034956Z value_states = self.v_proj(current_states) 2025-08-14T21:34:20.7035087Z 2025-08-14T21:34:20.7035167Z cudagraph partition due to non gpu ops 2025-08-14T21:34:20.7035356Z cudagraph partition due to non gpu ops 2025-08-14T21:34:20.7035549Z cudagraph partition due to non gpu ops 2025-08-14T21:34:20.7035741Z cudagraph partition due to non gpu ops 2025-08-14T21:34:20.7035951Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:20.7036282Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:20.7036579Z return mod(**inputs) 2025-08-14T21:34:20.7036963Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1512, in forward 2025-08-14T21:34:20.7037362Z outputs = self.model.decoder( 2025-08-14T21:34:20.7037759Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:20.7038164Z layer_outputs = decoder_layer( 2025-08-14T21:34:20.7038477Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:20.7038806Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:20.7039212Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 398, in forward 2025-08-14T21:34:20.7039637Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:34:20.7040048Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 237, in forward 2025-08-14T21:34:20.7040473Z attn_output, attn_weights = attention_interface( 2025-08-14T21:34:20.7040884Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:34:20.7041323Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:34:20.7041494Z 2025-08-14T21:34:20.7041589Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:20.7041920Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:20.7042218Z return mod(**inputs) 2025-08-14T21:34:20.7042599Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1512, in forward 2025-08-14T21:34:20.7043009Z outputs = self.model.decoder( 2025-08-14T21:34:20.7043406Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:20.7043810Z layer_outputs = decoder_layer( 2025-08-14T21:34:20.7044124Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:20.7044460Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:20.7044867Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 398, in forward 2025-08-14T21:34:20.7045291Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:34:20.7045739Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 237, in forward 2025-08-14T21:34:20.7046183Z attn_output, attn_weights = attention_interface( 2025-08-14T21:34:20.7046588Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:34:20.7047016Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:34:20.7047163Z 2025-08-14T21:34:20.7047255Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:20.7047583Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:20.7047906Z return mod(**inputs) 2025-08-14T21:34:20.7048287Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1512, in forward 2025-08-14T21:34:20.7048701Z outputs = self.model.decoder( 2025-08-14T21:34:20.7049106Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:20.7049517Z layer_outputs = decoder_layer( 2025-08-14T21:34:20.7049833Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:20.7050170Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:20.7050580Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 398, in forward 2025-08-14T21:34:20.7051010Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:34:20.7051431Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 251, in forward 2025-08-14T21:34:20.7051847Z attn_output = self.out_proj(attn_output) 2025-08-14T21:34:20.7051974Z 2025-08-14T21:34:20.7052075Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:20.7052405Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:20.7052707Z return mod(**inputs) 2025-08-14T21:34:20.7053097Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1512, in forward 2025-08-14T21:34:20.7053504Z outputs = self.model.decoder( 2025-08-14T21:34:20.7053899Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:20.7054303Z layer_outputs = decoder_layer( 2025-08-14T21:34:20.7054647Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:20.7054989Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:20.7055403Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 430, in forward 2025-08-14T21:34:20.7055866Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:34:20.7056029Z 2025-08-14T21:34:20.7056133Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:20.7056466Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:20.7056774Z return mod(**inputs) 2025-08-14T21:34:20.7057175Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1512, in forward 2025-08-14T21:34:20.7057595Z outputs = self.model.decoder( 2025-08-14T21:34:20.7058005Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:20.7058435Z layer_outputs = decoder_layer( 2025-08-14T21:34:20.7058793Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:20.7059145Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:20.7059542Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 430, in forward 2025-08-14T21:34:20.7059985Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:34:20.7060342Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:34:20.7060646Z return self.act(input) 2025-08-14T21:34:20.7060755Z 2025-08-14T21:34:20.7060849Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:20.7061211Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:20.7061506Z return mod(**inputs) 2025-08-14T21:34:20.7061881Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1512, in forward 2025-08-14T21:34:20.7062290Z outputs = self.model.decoder( 2025-08-14T21:34:20.7062684Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:20.7063074Z layer_outputs = decoder_layer( 2025-08-14T21:34:20.7063390Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:20.7063720Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:20.7064121Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 432, in forward 2025-08-14T21:34:20.7064532Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:34:20.7064735Z 2025-08-14T21:34:20.7064837Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:20.7065180Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:20.7065495Z return mod(**inputs) 2025-08-14T21:34:20.7065883Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1512, in forward 2025-08-14T21:34:20.7066305Z outputs = self.model.decoder( 2025-08-14T21:34:20.7066703Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:20.7067100Z layer_outputs = decoder_layer( 2025-08-14T21:34:20.7067422Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:20.7067761Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:20.7068171Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 398, in forward 2025-08-14T21:34:20.7068594Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:34:20.7069025Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 199, in forward 2025-08-14T21:34:20.7069502Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:34:20.7069690Z 2025-08-14T21:34:20.7069793Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:20.7070118Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:20.7070414Z return mod(**inputs) 2025-08-14T21:34:20.7070798Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1512, in forward 2025-08-14T21:34:20.7071205Z outputs = self.model.decoder( 2025-08-14T21:34:20.7071597Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:20.7072035Z layer_outputs = decoder_layer( 2025-08-14T21:34:20.7072369Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:20.7072692Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:20.7073093Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 398, in forward 2025-08-14T21:34:20.7073516Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:34:20.7073932Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 218, in forward 2025-08-14T21:34:20.7074383Z key_states = self.k_proj(current_states) 2025-08-14T21:34:20.7074512Z 2025-08-14T21:34:20.7074606Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:20.7074934Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:20.7075234Z return mod(**inputs) 2025-08-14T21:34:20.7075615Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1512, in forward 2025-08-14T21:34:20.7076020Z outputs = self.model.decoder( 2025-08-14T21:34:20.7076419Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:20.7076815Z layer_outputs = decoder_layer( 2025-08-14T21:34:20.7077137Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:20.7077476Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:20.7077883Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 398, in forward 2025-08-14T21:34:20.7078304Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:34:20.7078731Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 219, in forward 2025-08-14T21:34:20.7079149Z value_states = self.v_proj(current_states) 2025-08-14T21:34:20.7079277Z 2025-08-14T21:34:20.7079356Z cudagraph partition due to non gpu ops 2025-08-14T21:34:20.7079546Z cudagraph partition due to non gpu ops 2025-08-14T21:34:20.7079737Z cudagraph partition due to non gpu ops 2025-08-14T21:34:20.7079926Z cudagraph partition due to non gpu ops 2025-08-14T21:34:20.7080131Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:20.7080461Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:20.7080760Z return mod(**inputs) 2025-08-14T21:34:20.7081136Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1512, in forward 2025-08-14T21:34:20.7081546Z outputs = self.model.decoder( 2025-08-14T21:34:20.7081950Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:20.7082350Z layer_outputs = decoder_layer( 2025-08-14T21:34:20.7082667Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:20.7083001Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:20.7083406Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 398, in forward 2025-08-14T21:34:20.7083832Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:34:20.7084250Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 237, in forward 2025-08-14T21:34:20.7084820Z attn_output, attn_weights = attention_interface( 2025-08-14T21:34:20.7085302Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:34:20.7085782Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:34:20.7085977Z 2025-08-14T21:34:20.7086076Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:20.7086415Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:20.7086723Z return mod(**inputs) 2025-08-14T21:34:20.7087108Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1512, in forward 2025-08-14T21:34:20.7087567Z outputs = self.model.decoder( 2025-08-14T21:34:20.7087965Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:20.7088363Z layer_outputs = decoder_layer( 2025-08-14T21:34:20.7088675Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:20.7089005Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:20.7089408Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 398, in forward 2025-08-14T21:34:20.7089824Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:34:20.7090247Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 237, in forward 2025-08-14T21:34:20.7090672Z attn_output, attn_weights = attention_interface( 2025-08-14T21:34:20.7091073Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:34:20.7091480Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:34:20.7091635Z 2025-08-14T21:34:20.7091731Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:20.7092060Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:20.7092351Z return mod(**inputs) 2025-08-14T21:34:20.7092721Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1512, in forward 2025-08-14T21:34:20.7093123Z outputs = self.model.decoder( 2025-08-14T21:34:20.7093517Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:20.7093920Z layer_outputs = decoder_layer( 2025-08-14T21:34:20.7094232Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:20.7094564Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:20.7094967Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 398, in forward 2025-08-14T21:34:20.7095387Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:34:20.7095806Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 251, in forward 2025-08-14T21:34:20.7096214Z attn_output = self.out_proj(attn_output) 2025-08-14T21:34:20.7096338Z 2025-08-14T21:34:20.7096437Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:20.7096759Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:20.7097057Z return mod(**inputs) 2025-08-14T21:34:20.7097436Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1512, in forward 2025-08-14T21:34:20.7097841Z outputs = self.model.decoder( 2025-08-14T21:34:20.7098263Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:20.7098684Z layer_outputs = decoder_layer( 2025-08-14T21:34:20.7099006Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:20.7099341Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:20.7099752Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 430, in forward 2025-08-14T21:34:20.7100204Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:34:20.7100381Z 2025-08-14T21:34:20.7100483Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:20.7100808Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:20.7101113Z return mod(**inputs) 2025-08-14T21:34:20.7101509Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1512, in forward 2025-08-14T21:34:20.7101921Z outputs = self.model.decoder( 2025-08-14T21:34:20.7102314Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:20.7102723Z layer_outputs = decoder_layer( 2025-08-14T21:34:20.7103048Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:20.7103464Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:20.7103878Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 430, in forward 2025-08-14T21:34:20.7104328Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:34:20.7104757Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:34:20.7105084Z return self.act(input) 2025-08-14T21:34:20.7105196Z 2025-08-14T21:34:20.7105292Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:20.7105633Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:20.7105985Z return mod(**inputs) 2025-08-14T21:34:20.7106364Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1512, in forward 2025-08-14T21:34:20.7106772Z outputs = self.model.decoder( 2025-08-14T21:34:20.7107177Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:20.7107588Z layer_outputs = decoder_layer( 2025-08-14T21:34:20.7107919Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:20.7108265Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:20.7108687Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 432, in forward 2025-08-14T21:34:20.7109099Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:34:20.7109234Z 2025-08-14T21:34:20.7109332Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:20.7109673Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:20.7109980Z return mod(**inputs) 2025-08-14T21:34:20.7110365Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1512, in forward 2025-08-14T21:34:20.7110784Z outputs = self.model.decoder( 2025-08-14T21:34:20.7111196Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:20.7111637Z layer_outputs = decoder_layer( 2025-08-14T21:34:20.7112007Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:20.7112359Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:20.7112794Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 398, in forward 2025-08-14T21:34:20.7113236Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:34:20.7113681Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 199, in forward 2025-08-14T21:34:20.7114186Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:34:20.7114378Z 2025-08-14T21:34:20.7114483Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:20.7114821Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:20.7115132Z return mod(**inputs) 2025-08-14T21:34:20.7115532Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1512, in forward 2025-08-14T21:34:20.7115941Z outputs = self.model.decoder( 2025-08-14T21:34:20.7116350Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:20.7116762Z layer_outputs = decoder_layer( 2025-08-14T21:34:20.7117089Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:20.7117427Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:20.7117839Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 398, in forward 2025-08-14T21:34:20.7118342Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:34:20.7118767Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 218, in forward 2025-08-14T21:34:20.7119171Z key_states = self.k_proj(current_states) 2025-08-14T21:34:20.7119303Z 2025-08-14T21:34:20.7119399Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:20.7119728Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:20.7120020Z return mod(**inputs) 2025-08-14T21:34:20.7120402Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1512, in forward 2025-08-14T21:34:20.7120807Z outputs = self.model.decoder( 2025-08-14T21:34:20.7121202Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:20.7121598Z layer_outputs = decoder_layer( 2025-08-14T21:34:20.7121920Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:20.7122252Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:20.7122655Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 398, in forward 2025-08-14T21:34:20.7123068Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:34:20.7123487Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 219, in forward 2025-08-14T21:34:20.7123900Z value_states = self.v_proj(current_states) 2025-08-14T21:34:20.7124029Z 2025-08-14T21:34:20.7124101Z cudagraph partition due to non gpu ops 2025-08-14T21:34:20.7124296Z cudagraph partition due to non gpu ops 2025-08-14T21:34:20.7124487Z cudagraph partition due to non gpu ops 2025-08-14T21:34:20.7124705Z cudagraph partition due to non gpu ops 2025-08-14T21:34:20.7124928Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:20.7125258Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:20.7125555Z return mod(**inputs) 2025-08-14T21:34:20.7125933Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1512, in forward 2025-08-14T21:34:20.7126339Z outputs = self.model.decoder( 2025-08-14T21:34:20.7126737Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:20.7127155Z layer_outputs = decoder_layer( 2025-08-14T21:34:20.7127468Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:20.7127799Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:20.7128204Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 398, in forward 2025-08-14T21:34:20.7128623Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:34:20.7129043Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 237, in forward 2025-08-14T21:34:20.7129471Z attn_output, attn_weights = attention_interface( 2025-08-14T21:34:20.7129878Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:34:20.7130312Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:34:20.7130488Z 2025-08-14T21:34:20.7130581Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:20.7130904Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:20.7131202Z return mod(**inputs) 2025-08-14T21:34:20.7131579Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1512, in forward 2025-08-14T21:34:20.7132000Z outputs = self.model.decoder( 2025-08-14T21:34:20.7132625Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:20.7133083Z layer_outputs = decoder_layer( 2025-08-14T21:34:20.7133414Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:20.7133768Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:20.7134193Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 398, in forward 2025-08-14T21:34:20.7134633Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:34:20.7135090Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 237, in forward 2025-08-14T21:34:20.7135598Z attn_output, attn_weights = attention_interface( 2025-08-14T21:34:20.7136023Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:34:20.7136458Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:34:20.7136623Z 2025-08-14T21:34:20.7136721Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:20.7137065Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:20.7137383Z return mod(**inputs) 2025-08-14T21:34:20.7137780Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1512, in forward 2025-08-14T21:34:20.7138206Z outputs = self.model.decoder( 2025-08-14T21:34:20.7138687Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:20.7139109Z layer_outputs = decoder_layer( 2025-08-14T21:34:20.7139446Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:20.7139816Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:20.7140263Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 398, in forward 2025-08-14T21:34:20.7140722Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:34:20.7141166Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 251, in forward 2025-08-14T21:34:20.7141601Z attn_output = self.out_proj(attn_output) 2025-08-14T21:34:20.7141732Z 2025-08-14T21:34:20.7141841Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:20.7142181Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:20.7142497Z return mod(**inputs) 2025-08-14T21:34:20.7142901Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1512, in forward 2025-08-14T21:34:20.7143325Z outputs = self.model.decoder( 2025-08-14T21:34:20.7143750Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:20.7144183Z layer_outputs = decoder_layer( 2025-08-14T21:34:20.7144788Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:20.7145309Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:20.7145810Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 430, in forward 2025-08-14T21:34:20.7146300Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:34:20.7146470Z 2025-08-14T21:34:20.7146579Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:20.7146922Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:20.7147236Z return mod(**inputs) 2025-08-14T21:34:20.7147640Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1512, in forward 2025-08-14T21:34:20.7148108Z outputs = self.model.decoder( 2025-08-14T21:34:20.7148511Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:20.7148921Z layer_outputs = decoder_layer( 2025-08-14T21:34:20.7149242Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:20.7149571Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:20.7149979Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 430, in forward 2025-08-14T21:34:20.7150425Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:34:20.7150784Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:34:20.7151092Z return self.act(input) 2025-08-14T21:34:20.7151200Z 2025-08-14T21:34:20.7151297Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:20.7151625Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:20.7151911Z return mod(**inputs) 2025-08-14T21:34:20.7152331Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1512, in forward 2025-08-14T21:34:20.7152764Z outputs = self.model.decoder( 2025-08-14T21:34:20.7153163Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:20.7153557Z layer_outputs = decoder_layer( 2025-08-14T21:34:20.7153874Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:20.7154205Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:20.7154601Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 432, in forward 2025-08-14T21:34:20.7155024Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:34:20.7155157Z 2025-08-14T21:34:20.7155251Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:20.7155584Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:20.7155877Z return mod(**inputs) 2025-08-14T21:34:20.7156261Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1512, in forward 2025-08-14T21:34:20.7156666Z outputs = self.model.decoder( 2025-08-14T21:34:20.7157061Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:20.7157454Z layer_outputs = decoder_layer( 2025-08-14T21:34:20.7157771Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:20.7158102Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:20.7158494Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 398, in forward 2025-08-14T21:34:20.7158921Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:34:20.7159342Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 199, in forward 2025-08-14T21:34:20.7159811Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:34:20.7159998Z 2025-08-14T21:34:20.7160092Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:20.7160419Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:20.7160711Z return mod(**inputs) 2025-08-14T21:34:20.7161090Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1512, in forward 2025-08-14T21:34:20.7161487Z outputs = self.model.decoder( 2025-08-14T21:34:20.7161883Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:20.7162286Z layer_outputs = decoder_layer( 2025-08-14T21:34:20.7162605Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:20.7162929Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:20.7163329Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 398, in forward 2025-08-14T21:34:20.7163752Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:34:20.7164165Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 218, in forward 2025-08-14T21:34:20.7164573Z key_states = self.k_proj(current_states) 2025-08-14T21:34:20.7164704Z 2025-08-14T21:34:20.7164798Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:20.7165167Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:20.7165476Z return mod(**inputs) 2025-08-14T21:34:20.7165861Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1512, in forward 2025-08-14T21:34:20.7166266Z outputs = self.model.decoder( 2025-08-14T21:34:20.7166662Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:20.7167060Z layer_outputs = decoder_layer( 2025-08-14T21:34:20.7167377Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:20.7167727Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:20.7168123Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 398, in forward 2025-08-14T21:34:20.7168552Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:34:20.7168978Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 219, in forward 2025-08-14T21:34:20.7169397Z value_states = self.v_proj(current_states) 2025-08-14T21:34:20.7169525Z 2025-08-14T21:34:20.7169600Z cudagraph partition due to non gpu ops 2025-08-14T21:34:20.7169798Z cudagraph partition due to non gpu ops 2025-08-14T21:34:20.7169990Z cudagraph partition due to non gpu ops 2025-08-14T21:34:20.7170171Z cudagraph partition due to non gpu ops 2025-08-14T21:34:20.7170387Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:20.7170717Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:20.7171016Z return mod(**inputs) 2025-08-14T21:34:20.7171397Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1512, in forward 2025-08-14T21:34:20.7171805Z outputs = self.model.decoder( 2025-08-14T21:34:20.7172205Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:20.7172601Z layer_outputs = decoder_layer( 2025-08-14T21:34:20.7172921Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:20.7173253Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:20.7173657Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 398, in forward 2025-08-14T21:34:20.7174076Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:34:20.7174496Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 237, in forward 2025-08-14T21:34:20.7174914Z attn_output, attn_weights = attention_interface( 2025-08-14T21:34:20.7175321Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:34:20.7175754Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:34:20.7175928Z 2025-08-14T21:34:20.7176024Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:20.7176352Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:20.7176650Z return mod(**inputs) 2025-08-14T21:34:20.7177027Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1512, in forward 2025-08-14T21:34:20.7177436Z outputs = self.model.decoder( 2025-08-14T21:34:20.7177833Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:20.7178256Z layer_outputs = decoder_layer( 2025-08-14T21:34:20.7178592Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:20.7178922Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:20.7179324Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 398, in forward 2025-08-14T21:34:20.7179739Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:34:20.7180157Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 237, in forward 2025-08-14T21:34:20.7181422Z attn_output, attn_weights = attention_interface( 2025-08-14T21:34:20.7181825Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:34:20.7182236Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:34:20.7182395Z 2025-08-14T21:34:20.7182490Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:20.7182819Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:20.7183112Z return mod(**inputs) 2025-08-14T21:34:20.7183497Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1512, in forward 2025-08-14T21:34:20.7183900Z outputs = self.model.decoder( 2025-08-14T21:34:20.7184299Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:20.7184972Z layer_outputs = decoder_layer( 2025-08-14T21:34:20.7185331Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:20.7185698Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:20.7186173Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 398, in forward 2025-08-14T21:34:20.7186597Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:34:20.7187027Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 251, in forward 2025-08-14T21:34:20.7187442Z attn_output = self.out_proj(attn_output) 2025-08-14T21:34:20.7187568Z 2025-08-14T21:34:20.7187671Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:20.7187994Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:20.7188297Z return mod(**inputs) 2025-08-14T21:34:20.7188680Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1512, in forward 2025-08-14T21:34:20.7189077Z outputs = self.model.decoder( 2025-08-14T21:34:20.7189475Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:20.7189877Z layer_outputs = decoder_layer( 2025-08-14T21:34:20.7190196Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:20.7190521Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:20.7190927Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 430, in forward 2025-08-14T21:34:20.7191371Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:34:20.7191529Z 2025-08-14T21:34:20.7191631Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:20.7191950Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:20.7192248Z return mod(**inputs) 2025-08-14T21:34:20.7192703Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1512, in forward 2025-08-14T21:34:20.7193132Z outputs = self.model.decoder( 2025-08-14T21:34:20.7193532Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:20.7193930Z layer_outputs = decoder_layer( 2025-08-14T21:34:20.7194249Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:20.7194573Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:20.7195000Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 430, in forward 2025-08-14T21:34:20.7195442Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:34:20.7195794Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:34:20.7196105Z return self.act(input) 2025-08-14T21:34:20.7196272Z 2025-08-14T21:34:20.7196406Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:20.7196799Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:20.7197282Z return mod(**inputs) 2025-08-14T21:34:20.7197744Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1512, in forward 2025-08-14T21:34:20.7198241Z outputs = self.model.decoder( 2025-08-14T21:34:20.7198710Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:20.7199183Z layer_outputs = decoder_layer( 2025-08-14T21:34:20.7199552Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:20.7199954Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:20.7211300Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 432, in forward 2025-08-14T21:34:20.7211753Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:34:20.7211889Z 2025-08-14T21:34:20.7212012Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:20.7212354Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:20.7212745Z return mod(**inputs) 2025-08-14T21:34:20.7213213Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1512, in forward 2025-08-14T21:34:20.7213633Z outputs = self.model.decoder( 2025-08-14T21:34:20.7214041Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:20.7214455Z layer_outputs = decoder_layer( 2025-08-14T21:34:20.7214783Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:20.7215115Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:20.7215528Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 398, in forward 2025-08-14T21:34:20.7215960Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:34:20.7216392Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 199, in forward 2025-08-14T21:34:20.7216861Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:34:20.7217059Z 2025-08-14T21:34:20.7217157Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:20.7217606Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:20.7217941Z return mod(**inputs) 2025-08-14T21:34:20.7218327Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1512, in forward 2025-08-14T21:34:20.7218741Z outputs = self.model.decoder( 2025-08-14T21:34:20.7219145Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:20.7219545Z layer_outputs = decoder_layer( 2025-08-14T21:34:20.7219874Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:20.7220249Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:20.7220662Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 398, in forward 2025-08-14T21:34:20.7221090Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:34:20.7221524Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 218, in forward 2025-08-14T21:34:20.7221942Z key_states = self.k_proj(current_states) 2025-08-14T21:34:20.7222072Z 2025-08-14T21:34:20.7222177Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:20.7222508Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:20.7222818Z return mod(**inputs) 2025-08-14T21:34:20.7223209Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1512, in forward 2025-08-14T21:34:20.7223618Z outputs = self.model.decoder( 2025-08-14T21:34:20.7224027Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:20.7224432Z layer_outputs = decoder_layer( 2025-08-14T21:34:20.7224843Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:20.7225174Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:20.7225582Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 398, in forward 2025-08-14T21:34:20.7226009Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:34:20.7226431Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 219, in forward 2025-08-14T21:34:20.7226845Z value_states = self.v_proj(current_states) 2025-08-14T21:34:20.7226984Z 2025-08-14T21:34:20.7227061Z cudagraph partition due to non gpu ops 2025-08-14T21:34:20.7227266Z cudagraph partition due to non gpu ops 2025-08-14T21:34:20.7227453Z cudagraph partition due to non gpu ops 2025-08-14T21:34:20.7227652Z cudagraph partition due to non gpu ops 2025-08-14T21:34:20.7227872Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:20.7228207Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:20.7228500Z return mod(**inputs) 2025-08-14T21:34:20.7228888Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1512, in forward 2025-08-14T21:34:20.7229293Z outputs = self.model.decoder( 2025-08-14T21:34:20.7229684Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:20.7230089Z layer_outputs = decoder_layer( 2025-08-14T21:34:20.7230411Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:20.7230745Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:20.7231199Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 398, in forward 2025-08-14T21:34:20.7231629Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:34:20.7232053Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 237, in forward 2025-08-14T21:34:20.7232483Z attn_output, attn_weights = attention_interface( 2025-08-14T21:34:20.7232887Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:34:20.7233351Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:34:20.7233522Z 2025-08-14T21:34:20.7233626Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:20.7233948Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:20.7234253Z return mod(**inputs) 2025-08-14T21:34:20.7234636Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1512, in forward 2025-08-14T21:34:20.7235041Z outputs = self.model.decoder( 2025-08-14T21:34:20.7235429Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:20.7235832Z layer_outputs = decoder_layer( 2025-08-14T21:34:20.7236154Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:20.7236484Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:20.7236879Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 398, in forward 2025-08-14T21:34:20.7237302Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:34:20.7237726Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 237, in forward 2025-08-14T21:34:20.7238141Z attn_output, attn_weights = attention_interface( 2025-08-14T21:34:20.7238546Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:34:20.7238967Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:34:20.7239114Z 2025-08-14T21:34:20.7239215Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:20.7239536Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:20.7239834Z return mod(**inputs) 2025-08-14T21:34:20.7240217Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1512, in forward 2025-08-14T21:34:20.7240624Z outputs = self.model.decoder( 2025-08-14T21:34:20.7241015Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:20.7241415Z layer_outputs = decoder_layer( 2025-08-14T21:34:20.7241734Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:20.7242055Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:20.7242456Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 398, in forward 2025-08-14T21:34:20.7242882Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:34:20.7243299Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 251, in forward 2025-08-14T21:34:20.7243698Z attn_output = self.out_proj(attn_output) 2025-08-14T21:34:20.7243832Z 2025-08-14T21:34:20.7243989Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:20.7244324Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:20.7244626Z return mod(**inputs) 2025-08-14T21:34:20.7245002Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1512, in forward 2025-08-14T21:34:20.7245409Z outputs = self.model.decoder( 2025-08-14T21:34:20.7245814Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:20.7246230Z layer_outputs = decoder_layer( 2025-08-14T21:34:20.7246552Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:20.7246887Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:20.7247301Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 430, in forward 2025-08-14T21:34:20.7247747Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:34:20.7247915Z 2025-08-14T21:34:20.7248012Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:20.7248343Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:20.7248640Z return mod(**inputs) 2025-08-14T21:34:20.7249018Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1512, in forward 2025-08-14T21:34:20.7249431Z outputs = self.model.decoder( 2025-08-14T21:34:20.7249832Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:20.7250230Z layer_outputs = decoder_layer( 2025-08-14T21:34:20.7250565Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:20.7250901Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:20.7251297Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 430, in forward 2025-08-14T21:34:20.7251742Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:34:20.7252097Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:34:20.7252412Z return self.act(input) 2025-08-14T21:34:20.7252516Z 2025-08-14T21:34:20.7252610Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:20.7252940Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:20.7253236Z return mod(**inputs) 2025-08-14T21:34:20.7253615Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1512, in forward 2025-08-14T21:34:20.7254024Z outputs = self.model.decoder( 2025-08-14T21:34:20.7254421Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:20.7254824Z layer_outputs = decoder_layer( 2025-08-14T21:34:20.7255135Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:20.7255469Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:20.7255877Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 432, in forward 2025-08-14T21:34:20.7256282Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:34:20.7256415Z 2025-08-14T21:34:20.7256508Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:20.7256866Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:20.7257179Z return mod(**inputs) 2025-08-14T21:34:20.7257550Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1528, in forward 2025-08-14T21:34:20.7258128Z logits = self.lm_head(outputs[0]) 2025-08-14T21:34:20.7258252Z 2025-08-14T21:34:20.7258356Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:20.7258685Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:20.7258977Z return mod(**inputs) 2025-08-14T21:34:20.7259377Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1534, in forward 2025-08-14T21:34:20.7259845Z loss = loss_fct(logits.view(-1, self.config.vocab_size), labels.view(-1)) 2025-08-14T21:34:20.7260025Z 2025-08-14T21:34:27.4272069Z Compilation time (from dynamo_timed): 10.827482017 2025-08-14T21:34:27.4295407Z pass 2025-08-14T21:34:27.4299309Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:34:27.4303996Z TIMING: _recursive_pre_grad_passes:0.00552 _recursive_joint_graph_passes:0.25896 _recursive_post_grad_passes:0.25793 async_compile.wait:0.65218 code_gen:6.47631 inductor_compile:7.46747 backend_compile:9.42217 gc:0.00084 entire_frame_compile:10.82748 total_wall_time:10.82748 2025-08-14T21:34:27.4304990Z STATS: call_* op count: 252 | FakeTensorMode.__torch_dispatch__:9096 | FakeTensor.__torch_dispatch__:3327 | ProxyTorchDispatchMode.__torch_dispatch__:3279 2025-08-14T21:34:27.4305479Z Dynamo produced 1 graphs covering 252 ops with 0 graph breaks (0 unique) 2025-08-14T21:34:31.6047369Z /opt/conda/envs/py_3.9/lib/python3.9/site-packages/llvmlite/binding/ffi.py:175: UserWarning: pkg_resources is deprecated as an API. See https://setuptools.pypa.io/en/latest/pkg_resources.html. The pkg_resources package is slated for removal as early as 2025-11-30. Refrain from using this package or pin to Setuptools<81. 2025-08-14T21:34:31.6048415Z from pkg_resources import resource_filename 2025-08-14T21:34:32.4281316Z 2025-08-14T21:34:33.4007017Z loading model: 0it [00:00, ?it/s] 2025-08-14T21:34:33.4008703Z loading model: 0it [00:00, ?it/s] 2025-08-14T21:34:33.4020832Z cpu eval BlenderbotSmallForConditionalGeneration 2025-08-14T21:34:33.6271848Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:34:33.7071122Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:34:33.7823106Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:34:44.2570822Z cudagraph partition due to non gpu ops 2025-08-14T21:34:44.2572484Z cudagraph partition due to non gpu ops 2025-08-14T21:34:44.2572820Z cudagraph partition due to non gpu ops 2025-08-14T21:34:44.2577917Z cudagraph partition due to non gpu ops 2025-08-14T21:34:44.2579899Z cudagraph partition due to non gpu ops 2025-08-14T21:34:44.2580238Z cudagraph partition due to non gpu ops 2025-08-14T21:34:44.2584536Z cudagraph partition due to non gpu ops 2025-08-14T21:34:44.2586540Z cudagraph partition due to non gpu ops 2025-08-14T21:34:44.2586905Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.2590235Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.2594635Z return mod(**inputs) 2025-08-14T21:34:44.2599240Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.2600976Z outputs = self.model( 2025-08-14T21:34:44.2601611Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1195, in forward 2025-08-14T21:34:44.2602497Z encoder_outputs = self.encoder( 2025-08-14T21:34:44.2606749Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 812, in forward 2025-08-14T21:34:44.2610719Z layer_outputs = encoder_layer( 2025-08-14T21:34:44.2615428Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.2617271Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.2617918Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 296, in forward 2025-08-14T21:34:44.2618690Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:34:44.2619168Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 199, in forward 2025-08-14T21:34:44.2619670Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:34:44.2619875Z 2025-08-14T21:34:44.2619980Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.2622342Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.2622832Z return mod(**inputs) 2025-08-14T21:34:44.2623313Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.2623745Z outputs = self.model( 2025-08-14T21:34:44.2624161Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1195, in forward 2025-08-14T21:34:44.2624807Z encoder_outputs = self.encoder( 2025-08-14T21:34:44.2625238Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 812, in forward 2025-08-14T21:34:44.2625655Z layer_outputs = encoder_layer( 2025-08-14T21:34:44.2625998Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.2626344Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.2626754Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 296, in forward 2025-08-14T21:34:44.2627190Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:34:44.2627622Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 218, in forward 2025-08-14T21:34:44.2628046Z key_states = self.k_proj(current_states) 2025-08-14T21:34:44.2628174Z 2025-08-14T21:34:44.2628279Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.2628626Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.2628940Z return mod(**inputs) 2025-08-14T21:34:44.2629346Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.2629761Z outputs = self.model( 2025-08-14T21:34:44.2630168Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1195, in forward 2025-08-14T21:34:44.2630581Z encoder_outputs = self.encoder( 2025-08-14T21:34:44.2630983Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 812, in forward 2025-08-14T21:34:44.2631402Z layer_outputs = encoder_layer( 2025-08-14T21:34:44.2631735Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.2632069Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.2633067Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 296, in forward 2025-08-14T21:34:44.2633523Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:34:44.2633942Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 219, in forward 2025-08-14T21:34:44.2634358Z value_states = self.v_proj(current_states) 2025-08-14T21:34:44.2634490Z 2025-08-14T21:34:44.2634569Z cudagraph partition due to non gpu ops 2025-08-14T21:34:44.2634769Z cudagraph partition due to non gpu ops 2025-08-14T21:34:44.2634964Z cudagraph partition due to non gpu ops 2025-08-14T21:34:44.2635173Z cudagraph partition due to non gpu ops 2025-08-14T21:34:44.2635390Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.2635725Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.2636029Z return mod(**inputs) 2025-08-14T21:34:44.2636414Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.2636817Z outputs = self.model( 2025-08-14T21:34:44.2637208Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1195, in forward 2025-08-14T21:34:44.2637656Z encoder_outputs = self.encoder( 2025-08-14T21:34:44.2638059Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 812, in forward 2025-08-14T21:34:44.2638473Z layer_outputs = encoder_layer( 2025-08-14T21:34:44.2638801Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.2639137Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.2639548Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 296, in forward 2025-08-14T21:34:44.2639978Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:34:44.2640405Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 237, in forward 2025-08-14T21:34:44.2640837Z attn_output, attn_weights = attention_interface( 2025-08-14T21:34:44.2641243Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:34:44.2641693Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:34:44.2641869Z 2025-08-14T21:34:44.2641976Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.2642315Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.2642609Z return mod(**inputs) 2025-08-14T21:34:44.2643015Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.2643428Z outputs = self.model( 2025-08-14T21:34:44.2643809Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1195, in forward 2025-08-14T21:34:44.2644221Z encoder_outputs = self.encoder( 2025-08-14T21:34:44.2644625Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 812, in forward 2025-08-14T21:34:44.2645031Z layer_outputs = encoder_layer( 2025-08-14T21:34:44.2645354Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.2645691Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.2646154Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 296, in forward 2025-08-14T21:34:44.2646593Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:34:44.2647003Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 237, in forward 2025-08-14T21:34:44.2647426Z attn_output, attn_weights = attention_interface( 2025-08-14T21:34:44.2647828Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:34:44.2648247Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:34:44.2648404Z 2025-08-14T21:34:44.2648519Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.2648851Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.2649149Z return mod(**inputs) 2025-08-14T21:34:44.2649530Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.2649955Z outputs = self.model( 2025-08-14T21:34:44.2650339Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1195, in forward 2025-08-14T21:34:44.2650744Z encoder_outputs = self.encoder( 2025-08-14T21:34:44.2651136Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 812, in forward 2025-08-14T21:34:44.2651540Z layer_outputs = encoder_layer( 2025-08-14T21:34:44.2651866Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.2652196Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.2652602Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 296, in forward 2025-08-14T21:34:44.2653022Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:34:44.2653439Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 251, in forward 2025-08-14T21:34:44.2653845Z attn_output = self.out_proj(attn_output) 2025-08-14T21:34:44.2653982Z 2025-08-14T21:34:44.2654077Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.2654411Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.2654709Z return mod(**inputs) 2025-08-14T21:34:44.2655084Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.2655485Z outputs = self.model( 2025-08-14T21:34:44.2655865Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1195, in forward 2025-08-14T21:34:44.2656266Z encoder_outputs = self.encoder( 2025-08-14T21:34:44.2656663Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 812, in forward 2025-08-14T21:34:44.2657064Z layer_outputs = encoder_layer( 2025-08-14T21:34:44.2657385Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.2657711Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.2658117Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 307, in forward 2025-08-14T21:34:44.2658572Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:34:44.2658733Z 2025-08-14T21:34:44.2658836Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.2659160Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.2659511Z return mod(**inputs) 2025-08-14T21:34:44.2659895Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.2660287Z outputs = self.model( 2025-08-14T21:34:44.2660677Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1195, in forward 2025-08-14T21:34:44.2661086Z encoder_outputs = self.encoder( 2025-08-14T21:34:44.2661488Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 812, in forward 2025-08-14T21:34:44.2661907Z layer_outputs = encoder_layer( 2025-08-14T21:34:44.2662239Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.2662607Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.2663013Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 307, in forward 2025-08-14T21:34:44.2663469Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:34:44.2663842Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:34:44.2664163Z return self.act(input) 2025-08-14T21:34:44.2664270Z 2025-08-14T21:34:44.2664371Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.2664785Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.2665099Z return mod(**inputs) 2025-08-14T21:34:44.2665474Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.2665884Z outputs = self.model( 2025-08-14T21:34:44.2666279Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1195, in forward 2025-08-14T21:34:44.2666691Z encoder_outputs = self.encoder( 2025-08-14T21:34:44.2667085Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 812, in forward 2025-08-14T21:34:44.2667494Z layer_outputs = encoder_layer( 2025-08-14T21:34:44.2667822Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.2668150Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.2668560Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 309, in forward 2025-08-14T21:34:44.2668969Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:34:44.2669095Z 2025-08-14T21:34:44.2669198Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.2669526Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.2669832Z return mod(**inputs) 2025-08-14T21:34:44.2670220Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.2670628Z outputs = self.model( 2025-08-14T21:34:44.2671009Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1195, in forward 2025-08-14T21:34:44.2671421Z encoder_outputs = self.encoder( 2025-08-14T21:34:44.2671836Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 812, in forward 2025-08-14T21:34:44.2672248Z layer_outputs = encoder_layer( 2025-08-14T21:34:44.2672566Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.2672943Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.2673385Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 296, in forward 2025-08-14T21:34:44.2673797Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:34:44.2674214Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 199, in forward 2025-08-14T21:34:44.2674689Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:34:44.2674880Z 2025-08-14T21:34:44.2675009Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.2675332Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.2675631Z return mod(**inputs) 2025-08-14T21:34:44.2676020Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.2676427Z outputs = self.model( 2025-08-14T21:34:44.2676802Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1195, in forward 2025-08-14T21:34:44.2677208Z encoder_outputs = self.encoder( 2025-08-14T21:34:44.2677602Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 812, in forward 2025-08-14T21:34:44.2677997Z layer_outputs = encoder_layer( 2025-08-14T21:34:44.2678316Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.2678649Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.2679052Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 296, in forward 2025-08-14T21:34:44.2679464Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:34:44.2679879Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 218, in forward 2025-08-14T21:34:44.2680283Z key_states = self.k_proj(current_states) 2025-08-14T21:34:44.2680406Z 2025-08-14T21:34:44.2680508Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.2680830Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.2681128Z return mod(**inputs) 2025-08-14T21:34:44.2681506Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.2681898Z outputs = self.model( 2025-08-14T21:34:44.2682279Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1195, in forward 2025-08-14T21:34:44.2682683Z encoder_outputs = self.encoder( 2025-08-14T21:34:44.2683078Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 812, in forward 2025-08-14T21:34:44.2683469Z layer_outputs = encoder_layer( 2025-08-14T21:34:44.2683787Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.2684114Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.2684507Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 296, in forward 2025-08-14T21:34:44.2685126Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:34:44.2685549Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 219, in forward 2025-08-14T21:34:44.2685966Z value_states = self.v_proj(current_states) 2025-08-14T21:34:44.2686096Z 2025-08-14T21:34:44.2686294Z cudagraph partition due to non gpu ops 2025-08-14T21:34:44.2686500Z cudagraph partition due to non gpu ops 2025-08-14T21:34:44.2686693Z cudagraph partition due to non gpu ops 2025-08-14T21:34:44.2686884Z cudagraph partition due to non gpu ops 2025-08-14T21:34:44.2687095Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.2687432Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.2687734Z return mod(**inputs) 2025-08-14T21:34:44.2688109Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.2688546Z outputs = self.model( 2025-08-14T21:34:44.2688936Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1195, in forward 2025-08-14T21:34:44.2689344Z encoder_outputs = self.encoder( 2025-08-14T21:34:44.2689739Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 812, in forward 2025-08-14T21:34:44.2690146Z layer_outputs = encoder_layer( 2025-08-14T21:34:44.2690468Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.2690796Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.2691205Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 296, in forward 2025-08-14T21:34:44.2691632Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:34:44.2692050Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 237, in forward 2025-08-14T21:34:44.2692470Z attn_output, attn_weights = attention_interface( 2025-08-14T21:34:44.2692884Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:34:44.2693330Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:34:44.2693504Z 2025-08-14T21:34:44.2693609Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.2693937Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.2694241Z return mod(**inputs) 2025-08-14T21:34:44.2694630Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.2695027Z outputs = self.model( 2025-08-14T21:34:44.2695414Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1195, in forward 2025-08-14T21:34:44.2695821Z encoder_outputs = self.encoder( 2025-08-14T21:34:44.2696224Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 812, in forward 2025-08-14T21:34:44.2696622Z layer_outputs = encoder_layer( 2025-08-14T21:34:44.2696947Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.2697279Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.2697684Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 296, in forward 2025-08-14T21:34:44.2698097Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:34:44.2698520Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 237, in forward 2025-08-14T21:34:44.2698949Z attn_output, attn_weights = attention_interface( 2025-08-14T21:34:44.2699390Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:34:44.2699821Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:34:44.2699977Z 2025-08-14T21:34:44.2700071Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.2700400Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.2700692Z return mod(**inputs) 2025-08-14T21:34:44.2701077Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.2701495Z outputs = self.model( 2025-08-14T21:34:44.2701881Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1195, in forward 2025-08-14T21:34:44.2702283Z encoder_outputs = self.encoder( 2025-08-14T21:34:44.2702687Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 812, in forward 2025-08-14T21:34:44.2703092Z layer_outputs = encoder_layer( 2025-08-14T21:34:44.2703406Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.2703739Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.2704146Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 296, in forward 2025-08-14T21:34:44.2704567Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:34:44.2705041Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 251, in forward 2025-08-14T21:34:44.2705463Z attn_output = self.out_proj(attn_output) 2025-08-14T21:34:44.2705595Z 2025-08-14T21:34:44.2705690Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.2706021Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.2706319Z return mod(**inputs) 2025-08-14T21:34:44.2706703Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.2707104Z outputs = self.model( 2025-08-14T21:34:44.2707482Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1195, in forward 2025-08-14T21:34:44.2707892Z encoder_outputs = self.encoder( 2025-08-14T21:34:44.2708293Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 812, in forward 2025-08-14T21:34:44.2708700Z layer_outputs = encoder_layer( 2025-08-14T21:34:44.2709016Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.2709351Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.2709761Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 307, in forward 2025-08-14T21:34:44.2710209Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:34:44.2710367Z 2025-08-14T21:34:44.2710463Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.2710792Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.2711090Z return mod(**inputs) 2025-08-14T21:34:44.2711464Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.2711867Z outputs = self.model( 2025-08-14T21:34:44.2712251Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1195, in forward 2025-08-14T21:34:44.2712713Z encoder_outputs = self.encoder( 2025-08-14T21:34:44.2713104Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 812, in forward 2025-08-14T21:34:44.2713512Z layer_outputs = encoder_layer( 2025-08-14T21:34:44.2713831Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.2714164Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.2714558Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 307, in forward 2025-08-14T21:34:44.2715026Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:34:44.2715387Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:34:44.2715692Z return self.act(input) 2025-08-14T21:34:44.2715800Z 2025-08-14T21:34:44.2715899Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.2716229Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.2716526Z return mod(**inputs) 2025-08-14T21:34:44.2716900Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.2717297Z outputs = self.model( 2025-08-14T21:34:44.2717679Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1195, in forward 2025-08-14T21:34:44.2718084Z encoder_outputs = self.encoder( 2025-08-14T21:34:44.2718472Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 812, in forward 2025-08-14T21:34:44.2718874Z layer_outputs = encoder_layer( 2025-08-14T21:34:44.2719197Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.2719523Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.2719928Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 309, in forward 2025-08-14T21:34:44.2720338Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:34:44.2720462Z 2025-08-14T21:34:44.2720562Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.2720882Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.2721181Z return mod(**inputs) 2025-08-14T21:34:44.2721559Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.2721955Z outputs = self.model( 2025-08-14T21:34:44.2722329Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1195, in forward 2025-08-14T21:34:44.2722734Z encoder_outputs = self.encoder( 2025-08-14T21:34:44.2723131Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 812, in forward 2025-08-14T21:34:44.2723527Z layer_outputs = encoder_layer( 2025-08-14T21:34:44.2723846Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.2724175Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.2724577Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 296, in forward 2025-08-14T21:34:44.2724992Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:34:44.2725454Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 199, in forward 2025-08-14T21:34:44.2725950Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:34:44.2726142Z 2025-08-14T21:34:44.2726244Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.2726568Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.2726869Z return mod(**inputs) 2025-08-14T21:34:44.2727252Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.2727644Z outputs = self.model( 2025-08-14T21:34:44.2728051Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1195, in forward 2025-08-14T21:34:44.2728455Z encoder_outputs = self.encoder( 2025-08-14T21:34:44.2728860Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 812, in forward 2025-08-14T21:34:44.2729261Z layer_outputs = encoder_layer( 2025-08-14T21:34:44.2729590Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.2729926Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.2730340Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 296, in forward 2025-08-14T21:34:44.2730754Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:34:44.2731175Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 218, in forward 2025-08-14T21:34:44.2731587Z key_states = self.k_proj(current_states) 2025-08-14T21:34:44.2731710Z 2025-08-14T21:34:44.2731813Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.2732140Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.2732443Z return mod(**inputs) 2025-08-14T21:34:44.2732824Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.2733216Z outputs = self.model( 2025-08-14T21:34:44.2733600Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1195, in forward 2025-08-14T21:34:44.2734010Z encoder_outputs = self.encoder( 2025-08-14T21:34:44.2734409Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 812, in forward 2025-08-14T21:34:44.2734807Z layer_outputs = encoder_layer( 2025-08-14T21:34:44.2735131Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.2735466Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.2735866Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 296, in forward 2025-08-14T21:34:44.2736286Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:34:44.2736703Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 219, in forward 2025-08-14T21:34:44.2737117Z value_states = self.v_proj(current_states) 2025-08-14T21:34:44.2737247Z 2025-08-14T21:34:44.2737321Z cudagraph partition due to non gpu ops 2025-08-14T21:34:44.2737520Z cudagraph partition due to non gpu ops 2025-08-14T21:34:44.2737715Z cudagraph partition due to non gpu ops 2025-08-14T21:34:44.2737894Z cudagraph partition due to non gpu ops 2025-08-14T21:34:44.2738107Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.2738471Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.2738787Z return mod(**inputs) 2025-08-14T21:34:44.2739162Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.2739565Z outputs = self.model( 2025-08-14T21:34:44.2739949Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1195, in forward 2025-08-14T21:34:44.2740357Z encoder_outputs = self.encoder( 2025-08-14T21:34:44.2740748Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 812, in forward 2025-08-14T21:34:44.2741170Z layer_outputs = encoder_layer( 2025-08-14T21:34:44.2741487Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.2741810Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.2742217Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 296, in forward 2025-08-14T21:34:44.2742637Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:34:44.2743050Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 237, in forward 2025-08-14T21:34:44.2743467Z attn_output, attn_weights = attention_interface( 2025-08-14T21:34:44.2743874Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:34:44.2744313Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:34:44.2744484Z 2025-08-14T21:34:44.2744587Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.2744988Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.2745301Z return mod(**inputs) 2025-08-14T21:34:44.2745695Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.2746099Z outputs = self.model( 2025-08-14T21:34:44.2746494Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1195, in forward 2025-08-14T21:34:44.2746912Z encoder_outputs = self.encoder( 2025-08-14T21:34:44.2747323Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 812, in forward 2025-08-14T21:34:44.2747731Z layer_outputs = encoder_layer( 2025-08-14T21:34:44.2748064Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.2748406Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.2748826Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 296, in forward 2025-08-14T21:34:44.2749252Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:34:44.2749675Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 237, in forward 2025-08-14T21:34:44.2750107Z attn_output, attn_weights = attention_interface( 2025-08-14T21:34:44.2750513Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:34:44.2750941Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:34:44.2751097Z 2025-08-14T21:34:44.2751192Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.2751526Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.2751823Z return mod(**inputs) 2025-08-14T21:34:44.2752257Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.2752657Z outputs = self.model( 2025-08-14T21:34:44.2753038Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1195, in forward 2025-08-14T21:34:44.2753438Z encoder_outputs = self.encoder( 2025-08-14T21:34:44.2753833Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 812, in forward 2025-08-14T21:34:44.2754256Z layer_outputs = encoder_layer( 2025-08-14T21:34:44.2754570Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.2754902Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.2755306Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 296, in forward 2025-08-14T21:34:44.2755727Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:34:44.2756133Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 251, in forward 2025-08-14T21:34:44.2756538Z attn_output = self.out_proj(attn_output) 2025-08-14T21:34:44.2756660Z 2025-08-14T21:34:44.2756763Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.2757088Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.2757376Z return mod(**inputs) 2025-08-14T21:34:44.2757754Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.2758150Z outputs = self.model( 2025-08-14T21:34:44.2758526Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1195, in forward 2025-08-14T21:34:44.2758931Z encoder_outputs = self.encoder( 2025-08-14T21:34:44.2759324Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 812, in forward 2025-08-14T21:34:44.2759724Z layer_outputs = encoder_layer( 2025-08-14T21:34:44.2760037Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.2760367Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.2760766Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 307, in forward 2025-08-14T21:34:44.2761208Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:34:44.2761368Z 2025-08-14T21:34:44.2761463Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.2761792Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.2762090Z return mod(**inputs) 2025-08-14T21:34:44.2762460Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.2762863Z outputs = self.model( 2025-08-14T21:34:44.2763244Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1195, in forward 2025-08-14T21:34:44.2763646Z encoder_outputs = self.encoder( 2025-08-14T21:34:44.2764035Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 812, in forward 2025-08-14T21:34:44.2764441Z layer_outputs = encoder_layer( 2025-08-14T21:34:44.2764763Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.2765125Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.2765540Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 307, in forward 2025-08-14T21:34:44.2765984Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:34:44.2766342Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:34:44.2766654Z return self.act(input) 2025-08-14T21:34:44.2766764Z 2025-08-14T21:34:44.2766859Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.2767207Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.2767504Z return mod(**inputs) 2025-08-14T21:34:44.2767876Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.2768277Z outputs = self.model( 2025-08-14T21:34:44.2768661Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1195, in forward 2025-08-14T21:34:44.2769065Z encoder_outputs = self.encoder( 2025-08-14T21:34:44.2769452Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 812, in forward 2025-08-14T21:34:44.2769854Z layer_outputs = encoder_layer( 2025-08-14T21:34:44.2770170Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.2770493Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.2770895Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 309, in forward 2025-08-14T21:34:44.2771302Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:34:44.2771427Z 2025-08-14T21:34:44.2771530Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.2771851Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.2772146Z return mod(**inputs) 2025-08-14T21:34:44.2772521Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.2772911Z outputs = self.model( 2025-08-14T21:34:44.2773292Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1195, in forward 2025-08-14T21:34:44.2773698Z encoder_outputs = self.encoder( 2025-08-14T21:34:44.2774091Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 812, in forward 2025-08-14T21:34:44.2774482Z layer_outputs = encoder_layer( 2025-08-14T21:34:44.2774802Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.2775133Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.2775534Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 296, in forward 2025-08-14T21:34:44.2775942Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:34:44.2776355Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 199, in forward 2025-08-14T21:34:44.2776824Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:34:44.2777014Z 2025-08-14T21:34:44.2777115Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.2777436Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.2777734Z return mod(**inputs) 2025-08-14T21:34:44.2778159Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.2778569Z outputs = self.model( 2025-08-14T21:34:44.2778949Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1195, in forward 2025-08-14T21:34:44.2779355Z encoder_outputs = self.encoder( 2025-08-14T21:34:44.2779749Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 812, in forward 2025-08-14T21:34:44.2780144Z layer_outputs = encoder_layer( 2025-08-14T21:34:44.2780482Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.2780816Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.2781216Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 296, in forward 2025-08-14T21:34:44.2781641Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:34:44.2782057Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 218, in forward 2025-08-14T21:34:44.2782462Z key_states = self.k_proj(current_states) 2025-08-14T21:34:44.2782586Z 2025-08-14T21:34:44.2782680Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.2783011Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.2783309Z return mod(**inputs) 2025-08-14T21:34:44.2783695Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.2784085Z outputs = self.model( 2025-08-14T21:34:44.2784467Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1195, in forward 2025-08-14T21:34:44.2785074Z encoder_outputs = self.encoder( 2025-08-14T21:34:44.2785478Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 812, in forward 2025-08-14T21:34:44.2785878Z layer_outputs = encoder_layer( 2025-08-14T21:34:44.2786201Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.2786540Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.2786944Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 296, in forward 2025-08-14T21:34:44.2787376Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:34:44.2787794Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 219, in forward 2025-08-14T21:34:44.2788218Z value_states = self.v_proj(current_states) 2025-08-14T21:34:44.2788347Z 2025-08-14T21:34:44.2788421Z cudagraph partition due to non gpu ops 2025-08-14T21:34:44.2788622Z cudagraph partition due to non gpu ops 2025-08-14T21:34:44.2788818Z cudagraph partition due to non gpu ops 2025-08-14T21:34:44.2788998Z cudagraph partition due to non gpu ops 2025-08-14T21:34:44.2789210Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.2789541Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.2789843Z return mod(**inputs) 2025-08-14T21:34:44.2790227Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.2790627Z outputs = self.model( 2025-08-14T21:34:44.2791086Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1195, in forward 2025-08-14T21:34:44.2791514Z encoder_outputs = self.encoder( 2025-08-14T21:34:44.2791910Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 812, in forward 2025-08-14T21:34:44.2792314Z layer_outputs = encoder_layer( 2025-08-14T21:34:44.2792639Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.2792968Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.2793375Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 296, in forward 2025-08-14T21:34:44.2793832Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:34:44.2794252Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 237, in forward 2025-08-14T21:34:44.2794674Z attn_output, attn_weights = attention_interface( 2025-08-14T21:34:44.2795082Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:34:44.2795526Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:34:44.2795694Z 2025-08-14T21:34:44.2795795Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.2796120Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.2796422Z return mod(**inputs) 2025-08-14T21:34:44.2796808Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.2797206Z outputs = self.model( 2025-08-14T21:34:44.2797589Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1195, in forward 2025-08-14T21:34:44.2797995Z encoder_outputs = self.encoder( 2025-08-14T21:34:44.2798394Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 812, in forward 2025-08-14T21:34:44.2798801Z layer_outputs = encoder_layer( 2025-08-14T21:34:44.2799126Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.2799458Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.2799855Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 296, in forward 2025-08-14T21:34:44.2800276Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:34:44.2800691Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 237, in forward 2025-08-14T21:34:44.2801114Z attn_output, attn_weights = attention_interface( 2025-08-14T21:34:44.2801515Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:34:44.2801937Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:34:44.2802093Z 2025-08-14T21:34:44.2802188Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.2802518Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.2802808Z return mod(**inputs) 2025-08-14T21:34:44.2803189Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.2803592Z outputs = self.model( 2025-08-14T21:34:44.2803964Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1195, in forward 2025-08-14T21:34:44.2804369Z encoder_outputs = self.encoder( 2025-08-14T21:34:44.2804820Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 812, in forward 2025-08-14T21:34:44.2805227Z layer_outputs = encoder_layer( 2025-08-14T21:34:44.2805546Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.2805882Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.2806289Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 296, in forward 2025-08-14T21:34:44.2806728Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:34:44.2807139Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 251, in forward 2025-08-14T21:34:44.2807547Z attn_output = self.out_proj(attn_output) 2025-08-14T21:34:44.2807672Z 2025-08-14T21:34:44.2807775Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.2808106Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.2808399Z return mod(**inputs) 2025-08-14T21:34:44.2808782Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.2809187Z outputs = self.model( 2025-08-14T21:34:44.2809562Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1195, in forward 2025-08-14T21:34:44.2809971Z encoder_outputs = self.encoder( 2025-08-14T21:34:44.2810371Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 812, in forward 2025-08-14T21:34:44.2810775Z layer_outputs = encoder_layer( 2025-08-14T21:34:44.2811092Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.2811427Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.2811834Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 307, in forward 2025-08-14T21:34:44.2812279Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:34:44.2812437Z 2025-08-14T21:34:44.2812532Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.2812863Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.2813163Z return mod(**inputs) 2025-08-14T21:34:44.2813536Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.2813935Z outputs = self.model( 2025-08-14T21:34:44.2814322Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1195, in forward 2025-08-14T21:34:44.2814731Z encoder_outputs = self.encoder( 2025-08-14T21:34:44.2815126Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 812, in forward 2025-08-14T21:34:44.2815531Z layer_outputs = encoder_layer( 2025-08-14T21:34:44.2815852Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.2816182Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.2816585Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 307, in forward 2025-08-14T21:34:44.2817028Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:34:44.2817387Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:34:44.2817745Z return self.act(input) 2025-08-14T21:34:44.2817856Z 2025-08-14T21:34:44.2817953Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.2818284Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.2818586Z return mod(**inputs) 2025-08-14T21:34:44.2818959Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.2819359Z outputs = self.model( 2025-08-14T21:34:44.2819743Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1195, in forward 2025-08-14T21:34:44.2820170Z encoder_outputs = self.encoder( 2025-08-14T21:34:44.2820570Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 812, in forward 2025-08-14T21:34:44.2820976Z layer_outputs = encoder_layer( 2025-08-14T21:34:44.2821300Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.2821629Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.2822036Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 309, in forward 2025-08-14T21:34:44.2822446Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:34:44.2822574Z 2025-08-14T21:34:44.2822677Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.2823000Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.2823300Z return mod(**inputs) 2025-08-14T21:34:44.2823683Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.2824075Z outputs = self.model( 2025-08-14T21:34:44.2824470Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1195, in forward 2025-08-14T21:34:44.2824947Z encoder_outputs = self.encoder( 2025-08-14T21:34:44.2825350Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 812, in forward 2025-08-14T21:34:44.2825750Z layer_outputs = encoder_layer( 2025-08-14T21:34:44.2826076Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.2826419Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.2826827Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 296, in forward 2025-08-14T21:34:44.2827244Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:34:44.2827674Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 199, in forward 2025-08-14T21:34:44.2828156Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:34:44.2828345Z 2025-08-14T21:34:44.2828440Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.2828776Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.2829079Z return mod(**inputs) 2025-08-14T21:34:44.2829466Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.2829866Z outputs = self.model( 2025-08-14T21:34:44.2830253Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1195, in forward 2025-08-14T21:34:44.2830662Z encoder_outputs = self.encoder( 2025-08-14T21:34:44.2831098Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 812, in forward 2025-08-14T21:34:44.2831515Z layer_outputs = encoder_layer( 2025-08-14T21:34:44.2831849Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.2832192Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.2832597Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 296, in forward 2025-08-14T21:34:44.2833028Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:34:44.2833466Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 218, in forward 2025-08-14T21:34:44.2833875Z key_states = self.k_proj(current_states) 2025-08-14T21:34:44.2834001Z 2025-08-14T21:34:44.2834099Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.2834428Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.2834728Z return mod(**inputs) 2025-08-14T21:34:44.2835114Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.2835507Z outputs = self.model( 2025-08-14T21:34:44.2835887Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1195, in forward 2025-08-14T21:34:44.2836290Z encoder_outputs = self.encoder( 2025-08-14T21:34:44.2836683Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 812, in forward 2025-08-14T21:34:44.2837082Z layer_outputs = encoder_layer( 2025-08-14T21:34:44.2837404Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.2837739Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.2838134Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 296, in forward 2025-08-14T21:34:44.2838553Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:34:44.2838968Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 219, in forward 2025-08-14T21:34:44.2839379Z value_states = self.v_proj(current_states) 2025-08-14T21:34:44.2839512Z 2025-08-14T21:34:44.2839585Z cudagraph partition due to non gpu ops 2025-08-14T21:34:44.2839780Z cudagraph partition due to non gpu ops 2025-08-14T21:34:44.2839972Z cudagraph partition due to non gpu ops 2025-08-14T21:34:44.2840153Z cudagraph partition due to non gpu ops 2025-08-14T21:34:44.2840367Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.2840702Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.2840999Z return mod(**inputs) 2025-08-14T21:34:44.2841370Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.2841769Z outputs = self.model( 2025-08-14T21:34:44.2842150Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1195, in forward 2025-08-14T21:34:44.2842546Z encoder_outputs = self.encoder( 2025-08-14T21:34:44.2842944Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 812, in forward 2025-08-14T21:34:44.2843343Z layer_outputs = encoder_layer( 2025-08-14T21:34:44.2843663Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.2844042Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.2844449Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 296, in forward 2025-08-14T21:34:44.2844868Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:34:44.2845274Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 237, in forward 2025-08-14T21:34:44.2845696Z attn_output, attn_weights = attention_interface( 2025-08-14T21:34:44.2846103Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:34:44.2846561Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:34:44.2846729Z 2025-08-14T21:34:44.2846823Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.2847157Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.2847458Z return mod(**inputs) 2025-08-14T21:34:44.2847843Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.2848237Z outputs = self.model( 2025-08-14T21:34:44.2848621Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1195, in forward 2025-08-14T21:34:44.2849027Z encoder_outputs = self.encoder( 2025-08-14T21:34:44.2849424Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 812, in forward 2025-08-14T21:34:44.2849822Z layer_outputs = encoder_layer( 2025-08-14T21:34:44.2850142Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.2850478Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.2850880Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 296, in forward 2025-08-14T21:34:44.2851301Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:34:44.2852114Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 237, in forward 2025-08-14T21:34:44.2852543Z attn_output, attn_weights = attention_interface( 2025-08-14T21:34:44.2852942Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:34:44.2853363Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:34:44.2853517Z 2025-08-14T21:34:44.2853613Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.2853945Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.2854238Z return mod(**inputs) 2025-08-14T21:34:44.2854618Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.2855014Z outputs = self.model( 2025-08-14T21:34:44.2855387Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1195, in forward 2025-08-14T21:34:44.2855791Z encoder_outputs = self.encoder( 2025-08-14T21:34:44.2856186Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 812, in forward 2025-08-14T21:34:44.2856587Z layer_outputs = encoder_layer( 2025-08-14T21:34:44.2856899Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.2857233Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.2857696Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 296, in forward 2025-08-14T21:34:44.2858122Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:34:44.2858532Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 251, in forward 2025-08-14T21:34:44.2858943Z attn_output = self.out_proj(attn_output) 2025-08-14T21:34:44.2859068Z 2025-08-14T21:34:44.2859169Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.2859500Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.2859813Z return mod(**inputs) 2025-08-14T21:34:44.2860192Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.2860592Z outputs = self.model( 2025-08-14T21:34:44.2860969Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1195, in forward 2025-08-14T21:34:44.2861375Z encoder_outputs = self.encoder( 2025-08-14T21:34:44.2861772Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 812, in forward 2025-08-14T21:34:44.2862172Z layer_outputs = encoder_layer( 2025-08-14T21:34:44.2862484Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.2862818Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.2863224Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 307, in forward 2025-08-14T21:34:44.2863675Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:34:44.2863833Z 2025-08-14T21:34:44.2863931Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.2864264Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.2864563Z return mod(**inputs) 2025-08-14T21:34:44.2865007Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.2865420Z outputs = self.model( 2025-08-14T21:34:44.2865810Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1195, in forward 2025-08-14T21:34:44.2866219Z encoder_outputs = self.encoder( 2025-08-14T21:34:44.2866611Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 812, in forward 2025-08-14T21:34:44.2867024Z layer_outputs = encoder_layer( 2025-08-14T21:34:44.2867354Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.2867683Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.2868093Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 307, in forward 2025-08-14T21:34:44.2868542Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:34:44.2868902Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:34:44.2869210Z return self.act(input) 2025-08-14T21:34:44.2869321Z 2025-08-14T21:34:44.2869417Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.2869746Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.2870044Z return mod(**inputs) 2025-08-14T21:34:44.2870452Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.2870878Z outputs = self.model( 2025-08-14T21:34:44.2871257Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1195, in forward 2025-08-14T21:34:44.2871651Z encoder_outputs = self.encoder( 2025-08-14T21:34:44.2872045Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 812, in forward 2025-08-14T21:34:44.2872446Z layer_outputs = encoder_layer( 2025-08-14T21:34:44.2872762Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.2873108Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.2873522Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 309, in forward 2025-08-14T21:34:44.2873943Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:34:44.2874073Z 2025-08-14T21:34:44.2874177Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.2874508Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.2874812Z return mod(**inputs) 2025-08-14T21:34:44.2875203Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.2875603Z outputs = self.model( 2025-08-14T21:34:44.2875989Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1195, in forward 2025-08-14T21:34:44.2876404Z encoder_outputs = self.encoder( 2025-08-14T21:34:44.2876808Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 812, in forward 2025-08-14T21:34:44.2877214Z layer_outputs = encoder_layer( 2025-08-14T21:34:44.2877542Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.2877880Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.2878293Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 296, in forward 2025-08-14T21:34:44.2878713Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:34:44.2879138Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 199, in forward 2025-08-14T21:34:44.2879619Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:34:44.2879809Z 2025-08-14T21:34:44.2879906Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.2880241Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.2880549Z return mod(**inputs) 2025-08-14T21:34:44.2880942Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.2881342Z outputs = self.model( 2025-08-14T21:34:44.2881736Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1195, in forward 2025-08-14T21:34:44.2882150Z encoder_outputs = self.encoder( 2025-08-14T21:34:44.2882555Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 812, in forward 2025-08-14T21:34:44.2882961Z layer_outputs = encoder_layer( 2025-08-14T21:34:44.2883290Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.2883631Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.2884072Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 296, in forward 2025-08-14T21:34:44.2884515Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:34:44.2885051Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 218, in forward 2025-08-14T21:34:44.2885471Z key_states = self.k_proj(current_states) 2025-08-14T21:34:44.2885597Z 2025-08-14T21:34:44.2885693Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.2886033Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.2886380Z return mod(**inputs) 2025-08-14T21:34:44.2886764Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.2887163Z outputs = self.model( 2025-08-14T21:34:44.2887550Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1195, in forward 2025-08-14T21:34:44.2887962Z encoder_outputs = self.encoder( 2025-08-14T21:34:44.2888354Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 812, in forward 2025-08-14T21:34:44.2888764Z layer_outputs = encoder_layer( 2025-08-14T21:34:44.2889086Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.2889424Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.2889825Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 296, in forward 2025-08-14T21:34:44.2890252Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:34:44.2890674Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 219, in forward 2025-08-14T21:34:44.2891097Z value_states = self.v_proj(current_states) 2025-08-14T21:34:44.2891227Z 2025-08-14T21:34:44.2891303Z cudagraph partition due to non gpu ops 2025-08-14T21:34:44.2891504Z cudagraph partition due to non gpu ops 2025-08-14T21:34:44.2891697Z cudagraph partition due to non gpu ops 2025-08-14T21:34:44.2891880Z cudagraph partition due to non gpu ops 2025-08-14T21:34:44.2892096Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.2892429Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.2892729Z return mod(**inputs) 2025-08-14T21:34:44.2893108Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.2893509Z outputs = self.model( 2025-08-14T21:34:44.2893892Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1195, in forward 2025-08-14T21:34:44.2894292Z encoder_outputs = self.encoder( 2025-08-14T21:34:44.2894692Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 812, in forward 2025-08-14T21:34:44.2895097Z layer_outputs = encoder_layer( 2025-08-14T21:34:44.2895419Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.2895744Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.2896151Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 296, in forward 2025-08-14T21:34:44.2896570Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:34:44.2897069Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 237, in forward 2025-08-14T21:34:44.2897529Z attn_output, attn_weights = attention_interface( 2025-08-14T21:34:44.2897934Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:34:44.2898373Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:34:44.2898542Z 2025-08-14T21:34:44.2898636Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.2898965Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.2899323Z return mod(**inputs) 2025-08-14T21:34:44.2899712Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.2900108Z outputs = self.model( 2025-08-14T21:34:44.2900499Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1195, in forward 2025-08-14T21:34:44.2900911Z encoder_outputs = self.encoder( 2025-08-14T21:34:44.2901317Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 812, in forward 2025-08-14T21:34:44.2901728Z layer_outputs = encoder_layer( 2025-08-14T21:34:44.2902059Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.2902403Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.2902814Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 296, in forward 2025-08-14T21:34:44.2903251Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:34:44.2903685Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 237, in forward 2025-08-14T21:34:44.2904139Z attn_output, attn_weights = attention_interface( 2025-08-14T21:34:44.2904543Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:34:44.2905026Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:34:44.2905178Z 2025-08-14T21:34:44.2905280Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.2905611Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.2905903Z return mod(**inputs) 2025-08-14T21:34:44.2906296Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.2906700Z outputs = self.model( 2025-08-14T21:34:44.2907082Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1195, in forward 2025-08-14T21:34:44.2907493Z encoder_outputs = self.encoder( 2025-08-14T21:34:44.2907896Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 812, in forward 2025-08-14T21:34:44.2908301Z layer_outputs = encoder_layer( 2025-08-14T21:34:44.2908620Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.2908957Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.2909362Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 296, in forward 2025-08-14T21:34:44.2909784Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:34:44.2910192Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 251, in forward 2025-08-14T21:34:44.2910637Z attn_output = self.out_proj(attn_output) 2025-08-14T21:34:44.2910779Z 2025-08-14T21:34:44.2910880Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.2911203Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.2911499Z return mod(**inputs) 2025-08-14T21:34:44.2911881Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.2912277Z outputs = self.model( 2025-08-14T21:34:44.2912647Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1195, in forward 2025-08-14T21:34:44.2913068Z encoder_outputs = self.encoder( 2025-08-14T21:34:44.2913466Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 812, in forward 2025-08-14T21:34:44.2913869Z layer_outputs = encoder_layer( 2025-08-14T21:34:44.2914187Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.2914521Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.2914928Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 307, in forward 2025-08-14T21:34:44.2915366Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:34:44.2915534Z 2025-08-14T21:34:44.2915630Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.2915959Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.2916261Z return mod(**inputs) 2025-08-14T21:34:44.2916638Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.2917038Z outputs = self.model( 2025-08-14T21:34:44.2917428Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1195, in forward 2025-08-14T21:34:44.2917836Z encoder_outputs = self.encoder( 2025-08-14T21:34:44.2918225Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 812, in forward 2025-08-14T21:34:44.2918628Z layer_outputs = encoder_layer( 2025-08-14T21:34:44.2918950Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.2919280Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.2919685Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 307, in forward 2025-08-14T21:34:44.2920129Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:34:44.2920491Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:34:44.2920802Z return self.act(input) 2025-08-14T21:34:44.2920911Z 2025-08-14T21:34:44.2921007Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.2921338Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.2921636Z return mod(**inputs) 2025-08-14T21:34:44.2922011Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.2922412Z outputs = self.model( 2025-08-14T21:34:44.2922795Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1195, in forward 2025-08-14T21:34:44.2923191Z encoder_outputs = self.encoder( 2025-08-14T21:34:44.2923630Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 812, in forward 2025-08-14T21:34:44.2924054Z layer_outputs = encoder_layer( 2025-08-14T21:34:44.2924380Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.2924711Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.2925116Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 309, in forward 2025-08-14T21:34:44.2925526Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:34:44.2925653Z 2025-08-14T21:34:44.2925772Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.2926097Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.2926391Z return mod(**inputs) 2025-08-14T21:34:44.2926780Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.2927173Z outputs = self.model( 2025-08-14T21:34:44.2927556Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1195, in forward 2025-08-14T21:34:44.2927961Z encoder_outputs = self.encoder( 2025-08-14T21:34:44.2928355Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 812, in forward 2025-08-14T21:34:44.2928747Z layer_outputs = encoder_layer( 2025-08-14T21:34:44.2929066Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.2929404Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.2929799Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 296, in forward 2025-08-14T21:34:44.2930221Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:34:44.2930641Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 199, in forward 2025-08-14T21:34:44.2931117Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:34:44.2931305Z 2025-08-14T21:34:44.2931400Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.2931730Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.2932028Z return mod(**inputs) 2025-08-14T21:34:44.2932410Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.2932803Z outputs = self.model( 2025-08-14T21:34:44.2933190Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1195, in forward 2025-08-14T21:34:44.2933598Z encoder_outputs = self.encoder( 2025-08-14T21:34:44.2933989Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 812, in forward 2025-08-14T21:34:44.2934391Z layer_outputs = encoder_layer( 2025-08-14T21:34:44.2934710Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.2935040Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.2935436Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 296, in forward 2025-08-14T21:34:44.2935859Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:34:44.2936282Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 218, in forward 2025-08-14T21:34:44.2936691Z key_states = self.k_proj(current_states) 2025-08-14T21:34:44.2936869Z 2025-08-14T21:34:44.2936968Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.2937305Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.2937612Z return mod(**inputs) 2025-08-14T21:34:44.2937991Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.2938396Z outputs = self.model( 2025-08-14T21:34:44.2938782Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1195, in forward 2025-08-14T21:34:44.2939205Z encoder_outputs = self.encoder( 2025-08-14T21:34:44.2939591Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 812, in forward 2025-08-14T21:34:44.2939994Z layer_outputs = encoder_layer( 2025-08-14T21:34:44.2940320Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.2940653Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.2941047Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 296, in forward 2025-08-14T21:34:44.2941463Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:34:44.2941878Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 219, in forward 2025-08-14T21:34:44.2942294Z value_states = self.v_proj(current_states) 2025-08-14T21:34:44.2942421Z 2025-08-14T21:34:44.2942495Z cudagraph partition due to non gpu ops 2025-08-14T21:34:44.2942692Z cudagraph partition due to non gpu ops 2025-08-14T21:34:44.2942885Z cudagraph partition due to non gpu ops 2025-08-14T21:34:44.2943065Z cudagraph partition due to non gpu ops 2025-08-14T21:34:44.2943283Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.2943612Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.2943901Z return mod(**inputs) 2025-08-14T21:34:44.2944282Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.2944681Z outputs = self.model( 2025-08-14T21:34:44.2945140Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1195, in forward 2025-08-14T21:34:44.2945546Z encoder_outputs = self.encoder( 2025-08-14T21:34:44.2945949Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 812, in forward 2025-08-14T21:34:44.2946357Z layer_outputs = encoder_layer( 2025-08-14T21:34:44.2946683Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.2947017Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.2947427Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 296, in forward 2025-08-14T21:34:44.2947850Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:34:44.2948261Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 237, in forward 2025-08-14T21:34:44.2948691Z attn_output, attn_weights = attention_interface( 2025-08-14T21:34:44.2951929Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:34:44.2952418Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:34:44.2952593Z 2025-08-14T21:34:44.2952726Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.2953077Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.2953383Z return mod(**inputs) 2025-08-14T21:34:44.2953779Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.2954173Z outputs = self.model( 2025-08-14T21:34:44.2954560Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1195, in forward 2025-08-14T21:34:44.2955003Z encoder_outputs = self.encoder( 2025-08-14T21:34:44.2955421Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 812, in forward 2025-08-14T21:34:44.2955832Z layer_outputs = encoder_layer( 2025-08-14T21:34:44.2956158Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.2956497Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.2956899Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 296, in forward 2025-08-14T21:34:44.2957320Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:34:44.2957744Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 237, in forward 2025-08-14T21:34:44.2958176Z attn_output, attn_weights = attention_interface( 2025-08-14T21:34:44.2958584Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:34:44.2959010Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:34:44.2959161Z 2025-08-14T21:34:44.2959262Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.2959591Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.2959894Z return mod(**inputs) 2025-08-14T21:34:44.2960284Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.2960686Z outputs = self.model( 2025-08-14T21:34:44.2961066Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1195, in forward 2025-08-14T21:34:44.2961472Z encoder_outputs = self.encoder( 2025-08-14T21:34:44.2961876Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 812, in forward 2025-08-14T21:34:44.2962283Z layer_outputs = encoder_layer( 2025-08-14T21:34:44.2962601Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.2962941Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.2963350Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 296, in forward 2025-08-14T21:34:44.2963764Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:34:44.2964181Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 251, in forward 2025-08-14T21:34:44.2964589Z attn_output = self.out_proj(attn_output) 2025-08-14T21:34:44.2964714Z 2025-08-14T21:34:44.2964814Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.2965143Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.2965491Z return mod(**inputs) 2025-08-14T21:34:44.2965891Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.2966313Z outputs = self.model( 2025-08-14T21:34:44.2966689Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1195, in forward 2025-08-14T21:34:44.2967093Z encoder_outputs = self.encoder( 2025-08-14T21:34:44.2967490Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 812, in forward 2025-08-14T21:34:44.2967884Z layer_outputs = encoder_layer( 2025-08-14T21:34:44.2968207Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.2968563Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.2968971Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 307, in forward 2025-08-14T21:34:44.2969415Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:34:44.2969584Z 2025-08-14T21:34:44.2969680Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.2970012Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.2970308Z return mod(**inputs) 2025-08-14T21:34:44.2970684Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.2971084Z outputs = self.model( 2025-08-14T21:34:44.2971467Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1195, in forward 2025-08-14T21:34:44.2971867Z encoder_outputs = self.encoder( 2025-08-14T21:34:44.2972268Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 812, in forward 2025-08-14T21:34:44.2972673Z layer_outputs = encoder_layer( 2025-08-14T21:34:44.2972997Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.2973323Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.2973731Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 307, in forward 2025-08-14T21:34:44.2974178Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:34:44.2974536Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:34:44.2974845Z return self.act(input) 2025-08-14T21:34:44.2974952Z 2025-08-14T21:34:44.2975048Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.2975381Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.2975671Z return mod(**inputs) 2025-08-14T21:34:44.2976059Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.2976457Z outputs = self.model( 2025-08-14T21:34:44.2976840Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1195, in forward 2025-08-14T21:34:44.2977238Z encoder_outputs = self.encoder( 2025-08-14T21:34:44.2977633Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 812, in forward 2025-08-14T21:34:44.2978038Z layer_outputs = encoder_layer( 2025-08-14T21:34:44.2978349Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.2978730Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.2979157Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 309, in forward 2025-08-14T21:34:44.2979587Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:34:44.2979713Z 2025-08-14T21:34:44.2979810Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.2980144Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.2980445Z return mod(**inputs) 2025-08-14T21:34:44.2980827Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.2981220Z outputs = self.model( 2025-08-14T21:34:44.2981622Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1195, in forward 2025-08-14T21:34:44.2982028Z encoder_outputs = self.encoder( 2025-08-14T21:34:44.2982421Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 812, in forward 2025-08-14T21:34:44.2982824Z layer_outputs = encoder_layer( 2025-08-14T21:34:44.2983142Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.2983472Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.2983868Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 296, in forward 2025-08-14T21:34:44.2984298Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:34:44.2984920Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 199, in forward 2025-08-14T21:34:44.2985410Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:34:44.2985602Z 2025-08-14T21:34:44.2985699Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.2986035Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.2986339Z return mod(**inputs) 2025-08-14T21:34:44.2986736Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.2987147Z outputs = self.model( 2025-08-14T21:34:44.2987546Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1195, in forward 2025-08-14T21:34:44.2987972Z encoder_outputs = self.encoder( 2025-08-14T21:34:44.2988365Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 812, in forward 2025-08-14T21:34:44.2988773Z layer_outputs = encoder_layer( 2025-08-14T21:34:44.2989098Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.2989439Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.2989839Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 296, in forward 2025-08-14T21:34:44.2990263Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:34:44.2990683Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 218, in forward 2025-08-14T21:34:44.2991093Z key_states = self.k_proj(current_states) 2025-08-14T21:34:44.2991217Z 2025-08-14T21:34:44.2991313Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.2991646Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.2991989Z return mod(**inputs) 2025-08-14T21:34:44.2992390Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.2992824Z outputs = self.model( 2025-08-14T21:34:44.2993217Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1195, in forward 2025-08-14T21:34:44.2993625Z encoder_outputs = self.encoder( 2025-08-14T21:34:44.2994020Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 812, in forward 2025-08-14T21:34:44.2994425Z layer_outputs = encoder_layer( 2025-08-14T21:34:44.2994749Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.2995107Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.2995503Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 296, in forward 2025-08-14T21:34:44.2995924Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:34:44.2996341Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 219, in forward 2025-08-14T21:34:44.2996423Z value_states = self.v_proj(current_states) 2025-08-14T21:34:44.2996427Z 2025-08-14T21:34:44.2996502Z cudagraph partition due to non gpu ops 2025-08-14T21:34:44.2996582Z cudagraph partition due to non gpu ops 2025-08-14T21:34:44.2996653Z cudagraph partition due to non gpu ops 2025-08-14T21:34:44.2996727Z cudagraph partition due to non gpu ops 2025-08-14T21:34:44.2996822Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.2997007Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.2997073Z return mod(**inputs) 2025-08-14T21:34:44.2997355Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.2997419Z outputs = self.model( 2025-08-14T21:34:44.2997709Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1195, in forward 2025-08-14T21:34:44.2997777Z encoder_outputs = self.encoder( 2025-08-14T21:34:44.2998061Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 812, in forward 2025-08-14T21:34:44.2998125Z layer_outputs = encoder_layer( 2025-08-14T21:34:44.2998327Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.2998408Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.2998682Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 296, in forward 2025-08-14T21:34:44.2998772Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:34:44.2999048Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 237, in forward 2025-08-14T21:34:44.2999139Z attn_output, attn_weights = attention_interface( 2025-08-14T21:34:44.2999410Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:34:44.2999530Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:34:44.2999533Z 2025-08-14T21:34:44.2999634Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.2999814Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.2999874Z return mod(**inputs) 2025-08-14T21:34:44.3000182Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.3000247Z outputs = self.model( 2025-08-14T21:34:44.3000560Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1195, in forward 2025-08-14T21:34:44.3000638Z encoder_outputs = self.encoder( 2025-08-14T21:34:44.3000915Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 812, in forward 2025-08-14T21:34:44.3000987Z layer_outputs = encoder_layer( 2025-08-14T21:34:44.3001188Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.3001259Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.3001567Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 296, in forward 2025-08-14T21:34:44.3001650Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:34:44.3001928Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 237, in forward 2025-08-14T21:34:44.3002025Z attn_output, attn_weights = attention_interface( 2025-08-14T21:34:44.3002290Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:34:44.3002399Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:34:44.3002402Z 2025-08-14T21:34:44.3002495Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.3002677Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.3002745Z return mod(**inputs) 2025-08-14T21:34:44.3003027Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.3003095Z outputs = self.model( 2025-08-14T21:34:44.3003376Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1195, in forward 2025-08-14T21:34:44.3003443Z encoder_outputs = self.encoder( 2025-08-14T21:34:44.3003730Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 812, in forward 2025-08-14T21:34:44.3003796Z layer_outputs = encoder_layer( 2025-08-14T21:34:44.3004007Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.3004079Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.3004357Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 296, in forward 2025-08-14T21:34:44.3004447Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:34:44.3004724Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 251, in forward 2025-08-14T21:34:44.3004800Z attn_output = self.out_proj(attn_output) 2025-08-14T21:34:44.3004812Z 2025-08-14T21:34:44.3004905Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.3005087Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.3005154Z return mod(**inputs) 2025-08-14T21:34:44.3005433Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.3005494Z outputs = self.model( 2025-08-14T21:34:44.3005783Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1195, in forward 2025-08-14T21:34:44.3005868Z encoder_outputs = self.encoder( 2025-08-14T21:34:44.3006170Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 812, in forward 2025-08-14T21:34:44.3006262Z layer_outputs = encoder_layer( 2025-08-14T21:34:44.3006468Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.3006548Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.3006826Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 307, in forward 2025-08-14T21:34:44.3006938Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:34:44.3006948Z 2025-08-14T21:34:44.3007072Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.3007254Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.3007321Z return mod(**inputs) 2025-08-14T21:34:44.3007606Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.3007667Z outputs = self.model( 2025-08-14T21:34:44.3007959Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1195, in forward 2025-08-14T21:34:44.3008024Z encoder_outputs = self.encoder( 2025-08-14T21:34:44.3008308Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 812, in forward 2025-08-14T21:34:44.3008371Z layer_outputs = encoder_layer( 2025-08-14T21:34:44.3008574Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.3008655Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.3008932Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 307, in forward 2025-08-14T21:34:44.3009042Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:34:44.3009243Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:34:44.3009306Z return self.act(input) 2025-08-14T21:34:44.3009309Z 2025-08-14T21:34:44.3009407Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.3009589Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.3009647Z return mod(**inputs) 2025-08-14T21:34:44.3009936Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.3009998Z outputs = self.model( 2025-08-14T21:34:44.3010287Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1195, in forward 2025-08-14T21:34:44.3010353Z encoder_outputs = self.encoder( 2025-08-14T21:34:44.3010634Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 812, in forward 2025-08-14T21:34:44.3010707Z layer_outputs = encoder_layer( 2025-08-14T21:34:44.3010910Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.3010981Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.3011266Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 309, in forward 2025-08-14T21:34:44.3011343Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:34:44.3011347Z 2025-08-14T21:34:44.3011446Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.3011643Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.3011705Z return mod(**inputs) 2025-08-14T21:34:44.3012008Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.3012086Z outputs = self.model( 2025-08-14T21:34:44.3012372Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1213, in forward 2025-08-14T21:34:44.3012437Z decoder_outputs = self.decoder( 2025-08-14T21:34:44.3012717Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:44.3012789Z layer_outputs = decoder_layer( 2025-08-14T21:34:44.3013006Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.3013078Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.3013362Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 398, in forward 2025-08-14T21:34:44.3013455Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:34:44.3013736Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 199, in forward 2025-08-14T21:34:44.3013872Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:34:44.3013875Z 2025-08-14T21:34:44.3013968Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.3014154Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.3014213Z return mod(**inputs) 2025-08-14T21:34:44.3014501Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.3014561Z outputs = self.model( 2025-08-14T21:34:44.3014839Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1213, in forward 2025-08-14T21:34:44.3014913Z decoder_outputs = self.decoder( 2025-08-14T21:34:44.3015192Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:44.3015263Z layer_outputs = decoder_layer( 2025-08-14T21:34:44.3015463Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.3015533Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.3015818Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 398, in forward 2025-08-14T21:34:44.3015910Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:34:44.3016186Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 218, in forward 2025-08-14T21:34:44.3016267Z key_states = self.k_proj(current_states) 2025-08-14T21:34:44.3016270Z 2025-08-14T21:34:44.3016361Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.3016547Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.3016605Z return mod(**inputs) 2025-08-14T21:34:44.3016882Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.3016949Z outputs = self.model( 2025-08-14T21:34:44.3017229Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1213, in forward 2025-08-14T21:34:44.3017318Z decoder_outputs = self.decoder( 2025-08-14T21:34:44.3017618Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:44.3017702Z layer_outputs = decoder_layer( 2025-08-14T21:34:44.3017915Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.3017986Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.3018268Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 398, in forward 2025-08-14T21:34:44.3018365Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:34:44.3018645Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 219, in forward 2025-08-14T21:34:44.3018752Z value_states = self.v_proj(current_states) 2025-08-14T21:34:44.3018757Z 2025-08-14T21:34:44.3018831Z cudagraph partition due to non gpu ops 2025-08-14T21:34:44.3018905Z cudagraph partition due to non gpu ops 2025-08-14T21:34:44.3018986Z cudagraph partition due to non gpu ops 2025-08-14T21:34:44.3019056Z cudagraph partition due to non gpu ops 2025-08-14T21:34:44.3019151Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.3019341Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.3019401Z return mod(**inputs) 2025-08-14T21:34:44.3019693Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.3019754Z outputs = self.model( 2025-08-14T21:34:44.3020038Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1213, in forward 2025-08-14T21:34:44.3020115Z decoder_outputs = self.decoder( 2025-08-14T21:34:44.3020401Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:44.3020476Z layer_outputs = decoder_layer( 2025-08-14T21:34:44.3020681Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.3020754Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.3021042Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 398, in forward 2025-08-14T21:34:44.3021131Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:34:44.3021411Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 237, in forward 2025-08-14T21:34:44.3021509Z attn_output, attn_weights = attention_interface( 2025-08-14T21:34:44.3021780Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:34:44.3021910Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:34:44.3021915Z 2025-08-14T21:34:44.3022008Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.3022190Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.3022256Z return mod(**inputs) 2025-08-14T21:34:44.3022540Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.3022608Z outputs = self.model( 2025-08-14T21:34:44.3022890Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1213, in forward 2025-08-14T21:34:44.3022958Z decoder_outputs = self.decoder( 2025-08-14T21:34:44.3023268Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:44.3023350Z layer_outputs = decoder_layer( 2025-08-14T21:34:44.3023567Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.3023647Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.3023922Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 398, in forward 2025-08-14T21:34:44.3024016Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:34:44.3024294Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 237, in forward 2025-08-14T21:34:44.3024402Z attn_output, attn_weights = attention_interface( 2025-08-14T21:34:44.3024677Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:34:44.3024850Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:34:44.3024860Z 2025-08-14T21:34:44.3024965Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.3025148Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.3025207Z return mod(**inputs) 2025-08-14T21:34:44.3025501Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.3025563Z outputs = self.model( 2025-08-14T21:34:44.3025855Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1213, in forward 2025-08-14T21:34:44.3025921Z decoder_outputs = self.decoder( 2025-08-14T21:34:44.3026203Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:44.3026278Z layer_outputs = decoder_layer( 2025-08-14T21:34:44.3026483Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.3026554Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.3026843Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 398, in forward 2025-08-14T21:34:44.3026931Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:34:44.3027219Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 251, in forward 2025-08-14T21:34:44.3027294Z attn_output = self.out_proj(attn_output) 2025-08-14T21:34:44.3027298Z 2025-08-14T21:34:44.3027391Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.3027582Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.3027643Z return mod(**inputs) 2025-08-14T21:34:44.3027936Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.3027997Z outputs = self.model( 2025-08-14T21:34:44.3028281Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1213, in forward 2025-08-14T21:34:44.3028354Z decoder_outputs = self.decoder( 2025-08-14T21:34:44.3028634Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:44.3028700Z layer_outputs = decoder_layer( 2025-08-14T21:34:44.3028912Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.3029002Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.3029308Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 415, in forward 2025-08-14T21:34:44.3029434Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:34:44.3029708Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 199, in forward 2025-08-14T21:34:44.3029852Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:34:44.3029856Z 2025-08-14T21:34:44.3029948Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.3030131Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.3030207Z return mod(**inputs) 2025-08-14T21:34:44.3030497Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.3030565Z outputs = self.model( 2025-08-14T21:34:44.3030851Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1213, in forward 2025-08-14T21:34:44.3030918Z decoder_outputs = self.decoder( 2025-08-14T21:34:44.3031213Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:44.3031277Z layer_outputs = decoder_layer( 2025-08-14T21:34:44.3031489Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.3031559Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.3031845Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 415, in forward 2025-08-14T21:34:44.3031953Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:34:44.3032239Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 218, in forward 2025-08-14T21:34:44.3032319Z key_states = self.k_proj(current_states) 2025-08-14T21:34:44.3032322Z 2025-08-14T21:34:44.3032414Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.3032595Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.3032662Z return mod(**inputs) 2025-08-14T21:34:44.3032949Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.3033019Z outputs = self.model( 2025-08-14T21:34:44.3033306Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1213, in forward 2025-08-14T21:34:44.3033374Z decoder_outputs = self.decoder( 2025-08-14T21:34:44.3033670Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:44.3033734Z layer_outputs = decoder_layer( 2025-08-14T21:34:44.3033940Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.3034019Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.3034302Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 415, in forward 2025-08-14T21:34:44.3034405Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:34:44.3034692Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 219, in forward 2025-08-14T21:34:44.3034789Z value_states = self.v_proj(current_states) 2025-08-14T21:34:44.3034793Z 2025-08-14T21:34:44.3034877Z cudagraph partition due to non gpu ops 2025-08-14T21:34:44.3034963Z cudagraph partition due to non gpu ops 2025-08-14T21:34:44.3035056Z cudagraph partition due to non gpu ops 2025-08-14T21:34:44.3035125Z cudagraph partition due to non gpu ops 2025-08-14T21:34:44.3035219Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.3035410Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.3035469Z return mod(**inputs) 2025-08-14T21:34:44.3035750Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.3035819Z outputs = self.model( 2025-08-14T21:34:44.3036119Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1213, in forward 2025-08-14T21:34:44.3036199Z decoder_outputs = self.decoder( 2025-08-14T21:34:44.3036485Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:44.3036556Z layer_outputs = decoder_layer( 2025-08-14T21:34:44.3036769Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.3036844Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.3037126Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 415, in forward 2025-08-14T21:34:44.3037234Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:34:44.3037515Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 237, in forward 2025-08-14T21:34:44.3037617Z attn_output, attn_weights = attention_interface( 2025-08-14T21:34:44.3037885Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:34:44.3038013Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:34:44.3038016Z 2025-08-14T21:34:44.3038122Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.3038309Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.3038380Z return mod(**inputs) 2025-08-14T21:34:44.3038663Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.3038726Z outputs = self.model( 2025-08-14T21:34:44.3039019Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1213, in forward 2025-08-14T21:34:44.3039092Z decoder_outputs = self.decoder( 2025-08-14T21:34:44.3039379Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:44.3039457Z layer_outputs = decoder_layer( 2025-08-14T21:34:44.3039663Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.3039745Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.3040026Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 415, in forward 2025-08-14T21:34:44.3040128Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:34:44.3040417Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 237, in forward 2025-08-14T21:34:44.3040510Z attn_output, attn_weights = attention_interface( 2025-08-14T21:34:44.3040800Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:34:44.3040917Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:34:44.3040938Z 2025-08-14T21:34:44.3041033Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.3041225Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.3041284Z return mod(**inputs) 2025-08-14T21:34:44.3041572Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.3041632Z outputs = self.model( 2025-08-14T21:34:44.3041912Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1213, in forward 2025-08-14T21:34:44.3042007Z decoder_outputs = self.decoder( 2025-08-14T21:34:44.3042288Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:44.3042354Z layer_outputs = decoder_layer( 2025-08-14T21:34:44.3042567Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.3042637Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.3042922Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 415, in forward 2025-08-14T21:34:44.3043019Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:34:44.3043296Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 251, in forward 2025-08-14T21:34:44.3043378Z attn_output = self.out_proj(attn_output) 2025-08-14T21:34:44.3043380Z 2025-08-14T21:34:44.3043474Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.3043662Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.3043724Z return mod(**inputs) 2025-08-14T21:34:44.3044003Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.3044072Z outputs = self.model( 2025-08-14T21:34:44.3044351Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1213, in forward 2025-08-14T21:34:44.3044416Z decoder_outputs = self.decoder( 2025-08-14T21:34:44.3044704Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:44.3044772Z layer_outputs = decoder_layer( 2025-08-14T21:34:44.3044984Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.3045054Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.3045333Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 430, in forward 2025-08-14T21:34:44.3045455Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:34:44.3045458Z 2025-08-14T21:34:44.3045551Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.3045739Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.3045797Z return mod(**inputs) 2025-08-14T21:34:44.3046078Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.3046147Z outputs = self.model( 2025-08-14T21:34:44.3046445Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1213, in forward 2025-08-14T21:34:44.3046515Z decoder_outputs = self.decoder( 2025-08-14T21:34:44.3046816Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:44.3046923Z layer_outputs = decoder_layer( 2025-08-14T21:34:44.3047133Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.3047205Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.3047480Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 430, in forward 2025-08-14T21:34:44.3047595Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:34:44.3047812Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:34:44.3047886Z return self.act(input) 2025-08-14T21:34:44.3047889Z 2025-08-14T21:34:44.3047983Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.3048167Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.3048235Z return mod(**inputs) 2025-08-14T21:34:44.3048515Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.3048575Z outputs = self.model( 2025-08-14T21:34:44.3048862Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1213, in forward 2025-08-14T21:34:44.3048929Z decoder_outputs = self.decoder( 2025-08-14T21:34:44.3049219Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:44.3049285Z layer_outputs = decoder_layer( 2025-08-14T21:34:44.3049485Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.3049568Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.3049845Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 432, in forward 2025-08-14T21:34:44.3049928Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:34:44.3049931Z 2025-08-14T21:34:44.3050024Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.3050201Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.3050270Z return mod(**inputs) 2025-08-14T21:34:44.3050553Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.3050615Z outputs = self.model( 2025-08-14T21:34:44.3050906Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1213, in forward 2025-08-14T21:34:44.3050976Z decoder_outputs = self.decoder( 2025-08-14T21:34:44.3051266Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:44.3051330Z layer_outputs = decoder_layer( 2025-08-14T21:34:44.3051530Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.3051608Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.3051885Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 398, in forward 2025-08-14T21:34:44.3051984Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:34:44.3052279Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 199, in forward 2025-08-14T21:34:44.3052444Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:34:44.3052461Z 2025-08-14T21:34:44.3052562Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.3052741Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.3052807Z return mod(**inputs) 2025-08-14T21:34:44.3053087Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.3053147Z outputs = self.model( 2025-08-14T21:34:44.3053435Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1213, in forward 2025-08-14T21:34:44.3053520Z decoder_outputs = self.decoder( 2025-08-14T21:34:44.3053806Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:44.3053880Z layer_outputs = decoder_layer( 2025-08-14T21:34:44.3054084Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.3054162Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.3054440Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 398, in forward 2025-08-14T21:34:44.3054529Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:34:44.3054813Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 218, in forward 2025-08-14T21:34:44.3054887Z key_states = self.k_proj(current_states) 2025-08-14T21:34:44.3054890Z 2025-08-14T21:34:44.3054990Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.3055173Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.3055235Z return mod(**inputs) 2025-08-14T21:34:44.3055528Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.3055590Z outputs = self.model( 2025-08-14T21:34:44.3055873Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1213, in forward 2025-08-14T21:34:44.3055944Z decoder_outputs = self.decoder( 2025-08-14T21:34:44.3056226Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:44.3056302Z layer_outputs = decoder_layer( 2025-08-14T21:34:44.3056507Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.3056579Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.3056867Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 398, in forward 2025-08-14T21:34:44.3056957Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:34:44.3057245Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 219, in forward 2025-08-14T21:34:44.3057324Z value_states = self.v_proj(current_states) 2025-08-14T21:34:44.3057328Z 2025-08-14T21:34:44.3057398Z cudagraph partition due to non gpu ops 2025-08-14T21:34:44.3057475Z cudagraph partition due to non gpu ops 2025-08-14T21:34:44.3057546Z cudagraph partition due to non gpu ops 2025-08-14T21:34:44.3057612Z cudagraph partition due to non gpu ops 2025-08-14T21:34:44.3057710Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.3057907Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.3057978Z return mod(**inputs) 2025-08-14T21:34:44.3058297Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.3058359Z outputs = self.model( 2025-08-14T21:34:44.3058645Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1213, in forward 2025-08-14T21:34:44.3058710Z decoder_outputs = self.decoder( 2025-08-14T21:34:44.3058987Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:44.3059073Z layer_outputs = decoder_layer( 2025-08-14T21:34:44.3059275Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.3059352Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.3059629Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 398, in forward 2025-08-14T21:34:44.3059719Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:34:44.3060003Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 237, in forward 2025-08-14T21:34:44.3060091Z attn_output, attn_weights = attention_interface( 2025-08-14T21:34:44.3060362Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:34:44.3060487Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:34:44.3060491Z 2025-08-14T21:34:44.3060584Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.3060774Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.3060832Z return mod(**inputs) 2025-08-14T21:34:44.3061113Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.3061182Z outputs = self.model( 2025-08-14T21:34:44.3061460Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1213, in forward 2025-08-14T21:34:44.3061532Z decoder_outputs = self.decoder( 2025-08-14T21:34:44.3061813Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:44.3061880Z layer_outputs = decoder_layer( 2025-08-14T21:34:44.3062093Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.3062165Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.3062452Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 398, in forward 2025-08-14T21:34:44.3062542Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:34:44.3062820Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 237, in forward 2025-08-14T21:34:44.3062917Z attn_output, attn_weights = attention_interface( 2025-08-14T21:34:44.3063181Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:34:44.3063286Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:34:44.3063291Z 2025-08-14T21:34:44.3063383Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.3063579Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.3063648Z return mod(**inputs) 2025-08-14T21:34:44.3063946Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.3064028Z outputs = self.model( 2025-08-14T21:34:44.3064316Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1213, in forward 2025-08-14T21:34:44.3064381Z decoder_outputs = self.decoder( 2025-08-14T21:34:44.3064669Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:44.3064799Z layer_outputs = decoder_layer( 2025-08-14T21:34:44.3065032Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.3065115Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.3065396Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 398, in forward 2025-08-14T21:34:44.3065497Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:34:44.3065776Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 251, in forward 2025-08-14T21:34:44.3065851Z attn_output = self.out_proj(attn_output) 2025-08-14T21:34:44.3065855Z 2025-08-14T21:34:44.3065966Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.3066148Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.3066209Z return mod(**inputs) 2025-08-14T21:34:44.3066502Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.3066567Z outputs = self.model( 2025-08-14T21:34:44.3066859Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1213, in forward 2025-08-14T21:34:44.3066928Z decoder_outputs = self.decoder( 2025-08-14T21:34:44.3067207Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:44.3067282Z layer_outputs = decoder_layer( 2025-08-14T21:34:44.3067485Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.3067565Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.3067840Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 415, in forward 2025-08-14T21:34:44.3067943Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:34:44.3068232Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 199, in forward 2025-08-14T21:34:44.3068371Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:34:44.3068375Z 2025-08-14T21:34:44.3068479Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.3068658Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.3068718Z return mod(**inputs) 2025-08-14T21:34:44.3069006Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.3069068Z outputs = self.model( 2025-08-14T21:34:44.3069346Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1213, in forward 2025-08-14T21:34:44.3069423Z decoder_outputs = self.decoder( 2025-08-14T21:34:44.3069719Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:44.3069825Z layer_outputs = decoder_layer( 2025-08-14T21:34:44.3070028Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.3070099Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.3070382Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 415, in forward 2025-08-14T21:34:44.3070480Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:34:44.3070766Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 218, in forward 2025-08-14T21:34:44.3070856Z key_states = self.k_proj(current_states) 2025-08-14T21:34:44.3070859Z 2025-08-14T21:34:44.3070951Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.3071141Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.3071200Z return mod(**inputs) 2025-08-14T21:34:44.3071481Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.3071550Z outputs = self.model( 2025-08-14T21:34:44.3071830Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1213, in forward 2025-08-14T21:34:44.3071902Z decoder_outputs = self.decoder( 2025-08-14T21:34:44.3072178Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:44.3072244Z layer_outputs = decoder_layer( 2025-08-14T21:34:44.3072454Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.3072522Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.3072807Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 415, in forward 2025-08-14T21:34:44.3072904Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:34:44.3073183Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 219, in forward 2025-08-14T21:34:44.3073269Z value_states = self.v_proj(current_states) 2025-08-14T21:34:44.3073272Z 2025-08-14T21:34:44.3073341Z cudagraph partition due to non gpu ops 2025-08-14T21:34:44.3073412Z cudagraph partition due to non gpu ops 2025-08-14T21:34:44.3073488Z cudagraph partition due to non gpu ops 2025-08-14T21:34:44.3073555Z cudagraph partition due to non gpu ops 2025-08-14T21:34:44.3073655Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.3073833Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.3073895Z return mod(**inputs) 2025-08-14T21:34:44.3074182Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.3074242Z outputs = self.model( 2025-08-14T21:34:44.3074520Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1213, in forward 2025-08-14T21:34:44.3074592Z decoder_outputs = self.decoder( 2025-08-14T21:34:44.3074870Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:44.3074941Z layer_outputs = decoder_layer( 2025-08-14T21:34:44.3075182Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.3075257Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.3075560Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 415, in forward 2025-08-14T21:34:44.3075676Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:34:44.3075963Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 237, in forward 2025-08-14T21:34:44.3076050Z attn_output, attn_weights = attention_interface( 2025-08-14T21:34:44.3076314Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:34:44.3076463Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:34:44.3076467Z 2025-08-14T21:34:44.3076559Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.3076741Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.3076808Z return mod(**inputs) 2025-08-14T21:34:44.3077093Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.3077161Z outputs = self.model( 2025-08-14T21:34:44.3077445Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1213, in forward 2025-08-14T21:34:44.3077512Z decoder_outputs = self.decoder( 2025-08-14T21:34:44.3077805Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:44.3077872Z layer_outputs = decoder_layer( 2025-08-14T21:34:44.3078085Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.3078156Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.3078439Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 415, in forward 2025-08-14T21:34:44.3078545Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:34:44.3078826Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 237, in forward 2025-08-14T21:34:44.3078922Z attn_output, attn_weights = attention_interface( 2025-08-14T21:34:44.3079189Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:34:44.3079285Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:34:44.3079290Z 2025-08-14T21:34:44.3079391Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.3079576Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.3079636Z return mod(**inputs) 2025-08-14T21:34:44.3079930Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.3079993Z outputs = self.model( 2025-08-14T21:34:44.3080284Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1213, in forward 2025-08-14T21:34:44.3080351Z decoder_outputs = self.decoder( 2025-08-14T21:34:44.3080634Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:44.3080710Z layer_outputs = decoder_layer( 2025-08-14T21:34:44.3080913Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.3081006Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.3081299Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 415, in forward 2025-08-14T21:34:44.3081417Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:34:44.3081711Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 251, in forward 2025-08-14T21:34:44.3081785Z attn_output = self.out_proj(attn_output) 2025-08-14T21:34:44.3081788Z 2025-08-14T21:34:44.3081880Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.3082067Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.3082145Z return mod(**inputs) 2025-08-14T21:34:44.3082435Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.3082496Z outputs = self.model( 2025-08-14T21:34:44.3082773Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1213, in forward 2025-08-14T21:34:44.3082848Z decoder_outputs = self.decoder( 2025-08-14T21:34:44.3083127Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:44.3083199Z layer_outputs = decoder_layer( 2025-08-14T21:34:44.3083401Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.3083470Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.3083753Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 430, in forward 2025-08-14T21:34:44.3083866Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:34:44.3083869Z 2025-08-14T21:34:44.3083963Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.3084148Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.3084208Z return mod(**inputs) 2025-08-14T21:34:44.3084495Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.3084554Z outputs = self.model( 2025-08-14T21:34:44.3084952Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1213, in forward 2025-08-14T21:34:44.3085033Z decoder_outputs = self.decoder( 2025-08-14T21:34:44.3085312Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:44.3085390Z layer_outputs = decoder_layer( 2025-08-14T21:34:44.3085595Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.3085669Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.3085960Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 430, in forward 2025-08-14T21:34:44.3086068Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:34:44.3086266Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:34:44.3086339Z return self.act(input) 2025-08-14T21:34:44.3086342Z 2025-08-14T21:34:44.3086435Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.3086626Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.3086686Z return mod(**inputs) 2025-08-14T21:34:44.3087008Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.3087102Z outputs = self.model( 2025-08-14T21:34:44.3087407Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1213, in forward 2025-08-14T21:34:44.3087480Z decoder_outputs = self.decoder( 2025-08-14T21:34:44.3087758Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:44.3087825Z layer_outputs = decoder_layer( 2025-08-14T21:34:44.3088034Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.3088130Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.3088410Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 432, in forward 2025-08-14T21:34:44.3088491Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:34:44.3088495Z 2025-08-14T21:34:44.3088590Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.3088782Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.3088842Z return mod(**inputs) 2025-08-14T21:34:44.3089119Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.3089188Z outputs = self.model( 2025-08-14T21:34:44.3089463Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1213, in forward 2025-08-14T21:34:44.3089536Z decoder_outputs = self.decoder( 2025-08-14T21:34:44.3089817Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:44.3089882Z layer_outputs = decoder_layer( 2025-08-14T21:34:44.3090094Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.3090166Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.3090443Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 398, in forward 2025-08-14T21:34:44.3090541Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:34:44.3090817Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 199, in forward 2025-08-14T21:34:44.3090960Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:34:44.3090964Z 2025-08-14T21:34:44.3091058Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.3091238Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.3091303Z return mod(**inputs) 2025-08-14T21:34:44.3091582Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.3091651Z outputs = self.model( 2025-08-14T21:34:44.3091928Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1213, in forward 2025-08-14T21:34:44.3091993Z decoder_outputs = self.decoder( 2025-08-14T21:34:44.3092279Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:44.3092346Z layer_outputs = decoder_layer( 2025-08-14T21:34:44.3092554Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.3092642Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.3092939Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 398, in forward 2025-08-14T21:34:44.3093053Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:34:44.3093329Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 218, in forward 2025-08-14T21:34:44.3093402Z key_states = self.k_proj(current_states) 2025-08-14T21:34:44.3093406Z 2025-08-14T21:34:44.3093506Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.3093686Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.3093780Z return mod(**inputs) 2025-08-14T21:34:44.3094070Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.3094133Z outputs = self.model( 2025-08-14T21:34:44.3094429Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1213, in forward 2025-08-14T21:34:44.3094495Z decoder_outputs = self.decoder( 2025-08-14T21:34:44.3094794Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:44.3094858Z layer_outputs = decoder_layer( 2025-08-14T21:34:44.3095064Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.3095144Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.3095427Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 398, in forward 2025-08-14T21:34:44.3095519Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:34:44.3095815Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 219, in forward 2025-08-14T21:34:44.3095896Z value_states = self.v_proj(current_states) 2025-08-14T21:34:44.3095900Z 2025-08-14T21:34:44.3095981Z cudagraph partition due to non gpu ops 2025-08-14T21:34:44.3096052Z cudagraph partition due to non gpu ops 2025-08-14T21:34:44.3096121Z cudagraph partition due to non gpu ops 2025-08-14T21:34:44.3096197Z cudagraph partition due to non gpu ops 2025-08-14T21:34:44.3096291Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.3096476Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.3096542Z return mod(**inputs) 2025-08-14T21:34:44.3096833Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.3096903Z outputs = self.model( 2025-08-14T21:34:44.3097192Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1213, in forward 2025-08-14T21:34:44.3097259Z decoder_outputs = self.decoder( 2025-08-14T21:34:44.3097551Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:44.3097616Z layer_outputs = decoder_layer( 2025-08-14T21:34:44.3097829Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.3097901Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.3098186Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 398, in forward 2025-08-14T21:34:44.3098282Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:34:44.3098585Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 237, in forward 2025-08-14T21:34:44.3098692Z attn_output, attn_weights = attention_interface( 2025-08-14T21:34:44.3098982Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:34:44.3099105Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:34:44.3099108Z 2025-08-14T21:34:44.3099208Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.3099390Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.3099450Z return mod(**inputs) 2025-08-14T21:34:44.3099740Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.3099820Z outputs = self.model( 2025-08-14T21:34:44.3100107Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1213, in forward 2025-08-14T21:34:44.3100177Z decoder_outputs = self.decoder( 2025-08-14T21:34:44.3100455Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:44.3100528Z layer_outputs = decoder_layer( 2025-08-14T21:34:44.3100730Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.3100801Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.3101085Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 398, in forward 2025-08-14T21:34:44.3101175Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:34:44.3101460Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 237, in forward 2025-08-14T21:34:44.3101549Z attn_output, attn_weights = attention_interface( 2025-08-14T21:34:44.3101810Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:34:44.3101917Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:34:44.3101921Z 2025-08-14T21:34:44.3102013Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.3102202Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.3102261Z return mod(**inputs) 2025-08-14T21:34:44.3102538Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.3102608Z outputs = self.model( 2025-08-14T21:34:44.3102887Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1213, in forward 2025-08-14T21:34:44.3102962Z decoder_outputs = self.decoder( 2025-08-14T21:34:44.3103245Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:44.3103309Z layer_outputs = decoder_layer( 2025-08-14T21:34:44.3103513Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.3103583Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.3103857Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 398, in forward 2025-08-14T21:34:44.3103952Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:34:44.3104241Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 251, in forward 2025-08-14T21:34:44.3104325Z attn_output = self.out_proj(attn_output) 2025-08-14T21:34:44.3104329Z 2025-08-14T21:34:44.3104455Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.3104640Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.3104706Z return mod(**inputs) 2025-08-14T21:34:44.3105078Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.3105153Z outputs = self.model( 2025-08-14T21:34:44.3105438Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1213, in forward 2025-08-14T21:34:44.3105527Z decoder_outputs = self.decoder( 2025-08-14T21:34:44.3105820Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:44.3105884Z layer_outputs = decoder_layer( 2025-08-14T21:34:44.3106087Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.3106168Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.3106444Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 415, in forward 2025-08-14T21:34:44.3106550Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:34:44.3106825Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 199, in forward 2025-08-14T21:34:44.3106962Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:34:44.3106966Z 2025-08-14T21:34:44.3107070Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.3107252Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.3107318Z return mod(**inputs) 2025-08-14T21:34:44.3107603Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.3107663Z outputs = self.model( 2025-08-14T21:34:44.3107952Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1213, in forward 2025-08-14T21:34:44.3108018Z decoder_outputs = self.decoder( 2025-08-14T21:34:44.3108295Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:44.3108368Z layer_outputs = decoder_layer( 2025-08-14T21:34:44.3108569Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.3108648Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.3108927Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 415, in forward 2025-08-14T21:34:44.3109027Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:34:44.3109309Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 218, in forward 2025-08-14T21:34:44.3109380Z key_states = self.k_proj(current_states) 2025-08-14T21:34:44.3109383Z 2025-08-14T21:34:44.3109484Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.3109665Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.3109724Z return mod(**inputs) 2025-08-14T21:34:44.3110027Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.3110091Z outputs = self.model( 2025-08-14T21:34:44.3110391Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1213, in forward 2025-08-14T21:34:44.3110476Z decoder_outputs = self.decoder( 2025-08-14T21:34:44.3110755Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:44.3110828Z layer_outputs = decoder_layer( 2025-08-14T21:34:44.3111028Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.3111098Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.3111406Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 415, in forward 2025-08-14T21:34:44.3111505Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:34:44.3111792Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 219, in forward 2025-08-14T21:34:44.3111872Z value_states = self.v_proj(current_states) 2025-08-14T21:34:44.3111875Z 2025-08-14T21:34:44.3111946Z cudagraph partition due to non gpu ops 2025-08-14T21:34:44.3112026Z cudagraph partition due to non gpu ops 2025-08-14T21:34:44.3112095Z cudagraph partition due to non gpu ops 2025-08-14T21:34:44.3112164Z cudagraph partition due to non gpu ops 2025-08-14T21:34:44.3112264Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.3112443Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.3112511Z return mod(**inputs) 2025-08-14T21:34:44.3112791Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.3112850Z outputs = self.model( 2025-08-14T21:34:44.3113135Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1213, in forward 2025-08-14T21:34:44.3113200Z decoder_outputs = self.decoder( 2025-08-14T21:34:44.3113483Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:44.3113547Z layer_outputs = decoder_layer( 2025-08-14T21:34:44.3113746Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.3113823Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.3114099Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 415, in forward 2025-08-14T21:34:44.3114198Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:34:44.3114481Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 237, in forward 2025-08-14T21:34:44.3114570Z attn_output, attn_weights = attention_interface( 2025-08-14T21:34:44.3114838Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:34:44.3114957Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:34:44.3114960Z 2025-08-14T21:34:44.3115051Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.3115237Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.3115296Z return mod(**inputs) 2025-08-14T21:34:44.3115582Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.3115661Z outputs = self.model( 2025-08-14T21:34:44.3115956Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1213, in forward 2025-08-14T21:34:44.3116052Z decoder_outputs = self.decoder( 2025-08-14T21:34:44.3116332Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:44.3116397Z layer_outputs = decoder_layer( 2025-08-14T21:34:44.3116607Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.3116676Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.3116961Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 415, in forward 2025-08-14T21:34:44.3117076Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:34:44.3117353Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 237, in forward 2025-08-14T21:34:44.3117451Z attn_output, attn_weights = attention_interface( 2025-08-14T21:34:44.3117713Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:34:44.3117816Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:34:44.3117819Z 2025-08-14T21:34:44.3117911Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.3118089Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.3118155Z return mod(**inputs) 2025-08-14T21:34:44.3118435Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.3118501Z outputs = self.model( 2025-08-14T21:34:44.3118780Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1213, in forward 2025-08-14T21:34:44.3118847Z decoder_outputs = self.decoder( 2025-08-14T21:34:44.3119132Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:44.3119197Z layer_outputs = decoder_layer( 2025-08-14T21:34:44.3119395Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.3119472Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.3119750Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 415, in forward 2025-08-14T21:34:44.3119853Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:34:44.3120131Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 251, in forward 2025-08-14T21:34:44.3120205Z attn_output = self.out_proj(attn_output) 2025-08-14T21:34:44.3120210Z 2025-08-14T21:34:44.3120309Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.3120484Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.3120549Z return mod(**inputs) 2025-08-14T21:34:44.3120826Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.3120885Z outputs = self.model( 2025-08-14T21:34:44.3121169Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1213, in forward 2025-08-14T21:34:44.3121233Z decoder_outputs = self.decoder( 2025-08-14T21:34:44.3121525Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:44.3121618Z layer_outputs = decoder_layer( 2025-08-14T21:34:44.3121838Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.3121914Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.3122189Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 430, in forward 2025-08-14T21:34:44.3122297Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:34:44.3122301Z 2025-08-14T21:34:44.3122398Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.3122577Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.3122661Z return mod(**inputs) 2025-08-14T21:34:44.3122952Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.3123013Z outputs = self.model( 2025-08-14T21:34:44.3123311Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1213, in forward 2025-08-14T21:34:44.3123376Z decoder_outputs = self.decoder( 2025-08-14T21:34:44.3123661Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:44.3123733Z layer_outputs = decoder_layer( 2025-08-14T21:34:44.3123938Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.3124018Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.3124305Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 430, in forward 2025-08-14T21:34:44.3124413Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:34:44.3124622Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:34:44.3124686Z return self.act(input) 2025-08-14T21:34:44.3124690Z 2025-08-14T21:34:44.3124790Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.3124973Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.3125033Z return mod(**inputs) 2025-08-14T21:34:44.3125324Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.3125385Z outputs = self.model( 2025-08-14T21:34:44.3125673Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1213, in forward 2025-08-14T21:34:44.3125756Z decoder_outputs = self.decoder( 2025-08-14T21:34:44.3126044Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:44.3126118Z layer_outputs = decoder_layer( 2025-08-14T21:34:44.3126320Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.3126390Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.3126682Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 432, in forward 2025-08-14T21:34:44.3126756Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:34:44.3126759Z 2025-08-14T21:34:44.3126860Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.3127043Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.3127119Z return mod(**inputs) 2025-08-14T21:34:44.3127422Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.3127500Z outputs = self.model( 2025-08-14T21:34:44.3127781Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1213, in forward 2025-08-14T21:34:44.3127854Z decoder_outputs = self.decoder( 2025-08-14T21:34:44.3128132Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:44.3128203Z layer_outputs = decoder_layer( 2025-08-14T21:34:44.3128407Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.3128516Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.3128802Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 398, in forward 2025-08-14T21:34:44.3128893Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:34:44.3129175Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 199, in forward 2025-08-14T21:34:44.3129312Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:34:44.3129316Z 2025-08-14T21:34:44.3129409Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.3129597Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.3129655Z return mod(**inputs) 2025-08-14T21:34:44.3129941Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.3130002Z outputs = self.model( 2025-08-14T21:34:44.3130280Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1213, in forward 2025-08-14T21:34:44.3130356Z decoder_outputs = self.decoder( 2025-08-14T21:34:44.3130634Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:44.3130698Z layer_outputs = decoder_layer( 2025-08-14T21:34:44.3130907Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.3130978Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.3131261Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 398, in forward 2025-08-14T21:34:44.3131351Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:34:44.3131625Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 218, in forward 2025-08-14T21:34:44.3131709Z key_states = self.k_proj(current_states) 2025-08-14T21:34:44.3131714Z 2025-08-14T21:34:44.3131807Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.3131992Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.3132051Z return mod(**inputs) 2025-08-14T21:34:44.3132329Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.3132396Z outputs = self.model( 2025-08-14T21:34:44.3132676Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1213, in forward 2025-08-14T21:34:44.3132743Z decoder_outputs = self.decoder( 2025-08-14T21:34:44.3133044Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:44.3133111Z layer_outputs = decoder_layer( 2025-08-14T21:34:44.3133351Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.3133425Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.3133701Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 398, in forward 2025-08-14T21:34:44.3133797Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:34:44.3134072Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 219, in forward 2025-08-14T21:34:44.3134178Z value_states = self.v_proj(current_states) 2025-08-14T21:34:44.3134182Z 2025-08-14T21:34:44.3134253Z cudagraph partition due to non gpu ops 2025-08-14T21:34:44.3134323Z cudagraph partition due to non gpu ops 2025-08-14T21:34:44.3134400Z cudagraph partition due to non gpu ops 2025-08-14T21:34:44.3134469Z cudagraph partition due to non gpu ops 2025-08-14T21:34:44.3134561Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.3134747Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.3134805Z return mod(**inputs) 2025-08-14T21:34:44.3135092Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.3135153Z outputs = self.model( 2025-08-14T21:34:44.3135428Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1213, in forward 2025-08-14T21:34:44.3135504Z decoder_outputs = self.decoder( 2025-08-14T21:34:44.3135782Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:44.3135845Z layer_outputs = decoder_layer( 2025-08-14T21:34:44.3136056Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.3136127Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.3136408Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 398, in forward 2025-08-14T21:34:44.3136496Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:34:44.3136769Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 237, in forward 2025-08-14T21:34:44.3136864Z attn_output, attn_weights = attention_interface( 2025-08-14T21:34:44.3137126Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:34:44.3137252Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:34:44.3137255Z 2025-08-14T21:34:44.3137350Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.3137528Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.3137597Z return mod(**inputs) 2025-08-14T21:34:44.3137876Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.3137943Z outputs = self.model( 2025-08-14T21:34:44.3138224Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1213, in forward 2025-08-14T21:34:44.3138291Z decoder_outputs = self.decoder( 2025-08-14T21:34:44.3138590Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:44.3138658Z layer_outputs = decoder_layer( 2025-08-14T21:34:44.3138911Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.3139008Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.3139288Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 398, in forward 2025-08-14T21:34:44.3139385Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:34:44.3139666Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 237, in forward 2025-08-14T21:34:44.3139754Z attn_output, attn_weights = attention_interface( 2025-08-14T21:34:44.3140044Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:34:44.3140145Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:34:44.3140149Z 2025-08-14T21:34:44.3140252Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.3140440Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.3140503Z return mod(**inputs) 2025-08-14T21:34:44.3140795Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.3140859Z outputs = self.model( 2025-08-14T21:34:44.3141143Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1213, in forward 2025-08-14T21:34:44.3141219Z decoder_outputs = self.decoder( 2025-08-14T21:34:44.3141505Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:44.3141581Z layer_outputs = decoder_layer( 2025-08-14T21:34:44.3141787Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.3141863Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.3142148Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 398, in forward 2025-08-14T21:34:44.3142239Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:34:44.3142526Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 251, in forward 2025-08-14T21:34:44.3142604Z attn_output = self.out_proj(attn_output) 2025-08-14T21:34:44.3142609Z 2025-08-14T21:34:44.3142706Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.3142899Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.3142964Z return mod(**inputs) 2025-08-14T21:34:44.3143249Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.3143320Z outputs = self.model( 2025-08-14T21:34:44.3143600Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1213, in forward 2025-08-14T21:34:44.3143676Z decoder_outputs = self.decoder( 2025-08-14T21:34:44.3143963Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:44.3144031Z layer_outputs = decoder_layer( 2025-08-14T21:34:44.3144245Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.3144322Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.3144626Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 415, in forward 2025-08-14T21:34:44.3144826Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:34:44.3145116Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 199, in forward 2025-08-14T21:34:44.3145263Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:34:44.3145267Z 2025-08-14T21:34:44.3145362Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.3145551Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.3145612Z return mod(**inputs) 2025-08-14T21:34:44.3145916Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.3145989Z outputs = self.model( 2025-08-14T21:34:44.3146270Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1213, in forward 2025-08-14T21:34:44.3146340Z decoder_outputs = self.decoder( 2025-08-14T21:34:44.3146632Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:44.3146699Z layer_outputs = decoder_layer( 2025-08-14T21:34:44.3146910Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.3146982Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.3147261Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 415, in forward 2025-08-14T21:34:44.3147372Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:34:44.3147655Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 218, in forward 2025-08-14T21:34:44.3147740Z key_states = self.k_proj(current_states) 2025-08-14T21:34:44.3147744Z 2025-08-14T21:34:44.3147841Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.3148022Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.3148091Z return mod(**inputs) 2025-08-14T21:34:44.3148370Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.3148432Z outputs = self.model( 2025-08-14T21:34:44.3148719Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1213, in forward 2025-08-14T21:34:44.3148788Z decoder_outputs = self.decoder( 2025-08-14T21:34:44.3149072Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:44.3149139Z layer_outputs = decoder_layer( 2025-08-14T21:34:44.3149344Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.3149422Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.3149701Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 415, in forward 2025-08-14T21:34:44.3149808Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:34:44.3150085Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 219, in forward 2025-08-14T21:34:44.3150165Z value_states = self.v_proj(current_states) 2025-08-14T21:34:44.3150168Z 2025-08-14T21:34:44.3150265Z cudagraph partition due to non gpu ops 2025-08-14T21:34:44.3150339Z cudagraph partition due to non gpu ops 2025-08-14T21:34:44.3150408Z cudagraph partition due to non gpu ops 2025-08-14T21:34:44.3150519Z cudagraph partition due to non gpu ops 2025-08-14T21:34:44.3150616Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.3150805Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.3150865Z return mod(**inputs) 2025-08-14T21:34:44.3151149Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.3151221Z outputs = self.model( 2025-08-14T21:34:44.3151505Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1213, in forward 2025-08-14T21:34:44.3151587Z decoder_outputs = self.decoder( 2025-08-14T21:34:44.3151877Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:44.3151941Z layer_outputs = decoder_layer( 2025-08-14T21:34:44.3152153Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.3152225Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.3152502Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 415, in forward 2025-08-14T21:34:44.3152605Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:34:44.3152882Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 237, in forward 2025-08-14T21:34:44.3152981Z attn_output, attn_weights = attention_interface( 2025-08-14T21:34:44.3153246Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:34:44.3153369Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:34:44.3153375Z 2025-08-14T21:34:44.3153476Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.3153657Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.3153723Z return mod(**inputs) 2025-08-14T21:34:44.3154004Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.3154065Z outputs = self.model( 2025-08-14T21:34:44.3154353Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1213, in forward 2025-08-14T21:34:44.3154420Z decoder_outputs = self.decoder( 2025-08-14T21:34:44.3154702Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:44.3154776Z layer_outputs = decoder_layer( 2025-08-14T21:34:44.3154982Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.3155061Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.3155341Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 415, in forward 2025-08-14T21:34:44.3155437Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:34:44.3155722Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 237, in forward 2025-08-14T21:34:44.3155815Z attn_output, attn_weights = attention_interface( 2025-08-14T21:34:44.3156102Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:34:44.3156202Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:34:44.3156206Z 2025-08-14T21:34:44.3156340Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.3156532Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.3156592Z return mod(**inputs) 2025-08-14T21:34:44.3156872Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.3156941Z outputs = self.model( 2025-08-14T21:34:44.3157220Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1213, in forward 2025-08-14T21:34:44.3157314Z decoder_outputs = self.decoder( 2025-08-14T21:34:44.3157601Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:44.3157667Z layer_outputs = decoder_layer( 2025-08-14T21:34:44.3157884Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.3157956Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.3158248Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 415, in forward 2025-08-14T21:34:44.3158347Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:34:44.3158630Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 251, in forward 2025-08-14T21:34:44.3158711Z attn_output = self.out_proj(attn_output) 2025-08-14T21:34:44.3158716Z 2025-08-14T21:34:44.3158809Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.3158992Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.3159060Z return mod(**inputs) 2025-08-14T21:34:44.3159350Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.3159421Z outputs = self.model( 2025-08-14T21:34:44.3159707Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1213, in forward 2025-08-14T21:34:44.3159774Z decoder_outputs = self.decoder( 2025-08-14T21:34:44.3160066Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:44.3160130Z layer_outputs = decoder_layer( 2025-08-14T21:34:44.3160347Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.3160419Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.3160702Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 430, in forward 2025-08-14T21:34:44.3160823Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:34:44.3160826Z 2025-08-14T21:34:44.3160918Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.3161111Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.3161170Z return mod(**inputs) 2025-08-14T21:34:44.3161462Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.3161530Z outputs = self.model( 2025-08-14T21:34:44.3161818Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1213, in forward 2025-08-14T21:34:44.3161908Z decoder_outputs = self.decoder( 2025-08-14T21:34:44.3162214Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:44.3162296Z layer_outputs = decoder_layer( 2025-08-14T21:34:44.3162501Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.3162571Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.3162846Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 430, in forward 2025-08-14T21:34:44.3162961Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:34:44.3163155Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:34:44.3163235Z return self.act(input) 2025-08-14T21:34:44.3163245Z 2025-08-14T21:34:44.3163340Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.3163518Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.3163588Z return mod(**inputs) 2025-08-14T21:34:44.3163870Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.3163930Z outputs = self.model( 2025-08-14T21:34:44.3164218Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1213, in forward 2025-08-14T21:34:44.3164283Z decoder_outputs = self.decoder( 2025-08-14T21:34:44.3164566Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:44.3164630Z layer_outputs = decoder_layer( 2025-08-14T21:34:44.3164832Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.3164909Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.3165188Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 432, in forward 2025-08-14T21:34:44.3165263Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:34:44.3165273Z 2025-08-14T21:34:44.3165364Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.3165542Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.3165606Z return mod(**inputs) 2025-08-14T21:34:44.3165884Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.3165945Z outputs = self.model( 2025-08-14T21:34:44.3166231Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1213, in forward 2025-08-14T21:34:44.3166296Z decoder_outputs = self.decoder( 2025-08-14T21:34:44.3166584Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:44.3166650Z layer_outputs = decoder_layer( 2025-08-14T21:34:44.3166850Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.3166929Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.3167205Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 398, in forward 2025-08-14T21:34:44.3167296Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:34:44.3167605Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 199, in forward 2025-08-14T21:34:44.3167743Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:34:44.3167746Z 2025-08-14T21:34:44.3167879Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.3168062Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.3168120Z return mod(**inputs) 2025-08-14T21:34:44.3168406Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.3168466Z outputs = self.model( 2025-08-14T21:34:44.3168751Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1213, in forward 2025-08-14T21:34:44.3168833Z decoder_outputs = self.decoder( 2025-08-14T21:34:44.3169116Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:44.3169190Z layer_outputs = decoder_layer( 2025-08-14T21:34:44.3169395Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.3169475Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.3169757Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 398, in forward 2025-08-14T21:34:44.3169848Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:34:44.3170136Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 218, in forward 2025-08-14T21:34:44.3170209Z key_states = self.k_proj(current_states) 2025-08-14T21:34:44.3170213Z 2025-08-14T21:34:44.3170307Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.3170497Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.3170557Z return mod(**inputs) 2025-08-14T21:34:44.3170851Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.3170914Z outputs = self.model( 2025-08-14T21:34:44.3171195Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1213, in forward 2025-08-14T21:34:44.3171267Z decoder_outputs = self.decoder( 2025-08-14T21:34:44.3171549Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:44.3171620Z layer_outputs = decoder_layer( 2025-08-14T21:34:44.3171823Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.3171895Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.3172181Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 398, in forward 2025-08-14T21:34:44.3172272Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:34:44.3172550Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 219, in forward 2025-08-14T21:34:44.3172635Z value_states = self.v_proj(current_states) 2025-08-14T21:34:44.3172638Z 2025-08-14T21:34:44.3172709Z cudagraph partition due to non gpu ops 2025-08-14T21:34:44.3172787Z cudagraph partition due to non gpu ops 2025-08-14T21:34:44.3172855Z cudagraph partition due to non gpu ops 2025-08-14T21:34:44.3172924Z cudagraph partition due to non gpu ops 2025-08-14T21:34:44.3173027Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.3173226Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.3173290Z return mod(**inputs) 2025-08-14T21:34:44.3173594Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.3173671Z outputs = self.model( 2025-08-14T21:34:44.3173967Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1213, in forward 2025-08-14T21:34:44.3174032Z decoder_outputs = self.decoder( 2025-08-14T21:34:44.3174321Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:44.3174392Z layer_outputs = decoder_layer( 2025-08-14T21:34:44.3174609Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.3174688Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.3174969Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 398, in forward 2025-08-14T21:34:44.3175061Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:34:44.3175348Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 237, in forward 2025-08-14T21:34:44.3175436Z attn_output, attn_weights = attention_interface( 2025-08-14T21:34:44.3175701Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:34:44.3175831Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:34:44.3175834Z 2025-08-14T21:34:44.3175928Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.3176117Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.3176178Z return mod(**inputs) 2025-08-14T21:34:44.3176460Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.3176530Z outputs = self.model( 2025-08-14T21:34:44.3176810Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1213, in forward 2025-08-14T21:34:44.3176883Z decoder_outputs = self.decoder( 2025-08-14T21:34:44.3177166Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:44.3177229Z layer_outputs = decoder_layer( 2025-08-14T21:34:44.3177438Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.3177511Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.3177798Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 398, in forward 2025-08-14T21:34:44.3177888Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:34:44.3178168Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 237, in forward 2025-08-14T21:34:44.3178260Z attn_output, attn_weights = attention_interface( 2025-08-14T21:34:44.3178526Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:34:44.3178623Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:34:44.3178634Z 2025-08-14T21:34:44.3178729Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.3178912Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.3178978Z return mod(**inputs) 2025-08-14T21:34:44.3179274Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.3179369Z outputs = self.model( 2025-08-14T21:34:44.3179659Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1213, in forward 2025-08-14T21:34:44.3179723Z decoder_outputs = self.decoder( 2025-08-14T21:34:44.3180010Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:44.3180074Z layer_outputs = decoder_layer( 2025-08-14T21:34:44.3180275Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.3180380Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.3180661Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 398, in forward 2025-08-14T21:34:44.3180748Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:34:44.3181036Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 251, in forward 2025-08-14T21:34:44.3181111Z attn_output = self.out_proj(attn_output) 2025-08-14T21:34:44.3181114Z 2025-08-14T21:34:44.3181212Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.3181471Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.3181550Z return mod(**inputs) 2025-08-14T21:34:44.3181983Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.3182069Z outputs = self.model( 2025-08-14T21:34:44.3182484Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1213, in forward 2025-08-14T21:34:44.3182559Z decoder_outputs = self.decoder( 2025-08-14T21:34:44.3182854Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:44.3182930Z layer_outputs = decoder_layer( 2025-08-14T21:34:44.3183141Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.3183214Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.3183508Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 415, in forward 2025-08-14T21:34:44.3183612Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:34:44.3183907Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 199, in forward 2025-08-14T21:34:44.3184048Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:34:44.3184054Z 2025-08-14T21:34:44.3184148Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.3184343Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.3184402Z return mod(**inputs) 2025-08-14T21:34:44.3184989Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.3185060Z outputs = self.model( 2025-08-14T21:34:44.3185349Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1213, in forward 2025-08-14T21:34:44.3185429Z decoder_outputs = self.decoder( 2025-08-14T21:34:44.3185769Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:44.3185849Z layer_outputs = decoder_layer( 2025-08-14T21:34:44.3186123Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.3186228Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.3186532Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 415, in forward 2025-08-14T21:34:44.3186633Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:34:44.3186917Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 218, in forward 2025-08-14T21:34:44.3187027Z key_states = self.k_proj(current_states) 2025-08-14T21:34:44.3187031Z 2025-08-14T21:34:44.3187127Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.3187324Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.3187386Z return mod(**inputs) 2025-08-14T21:34:44.3187675Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.3187747Z outputs = self.model( 2025-08-14T21:34:44.3188034Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1213, in forward 2025-08-14T21:34:44.3188110Z decoder_outputs = self.decoder( 2025-08-14T21:34:44.3188399Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:44.3188467Z layer_outputs = decoder_layer( 2025-08-14T21:34:44.3188681Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.3188756Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.3189042Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 415, in forward 2025-08-14T21:34:44.3189150Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:34:44.3189436Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 219, in forward 2025-08-14T21:34:44.3189523Z value_states = self.v_proj(current_states) 2025-08-14T21:34:44.3189527Z 2025-08-14T21:34:44.3189600Z cudagraph partition due to non gpu ops 2025-08-14T21:34:44.3189674Z cudagraph partition due to non gpu ops 2025-08-14T21:34:44.3189753Z cudagraph partition due to non gpu ops 2025-08-14T21:34:44.3189825Z cudagraph partition due to non gpu ops 2025-08-14T21:34:44.3189920Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.3190112Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.3190172Z return mod(**inputs) 2025-08-14T21:34:44.3190467Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.3190538Z outputs = self.model( 2025-08-14T21:34:44.3190965Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1213, in forward 2025-08-14T21:34:44.3191042Z decoder_outputs = self.decoder( 2025-08-14T21:34:44.3191336Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:44.3191437Z layer_outputs = decoder_layer( 2025-08-14T21:34:44.3191662Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.3191752Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.3192061Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 415, in forward 2025-08-14T21:34:44.3192176Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:34:44.3192459Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 237, in forward 2025-08-14T21:34:44.3192556Z attn_output, attn_weights = attention_interface( 2025-08-14T21:34:44.3192831Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:34:44.3192960Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:34:44.3192979Z 2025-08-14T21:34:44.3193074Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.3193262Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.3193330Z return mod(**inputs) 2025-08-14T21:34:44.3193615Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.3193680Z outputs = self.model( 2025-08-14T21:34:44.3193968Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1213, in forward 2025-08-14T21:34:44.3194035Z decoder_outputs = self.decoder( 2025-08-14T21:34:44.3194325Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:44.3194390Z layer_outputs = decoder_layer( 2025-08-14T21:34:44.3194603Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.3194675Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.3195023Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 415, in forward 2025-08-14T21:34:44.3195132Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:34:44.3195417Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 237, in forward 2025-08-14T21:34:44.3195505Z attn_output, attn_weights = attention_interface( 2025-08-14T21:34:44.3195781Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:34:44.3195881Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:34:44.3195884Z 2025-08-14T21:34:44.3195984Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.3196167Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.3196228Z return mod(**inputs) 2025-08-14T21:34:44.3196523Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.3196585Z outputs = self.model( 2025-08-14T21:34:44.3196879Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1213, in forward 2025-08-14T21:34:44.3196947Z decoder_outputs = self.decoder( 2025-08-14T21:34:44.3197234Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:44.3197308Z layer_outputs = decoder_layer( 2025-08-14T21:34:44.3197516Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.3197591Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.3197902Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 415, in forward 2025-08-14T21:34:44.3198019Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:34:44.3198330Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 251, in forward 2025-08-14T21:34:44.3198408Z attn_output = self.out_proj(attn_output) 2025-08-14T21:34:44.3198411Z 2025-08-14T21:34:44.3198506Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.3198701Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.3198760Z return mod(**inputs) 2025-08-14T21:34:44.3199104Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.3199183Z outputs = self.model( 2025-08-14T21:34:44.3199462Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1213, in forward 2025-08-14T21:34:44.3199537Z decoder_outputs = self.decoder( 2025-08-14T21:34:44.3199813Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:44.3199879Z layer_outputs = decoder_layer( 2025-08-14T21:34:44.3200087Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.3200156Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.3200438Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 430, in forward 2025-08-14T21:34:44.3200549Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:34:44.3200552Z 2025-08-14T21:34:44.3200644Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.3200829Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.3200891Z return mod(**inputs) 2025-08-14T21:34:44.3201179Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.3201240Z outputs = self.model( 2025-08-14T21:34:44.3201518Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1213, in forward 2025-08-14T21:34:44.3201591Z decoder_outputs = self.decoder( 2025-08-14T21:34:44.3201868Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:44.3201933Z layer_outputs = decoder_layer( 2025-08-14T21:34:44.3202142Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.3202212Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.3202497Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 430, in forward 2025-08-14T21:34:44.3202605Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:34:44.3202798Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:34:44.3202868Z return self.act(input) 2025-08-14T21:34:44.3202872Z 2025-08-14T21:34:44.3202964Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.3203148Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.3203208Z return mod(**inputs) 2025-08-14T21:34:44.3203501Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.3203572Z outputs = self.model( 2025-08-14T21:34:44.3203865Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1213, in forward 2025-08-14T21:34:44.3203957Z decoder_outputs = self.decoder( 2025-08-14T21:34:44.3204244Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:44.3204307Z layer_outputs = decoder_layer( 2025-08-14T21:34:44.3204511Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.3204581Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.3204882Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 432, in forward 2025-08-14T21:34:44.3204965Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:34:44.3204968Z 2025-08-14T21:34:44.3205060Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.3205248Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.3205309Z return mod(**inputs) 2025-08-14T21:34:44.3205587Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.3205655Z outputs = self.model( 2025-08-14T21:34:44.3205935Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1213, in forward 2025-08-14T21:34:44.3206002Z decoder_outputs = self.decoder( 2025-08-14T21:34:44.3206289Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:44.3206357Z layer_outputs = decoder_layer( 2025-08-14T21:34:44.3206565Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.3206635Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.3206913Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 398, in forward 2025-08-14T21:34:44.3207011Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:34:44.3207287Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 199, in forward 2025-08-14T21:34:44.3207429Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:34:44.3207433Z 2025-08-14T21:34:44.3207525Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.3207704Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.3207771Z return mod(**inputs) 2025-08-14T21:34:44.3208053Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.3208121Z outputs = self.model( 2025-08-14T21:34:44.3208399Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1213, in forward 2025-08-14T21:34:44.3208463Z decoder_outputs = self.decoder( 2025-08-14T21:34:44.3208749Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:44.3208813Z layer_outputs = decoder_layer( 2025-08-14T21:34:44.3209012Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.3209090Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.3209383Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 398, in forward 2025-08-14T21:34:44.3209499Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:34:44.3209794Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 218, in forward 2025-08-14T21:34:44.3209864Z key_states = self.k_proj(current_states) 2025-08-14T21:34:44.3209868Z 2025-08-14T21:34:44.3209963Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.3210144Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.3210207Z return mod(**inputs) 2025-08-14T21:34:44.3210486Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.3210564Z outputs = self.model( 2025-08-14T21:34:44.3210849Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1213, in forward 2025-08-14T21:34:44.3210917Z decoder_outputs = self.decoder( 2025-08-14T21:34:44.3211199Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:44.3211270Z layer_outputs = decoder_layer( 2025-08-14T21:34:44.3211471Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.3211547Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.3211820Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 398, in forward 2025-08-14T21:34:44.3211910Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:34:44.3212194Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 219, in forward 2025-08-14T21:34:44.3212270Z value_states = self.v_proj(current_states) 2025-08-14T21:34:44.3212276Z 2025-08-14T21:34:44.3212356Z cudagraph partition due to non gpu ops 2025-08-14T21:34:44.3212425Z cudagraph partition due to non gpu ops 2025-08-14T21:34:44.3212494Z cudagraph partition due to non gpu ops 2025-08-14T21:34:44.3212569Z cudagraph partition due to non gpu ops 2025-08-14T21:34:44.3212661Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.3212840Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.3212906Z return mod(**inputs) 2025-08-14T21:34:44.3213182Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.3213252Z outputs = self.model( 2025-08-14T21:34:44.3213534Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1213, in forward 2025-08-14T21:34:44.3213600Z decoder_outputs = self.decoder( 2025-08-14T21:34:44.3213888Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:44.3213954Z layer_outputs = decoder_layer( 2025-08-14T21:34:44.3214152Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.3214229Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.3214504Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 398, in forward 2025-08-14T21:34:44.3214598Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:34:44.3214889Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 237, in forward 2025-08-14T21:34:44.3214979Z attn_output, attn_weights = attention_interface( 2025-08-14T21:34:44.3215266Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:34:44.3215403Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:34:44.3215406Z 2025-08-14T21:34:44.3215507Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.3215685Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.3215744Z return mod(**inputs) 2025-08-14T21:34:44.3216029Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.3216109Z outputs = self.model( 2025-08-14T21:34:44.3216398Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1213, in forward 2025-08-14T21:34:44.3216464Z decoder_outputs = self.decoder( 2025-08-14T21:34:44.3216745Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:44.3216815Z layer_outputs = decoder_layer( 2025-08-14T21:34:44.3217016Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.3217085Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.3217368Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 398, in forward 2025-08-14T21:34:44.3217458Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:34:44.3217747Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 237, in forward 2025-08-14T21:34:44.3217833Z attn_output, attn_weights = attention_interface( 2025-08-14T21:34:44.3218098Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:34:44.3218203Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:34:44.3218207Z 2025-08-14T21:34:44.3218298Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.3218485Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.3218545Z return mod(**inputs) 2025-08-14T21:34:44.3218835Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.3218907Z outputs = self.model( 2025-08-14T21:34:44.3219195Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1213, in forward 2025-08-14T21:34:44.3219262Z decoder_outputs = self.decoder( 2025-08-14T21:34:44.3219559Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:44.3219627Z layer_outputs = decoder_layer( 2025-08-14T21:34:44.3219841Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.3219914Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.3220202Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 398, in forward 2025-08-14T21:34:44.3220302Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:34:44.3220587Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 251, in forward 2025-08-14T21:34:44.3220687Z attn_output = self.out_proj(attn_output) 2025-08-14T21:34:44.3220691Z 2025-08-14T21:34:44.3220786Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.3221579Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.3221652Z return mod(**inputs) 2025-08-14T21:34:44.3221942Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.3222005Z outputs = self.model( 2025-08-14T21:34:44.3222300Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1213, in forward 2025-08-14T21:34:44.3222367Z decoder_outputs = self.decoder( 2025-08-14T21:34:44.3222682Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:44.3222761Z layer_outputs = decoder_layer( 2025-08-14T21:34:44.3222963Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.3223046Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.3223324Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 415, in forward 2025-08-14T21:34:44.3223426Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:34:44.3223703Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 199, in forward 2025-08-14T21:34:44.3223838Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:34:44.3223842Z 2025-08-14T21:34:44.3223940Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.3224121Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.3224185Z return mod(**inputs) 2025-08-14T21:34:44.3224463Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.3224525Z outputs = self.model( 2025-08-14T21:34:44.3224887Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1213, in forward 2025-08-14T21:34:44.3224958Z decoder_outputs = self.decoder( 2025-08-14T21:34:44.3225238Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:44.3225309Z layer_outputs = decoder_layer( 2025-08-14T21:34:44.3225515Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.3225592Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.3225874Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 415, in forward 2025-08-14T21:34:44.3225974Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:34:44.3226261Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 218, in forward 2025-08-14T21:34:44.3226334Z key_states = self.k_proj(current_states) 2025-08-14T21:34:44.3226338Z 2025-08-14T21:34:44.3226438Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.3226615Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.3226674Z return mod(**inputs) 2025-08-14T21:34:44.3226965Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.3227025Z outputs = self.model( 2025-08-14T21:34:44.3227322Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1213, in forward 2025-08-14T21:34:44.3227438Z decoder_outputs = self.decoder( 2025-08-14T21:34:44.3227721Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:44.3227792Z layer_outputs = decoder_layer( 2025-08-14T21:34:44.3227993Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.3228064Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.3228347Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 415, in forward 2025-08-14T21:34:44.3228462Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:34:44.3228749Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 219, in forward 2025-08-14T21:34:44.3228831Z value_states = self.v_proj(current_states) 2025-08-14T21:34:44.3228836Z 2025-08-14T21:34:44.3228910Z cudagraph partition due to non gpu ops 2025-08-14T21:34:44.3228992Z cudagraph partition due to non gpu ops 2025-08-14T21:34:44.3229063Z cudagraph partition due to non gpu ops 2025-08-14T21:34:44.3229133Z cudagraph partition due to non gpu ops 2025-08-14T21:34:44.3229236Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.3229420Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.3229489Z return mod(**inputs) 2025-08-14T21:34:44.3229772Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.3229837Z outputs = self.model( 2025-08-14T21:34:44.3230128Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1213, in forward 2025-08-14T21:34:44.3230197Z decoder_outputs = self.decoder( 2025-08-14T21:34:44.3230481Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:44.3230555Z layer_outputs = decoder_layer( 2025-08-14T21:34:44.3230757Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.3230836Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.3231115Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 415, in forward 2025-08-14T21:34:44.3231214Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:34:44.3231501Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 237, in forward 2025-08-14T21:34:44.3231591Z attn_output, attn_weights = attention_interface( 2025-08-14T21:34:44.3231865Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:34:44.3231986Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:34:44.3231990Z 2025-08-14T21:34:44.3232083Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.3232276Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.3232339Z return mod(**inputs) 2025-08-14T21:34:44.3232621Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.3232693Z outputs = self.model( 2025-08-14T21:34:44.3233025Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1213, in forward 2025-08-14T21:34:44.3233117Z decoder_outputs = self.decoder( 2025-08-14T21:34:44.3233410Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:44.3233473Z layer_outputs = decoder_layer( 2025-08-14T21:34:44.3233681Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.3233752Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.3234035Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 415, in forward 2025-08-14T21:34:44.3234149Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:34:44.3234427Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 237, in forward 2025-08-14T21:34:44.3234522Z attn_output, attn_weights = attention_interface( 2025-08-14T21:34:44.3234785Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:34:44.3234887Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:34:44.3234890Z 2025-08-14T21:34:44.3234982Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.3235162Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.3235228Z return mod(**inputs) 2025-08-14T21:34:44.3235511Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.3235574Z outputs = self.model( 2025-08-14T21:34:44.3235865Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1213, in forward 2025-08-14T21:34:44.3235931Z decoder_outputs = self.decoder( 2025-08-14T21:34:44.3236219Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:44.3236284Z layer_outputs = decoder_layer( 2025-08-14T21:34:44.3236486Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.3236565Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.3236844Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 415, in forward 2025-08-14T21:34:44.3236946Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:34:44.3237225Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 251, in forward 2025-08-14T21:34:44.3237297Z attn_output = self.out_proj(attn_output) 2025-08-14T21:34:44.3237300Z 2025-08-14T21:34:44.3237400Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.3237580Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.3237639Z return mod(**inputs) 2025-08-14T21:34:44.3237928Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.3237987Z outputs = self.model( 2025-08-14T21:34:44.3238273Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1213, in forward 2025-08-14T21:34:44.3238339Z decoder_outputs = self.decoder( 2025-08-14T21:34:44.3238633Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:44.3238706Z layer_outputs = decoder_layer( 2025-08-14T21:34:44.3238923Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.3239016Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.3239293Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 430, in forward 2025-08-14T21:34:44.3239402Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:34:44.3239405Z 2025-08-14T21:34:44.3239502Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.3239680Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.3239755Z return mod(**inputs) 2025-08-14T21:34:44.3240051Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.3240111Z outputs = self.model( 2025-08-14T21:34:44.3240402Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1213, in forward 2025-08-14T21:34:44.3240470Z decoder_outputs = self.decoder( 2025-08-14T21:34:44.3240754Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:44.3240823Z layer_outputs = decoder_layer( 2025-08-14T21:34:44.3241030Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.3241106Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.3241390Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 430, in forward 2025-08-14T21:34:44.3241502Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:34:44.3241707Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:34:44.3241774Z return self.act(input) 2025-08-14T21:34:44.3241777Z 2025-08-14T21:34:44.3241878Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.3242060Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.3242118Z return mod(**inputs) 2025-08-14T21:34:44.3242411Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.3242471Z outputs = self.model( 2025-08-14T21:34:44.3242758Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1213, in forward 2025-08-14T21:34:44.3242834Z decoder_outputs = self.decoder( 2025-08-14T21:34:44.3243119Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:44.3243192Z layer_outputs = decoder_layer( 2025-08-14T21:34:44.3243397Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.3243467Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.3243758Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 432, in forward 2025-08-14T21:34:44.3243831Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:34:44.3243834Z 2025-08-14T21:34:44.3243932Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.3244113Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.3244173Z return mod(**inputs) 2025-08-14T21:34:44.3244482Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.3244545Z outputs = self.model( 2025-08-14T21:34:44.3244861Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1213, in forward 2025-08-14T21:34:44.3244938Z decoder_outputs = self.decoder( 2025-08-14T21:34:44.3245218Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:44.3245290Z layer_outputs = decoder_layer( 2025-08-14T21:34:44.3245494Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.3245581Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.3245868Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 398, in forward 2025-08-14T21:34:44.3245959Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:34:44.3246235Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 199, in forward 2025-08-14T21:34:44.3246383Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:34:44.3246386Z 2025-08-14T21:34:44.3246477Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.3246665Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.3246722Z return mod(**inputs) 2025-08-14T21:34:44.3247007Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.3247077Z outputs = self.model( 2025-08-14T21:34:44.3247354Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1213, in forward 2025-08-14T21:34:44.3247425Z decoder_outputs = self.decoder( 2025-08-14T21:34:44.3247703Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:44.3247769Z layer_outputs = decoder_layer( 2025-08-14T21:34:44.3247975Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.3248046Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.3248330Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 398, in forward 2025-08-14T21:34:44.3248420Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:34:44.3248699Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 218, in forward 2025-08-14T21:34:44.3248780Z key_states = self.k_proj(current_states) 2025-08-14T21:34:44.3248783Z 2025-08-14T21:34:44.3248875Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.3249054Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.3249120Z return mod(**inputs) 2025-08-14T21:34:44.3249398Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.3249467Z outputs = self.model( 2025-08-14T21:34:44.3249744Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1213, in forward 2025-08-14T21:34:44.3249811Z decoder_outputs = self.decoder( 2025-08-14T21:34:44.3250123Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:44.3250190Z layer_outputs = decoder_layer( 2025-08-14T21:34:44.3250414Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.3250501Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.3250780Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 398, in forward 2025-08-14T21:34:44.3250877Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:34:44.3251152Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 219, in forward 2025-08-14T21:34:44.3251228Z value_states = self.v_proj(current_states) 2025-08-14T21:34:44.3251255Z 2025-08-14T21:34:44.3251330Z cudagraph partition due to non gpu ops 2025-08-14T21:34:44.3251401Z cudagraph partition due to non gpu ops 2025-08-14T21:34:44.3251478Z cudagraph partition due to non gpu ops 2025-08-14T21:34:44.3251546Z cudagraph partition due to non gpu ops 2025-08-14T21:34:44.3251636Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.3251829Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.3251887Z return mod(**inputs) 2025-08-14T21:34:44.3252167Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.3252234Z outputs = self.model( 2025-08-14T21:34:44.3252512Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1213, in forward 2025-08-14T21:34:44.3252586Z decoder_outputs = self.decoder( 2025-08-14T21:34:44.3252865Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:44.3252930Z layer_outputs = decoder_layer( 2025-08-14T21:34:44.3253138Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.3253209Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.3253490Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 398, in forward 2025-08-14T21:34:44.3253579Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:34:44.3253857Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 237, in forward 2025-08-14T21:34:44.3253951Z attn_output, attn_weights = attention_interface( 2025-08-14T21:34:44.3254215Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:34:44.3254339Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:34:44.3254349Z 2025-08-14T21:34:44.3254440Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.3254620Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.3254686Z return mod(**inputs) 2025-08-14T21:34:44.3254964Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.3255023Z outputs = self.model( 2025-08-14T21:34:44.3255310Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1213, in forward 2025-08-14T21:34:44.3255373Z decoder_outputs = self.decoder( 2025-08-14T21:34:44.3255660Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:44.3255739Z layer_outputs = decoder_layer( 2025-08-14T21:34:44.3255943Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.3256057Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.3256333Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 398, in forward 2025-08-14T21:34:44.3256429Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:34:44.3256703Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 237, in forward 2025-08-14T21:34:44.3256790Z attn_output, attn_weights = attention_interface( 2025-08-14T21:34:44.3257060Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:34:44.3257177Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:34:44.3257182Z 2025-08-14T21:34:44.3257273Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.3257462Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.3257524Z return mod(**inputs) 2025-08-14T21:34:44.3257814Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.3257875Z outputs = self.model( 2025-08-14T21:34:44.3258156Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1213, in forward 2025-08-14T21:34:44.3258230Z decoder_outputs = self.decoder( 2025-08-14T21:34:44.3258508Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:44.3258582Z layer_outputs = decoder_layer( 2025-08-14T21:34:44.3258784Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.3258857Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.3259142Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 398, in forward 2025-08-14T21:34:44.3259232Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:34:44.3259508Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 251, in forward 2025-08-14T21:34:44.3259592Z attn_output = self.out_proj(attn_output) 2025-08-14T21:34:44.3259595Z 2025-08-14T21:34:44.3259687Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.3259877Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.3259937Z return mod(**inputs) 2025-08-14T21:34:44.3260216Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.3260287Z outputs = self.model( 2025-08-14T21:34:44.3260568Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1213, in forward 2025-08-14T21:34:44.3260641Z decoder_outputs = self.decoder( 2025-08-14T21:34:44.3260918Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:44.3260982Z layer_outputs = decoder_layer( 2025-08-14T21:34:44.3261191Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.3261264Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.3261560Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 415, in forward 2025-08-14T21:34:44.3261669Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:34:44.3261959Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 199, in forward 2025-08-14T21:34:44.3262117Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:34:44.3262120Z 2025-08-14T21:34:44.3262213Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.3262395Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.3262461Z return mod(**inputs) 2025-08-14T21:34:44.3262747Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.3262834Z outputs = self.model( 2025-08-14T21:34:44.3263114Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1213, in forward 2025-08-14T21:34:44.3263178Z decoder_outputs = self.decoder( 2025-08-14T21:34:44.3263467Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:44.3263531Z layer_outputs = decoder_layer( 2025-08-14T21:34:44.3263738Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.3263808Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.3264085Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 415, in forward 2025-08-14T21:34:44.3264190Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:34:44.3264466Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 218, in forward 2025-08-14T21:34:44.3264538Z key_states = self.k_proj(current_states) 2025-08-14T21:34:44.3264547Z 2025-08-14T21:34:44.3264641Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.3264899Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.3264972Z return mod(**inputs) 2025-08-14T21:34:44.3265252Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.3265313Z outputs = self.model( 2025-08-14T21:34:44.3265598Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1213, in forward 2025-08-14T21:34:44.3265667Z decoder_outputs = self.decoder( 2025-08-14T21:34:44.3265952Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:44.3266017Z layer_outputs = decoder_layer( 2025-08-14T21:34:44.3266219Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.3266298Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.3266575Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 415, in forward 2025-08-14T21:34:44.3266672Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:34:44.3266957Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 219, in forward 2025-08-14T21:34:44.3267033Z value_states = self.v_proj(current_states) 2025-08-14T21:34:44.3267038Z 2025-08-14T21:34:44.3267118Z cudagraph partition due to non gpu ops 2025-08-14T21:34:44.3267189Z cudagraph partition due to non gpu ops 2025-08-14T21:34:44.3267279Z cudagraph partition due to non gpu ops 2025-08-14T21:34:44.3267358Z cudagraph partition due to non gpu ops 2025-08-14T21:34:44.3267466Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.3267661Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.3267728Z return mod(**inputs) 2025-08-14T21:34:44.3268008Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.3268075Z outputs = self.model( 2025-08-14T21:34:44.3268354Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1213, in forward 2025-08-14T21:34:44.3268444Z decoder_outputs = self.decoder( 2025-08-14T21:34:44.3268731Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:44.3268793Z layer_outputs = decoder_layer( 2025-08-14T21:34:44.3269002Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.3269075Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.3269349Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 415, in forward 2025-08-14T21:34:44.3269449Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:34:44.3269726Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 237, in forward 2025-08-14T21:34:44.3269813Z attn_output, attn_weights = attention_interface( 2025-08-14T21:34:44.3270084Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:34:44.3270205Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:34:44.3270208Z 2025-08-14T21:34:44.3270304Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.3270487Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.3270545Z return mod(**inputs) 2025-08-14T21:34:44.3270831Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.3270891Z outputs = self.model( 2025-08-14T21:34:44.3271178Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1213, in forward 2025-08-14T21:34:44.3271244Z decoder_outputs = self.decoder( 2025-08-14T21:34:44.3271524Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:44.3271596Z layer_outputs = decoder_layer( 2025-08-14T21:34:44.3271797Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.3271869Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.3272154Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 415, in forward 2025-08-14T21:34:44.3272250Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:34:44.3272534Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 237, in forward 2025-08-14T21:34:44.3272622Z attn_output, attn_weights = attention_interface( 2025-08-14T21:34:44.3272887Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:34:44.3273007Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:34:44.3273011Z 2025-08-14T21:34:44.3273103Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.3273305Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.3273381Z return mod(**inputs) 2025-08-14T21:34:44.3273662Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.3273730Z outputs = self.model( 2025-08-14T21:34:44.3274010Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1213, in forward 2025-08-14T21:34:44.3274084Z decoder_outputs = self.decoder( 2025-08-14T21:34:44.3274384Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:44.3274452Z layer_outputs = decoder_layer( 2025-08-14T21:34:44.3274659Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.3274733Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.3275012Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 415, in forward 2025-08-14T21:34:44.3275115Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:34:44.3275391Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 251, in forward 2025-08-14T21:34:44.3275472Z attn_output = self.out_proj(attn_output) 2025-08-14T21:34:44.3275476Z 2025-08-14T21:34:44.3275568Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.3275748Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.3275817Z return mod(**inputs) 2025-08-14T21:34:44.3276099Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.3276165Z outputs = self.model( 2025-08-14T21:34:44.3276444Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1213, in forward 2025-08-14T21:34:44.3276511Z decoder_outputs = self.decoder( 2025-08-14T21:34:44.3276798Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:44.3276861Z layer_outputs = decoder_layer( 2025-08-14T21:34:44.3277060Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.3277140Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.3277420Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 430, in forward 2025-08-14T21:34:44.3277537Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:34:44.3277542Z 2025-08-14T21:34:44.3277633Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.3277814Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.3277880Z return mod(**inputs) 2025-08-14T21:34:44.3278159Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.3278226Z outputs = self.model( 2025-08-14T21:34:44.3278506Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1213, in forward 2025-08-14T21:34:44.3278572Z decoder_outputs = self.decoder( 2025-08-14T21:34:44.3278876Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:44.3278958Z layer_outputs = decoder_layer( 2025-08-14T21:34:44.3279176Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.3279255Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.3279535Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 430, in forward 2025-08-14T21:34:44.3279651Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:34:44.3279846Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:34:44.3279926Z return self.act(input) 2025-08-14T21:34:44.3279929Z 2025-08-14T21:34:44.3280029Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.3280213Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.3280280Z return mod(**inputs) 2025-08-14T21:34:44.3280563Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.3280625Z outputs = self.model( 2025-08-14T21:34:44.3280915Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1213, in forward 2025-08-14T21:34:44.3280981Z decoder_outputs = self.decoder( 2025-08-14T21:34:44.3281264Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:44.3281335Z layer_outputs = decoder_layer( 2025-08-14T21:34:44.3281539Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.3281620Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.3281903Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 432, in forward 2025-08-14T21:34:44.3281977Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:34:44.3281980Z 2025-08-14T21:34:44.3282077Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.3282260Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.3282325Z return mod(**inputs) 2025-08-14T21:34:44.3282607Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.3282668Z outputs = self.model( 2025-08-14T21:34:44.3282957Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1213, in forward 2025-08-14T21:34:44.3283023Z decoder_outputs = self.decoder( 2025-08-14T21:34:44.3283306Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:44.3283377Z layer_outputs = decoder_layer( 2025-08-14T21:34:44.3283578Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.3283654Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.3283933Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 398, in forward 2025-08-14T21:34:44.3284020Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:34:44.3284308Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 199, in forward 2025-08-14T21:34:44.3284465Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:34:44.3284469Z 2025-08-14T21:34:44.3284570Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.3285052Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.3285145Z return mod(**inputs) 2025-08-14T21:34:44.3285449Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.3285513Z outputs = self.model( 2025-08-14T21:34:44.3285860Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1213, in forward 2025-08-14T21:34:44.3285926Z decoder_outputs = self.decoder( 2025-08-14T21:34:44.3286230Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:44.3286306Z layer_outputs = decoder_layer( 2025-08-14T21:34:44.3286505Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.3286579Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.3286863Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 398, in forward 2025-08-14T21:34:44.3286951Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:34:44.3287243Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 218, in forward 2025-08-14T21:34:44.3287317Z key_states = self.k_proj(current_states) 2025-08-14T21:34:44.3287321Z 2025-08-14T21:34:44.3287414Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.3287608Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.3287668Z return mod(**inputs) 2025-08-14T21:34:44.3287963Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.3288026Z outputs = self.model( 2025-08-14T21:34:44.3288313Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1213, in forward 2025-08-14T21:34:44.3288389Z decoder_outputs = self.decoder( 2025-08-14T21:34:44.3288674Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:44.3288741Z layer_outputs = decoder_layer( 2025-08-14T21:34:44.3288951Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.3289026Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.3289321Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 398, in forward 2025-08-14T21:34:44.3289415Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:34:44.3289700Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 219, in forward 2025-08-14T21:34:44.3289788Z value_states = self.v_proj(current_states) 2025-08-14T21:34:44.3289791Z 2025-08-14T21:34:44.3289864Z cudagraph partition due to non gpu ops 2025-08-14T21:34:44.3289946Z cudagraph partition due to non gpu ops 2025-08-14T21:34:44.3290017Z cudagraph partition due to non gpu ops 2025-08-14T21:34:44.3290086Z cudagraph partition due to non gpu ops 2025-08-14T21:34:44.3290186Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.3290374Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.3290433Z return mod(**inputs) 2025-08-14T21:34:44.3290752Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.3290857Z outputs = self.model( 2025-08-14T21:34:44.3291152Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1213, in forward 2025-08-14T21:34:44.3291220Z decoder_outputs = self.decoder( 2025-08-14T21:34:44.3291502Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:44.3291574Z layer_outputs = decoder_layer( 2025-08-14T21:34:44.3291780Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.3291873Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.3292168Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 398, in forward 2025-08-14T21:34:44.3292258Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:34:44.3292549Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 237, in forward 2025-08-14T21:34:44.3292640Z attn_output, attn_weights = attention_interface( 2025-08-14T21:34:44.3292914Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:34:44.3293044Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:34:44.3293048Z 2025-08-14T21:34:44.3293140Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.3293329Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.3293389Z return mod(**inputs) 2025-08-14T21:34:44.3293680Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.3293751Z outputs = self.model( 2025-08-14T21:34:44.3294042Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1213, in forward 2025-08-14T21:34:44.3294115Z decoder_outputs = self.decoder( 2025-08-14T21:34:44.3294403Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:44.3294469Z layer_outputs = decoder_layer( 2025-08-14T21:34:44.3294681Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.3294754Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.3295041Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 398, in forward 2025-08-14T21:34:44.3295138Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:34:44.3295422Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 237, in forward 2025-08-14T21:34:44.3295519Z attn_output, attn_weights = attention_interface( 2025-08-14T21:34:44.3295792Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:34:44.3295891Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:34:44.3295894Z 2025-08-14T21:34:44.3295995Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.3296179Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.3296248Z return mod(**inputs) 2025-08-14T21:34:44.3296551Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.3296617Z outputs = self.model( 2025-08-14T21:34:44.3296929Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1213, in forward 2025-08-14T21:34:44.3297016Z decoder_outputs = self.decoder( 2025-08-14T21:34:44.3297310Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:44.3297384Z layer_outputs = decoder_layer( 2025-08-14T21:34:44.3297591Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.3297671Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.3297978Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 398, in forward 2025-08-14T21:34:44.3298070Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:34:44.3298362Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 251, in forward 2025-08-14T21:34:44.3298437Z attn_output = self.out_proj(attn_output) 2025-08-14T21:34:44.3298440Z 2025-08-14T21:34:44.3298541Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.3298724Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.3298784Z return mod(**inputs) 2025-08-14T21:34:44.3299073Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.3299137Z outputs = self.model( 2025-08-14T21:34:44.3299426Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1213, in forward 2025-08-14T21:34:44.3299499Z decoder_outputs = self.decoder( 2025-08-14T21:34:44.3299778Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:44.3299849Z layer_outputs = decoder_layer( 2025-08-14T21:34:44.3300047Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.3300116Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.3300398Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 415, in forward 2025-08-14T21:34:44.3300495Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:34:44.3300780Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 199, in forward 2025-08-14T21:34:44.3300916Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:34:44.3300919Z 2025-08-14T21:34:44.3301009Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.3301199Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.3301257Z return mod(**inputs) 2025-08-14T21:34:44.3301542Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.3301601Z outputs = self.model( 2025-08-14T21:34:44.3301877Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1213, in forward 2025-08-14T21:34:44.3301946Z decoder_outputs = self.decoder( 2025-08-14T21:34:44.3302227Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:44.3302308Z layer_outputs = decoder_layer( 2025-08-14T21:34:44.3302535Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.3302625Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.3302914Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 415, in forward 2025-08-14T21:34:44.3303015Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:34:44.3303298Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 218, in forward 2025-08-14T21:34:44.3303381Z key_states = self.k_proj(current_states) 2025-08-14T21:34:44.3303385Z 2025-08-14T21:34:44.3303495Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.3303681Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.3303742Z return mod(**inputs) 2025-08-14T21:34:44.3304023Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.3304093Z outputs = self.model( 2025-08-14T21:34:44.3304374Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1213, in forward 2025-08-14T21:34:44.3304438Z decoder_outputs = self.decoder( 2025-08-14T21:34:44.3304774Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:44.3304848Z layer_outputs = decoder_layer( 2025-08-14T21:34:44.3305059Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.3305132Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.3305412Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 415, in forward 2025-08-14T21:34:44.3305519Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:34:44.3305800Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 219, in forward 2025-08-14T21:34:44.3305885Z value_states = self.v_proj(current_states) 2025-08-14T21:34:44.3305888Z 2025-08-14T21:34:44.3305959Z cudagraph partition due to non gpu ops 2025-08-14T21:34:44.3306030Z cudagraph partition due to non gpu ops 2025-08-14T21:34:44.3306107Z cudagraph partition due to non gpu ops 2025-08-14T21:34:44.3306177Z cudagraph partition due to non gpu ops 2025-08-14T21:34:44.3306269Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.3306460Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.3306535Z return mod(**inputs) 2025-08-14T21:34:44.3306828Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.3306892Z outputs = self.model( 2025-08-14T21:34:44.3307173Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1213, in forward 2025-08-14T21:34:44.3307246Z decoder_outputs = self.decoder( 2025-08-14T21:34:44.3307530Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:44.3307597Z layer_outputs = decoder_layer( 2025-08-14T21:34:44.3307807Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.3307878Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.3308182Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 415, in forward 2025-08-14T21:34:44.3308293Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:34:44.3308586Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 237, in forward 2025-08-14T21:34:44.3308682Z attn_output, attn_weights = attention_interface( 2025-08-14T21:34:44.3308944Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:34:44.3309070Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:34:44.3309073Z 2025-08-14T21:34:44.3309166Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.3309362Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.3309428Z return mod(**inputs) 2025-08-14T21:34:44.3309708Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.3309779Z outputs = self.model( 2025-08-14T21:34:44.3310055Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1213, in forward 2025-08-14T21:34:44.3310120Z decoder_outputs = self.decoder( 2025-08-14T21:34:44.3310405Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:44.3310470Z layer_outputs = decoder_layer( 2025-08-14T21:34:44.3310668Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.3310748Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.3311025Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 415, in forward 2025-08-14T21:34:44.3311126Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:34:44.3311405Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 237, in forward 2025-08-14T21:34:44.3311492Z attn_output, attn_weights = attention_interface( 2025-08-14T21:34:44.3311760Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:34:44.3311855Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:34:44.3311858Z 2025-08-14T21:34:44.3311956Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.3312133Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.3312194Z return mod(**inputs) 2025-08-14T21:34:44.3312481Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.3312539Z outputs = self.model( 2025-08-14T21:34:44.3312821Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1213, in forward 2025-08-14T21:34:44.3312894Z decoder_outputs = self.decoder( 2025-08-14T21:34:44.3313171Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:44.3313241Z layer_outputs = decoder_layer( 2025-08-14T21:34:44.3313438Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.3313511Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.3313809Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 415, in forward 2025-08-14T21:34:44.3313906Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:34:44.3314213Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 251, in forward 2025-08-14T21:34:44.3314305Z attn_output = self.out_proj(attn_output) 2025-08-14T21:34:44.3314308Z 2025-08-14T21:34:44.3314403Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.3314592Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.3314652Z return mod(**inputs) 2025-08-14T21:34:44.3314932Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.3315017Z outputs = self.model( 2025-08-14T21:34:44.3315297Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1213, in forward 2025-08-14T21:34:44.3315370Z decoder_outputs = self.decoder( 2025-08-14T21:34:44.3315652Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:44.3315717Z layer_outputs = decoder_layer( 2025-08-14T21:34:44.3315926Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.3315997Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.3316282Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 430, in forward 2025-08-14T21:34:44.3316391Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:34:44.3316396Z 2025-08-14T21:34:44.3316487Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.3316675Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.3316735Z return mod(**inputs) 2025-08-14T21:34:44.3317014Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.3317082Z outputs = self.model( 2025-08-14T21:34:44.3317361Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1213, in forward 2025-08-14T21:34:44.3317431Z decoder_outputs = self.decoder( 2025-08-14T21:34:44.3317727Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:44.3317792Z layer_outputs = decoder_layer( 2025-08-14T21:34:44.3318007Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.3318080Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.3318372Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 430, in forward 2025-08-14T21:34:44.3318484Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:34:44.3318685Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:34:44.3318757Z return self.act(input) 2025-08-14T21:34:44.3318760Z 2025-08-14T21:34:44.3318854Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.3319045Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.3319106Z return mod(**inputs) 2025-08-14T21:34:44.3319396Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1375, in forward 2025-08-14T21:34:44.3319468Z outputs = self.model( 2025-08-14T21:34:44.3319772Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1213, in forward 2025-08-14T21:34:44.3319871Z decoder_outputs = self.decoder( 2025-08-14T21:34:44.3320169Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1057, in forward 2025-08-14T21:34:44.3320234Z layer_outputs = decoder_layer( 2025-08-14T21:34:44.3320443Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:34:44.3320515Z return super().__call__(*args, **kwargs) 2025-08-14T21:34:44.3320799Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 432, in forward 2025-08-14T21:34:44.3320926Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:34:44.3320930Z 2025-08-14T21:34:44.3321028Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.3321224Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.3321286Z return mod(**inputs) 2025-08-14T21:34:44.3321576Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1393, in forward 2025-08-14T21:34:44.3321697Z lm_logits = self.lm_head(outputs[0]) + self.final_logits_bias 2025-08-14T21:34:44.3321701Z 2025-08-14T21:34:44.3321794Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:34:44.3321986Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:34:44.3322053Z return mod(**inputs) 2025-08-14T21:34:44.3322333Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/blenderbot_small/modeling_blenderbot_small.py", line 1398, in forward 2025-08-14T21:34:44.3322497Z masked_lm_loss = loss_fct(lm_logits.view(-1, self.config.vocab_size), labels.view(-1)) 2025-08-14T21:34:44.3322501Z 2025-08-14T21:34:52.5973403Z Compilation time (from dynamo_timed): 17.903342288 2025-08-14T21:34:52.5989119Z pass 2025-08-14T21:34:52.5992865Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:34:52.5997353Z TIMING: _recursive_pre_grad_passes:0.00891 _recursive_joint_graph_passes:0.51619 _recursive_post_grad_passes:0.1076 async_compile.wait:0.69762 code_gen:7.73274 inductor_compile:9.83696 backend_compile:14.48151 gc:0.00011 entire_frame_compile:17.90334 total_wall_time:17.90334 2025-08-14T21:34:52.5998817Z STATS: call_* op count: 652 | FakeTensorMode.__torch_dispatch__:22579 | FakeTensor.__torch_dispatch__:8019 | ProxyTorchDispatchMode.__torch_dispatch__:8304 2025-08-14T21:34:52.5999309Z Dynamo produced 1 graphs covering 652 ops with 0 graph breaks (0 unique) 2025-08-14T21:34:56.8996009Z /opt/conda/envs/py_3.9/lib/python3.9/site-packages/llvmlite/binding/ffi.py:175: UserWarning: pkg_resources is deprecated as an API. See https://setuptools.pypa.io/en/latest/pkg_resources.html. The pkg_resources package is slated for removal as early as 2025-11-30. Refrain from using this package or pin to Setuptools<81. 2025-08-14T21:34:56.8997161Z from pkg_resources import resource_filename 2025-08-14T21:34:57.4489766Z 2025-08-14T21:34:58.7090855Z loading model: 0it [00:00, ?it/s] 2025-08-14T21:34:58.7094913Z loading model: 0it [00:01, ?it/s] 2025-08-14T21:34:58.7108365Z cpu eval CamemBert 2025-08-14T21:34:59.1616261Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:34:59.3738549Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:34:59.6400034Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:35:06.4377579Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:06.4381533Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:06.4383351Z return mod(**inputs) 2025-08-14T21:35:06.4384370Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 1038, in forward 2025-08-14T21:35:06.4388790Z outputs = self.roberta( 2025-08-14T21:35:06.4394320Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 886, in forward 2025-08-14T21:35:06.4396040Z embedding_output = self.embeddings( 2025-08-14T21:35:06.4396635Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 90, in forward 2025-08-14T21:35:06.4400361Z position_ids = create_position_ids_from_input_ids(input_ids, self.padding_idx, past_key_values_length) 2025-08-14T21:35:06.4401128Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 1590, in create_position_ids_from_input_ids 2025-08-14T21:35:06.4405598Z mask = input_ids.ne(padding_idx).int() 2025-08-14T21:35:06.4410665Z 2025-08-14T21:35:06.4410968Z cudagraph partition due to non gpu ops 2025-08-14T21:35:06.4411248Z cudagraph partition due to non gpu ops 2025-08-14T21:35:06.4411541Z cudagraph partition due to non gpu ops 2025-08-14T21:35:06.4411756Z cudagraph partition due to non gpu ops 2025-08-14T21:35:06.4412054Z cudagraph partition due to non gpu ops 2025-08-14T21:35:06.4412810Z cudagraph partition due to non gpu ops 2025-08-14T21:35:06.4417791Z cudagraph partition due to non gpu ops 2025-08-14T21:35:06.4421741Z cudagraph partition due to non gpu ops 2025-08-14T21:35:06.4422025Z cudagraph partition due to non gpu ops 2025-08-14T21:35:06.4422230Z cudagraph partition due to non gpu ops 2025-08-14T21:35:06.4422421Z cudagraph partition due to non gpu ops 2025-08-14T21:35:06.4422611Z cudagraph partition due to non gpu ops 2025-08-14T21:35:06.4422835Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:06.4423197Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:06.4423515Z return mod(**inputs) 2025-08-14T21:35:06.4423908Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 1038, in forward 2025-08-14T21:35:06.4424299Z outputs = self.roberta( 2025-08-14T21:35:06.4424676Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 886, in forward 2025-08-14T21:35:06.4425183Z embedding_output = self.embeddings( 2025-08-14T21:35:06.4425571Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 90, in forward 2025-08-14T21:35:06.4426089Z position_ids = create_position_ids_from_input_ids(input_ids, self.padding_idx, past_key_values_length) 2025-08-14T21:35:06.4426653Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 1591, in create_position_ids_from_input_ids 2025-08-14T21:35:06.4427223Z incremental_indices = (torch.cumsum(mask, dim=1).type_as(mask) + past_key_values_length) * mask 2025-08-14T21:35:06.4427458Z 2025-08-14T21:35:06.4427563Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:06.4427916Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:06.4428235Z return mod(**inputs) 2025-08-14T21:35:06.4428622Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 1038, in forward 2025-08-14T21:35:06.4429012Z outputs = self.roberta( 2025-08-14T21:35:06.4429536Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 886, in forward 2025-08-14T21:35:06.4430353Z embedding_output = self.embeddings( 2025-08-14T21:35:06.4430833Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 90, in forward 2025-08-14T21:35:06.4443931Z position_ids = create_position_ids_from_input_ids(input_ids, self.padding_idx, past_key_values_length) 2025-08-14T21:35:06.4444567Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 1591, in create_position_ids_from_input_ids 2025-08-14T21:35:06.4445135Z incremental_indices = (torch.cumsum(mask, dim=1).type_as(mask) + past_key_values_length) * mask 2025-08-14T21:35:06.4445364Z 2025-08-14T21:35:06.4445481Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:06.4445827Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:06.4446293Z return mod(**inputs) 2025-08-14T21:35:06.4446681Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 1038, in forward 2025-08-14T21:35:06.4447065Z outputs = self.roberta( 2025-08-14T21:35:06.4447439Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 950, in forward 2025-08-14T21:35:06.4447829Z encoder_outputs = self.encoder( 2025-08-14T21:35:06.4448548Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 632, in forward 2025-08-14T21:35:06.4448919Z layer_outputs = layer_module( 2025-08-14T21:35:06.4449257Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:06.4449602Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:06.4449995Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 540, in forward 2025-08-14T21:35:06.4450377Z self_attention_outputs = self.attention( 2025-08-14T21:35:06.4450737Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:35:06.4451090Z return func(*args, **kwargs) 2025-08-14T21:35:06.4451454Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 467, in forward 2025-08-14T21:35:06.4451832Z self_outputs = self.self( 2025-08-14T21:35:06.4452174Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:35:06.4452524Z return func(*args, **kwargs) 2025-08-14T21:35:06.4452883Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 325, in forward 2025-08-14T21:35:06.4453389Z self.query(hidden_states).view(bsz, -1, self.num_attention_heads, self.attention_head_size).transpose(1, 2) 2025-08-14T21:35:06.4453637Z 2025-08-14T21:35:06.4453747Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:06.4454087Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:06.4454390Z return mod(**inputs) 2025-08-14T21:35:06.4454757Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 1038, in forward 2025-08-14T21:35:06.4455136Z outputs = self.roberta( 2025-08-14T21:35:06.4455491Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 950, in forward 2025-08-14T21:35:06.4455866Z encoder_outputs = self.encoder( 2025-08-14T21:35:06.4456241Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 632, in forward 2025-08-14T21:35:06.4456618Z layer_outputs = layer_module( 2025-08-14T21:35:06.4456974Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:06.4457319Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:06.4457727Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 540, in forward 2025-08-14T21:35:06.4458136Z self_attention_outputs = self.attention( 2025-08-14T21:35:06.4458491Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:35:06.4458837Z return func(*args, **kwargs) 2025-08-14T21:35:06.4459201Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 467, in forward 2025-08-14T21:35:06.4459566Z self_outputs = self.self( 2025-08-14T21:35:06.4459899Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:35:06.4460287Z return func(*args, **kwargs) 2025-08-14T21:35:06.4460666Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 353, in forward 2025-08-14T21:35:06.4461042Z self.key(current_states) 2025-08-14T21:35:06.4461164Z 2025-08-14T21:35:06.4461264Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:06.4461610Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:06.4461915Z return mod(**inputs) 2025-08-14T21:35:06.4462283Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 1038, in forward 2025-08-14T21:35:06.4462667Z outputs = self.roberta( 2025-08-14T21:35:06.4463033Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 950, in forward 2025-08-14T21:35:06.4463413Z encoder_outputs = self.encoder( 2025-08-14T21:35:06.4463799Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 632, in forward 2025-08-14T21:35:06.4464182Z layer_outputs = layer_module( 2025-08-14T21:35:06.4464508Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:06.4464979Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:06.4465360Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 540, in forward 2025-08-14T21:35:06.4465743Z self_attention_outputs = self.attention( 2025-08-14T21:35:06.4466091Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:35:06.4466432Z return func(*args, **kwargs) 2025-08-14T21:35:06.4466803Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 467, in forward 2025-08-14T21:35:06.4467178Z self_outputs = self.self( 2025-08-14T21:35:06.4467510Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:35:06.4467851Z return func(*args, **kwargs) 2025-08-14T21:35:06.4468215Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 358, in forward 2025-08-14T21:35:06.4468587Z self.value(current_states) 2025-08-14T21:35:06.4468707Z 2025-08-14T21:35:06.4468784Z cudagraph partition due to non gpu ops 2025-08-14T21:35:06.4469011Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:06.4469341Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:06.4469632Z return mod(**inputs) 2025-08-14T21:35:06.4469988Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 1038, in forward 2025-08-14T21:35:06.4470363Z outputs = self.roberta( 2025-08-14T21:35:06.4470736Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 950, in forward 2025-08-14T21:35:06.4471134Z encoder_outputs = self.encoder( 2025-08-14T21:35:06.4471523Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 632, in forward 2025-08-14T21:35:06.4471898Z layer_outputs = layer_module( 2025-08-14T21:35:06.4472209Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:06.4472548Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:06.4472927Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 540, in forward 2025-08-14T21:35:06.4473324Z self_attention_outputs = self.attention( 2025-08-14T21:35:06.4473678Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:35:06.4474022Z return func(*args, **kwargs) 2025-08-14T21:35:06.4474390Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 467, in forward 2025-08-14T21:35:06.4474760Z self_outputs = self.self( 2025-08-14T21:35:06.4475100Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:35:06.4475445Z return func(*args, **kwargs) 2025-08-14T21:35:06.4475807Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 389, in forward 2025-08-14T21:35:06.4476232Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:35:06.4476414Z 2025-08-14T21:35:06.4476514Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:06.4476848Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:06.4477144Z return mod(**inputs) 2025-08-14T21:35:06.4477506Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 1038, in forward 2025-08-14T21:35:06.4477889Z outputs = self.roberta( 2025-08-14T21:35:06.4478240Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 950, in forward 2025-08-14T21:35:06.4478614Z encoder_outputs = self.encoder( 2025-08-14T21:35:06.4478984Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 632, in forward 2025-08-14T21:35:06.4479348Z layer_outputs = layer_module( 2025-08-14T21:35:06.4479668Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:06.4480001Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:06.4480380Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 540, in forward 2025-08-14T21:35:06.4480755Z self_attention_outputs = self.attention( 2025-08-14T21:35:06.4481107Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:35:06.4481450Z return func(*args, **kwargs) 2025-08-14T21:35:06.4481814Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 477, in forward 2025-08-14T21:35:06.4482236Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:35:06.4482662Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 413, in forward 2025-08-14T21:35:06.4483045Z hidden_states = self.dense(hidden_states) 2025-08-14T21:35:06.4483177Z 2025-08-14T21:35:06.4483273Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:06.4483622Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:06.4483926Z return mod(**inputs) 2025-08-14T21:35:06.4484299Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 1038, in forward 2025-08-14T21:35:06.4485725Z outputs = self.roberta( 2025-08-14T21:35:06.4486086Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 950, in forward 2025-08-14T21:35:06.4486468Z encoder_outputs = self.encoder( 2025-08-14T21:35:06.4486840Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 632, in forward 2025-08-14T21:35:06.4487207Z layer_outputs = layer_module( 2025-08-14T21:35:06.4487598Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:06.4487936Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:06.4488310Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 570, in forward 2025-08-14T21:35:06.4488700Z layer_output = apply_chunking_to_forward( 2025-08-14T21:35:06.4489077Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:35:06.4489448Z return forward_fn(*input_tensors) 2025-08-14T21:35:06.4489856Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 578, in feed_forward_chunk 2025-08-14T21:35:06.4490309Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:35:06.4490729Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 493, in forward 2025-08-14T21:35:06.4491115Z hidden_states = self.dense(hidden_states) 2025-08-14T21:35:06.4491244Z 2025-08-14T21:35:06.4491342Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:06.4491676Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:06.4491980Z return mod(**inputs) 2025-08-14T21:35:06.4492328Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 1038, in forward 2025-08-14T21:35:06.4492699Z outputs = self.roberta( 2025-08-14T21:35:06.4493056Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 950, in forward 2025-08-14T21:35:06.4493429Z encoder_outputs = self.encoder( 2025-08-14T21:35:06.4493793Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 632, in forward 2025-08-14T21:35:06.4494166Z layer_outputs = layer_module( 2025-08-14T21:35:06.4494489Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:06.4494814Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:06.4495190Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 570, in forward 2025-08-14T21:35:06.4495576Z layer_output = apply_chunking_to_forward( 2025-08-14T21:35:06.4495946Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:35:06.4496303Z return forward_fn(*input_tensors) 2025-08-14T21:35:06.4496704Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 578, in feed_forward_chunk 2025-08-14T21:35:06.4497145Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:35:06.4497561Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 494, in forward 2025-08-14T21:35:06.4498018Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:35:06.4498399Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:35:06.4498735Z return self.act(input) 2025-08-14T21:35:06.4498839Z 2025-08-14T21:35:06.4498934Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:06.4499272Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:06.4499571Z return mod(**inputs) 2025-08-14T21:35:06.4499928Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 1038, in forward 2025-08-14T21:35:06.4500293Z outputs = self.roberta( 2025-08-14T21:35:06.4500649Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 950, in forward 2025-08-14T21:35:06.4501050Z encoder_outputs = self.encoder( 2025-08-14T21:35:06.4501423Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 632, in forward 2025-08-14T21:35:06.4501800Z layer_outputs = layer_module( 2025-08-14T21:35:06.4502126Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:06.4502460Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:06.4502833Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 570, in forward 2025-08-14T21:35:06.4503218Z layer_output = apply_chunking_to_forward( 2025-08-14T21:35:06.4503593Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:35:06.4503960Z return forward_fn(*input_tensors) 2025-08-14T21:35:06.4504363Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 579, in feed_forward_chunk 2025-08-14T21:35:06.4504896Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:35:06.4505332Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 507, in forward 2025-08-14T21:35:06.4505718Z hidden_states = self.dense(hidden_states) 2025-08-14T21:35:06.4505846Z 2025-08-14T21:35:06.4505942Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:06.4506273Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:06.4506577Z return mod(**inputs) 2025-08-14T21:35:06.4506925Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 1038, in forward 2025-08-14T21:35:06.4507303Z outputs = self.roberta( 2025-08-14T21:35:06.4507663Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 950, in forward 2025-08-14T21:35:06.4508038Z encoder_outputs = self.encoder( 2025-08-14T21:35:06.4508402Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 632, in forward 2025-08-14T21:35:06.4508779Z layer_outputs = layer_module( 2025-08-14T21:35:06.4509102Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:06.4509430Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:06.4509959Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 540, in forward 2025-08-14T21:35:06.4510349Z self_attention_outputs = self.attention( 2025-08-14T21:35:06.4510707Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:35:06.4511050Z return func(*args, **kwargs) 2025-08-14T21:35:06.4511440Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 467, in forward 2025-08-14T21:35:06.4511821Z self_outputs = self.self( 2025-08-14T21:35:06.4512207Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:35:06.4512544Z return func(*args, **kwargs) 2025-08-14T21:35:06.4512908Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 325, in forward 2025-08-14T21:35:06.4513408Z self.query(hidden_states).view(bsz, -1, self.num_attention_heads, self.attention_head_size).transpose(1, 2) 2025-08-14T21:35:06.4513652Z 2025-08-14T21:35:06.4513747Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:06.4514097Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:06.4514397Z return mod(**inputs) 2025-08-14T21:35:06.4514750Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 1038, in forward 2025-08-14T21:35:06.4515114Z outputs = self.roberta( 2025-08-14T21:35:06.4515471Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 950, in forward 2025-08-14T21:35:06.4515842Z encoder_outputs = self.encoder( 2025-08-14T21:35:06.4516208Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 632, in forward 2025-08-14T21:35:06.4516571Z layer_outputs = layer_module( 2025-08-14T21:35:06.4516885Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:06.4517217Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:06.4517584Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 540, in forward 2025-08-14T21:35:06.4517967Z self_attention_outputs = self.attention( 2025-08-14T21:35:06.4518322Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:35:06.4518666Z return func(*args, **kwargs) 2025-08-14T21:35:06.4519021Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 467, in forward 2025-08-14T21:35:06.4519390Z self_outputs = self.self( 2025-08-14T21:35:06.4519724Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:35:06.4520056Z return func(*args, **kwargs) 2025-08-14T21:35:06.4520419Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 353, in forward 2025-08-14T21:35:06.4520793Z self.key(current_states) 2025-08-14T21:35:06.4520899Z 2025-08-14T21:35:06.4521004Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:06.4521331Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:06.4521636Z return mod(**inputs) 2025-08-14T21:35:06.4521995Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 1038, in forward 2025-08-14T21:35:06.4522366Z outputs = self.roberta( 2025-08-14T21:35:06.4522714Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 950, in forward 2025-08-14T21:35:06.4523089Z encoder_outputs = self.encoder( 2025-08-14T21:35:06.4523459Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 632, in forward 2025-08-14T21:35:06.4523825Z layer_outputs = layer_module( 2025-08-14T21:35:06.4524145Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:06.4524493Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:06.4524891Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 540, in forward 2025-08-14T21:35:06.4525287Z self_attention_outputs = self.attention( 2025-08-14T21:35:06.4525643Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:35:06.4525993Z return func(*args, **kwargs) 2025-08-14T21:35:06.4526358Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 467, in forward 2025-08-14T21:35:06.4526735Z self_outputs = self.self( 2025-08-14T21:35:06.4527076Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:35:06.4527438Z return func(*args, **kwargs) 2025-08-14T21:35:06.4527796Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 358, in forward 2025-08-14T21:35:06.4528171Z self.value(current_states) 2025-08-14T21:35:06.4528281Z 2025-08-14T21:35:06.4528368Z cudagraph partition due to non gpu ops 2025-08-14T21:35:06.4528586Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:06.4528917Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:06.4529217Z return mod(**inputs) 2025-08-14T21:35:06.4529573Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 1038, in forward 2025-08-14T21:35:06.4529938Z outputs = self.roberta( 2025-08-14T21:35:06.4530296Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 950, in forward 2025-08-14T21:35:06.4530673Z encoder_outputs = self.encoder( 2025-08-14T21:35:06.4531046Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 632, in forward 2025-08-14T21:35:06.4531413Z layer_outputs = layer_module( 2025-08-14T21:35:06.4531738Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:06.4532072Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:06.4532443Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 540, in forward 2025-08-14T21:35:06.4532824Z self_attention_outputs = self.attention( 2025-08-14T21:35:06.4533178Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:35:06.4533522Z return func(*args, **kwargs) 2025-08-14T21:35:06.4533882Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 467, in forward 2025-08-14T21:35:06.4534255Z self_outputs = self.self( 2025-08-14T21:35:06.4534593Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:35:06.4534931Z return func(*args, **kwargs) 2025-08-14T21:35:06.4535294Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 389, in forward 2025-08-14T21:35:06.4535731Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:35:06.4535902Z 2025-08-14T21:35:06.4536009Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:06.4536336Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:06.4536638Z return mod(**inputs) 2025-08-14T21:35:06.4537002Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 1038, in forward 2025-08-14T21:35:06.4537370Z outputs = self.roberta( 2025-08-14T21:35:06.4537748Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 950, in forward 2025-08-14T21:35:06.4538147Z encoder_outputs = self.encoder( 2025-08-14T21:35:06.4538532Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 632, in forward 2025-08-14T21:35:06.4538900Z layer_outputs = layer_module( 2025-08-14T21:35:06.4539221Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:06.4539556Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:06.4539935Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 540, in forward 2025-08-14T21:35:06.4540331Z self_attention_outputs = self.attention( 2025-08-14T21:35:06.4540690Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:35:06.4541035Z return func(*args, **kwargs) 2025-08-14T21:35:06.4541395Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 477, in forward 2025-08-14T21:35:06.4541823Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:35:06.4542251Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 413, in forward 2025-08-14T21:35:06.4542637Z hidden_states = self.dense(hidden_states) 2025-08-14T21:35:06.4542767Z 2025-08-14T21:35:06.4542863Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:06.4543196Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:06.4543503Z return mod(**inputs) 2025-08-14T21:35:06.4543861Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 1038, in forward 2025-08-14T21:35:06.4544230Z outputs = self.roberta( 2025-08-14T21:35:06.4544591Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 950, in forward 2025-08-14T21:35:06.4545041Z encoder_outputs = self.encoder( 2025-08-14T21:35:06.4545407Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 632, in forward 2025-08-14T21:35:06.4545786Z layer_outputs = layer_module( 2025-08-14T21:35:06.4546111Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:06.4546447Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:06.4546819Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 570, in forward 2025-08-14T21:35:06.4547212Z layer_output = apply_chunking_to_forward( 2025-08-14T21:35:06.4547587Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:35:06.4547946Z return forward_fn(*input_tensors) 2025-08-14T21:35:06.4548353Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 578, in feed_forward_chunk 2025-08-14T21:35:06.4548804Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:35:06.4549217Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 493, in forward 2025-08-14T21:35:06.4549593Z hidden_states = self.dense(hidden_states) 2025-08-14T21:35:06.4549729Z 2025-08-14T21:35:06.4549826Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:06.4550161Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:06.4550463Z return mod(**inputs) 2025-08-14T21:35:06.4550831Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 1038, in forward 2025-08-14T21:35:06.4551204Z outputs = self.roberta( 2025-08-14T21:35:06.4551598Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 950, in forward 2025-08-14T21:35:06.4551970Z encoder_outputs = self.encoder( 2025-08-14T21:35:06.4552340Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 632, in forward 2025-08-14T21:35:06.4552716Z layer_outputs = layer_module( 2025-08-14T21:35:06.4553038Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:06.4553368Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:06.4553763Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 570, in forward 2025-08-14T21:35:06.4554145Z layer_output = apply_chunking_to_forward( 2025-08-14T21:35:06.4554518Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:35:06.4554876Z return forward_fn(*input_tensors) 2025-08-14T21:35:06.4555277Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 578, in feed_forward_chunk 2025-08-14T21:35:06.4555718Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:35:06.4556124Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 494, in forward 2025-08-14T21:35:06.4556528Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:35:06.4556880Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:35:06.4557192Z return self.act(input) 2025-08-14T21:35:06.4557295Z 2025-08-14T21:35:06.4557392Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:06.4557727Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:06.4558029Z return mod(**inputs) 2025-08-14T21:35:06.4558384Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 1038, in forward 2025-08-14T21:35:06.4558748Z outputs = self.roberta( 2025-08-14T21:35:06.4559101Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 950, in forward 2025-08-14T21:35:06.4559473Z encoder_outputs = self.encoder( 2025-08-14T21:35:06.4559832Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 632, in forward 2025-08-14T21:35:06.4560208Z layer_outputs = layer_module( 2025-08-14T21:35:06.4560527Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:06.4560861Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:06.4561227Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 570, in forward 2025-08-14T21:35:06.4561613Z layer_output = apply_chunking_to_forward( 2025-08-14T21:35:06.4561982Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:35:06.4562336Z return forward_fn(*input_tensors) 2025-08-14T21:35:06.4562732Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 579, in feed_forward_chunk 2025-08-14T21:35:06.4563183Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:35:06.4563623Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 507, in forward 2025-08-14T21:35:06.4564001Z hidden_states = self.dense(hidden_states) 2025-08-14T21:35:06.4564133Z 2025-08-14T21:35:06.4564257Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:06.4564620Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:06.4564915Z return mod(**inputs) 2025-08-14T21:35:06.4565273Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 1038, in forward 2025-08-14T21:35:06.4565643Z outputs = self.roberta( 2025-08-14T21:35:06.4565998Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 950, in forward 2025-08-14T21:35:06.4566369Z encoder_outputs = self.encoder( 2025-08-14T21:35:06.4566755Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 632, in forward 2025-08-14T21:35:06.4567130Z layer_outputs = layer_module( 2025-08-14T21:35:06.4567444Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:06.4567782Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:06.4568161Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 540, in forward 2025-08-14T21:35:06.4568548Z self_attention_outputs = self.attention( 2025-08-14T21:35:06.4568895Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:35:06.4569242Z return func(*args, **kwargs) 2025-08-14T21:35:06.4569607Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 467, in forward 2025-08-14T21:35:06.4569983Z self_outputs = self.self( 2025-08-14T21:35:06.4570310Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:35:06.4570651Z return func(*args, **kwargs) 2025-08-14T21:35:06.4571015Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 325, in forward 2025-08-14T21:35:06.4571513Z self.query(hidden_states).view(bsz, -1, self.num_attention_heads, self.attention_head_size).transpose(1, 2) 2025-08-14T21:35:06.4571764Z 2025-08-14T21:35:06.4571860Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:06.4572195Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:06.4572496Z return mod(**inputs) 2025-08-14T21:35:06.4572845Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 1038, in forward 2025-08-14T21:35:06.4573221Z outputs = self.roberta( 2025-08-14T21:35:06.4573580Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 950, in forward 2025-08-14T21:35:06.4573953Z encoder_outputs = self.encoder( 2025-08-14T21:35:06.4574316Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 632, in forward 2025-08-14T21:35:06.4574692Z layer_outputs = layer_module( 2025-08-14T21:35:06.4575012Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:06.4575340Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:06.4575719Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 540, in forward 2025-08-14T21:35:06.4576104Z self_attention_outputs = self.attention( 2025-08-14T21:35:06.4576460Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:35:06.4576796Z return func(*args, **kwargs) 2025-08-14T21:35:06.4577175Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 467, in forward 2025-08-14T21:35:06.4577614Z self_outputs = self.self( 2025-08-14T21:35:06.4577942Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:35:06.4578284Z return func(*args, **kwargs) 2025-08-14T21:35:06.4578644Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 353, in forward 2025-08-14T21:35:06.4579017Z self.key(current_states) 2025-08-14T21:35:06.4579122Z 2025-08-14T21:35:06.4579218Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:06.4579550Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:06.4579878Z return mod(**inputs) 2025-08-14T21:35:06.4580232Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 1038, in forward 2025-08-14T21:35:06.4580610Z outputs = self.roberta( 2025-08-14T21:35:06.4580970Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 950, in forward 2025-08-14T21:35:06.4581349Z encoder_outputs = self.encoder( 2025-08-14T21:35:06.4581710Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 632, in forward 2025-08-14T21:35:06.4582086Z layer_outputs = layer_module( 2025-08-14T21:35:06.4582407Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:06.4582738Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:06.4583109Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 540, in forward 2025-08-14T21:35:06.4583492Z self_attention_outputs = self.attention( 2025-08-14T21:35:06.4583842Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:35:06.4584176Z return func(*args, **kwargs) 2025-08-14T21:35:06.4584540Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 467, in forward 2025-08-14T21:35:06.4585088Z self_outputs = self.self( 2025-08-14T21:35:06.4585422Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:35:06.4585759Z return func(*args, **kwargs) 2025-08-14T21:35:06.4586128Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 358, in forward 2025-08-14T21:35:06.4586506Z self.value(current_states) 2025-08-14T21:35:06.4586616Z 2025-08-14T21:35:06.4586700Z cudagraph partition due to non gpu ops 2025-08-14T21:35:06.4586922Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:06.4587259Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:06.4587563Z return mod(**inputs) 2025-08-14T21:35:06.4587909Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 1038, in forward 2025-08-14T21:35:06.4588287Z outputs = self.roberta( 2025-08-14T21:35:06.4588639Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 950, in forward 2025-08-14T21:35:06.4589007Z encoder_outputs = self.encoder( 2025-08-14T21:35:06.4589368Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 632, in forward 2025-08-14T21:35:06.4589739Z layer_outputs = layer_module( 2025-08-14T21:35:06.4590089Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:06.4590419Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:06.4590815Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 540, in forward 2025-08-14T21:35:06.4591219Z self_attention_outputs = self.attention( 2025-08-14T21:35:06.4591575Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:35:06.4591912Z return func(*args, **kwargs) 2025-08-14T21:35:06.4592275Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 467, in forward 2025-08-14T21:35:06.4592647Z self_outputs = self.self( 2025-08-14T21:35:06.4592969Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:35:06.4593332Z return func(*args, **kwargs) 2025-08-14T21:35:06.4593696Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 389, in forward 2025-08-14T21:35:06.4594128Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:35:06.4594304Z 2025-08-14T21:35:06.4594400Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:06.4594738Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:06.4595037Z return mod(**inputs) 2025-08-14T21:35:06.4595397Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 1038, in forward 2025-08-14T21:35:06.4595764Z outputs = self.roberta( 2025-08-14T21:35:06.4596125Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 950, in forward 2025-08-14T21:35:06.4596501Z encoder_outputs = self.encoder( 2025-08-14T21:35:06.4596865Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 632, in forward 2025-08-14T21:35:06.4597239Z layer_outputs = layer_module( 2025-08-14T21:35:06.4597566Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:06.4597903Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:06.4598272Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 540, in forward 2025-08-14T21:35:06.4598653Z self_attention_outputs = self.attention( 2025-08-14T21:35:06.4599004Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:35:06.4599340Z return func(*args, **kwargs) 2025-08-14T21:35:06.4599706Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 477, in forward 2025-08-14T21:35:06.4600132Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:35:06.4600555Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 413, in forward 2025-08-14T21:35:06.4600936Z hidden_states = self.dense(hidden_states) 2025-08-14T21:35:06.4601072Z 2025-08-14T21:35:06.4601168Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:06.4601499Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:06.4601798Z return mod(**inputs) 2025-08-14T21:35:06.4602148Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 1038, in forward 2025-08-14T21:35:06.4602519Z outputs = self.roberta( 2025-08-14T21:35:06.4602878Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 950, in forward 2025-08-14T21:35:06.4603259Z encoder_outputs = self.encoder( 2025-08-14T21:35:06.4603633Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 632, in forward 2025-08-14T21:35:06.4604038Z layer_outputs = layer_module( 2025-08-14T21:35:06.4604357Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:06.4604680Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:06.4605050Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 570, in forward 2025-08-14T21:35:06.4605435Z layer_output = apply_chunking_to_forward( 2025-08-14T21:35:06.4605805Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:35:06.4606181Z return forward_fn(*input_tensors) 2025-08-14T21:35:06.4606582Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 578, in feed_forward_chunk 2025-08-14T21:35:06.4607026Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:35:06.4607440Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 493, in forward 2025-08-14T21:35:06.4607819Z hidden_states = self.dense(hidden_states) 2025-08-14T21:35:06.4607949Z 2025-08-14T21:35:06.4608044Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:06.4608371Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:06.4608660Z return mod(**inputs) 2025-08-14T21:35:06.4609011Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 1038, in forward 2025-08-14T21:35:06.4609376Z outputs = self.roberta( 2025-08-14T21:35:06.4609727Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 950, in forward 2025-08-14T21:35:06.4610089Z encoder_outputs = self.encoder( 2025-08-14T21:35:06.4610457Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 632, in forward 2025-08-14T21:35:06.4610831Z layer_outputs = layer_module( 2025-08-14T21:35:06.4611142Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:06.4611473Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:06.4611850Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 570, in forward 2025-08-14T21:35:06.4612234Z layer_output = apply_chunking_to_forward( 2025-08-14T21:35:06.4612598Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:35:06.4612962Z return forward_fn(*input_tensors) 2025-08-14T21:35:06.4613364Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 578, in feed_forward_chunk 2025-08-14T21:35:06.4613810Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:35:06.4614220Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 494, in forward 2025-08-14T21:35:06.4614633Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:35:06.4614988Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:35:06.4615294Z return self.act(input) 2025-08-14T21:35:06.4615403Z 2025-08-14T21:35:06.4615498Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:06.4615832Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:06.4616129Z return mod(**inputs) 2025-08-14T21:35:06.4616493Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 1038, in forward 2025-08-14T21:35:06.4616872Z outputs = self.roberta( 2025-08-14T21:35:06.4617237Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 950, in forward 2025-08-14T21:35:06.4617598Z encoder_outputs = self.encoder( 2025-08-14T21:35:06.4617964Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 632, in forward 2025-08-14T21:35:06.4618335Z layer_outputs = layer_module( 2025-08-14T21:35:06.4618657Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:06.4619001Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:06.4619381Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 570, in forward 2025-08-14T21:35:06.4619770Z layer_output = apply_chunking_to_forward( 2025-08-14T21:35:06.4620141Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:35:06.4620496Z return forward_fn(*input_tensors) 2025-08-14T21:35:06.4620895Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 579, in feed_forward_chunk 2025-08-14T21:35:06.4621349Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:35:06.4621770Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 507, in forward 2025-08-14T21:35:06.4622152Z hidden_states = self.dense(hidden_states) 2025-08-14T21:35:06.4622287Z 2025-08-14T21:35:06.4622382Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:06.4622717Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:06.4623015Z return mod(**inputs) 2025-08-14T21:35:06.4623368Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 1038, in forward 2025-08-14T21:35:06.4623744Z outputs = self.roberta( 2025-08-14T21:35:06.4624099Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 950, in forward 2025-08-14T21:35:06.4624463Z encoder_outputs = self.encoder( 2025-08-14T21:35:06.4624906Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 632, in forward 2025-08-14T21:35:06.4625293Z layer_outputs = layer_module( 2025-08-14T21:35:06.4625613Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:06.4625956Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:06.4626339Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 540, in forward 2025-08-14T21:35:06.4626724Z self_attention_outputs = self.attention( 2025-08-14T21:35:06.4627071Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:35:06.4627409Z return func(*args, **kwargs) 2025-08-14T21:35:06.4627766Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 467, in forward 2025-08-14T21:35:06.4628127Z self_outputs = self.self( 2025-08-14T21:35:06.4628455Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:35:06.4628794Z return func(*args, **kwargs) 2025-08-14T21:35:06.4629156Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 325, in forward 2025-08-14T21:35:06.4629673Z self.query(hidden_states).view(bsz, -1, self.num_attention_heads, self.attention_head_size).transpose(1, 2) 2025-08-14T21:35:06.4629919Z 2025-08-14T21:35:06.4630043Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:06.4630368Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:06.4630656Z return mod(**inputs) 2025-08-14T21:35:06.4631000Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 1038, in forward 2025-08-14T21:35:06.4631375Z outputs = self.roberta( 2025-08-14T21:35:06.4631731Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 950, in forward 2025-08-14T21:35:06.4632116Z encoder_outputs = self.encoder( 2025-08-14T21:35:06.4632489Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 632, in forward 2025-08-14T21:35:06.4632863Z layer_outputs = layer_module( 2025-08-14T21:35:06.4633184Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:06.4633514Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:06.4633892Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 540, in forward 2025-08-14T21:35:06.4634273Z self_attention_outputs = self.attention( 2025-08-14T21:35:06.4634626Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:35:06.4634962Z return func(*args, **kwargs) 2025-08-14T21:35:06.4635328Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 467, in forward 2025-08-14T21:35:06.4635697Z self_outputs = self.self( 2025-08-14T21:35:06.4636021Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:35:06.4636360Z return func(*args, **kwargs) 2025-08-14T21:35:06.4636725Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 353, in forward 2025-08-14T21:35:06.4637100Z self.key(current_states) 2025-08-14T21:35:06.4637207Z 2025-08-14T21:35:06.4637304Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:06.4637641Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:06.4637941Z return mod(**inputs) 2025-08-14T21:35:06.4638289Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 1038, in forward 2025-08-14T21:35:06.4638659Z outputs = self.roberta( 2025-08-14T21:35:06.4639016Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 950, in forward 2025-08-14T21:35:06.4639388Z encoder_outputs = self.encoder( 2025-08-14T21:35:06.4639751Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 632, in forward 2025-08-14T21:35:06.4640124Z layer_outputs = layer_module( 2025-08-14T21:35:06.4640444Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:06.4640777Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:06.4641146Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 540, in forward 2025-08-14T21:35:06.4641527Z self_attention_outputs = self.attention( 2025-08-14T21:35:06.4641875Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:35:06.4642210Z return func(*args, **kwargs) 2025-08-14T21:35:06.4642590Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 467, in forward 2025-08-14T21:35:06.4642960Z self_outputs = self.self( 2025-08-14T21:35:06.4643322Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:35:06.4643656Z return func(*args, **kwargs) 2025-08-14T21:35:06.4644019Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 358, in forward 2025-08-14T21:35:06.4644394Z self.value(current_states) 2025-08-14T21:35:06.4644504Z 2025-08-14T21:35:06.4644581Z cudagraph partition due to non gpu ops 2025-08-14T21:35:06.4644806Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:06.4645140Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:06.4645458Z return mod(**inputs) 2025-08-14T21:35:06.4645809Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 1038, in forward 2025-08-14T21:35:06.4646972Z outputs = self.roberta( 2025-08-14T21:35:06.4647482Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 950, in forward 2025-08-14T21:35:06.4647896Z encoder_outputs = self.encoder( 2025-08-14T21:35:06.4648305Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 632, in forward 2025-08-14T21:35:06.4648731Z layer_outputs = layer_module( 2025-08-14T21:35:06.4649061Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:06.4649415Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:06.4649829Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 540, in forward 2025-08-14T21:35:06.4650240Z self_attention_outputs = self.attention( 2025-08-14T21:35:06.4650654Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:35:06.4651055Z return func(*args, **kwargs) 2025-08-14T21:35:06.4651480Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 467, in forward 2025-08-14T21:35:06.4651923Z self_outputs = self.self( 2025-08-14T21:35:06.4652319Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:35:06.4652727Z return func(*args, **kwargs) 2025-08-14T21:35:06.4653148Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 389, in forward 2025-08-14T21:35:06.4653655Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:35:06.4653877Z 2025-08-14T21:35:06.4653988Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:06.4654410Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:06.4654785Z return mod(**inputs) 2025-08-14T21:35:06.4655185Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 1038, in forward 2025-08-14T21:35:06.4655647Z outputs = self.roberta( 2025-08-14T21:35:06.4656069Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 950, in forward 2025-08-14T21:35:06.4656499Z encoder_outputs = self.encoder( 2025-08-14T21:35:06.4656937Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 632, in forward 2025-08-14T21:35:06.4657381Z layer_outputs = layer_module( 2025-08-14T21:35:06.4657893Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:06.4658295Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:06.4658784Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 540, in forward 2025-08-14T21:35:06.4659259Z self_attention_outputs = self.attention( 2025-08-14T21:35:06.4659632Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:35:06.4659974Z return func(*args, **kwargs) 2025-08-14T21:35:06.4660349Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 477, in forward 2025-08-14T21:35:06.4660789Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:35:06.4661272Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 413, in forward 2025-08-14T21:35:06.4661680Z hidden_states = self.dense(hidden_states) 2025-08-14T21:35:06.4661823Z 2025-08-14T21:35:06.4661938Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:06.4662285Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:06.4662603Z return mod(**inputs) 2025-08-14T21:35:06.4662972Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 1038, in forward 2025-08-14T21:35:06.4663359Z outputs = self.roberta( 2025-08-14T21:35:06.4663729Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 950, in forward 2025-08-14T21:35:06.4664110Z encoder_outputs = self.encoder( 2025-08-14T21:35:06.4664493Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 632, in forward 2025-08-14T21:35:06.4665023Z layer_outputs = layer_module( 2025-08-14T21:35:06.4665362Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:06.4665707Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:06.4666119Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 570, in forward 2025-08-14T21:35:06.4666515Z layer_output = apply_chunking_to_forward( 2025-08-14T21:35:06.4666902Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:35:06.4667270Z return forward_fn(*input_tensors) 2025-08-14T21:35:06.4667686Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 578, in feed_forward_chunk 2025-08-14T21:35:06.4668150Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:35:06.4668577Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 493, in forward 2025-08-14T21:35:06.4668971Z hidden_states = self.dense(hidden_states) 2025-08-14T21:35:06.4669111Z 2025-08-14T21:35:06.4669213Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:06.4669557Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:06.4669862Z return mod(**inputs) 2025-08-14T21:35:06.4670228Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 1038, in forward 2025-08-14T21:35:06.4670608Z outputs = self.roberta( 2025-08-14T21:35:06.4670963Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 950, in forward 2025-08-14T21:35:06.4671349Z encoder_outputs = self.encoder( 2025-08-14T21:35:06.4671723Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 632, in forward 2025-08-14T21:35:06.4672123Z layer_outputs = layer_module( 2025-08-14T21:35:06.4672465Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:06.4672829Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:06.4673218Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 570, in forward 2025-08-14T21:35:06.4673623Z layer_output = apply_chunking_to_forward( 2025-08-14T21:35:06.4673985Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:35:06.4674349Z return forward_fn(*input_tensors) 2025-08-14T21:35:06.4674748Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 578, in feed_forward_chunk 2025-08-14T21:35:06.4675209Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:35:06.4675633Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 494, in forward 2025-08-14T21:35:06.4676054Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:35:06.4676417Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:35:06.4676734Z return self.act(input) 2025-08-14T21:35:06.4676845Z 2025-08-14T21:35:06.4676944Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:06.4677284Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:06.4677588Z return mod(**inputs) 2025-08-14T21:35:06.4677940Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 1038, in forward 2025-08-14T21:35:06.4678319Z outputs = self.roberta( 2025-08-14T21:35:06.4678681Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 950, in forward 2025-08-14T21:35:06.4679054Z encoder_outputs = self.encoder( 2025-08-14T21:35:06.4679431Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 632, in forward 2025-08-14T21:35:06.4679814Z layer_outputs = layer_module( 2025-08-14T21:35:06.4680139Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:06.4680470Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:06.4680854Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 570, in forward 2025-08-14T21:35:06.4681242Z layer_output = apply_chunking_to_forward( 2025-08-14T21:35:06.4681619Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:35:06.4681983Z return forward_fn(*input_tensors) 2025-08-14T21:35:06.4682390Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 579, in feed_forward_chunk 2025-08-14T21:35:06.4682854Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:35:06.4683281Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 507, in forward 2025-08-14T21:35:06.4683666Z hidden_states = self.dense(hidden_states) 2025-08-14T21:35:06.4683802Z 2025-08-14T21:35:06.4683903Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:06.4684239Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:06.4684535Z return mod(**inputs) 2025-08-14T21:35:06.4685183Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 1038, in forward 2025-08-14T21:35:06.4685643Z outputs = self.roberta( 2025-08-14T21:35:06.4686029Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 950, in forward 2025-08-14T21:35:06.4686443Z encoder_outputs = self.encoder( 2025-08-14T21:35:06.4686816Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 632, in forward 2025-08-14T21:35:06.4687194Z layer_outputs = layer_module( 2025-08-14T21:35:06.4687513Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:06.4687851Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:06.4688233Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 540, in forward 2025-08-14T21:35:06.4688642Z self_attention_outputs = self.attention( 2025-08-14T21:35:06.4688988Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:35:06.4689334Z return func(*args, **kwargs) 2025-08-14T21:35:06.4689692Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 467, in forward 2025-08-14T21:35:06.4690055Z self_outputs = self.self( 2025-08-14T21:35:06.4690387Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:35:06.4690724Z return func(*args, **kwargs) 2025-08-14T21:35:06.4691082Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 325, in forward 2025-08-14T21:35:06.4691572Z self.query(hidden_states).view(bsz, -1, self.num_attention_heads, self.attention_head_size).transpose(1, 2) 2025-08-14T21:35:06.4691824Z 2025-08-14T21:35:06.4691919Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:06.4692250Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:06.4692547Z return mod(**inputs) 2025-08-14T21:35:06.4692895Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 1038, in forward 2025-08-14T21:35:06.4693264Z outputs = self.roberta( 2025-08-14T21:35:06.4693615Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 950, in forward 2025-08-14T21:35:06.4693976Z encoder_outputs = self.encoder( 2025-08-14T21:35:06.4694337Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 632, in forward 2025-08-14T21:35:06.4694707Z layer_outputs = layer_module( 2025-08-14T21:35:06.4695022Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:06.4695344Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:06.4695718Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 540, in forward 2025-08-14T21:35:06.4696102Z self_attention_outputs = self.attention( 2025-08-14T21:35:06.4696451Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:35:06.4696782Z return func(*args, **kwargs) 2025-08-14T21:35:06.4697141Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 467, in forward 2025-08-14T21:35:06.4697508Z self_outputs = self.self( 2025-08-14T21:35:06.4697830Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:35:06.4698168Z return func(*args, **kwargs) 2025-08-14T21:35:06.4698546Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 353, in forward 2025-08-14T21:35:06.4698923Z self.key(current_states) 2025-08-14T21:35:06.4699028Z 2025-08-14T21:35:06.4699141Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:06.4699511Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:06.4699817Z return mod(**inputs) 2025-08-14T21:35:06.4700169Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 1038, in forward 2025-08-14T21:35:06.4700550Z outputs = self.roberta( 2025-08-14T21:35:06.4700910Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 950, in forward 2025-08-14T21:35:06.4701292Z encoder_outputs = self.encoder( 2025-08-14T21:35:06.4701689Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 632, in forward 2025-08-14T21:35:06.4702066Z layer_outputs = layer_module( 2025-08-14T21:35:06.4702388Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:06.4702723Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:06.4703093Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 540, in forward 2025-08-14T21:35:06.4703476Z self_attention_outputs = self.attention( 2025-08-14T21:35:06.4703825Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:35:06.4704160Z return func(*args, **kwargs) 2025-08-14T21:35:06.4704522Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 467, in forward 2025-08-14T21:35:06.4704965Z self_outputs = self.self( 2025-08-14T21:35:06.4705307Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:35:06.4705642Z return func(*args, **kwargs) 2025-08-14T21:35:06.4706014Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 358, in forward 2025-08-14T21:35:06.4706404Z self.value(current_states) 2025-08-14T21:35:06.4706517Z 2025-08-14T21:35:06.4706595Z cudagraph partition due to non gpu ops 2025-08-14T21:35:06.4706830Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:06.4707173Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:06.4707484Z return mod(**inputs) 2025-08-14T21:35:06.4707842Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 1038, in forward 2025-08-14T21:35:06.4708228Z outputs = self.roberta( 2025-08-14T21:35:06.4708600Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 950, in forward 2025-08-14T21:35:06.4708979Z encoder_outputs = self.encoder( 2025-08-14T21:35:06.4709370Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 632, in forward 2025-08-14T21:35:06.4709745Z layer_outputs = layer_module( 2025-08-14T21:35:06.4710064Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:06.4710391Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:06.4710768Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 540, in forward 2025-08-14T21:35:06.4711151Z self_attention_outputs = self.attention( 2025-08-14T21:35:06.4711506Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:35:06.4711843Z return func(*args, **kwargs) 2025-08-14T21:35:06.4712238Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 467, in forward 2025-08-14T21:35:06.4712626Z self_outputs = self.self( 2025-08-14T21:35:06.4712964Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:35:06.4713304Z return func(*args, **kwargs) 2025-08-14T21:35:06.4713665Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 389, in forward 2025-08-14T21:35:06.4714093Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:35:06.4714263Z 2025-08-14T21:35:06.4714360Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:06.4714687Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:06.4715005Z return mod(**inputs) 2025-08-14T21:35:06.4715351Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 1038, in forward 2025-08-14T21:35:06.4715722Z outputs = self.roberta( 2025-08-14T21:35:06.4716079Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 950, in forward 2025-08-14T21:35:06.4716451Z encoder_outputs = self.encoder( 2025-08-14T21:35:06.4716809Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 632, in forward 2025-08-14T21:35:06.4717179Z layer_outputs = layer_module( 2025-08-14T21:35:06.4717498Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:06.4717828Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:06.4718194Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 540, in forward 2025-08-14T21:35:06.4718575Z self_attention_outputs = self.attention( 2025-08-14T21:35:06.4718922Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:35:06.4719254Z return func(*args, **kwargs) 2025-08-14T21:35:06.4719614Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 477, in forward 2025-08-14T21:35:06.4720032Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:35:06.4720448Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 413, in forward 2025-08-14T21:35:06.4720822Z hidden_states = self.dense(hidden_states) 2025-08-14T21:35:06.4720956Z 2025-08-14T21:35:06.4721052Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:06.4721382Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:06.4721684Z return mod(**inputs) 2025-08-14T21:35:06.4722029Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 1038, in forward 2025-08-14T21:35:06.4722402Z outputs = self.roberta( 2025-08-14T21:35:06.4722753Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 950, in forward 2025-08-14T21:35:06.4723118Z encoder_outputs = self.encoder( 2025-08-14T21:35:06.4723487Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 632, in forward 2025-08-14T21:35:06.4723855Z layer_outputs = layer_module( 2025-08-14T21:35:06.4724169Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:06.4724496Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:06.4724881Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 570, in forward 2025-08-14T21:35:06.4725267Z layer_output = apply_chunking_to_forward( 2025-08-14T21:35:06.4725646Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:35:06.4726030Z return forward_fn(*input_tensors) 2025-08-14T21:35:06.4726430Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 578, in feed_forward_chunk 2025-08-14T21:35:06.4726875Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:35:06.4727284Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 493, in forward 2025-08-14T21:35:06.4727672Z hidden_states = self.dense(hidden_states) 2025-08-14T21:35:06.4727825Z 2025-08-14T21:35:06.4727922Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:06.4728253Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:06.4728543Z return mod(**inputs) 2025-08-14T21:35:06.4728898Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 1038, in forward 2025-08-14T21:35:06.4729270Z outputs = self.roberta( 2025-08-14T21:35:06.4729615Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 950, in forward 2025-08-14T21:35:06.4729987Z encoder_outputs = self.encoder( 2025-08-14T21:35:06.4730350Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 632, in forward 2025-08-14T21:35:06.4730716Z layer_outputs = layer_module( 2025-08-14T21:35:06.4731030Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:06.4731361Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:06.4731736Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 570, in forward 2025-08-14T21:35:06.4732123Z layer_output = apply_chunking_to_forward( 2025-08-14T21:35:06.4732482Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:35:06.4732843Z return forward_fn(*input_tensors) 2025-08-14T21:35:06.4733239Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 578, in feed_forward_chunk 2025-08-14T21:35:06.4733669Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:35:06.4734082Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 494, in forward 2025-08-14T21:35:06.4734489Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:35:06.4734841Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:35:06.4735150Z return self.act(input) 2025-08-14T21:35:06.4735258Z 2025-08-14T21:35:06.4735356Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:06.4735687Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:06.4735984Z return mod(**inputs) 2025-08-14T21:35:06.4736332Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 1038, in forward 2025-08-14T21:35:06.4736701Z outputs = self.roberta( 2025-08-14T21:35:06.4737055Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 950, in forward 2025-08-14T21:35:06.4737420Z encoder_outputs = self.encoder( 2025-08-14T21:35:06.4737803Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 632, in forward 2025-08-14T21:35:06.4738179Z layer_outputs = layer_module( 2025-08-14T21:35:06.4738516Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:06.4738855Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:06.4739229Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 570, in forward 2025-08-14T21:35:06.4739606Z layer_output = apply_chunking_to_forward( 2025-08-14T21:35:06.4739967Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:35:06.4740326Z return forward_fn(*input_tensors) 2025-08-14T21:35:06.4740721Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 579, in feed_forward_chunk 2025-08-14T21:35:06.4741195Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:35:06.4741611Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 507, in forward 2025-08-14T21:35:06.4741990Z hidden_states = self.dense(hidden_states) 2025-08-14T21:35:06.4742121Z 2025-08-14T21:35:06.4742217Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:06.4742547Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:06.4742837Z return mod(**inputs) 2025-08-14T21:35:06.4743186Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 1038, in forward 2025-08-14T21:35:06.4743554Z outputs = self.roberta( 2025-08-14T21:35:06.4743901Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 950, in forward 2025-08-14T21:35:06.4744274Z encoder_outputs = self.encoder( 2025-08-14T21:35:06.4744644Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 632, in forward 2025-08-14T21:35:06.4745101Z layer_outputs = layer_module( 2025-08-14T21:35:06.4745423Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:06.4745762Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:06.4746144Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 540, in forward 2025-08-14T21:35:06.4746532Z self_attention_outputs = self.attention( 2025-08-14T21:35:06.4746885Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:35:06.4747237Z return func(*args, **kwargs) 2025-08-14T21:35:06.4747604Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 467, in forward 2025-08-14T21:35:06.4747975Z self_outputs = self.self( 2025-08-14T21:35:06.4748315Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:35:06.4748659Z return func(*args, **kwargs) 2025-08-14T21:35:06.4749025Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 325, in forward 2025-08-14T21:35:06.4749526Z self.query(hidden_states).view(bsz, -1, self.num_attention_heads, self.attention_head_size).transpose(1, 2) 2025-08-14T21:35:06.4749780Z 2025-08-14T21:35:06.4749878Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:06.4750214Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:06.4750518Z return mod(**inputs) 2025-08-14T21:35:06.4750887Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 1038, in forward 2025-08-14T21:35:06.4751261Z outputs = self.roberta( 2025-08-14T21:35:06.4751648Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 950, in forward 2025-08-14T21:35:06.4752039Z encoder_outputs = self.encoder( 2025-08-14T21:35:06.4752409Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 632, in forward 2025-08-14T21:35:06.4752784Z layer_outputs = layer_module( 2025-08-14T21:35:06.4753103Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:06.4753425Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:06.4753801Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 540, in forward 2025-08-14T21:35:06.4754199Z self_attention_outputs = self.attention( 2025-08-14T21:35:06.4754542Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:35:06.4754886Z return func(*args, **kwargs) 2025-08-14T21:35:06.4755249Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 467, in forward 2025-08-14T21:35:06.4755617Z self_outputs = self.self( 2025-08-14T21:35:06.4755939Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:35:06.4756277Z return func(*args, **kwargs) 2025-08-14T21:35:06.4756636Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 353, in forward 2025-08-14T21:35:06.4757004Z self.key(current_states) 2025-08-14T21:35:06.4757112Z 2025-08-14T21:35:06.4757208Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:06.4757542Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:06.4757837Z return mod(**inputs) 2025-08-14T21:35:06.4758182Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 1038, in forward 2025-08-14T21:35:06.4758556Z outputs = self.roberta( 2025-08-14T21:35:06.4758909Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 950, in forward 2025-08-14T21:35:06.4759278Z encoder_outputs = self.encoder( 2025-08-14T21:35:06.4759636Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 632, in forward 2025-08-14T21:35:06.4760006Z layer_outputs = layer_module( 2025-08-14T21:35:06.4760322Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:06.4760646Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:06.4761017Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 540, in forward 2025-08-14T21:35:06.4761398Z self_attention_outputs = self.attention( 2025-08-14T21:35:06.4761746Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:35:06.4762077Z return func(*args, **kwargs) 2025-08-14T21:35:06.4762435Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 467, in forward 2025-08-14T21:35:06.4762803Z self_outputs = self.self( 2025-08-14T21:35:06.4763132Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:35:06.4763462Z return func(*args, **kwargs) 2025-08-14T21:35:06.4763821Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 358, in forward 2025-08-14T21:35:06.4764209Z self.value(current_states) 2025-08-14T21:35:06.4764320Z 2025-08-14T21:35:06.4764395Z cudagraph partition due to non gpu ops 2025-08-14T21:35:06.4764631Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:06.4764982Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:06.4765284Z return mod(**inputs) 2025-08-14T21:35:06.4765634Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 1038, in forward 2025-08-14T21:35:06.4766007Z outputs = self.roberta( 2025-08-14T21:35:06.4766361Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 950, in forward 2025-08-14T21:35:06.4766728Z encoder_outputs = self.encoder( 2025-08-14T21:35:06.4767117Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 632, in forward 2025-08-14T21:35:06.4767491Z layer_outputs = layer_module( 2025-08-14T21:35:06.4767815Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:06.4768144Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:06.4768518Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 540, in forward 2025-08-14T21:35:06.4768898Z self_attention_outputs = self.attention( 2025-08-14T21:35:06.4769239Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:35:06.4769580Z return func(*args, **kwargs) 2025-08-14T21:35:06.4769941Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 467, in forward 2025-08-14T21:35:06.4770314Z self_outputs = self.self( 2025-08-14T21:35:06.4770639Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:35:06.4770980Z return func(*args, **kwargs) 2025-08-14T21:35:06.4771343Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 389, in forward 2025-08-14T21:35:06.4771773Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:35:06.4771944Z 2025-08-14T21:35:06.4772040Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:06.4772373Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:06.4772671Z return mod(**inputs) 2025-08-14T21:35:06.4773019Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 1038, in forward 2025-08-14T21:35:06.4773392Z outputs = self.roberta( 2025-08-14T21:35:06.4773750Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 950, in forward 2025-08-14T21:35:06.4774119Z encoder_outputs = self.encoder( 2025-08-14T21:35:06.4774479Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 632, in forward 2025-08-14T21:35:06.4774853Z layer_outputs = layer_module( 2025-08-14T21:35:06.4775174Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:06.4775503Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:06.4775871Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 540, in forward 2025-08-14T21:35:06.4776248Z self_attention_outputs = self.attention( 2025-08-14T21:35:06.4776595Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:35:06.4776927Z return func(*args, **kwargs) 2025-08-14T21:35:06.4777305Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 477, in forward 2025-08-14T21:35:06.4777751Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:35:06.4778192Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 413, in forward 2025-08-14T21:35:06.4778568Z hidden_states = self.dense(hidden_states) 2025-08-14T21:35:06.4778705Z 2025-08-14T21:35:06.4778802Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:06.4779136Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:06.4779435Z return mod(**inputs) 2025-08-14T21:35:06.4779783Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 1038, in forward 2025-08-14T21:35:06.4780176Z outputs = self.roberta( 2025-08-14T21:35:06.4780540Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 950, in forward 2025-08-14T21:35:06.4780913Z encoder_outputs = self.encoder( 2025-08-14T21:35:06.4781289Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 632, in forward 2025-08-14T21:35:06.4781667Z layer_outputs = layer_module( 2025-08-14T21:35:06.4781994Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:06.4782324Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:06.4782709Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 570, in forward 2025-08-14T21:35:06.4783100Z layer_output = apply_chunking_to_forward( 2025-08-14T21:35:06.4783469Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:35:06.4783838Z return forward_fn(*input_tensors) 2025-08-14T21:35:06.4784248Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 578, in feed_forward_chunk 2025-08-14T21:35:06.4784887Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:35:06.4785318Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 493, in forward 2025-08-14T21:35:06.4785722Z hidden_states = self.dense(hidden_states) 2025-08-14T21:35:06.4785862Z 2025-08-14T21:35:06.4785962Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:06.4786319Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:06.4786623Z return mod(**inputs) 2025-08-14T21:35:06.4786983Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 1038, in forward 2025-08-14T21:35:06.4787391Z outputs = self.roberta( 2025-08-14T21:35:06.4787756Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 950, in forward 2025-08-14T21:35:06.4788150Z encoder_outputs = self.encoder( 2025-08-14T21:35:06.4788534Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 632, in forward 2025-08-14T21:35:06.4788924Z layer_outputs = layer_module( 2025-08-14T21:35:06.4789248Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:06.4789594Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:06.4789986Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 570, in forward 2025-08-14T21:35:06.4790382Z layer_output = apply_chunking_to_forward( 2025-08-14T21:35:06.4790800Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:35:06.4791176Z return forward_fn(*input_tensors) 2025-08-14T21:35:06.4791635Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 578, in feed_forward_chunk 2025-08-14T21:35:06.4792084Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:35:06.4792510Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 494, in forward 2025-08-14T21:35:06.4792929Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:35:06.4793291Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:35:06.4793631Z return self.act(input) 2025-08-14T21:35:06.4793744Z 2025-08-14T21:35:06.4793841Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:06.4794187Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:06.4794489Z return mod(**inputs) 2025-08-14T21:35:06.4794848Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 1038, in forward 2025-08-14T21:35:06.4795227Z outputs = self.roberta( 2025-08-14T21:35:06.4795594Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 950, in forward 2025-08-14T21:35:06.4795969Z encoder_outputs = self.encoder( 2025-08-14T21:35:06.4796349Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 632, in forward 2025-08-14T21:35:06.4796728Z layer_outputs = layer_module( 2025-08-14T21:35:06.4797059Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:06.4797399Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:06.4797790Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 570, in forward 2025-08-14T21:35:06.4798184Z layer_output = apply_chunking_to_forward( 2025-08-14T21:35:06.4798558Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:35:06.4798932Z return forward_fn(*input_tensors) 2025-08-14T21:35:06.4799343Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 579, in feed_forward_chunk 2025-08-14T21:35:06.4799818Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:35:06.4800238Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 507, in forward 2025-08-14T21:35:06.4800621Z hidden_states = self.dense(hidden_states) 2025-08-14T21:35:06.4800757Z 2025-08-14T21:35:06.4800856Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:06.4801196Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:06.4801258Z return mod(**inputs) 2025-08-14T21:35:06.4801511Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 1038, in forward 2025-08-14T21:35:06.4801582Z outputs = self.roberta( 2025-08-14T21:35:06.4801832Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 950, in forward 2025-08-14T21:35:06.4801900Z encoder_outputs = self.encoder( 2025-08-14T21:35:06.4802153Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 632, in forward 2025-08-14T21:35:06.4802219Z layer_outputs = layer_module( 2025-08-14T21:35:06.4802452Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:06.4802527Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:06.4802789Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 540, in forward 2025-08-14T21:35:06.4802891Z self_attention_outputs = self.attention( 2025-08-14T21:35:06.4803115Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:35:06.4803186Z return func(*args, **kwargs) 2025-08-14T21:35:06.4803438Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 467, in forward 2025-08-14T21:35:06.4803501Z self_outputs = self.self( 2025-08-14T21:35:06.4803749Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:35:06.4803812Z return func(*args, **kwargs) 2025-08-14T21:35:06.4804062Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 325, in forward 2025-08-14T21:35:06.4804265Z self.query(hidden_states).view(bsz, -1, self.num_attention_heads, self.attention_head_size).transpose(1, 2) 2025-08-14T21:35:06.4804268Z 2025-08-14T21:35:06.4804364Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:06.4804557Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:06.4804616Z return mod(**inputs) 2025-08-14T21:35:06.4804868Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 1038, in forward 2025-08-14T21:35:06.4804937Z outputs = self.roberta( 2025-08-14T21:35:06.4805186Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 950, in forward 2025-08-14T21:35:06.4805259Z encoder_outputs = self.encoder( 2025-08-14T21:35:06.4805507Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 632, in forward 2025-08-14T21:35:06.4805573Z layer_outputs = layer_module( 2025-08-14T21:35:06.4805781Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:06.4805852Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:06.4806097Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 540, in forward 2025-08-14T21:35:06.4806179Z self_attention_outputs = self.attention( 2025-08-14T21:35:06.4806397Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:35:06.4806466Z return func(*args, **kwargs) 2025-08-14T21:35:06.4806713Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 467, in forward 2025-08-14T21:35:06.4806777Z self_outputs = self.self( 2025-08-14T21:35:06.4807006Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:35:06.4807069Z return func(*args, **kwargs) 2025-08-14T21:35:06.4807317Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 353, in forward 2025-08-14T21:35:06.4807389Z self.key(current_states) 2025-08-14T21:35:06.4807392Z 2025-08-14T21:35:06.4807487Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:06.4807674Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:06.4807733Z return mod(**inputs) 2025-08-14T21:35:06.4807985Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 1038, in forward 2025-08-14T21:35:06.4808068Z outputs = self.roberta( 2025-08-14T21:35:06.4808330Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 950, in forward 2025-08-14T21:35:06.4808421Z encoder_outputs = self.encoder( 2025-08-14T21:35:06.4808668Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 632, in forward 2025-08-14T21:35:06.4808732Z layer_outputs = layer_module( 2025-08-14T21:35:06.4808941Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:06.4809012Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:06.4809259Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 540, in forward 2025-08-14T21:35:06.4809357Z self_attention_outputs = self.attention( 2025-08-14T21:35:06.4809579Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:35:06.4809646Z return func(*args, **kwargs) 2025-08-14T21:35:06.4809895Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 467, in forward 2025-08-14T21:35:06.4809957Z self_outputs = self.self( 2025-08-14T21:35:06.4810183Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:35:06.4810243Z return func(*args, **kwargs) 2025-08-14T21:35:06.4810500Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 358, in forward 2025-08-14T21:35:06.4810565Z self.value(current_states) 2025-08-14T21:35:06.4810568Z 2025-08-14T21:35:06.4810643Z cudagraph partition due to non gpu ops 2025-08-14T21:35:06.4810741Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:06.4810925Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:06.4810984Z return mod(**inputs) 2025-08-14T21:35:06.4811245Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 1038, in forward 2025-08-14T21:35:06.4811307Z outputs = self.roberta( 2025-08-14T21:35:06.4811562Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 950, in forward 2025-08-14T21:35:06.4811626Z encoder_outputs = self.encoder( 2025-08-14T21:35:06.4811876Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 632, in forward 2025-08-14T21:35:06.4811947Z layer_outputs = layer_module( 2025-08-14T21:35:06.4812150Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:06.4812220Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:06.4812476Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 540, in forward 2025-08-14T21:35:06.4812551Z self_attention_outputs = self.attention( 2025-08-14T21:35:06.4812777Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:35:06.4812837Z return func(*args, **kwargs) 2025-08-14T21:35:06.4813086Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 467, in forward 2025-08-14T21:35:06.4813156Z self_outputs = self.self( 2025-08-14T21:35:06.4813376Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:35:06.4813445Z return func(*args, **kwargs) 2025-08-14T21:35:06.4813751Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 389, in forward 2025-08-14T21:35:06.4813875Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:35:06.4813878Z 2025-08-14T21:35:06.4813996Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:06.4814195Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:06.4814255Z return mod(**inputs) 2025-08-14T21:35:06.4814517Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 1038, in forward 2025-08-14T21:35:06.4814579Z outputs = self.roberta( 2025-08-14T21:35:06.4814835Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 950, in forward 2025-08-14T21:35:06.4814901Z encoder_outputs = self.encoder( 2025-08-14T21:35:06.4815183Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 632, in forward 2025-08-14T21:35:06.4815258Z layer_outputs = layer_module( 2025-08-14T21:35:06.4815461Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:06.4815540Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:06.4815787Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 540, in forward 2025-08-14T21:35:06.4815860Z self_attention_outputs = self.attention( 2025-08-14T21:35:06.4816083Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:35:06.4816144Z return func(*args, **kwargs) 2025-08-14T21:35:06.4816390Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 477, in forward 2025-08-14T21:35:06.4816518Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:35:06.4816766Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 413, in forward 2025-08-14T21:35:06.4816849Z hidden_states = self.dense(hidden_states) 2025-08-14T21:35:06.4816854Z 2025-08-14T21:35:06.4816946Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:06.4817125Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:06.4817192Z return mod(**inputs) 2025-08-14T21:35:06.4817442Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 1038, in forward 2025-08-14T21:35:06.4817509Z outputs = self.roberta( 2025-08-14T21:35:06.4817756Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 950, in forward 2025-08-14T21:35:06.4817823Z encoder_outputs = self.encoder( 2025-08-14T21:35:06.4818079Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 632, in forward 2025-08-14T21:35:06.4818144Z layer_outputs = layer_module( 2025-08-14T21:35:06.4818347Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:06.4818427Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:06.4818673Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 570, in forward 2025-08-14T21:35:06.4818754Z layer_output = apply_chunking_to_forward( 2025-08-14T21:35:06.4818991Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:35:06.4819062Z return forward_fn(*input_tensors) 2025-08-14T21:35:06.4819346Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 578, in feed_forward_chunk 2025-08-14T21:35:06.4819469Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:35:06.4819744Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 493, in forward 2025-08-14T21:35:06.4819834Z hidden_states = self.dense(hidden_states) 2025-08-14T21:35:06.4819837Z 2025-08-14T21:35:06.4819930Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:06.4820120Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:06.4820180Z return mod(**inputs) 2025-08-14T21:35:06.4820433Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 1038, in forward 2025-08-14T21:35:06.4820502Z outputs = self.roberta( 2025-08-14T21:35:06.4820778Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 950, in forward 2025-08-14T21:35:06.4820852Z encoder_outputs = self.encoder( 2025-08-14T21:35:06.4821100Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 632, in forward 2025-08-14T21:35:06.4821167Z layer_outputs = layer_module( 2025-08-14T21:35:06.4821376Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:06.4821448Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:06.4821696Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 570, in forward 2025-08-14T21:35:06.4821778Z layer_output = apply_chunking_to_forward( 2025-08-14T21:35:06.4822015Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:35:06.4822092Z return forward_fn(*input_tensors) 2025-08-14T21:35:06.4822372Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 578, in feed_forward_chunk 2025-08-14T21:35:06.4822481Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:35:06.4822738Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 494, in forward 2025-08-14T21:35:06.4822838Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:35:06.4823037Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:35:06.4823100Z return self.act(input) 2025-08-14T21:35:06.4823103Z 2025-08-14T21:35:06.4823195Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:06.4823383Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:06.4823443Z return mod(**inputs) 2025-08-14T21:35:06.4823703Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 1038, in forward 2025-08-14T21:35:06.4823763Z outputs = self.roberta( 2025-08-14T21:35:06.4824013Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 950, in forward 2025-08-14T21:35:06.4824085Z encoder_outputs = self.encoder( 2025-08-14T21:35:06.4824335Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 632, in forward 2025-08-14T21:35:06.4824398Z layer_outputs = layer_module( 2025-08-14T21:35:06.4824608Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:06.4824679Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:06.4825013Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 570, in forward 2025-08-14T21:35:06.4825097Z layer_output = apply_chunking_to_forward( 2025-08-14T21:35:06.4825352Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:35:06.4825447Z return forward_fn(*input_tensors) 2025-08-14T21:35:06.4825743Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 579, in feed_forward_chunk 2025-08-14T21:35:06.4825863Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:35:06.4826121Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 507, in forward 2025-08-14T21:35:06.4826194Z hidden_states = self.dense(hidden_states) 2025-08-14T21:35:06.4826198Z 2025-08-14T21:35:06.4826300Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:06.4826505Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:06.4826566Z return mod(**inputs) 2025-08-14T21:35:06.4826829Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 1038, in forward 2025-08-14T21:35:06.4826893Z outputs = self.roberta( 2025-08-14T21:35:06.4827153Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 950, in forward 2025-08-14T21:35:06.4827219Z encoder_outputs = self.encoder( 2025-08-14T21:35:06.4827471Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 632, in forward 2025-08-14T21:35:06.4827544Z layer_outputs = layer_module( 2025-08-14T21:35:06.4827746Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:06.4827820Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:06.4828078Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 540, in forward 2025-08-14T21:35:06.4828152Z self_attention_outputs = self.attention( 2025-08-14T21:35:06.4828384Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:35:06.4828447Z return func(*args, **kwargs) 2025-08-14T21:35:06.4828699Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 467, in forward 2025-08-14T21:35:06.4828773Z self_outputs = self.self( 2025-08-14T21:35:06.4828995Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:35:06.4829065Z return func(*args, **kwargs) 2025-08-14T21:35:06.4829314Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 325, in forward 2025-08-14T21:35:06.4829507Z self.query(hidden_states).view(bsz, -1, self.num_attention_heads, self.attention_head_size).transpose(1, 2) 2025-08-14T21:35:06.4829510Z 2025-08-14T21:35:06.4829610Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:06.4829792Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:06.4829853Z return mod(**inputs) 2025-08-14T21:35:06.4830111Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 1038, in forward 2025-08-14T21:35:06.4830171Z outputs = self.roberta( 2025-08-14T21:35:06.4830428Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 950, in forward 2025-08-14T21:35:06.4830494Z encoder_outputs = self.encoder( 2025-08-14T21:35:06.4830751Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 632, in forward 2025-08-14T21:35:06.4830824Z layer_outputs = layer_module( 2025-08-14T21:35:06.4831042Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:06.4831137Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:06.4831401Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 540, in forward 2025-08-14T21:35:06.4831475Z self_attention_outputs = self.attention( 2025-08-14T21:35:06.4831700Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:35:06.4831763Z return func(*args, **kwargs) 2025-08-14T21:35:06.4832009Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 467, in forward 2025-08-14T21:35:06.4832094Z self_outputs = self.self( 2025-08-14T21:35:06.4832322Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:35:06.4832394Z return func(*args, **kwargs) 2025-08-14T21:35:06.4832645Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 353, in forward 2025-08-14T21:35:06.4832711Z self.key(current_states) 2025-08-14T21:35:06.4832715Z 2025-08-14T21:35:06.4832816Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:06.4832997Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:06.4833064Z return mod(**inputs) 2025-08-14T21:35:06.4833317Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 1038, in forward 2025-08-14T21:35:06.4833378Z outputs = self.roberta( 2025-08-14T21:35:06.4833641Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 950, in forward 2025-08-14T21:35:06.4833708Z encoder_outputs = self.encoder( 2025-08-14T21:35:06.4833961Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 632, in forward 2025-08-14T21:35:06.4834035Z layer_outputs = layer_module( 2025-08-14T21:35:06.4834241Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:06.4834320Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:06.4834570Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 540, in forward 2025-08-14T21:35:06.4834645Z self_attention_outputs = self.attention( 2025-08-14T21:35:06.4834870Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:35:06.4834934Z return func(*args, **kwargs) 2025-08-14T21:35:06.4835184Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 467, in forward 2025-08-14T21:35:06.4835253Z self_outputs = self.self( 2025-08-14T21:35:06.4835475Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:35:06.4835545Z return func(*args, **kwargs) 2025-08-14T21:35:06.4835792Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 358, in forward 2025-08-14T21:35:06.4835855Z self.value(current_states) 2025-08-14T21:35:06.4835858Z 2025-08-14T21:35:06.4835940Z cudagraph partition due to non gpu ops 2025-08-14T21:35:06.4836033Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:06.4836222Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:06.4836283Z return mod(**inputs) 2025-08-14T21:35:06.4836549Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 1038, in forward 2025-08-14T21:35:06.4836619Z outputs = self.roberta( 2025-08-14T21:35:06.4836886Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 950, in forward 2025-08-14T21:35:06.4836968Z encoder_outputs = self.encoder( 2025-08-14T21:35:06.4837229Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 632, in forward 2025-08-14T21:35:06.4837292Z layer_outputs = layer_module( 2025-08-14T21:35:06.4837503Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:06.4837573Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:06.4837822Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 540, in forward 2025-08-14T21:35:06.4837919Z self_attention_outputs = self.attention( 2025-08-14T21:35:06.4838140Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:35:06.4838201Z return func(*args, **kwargs) 2025-08-14T21:35:06.4838457Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 467, in forward 2025-08-14T21:35:06.4838519Z self_outputs = self.self( 2025-08-14T21:35:06.4838743Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:35:06.4838804Z return func(*args, **kwargs) 2025-08-14T21:35:06.4839049Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 389, in forward 2025-08-14T21:35:06.4839176Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:35:06.4839181Z 2025-08-14T21:35:06.4839273Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:06.4839458Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:06.4839517Z return mod(**inputs) 2025-08-14T21:35:06.4839768Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 1038, in forward 2025-08-14T21:35:06.4839837Z outputs = self.roberta( 2025-08-14T21:35:06.4840084Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 950, in forward 2025-08-14T21:35:06.4840148Z encoder_outputs = self.encoder( 2025-08-14T21:35:06.4840405Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 632, in forward 2025-08-14T21:35:06.4840468Z layer_outputs = layer_module( 2025-08-14T21:35:06.4840678Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:06.4840748Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:06.4840994Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 540, in forward 2025-08-14T21:35:06.4841076Z self_attention_outputs = self.attention( 2025-08-14T21:35:06.4841294Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:35:06.4841361Z return func(*args, **kwargs) 2025-08-14T21:35:06.4841608Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 477, in forward 2025-08-14T21:35:06.4841725Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:35:06.4841980Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 413, in forward 2025-08-14T21:35:06.4842055Z hidden_states = self.dense(hidden_states) 2025-08-14T21:35:06.4842059Z 2025-08-14T21:35:06.4842167Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:06.4842359Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:06.4842473Z return mod(**inputs) 2025-08-14T21:35:06.4842738Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 1038, in forward 2025-08-14T21:35:06.4842800Z outputs = self.roberta( 2025-08-14T21:35:06.4843049Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 950, in forward 2025-08-14T21:35:06.4843121Z encoder_outputs = self.encoder( 2025-08-14T21:35:06.4843373Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 632, in forward 2025-08-14T21:35:06.4843463Z layer_outputs = layer_module( 2025-08-14T21:35:06.4843666Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:06.4843741Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:06.4844001Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 570, in forward 2025-08-14T21:35:06.4844079Z layer_output = apply_chunking_to_forward( 2025-08-14T21:35:06.4844319Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:35:06.4844397Z return forward_fn(*input_tensors) 2025-08-14T21:35:06.4844677Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 578, in feed_forward_chunk 2025-08-14T21:35:06.4844793Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:35:06.4845044Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 493, in forward 2025-08-14T21:35:06.4845119Z hidden_states = self.dense(hidden_states) 2025-08-14T21:35:06.4845123Z 2025-08-14T21:35:06.4845223Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:06.4845408Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:06.4845474Z return mod(**inputs) 2025-08-14T21:35:06.4845728Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 1038, in forward 2025-08-14T21:35:06.4845789Z outputs = self.roberta( 2025-08-14T21:35:06.4846045Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 950, in forward 2025-08-14T21:35:06.4846111Z encoder_outputs = self.encoder( 2025-08-14T21:35:06.4846360Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 632, in forward 2025-08-14T21:35:06.4846432Z layer_outputs = layer_module( 2025-08-14T21:35:06.4846638Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:06.4846718Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:06.4846969Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 570, in forward 2025-08-14T21:35:06.4847043Z layer_output = apply_chunking_to_forward( 2025-08-14T21:35:06.4847288Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:35:06.4847356Z return forward_fn(*input_tensors) 2025-08-14T21:35:06.4847638Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 578, in feed_forward_chunk 2025-08-14T21:35:06.4847747Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:35:06.4848012Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 494, in forward 2025-08-14T21:35:06.4848124Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:35:06.4848334Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:35:06.4848415Z return self.act(input) 2025-08-14T21:35:06.4848427Z 2025-08-14T21:35:06.4848521Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:06.4848702Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:06.4848771Z return mod(**inputs) 2025-08-14T21:35:06.4849025Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 1038, in forward 2025-08-14T21:35:06.4849087Z outputs = self.roberta( 2025-08-14T21:35:06.4849365Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 950, in forward 2025-08-14T21:35:06.4849432Z encoder_outputs = self.encoder( 2025-08-14T21:35:06.4849687Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 632, in forward 2025-08-14T21:35:06.4849755Z layer_outputs = layer_module( 2025-08-14T21:35:06.4849958Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:06.4850038Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:06.4850285Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 570, in forward 2025-08-14T21:35:06.4850361Z layer_output = apply_chunking_to_forward( 2025-08-14T21:35:06.4850603Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:35:06.4850673Z return forward_fn(*input_tensors) 2025-08-14T21:35:06.4850955Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 579, in feed_forward_chunk 2025-08-14T21:35:06.4851077Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:35:06.4851329Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 507, in forward 2025-08-14T21:35:06.4851412Z hidden_states = self.dense(hidden_states) 2025-08-14T21:35:06.4851416Z 2025-08-14T21:35:06.4851508Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:06.4851694Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:06.4851753Z return mod(**inputs) 2025-08-14T21:35:06.4852002Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 1038, in forward 2025-08-14T21:35:06.4852072Z outputs = self.roberta( 2025-08-14T21:35:06.4852321Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 950, in forward 2025-08-14T21:35:06.4852386Z encoder_outputs = self.encoder( 2025-08-14T21:35:06.4852642Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 632, in forward 2025-08-14T21:35:06.4852705Z layer_outputs = layer_module( 2025-08-14T21:35:06.4852913Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:06.4852983Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:06.4853229Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 540, in forward 2025-08-14T21:35:06.4853309Z self_attention_outputs = self.attention( 2025-08-14T21:35:06.4853531Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:35:06.4853621Z return func(*args, **kwargs) 2025-08-14T21:35:06.4853892Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 467, in forward 2025-08-14T21:35:06.4853976Z self_outputs = self.self( 2025-08-14T21:35:06.4854205Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:35:06.4854268Z return func(*args, **kwargs) 2025-08-14T21:35:06.4854519Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 325, in forward 2025-08-14T21:35:06.4854718Z self.query(hidden_states).view(bsz, -1, self.num_attention_heads, self.attention_head_size).transpose(1, 2) 2025-08-14T21:35:06.4854721Z 2025-08-14T21:35:06.4854831Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:06.4855019Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:06.4855079Z return mod(**inputs) 2025-08-14T21:35:06.4855335Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 1038, in forward 2025-08-14T21:35:06.4855405Z outputs = self.roberta( 2025-08-14T21:35:06.4855656Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 950, in forward 2025-08-14T21:35:06.4855727Z encoder_outputs = self.encoder( 2025-08-14T21:35:06.4855975Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 632, in forward 2025-08-14T21:35:06.4856039Z layer_outputs = layer_module( 2025-08-14T21:35:06.4856249Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:06.4856321Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:06.4856571Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 540, in forward 2025-08-14T21:35:06.4856652Z self_attention_outputs = self.attention( 2025-08-14T21:35:06.4856873Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:35:06.4856944Z return func(*args, **kwargs) 2025-08-14T21:35:06.4857194Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 467, in forward 2025-08-14T21:35:06.4857257Z self_outputs = self.self( 2025-08-14T21:35:06.4857484Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:35:06.4857546Z return func(*args, **kwargs) 2025-08-14T21:35:06.4857797Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 353, in forward 2025-08-14T21:35:06.4857869Z self.key(current_states) 2025-08-14T21:35:06.4857873Z 2025-08-14T21:35:06.4857968Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:06.4858158Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:06.4858219Z return mod(**inputs) 2025-08-14T21:35:06.4858471Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 1038, in forward 2025-08-14T21:35:06.4858539Z outputs = self.roberta( 2025-08-14T21:35:06.4858793Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 950, in forward 2025-08-14T21:35:06.4858864Z encoder_outputs = self.encoder( 2025-08-14T21:35:06.4859116Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 632, in forward 2025-08-14T21:35:06.4859181Z layer_outputs = layer_module( 2025-08-14T21:35:06.4859405Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:06.4859479Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:06.4859743Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 540, in forward 2025-08-14T21:35:06.4859838Z self_attention_outputs = self.attention( 2025-08-14T21:35:06.4860062Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:35:06.4860130Z return func(*args, **kwargs) 2025-08-14T21:35:06.4860383Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 467, in forward 2025-08-14T21:35:06.4860446Z self_outputs = self.self( 2025-08-14T21:35:06.4860690Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:35:06.4860751Z return func(*args, **kwargs) 2025-08-14T21:35:06.4861000Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 358, in forward 2025-08-14T21:35:06.4861077Z self.value(current_states) 2025-08-14T21:35:06.4861080Z 2025-08-14T21:35:06.4861153Z cudagraph partition due to non gpu ops 2025-08-14T21:35:06.4861253Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:06.4861436Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:06.4861495Z return mod(**inputs) 2025-08-14T21:35:06.4861754Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 1038, in forward 2025-08-14T21:35:06.4861815Z outputs = self.roberta( 2025-08-14T21:35:06.4862075Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 950, in forward 2025-08-14T21:35:06.4862142Z encoder_outputs = self.encoder( 2025-08-14T21:35:06.4862394Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 632, in forward 2025-08-14T21:35:06.4862469Z layer_outputs = layer_module( 2025-08-14T21:35:06.4862673Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:06.4862745Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:06.4863001Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 540, in forward 2025-08-14T21:35:06.4863074Z self_attention_outputs = self.attention( 2025-08-14T21:35:06.4863302Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:35:06.4863364Z return func(*args, **kwargs) 2025-08-14T21:35:06.4863614Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 467, in forward 2025-08-14T21:35:06.4863685Z self_outputs = self.self( 2025-08-14T21:35:06.4863907Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:35:06.4863969Z return func(*args, **kwargs) 2025-08-14T21:35:06.4864223Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 389, in forward 2025-08-14T21:35:06.4864345Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:35:06.4864348Z 2025-08-14T21:35:06.4864445Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:06.4864626Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:06.4864685Z return mod(**inputs) 2025-08-14T21:35:06.4865034Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 1038, in forward 2025-08-14T21:35:06.4865104Z outputs = self.roberta( 2025-08-14T21:35:06.4865391Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 950, in forward 2025-08-14T21:35:06.4865475Z encoder_outputs = self.encoder( 2025-08-14T21:35:06.4865732Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 632, in forward 2025-08-14T21:35:06.4865805Z layer_outputs = layer_module( 2025-08-14T21:35:06.4866013Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:06.4866084Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:06.4866346Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 540, in forward 2025-08-14T21:35:06.4866439Z self_attention_outputs = self.attention( 2025-08-14T21:35:06.4866669Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:35:06.4866730Z return func(*args, **kwargs) 2025-08-14T21:35:06.4866980Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 477, in forward 2025-08-14T21:35:06.4867103Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:35:06.4867350Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 413, in forward 2025-08-14T21:35:06.4867432Z hidden_states = self.dense(hidden_states) 2025-08-14T21:35:06.4867435Z 2025-08-14T21:35:06.4867529Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:06.4867709Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:06.4867776Z return mod(**inputs) 2025-08-14T21:35:06.4868030Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 1038, in forward 2025-08-14T21:35:06.4868091Z outputs = self.roberta( 2025-08-14T21:35:06.4868345Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 950, in forward 2025-08-14T21:35:06.4868412Z encoder_outputs = self.encoder( 2025-08-14T21:35:06.4868664Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 632, in forward 2025-08-14T21:35:06.4868728Z layer_outputs = layer_module( 2025-08-14T21:35:06.4868929Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:06.4869009Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:06.4869258Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 570, in forward 2025-08-14T21:35:06.4869342Z layer_output = apply_chunking_to_forward( 2025-08-14T21:35:06.4869580Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:35:06.4869652Z return forward_fn(*input_tensors) 2025-08-14T21:35:06.4869937Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 578, in feed_forward_chunk 2025-08-14T21:35:06.4870046Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:35:06.4870293Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 493, in forward 2025-08-14T21:35:06.4870374Z hidden_states = self.dense(hidden_states) 2025-08-14T21:35:06.4870378Z 2025-08-14T21:35:06.4870470Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:06.4870658Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:06.4870734Z return mod(**inputs) 2025-08-14T21:35:06.4871003Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 1038, in forward 2025-08-14T21:35:06.4871091Z outputs = self.roberta( 2025-08-14T21:35:06.4871343Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 950, in forward 2025-08-14T21:35:06.4871417Z encoder_outputs = self.encoder( 2025-08-14T21:35:06.4871668Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 632, in forward 2025-08-14T21:35:06.4871732Z layer_outputs = layer_module( 2025-08-14T21:35:06.4871946Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:06.4872035Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:06.4872284Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 570, in forward 2025-08-14T21:35:06.4872367Z layer_output = apply_chunking_to_forward( 2025-08-14T21:35:06.4872605Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:35:06.4872684Z return forward_fn(*input_tensors) 2025-08-14T21:35:06.4872963Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 578, in feed_forward_chunk 2025-08-14T21:35:06.4873070Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:35:06.4873325Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 494, in forward 2025-08-14T21:35:06.4873426Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:35:06.4873629Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:35:06.4873693Z return self.act(input) 2025-08-14T21:35:06.4873697Z 2025-08-14T21:35:06.4873789Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:06.4873979Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:06.4874038Z return mod(**inputs) 2025-08-14T21:35:06.4874290Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 1038, in forward 2025-08-14T21:35:06.4874360Z outputs = self.roberta( 2025-08-14T21:35:06.4874609Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 950, in forward 2025-08-14T21:35:06.4874681Z encoder_outputs = self.encoder( 2025-08-14T21:35:06.4874930Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 632, in forward 2025-08-14T21:35:06.4874996Z layer_outputs = layer_module( 2025-08-14T21:35:06.4875204Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:06.4875276Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:06.4875533Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 570, in forward 2025-08-14T21:35:06.4875607Z layer_output = apply_chunking_to_forward( 2025-08-14T21:35:06.4875841Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:35:06.4875917Z return forward_fn(*input_tensors) 2025-08-14T21:35:06.4876194Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 579, in feed_forward_chunk 2025-08-14T21:35:06.4876316Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:35:06.4876591Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 507, in forward 2025-08-14T21:35:06.4876667Z hidden_states = self.dense(hidden_states) 2025-08-14T21:35:06.4876705Z 2025-08-14T21:35:06.4876808Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:06.4876991Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:06.4877051Z return mod(**inputs) 2025-08-14T21:35:06.4877311Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 1038, in forward 2025-08-14T21:35:06.4877371Z outputs = self.roberta( 2025-08-14T21:35:06.4877630Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 950, in forward 2025-08-14T21:35:06.4877713Z encoder_outputs = self.encoder( 2025-08-14T21:35:06.4877965Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 632, in forward 2025-08-14T21:35:06.4878035Z layer_outputs = layer_module( 2025-08-14T21:35:06.4878239Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:06.4878310Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:06.4878567Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 540, in forward 2025-08-14T21:35:06.4878640Z self_attention_outputs = self.attention( 2025-08-14T21:35:06.4878870Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:35:06.4878932Z return func(*args, **kwargs) 2025-08-14T21:35:06.4879183Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 467, in forward 2025-08-14T21:35:06.4879254Z self_outputs = self.self( 2025-08-14T21:35:06.4879476Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:35:06.4879544Z return func(*args, **kwargs) 2025-08-14T21:35:06.4879795Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 325, in forward 2025-08-14T21:35:06.4879982Z self.query(hidden_states).view(bsz, -1, self.num_attention_heads, self.attention_head_size).transpose(1, 2) 2025-08-14T21:35:06.4879986Z 2025-08-14T21:35:06.4880084Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:06.4880263Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:06.4880320Z return mod(**inputs) 2025-08-14T21:35:06.4880576Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 1038, in forward 2025-08-14T21:35:06.4880638Z outputs = self.roberta( 2025-08-14T21:35:06.4880894Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 950, in forward 2025-08-14T21:35:06.4880959Z encoder_outputs = self.encoder( 2025-08-14T21:35:06.4881209Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 632, in forward 2025-08-14T21:35:06.4881279Z layer_outputs = layer_module( 2025-08-14T21:35:06.4881480Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:06.4881556Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:06.4881803Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 540, in forward 2025-08-14T21:35:06.4881878Z self_attention_outputs = self.attention( 2025-08-14T21:35:06.4882120Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:35:06.4882184Z return func(*args, **kwargs) 2025-08-14T21:35:06.4882447Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 467, in forward 2025-08-14T21:35:06.4882532Z self_outputs = self.self( 2025-08-14T21:35:06.4882752Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:35:06.4882821Z return func(*args, **kwargs) 2025-08-14T21:35:06.4883068Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 353, in forward 2025-08-14T21:35:06.4883131Z self.key(current_states) 2025-08-14T21:35:06.4883134Z 2025-08-14T21:35:06.4883235Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:06.4883434Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:06.4883493Z return mod(**inputs) 2025-08-14T21:35:06.4883756Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 1038, in forward 2025-08-14T21:35:06.4883820Z outputs = self.roberta( 2025-08-14T21:35:06.4884076Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 950, in forward 2025-08-14T21:35:06.4884142Z encoder_outputs = self.encoder( 2025-08-14T21:35:06.4884390Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 632, in forward 2025-08-14T21:35:06.4884461Z layer_outputs = layer_module( 2025-08-14T21:35:06.4884780Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:06.4884871Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:06.4885129Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 540, in forward 2025-08-14T21:35:06.4885208Z self_attention_outputs = self.attention( 2025-08-14T21:35:06.4885441Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:35:06.4885505Z return func(*args, **kwargs) 2025-08-14T21:35:06.4885763Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 467, in forward 2025-08-14T21:35:06.4885836Z self_outputs = self.self( 2025-08-14T21:35:06.4886082Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:35:06.4886153Z return func(*args, **kwargs) 2025-08-14T21:35:06.4886401Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 358, in forward 2025-08-14T21:35:06.4886467Z self.value(current_states) 2025-08-14T21:35:06.4886471Z 2025-08-14T21:35:06.4886554Z cudagraph partition due to non gpu ops 2025-08-14T21:35:06.4886649Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:06.4886838Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:06.4886899Z return mod(**inputs) 2025-08-14T21:35:06.4887154Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 1038, in forward 2025-08-14T21:35:06.4887226Z outputs = self.roberta( 2025-08-14T21:35:06.4887474Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 950, in forward 2025-08-14T21:35:06.4887541Z encoder_outputs = self.encoder( 2025-08-14T21:35:06.4887803Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 632, in forward 2025-08-14T21:35:06.4887869Z layer_outputs = layer_module( 2025-08-14T21:35:06.4888132Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:06.4888205Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:06.4888498Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 540, in forward 2025-08-14T21:35:06.4888585Z self_attention_outputs = self.attention( 2025-08-14T21:35:06.4888806Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:35:06.4888870Z return func(*args, **kwargs) 2025-08-14T21:35:06.4889128Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 467, in forward 2025-08-14T21:35:06.4889192Z self_outputs = self.self( 2025-08-14T21:35:06.4889443Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:35:06.4889508Z return func(*args, **kwargs) 2025-08-14T21:35:06.4889758Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 389, in forward 2025-08-14T21:35:06.4889888Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:35:06.4889892Z 2025-08-14T21:35:06.4889983Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:06.4890174Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:06.4890233Z return mod(**inputs) 2025-08-14T21:35:06.4890485Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 1038, in forward 2025-08-14T21:35:06.4890552Z outputs = self.roberta( 2025-08-14T21:35:06.4890802Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 950, in forward 2025-08-14T21:35:06.4890868Z encoder_outputs = self.encoder( 2025-08-14T21:35:06.4891126Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 632, in forward 2025-08-14T21:35:06.4891194Z layer_outputs = layer_module( 2025-08-14T21:35:06.4891402Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:06.4891473Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:06.4891718Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 540, in forward 2025-08-14T21:35:06.4891798Z self_attention_outputs = self.attention( 2025-08-14T21:35:06.4892016Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:35:06.4892087Z return func(*args, **kwargs) 2025-08-14T21:35:06.4892337Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 477, in forward 2025-08-14T21:35:06.4892456Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:35:06.4892710Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 413, in forward 2025-08-14T21:35:06.4892785Z hidden_states = self.dense(hidden_states) 2025-08-14T21:35:06.4892788Z 2025-08-14T21:35:06.4892878Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:06.4893064Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:06.4893123Z return mod(**inputs) 2025-08-14T21:35:06.4893381Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 1038, in forward 2025-08-14T21:35:06.4893441Z outputs = self.roberta( 2025-08-14T21:35:06.4893702Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 950, in forward 2025-08-14T21:35:06.4893779Z encoder_outputs = self.encoder( 2025-08-14T21:35:06.4894045Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 632, in forward 2025-08-14T21:35:06.4894133Z layer_outputs = layer_module( 2025-08-14T21:35:06.4894337Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:06.4894409Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:06.4894667Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 570, in forward 2025-08-14T21:35:06.4894744Z layer_output = apply_chunking_to_forward( 2025-08-14T21:35:06.4894984Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:35:06.4895076Z return forward_fn(*input_tensors) 2025-08-14T21:35:06.4895356Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 578, in feed_forward_chunk 2025-08-14T21:35:06.4895473Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:35:06.4895720Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 493, in forward 2025-08-14T21:35:06.4895794Z hidden_states = self.dense(hidden_states) 2025-08-14T21:35:06.4895797Z 2025-08-14T21:35:06.4895896Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:06.4896074Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:06.4896139Z return mod(**inputs) 2025-08-14T21:35:06.4896390Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 1038, in forward 2025-08-14T21:35:06.4896452Z outputs = self.roberta( 2025-08-14T21:35:06.4896706Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 950, in forward 2025-08-14T21:35:06.4896773Z encoder_outputs = self.encoder( 2025-08-14T21:35:06.4897020Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 632, in forward 2025-08-14T21:35:06.4897091Z layer_outputs = layer_module( 2025-08-14T21:35:06.4897292Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:06.4897370Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:06.4897618Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 570, in forward 2025-08-14T21:35:06.4897695Z layer_output = apply_chunking_to_forward( 2025-08-14T21:35:06.4897939Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:35:06.4898009Z return forward_fn(*input_tensors) 2025-08-14T21:35:06.4898293Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 578, in feed_forward_chunk 2025-08-14T21:35:06.4898402Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:35:06.4898648Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 494, in forward 2025-08-14T21:35:06.4898757Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:35:06.4898950Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:35:06.4899014Z return self.act(input) 2025-08-14T21:35:06.4899019Z 2025-08-14T21:35:06.4899119Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:06.4899319Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:06.4899388Z return mod(**inputs) 2025-08-14T21:35:06.4899655Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 1038, in forward 2025-08-14T21:35:06.4899733Z outputs = self.roberta( 2025-08-14T21:35:06.4899989Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 950, in forward 2025-08-14T21:35:06.4900054Z encoder_outputs = self.encoder( 2025-08-14T21:35:06.4900304Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 632, in forward 2025-08-14T21:35:06.4900373Z layer_outputs = layer_module( 2025-08-14T21:35:06.4900574Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:06.4900694Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:06.4900947Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 570, in forward 2025-08-14T21:35:06.4901022Z layer_output = apply_chunking_to_forward( 2025-08-14T21:35:06.4901270Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:35:06.4901339Z return forward_fn(*input_tensors) 2025-08-14T21:35:06.4901627Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 579, in feed_forward_chunk 2025-08-14T21:35:06.4901748Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:35:06.4901995Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 507, in forward 2025-08-14T21:35:06.4902079Z hidden_states = self.dense(hidden_states) 2025-08-14T21:35:06.4902083Z 2025-08-14T21:35:06.4902173Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:06.4902364Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:06.4902422Z return mod(**inputs) 2025-08-14T21:35:06.4902681Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 1038, in forward 2025-08-14T21:35:06.4902748Z outputs = self.roberta( 2025-08-14T21:35:06.4902999Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 950, in forward 2025-08-14T21:35:06.4903063Z encoder_outputs = self.encoder( 2025-08-14T21:35:06.4903325Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 632, in forward 2025-08-14T21:35:06.4903391Z layer_outputs = layer_module( 2025-08-14T21:35:06.4903604Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:06.4903678Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:06.4903928Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 540, in forward 2025-08-14T21:35:06.4904012Z self_attention_outputs = self.attention( 2025-08-14T21:35:06.4904236Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:35:06.4904298Z return func(*args, **kwargs) 2025-08-14T21:35:06.4904556Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 467, in forward 2025-08-14T21:35:06.4904617Z self_outputs = self.self( 2025-08-14T21:35:06.4904914Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:35:06.4904987Z return func(*args, **kwargs) 2025-08-14T21:35:06.4905257Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 325, in forward 2025-08-14T21:35:06.4905476Z self.query(hidden_states).view(bsz, -1, self.num_attention_heads, self.attention_head_size).transpose(1, 2) 2025-08-14T21:35:06.4905494Z 2025-08-14T21:35:06.4905588Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:06.4905776Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:06.4905835Z return mod(**inputs) 2025-08-14T21:35:06.4906086Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 1038, in forward 2025-08-14T21:35:06.4906157Z outputs = self.roberta( 2025-08-14T21:35:06.4906407Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 950, in forward 2025-08-14T21:35:06.4906499Z encoder_outputs = self.encoder( 2025-08-14T21:35:06.4906758Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 632, in forward 2025-08-14T21:35:06.4906820Z layer_outputs = layer_module( 2025-08-14T21:35:06.4907031Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:06.4907104Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:06.4907352Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 540, in forward 2025-08-14T21:35:06.4907435Z self_attention_outputs = self.attention( 2025-08-14T21:35:06.4907653Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:35:06.4907722Z return func(*args, **kwargs) 2025-08-14T21:35:06.4907976Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 467, in forward 2025-08-14T21:35:06.4908039Z self_outputs = self.self( 2025-08-14T21:35:06.4908265Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:35:06.4908330Z return func(*args, **kwargs) 2025-08-14T21:35:06.4908578Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 353, in forward 2025-08-14T21:35:06.4908648Z self.key(current_states) 2025-08-14T21:35:06.4908651Z 2025-08-14T21:35:06.4908744Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:06.4908928Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:06.4908986Z return mod(**inputs) 2025-08-14T21:35:06.4909239Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 1038, in forward 2025-08-14T21:35:06.4909306Z outputs = self.roberta( 2025-08-14T21:35:06.4909560Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 950, in forward 2025-08-14T21:35:06.4909629Z encoder_outputs = self.encoder( 2025-08-14T21:35:06.4909882Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 632, in forward 2025-08-14T21:35:06.4909946Z layer_outputs = layer_module( 2025-08-14T21:35:06.4910156Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:06.4910226Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:06.4910475Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 540, in forward 2025-08-14T21:35:06.4910556Z self_attention_outputs = self.attention( 2025-08-14T21:35:06.4910777Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:35:06.4910861Z return func(*args, **kwargs) 2025-08-14T21:35:06.4911123Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 467, in forward 2025-08-14T21:35:06.4911202Z self_outputs = self.self( 2025-08-14T21:35:06.4911429Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:35:06.4911490Z return func(*args, **kwargs) 2025-08-14T21:35:06.4911738Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 358, in forward 2025-08-14T21:35:06.4911809Z self.value(current_states) 2025-08-14T21:35:06.4911812Z 2025-08-14T21:35:06.4911886Z cudagraph partition due to non gpu ops 2025-08-14T21:35:06.4911986Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:06.4912181Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:06.4912240Z return mod(**inputs) 2025-08-14T21:35:06.4912498Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 1038, in forward 2025-08-14T21:35:06.4912559Z outputs = self.roberta( 2025-08-14T21:35:06.4912815Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 950, in forward 2025-08-14T21:35:06.4912880Z encoder_outputs = self.encoder( 2025-08-14T21:35:06.4913128Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 632, in forward 2025-08-14T21:35:06.4913199Z layer_outputs = layer_module( 2025-08-14T21:35:06.4913401Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:06.4913473Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:06.4913727Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 540, in forward 2025-08-14T21:35:06.4913799Z self_attention_outputs = self.attention( 2025-08-14T21:35:06.4914027Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:35:06.4914091Z return func(*args, **kwargs) 2025-08-14T21:35:06.4914337Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 467, in forward 2025-08-14T21:35:06.4914407Z self_outputs = self.self( 2025-08-14T21:35:06.4914626Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:35:06.4914686Z return func(*args, **kwargs) 2025-08-14T21:35:06.4914941Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 389, in forward 2025-08-14T21:35:06.4915063Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:35:06.4915066Z 2025-08-14T21:35:06.4915166Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:06.4915347Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:06.4915406Z return mod(**inputs) 2025-08-14T21:35:06.4915667Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 1038, in forward 2025-08-14T21:35:06.4915726Z outputs = self.roberta( 2025-08-14T21:35:06.4915980Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 950, in forward 2025-08-14T21:35:06.4916046Z encoder_outputs = self.encoder( 2025-08-14T21:35:06.4916293Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 632, in forward 2025-08-14T21:35:06.4916367Z layer_outputs = layer_module( 2025-08-14T21:35:06.4916583Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:06.4916658Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:06.4916943Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 540, in forward 2025-08-14T21:35:06.4917019Z self_attention_outputs = self.attention( 2025-08-14T21:35:06.4917248Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:35:06.4917309Z return func(*args, **kwargs) 2025-08-14T21:35:06.4917559Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 477, in forward 2025-08-14T21:35:06.4917682Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:35:06.4917953Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 413, in forward 2025-08-14T21:35:06.4918036Z hidden_states = self.dense(hidden_states) 2025-08-14T21:35:06.4918040Z 2025-08-14T21:35:06.4918134Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:06.4918314Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:06.4918380Z return mod(**inputs) 2025-08-14T21:35:06.4918629Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 1038, in forward 2025-08-14T21:35:06.4918689Z outputs = self.roberta( 2025-08-14T21:35:06.4918942Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 950, in forward 2025-08-14T21:35:06.4919008Z encoder_outputs = self.encoder( 2025-08-14T21:35:06.4919265Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 632, in forward 2025-08-14T21:35:06.4919329Z layer_outputs = layer_module( 2025-08-14T21:35:06.4919531Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:06.4919612Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:06.4919858Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 570, in forward 2025-08-14T21:35:06.4919944Z layer_output = apply_chunking_to_forward( 2025-08-14T21:35:06.4920179Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:35:06.4920249Z return forward_fn(*input_tensors) 2025-08-14T21:35:06.4920534Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 578, in feed_forward_chunk 2025-08-14T21:35:06.4920643Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:35:06.4920889Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 493, in forward 2025-08-14T21:35:06.4920970Z hidden_states = self.dense(hidden_states) 2025-08-14T21:35:06.4920975Z 2025-08-14T21:35:06.4921066Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:06.4921253Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:06.4921310Z return mod(**inputs) 2025-08-14T21:35:06.4921559Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 1038, in forward 2025-08-14T21:35:06.4921626Z outputs = self.roberta( 2025-08-14T21:35:06.4921874Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 950, in forward 2025-08-14T21:35:06.4921945Z encoder_outputs = self.encoder( 2025-08-14T21:35:06.4922208Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 632, in forward 2025-08-14T21:35:06.4922274Z layer_outputs = layer_module( 2025-08-14T21:35:06.4922499Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:06.4922586Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:06.4922832Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 570, in forward 2025-08-14T21:35:06.4922914Z layer_output = apply_chunking_to_forward( 2025-08-14T21:35:06.4923151Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:35:06.4923226Z return forward_fn(*input_tensors) 2025-08-14T21:35:06.4923524Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 578, in feed_forward_chunk 2025-08-14T21:35:06.4923631Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:35:06.4923890Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 494, in forward 2025-08-14T21:35:06.4923992Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:35:06.4924193Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:35:06.4924256Z return self.act(input) 2025-08-14T21:35:06.4924259Z 2025-08-14T21:35:06.4924350Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:06.4924539Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:06.4924597Z return mod(**inputs) 2025-08-14T21:35:06.4924851Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 1038, in forward 2025-08-14T21:35:06.4924920Z outputs = self.roberta( 2025-08-14T21:35:06.4925171Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 950, in forward 2025-08-14T21:35:06.4925247Z encoder_outputs = self.encoder( 2025-08-14T21:35:06.4925496Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 632, in forward 2025-08-14T21:35:06.4925558Z layer_outputs = layer_module( 2025-08-14T21:35:06.4925766Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:06.4925838Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:06.4926091Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 570, in forward 2025-08-14T21:35:06.4926166Z layer_output = apply_chunking_to_forward( 2025-08-14T21:35:06.4926405Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:35:06.4926481Z return forward_fn(*input_tensors) 2025-08-14T21:35:06.4926760Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 579, in feed_forward_chunk 2025-08-14T21:35:06.4926881Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:35:06.4927138Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 507, in forward 2025-08-14T21:35:06.4927212Z hidden_states = self.dense(hidden_states) 2025-08-14T21:35:06.4927215Z 2025-08-14T21:35:06.4927315Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:06.4927496Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:06.4927557Z return mod(**inputs) 2025-08-14T21:35:06.4927836Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 1038, in forward 2025-08-14T21:35:06.4927900Z outputs = self.roberta( 2025-08-14T21:35:06.4928172Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 950, in forward 2025-08-14T21:35:06.4928263Z encoder_outputs = self.encoder( 2025-08-14T21:35:06.4928515Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 632, in forward 2025-08-14T21:35:06.4928588Z layer_outputs = layer_module( 2025-08-14T21:35:06.4928790Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:06.4928863Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:06.4929117Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 540, in forward 2025-08-14T21:35:06.4929208Z self_attention_outputs = self.attention( 2025-08-14T21:35:06.4929441Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:35:06.4929504Z return func(*args, **kwargs) 2025-08-14T21:35:06.4929756Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 467, in forward 2025-08-14T21:35:06.4929827Z self_outputs = self.self( 2025-08-14T21:35:06.4930048Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:35:06.4930109Z return func(*args, **kwargs) 2025-08-14T21:35:06.4930364Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 325, in forward 2025-08-14T21:35:06.4930556Z self.query(hidden_states).view(bsz, -1, self.num_attention_heads, self.attention_head_size).transpose(1, 2) 2025-08-14T21:35:06.4930560Z 2025-08-14T21:35:06.4930661Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:06.4930842Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:06.4930906Z return mod(**inputs) 2025-08-14T21:35:06.4931166Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 1038, in forward 2025-08-14T21:35:06.4931226Z outputs = self.roberta( 2025-08-14T21:35:06.4931483Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 950, in forward 2025-08-14T21:35:06.4931548Z encoder_outputs = self.encoder( 2025-08-14T21:35:06.4931798Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 632, in forward 2025-08-14T21:35:06.4931871Z layer_outputs = layer_module( 2025-08-14T21:35:06.4932074Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:06.4932144Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:06.4932400Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 540, in forward 2025-08-14T21:35:06.4932476Z self_attention_outputs = self.attention( 2025-08-14T21:35:06.4932704Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:35:06.4932765Z return func(*args, **kwargs) 2025-08-14T21:35:06.4933012Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 467, in forward 2025-08-14T21:35:06.4933081Z self_outputs = self.self( 2025-08-14T21:35:06.4933301Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:35:06.4933371Z return func(*args, **kwargs) 2025-08-14T21:35:06.4933641Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 353, in forward 2025-08-14T21:35:06.4933708Z self.key(current_states) 2025-08-14T21:35:06.4933740Z 2025-08-14T21:35:06.4933846Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:06.4934029Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:06.4934089Z return mod(**inputs) 2025-08-14T21:35:06.4934350Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 1038, in forward 2025-08-14T21:35:06.4934409Z outputs = self.roberta( 2025-08-14T21:35:06.4934667Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 950, in forward 2025-08-14T21:35:06.4934748Z encoder_outputs = self.encoder( 2025-08-14T21:35:06.4935000Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 632, in forward 2025-08-14T21:35:06.4935071Z layer_outputs = layer_module( 2025-08-14T21:35:06.4935275Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:06.4935355Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:06.4935600Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 540, in forward 2025-08-14T21:35:06.4935676Z self_attention_outputs = self.attention( 2025-08-14T21:35:06.4935902Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:35:06.4935962Z return func(*args, **kwargs) 2025-08-14T21:35:06.4936211Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 467, in forward 2025-08-14T21:35:06.4936280Z self_outputs = self.self( 2025-08-14T21:35:06.4936502Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:35:06.4936570Z return func(*args, **kwargs) 2025-08-14T21:35:06.4936822Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 358, in forward 2025-08-14T21:35:06.4936884Z self.value(current_states) 2025-08-14T21:35:06.4936888Z 2025-08-14T21:35:06.4936967Z cudagraph partition due to non gpu ops 2025-08-14T21:35:06.4937060Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:06.4937240Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:06.4937304Z return mod(**inputs) 2025-08-14T21:35:06.4937556Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 1038, in forward 2025-08-14T21:35:06.4937624Z outputs = self.roberta( 2025-08-14T21:35:06.4937873Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 950, in forward 2025-08-14T21:35:06.4937939Z encoder_outputs = self.encoder( 2025-08-14T21:35:06.4938195Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 632, in forward 2025-08-14T21:35:06.4938259Z layer_outputs = layer_module( 2025-08-14T21:35:06.4938467Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:06.4938536Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:06.4938781Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 540, in forward 2025-08-14T21:35:06.4938863Z self_attention_outputs = self.attention( 2025-08-14T21:35:06.4939084Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:35:06.4939161Z return func(*args, **kwargs) 2025-08-14T21:35:06.4939433Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 467, in forward 2025-08-14T21:35:06.4939511Z self_outputs = self.self( 2025-08-14T21:35:06.4939736Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:35:06.4939797Z return func(*args, **kwargs) 2025-08-14T21:35:06.4940043Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 389, in forward 2025-08-14T21:35:06.4940172Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:35:06.4940176Z 2025-08-14T21:35:06.4940267Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:06.4940473Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:06.4940534Z return mod(**inputs) 2025-08-14T21:35:06.4940787Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 1038, in forward 2025-08-14T21:35:06.4940857Z outputs = self.roberta( 2025-08-14T21:35:06.4941104Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 950, in forward 2025-08-14T21:35:06.4941170Z encoder_outputs = self.encoder( 2025-08-14T21:35:06.4941426Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 632, in forward 2025-08-14T21:35:06.4941489Z layer_outputs = layer_module( 2025-08-14T21:35:06.4941696Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:06.4941768Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:06.4942016Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 540, in forward 2025-08-14T21:35:06.4942096Z self_attention_outputs = self.attention( 2025-08-14T21:35:06.4942316Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:35:06.4942379Z return func(*args, **kwargs) 2025-08-14T21:35:06.4942634Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 477, in forward 2025-08-14T21:35:06.4942750Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:35:06.4943006Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 413, in forward 2025-08-14T21:35:06.4943080Z hidden_states = self.dense(hidden_states) 2025-08-14T21:35:06.4943085Z 2025-08-14T21:35:06.4943177Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:06.4943365Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:06.4943424Z return mod(**inputs) 2025-08-14T21:35:06.4943682Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 1038, in forward 2025-08-14T21:35:06.4943745Z outputs = self.roberta( 2025-08-14T21:35:06.4943992Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 950, in forward 2025-08-14T21:35:06.4944066Z encoder_outputs = self.encoder( 2025-08-14T21:35:06.4944312Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 632, in forward 2025-08-14T21:35:06.4944374Z layer_outputs = layer_module( 2025-08-14T21:35:06.4944590Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:06.4944663Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:06.4945014Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 570, in forward 2025-08-14T21:35:06.4945114Z layer_output = apply_chunking_to_forward( 2025-08-14T21:35:06.4945370Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:35:06.4945451Z return forward_fn(*input_tensors) 2025-08-14T21:35:06.4945730Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 578, in feed_forward_chunk 2025-08-14T21:35:06.4945849Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:35:06.4946098Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 493, in forward 2025-08-14T21:35:06.4946194Z hidden_states = self.dense(hidden_states) 2025-08-14T21:35:06.4946197Z 2025-08-14T21:35:06.4946300Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:06.4946483Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:06.4946546Z return mod(**inputs) 2025-08-14T21:35:06.4946804Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 1038, in forward 2025-08-14T21:35:06.4946865Z outputs = self.roberta( 2025-08-14T21:35:06.4947118Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 950, in forward 2025-08-14T21:35:06.4947184Z encoder_outputs = self.encoder( 2025-08-14T21:35:06.4947430Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 632, in forward 2025-08-14T21:35:06.4947503Z layer_outputs = layer_module( 2025-08-14T21:35:06.4947704Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:06.4947783Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:06.4948030Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 570, in forward 2025-08-14T21:35:06.4948105Z layer_output = apply_chunking_to_forward( 2025-08-14T21:35:06.4948347Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:35:06.4948416Z return forward_fn(*input_tensors) 2025-08-14T21:35:06.4948689Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 578, in feed_forward_chunk 2025-08-14T21:35:06.4948803Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:35:06.4949050Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 494, in forward 2025-08-14T21:35:06.4949159Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:35:06.4949352Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:35:06.4949417Z return self.act(input) 2025-08-14T21:35:06.4949420Z 2025-08-14T21:35:06.4949520Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:06.4949700Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:06.4949765Z return mod(**inputs) 2025-08-14T21:35:06.4950013Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 1038, in forward 2025-08-14T21:35:06.4950074Z outputs = self.roberta( 2025-08-14T21:35:06.4950326Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 950, in forward 2025-08-14T21:35:06.4950392Z encoder_outputs = self.encoder( 2025-08-14T21:35:06.4950654Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 632, in forward 2025-08-14T21:35:06.4950729Z layer_outputs = layer_module( 2025-08-14T21:35:06.4950971Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:06.4951052Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:06.4951298Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 570, in forward 2025-08-14T21:35:06.4951374Z layer_output = apply_chunking_to_forward( 2025-08-14T21:35:06.4951618Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:35:06.4951687Z return forward_fn(*input_tensors) 2025-08-14T21:35:06.4952715Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 579, in feed_forward_chunk 2025-08-14T21:35:06.4952834Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:35:06.4953086Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 507, in forward 2025-08-14T21:35:06.4953169Z hidden_states = self.dense(hidden_states) 2025-08-14T21:35:06.4953174Z 2025-08-14T21:35:06.4953268Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:06.4953453Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:06.4953520Z return mod(**inputs) 2025-08-14T21:35:06.4953772Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 1052, in forward 2025-08-14T21:35:06.4953875Z prediction_scores = self.lm_head(sequence_output) 2025-08-14T21:35:06.4954126Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 756, in forward 2025-08-14T21:35:06.4954191Z x = self.dense(features) 2025-08-14T21:35:06.4954195Z 2025-08-14T21:35:06.4954297Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:06.4954479Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:06.4954544Z return mod(**inputs) 2025-08-14T21:35:06.4954797Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 1052, in forward 2025-08-14T21:35:06.4954887Z prediction_scores = self.lm_head(sequence_output) 2025-08-14T21:35:06.4955140Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 761, in forward 2025-08-14T21:35:06.4955201Z x = self.decoder(x) 2025-08-14T21:35:06.4955206Z 2025-08-14T21:35:06.4955298Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:06.4955486Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:06.4955543Z return mod(**inputs) 2025-08-14T21:35:06.4955802Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/camembert/modeling_camembert.py", line 1059, in forward 2025-08-14T21:35:06.4955978Z masked_lm_loss = loss_fct(prediction_scores.view(-1, self.config.vocab_size), labels.view(-1)) 2025-08-14T21:35:06.4955981Z 2025-08-14T21:35:14.0761330Z Compilation time (from dynamo_timed): 13.229159002 2025-08-14T21:35:14.0829411Z pass 2025-08-14T21:35:14.0833546Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:35:14.0838168Z TIMING: _recursive_pre_grad_passes:0.00617 _recursive_joint_graph_passes:0.33512 _recursive_post_grad_passes:0.07271 async_compile.wait:0.71179 code_gen:6.60264 inductor_compile:7.68094 backend_compile:10.53089 gc:0.00028 entire_frame_compile:13.22916 total_wall_time:13.22916 2025-08-14T21:35:14.0839858Z STATS: call_* op count: 297 | FakeTensorMode.__torch_dispatch__:12436 | FakeTensor.__torch_dispatch__:4756 | ProxyTorchDispatchMode.__torch_dispatch__:4530 2025-08-14T21:35:14.0840398Z Dynamo produced 1 graphs covering 297 ops with 0 graph breaks (0 unique) 2025-08-14T21:35:18.1088677Z /opt/conda/envs/py_3.9/lib/python3.9/site-packages/llvmlite/binding/ffi.py:175: UserWarning: pkg_resources is deprecated as an API. See https://setuptools.pypa.io/en/latest/pkg_resources.html. The pkg_resources package is slated for removal as early as 2025-11-30. Refrain from using this package or pin to Setuptools<81. 2025-08-14T21:35:18.1089519Z from pkg_resources import resource_filename 2025-08-14T21:35:18.7196752Z 2025-08-14T21:35:26.7482526Z loading model: 0it [00:00, ?it/s] 2025-08-14T21:35:26.7486632Z loading model: 0it [00:08, ?it/s] 2025-08-14T21:35:26.7507740Z cpu eval DebertaV2ForMaskedLM 2025-08-14T21:35:26.8699905Z Compilation time (from dynamo_timed): 0 2025-08-14T21:35:26.8703973Z pass_due_to_skip 2025-08-14T21:35:26.8709011Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:35:26.8713150Z TIMING: total_wall_time:0 2025-08-14T21:35:26.8717371Z STATS: call_* op count: 0 2025-08-14T21:35:26.8722044Z Dynamo produced 0 graphs covering 0 ops with 0 graph breaks (0 unique) 2025-08-14T21:35:30.5598778Z /opt/conda/envs/py_3.9/lib/python3.9/site-packages/llvmlite/binding/ffi.py:175: UserWarning: pkg_resources is deprecated as an API. See https://setuptools.pypa.io/en/latest/pkg_resources.html. The pkg_resources package is slated for removal as early as 2025-11-30. Refrain from using this package or pin to Setuptools<81. 2025-08-14T21:35:30.5599739Z from pkg_resources import resource_filename 2025-08-14T21:35:31.1313211Z 2025-08-14T21:35:37.7154684Z loading model: 0it [00:00, ?it/s] 2025-08-14T21:35:37.7158904Z loading model: 0it [00:06, ?it/s] 2025-08-14T21:35:37.7179488Z cpu eval DebertaV2ForQuestionAnswering 2025-08-14T21:35:40.1564430Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:35:41.2785754Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:35:42.2460945Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:35:56.0690292Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.0694878Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.0699027Z return mod(**inputs) 2025-08-14T21:35:56.0703278Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.0707548Z outputs = self.deberta( 2025-08-14T21:35:56.0710964Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.0715320Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.0715814Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.0716342Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.0716696Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.0717047Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.0717440Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.0717844Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.0718242Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.0718628Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.0720156Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 236, in forward 2025-08-14T21:35:56.0720749Z query_layer = self.transpose_for_scores(self.query_proj(query_states), self.num_attention_heads) 2025-08-14T21:35:56.0721042Z 2025-08-14T21:35:56.0721149Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.0721496Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.0721808Z return mod(**inputs) 2025-08-14T21:35:56.0722171Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.0722549Z outputs = self.deberta( 2025-08-14T21:35:56.0722912Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.0723562Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.0723927Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.0724325Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.0724681Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.0725018Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.0725401Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.0725791Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.0726181Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.0726547Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.0726922Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 237, in forward 2025-08-14T21:35:56.0727390Z key_layer = self.transpose_for_scores(self.key_proj(hidden_states), self.num_attention_heads) 2025-08-14T21:35:56.0727607Z 2025-08-14T21:35:56.0727712Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.0728042Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.0728342Z return mod(**inputs) 2025-08-14T21:35:56.0728698Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.0729070Z outputs = self.deberta( 2025-08-14T21:35:56.0729439Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.0729814Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.0730182Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.0730557Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.0730896Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.0731231Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.0731603Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.0731987Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.0732412Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.0732798Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.0733166Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 236, in forward 2025-08-14T21:35:56.0733663Z query_layer = self.transpose_for_scores(self.query_proj(query_states), self.num_attention_heads) 2025-08-14T21:35:56.0734193Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 194, in transpose_for_scores 2025-08-14T21:35:56.0734685Z return x.permute(0, 2, 1, 3).contiguous().view(-1, x.size(1), x.size(-1)) 2025-08-14T21:35:56.0734859Z 2025-08-14T21:35:56.0734956Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.0735294Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.0735591Z return mod(**inputs) 2025-08-14T21:35:56.0735947Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.0736335Z outputs = self.deberta( 2025-08-14T21:35:56.0736691Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.0737064Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.0737432Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.0737817Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.0738161Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.0738504Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.0738882Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.0739277Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.0739668Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.0740043Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.0740410Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 248, in forward 2025-08-14T21:35:56.0740921Z attention_scores = torch.bmm(query_layer, key_layer.transpose(-1, -2) / scale.to(dtype=query_layer.dtype)) 2025-08-14T21:35:56.0741179Z 2025-08-14T21:35:56.0741275Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.0741613Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.0741909Z return mod(**inputs) 2025-08-14T21:35:56.0742269Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.0742645Z outputs = self.deberta( 2025-08-14T21:35:56.0742997Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.0743371Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.0743740Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.0744132Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.0744466Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.0744916Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.0745315Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.0745715Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.0746101Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.0746488Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.0746889Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 248, in forward 2025-08-14T21:35:56.0747408Z attention_scores = torch.bmm(query_layer, key_layer.transpose(-1, -2) / scale.to(dtype=query_layer.dtype)) 2025-08-14T21:35:56.0747673Z 2025-08-14T21:35:56.0747775Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.0748126Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.0748441Z return mod(**inputs) 2025-08-14T21:35:56.0748808Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.0749202Z outputs = self.deberta( 2025-08-14T21:35:56.0749578Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.0749959Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.0750327Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.0750725Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.0751078Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.0751416Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.0751793Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.0752192Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.0752588Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.0752963Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.0753345Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 238, in forward 2025-08-14T21:35:56.0753845Z value_layer = self.transpose_for_scores(self.value_proj(hidden_states), self.num_attention_heads) 2025-08-14T21:35:56.0754073Z 2025-08-14T21:35:56.0754177Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.0754504Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.0754800Z return mod(**inputs) 2025-08-14T21:35:56.0755152Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.0755519Z outputs = self.deberta( 2025-08-14T21:35:56.0755867Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.0756238Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.0756604Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.0756976Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.0757319Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.0757651Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.0758024Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.0758407Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.0758794Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.0759173Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.0759582Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 238, in forward 2025-08-14T21:35:56.0760055Z value_layer = self.transpose_for_scores(self.value_proj(hidden_states), self.num_attention_heads) 2025-08-14T21:35:56.0760578Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 194, in transpose_for_scores 2025-08-14T21:35:56.0761062Z return x.permute(0, 2, 1, 3).contiguous().view(-1, x.size(1), x.size(-1)) 2025-08-14T21:35:56.0761238Z 2025-08-14T21:35:56.0761342Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.0761668Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.0761969Z return mod(**inputs) 2025-08-14T21:35:56.0762324Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.0762716Z outputs = self.deberta( 2025-08-14T21:35:56.0763076Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.0763450Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.0763819Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.0764199Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.0764537Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.0764868Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.0765234Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.0765627Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.0766017Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.0766394Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.0766762Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 268, in forward 2025-08-14T21:35:56.0767137Z context_layer = torch.bmm( 2025-08-14T21:35:56.0767245Z 2025-08-14T21:35:56.0767349Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.0767694Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.0767992Z return mod(**inputs) 2025-08-14T21:35:56.0768346Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.0768718Z outputs = self.deberta( 2025-08-14T21:35:56.0769066Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.0769439Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.0769808Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.0770196Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.0770529Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.0770866Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.0771244Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.0771633Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.0772013Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.0772390Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.0772778Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 272, in forward 2025-08-14T21:35:56.0773266Z context_layer.view(-1, self.num_attention_heads, context_layer.size(-2), context_layer.size(-1)) 2025-08-14T21:35:56.0773508Z 2025-08-14T21:35:56.0773602Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.0773934Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.0774241Z return mod(**inputs) 2025-08-14T21:35:56.0774597Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.0774973Z outputs = self.deberta( 2025-08-14T21:35:56.0775328Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.0775724Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.0776084Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.0776471Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.0776813Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.0777138Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.0777515Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.0777910Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.0778294Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 381, in forward 2025-08-14T21:35:56.0778705Z attention_output = self.output(self_output, query_states) 2025-08-14T21:35:56.0779131Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 52, in forward 2025-08-14T21:35:56.0779519Z hidden_states = self.dense(hidden_states) 2025-08-14T21:35:56.0779645Z 2025-08-14T21:35:56.0779753Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.0780082Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.0780386Z return mod(**inputs) 2025-08-14T21:35:56.0780739Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.0781104Z outputs = self.deberta( 2025-08-14T21:35:56.0781479Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.0781856Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.0782223Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.0782607Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.0782951Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.0783284Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.0783656Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 446, in forward 2025-08-14T21:35:56.0784065Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:35:56.0784479Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 400, in forward 2025-08-14T21:35:56.0785069Z hidden_states = self.dense(hidden_states) 2025-08-14T21:35:56.0785206Z 2025-08-14T21:35:56.0785306Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.0785710Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.0786017Z return mod(**inputs) 2025-08-14T21:35:56.0786396Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.0786795Z outputs = self.deberta( 2025-08-14T21:35:56.0787155Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.0787532Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.0787904Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.0788282Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.0788627Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.0788998Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.0789375Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 446, in forward 2025-08-14T21:35:56.0789795Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:35:56.0790212Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 401, in forward 2025-08-14T21:35:56.0790618Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:35:56.0790978Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:35:56.0791296Z return self.act(input) 2025-08-14T21:35:56.0791399Z 2025-08-14T21:35:56.0791501Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.0791836Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.0792129Z return mod(**inputs) 2025-08-14T21:35:56.0792484Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.0792857Z outputs = self.deberta( 2025-08-14T21:35:56.0793207Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.0793581Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.0793945Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.0794388Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.0794731Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.0795066Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.0795446Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 447, in forward 2025-08-14T21:35:56.0795876Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:35:56.0796296Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 415, in forward 2025-08-14T21:35:56.0796686Z hidden_states = self.dense(hidden_states) 2025-08-14T21:35:56.0796813Z 2025-08-14T21:35:56.0796918Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.0797246Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.0797551Z return mod(**inputs) 2025-08-14T21:35:56.0797904Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.0798275Z outputs = self.deberta( 2025-08-14T21:35:56.0798624Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.0799032Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.0799419Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.0799818Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.0800157Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.0800487Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.0800877Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.0801262Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.0801658Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.0802078Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.0802457Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 236, in forward 2025-08-14T21:35:56.0802937Z query_layer = self.transpose_for_scores(self.query_proj(query_states), self.num_attention_heads) 2025-08-14T21:35:56.0803173Z 2025-08-14T21:35:56.0803267Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.0803604Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.0803906Z return mod(**inputs) 2025-08-14T21:35:56.0804256Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.0804638Z outputs = self.deberta( 2025-08-14T21:35:56.0804998Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.0805366Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.0805739Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.0806128Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.0806469Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.0806798Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.0807177Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.0807577Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.0807962Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.0808338Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.0808711Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 237, in forward 2025-08-14T21:35:56.0809181Z key_layer = self.transpose_for_scores(self.key_proj(hidden_states), self.num_attention_heads) 2025-08-14T21:35:56.0809400Z 2025-08-14T21:35:56.0809495Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.0809830Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.0810128Z return mod(**inputs) 2025-08-14T21:35:56.0810481Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.0810847Z outputs = self.deberta( 2025-08-14T21:35:56.0811199Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.0811574Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.0811952Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.0812339Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.0812699Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.0813049Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.0813420Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.0813809Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.0814196Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.0814569Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.0814954Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 236, in forward 2025-08-14T21:35:56.0815433Z query_layer = self.transpose_for_scores(self.query_proj(query_states), self.num_attention_heads) 2025-08-14T21:35:56.0815942Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 194, in transpose_for_scores 2025-08-14T21:35:56.0816408Z return x.permute(0, 2, 1, 3).contiguous().view(-1, x.size(1), x.size(-1)) 2025-08-14T21:35:56.0816582Z 2025-08-14T21:35:56.0816677Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.0817007Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.0817306Z return mod(**inputs) 2025-08-14T21:35:56.0817659Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.0818030Z outputs = self.deberta( 2025-08-14T21:35:56.0818385Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.0818758Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.0819119Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.0819505Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.0819843Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.0820177Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.0820545Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.0820935Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.0821325Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.0821692Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.0822065Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 248, in forward 2025-08-14T21:35:56.0822569Z attention_scores = torch.bmm(query_layer, key_layer.transpose(-1, -2) / scale.to(dtype=query_layer.dtype)) 2025-08-14T21:35:56.0822815Z 2025-08-14T21:35:56.0822917Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.0823243Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.0823540Z return mod(**inputs) 2025-08-14T21:35:56.0823894Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.0824267Z outputs = self.deberta( 2025-08-14T21:35:56.0824614Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.0825093Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.0825502Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.0825899Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.0826239Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.0826569Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.0826941Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.0827321Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.0827710Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.0828102Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.0828476Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 248, in forward 2025-08-14T21:35:56.0828966Z attention_scores = torch.bmm(query_layer, key_layer.transpose(-1, -2) / scale.to(dtype=query_layer.dtype)) 2025-08-14T21:35:56.0829218Z 2025-08-14T21:35:56.0829312Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.0829648Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.0829947Z return mod(**inputs) 2025-08-14T21:35:56.0830294Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.0830664Z outputs = self.deberta( 2025-08-14T21:35:56.0831017Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.0831378Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.0831743Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.0832127Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.0832465Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.0832788Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.0833164Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.0833550Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.0833930Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.0834303Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.0834679Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 238, in forward 2025-08-14T21:35:56.0835159Z value_layer = self.transpose_for_scores(self.value_proj(hidden_states), self.num_attention_heads) 2025-08-14T21:35:56.0835387Z 2025-08-14T21:35:56.0835482Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.0835814Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.0836115Z return mod(**inputs) 2025-08-14T21:35:56.0836469Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.0836831Z outputs = self.deberta( 2025-08-14T21:35:56.0837189Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.0837565Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.0837941Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.0838329Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.0838701Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.0839039Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.0839409Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.0839802Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.0840188Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.0840582Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.0840960Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 238, in forward 2025-08-14T21:35:56.0841445Z value_layer = self.transpose_for_scores(self.value_proj(hidden_states), self.num_attention_heads) 2025-08-14T21:35:56.0841966Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 194, in transpose_for_scores 2025-08-14T21:35:56.0842441Z return x.permute(0, 2, 1, 3).contiguous().view(-1, x.size(1), x.size(-1)) 2025-08-14T21:35:56.0842619Z 2025-08-14T21:35:56.0842719Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.0843059Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.0843363Z return mod(**inputs) 2025-08-14T21:35:56.0843717Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.0844097Z outputs = self.deberta( 2025-08-14T21:35:56.0844460Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.0844840Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.0845212Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.0845603Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.0845949Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.0846287Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.0846659Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.0847056Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.0847450Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.0847828Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.0848212Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 268, in forward 2025-08-14T21:35:56.0848590Z context_layer = torch.bmm( 2025-08-14T21:35:56.0848700Z 2025-08-14T21:35:56.0848808Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.0849141Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.0849448Z return mod(**inputs) 2025-08-14T21:35:56.0849808Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.0850182Z outputs = self.deberta( 2025-08-14T21:35:56.0850532Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.0850925Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.0851296Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.0851720Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.0852060Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.0852394Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.0852768Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.0853153Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.0853544Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.0853936Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.0854301Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 272, in forward 2025-08-14T21:35:56.0854784Z context_layer.view(-1, self.num_attention_heads, context_layer.size(-2), context_layer.size(-1)) 2025-08-14T21:35:56.0855016Z 2025-08-14T21:35:56.0855112Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.0855455Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.0855750Z return mod(**inputs) 2025-08-14T21:35:56.0856106Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.0856474Z outputs = self.deberta( 2025-08-14T21:35:56.0856828Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.0857191Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.0857556Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.0857941Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.0858272Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.0858600Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.0858973Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.0859359Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.0859736Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 381, in forward 2025-08-14T21:35:56.0860147Z attention_output = self.output(self_output, query_states) 2025-08-14T21:35:56.0860556Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 52, in forward 2025-08-14T21:35:56.0860934Z hidden_states = self.dense(hidden_states) 2025-08-14T21:35:56.0861061Z 2025-08-14T21:35:56.0861160Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.0861490Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.0861794Z return mod(**inputs) 2025-08-14T21:35:56.0862144Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.0862512Z outputs = self.deberta( 2025-08-14T21:35:56.0862865Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.0863237Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.0863609Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.0863998Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.0864352Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.0864697Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.0865138Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 446, in forward 2025-08-14T21:35:56.0865558Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:35:56.0865975Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 400, in forward 2025-08-14T21:35:56.0866349Z hidden_states = self.dense(hidden_states) 2025-08-14T21:35:56.0866507Z 2025-08-14T21:35:56.0866603Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.0866935Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.0867236Z return mod(**inputs) 2025-08-14T21:35:56.0867583Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.0867956Z outputs = self.deberta( 2025-08-14T21:35:56.0868312Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.0868686Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.0869042Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.0869424Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.0869759Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.0870084Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.0870459Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 446, in forward 2025-08-14T21:35:56.0870871Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:35:56.0871282Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 401, in forward 2025-08-14T21:35:56.0871678Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:35:56.0872031Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:35:56.0872343Z return self.act(input) 2025-08-14T21:35:56.0872448Z 2025-08-14T21:35:56.0872549Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.0872872Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.0873171Z return mod(**inputs) 2025-08-14T21:35:56.0873521Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.0873885Z outputs = self.deberta( 2025-08-14T21:35:56.0874238Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.0875408Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.0875771Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.0876148Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.0876492Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.0876825Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.0877201Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 447, in forward 2025-08-14T21:35:56.0877639Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:35:56.0878086Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 415, in forward 2025-08-14T21:35:56.0878489Z hidden_states = self.dense(hidden_states) 2025-08-14T21:35:56.0878616Z 2025-08-14T21:35:56.0878712Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.0879047Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.0879351Z return mod(**inputs) 2025-08-14T21:35:56.0879707Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.0880074Z outputs = self.deberta( 2025-08-14T21:35:56.0880447Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.0880821Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.0881192Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.0881571Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.0881908Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.0882240Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.0882607Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.0882999Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.0883389Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.0883765Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.0884137Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 236, in forward 2025-08-14T21:35:56.0884751Z query_layer = self.transpose_for_scores(self.query_proj(query_states), self.num_attention_heads) 2025-08-14T21:35:56.0884987Z 2025-08-14T21:35:56.0885092Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.0885430Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.0885726Z return mod(**inputs) 2025-08-14T21:35:56.0886088Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.0886462Z outputs = self.deberta( 2025-08-14T21:35:56.0886811Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.0887187Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.0887561Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.0887948Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.0888285Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.0888622Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.0888997Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.0889388Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.0889772Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.0890148Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.0890601Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 237, in forward 2025-08-14T21:35:56.0891094Z key_layer = self.transpose_for_scores(self.key_proj(hidden_states), self.num_attention_heads) 2025-08-14T21:35:56.0891349Z 2025-08-14T21:35:56.0891447Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.0891783Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.0892090Z return mod(**inputs) 2025-08-14T21:35:56.0892435Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.0892807Z outputs = self.deberta( 2025-08-14T21:35:56.0893163Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.0893559Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.0893919Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.0894303Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.0894646Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.0894973Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.0895349Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.0895738Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.0896123Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.0896490Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.0896865Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 236, in forward 2025-08-14T21:35:56.0897339Z query_layer = self.transpose_for_scores(self.query_proj(query_states), self.num_attention_heads) 2025-08-14T21:35:56.0897851Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 194, in transpose_for_scores 2025-08-14T21:35:56.0898304Z return x.permute(0, 2, 1, 3).contiguous().view(-1, x.size(1), x.size(-1)) 2025-08-14T21:35:56.0898484Z 2025-08-14T21:35:56.0898577Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.0898909Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.0899206Z return mod(**inputs) 2025-08-14T21:35:56.0899550Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.0899922Z outputs = self.deberta( 2025-08-14T21:35:56.0900274Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.0900635Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.0901002Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.0901386Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.0901722Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.0902046Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.0902420Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.0902809Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.0903191Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.0903581Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.0903980Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 248, in forward 2025-08-14T21:35:56.0904497Z attention_scores = torch.bmm(query_layer, key_layer.transpose(-1, -2) / scale.to(dtype=query_layer.dtype)) 2025-08-14T21:35:56.0904743Z 2025-08-14T21:35:56.0904892Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.0905234Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.0905538Z return mod(**inputs) 2025-08-14T21:35:56.0905897Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.0906286Z outputs = self.deberta( 2025-08-14T21:35:56.0906645Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.0907020Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.0907386Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.0907777Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.0908119Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.0908456Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.0908828Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.0909221Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.0909613Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.0909992Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.0910363Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 248, in forward 2025-08-14T21:35:56.0910868Z attention_scores = torch.bmm(query_layer, key_layer.transpose(-1, -2) / scale.to(dtype=query_layer.dtype)) 2025-08-14T21:35:56.0911112Z 2025-08-14T21:35:56.0911217Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.0911549Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.0911845Z return mod(**inputs) 2025-08-14T21:35:56.0912204Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.0912579Z outputs = self.deberta( 2025-08-14T21:35:56.0912927Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.0913303Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.0913670Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.0914058Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.0914391Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.0914726Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.0915101Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.0915483Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.0915872Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.0916247Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.0916635Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 238, in forward 2025-08-14T21:35:56.0917127Z value_layer = self.transpose_for_scores(self.value_proj(hidden_states), self.num_attention_heads) 2025-08-14T21:35:56.0917375Z 2025-08-14T21:35:56.0917471Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.0917802Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.0918104Z return mod(**inputs) 2025-08-14T21:35:56.0918450Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.0918819Z outputs = self.deberta( 2025-08-14T21:35:56.0919174Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.0919559Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.0919918Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.0920300Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.0920640Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.0920962Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.0921338Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.0921725Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.0922113Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.0922488Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.0922864Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 238, in forward 2025-08-14T21:35:56.0923340Z value_layer = self.transpose_for_scores(self.value_proj(hidden_states), self.num_attention_heads) 2025-08-14T21:35:56.0923854Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 194, in transpose_for_scores 2025-08-14T21:35:56.0924305Z return x.permute(0, 2, 1, 3).contiguous().view(-1, x.size(1), x.size(-1)) 2025-08-14T21:35:56.0924482Z 2025-08-14T21:35:56.0924575Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.0924906Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.0925197Z return mod(**inputs) 2025-08-14T21:35:56.0925552Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.0925920Z outputs = self.deberta( 2025-08-14T21:35:56.0926273Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.0926637Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.0927006Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.0927387Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.0927723Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.0928045Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.0928417Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.0928805Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.0929183Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.0929596Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.0929984Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 268, in forward 2025-08-14T21:35:56.0930371Z context_layer = torch.bmm( 2025-08-14T21:35:56.0930480Z 2025-08-14T21:35:56.0930575Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.0930911Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.0931211Z return mod(**inputs) 2025-08-14T21:35:56.0931560Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.0931930Z outputs = self.deberta( 2025-08-14T21:35:56.0932295Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.0932666Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.0933025Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.0933407Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.0933744Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.0934070Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.0934432Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.0934819Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.0935206Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.0935571Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.0935945Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 272, in forward 2025-08-14T21:35:56.0936422Z context_layer.view(-1, self.num_attention_heads, context_layer.size(-2), context_layer.size(-1)) 2025-08-14T21:35:56.0936646Z 2025-08-14T21:35:56.0936748Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.0937073Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.0937372Z return mod(**inputs) 2025-08-14T21:35:56.0937730Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.0938100Z outputs = self.deberta( 2025-08-14T21:35:56.0938446Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.0938820Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.0939191Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.0939568Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.0939908Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.0940239Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.0940611Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.0940991Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.0941379Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 381, in forward 2025-08-14T21:35:56.0941793Z attention_output = self.output(self_output, query_states) 2025-08-14T21:35:56.0942219Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 52, in forward 2025-08-14T21:35:56.0942596Z hidden_states = self.dense(hidden_states) 2025-08-14T21:35:56.0942731Z 2025-08-14T21:35:56.0942860Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.0943195Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.0943484Z return mod(**inputs) 2025-08-14T21:35:56.0943837Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.0944205Z outputs = self.deberta( 2025-08-14T21:35:56.0944558Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.0945008Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.0945388Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.0945780Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.0946125Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.0946454Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.0946836Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 446, in forward 2025-08-14T21:35:56.0947256Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:35:56.0947669Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 400, in forward 2025-08-14T21:35:56.0948055Z hidden_states = self.dense(hidden_states) 2025-08-14T21:35:56.0948193Z 2025-08-14T21:35:56.0948290Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.0948625Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.0948916Z return mod(**inputs) 2025-08-14T21:35:56.0949276Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.0949651Z outputs = self.deberta( 2025-08-14T21:35:56.0950003Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.0950369Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.0950737Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.0951120Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.0951450Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.0951786Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.0952166Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 446, in forward 2025-08-14T21:35:56.0952583Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:35:56.0952991Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 401, in forward 2025-08-14T21:35:56.0953397Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:35:56.0953750Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:35:56.0954067Z return self.act(input) 2025-08-14T21:35:56.0954171Z 2025-08-14T21:35:56.0954267Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.0954600Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.0954904Z return mod(**inputs) 2025-08-14T21:35:56.0955285Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.0955663Z outputs = self.deberta( 2025-08-14T21:35:56.0956047Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.0956425Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.0956793Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.0957180Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.0957521Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.0957847Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.0958237Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 447, in forward 2025-08-14T21:35:56.0958668Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:35:56.0959094Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 415, in forward 2025-08-14T21:35:56.0959470Z hidden_states = self.dense(hidden_states) 2025-08-14T21:35:56.0959603Z 2025-08-14T21:35:56.0959697Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.0960024Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.0960325Z return mod(**inputs) 2025-08-14T21:35:56.0960672Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.0961041Z outputs = self.deberta( 2025-08-14T21:35:56.0961395Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.0961760Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.0962131Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.0962519Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.0962858Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.0963181Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.0963556Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.0963946Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.0964335Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.0964705Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.0965078Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 236, in forward 2025-08-14T21:35:56.0965557Z query_layer = self.transpose_for_scores(self.query_proj(query_states), self.num_attention_heads) 2025-08-14T21:35:56.0965780Z 2025-08-14T21:35:56.0965879Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.0966204Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.0966501Z return mod(**inputs) 2025-08-14T21:35:56.0966854Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.0967216Z outputs = self.deberta( 2025-08-14T21:35:56.0967569Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.0967943Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.0968327Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.0968718Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.0969073Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.0969402Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.0969770Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.0970160Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.0970550Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.0970942Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.0971313Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 237, in forward 2025-08-14T21:35:56.0971786Z key_layer = self.transpose_for_scores(self.key_proj(hidden_states), self.num_attention_heads) 2025-08-14T21:35:56.0972011Z 2025-08-14T21:35:56.0972108Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.0972443Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.0972735Z return mod(**inputs) 2025-08-14T21:35:56.0973091Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.0973462Z outputs = self.deberta( 2025-08-14T21:35:56.0973806Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.0974182Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.0974549Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.0974934Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.0975269Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.0975602Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.0975976Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.0976364Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.0976746Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.0977122Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.0977501Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 236, in forward 2025-08-14T21:35:56.0977974Z query_layer = self.transpose_for_scores(self.query_proj(query_states), self.num_attention_heads) 2025-08-14T21:35:56.0978487Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 194, in transpose_for_scores 2025-08-14T21:35:56.0978953Z return x.permute(0, 2, 1, 3).contiguous().view(-1, x.size(1), x.size(-1)) 2025-08-14T21:35:56.0979127Z 2025-08-14T21:35:56.0979231Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.0979557Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.0979856Z return mod(**inputs) 2025-08-14T21:35:56.0980211Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.0980586Z outputs = self.deberta( 2025-08-14T21:35:56.0980950Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.0981324Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.0981705Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.0982104Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.0982433Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.0982766Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.0983141Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.0983525Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.0983931Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.0984308Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.0984873Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 248, in forward 2025-08-14T21:35:56.0985379Z attention_scores = torch.bmm(query_layer, key_layer.transpose(-1, -2) / scale.to(dtype=query_layer.dtype)) 2025-08-14T21:35:56.0985635Z 2025-08-14T21:35:56.0985733Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.0986069Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.0986371Z return mod(**inputs) 2025-08-14T21:35:56.0986723Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.0987096Z outputs = self.deberta( 2025-08-14T21:35:56.0987451Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.0987817Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.0988186Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.0988577Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.0988918Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.0989245Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.0989619Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.0990007Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.0990388Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.0990774Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.0991146Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 248, in forward 2025-08-14T21:35:56.0991640Z attention_scores = torch.bmm(query_layer, key_layer.transpose(-1, -2) / scale.to(dtype=query_layer.dtype)) 2025-08-14T21:35:56.0991884Z 2025-08-14T21:35:56.0991980Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.0992312Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.0992610Z return mod(**inputs) 2025-08-14T21:35:56.0992959Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.0993324Z outputs = self.deberta( 2025-08-14T21:35:56.0993680Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.0994096Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.0994463Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.0994896Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.0995242Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.0995577Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.0995947Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.0996341Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.0996731Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.0997137Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.0997507Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 238, in forward 2025-08-14T21:35:56.0997987Z value_layer = self.transpose_for_scores(self.value_proj(hidden_states), self.num_attention_heads) 2025-08-14T21:35:56.0998219Z 2025-08-14T21:35:56.0998314Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.0998642Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.0998932Z return mod(**inputs) 2025-08-14T21:35:56.0999285Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.0999654Z outputs = self.deberta( 2025-08-14T21:35:56.1000001Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1000373Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1000737Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1001124Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1001453Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1001778Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1002150Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1002535Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1002915Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1003288Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1003659Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 238, in forward 2025-08-14T21:35:56.1004127Z value_layer = self.transpose_for_scores(self.value_proj(hidden_states), self.num_attention_heads) 2025-08-14T21:35:56.1004639Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 194, in transpose_for_scores 2025-08-14T21:35:56.1005099Z return x.permute(0, 2, 1, 3).contiguous().view(-1, x.size(1), x.size(-1)) 2025-08-14T21:35:56.1005271Z 2025-08-14T21:35:56.1005377Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1005700Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1006002Z return mod(**inputs) 2025-08-14T21:35:56.1006357Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1006802Z outputs = self.deberta( 2025-08-14T21:35:56.1007361Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1007815Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1008272Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1008736Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1009149Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1009537Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1010023Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1010449Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1010907Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1029547Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1030019Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 268, in forward 2025-08-14T21:35:56.1030421Z context_layer = torch.bmm( 2025-08-14T21:35:56.1030535Z 2025-08-14T21:35:56.1030648Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1030994Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1031311Z return mod(**inputs) 2025-08-14T21:35:56.1031682Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1032075Z outputs = self.deberta( 2025-08-14T21:35:56.1032446Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1032838Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1033227Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1033627Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1033975Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1034314Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1034698Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1035085Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1035488Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1035875Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1036265Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 272, in forward 2025-08-14T21:35:56.1036746Z context_layer.view(-1, self.num_attention_heads, context_layer.size(-2), context_layer.size(-1)) 2025-08-14T21:35:56.1036982Z 2025-08-14T21:35:56.1037083Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1037437Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1037745Z return mod(**inputs) 2025-08-14T21:35:56.1038168Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1038554Z outputs = self.deberta( 2025-08-14T21:35:56.1038914Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1039291Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1039726Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1040163Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1040544Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1040890Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1041266Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1041659Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1042055Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 381, in forward 2025-08-14T21:35:56.1042486Z attention_output = self.output(self_output, query_states) 2025-08-14T21:35:56.1042900Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 52, in forward 2025-08-14T21:35:56.1043284Z hidden_states = self.dense(hidden_states) 2025-08-14T21:35:56.1043412Z 2025-08-14T21:35:56.1043520Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1043849Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1044152Z return mod(**inputs) 2025-08-14T21:35:56.1044510Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1044882Z outputs = self.deberta( 2025-08-14T21:35:56.1045230Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1045605Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1045976Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1046353Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1046698Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1047036Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1047411Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 446, in forward 2025-08-14T21:35:56.1047817Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:35:56.1048229Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 400, in forward 2025-08-14T21:35:56.1048608Z hidden_states = self.dense(hidden_states) 2025-08-14T21:35:56.1048736Z 2025-08-14T21:35:56.1048839Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1049165Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1049464Z return mod(**inputs) 2025-08-14T21:35:56.1049820Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1050183Z outputs = self.deberta( 2025-08-14T21:35:56.1050537Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1050906Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1051273Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1051650Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1051990Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1052326Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1052710Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 446, in forward 2025-08-14T21:35:56.1053144Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:35:56.1053570Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 401, in forward 2025-08-14T21:35:56.1053976Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:35:56.1054319Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:35:56.1054635Z return self.act(input) 2025-08-14T21:35:56.1054745Z 2025-08-14T21:35:56.1054840Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1055178Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1055491Z return mod(**inputs) 2025-08-14T21:35:56.1055849Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1056225Z outputs = self.deberta( 2025-08-14T21:35:56.1056576Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1056950Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1057318Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1057701Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1058031Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1058365Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1058748Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 447, in forward 2025-08-14T21:35:56.1059178Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:35:56.1059604Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 415, in forward 2025-08-14T21:35:56.1059987Z hidden_states = self.dense(hidden_states) 2025-08-14T21:35:56.1060114Z 2025-08-14T21:35:56.1060219Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1060545Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1060848Z return mod(**inputs) 2025-08-14T21:35:56.1061205Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1061566Z outputs = self.deberta( 2025-08-14T21:35:56.1061919Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1062289Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1062656Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1063035Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1063374Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1063704Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1064068Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1064456Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1064923Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1065312Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1065698Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 236, in forward 2025-08-14T21:35:56.1066203Z query_layer = self.transpose_for_scores(self.query_proj(query_states), self.num_attention_heads) 2025-08-14T21:35:56.1066443Z 2025-08-14T21:35:56.1066549Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1066885Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1067177Z return mod(**inputs) 2025-08-14T21:35:56.1067538Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1067913Z outputs = self.deberta( 2025-08-14T21:35:56.1068267Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1068658Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1069027Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1069410Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1069743Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1070072Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1070444Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1070831Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1071216Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1071590Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1071962Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 237, in forward 2025-08-14T21:35:56.1072426Z key_layer = self.transpose_for_scores(self.key_proj(hidden_states), self.num_attention_heads) 2025-08-14T21:35:56.1072649Z 2025-08-14T21:35:56.1072745Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1073081Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1073374Z return mod(**inputs) 2025-08-14T21:35:56.1073719Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1074088Z outputs = self.deberta( 2025-08-14T21:35:56.1074440Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1074809Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1075167Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1075547Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1075888Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1076213Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1076587Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1076976Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1077368Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1077733Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1078110Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 236, in forward 2025-08-14T21:35:56.1078610Z query_layer = self.transpose_for_scores(self.query_proj(query_states), self.num_attention_heads) 2025-08-14T21:35:56.1079139Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 194, in transpose_for_scores 2025-08-14T21:35:56.1079616Z return x.permute(0, 2, 1, 3).contiguous().view(-1, x.size(1), x.size(-1)) 2025-08-14T21:35:56.1079800Z 2025-08-14T21:35:56.1079899Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1080233Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1080526Z return mod(**inputs) 2025-08-14T21:35:56.1080888Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1081282Z outputs = self.deberta( 2025-08-14T21:35:56.1081638Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1082005Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1082379Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1082763Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1083103Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1083429Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1083806Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1084193Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1084712Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1085109Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1085493Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 248, in forward 2025-08-14T21:35:56.1086011Z attention_scores = torch.bmm(query_layer, key_layer.transpose(-1, -2) / scale.to(dtype=query_layer.dtype)) 2025-08-14T21:35:56.1086268Z 2025-08-14T21:35:56.1086366Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1086708Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1087019Z return mod(**inputs) 2025-08-14T21:35:56.1087385Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1087762Z outputs = self.deberta( 2025-08-14T21:35:56.1088127Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1088509Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1088877Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1089272Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1089614Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1089950Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1090323Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1090718Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1091109Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1091486Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1091908Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 248, in forward 2025-08-14T21:35:56.1092442Z attention_scores = torch.bmm(query_layer, key_layer.transpose(-1, -2) / scale.to(dtype=query_layer.dtype)) 2025-08-14T21:35:56.1092714Z 2025-08-14T21:35:56.1092826Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1093175Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1093482Z return mod(**inputs) 2025-08-14T21:35:56.1093853Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1094239Z outputs = self.deberta( 2025-08-14T21:35:56.1094613Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1094984Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1095349Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1095739Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1096068Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1096397Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1096770Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1097151Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1097541Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1097918Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1098296Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 238, in forward 2025-08-14T21:35:56.1098774Z value_layer = self.transpose_for_scores(self.value_proj(hidden_states), self.num_attention_heads) 2025-08-14T21:35:56.1099007Z 2025-08-14T21:35:56.1099101Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1099434Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1099735Z return mod(**inputs) 2025-08-14T21:35:56.1100081Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1100451Z outputs = self.deberta( 2025-08-14T21:35:56.1100805Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1101179Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1101538Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1101926Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1102265Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1102592Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1102967Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1103356Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1103745Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1104114Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1104502Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 238, in forward 2025-08-14T21:35:56.1105042Z value_layer = self.transpose_for_scores(self.value_proj(hidden_states), self.num_attention_heads) 2025-08-14T21:35:56.1105595Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 194, in transpose_for_scores 2025-08-14T21:35:56.1106061Z return x.permute(0, 2, 1, 3).contiguous().view(-1, x.size(1), x.size(-1)) 2025-08-14T21:35:56.1106245Z 2025-08-14T21:35:56.1106352Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1106687Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1106993Z return mod(**inputs) 2025-08-14T21:35:56.1107355Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1107741Z outputs = self.deberta( 2025-08-14T21:35:56.1108100Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1108477Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1108849Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1109224Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1109561Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1109891Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1110266Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1110648Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1111037Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1111407Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1111771Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 268, in forward 2025-08-14T21:35:56.1112140Z context_layer = torch.bmm( 2025-08-14T21:35:56.1112255Z 2025-08-14T21:35:56.1112350Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1112683Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1112973Z return mod(**inputs) 2025-08-14T21:35:56.1113326Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1113694Z outputs = self.deberta( 2025-08-14T21:35:56.1114049Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1114410Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1114780Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1115165Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1115496Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1115826Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1116200Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1116585Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1116965Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1117339Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1117724Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 272, in forward 2025-08-14T21:35:56.1118220Z context_layer.view(-1, self.num_attention_heads, context_layer.size(-2), context_layer.size(-1)) 2025-08-14T21:35:56.1118457Z 2025-08-14T21:35:56.1118552Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1118883Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1119180Z return mod(**inputs) 2025-08-14T21:35:56.1119529Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1119903Z outputs = self.deberta( 2025-08-14T21:35:56.1120256Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1120655Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1121017Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1121408Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1121748Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1122078Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1122449Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1122839Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1123227Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 381, in forward 2025-08-14T21:35:56.1123634Z attention_output = self.output(self_output, query_states) 2025-08-14T21:35:56.1124048Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 52, in forward 2025-08-14T21:35:56.1124433Z hidden_states = self.dense(hidden_states) 2025-08-14T21:35:56.1124560Z 2025-08-14T21:35:56.1124667Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1124994Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1125296Z return mod(**inputs) 2025-08-14T21:35:56.1125648Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1126021Z outputs = self.deberta( 2025-08-14T21:35:56.1126369Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1126745Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1127114Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1127492Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1127832Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1128167Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1128541Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 446, in forward 2025-08-14T21:35:56.1128951Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:35:56.1129368Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 400, in forward 2025-08-14T21:35:56.1129750Z hidden_states = self.dense(hidden_states) 2025-08-14T21:35:56.1129879Z 2025-08-14T21:35:56.1129982Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1130324Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1130631Z return mod(**inputs) 2025-08-14T21:35:56.1131005Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1131390Z outputs = self.deberta( 2025-08-14T21:35:56.1131748Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1132117Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1132487Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1132863Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1133202Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1133548Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1133916Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 446, in forward 2025-08-14T21:35:56.1134329Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:35:56.1134741Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 401, in forward 2025-08-14T21:35:56.1135146Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:35:56.1135503Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:35:56.1135816Z return self.act(input) 2025-08-14T21:35:56.1135918Z 2025-08-14T21:35:56.1136018Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1136344Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1136643Z return mod(**inputs) 2025-08-14T21:35:56.1136997Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1137368Z outputs = self.deberta( 2025-08-14T21:35:56.1137714Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1138085Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1138449Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1138823Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1139161Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1139491Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1139864Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 447, in forward 2025-08-14T21:35:56.1140286Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:35:56.1140709Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 415, in forward 2025-08-14T21:35:56.1141092Z hidden_states = self.dense(hidden_states) 2025-08-14T21:35:56.1141217Z 2025-08-14T21:35:56.1141317Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1141641Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1141936Z return mod(**inputs) 2025-08-14T21:35:56.1142284Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1142645Z outputs = self.deberta( 2025-08-14T21:35:56.1142996Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1143380Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1143765Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1144158Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1144498Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1144896Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1145269Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1145660Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1146051Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1146447Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1146818Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 236, in forward 2025-08-14T21:35:56.1147296Z query_layer = self.transpose_for_scores(self.query_proj(query_states), self.num_attention_heads) 2025-08-14T21:35:56.1147527Z 2025-08-14T21:35:56.1147624Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1147957Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1148248Z return mod(**inputs) 2025-08-14T21:35:56.1148607Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1148978Z outputs = self.deberta( 2025-08-14T21:35:56.1149323Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1149696Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1150065Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1150455Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1150788Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1151117Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1151491Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1151882Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1152263Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1152645Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1153019Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 237, in forward 2025-08-14T21:35:56.1153488Z key_layer = self.transpose_for_scores(self.key_proj(hidden_states), self.num_attention_heads) 2025-08-14T21:35:56.1153702Z 2025-08-14T21:35:56.1153797Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1154130Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1154430Z return mod(**inputs) 2025-08-14T21:35:56.1154777Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1155147Z outputs = self.deberta( 2025-08-14T21:35:56.1155498Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1155872Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1156252Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1156638Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1157005Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1157330Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1157702Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1158087Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1158473Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1158838Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1159231Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 236, in forward 2025-08-14T21:35:56.1159702Z query_layer = self.transpose_for_scores(self.query_proj(query_states), self.num_attention_heads) 2025-08-14T21:35:56.1160211Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 194, in transpose_for_scores 2025-08-14T21:35:56.1160665Z return x.permute(0, 2, 1, 3).contiguous().view(-1, x.size(1), x.size(-1)) 2025-08-14T21:35:56.1160844Z 2025-08-14T21:35:56.1160939Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1161269Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1161562Z return mod(**inputs) 2025-08-14T21:35:56.1161912Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1162282Z outputs = self.deberta( 2025-08-14T21:35:56.1162635Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1162999Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1163365Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1163749Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1164082Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1164402Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1164773Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1165160Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1165547Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1165914Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1166287Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 248, in forward 2025-08-14T21:35:56.1166792Z attention_scores = torch.bmm(query_layer, key_layer.transpose(-1, -2) / scale.to(dtype=query_layer.dtype)) 2025-08-14T21:35:56.1167038Z 2025-08-14T21:35:56.1167139Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1167466Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1167762Z return mod(**inputs) 2025-08-14T21:35:56.1168118Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1168483Z outputs = self.deberta( 2025-08-14T21:35:56.1168869Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1169245Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1169624Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1170031Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1170366Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1170696Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1171061Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1171449Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1171837Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1172226Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1172598Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 248, in forward 2025-08-14T21:35:56.1173099Z attention_scores = torch.bmm(query_layer, key_layer.transpose(-1, -2) / scale.to(dtype=query_layer.dtype)) 2025-08-14T21:35:56.1173349Z 2025-08-14T21:35:56.1173447Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1173778Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1174069Z return mod(**inputs) 2025-08-14T21:35:56.1174423Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1174792Z outputs = self.deberta( 2025-08-14T21:35:56.1175139Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1175517Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1175883Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1176267Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1176597Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1176927Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1177298Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1177687Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1178068Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1178441Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1178815Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 238, in forward 2025-08-14T21:35:56.1179296Z value_layer = self.transpose_for_scores(self.value_proj(hidden_states), self.num_attention_heads) 2025-08-14T21:35:56.1179529Z 2025-08-14T21:35:56.1179625Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1179957Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1180253Z return mod(**inputs) 2025-08-14T21:35:56.1180600Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1180972Z outputs = self.deberta( 2025-08-14T21:35:56.1181324Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1181697Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1182071Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1182472Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1182825Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1183149Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1183524Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1183912Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1184300Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1184868Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1185260Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 238, in forward 2025-08-14T21:35:56.1185749Z value_layer = self.transpose_for_scores(self.value_proj(hidden_states), self.num_attention_heads) 2025-08-14T21:35:56.1186270Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 194, in transpose_for_scores 2025-08-14T21:35:56.1186725Z return x.permute(0, 2, 1, 3).contiguous().view(-1, x.size(1), x.size(-1)) 2025-08-14T21:35:56.1186908Z 2025-08-14T21:35:56.1187004Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1187346Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1187646Z return mod(**inputs) 2025-08-14T21:35:56.1187995Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1188370Z outputs = self.deberta( 2025-08-14T21:35:56.1188731Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1189098Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1189467Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1189851Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1190190Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1190514Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1190891Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1191283Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1191673Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1192040Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1192418Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 268, in forward 2025-08-14T21:35:56.1192789Z context_layer = torch.bmm( 2025-08-14T21:35:56.1192896Z 2025-08-14T21:35:56.1192993Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1193330Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1193630Z return mod(**inputs) 2025-08-14T21:35:56.1193985Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1194349Z outputs = self.deberta( 2025-08-14T21:35:56.1194706Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1195112Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1195497Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1195908Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1196242Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1196568Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1196937Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1197326Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1197717Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1198116Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1198481Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 272, in forward 2025-08-14T21:35:56.1198965Z context_layer.view(-1, self.num_attention_heads, context_layer.size(-2), context_layer.size(-1)) 2025-08-14T21:35:56.1199187Z 2025-08-14T21:35:56.1199290Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1199620Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1199912Z return mod(**inputs) 2025-08-14T21:35:56.1200266Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1200634Z outputs = self.deberta( 2025-08-14T21:35:56.1200982Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1201354Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1201723Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1202106Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1202436Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1202768Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1203142Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1203524Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1203912Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 381, in forward 2025-08-14T21:35:56.1204326Z attention_output = self.output(self_output, query_states) 2025-08-14T21:35:56.1204735Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 52, in forward 2025-08-14T21:35:56.1205109Z hidden_states = self.dense(hidden_states) 2025-08-14T21:35:56.1205242Z 2025-08-14T21:35:56.1205342Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1205673Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1205976Z return mod(**inputs) 2025-08-14T21:35:56.1206322Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1206688Z outputs = self.deberta( 2025-08-14T21:35:56.1207041Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1207406Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1207785Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1208170Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1208518Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1208856Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1209230Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 446, in forward 2025-08-14T21:35:56.1209640Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:35:56.1210053Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 400, in forward 2025-08-14T21:35:56.1210429Z hidden_states = self.dense(hidden_states) 2025-08-14T21:35:56.1210580Z 2025-08-14T21:35:56.1210674Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1211006Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1211298Z return mod(**inputs) 2025-08-14T21:35:56.1211656Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1212029Z outputs = self.deberta( 2025-08-14T21:35:56.1212382Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1212746Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1213114Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1213498Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1213835Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1214163Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1214540Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 446, in forward 2025-08-14T21:35:56.1214955Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:35:56.1215361Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 401, in forward 2025-08-14T21:35:56.1215767Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:35:56.1216120Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:35:56.1216435Z return self.act(input) 2025-08-14T21:35:56.1216540Z 2025-08-14T21:35:56.1216634Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1216969Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1217271Z return mod(**inputs) 2025-08-14T21:35:56.1217622Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1217996Z outputs = self.deberta( 2025-08-14T21:35:56.1218357Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1218734Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1219098Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1219484Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1219826Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1220159Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1220531Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 447, in forward 2025-08-14T21:35:56.1220975Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:35:56.1221425Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 415, in forward 2025-08-14T21:35:56.1221814Z hidden_states = self.dense(hidden_states) 2025-08-14T21:35:56.1221947Z 2025-08-14T21:35:56.1222042Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1222370Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1222671Z return mod(**inputs) 2025-08-14T21:35:56.1223018Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1223411Z outputs = self.deberta( 2025-08-14T21:35:56.1223764Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1224128Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1224486Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1224933Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1225275Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1225600Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1225977Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1226371Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1226768Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1227144Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1227524Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 236, in forward 2025-08-14T21:35:56.1228004Z query_layer = self.transpose_for_scores(self.query_proj(query_states), self.num_attention_heads) 2025-08-14T21:35:56.1228228Z 2025-08-14T21:35:56.1228335Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1228661Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1228958Z return mod(**inputs) 2025-08-14T21:35:56.1229313Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1229678Z outputs = self.deberta( 2025-08-14T21:35:56.1230030Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1230401Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1230769Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1231146Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1231485Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1231816Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1232190Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1232582Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1232972Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1233352Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1233739Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 237, in forward 2025-08-14T21:35:56.1234228Z key_layer = self.transpose_for_scores(self.key_proj(hidden_states), self.num_attention_heads) 2025-08-14T21:35:56.1234466Z 2025-08-14T21:35:56.1234562Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1234891Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1235183Z return mod(**inputs) 2025-08-14T21:35:56.1235537Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1235907Z outputs = self.deberta( 2025-08-14T21:35:56.1236262Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1236657Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1237022Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1237403Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1237736Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1238068Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1238436Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1238824Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1239204Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1239578Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1239951Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 236, in forward 2025-08-14T21:35:56.1240429Z query_layer = self.transpose_for_scores(self.query_proj(query_states), self.num_attention_heads) 2025-08-14T21:35:56.1240933Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 194, in transpose_for_scores 2025-08-14T21:35:56.1241395Z return x.permute(0, 2, 1, 3).contiguous().view(-1, x.size(1), x.size(-1)) 2025-08-14T21:35:56.1241567Z 2025-08-14T21:35:56.1241671Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1242001Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1242293Z return mod(**inputs) 2025-08-14T21:35:56.1242647Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1243023Z outputs = self.deberta( 2025-08-14T21:35:56.1243375Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1243746Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1244110Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1244491Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1244818Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1245150Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1245522Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1245902Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1246291Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1246674Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1247064Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 248, in forward 2025-08-14T21:35:56.1248004Z attention_scores = torch.bmm(query_layer, key_layer.transpose(-1, -2) / scale.to(dtype=query_layer.dtype)) 2025-08-14T21:35:56.1248258Z 2025-08-14T21:35:56.1248355Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1248689Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1248989Z return mod(**inputs) 2025-08-14T21:35:56.1249335Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1249726Z outputs = self.deberta( 2025-08-14T21:35:56.1250078Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1250447Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1250805Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1251185Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1251522Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1251844Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1252221Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1252606Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1252992Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1253361Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1253757Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 248, in forward 2025-08-14T21:35:56.1254258Z attention_scores = torch.bmm(query_layer, key_layer.transpose(-1, -2) / scale.to(dtype=query_layer.dtype)) 2025-08-14T21:35:56.1254503Z 2025-08-14T21:35:56.1254602Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1254930Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1255228Z return mod(**inputs) 2025-08-14T21:35:56.1255584Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1255944Z outputs = self.deberta( 2025-08-14T21:35:56.1256298Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1256668Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1257033Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1257409Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1257745Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1258072Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1258444Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1258823Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1259207Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1259581Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1259959Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 238, in forward 2025-08-14T21:35:56.1260456Z value_layer = self.transpose_for_scores(self.value_proj(hidden_states), self.num_attention_heads) 2025-08-14T21:35:56.1260702Z 2025-08-14T21:35:56.1260799Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1261130Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1261422Z return mod(**inputs) 2025-08-14T21:35:56.1261775Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1262144Z outputs = self.deberta( 2025-08-14T21:35:56.1262494Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1262873Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1263239Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1263624Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1263951Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1264277Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1264652Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1265108Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1265494Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1265871Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1266243Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 238, in forward 2025-08-14T21:35:56.1266722Z value_layer = self.transpose_for_scores(self.value_proj(hidden_states), self.num_attention_heads) 2025-08-14T21:35:56.1267228Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 194, in transpose_for_scores 2025-08-14T21:35:56.1267690Z return x.permute(0, 2, 1, 3).contiguous().view(-1, x.size(1), x.size(-1)) 2025-08-14T21:35:56.1267863Z 2025-08-14T21:35:56.1267968Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1268301Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1268590Z return mod(**inputs) 2025-08-14T21:35:56.1268943Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1269313Z outputs = self.deberta( 2025-08-14T21:35:56.1269662Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1270031Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1270398Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1270778Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1271105Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1271434Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1271803Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1272188Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1272596Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1272973Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1273359Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 268, in forward 2025-08-14T21:35:56.1273736Z context_layer = torch.bmm( 2025-08-14T21:35:56.1273852Z 2025-08-14T21:35:56.1273946Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1274279Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1274577Z return mod(**inputs) 2025-08-14T21:35:56.1274926Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1275297Z outputs = self.deberta( 2025-08-14T21:35:56.1275662Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1276022Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1276391Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1276775Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1277111Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1277435Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1277806Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1278191Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1278573Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1278932Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1279302Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 272, in forward 2025-08-14T21:35:56.1279776Z context_layer.view(-1, self.num_attention_heads, context_layer.size(-2), context_layer.size(-1)) 2025-08-14T21:35:56.1279998Z 2025-08-14T21:35:56.1280090Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1280419Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1280712Z return mod(**inputs) 2025-08-14T21:35:56.1281063Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1281424Z outputs = self.deberta( 2025-08-14T21:35:56.1281776Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1282140Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1282498Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1282873Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1283209Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1283535Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1283900Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1284284Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1284812Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 381, in forward 2025-08-14T21:35:56.1285241Z attention_output = self.output(self_output, query_states) 2025-08-14T21:35:56.1285681Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 52, in forward 2025-08-14T21:35:56.1286068Z hidden_states = self.dense(hidden_states) 2025-08-14T21:35:56.1286195Z 2025-08-14T21:35:56.1286335Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1286654Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1286938Z return mod(**inputs) 2025-08-14T21:35:56.1287292Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1287666Z outputs = self.deberta( 2025-08-14T21:35:56.1288016Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1288411Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1288776Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1289159Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1289489Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1289822Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1290189Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 446, in forward 2025-08-14T21:35:56.1290596Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:35:56.1290993Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 400, in forward 2025-08-14T21:35:56.1291364Z hidden_states = self.dense(hidden_states) 2025-08-14T21:35:56.1291492Z 2025-08-14T21:35:56.1291591Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1291915Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1292214Z return mod(**inputs) 2025-08-14T21:35:56.1292562Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1292933Z outputs = self.deberta( 2025-08-14T21:35:56.1293277Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1293646Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1294012Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1294385Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1294720Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1295053Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1295302Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 446, in forward 2025-08-14T21:35:56.1295418Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:35:56.1295666Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 401, in forward 2025-08-14T21:35:56.1295761Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:35:56.1295954Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:35:56.1296014Z return self.act(input) 2025-08-14T21:35:56.1296017Z 2025-08-14T21:35:56.1296111Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1296290Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1296347Z return mod(**inputs) 2025-08-14T21:35:56.1296614Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1296676Z outputs = self.deberta( 2025-08-14T21:35:56.1296980Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1297047Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1297293Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1297372Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1297574Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1297642Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1297908Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 447, in forward 2025-08-14T21:35:56.1298029Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:35:56.1298284Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 415, in forward 2025-08-14T21:35:56.1298359Z hidden_states = self.dense(hidden_states) 2025-08-14T21:35:56.1298363Z 2025-08-14T21:35:56.1298455Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1298649Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1298706Z return mod(**inputs) 2025-08-14T21:35:56.1298969Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1299032Z outputs = self.deberta( 2025-08-14T21:35:56.1299283Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1299354Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1299607Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1299685Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1299897Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1299967Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1300224Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1300309Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1300560Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1300641Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1300894Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 236, in forward 2025-08-14T21:35:56.1301069Z query_layer = self.transpose_for_scores(self.query_proj(query_states), self.num_attention_heads) 2025-08-14T21:35:56.1301073Z 2025-08-14T21:35:56.1301162Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1301347Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1301413Z return mod(**inputs) 2025-08-14T21:35:56.1301668Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1301730Z outputs = self.deberta( 2025-08-14T21:35:56.1301989Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1302050Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1302319Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1302424Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1302629Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1302708Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1302955Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1303047Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1303296Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1303390Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1303645Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 237, in forward 2025-08-14T21:35:56.1303814Z key_layer = self.transpose_for_scores(self.key_proj(hidden_states), self.num_attention_heads) 2025-08-14T21:35:56.1303818Z 2025-08-14T21:35:56.1303920Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1304101Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1304160Z return mod(**inputs) 2025-08-14T21:35:56.1304420Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1304481Z outputs = self.deberta( 2025-08-14T21:35:56.1304729Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1304878Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1305141Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1305227Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1305436Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1305509Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1305765Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1305849Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1306098Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1306178Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1306427Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 236, in forward 2025-08-14T21:35:56.1306606Z query_layer = self.transpose_for_scores(self.query_proj(query_states), self.num_attention_heads) 2025-08-14T21:35:56.1306892Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 194, in transpose_for_scores 2025-08-14T21:35:56.1307015Z return x.permute(0, 2, 1, 3).contiguous().view(-1, x.size(1), x.size(-1)) 2025-08-14T21:35:56.1307026Z 2025-08-14T21:35:56.1307121Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1307304Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1307369Z return mod(**inputs) 2025-08-14T21:35:56.1307624Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1307687Z outputs = self.deberta( 2025-08-14T21:35:56.1307960Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1308025Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1308297Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1308393Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1308592Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1308670Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1308915Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1308997Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1309266Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1309336Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1309590Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 248, in forward 2025-08-14T21:35:56.1309786Z attention_scores = torch.bmm(query_layer, key_layer.transpose(-1, -2) / scale.to(dtype=query_layer.dtype)) 2025-08-14T21:35:56.1309790Z 2025-08-14T21:35:56.1309883Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1310068Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1310126Z return mod(**inputs) 2025-08-14T21:35:56.1310383Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1310444Z outputs = self.deberta( 2025-08-14T21:35:56.1310693Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1310763Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1311012Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1311090Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1311299Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1311369Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1311622Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1311703Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1311947Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1312025Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1312269Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 248, in forward 2025-08-14T21:35:56.1312470Z attention_scores = torch.bmm(query_layer, key_layer.transpose(-1, -2) / scale.to(dtype=query_layer.dtype)) 2025-08-14T21:35:56.1312473Z 2025-08-14T21:35:56.1312567Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1312746Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1312811Z return mod(**inputs) 2025-08-14T21:35:56.1313059Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1313126Z outputs = self.deberta( 2025-08-14T21:35:56.1313375Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1313451Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1313721Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1313812Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1314013Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1314092Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1314335Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1314424Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1314669Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1314753Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1315007Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 238, in forward 2025-08-14T21:35:56.1315184Z value_layer = self.transpose_for_scores(self.value_proj(hidden_states), self.num_attention_heads) 2025-08-14T21:35:56.1315188Z 2025-08-14T21:35:56.1315286Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1315464Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1315520Z return mod(**inputs) 2025-08-14T21:35:56.1315778Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1315838Z outputs = self.deberta( 2025-08-14T21:35:56.1316080Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1316155Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1316402Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1316488Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1316688Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1316758Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1317008Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1317090Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1317341Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1317411Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1317657Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 238, in forward 2025-08-14T21:35:56.1317837Z value_layer = self.transpose_for_scores(self.value_proj(hidden_states), self.num_attention_heads) 2025-08-14T21:35:56.1318121Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 194, in transpose_for_scores 2025-08-14T21:35:56.1318247Z return x.permute(0, 2, 1, 3).contiguous().view(-1, x.size(1), x.size(-1)) 2025-08-14T21:35:56.1318250Z 2025-08-14T21:35:56.1318344Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1318526Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1318591Z return mod(**inputs) 2025-08-14T21:35:56.1318842Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1318905Z outputs = self.deberta( 2025-08-14T21:35:56.1319174Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1319242Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1319527Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1319606Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1319808Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1319888Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1320134Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1320233Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1320488Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1320556Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1320811Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 268, in forward 2025-08-14T21:35:56.1320875Z context_layer = torch.bmm( 2025-08-14T21:35:56.1320879Z 2025-08-14T21:35:56.1320973Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1321162Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1321222Z return mod(**inputs) 2025-08-14T21:35:56.1321477Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1321536Z outputs = self.deberta( 2025-08-14T21:35:56.1321784Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1321857Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1322106Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1322184Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1322391Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1322463Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1322715Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1322796Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1323040Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1323116Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1323366Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 272, in forward 2025-08-14T21:35:56.1323547Z context_layer.view(-1, self.num_attention_heads, context_layer.size(-2), context_layer.size(-1)) 2025-08-14T21:35:56.1323551Z 2025-08-14T21:35:56.1323645Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1323823Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1323890Z return mod(**inputs) 2025-08-14T21:35:56.1324143Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1324209Z outputs = self.deberta( 2025-08-14T21:35:56.1324456Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1324520Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1324787Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1324880Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1325101Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1325178Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1325421Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1325509Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1325755Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 381, in forward 2025-08-14T21:35:56.1325892Z attention_output = self.output(self_output, query_states) 2025-08-14T21:35:56.1326147Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 52, in forward 2025-08-14T21:35:56.1326222Z hidden_states = self.dense(hidden_states) 2025-08-14T21:35:56.1326225Z 2025-08-14T21:35:56.1326325Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1326504Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1326563Z return mod(**inputs) 2025-08-14T21:35:56.1326817Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1326878Z outputs = self.deberta( 2025-08-14T21:35:56.1327123Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1327195Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1327441Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1327524Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1327725Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1327797Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1328050Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 446, in forward 2025-08-14T21:35:56.1328157Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:35:56.1328406Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 400, in forward 2025-08-14T21:35:56.1328478Z hidden_states = self.dense(hidden_states) 2025-08-14T21:35:56.1328483Z 2025-08-14T21:35:56.1328574Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1328760Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1328818Z return mod(**inputs) 2025-08-14T21:35:56.1329068Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1329135Z outputs = self.deberta( 2025-08-14T21:35:56.1329379Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1329448Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1329694Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1329770Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1329974Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1330044Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1330312Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 446, in forward 2025-08-14T21:35:56.1330435Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:35:56.1330701Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 401, in forward 2025-08-14T21:35:56.1330808Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:35:56.1331002Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:35:56.1331064Z return self.act(input) 2025-08-14T21:35:56.1331068Z 2025-08-14T21:35:56.1331168Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1331367Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1331433Z return mod(**inputs) 2025-08-14T21:35:56.1331685Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1331747Z outputs = self.deberta( 2025-08-14T21:35:56.1332004Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1332068Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1332314Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1332398Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1332600Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1332677Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1332925Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 447, in forward 2025-08-14T21:35:56.1333047Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:35:56.1333304Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 415, in forward 2025-08-14T21:35:56.1333381Z hidden_states = self.dense(hidden_states) 2025-08-14T21:35:56.1333384Z 2025-08-14T21:35:56.1333480Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1333660Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1333715Z return mod(**inputs) 2025-08-14T21:35:56.1333967Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1334026Z outputs = self.deberta( 2025-08-14T21:35:56.1334276Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1334339Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1334583Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1334660Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1334860Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1334931Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1335185Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1335268Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1335521Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1335592Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1335852Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 236, in forward 2025-08-14T21:35:56.1336045Z query_layer = self.transpose_for_scores(self.query_proj(query_states), self.num_attention_heads) 2025-08-14T21:35:56.1336062Z 2025-08-14T21:35:56.1336156Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1336345Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1336403Z return mod(**inputs) 2025-08-14T21:35:56.1336656Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1336723Z outputs = self.deberta( 2025-08-14T21:35:56.1336976Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1337059Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1337316Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1337395Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1337606Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1337678Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1337925Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1338017Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1338266Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1338344Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1338593Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 237, in forward 2025-08-14T21:35:56.1338761Z key_layer = self.transpose_for_scores(self.key_proj(hidden_states), self.num_attention_heads) 2025-08-14T21:35:56.1338766Z 2025-08-14T21:35:56.1338866Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1339051Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1339108Z return mod(**inputs) 2025-08-14T21:35:56.1339368Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1339430Z outputs = self.deberta( 2025-08-14T21:35:56.1339682Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1339745Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1339995Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1340076Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1340279Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1340355Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1340602Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1340685Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1340936Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1341008Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1341270Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 236, in forward 2025-08-14T21:35:56.1341442Z query_layer = self.transpose_for_scores(self.query_proj(query_states), self.num_attention_heads) 2025-08-14T21:35:56.1341732Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 194, in transpose_for_scores 2025-08-14T21:35:56.1341866Z return x.permute(0, 2, 1, 3).contiguous().view(-1, x.size(1), x.size(-1)) 2025-08-14T21:35:56.1341869Z 2025-08-14T21:35:56.1341960Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1342135Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1342199Z return mod(**inputs) 2025-08-14T21:35:56.1342448Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1342530Z outputs = self.deberta( 2025-08-14T21:35:56.1342786Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1342851Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1343115Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1343192Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1343406Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1343477Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1343731Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1343819Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1344074Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1344144Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1344402Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 248, in forward 2025-08-14T21:35:56.1344600Z attention_scores = torch.bmm(query_layer, key_layer.transpose(-1, -2) / scale.to(dtype=query_layer.dtype)) 2025-08-14T21:35:56.1344604Z 2025-08-14T21:35:56.1344705Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1344949Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1345016Z return mod(**inputs) 2025-08-14T21:35:56.1345279Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1345343Z outputs = self.deberta( 2025-08-14T21:35:56.1345602Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1345667Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1345917Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1346003Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1346209Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1346281Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1346538Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1346619Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1346877Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1346947Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1347218Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 248, in forward 2025-08-14T21:35:56.1347460Z attention_scores = torch.bmm(query_layer, key_layer.transpose(-1, -2) / scale.to(dtype=query_layer.dtype)) 2025-08-14T21:35:56.1347464Z 2025-08-14T21:35:56.1347562Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1347753Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1347812Z return mod(**inputs) 2025-08-14T21:35:56.1348060Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1348125Z outputs = self.deberta( 2025-08-14T21:35:56.1348386Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1348452Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1348706Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1348785Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1348993Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1349065Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1349311Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1349403Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1349647Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1349724Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1349973Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 238, in forward 2025-08-14T21:35:56.1350150Z value_layer = self.transpose_for_scores(self.value_proj(hidden_states), self.num_attention_heads) 2025-08-14T21:35:56.1350155Z 2025-08-14T21:35:56.1350259Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1350442Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1350508Z return mod(**inputs) 2025-08-14T21:35:56.1350759Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1350820Z outputs = self.deberta( 2025-08-14T21:35:56.1351065Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1351132Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1351381Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1351461Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1351659Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1351740Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1351984Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1352067Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1352320Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1352394Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1352654Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 238, in forward 2025-08-14T21:35:56.1352836Z value_layer = self.transpose_for_scores(self.value_proj(hidden_states), self.num_attention_heads) 2025-08-14T21:35:56.1353148Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 194, in transpose_for_scores 2025-08-14T21:35:56.1353274Z return x.permute(0, 2, 1, 3).contiguous().view(-1, x.size(1), x.size(-1)) 2025-08-14T21:35:56.1353278Z 2025-08-14T21:35:56.1353370Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1353551Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1353618Z return mod(**inputs) 2025-08-14T21:35:56.1353872Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1353955Z outputs = self.deberta( 2025-08-14T21:35:56.1354203Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1354267Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1354524Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1354599Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1354803Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1354874Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1355121Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1355212Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1355461Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1355529Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1355782Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 268, in forward 2025-08-14T21:35:56.1355847Z context_layer = torch.bmm( 2025-08-14T21:35:56.1355850Z 2025-08-14T21:35:56.1355950Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1356131Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1356188Z return mod(**inputs) 2025-08-14T21:35:56.1356444Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1356503Z outputs = self.deberta( 2025-08-14T21:35:56.1356754Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1356820Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1357066Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1357149Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1357349Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1357420Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1357667Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1357746Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1357999Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1358068Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1358326Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 272, in forward 2025-08-14T21:35:56.1358523Z context_layer.view(-1, self.num_attention_heads, context_layer.size(-2), context_layer.size(-1)) 2025-08-14T21:35:56.1358539Z 2025-08-14T21:35:56.1358634Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1358823Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1358881Z return mod(**inputs) 2025-08-14T21:35:56.1359129Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1359198Z outputs = self.deberta( 2025-08-14T21:35:56.1359444Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1359524Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1359782Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1359859Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1360070Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1360143Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1360391Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1360478Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1360727Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 381, in forward 2025-08-14T21:35:56.1360844Z attention_output = self.output(self_output, query_states) 2025-08-14T21:35:56.1361094Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 52, in forward 2025-08-14T21:35:56.1361169Z hidden_states = self.dense(hidden_states) 2025-08-14T21:35:56.1361172Z 2025-08-14T21:35:56.1361275Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1361459Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1361518Z return mod(**inputs) 2025-08-14T21:35:56.1361777Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1361840Z outputs = self.deberta( 2025-08-14T21:35:56.1362092Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1362159Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1362406Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1362491Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1362695Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1362774Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1363022Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 446, in forward 2025-08-14T21:35:56.1363131Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:35:56.1363382Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 400, in forward 2025-08-14T21:35:56.1363452Z hidden_states = self.dense(hidden_states) 2025-08-14T21:35:56.1363456Z 2025-08-14T21:35:56.1363551Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1363748Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1363809Z return mod(**inputs) 2025-08-14T21:35:56.1364078Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1364157Z outputs = self.deberta( 2025-08-14T21:35:56.1364408Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1364479Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1364727Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1364806Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1365007Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1365093Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1365348Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 446, in forward 2025-08-14T21:35:56.1365456Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:35:56.1365705Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 401, in forward 2025-08-14T21:35:56.1365813Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:35:56.1366009Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:35:56.1366078Z return self.act(input) 2025-08-14T21:35:56.1366081Z 2025-08-14T21:35:56.1366173Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1366357Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1366424Z return mod(**inputs) 2025-08-14T21:35:56.1366678Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1366748Z outputs = self.deberta( 2025-08-14T21:35:56.1366996Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1367061Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1367312Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1367387Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1367587Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1367664Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1367912Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 447, in forward 2025-08-14T21:35:56.1368037Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:35:56.1368284Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 415, in forward 2025-08-14T21:35:56.1368355Z hidden_states = self.dense(hidden_states) 2025-08-14T21:35:56.1368358Z 2025-08-14T21:35:56.1368457Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1368637Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1368703Z return mod(**inputs) 2025-08-14T21:35:56.1368952Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1369015Z outputs = self.deberta( 2025-08-14T21:35:56.1369267Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1369351Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1369613Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1369711Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1369912Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1369986Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1370231Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1370314Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1370568Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1370653Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1370903Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 236, in forward 2025-08-14T21:35:56.1371076Z query_layer = self.transpose_for_scores(self.query_proj(query_states), self.num_attention_heads) 2025-08-14T21:35:56.1371080Z 2025-08-14T21:35:56.1371172Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1371359Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1371417Z return mod(**inputs) 2025-08-14T21:35:56.1371670Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1371739Z outputs = self.deberta( 2025-08-14T21:35:56.1371988Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1372057Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1372307Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1372390Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1372597Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1372667Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1372922Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1373008Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1373256Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1373333Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1373581Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 237, in forward 2025-08-14T21:35:56.1373746Z key_layer = self.transpose_for_scores(self.key_proj(hidden_states), self.num_attention_heads) 2025-08-14T21:35:56.1373757Z 2025-08-14T21:35:56.1373845Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1374025Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1374087Z return mod(**inputs) 2025-08-14T21:35:56.1374335Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1374397Z outputs = self.deberta( 2025-08-14T21:35:56.1374643Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1374710Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1374974Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1375049Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1375277Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1375358Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1375610Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1375701Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1375949Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1376033Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1376291Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 236, in forward 2025-08-14T21:35:56.1376463Z query_layer = self.transpose_for_scores(self.query_proj(query_states), self.num_attention_heads) 2025-08-14T21:35:56.1376749Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 194, in transpose_for_scores 2025-08-14T21:35:56.1376879Z return x.permute(0, 2, 1, 3).contiguous().view(-1, x.size(1), x.size(-1)) 2025-08-14T21:35:56.1376883Z 2025-08-14T21:35:56.1376975Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1377163Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1377221Z return mod(**inputs) 2025-08-14T21:35:56.1377475Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1377546Z outputs = self.deberta( 2025-08-14T21:35:56.1377796Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1377868Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1378118Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1378194Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1378402Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1378471Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1378718Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1378807Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1379053Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1379126Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1379370Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 248, in forward 2025-08-14T21:35:56.1379562Z attention_scores = torch.bmm(query_layer, key_layer.transpose(-1, -2) / scale.to(dtype=query_layer.dtype)) 2025-08-14T21:35:56.1379566Z 2025-08-14T21:35:56.1379658Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1379839Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1379904Z return mod(**inputs) 2025-08-14T21:35:56.1380155Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1380214Z outputs = self.deberta( 2025-08-14T21:35:56.1380480Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1380545Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1380809Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1380907Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1381109Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1381185Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1381430Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1381513Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1381767Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1381849Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1382100Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 248, in forward 2025-08-14T21:35:56.1382291Z attention_scores = torch.bmm(query_layer, key_layer.transpose(-1, -2) / scale.to(dtype=query_layer.dtype)) 2025-08-14T21:35:56.1382295Z 2025-08-14T21:35:56.1382386Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1382569Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1382627Z return mod(**inputs) 2025-08-14T21:35:56.1382887Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1382947Z outputs = self.deberta( 2025-08-14T21:35:56.1383193Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1383261Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1383505Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1383582Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1383788Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1383856Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1384103Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1384185Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1384431Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1384506Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1384956Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 238, in forward 2025-08-14T21:35:56.1385148Z value_layer = self.transpose_for_scores(self.value_proj(hidden_states), self.num_attention_heads) 2025-08-14T21:35:56.1385154Z 2025-08-14T21:35:56.1385249Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1385432Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1385498Z return mod(**inputs) 2025-08-14T21:35:56.1385753Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1385816Z outputs = self.deberta( 2025-08-14T21:35:56.1386071Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1386140Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1386429Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1386526Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1386748Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1386828Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1387077Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1387170Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1387419Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1387517Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1387772Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 238, in forward 2025-08-14T21:35:56.1387946Z value_layer = self.transpose_for_scores(self.value_proj(hidden_states), self.num_attention_heads) 2025-08-14T21:35:56.1388231Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 194, in transpose_for_scores 2025-08-14T21:35:56.1388358Z return x.permute(0, 2, 1, 3).contiguous().view(-1, x.size(1), x.size(-1)) 2025-08-14T21:35:56.1388361Z 2025-08-14T21:35:56.1388455Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1388643Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1388704Z return mod(**inputs) 2025-08-14T21:35:56.1388955Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1389025Z outputs = self.deberta( 2025-08-14T21:35:56.1389275Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1389346Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1389593Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1389669Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1389871Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1389942Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1390189Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1390270Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1390514Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1390589Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1390839Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 268, in forward 2025-08-14T21:35:56.1390906Z context_layer = torch.bmm( 2025-08-14T21:35:56.1390909Z 2025-08-14T21:35:56.1391009Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1391190Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1391261Z return mod(**inputs) 2025-08-14T21:35:56.1391517Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1391579Z outputs = self.deberta( 2025-08-14T21:35:56.1391849Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1391931Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1392200Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1392302Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1392508Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1392587Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1392843Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1392926Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1393189Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1393273Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1393535Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 272, in forward 2025-08-14T21:35:56.1393716Z context_layer.view(-1, self.num_attention_heads, context_layer.size(-2), context_layer.size(-1)) 2025-08-14T21:35:56.1393721Z 2025-08-14T21:35:56.1393816Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1394004Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1394064Z return mod(**inputs) 2025-08-14T21:35:56.1394330Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1394392Z outputs = self.deberta( 2025-08-14T21:35:56.1394648Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1394723Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1394979Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1395059Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1395277Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1395351Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1395609Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1395694Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1395947Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 381, in forward 2025-08-14T21:35:56.1396066Z attention_output = self.output(self_output, query_states) 2025-08-14T21:35:56.1396326Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 52, in forward 2025-08-14T21:35:56.1396409Z hidden_states = self.dense(hidden_states) 2025-08-14T21:35:56.1396413Z 2025-08-14T21:35:56.1396512Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1396698Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1396766Z return mod(**inputs) 2025-08-14T21:35:56.1397023Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1397085Z outputs = self.deberta( 2025-08-14T21:35:56.1397346Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1397413Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1397689Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1397769Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1397990Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1398084Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1398336Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 446, in forward 2025-08-14T21:35:56.1398452Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:35:56.1398706Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 400, in forward 2025-08-14T21:35:56.1398781Z hidden_states = self.dense(hidden_states) 2025-08-14T21:35:56.1398798Z 2025-08-14T21:35:56.1398905Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1399094Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1399154Z return mod(**inputs) 2025-08-14T21:35:56.1399421Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1399484Z outputs = self.deberta( 2025-08-14T21:35:56.1399746Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1399812Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1400068Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1400153Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1400360Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1400442Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1400698Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 446, in forward 2025-08-14T21:35:56.1400810Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:35:56.1401076Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 401, in forward 2025-08-14T21:35:56.1401179Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:35:56.1401378Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:35:56.1401452Z return self.act(input) 2025-08-14T21:35:56.1401455Z 2025-08-14T21:35:56.1401549Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1401745Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1401807Z return mod(**inputs) 2025-08-14T21:35:56.1402069Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1402140Z outputs = self.deberta( 2025-08-14T21:35:56.1402399Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1402472Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1402728Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1402806Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1403022Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1403097Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1403369Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 447, in forward 2025-08-14T21:35:56.1403505Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:35:56.1403775Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 415, in forward 2025-08-14T21:35:56.1403875Z hidden_states = self.dense(hidden_states) 2025-08-14T21:35:56.1403878Z 2025-08-14T21:35:56.1403972Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1404155Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1404221Z return mod(**inputs) 2025-08-14T21:35:56.1404481Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1404569Z outputs = self.deberta( 2025-08-14T21:35:56.1404828Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1404895Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1405159Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1405242Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1405450Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1405531Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1405788Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1405883Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1406140Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1406213Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1406483Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 236, in forward 2025-08-14T21:35:56.1406660Z query_layer = self.transpose_for_scores(self.query_proj(query_states), self.num_attention_heads) 2025-08-14T21:35:56.1406665Z 2025-08-14T21:35:56.1406768Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1406955Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1407016Z return mod(**inputs) 2025-08-14T21:35:56.1407282Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1407343Z outputs = self.deberta( 2025-08-14T21:35:56.1407606Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1407680Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1407937Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1408025Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1408231Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1408304Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1408567Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1408652Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1408915Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1408988Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1409257Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 237, in forward 2025-08-14T21:35:56.1409450Z key_layer = self.transpose_for_scores(self.key_proj(hidden_states), self.num_attention_heads) 2025-08-14T21:35:56.1409478Z 2025-08-14T21:35:56.1409576Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1409761Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1409829Z return mod(**inputs) 2025-08-14T21:35:56.1410086Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1410157Z outputs = self.deberta( 2025-08-14T21:35:56.1410415Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1410498Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1410757Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1410833Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1411046Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1411117Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1411366Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1411456Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1411704Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1411774Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1412031Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 236, in forward 2025-08-14T21:35:56.1412202Z query_layer = self.transpose_for_scores(self.query_proj(query_states), self.num_attention_heads) 2025-08-14T21:35:56.1412493Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 194, in transpose_for_scores 2025-08-14T21:35:56.1412614Z return x.permute(0, 2, 1, 3).contiguous().view(-1, x.size(1), x.size(-1)) 2025-08-14T21:35:56.1412617Z 2025-08-14T21:35:56.1412710Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1412901Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1412960Z return mod(**inputs) 2025-08-14T21:35:56.1413224Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1413287Z outputs = self.deberta( 2025-08-14T21:35:56.1413536Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1413607Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1413856Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1413942Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1414144Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1414213Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1414466Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1414548Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1414797Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1414884Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1415149Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 248, in forward 2025-08-14T21:35:56.1415396Z attention_scores = torch.bmm(query_layer, key_layer.transpose(-1, -2) / scale.to(dtype=query_layer.dtype)) 2025-08-14T21:35:56.1415400Z 2025-08-14T21:35:56.1415492Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1415672Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1415739Z return mod(**inputs) 2025-08-14T21:35:56.1415988Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1416072Z outputs = self.deberta( 2025-08-14T21:35:56.1416323Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1416389Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1416645Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1416722Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1416924Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1417002Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1417249Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1417340Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1417589Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1417659Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1417913Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 248, in forward 2025-08-14T21:35:56.1418107Z attention_scores = torch.bmm(query_layer, key_layer.transpose(-1, -2) / scale.to(dtype=query_layer.dtype)) 2025-08-14T21:35:56.1418111Z 2025-08-14T21:35:56.1418210Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1418390Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1418449Z return mod(**inputs) 2025-08-14T21:35:56.1418707Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1418768Z outputs = self.deberta( 2025-08-14T21:35:56.1419019Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1419092Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1419340Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1419425Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1419625Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1419695Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1419950Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1420033Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1420287Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1420357Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1420617Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 238, in forward 2025-08-14T21:35:56.1420818Z value_layer = self.transpose_for_scores(self.value_proj(hidden_states), self.num_attention_heads) 2025-08-14T21:35:56.1420839Z 2025-08-14T21:35:56.1420936Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1421126Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1421187Z return mod(**inputs) 2025-08-14T21:35:56.1421439Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1421509Z outputs = self.deberta( 2025-08-14T21:35:56.1421756Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1421837Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1422095Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1422172Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1422381Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1422452Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1422698Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1422788Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1423036Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1423115Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1423365Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 238, in forward 2025-08-14T21:35:56.1423537Z value_layer = self.transpose_for_scores(self.value_proj(hidden_states), self.num_attention_heads) 2025-08-14T21:35:56.1423830Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 194, in transpose_for_scores 2025-08-14T21:35:56.1423949Z return x.permute(0, 2, 1, 3).contiguous().view(-1, x.size(1), x.size(-1)) 2025-08-14T21:35:56.1423952Z 2025-08-14T21:35:56.1424051Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1424233Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1424293Z return mod(**inputs) 2025-08-14T21:35:56.1424555Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1424619Z outputs = self.deberta( 2025-08-14T21:35:56.1424942Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1425026Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1425276Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1425361Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1425563Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1425634Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1425888Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1425974Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1426241Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1426321Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1426592Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 268, in forward 2025-08-14T21:35:56.1426683Z context_layer = torch.bmm( 2025-08-14T21:35:56.1426686Z 2025-08-14T21:35:56.1426783Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1426966Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1427033Z return mod(**inputs) 2025-08-14T21:35:56.1427286Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1427358Z outputs = self.deberta( 2025-08-14T21:35:56.1427623Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1427692Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1427952Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1428031Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1428232Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1428306Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1428549Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1428634Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1428878Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1428946Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1429198Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 272, in forward 2025-08-14T21:35:56.1429370Z context_layer.view(-1, self.num_attention_heads, context_layer.size(-2), context_layer.size(-1)) 2025-08-14T21:35:56.1429375Z 2025-08-14T21:35:56.1429470Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1429648Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1429706Z return mod(**inputs) 2025-08-14T21:35:56.1429959Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1430017Z outputs = self.deberta( 2025-08-14T21:35:56.1430259Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1430331Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1430579Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1430659Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1430859Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1430930Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1431174Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1431253Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1431496Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 381, in forward 2025-08-14T21:35:56.1431599Z attention_output = self.output(self_output, query_states) 2025-08-14T21:35:56.1431858Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 52, in forward 2025-08-14T21:35:56.1431939Z hidden_states = self.dense(hidden_states) 2025-08-14T21:35:56.1431942Z 2025-08-14T21:35:56.1432066Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1432253Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1432308Z return mod(**inputs) 2025-08-14T21:35:56.1432554Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1432624Z outputs = self.deberta( 2025-08-14T21:35:56.1432870Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1432953Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1433210Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1433287Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1433495Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1433568Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1433815Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 446, in forward 2025-08-14T21:35:56.1433928Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:35:56.1434175Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 400, in forward 2025-08-14T21:35:56.1434248Z hidden_states = self.dense(hidden_states) 2025-08-14T21:35:56.1434260Z 2025-08-14T21:35:56.1434352Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1434533Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1434597Z return mod(**inputs) 2025-08-14T21:35:56.1434851Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1434913Z outputs = self.deberta( 2025-08-14T21:35:56.1435169Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1435233Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1435490Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1435567Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1435769Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1435847Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1436099Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 446, in forward 2025-08-14T21:35:56.1436206Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:35:56.1436460Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 401, in forward 2025-08-14T21:35:56.1436562Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:35:56.1436761Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:35:56.1436824Z return self.act(input) 2025-08-14T21:35:56.1436828Z 2025-08-14T21:35:56.1436917Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1437108Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1437166Z return mod(**inputs) 2025-08-14T21:35:56.1437437Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1437499Z outputs = self.deberta( 2025-08-14T21:35:56.1437773Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1437847Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1438092Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1438169Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1438369Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1438452Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1438703Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 447, in forward 2025-08-14T21:35:56.1438821Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:35:56.1439066Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 415, in forward 2025-08-14T21:35:56.1439142Z hidden_states = self.dense(hidden_states) 2025-08-14T21:35:56.1439145Z 2025-08-14T21:35:56.1439236Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1439418Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1439472Z return mod(**inputs) 2025-08-14T21:35:56.1439718Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1439784Z outputs = self.deberta( 2025-08-14T21:35:56.1440031Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1440091Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1440341Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1440416Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1440614Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1440683Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1440924Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1441011Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1441252Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1441330Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1441577Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 236, in forward 2025-08-14T21:35:56.1441751Z query_layer = self.transpose_for_scores(self.query_proj(query_states), self.num_attention_heads) 2025-08-14T21:35:56.1441755Z 2025-08-14T21:35:56.1441853Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1442031Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1442099Z return mod(**inputs) 2025-08-14T21:35:56.1442352Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1442412Z outputs = self.deberta( 2025-08-14T21:35:56.1442665Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1442744Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1442993Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1443107Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1443311Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1443389Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1443638Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1443722Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1443979Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1444064Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1444315Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 237, in forward 2025-08-14T21:35:56.1444483Z key_layer = self.transpose_for_scores(self.key_proj(hidden_states), self.num_attention_heads) 2025-08-14T21:35:56.1444488Z 2025-08-14T21:35:56.1444581Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1444761Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1444816Z return mod(**inputs) 2025-08-14T21:35:56.1445062Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1445123Z outputs = self.deberta( 2025-08-14T21:35:56.1445367Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1445436Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1445684Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1445756Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1445965Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1446033Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1446281Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1446361Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1446603Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1446672Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1446918Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 236, in forward 2025-08-14T21:35:56.1447085Z query_layer = self.transpose_for_scores(self.query_proj(query_states), self.num_attention_heads) 2025-08-14T21:35:56.1447372Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 194, in transpose_for_scores 2025-08-14T21:35:56.1447490Z return x.permute(0, 2, 1, 3).contiguous().view(-1, x.size(1), x.size(-1)) 2025-08-14T21:35:56.1447493Z 2025-08-14T21:35:56.1447585Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1447763Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1447818Z return mod(**inputs) 2025-08-14T21:35:56.1448077Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1448139Z outputs = self.deberta( 2025-08-14T21:35:56.1448408Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1448475Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1448743Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1448843Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1449044Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1449115Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1449366Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1449449Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1449719Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1449789Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1450037Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 248, in forward 2025-08-14T21:35:56.1450235Z attention_scores = torch.bmm(query_layer, key_layer.transpose(-1, -2) / scale.to(dtype=query_layer.dtype)) 2025-08-14T21:35:56.1450238Z 2025-08-14T21:35:56.1450329Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1450520Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1450580Z return mod(**inputs) 2025-08-14T21:35:56.1450833Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1450905Z outputs = self.deberta( 2025-08-14T21:35:56.1451157Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1451229Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1451479Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1451558Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1451767Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1451837Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1452082Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1452171Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1452419Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1452498Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1452746Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 248, in forward 2025-08-14T21:35:56.1452942Z attention_scores = torch.bmm(query_layer, key_layer.transpose(-1, -2) / scale.to(dtype=query_layer.dtype)) 2025-08-14T21:35:56.1452946Z 2025-08-14T21:35:56.1453047Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1453228Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1453296Z return mod(**inputs) 2025-08-14T21:35:56.1453546Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1453607Z outputs = self.deberta( 2025-08-14T21:35:56.1453868Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1453954Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1454218Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1454319Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1454520Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1454596Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1454841Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1454922Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1455175Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1455260Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1455514Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 238, in forward 2025-08-14T21:35:56.1455692Z value_layer = self.transpose_for_scores(self.value_proj(hidden_states), self.num_attention_heads) 2025-08-14T21:35:56.1455697Z 2025-08-14T21:35:56.1455790Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1455978Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1456037Z return mod(**inputs) 2025-08-14T21:35:56.1456290Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1456362Z outputs = self.deberta( 2025-08-14T21:35:56.1456611Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1456703Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1457017Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1457099Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1457309Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1457382Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1457632Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1457714Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1457991Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1458085Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1458364Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 238, in forward 2025-08-14T21:35:56.1458535Z value_layer = self.transpose_for_scores(self.value_proj(hidden_states), self.num_attention_heads) 2025-08-14T21:35:56.1458830Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 194, in transpose_for_scores 2025-08-14T21:35:56.1458949Z return x.permute(0, 2, 1, 3).contiguous().view(-1, x.size(1), x.size(-1)) 2025-08-14T21:35:56.1458953Z 2025-08-14T21:35:56.1459052Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1459234Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1459293Z return mod(**inputs) 2025-08-14T21:35:56.1459551Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1459613Z outputs = self.deberta( 2025-08-14T21:35:56.1459890Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1459987Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1460253Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1460337Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1460538Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1460616Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1460865Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1460962Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1461219Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1461287Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1461535Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 268, in forward 2025-08-14T21:35:56.1461608Z context_layer = torch.bmm( 2025-08-14T21:35:56.1461612Z 2025-08-14T21:35:56.1461706Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1461895Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1461954Z return mod(**inputs) 2025-08-14T21:35:56.1462206Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1462276Z outputs = self.deberta( 2025-08-14T21:35:56.1462528Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1462603Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1462853Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1462933Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1463142Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1463213Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1463460Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1463550Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1463795Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1463872Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1464120Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 272, in forward 2025-08-14T21:35:56.1464292Z context_layer.view(-1, self.num_attention_heads, context_layer.size(-2), context_layer.size(-1)) 2025-08-14T21:35:56.1464297Z 2025-08-14T21:35:56.1464397Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1464604Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1464671Z return mod(**inputs) 2025-08-14T21:35:56.1464985Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1465054Z outputs = self.deberta( 2025-08-14T21:35:56.1465317Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1465386Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1465691Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1465794Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1466012Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1466092Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1466339Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1466422Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1466677Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 381, in forward 2025-08-14T21:35:56.1466803Z attention_output = self.output(self_output, query_states) 2025-08-14T21:35:56.1467071Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 52, in forward 2025-08-14T21:35:56.1467150Z hidden_states = self.dense(hidden_states) 2025-08-14T21:35:56.1467158Z 2025-08-14T21:35:56.1467252Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1467447Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1467506Z return mod(**inputs) 2025-08-14T21:35:56.1467764Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1467832Z outputs = self.deberta( 2025-08-14T21:35:56.1468087Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1468162Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1468420Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1468497Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1468714Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1468787Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1469050Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 446, in forward 2025-08-14T21:35:56.1469160Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:35:56.1469416Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 400, in forward 2025-08-14T21:35:56.1469498Z hidden_states = self.dense(hidden_states) 2025-08-14T21:35:56.1469503Z 2025-08-14T21:35:56.1469598Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1469785Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1469854Z return mod(**inputs) 2025-08-14T21:35:56.1470111Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1470181Z outputs = self.deberta( 2025-08-14T21:35:56.1470437Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1470502Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1470764Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1470842Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1471055Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1471125Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1471394Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 446, in forward 2025-08-14T21:35:56.1471538Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:35:56.1471796Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 401, in forward 2025-08-14T21:35:56.1471900Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:35:56.1472103Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:35:56.1472167Z return self.act(input) 2025-08-14T21:35:56.1472171Z 2025-08-14T21:35:56.1472271Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1472476Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1472538Z return mod(**inputs) 2025-08-14T21:35:56.1472805Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1472869Z outputs = self.deberta( 2025-08-14T21:35:56.1473131Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1473197Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1473452Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1473537Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1473741Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1473814Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1474076Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 447, in forward 2025-08-14T21:35:56.1474201Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:35:56.1474462Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 415, in forward 2025-08-14T21:35:56.1474540Z hidden_states = self.dense(hidden_states) 2025-08-14T21:35:56.1474544Z 2025-08-14T21:35:56.1474637Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1474829Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1474890Z return mod(**inputs) 2025-08-14T21:35:56.1475154Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1475219Z outputs = self.deberta( 2025-08-14T21:35:56.1475475Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1475547Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1475804Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1475884Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1476096Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1476168Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1476430Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1476518Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1476771Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1476852Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1477122Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 236, in forward 2025-08-14T21:35:56.1477336Z query_layer = self.transpose_for_scores(self.query_proj(query_states), self.num_attention_heads) 2025-08-14T21:35:56.1477340Z 2025-08-14T21:35:56.1477437Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1477629Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1477699Z return mod(**inputs) 2025-08-14T21:35:56.1477964Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1478028Z outputs = self.deberta( 2025-08-14T21:35:56.1478309Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1478377Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1478648Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1478727Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1478929Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1479006Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1479255Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1479346Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1479592Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1479662Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1479921Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 237, in forward 2025-08-14T21:35:56.1480085Z key_layer = self.transpose_for_scores(self.key_proj(hidden_states), self.num_attention_heads) 2025-08-14T21:35:56.1480089Z 2025-08-14T21:35:56.1480189Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1480370Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1480428Z return mod(**inputs) 2025-08-14T21:35:56.1480687Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1480748Z outputs = self.deberta( 2025-08-14T21:35:56.1480994Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1481069Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1481318Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1481403Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1481608Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1481679Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1481933Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1482015Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1482262Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1482341Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1482601Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 236, in forward 2025-08-14T21:35:56.1482782Z query_layer = self.transpose_for_scores(self.query_proj(query_states), self.num_attention_heads) 2025-08-14T21:35:56.1483081Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 194, in transpose_for_scores 2025-08-14T21:35:56.1483214Z return x.permute(0, 2, 1, 3).contiguous().view(-1, x.size(1), x.size(-1)) 2025-08-14T21:35:56.1483225Z 2025-08-14T21:35:56.1483319Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1483503Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1483569Z return mod(**inputs) 2025-08-14T21:35:56.1483819Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1483897Z outputs = self.deberta( 2025-08-14T21:35:56.1484156Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1484223Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1484480Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1484556Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1484904Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1484992Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1485247Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1485334Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1485604Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1485677Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1485942Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 248, in forward 2025-08-14T21:35:56.1486147Z attention_scores = torch.bmm(query_layer, key_layer.transpose(-1, -2) / scale.to(dtype=query_layer.dtype)) 2025-08-14T21:35:56.1486151Z 2025-08-14T21:35:56.1486247Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1486448Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1486508Z return mod(**inputs) 2025-08-14T21:35:56.1486769Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1486833Z outputs = self.deberta( 2025-08-14T21:35:56.1487082Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1487156Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1487407Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1487485Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1487694Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1487767Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1488022Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1488105Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1488356Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1488489Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1488736Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 248, in forward 2025-08-14T21:35:56.1488977Z attention_scores = torch.bmm(query_layer, key_layer.transpose(-1, -2) / scale.to(dtype=query_layer.dtype)) 2025-08-14T21:35:56.1488981Z 2025-08-14T21:35:56.1489077Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1489258Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1489327Z return mod(**inputs) 2025-08-14T21:35:56.1489578Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1489639Z outputs = self.deberta( 2025-08-14T21:35:56.1489921Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1489988Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1490249Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1490327Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1490528Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1490607Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1490856Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1490946Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1491194Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1491263Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1491519Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 238, in forward 2025-08-14T21:35:56.1491695Z value_layer = self.transpose_for_scores(self.value_proj(hidden_states), self.num_attention_heads) 2025-08-14T21:35:56.1491700Z 2025-08-14T21:35:56.1491799Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1491980Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1492039Z return mod(**inputs) 2025-08-14T21:35:56.1492299Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1492359Z outputs = self.deberta( 2025-08-14T21:35:56.1492607Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1492680Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1492930Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1493014Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1493217Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1493288Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1493541Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1493624Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1493880Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1493950Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1494212Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 238, in forward 2025-08-14T21:35:56.1494395Z value_layer = self.transpose_for_scores(self.value_proj(hidden_states), self.num_attention_heads) 2025-08-14T21:35:56.1495258Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 194, in transpose_for_scores 2025-08-14T21:35:56.1495378Z return x.permute(0, 2, 1, 3).contiguous().view(-1, x.size(1), x.size(-1)) 2025-08-14T21:35:56.1495389Z 2025-08-14T21:35:56.1495483Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1495662Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1495727Z return mod(**inputs) 2025-08-14T21:35:56.1495980Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1496057Z outputs = self.deberta( 2025-08-14T21:35:56.1496321Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1496387Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1496648Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1496724Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1496927Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1497006Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1497260Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1497344Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1497606Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1497675Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1497932Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 268, in forward 2025-08-14T21:35:56.1497999Z context_layer = torch.bmm( 2025-08-14T21:35:56.1498002Z 2025-08-14T21:35:56.1498094Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1498282Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1498343Z return mod(**inputs) 2025-08-14T21:35:56.1498606Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1498667Z outputs = self.deberta( 2025-08-14T21:35:56.1498917Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1498990Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1499243Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1499320Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1499529Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1499600Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1499857Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1499941Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1500193Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1500268Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1500535Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 272, in forward 2025-08-14T21:35:56.1500729Z context_layer.view(-1, self.num_attention_heads, context_layer.size(-2), context_layer.size(-1)) 2025-08-14T21:35:56.1500746Z 2025-08-14T21:35:56.1500842Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1501023Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1501089Z return mod(**inputs) 2025-08-14T21:35:56.1501339Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1501399Z outputs = self.deberta( 2025-08-14T21:35:56.1501655Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1501738Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1501995Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1502074Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1502278Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1502355Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1502602Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1502691Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1502937Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 381, in forward 2025-08-14T21:35:56.1503045Z attention_output = self.output(self_output, query_states) 2025-08-14T21:35:56.1503301Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 52, in forward 2025-08-14T21:35:56.1503375Z hidden_states = self.dense(hidden_states) 2025-08-14T21:35:56.1503382Z 2025-08-14T21:35:56.1503479Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1503659Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1503719Z return mod(**inputs) 2025-08-14T21:35:56.1503977Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1504037Z outputs = self.deberta( 2025-08-14T21:35:56.1504285Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1504357Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1504605Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1504688Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1504958Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1505035Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1505297Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 446, in forward 2025-08-14T21:35:56.1505408Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:35:56.1505658Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 400, in forward 2025-08-14T21:35:56.1505743Z hidden_states = self.dense(hidden_states) 2025-08-14T21:35:56.1505749Z 2025-08-14T21:35:56.1505843Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1506049Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1506112Z return mod(**inputs) 2025-08-14T21:35:56.1506395Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1506479Z outputs = self.deberta( 2025-08-14T21:35:56.1506726Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1506798Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1507046Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1507121Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1507347Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1507419Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1507671Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 446, in forward 2025-08-14T21:35:56.1507789Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:35:56.1508036Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 401, in forward 2025-08-14T21:35:56.1508143Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:35:56.1508338Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:35:56.1508403Z return self.act(input) 2025-08-14T21:35:56.1508406Z 2025-08-14T21:35:56.1508505Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1508687Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1508753Z return mod(**inputs) 2025-08-14T21:35:56.1509006Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1509069Z outputs = self.deberta( 2025-08-14T21:35:56.1509326Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1509391Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1509639Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1509724Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1509926Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1510005Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1510254Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 447, in forward 2025-08-14T21:35:56.1510378Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:35:56.1510640Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 415, in forward 2025-08-14T21:35:56.1510717Z hidden_states = self.dense(hidden_states) 2025-08-14T21:35:56.1510720Z 2025-08-14T21:35:56.1510820Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1510998Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1511057Z return mod(**inputs) 2025-08-14T21:35:56.1511315Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1511377Z outputs = self.deberta( 2025-08-14T21:35:56.1511638Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1511714Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1511980Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1512078Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1512279Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1512348Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1512602Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1512686Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1512944Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1513028Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1513276Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 236, in forward 2025-08-14T21:35:56.1513457Z query_layer = self.transpose_for_scores(self.query_proj(query_states), self.num_attention_heads) 2025-08-14T21:35:56.1513461Z 2025-08-14T21:35:56.1513551Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1513738Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1513797Z return mod(**inputs) 2025-08-14T21:35:56.1514046Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1514114Z outputs = self.deberta( 2025-08-14T21:35:56.1514361Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1514426Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1514679Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1514757Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1514961Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1515031Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1515275Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1515369Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1515614Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1515685Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1515937Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 237, in forward 2025-08-14T21:35:56.1516103Z key_layer = self.transpose_for_scores(self.key_proj(hidden_states), self.num_attention_heads) 2025-08-14T21:35:56.1516107Z 2025-08-14T21:35:56.1516206Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1516384Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1516443Z return mod(**inputs) 2025-08-14T21:35:56.1516699Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1516760Z outputs = self.deberta( 2025-08-14T21:35:56.1517010Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1517075Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1517336Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1517420Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1517657Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1517737Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1517980Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1518062Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1518314Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1518399Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1518647Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 236, in forward 2025-08-14T21:35:56.1518825Z query_layer = self.transpose_for_scores(self.query_proj(query_states), self.num_attention_heads) 2025-08-14T21:35:56.1519110Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 194, in transpose_for_scores 2025-08-14T21:35:56.1519237Z return x.permute(0, 2, 1, 3).contiguous().view(-1, x.size(1), x.size(-1)) 2025-08-14T21:35:56.1519241Z 2025-08-14T21:35:56.1519334Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1519515Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1519580Z return mod(**inputs) 2025-08-14T21:35:56.1519832Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1519902Z outputs = self.deberta( 2025-08-14T21:35:56.1520151Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1520218Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1520476Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1520554Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1520754Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1520833Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1521078Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1521169Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1521418Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1521491Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1521746Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 248, in forward 2025-08-14T21:35:56.1521942Z attention_scores = torch.bmm(query_layer, key_layer.transpose(-1, -2) / scale.to(dtype=query_layer.dtype)) 2025-08-14T21:35:56.1521946Z 2025-08-14T21:35:56.1522047Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1522229Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1522289Z return mod(**inputs) 2025-08-14T21:35:56.1522547Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1522609Z outputs = self.deberta( 2025-08-14T21:35:56.1522878Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1522947Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1523207Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1523305Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1523507Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1523580Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1523835Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1523917Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1524186Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1524258Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1524504Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 248, in forward 2025-08-14T21:35:56.1524706Z attention_scores = torch.bmm(query_layer, key_layer.transpose(-1, -2) / scale.to(dtype=query_layer.dtype)) 2025-08-14T21:35:56.1524711Z 2025-08-14T21:35:56.1524804Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1524989Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1525047Z return mod(**inputs) 2025-08-14T21:35:56.1525296Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1525362Z outputs = self.deberta( 2025-08-14T21:35:56.1525610Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1525676Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1525933Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1526011Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1526216Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1526287Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1526532Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1526622Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1526867Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1526944Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1527190Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 238, in forward 2025-08-14T21:35:56.1527364Z value_layer = self.transpose_for_scores(self.value_proj(hidden_states), self.num_attention_heads) 2025-08-14T21:35:56.1527369Z 2025-08-14T21:35:56.1527469Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1527649Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1527707Z return mod(**inputs) 2025-08-14T21:35:56.1527965Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1528027Z outputs = self.deberta( 2025-08-14T21:35:56.1528279Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1528346Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1528606Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1528708Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1528924Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1529001Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1529250Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1529338Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1529591Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1529675Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1529924Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 238, in forward 2025-08-14T21:35:56.1530107Z value_layer = self.transpose_for_scores(self.value_proj(hidden_states), self.num_attention_heads) 2025-08-14T21:35:56.1530389Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 194, in transpose_for_scores 2025-08-14T21:35:56.1530514Z return x.permute(0, 2, 1, 3).contiguous().view(-1, x.size(1), x.size(-1)) 2025-08-14T21:35:56.1530517Z 2025-08-14T21:35:56.1530609Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1530786Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1530853Z return mod(**inputs) 2025-08-14T21:35:56.1531104Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1531173Z outputs = self.deberta( 2025-08-14T21:35:56.1531422Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1531486Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1531744Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1531821Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1532025Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1532096Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1532337Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1532428Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1532675Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1532743Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1532994Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 268, in forward 2025-08-14T21:35:56.1533060Z context_layer = torch.bmm( 2025-08-14T21:35:56.1533063Z 2025-08-14T21:35:56.1533161Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1533339Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1533397Z return mod(**inputs) 2025-08-14T21:35:56.1533652Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1533713Z outputs = self.deberta( 2025-08-14T21:35:56.1533965Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1534044Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1534307Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1534406Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1534606Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1534675Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1534929Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1535012Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1535266Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1535351Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1535601Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 272, in forward 2025-08-14T21:35:56.1535783Z context_layer.view(-1, self.num_attention_heads, context_layer.size(-2), context_layer.size(-1)) 2025-08-14T21:35:56.1535788Z 2025-08-14T21:35:56.1535880Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1536066Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1536124Z return mod(**inputs) 2025-08-14T21:35:56.1536376Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1536444Z outputs = self.deberta( 2025-08-14T21:35:56.1536692Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1536759Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1537015Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1537094Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1537302Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1537373Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1537620Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1537709Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1537956Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 381, in forward 2025-08-14T21:35:56.1538070Z attention_output = self.output(self_output, query_states) 2025-08-14T21:35:56.1538321Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 52, in forward 2025-08-14T21:35:56.1538396Z hidden_states = self.dense(hidden_states) 2025-08-14T21:35:56.1538402Z 2025-08-14T21:35:56.1538500Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1538678Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1538737Z return mod(**inputs) 2025-08-14T21:35:56.1538997Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1539059Z outputs = self.deberta( 2025-08-14T21:35:56.1539315Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1539381Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1539650Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1539736Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1539952Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1540045Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1540295Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 446, in forward 2025-08-14T21:35:56.1540401Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:35:56.1540655Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 400, in forward 2025-08-14T21:35:56.1540730Z hidden_states = self.dense(hidden_states) 2025-08-14T21:35:56.1540747Z 2025-08-14T21:35:56.1540841Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1541029Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1541088Z return mod(**inputs) 2025-08-14T21:35:56.1541348Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1541411Z outputs = self.deberta( 2025-08-14T21:35:56.1541657Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1541728Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1541973Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1542056Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1542256Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1542325Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1542580Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 446, in forward 2025-08-14T21:35:56.1542688Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:35:56.1542934Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 401, in forward 2025-08-14T21:35:56.1543043Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:35:56.1543236Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:35:56.1543309Z return self.act(input) 2025-08-14T21:35:56.1543312Z 2025-08-14T21:35:56.1543404Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1543585Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1543654Z return mod(**inputs) 2025-08-14T21:35:56.1543906Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1543976Z outputs = self.deberta( 2025-08-14T21:35:56.1544223Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1544288Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1544541Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1544618Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1544882Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1544972Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1545246Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 447, in forward 2025-08-14T21:35:56.1545376Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:35:56.1545640Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 415, in forward 2025-08-14T21:35:56.1545731Z hidden_states = self.dense(hidden_states) 2025-08-14T21:35:56.1545735Z 2025-08-14T21:35:56.1545836Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1546019Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1546087Z return mod(**inputs) 2025-08-14T21:35:56.1546338Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1546417Z outputs = self.deberta( 2025-08-14T21:35:56.1546673Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1546737Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1546986Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1547068Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1547266Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1547341Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1547585Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1547668Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1547920Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1547989Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1548242Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 236, in forward 2025-08-14T21:35:56.1548416Z query_layer = self.transpose_for_scores(self.query_proj(query_states), self.num_attention_heads) 2025-08-14T21:35:56.1548419Z 2025-08-14T21:35:56.1548511Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1548697Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1548755Z return mod(**inputs) 2025-08-14T21:35:56.1549004Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1549074Z outputs = self.deberta( 2025-08-14T21:35:56.1549321Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1549394Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1549642Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1549721Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1549928Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1549999Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1550251Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1550336Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1550583Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1550662Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1550925Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 237, in forward 2025-08-14T21:35:56.1551108Z key_layer = self.transpose_for_scores(self.key_proj(hidden_states), self.num_attention_heads) 2025-08-14T21:35:56.1551133Z 2025-08-14T21:35:56.1551227Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1551408Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1551474Z return mod(**inputs) 2025-08-14T21:35:56.1551728Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1551791Z outputs = self.deberta( 2025-08-14T21:35:56.1552049Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1552130Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1552388Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1552467Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1552669Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1552747Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1552993Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1553076Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1553330Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1553400Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1553654Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 236, in forward 2025-08-14T21:35:56.1553823Z query_layer = self.transpose_for_scores(self.query_proj(query_states), self.num_attention_heads) 2025-08-14T21:35:56.1554106Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 194, in transpose_for_scores 2025-08-14T21:35:56.1554235Z return x.permute(0, 2, 1, 3).contiguous().view(-1, x.size(1), x.size(-1)) 2025-08-14T21:35:56.1554238Z 2025-08-14T21:35:56.1554330Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1554517Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1554575Z return mod(**inputs) 2025-08-14T21:35:56.1554825Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1554895Z outputs = self.deberta( 2025-08-14T21:35:56.1555141Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1555212Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1555463Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1555539Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1555744Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1555814Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1556059Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1556149Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1556413Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1556493Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1556752Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 248, in forward 2025-08-14T21:35:56.1556961Z attention_scores = torch.bmm(query_layer, key_layer.transpose(-1, -2) / scale.to(dtype=query_layer.dtype)) 2025-08-14T21:35:56.1556965Z 2025-08-14T21:35:56.1557069Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1557251Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1557318Z return mod(**inputs) 2025-08-14T21:35:56.1557569Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1557655Z outputs = self.deberta( 2025-08-14T21:35:56.1557911Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1557977Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1558226Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1558312Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1558511Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1558591Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1558837Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1558921Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1559174Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1559244Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1559498Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 248, in forward 2025-08-14T21:35:56.1559693Z attention_scores = torch.bmm(query_layer, key_layer.transpose(-1, -2) / scale.to(dtype=query_layer.dtype)) 2025-08-14T21:35:56.1559696Z 2025-08-14T21:35:56.1559788Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1559975Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1560034Z return mod(**inputs) 2025-08-14T21:35:56.1560293Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1560355Z outputs = self.deberta( 2025-08-14T21:35:56.1560601Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1560674Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1560923Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1561003Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1561210Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1561279Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1561530Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1561613Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1561857Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1561935Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1562198Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 238, in forward 2025-08-14T21:35:56.1562394Z value_layer = self.transpose_for_scores(self.value_proj(hidden_states), self.num_attention_heads) 2025-08-14T21:35:56.1562410Z 2025-08-14T21:35:56.1562504Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1562684Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1562748Z return mod(**inputs) 2025-08-14T21:35:56.1563000Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1563061Z outputs = self.deberta( 2025-08-14T21:35:56.1563315Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1563393Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1563650Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1563728Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1563931Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1564009Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1564259Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1564348Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1564596Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1564665Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1564923Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 238, in forward 2025-08-14T21:35:56.1565094Z value_layer = self.transpose_for_scores(self.value_proj(hidden_states), self.num_attention_heads) 2025-08-14T21:35:56.1565381Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 194, in transpose_for_scores 2025-08-14T21:35:56.1565508Z return x.permute(0, 2, 1, 3).contiguous().view(-1, x.size(1), x.size(-1)) 2025-08-14T21:35:56.1565512Z 2025-08-14T21:35:56.1565603Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1565793Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1565853Z return mod(**inputs) 2025-08-14T21:35:56.1566107Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1566180Z outputs = self.deberta( 2025-08-14T21:35:56.1566430Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1566503Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1566756Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1566833Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1567044Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1567115Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1567363Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1567455Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1567722Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1567800Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1568065Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 268, in forward 2025-08-14T21:35:56.1568146Z context_layer = torch.bmm( 2025-08-14T21:35:56.1568149Z 2025-08-14T21:35:56.1568251Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1568433Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1568500Z return mod(**inputs) 2025-08-14T21:35:56.1568751Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1568828Z outputs = self.deberta( 2025-08-14T21:35:56.1569090Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1569157Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1569408Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1569492Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1569693Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1569772Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1570018Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1570101Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1570357Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1570427Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1570681Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 272, in forward 2025-08-14T21:35:56.1570853Z context_layer.view(-1, self.num_attention_heads, context_layer.size(-2), context_layer.size(-1)) 2025-08-14T21:35:56.1570858Z 2025-08-14T21:35:56.1570952Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1571140Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1571198Z return mod(**inputs) 2025-08-14T21:35:56.1571456Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1571519Z outputs = self.deberta( 2025-08-14T21:35:56.1571766Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1571840Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1572089Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1572168Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1572377Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1572446Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1572703Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1572787Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1573036Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 381, in forward 2025-08-14T21:35:56.1573152Z attention_output = self.output(self_output, query_states) 2025-08-14T21:35:56.1573418Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 52, in forward 2025-08-14T21:35:56.1573502Z hidden_states = self.dense(hidden_states) 2025-08-14T21:35:56.1573535Z 2025-08-14T21:35:56.1573630Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1573811Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1573877Z return mod(**inputs) 2025-08-14T21:35:56.1574127Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1574188Z outputs = self.deberta( 2025-08-14T21:35:56.1574443Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1574521Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1574775Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1574851Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1575051Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1575129Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1575375Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 446, in forward 2025-08-14T21:35:56.1575482Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:35:56.1575736Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 400, in forward 2025-08-14T21:35:56.1575809Z hidden_states = self.dense(hidden_states) 2025-08-14T21:35:56.1575814Z 2025-08-14T21:35:56.1575910Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1576089Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1576147Z return mod(**inputs) 2025-08-14T21:35:56.1576406Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1576467Z outputs = self.deberta( 2025-08-14T21:35:56.1576717Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1576782Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1577030Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1577114Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1577318Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1577388Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1577642Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 446, in forward 2025-08-14T21:35:56.1577750Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:35:56.1578006Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 401, in forward 2025-08-14T21:35:56.1578106Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:35:56.1578299Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:35:56.1578370Z return self.act(input) 2025-08-14T21:35:56.1578373Z 2025-08-14T21:35:56.1578465Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1578656Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1578716Z return mod(**inputs) 2025-08-14T21:35:56.1578983Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1579070Z outputs = self.deberta( 2025-08-14T21:35:56.1579346Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1579413Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1579672Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1579748Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1579957Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1580044Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1580289Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 447, in forward 2025-08-14T21:35:56.1580417Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:35:56.1580665Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 415, in forward 2025-08-14T21:35:56.1580746Z hidden_states = self.dense(hidden_states) 2025-08-14T21:35:56.1580750Z 2025-08-14T21:35:56.1580843Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1581023Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1581090Z return mod(**inputs) 2025-08-14T21:35:56.1581341Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1581405Z outputs = self.deberta( 2025-08-14T21:35:56.1581659Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1581725Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1581979Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1582055Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1582253Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1582331Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1582574Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1582667Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1582913Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1582983Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1583234Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 236, in forward 2025-08-14T21:35:56.1583406Z query_layer = self.transpose_for_scores(self.query_proj(query_states), self.num_attention_heads) 2025-08-14T21:35:56.1583410Z 2025-08-14T21:35:56.1583508Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1583688Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1583746Z return mod(**inputs) 2025-08-14T21:35:56.1584000Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1584061Z outputs = self.deberta( 2025-08-14T21:35:56.1584307Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1584394Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1584752Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1584956Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1585176Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1585251Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1585525Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1585617Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1585897Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1586007Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1586271Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 237, in forward 2025-08-14T21:35:56.1586454Z key_layer = self.transpose_for_scores(self.key_proj(hidden_states), self.num_attention_heads) 2025-08-14T21:35:56.1586459Z 2025-08-14T21:35:56.1586558Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1586753Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1586822Z return mod(**inputs) 2025-08-14T21:35:56.1587075Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1587148Z outputs = self.deberta( 2025-08-14T21:35:56.1587416Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1587488Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1587765Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1587847Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1588072Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1588147Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1588415Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1588513Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1588779Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1588854Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1589132Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 236, in forward 2025-08-14T21:35:56.1589314Z query_layer = self.transpose_for_scores(self.query_proj(query_states), self.num_attention_heads) 2025-08-14T21:35:56.1589628Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 194, in transpose_for_scores 2025-08-14T21:35:56.1589756Z return x.permute(0, 2, 1, 3).contiguous().view(-1, x.size(1), x.size(-1)) 2025-08-14T21:35:56.1589759Z 2025-08-14T21:35:56.1589857Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1590058Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1590120Z return mod(**inputs) 2025-08-14T21:35:56.1590402Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1590469Z outputs = self.deberta( 2025-08-14T21:35:56.1590759Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1590841Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1591148Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1591232Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1591458Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1591533Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1591810Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1591899Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1592185Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1592266Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1592536Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 248, in forward 2025-08-14T21:35:56.1592757Z attention_scores = torch.bmm(query_layer, key_layer.transpose(-1, -2) / scale.to(dtype=query_layer.dtype)) 2025-08-14T21:35:56.1592761Z 2025-08-14T21:35:56.1592862Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1593058Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1593128Z return mod(**inputs) 2025-08-14T21:35:56.1593399Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1593474Z outputs = self.deberta( 2025-08-14T21:35:56.1593743Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1593813Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1594089Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1594173Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1594389Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1594475Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1594743Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1594838Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1595107Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1595181Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1595458Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 248, in forward 2025-08-14T21:35:56.1595671Z attention_scores = torch.bmm(query_layer, key_layer.transpose(-1, -2) / scale.to(dtype=query_layer.dtype)) 2025-08-14T21:35:56.1595675Z 2025-08-14T21:35:56.1595774Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1595956Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1596015Z return mod(**inputs) 2025-08-14T21:35:56.1596274Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1596336Z outputs = self.deberta( 2025-08-14T21:35:56.1596584Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1596694Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1596955Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1597056Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1597258Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1597328Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1597581Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1597664Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1597916Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1597999Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1598247Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 238, in forward 2025-08-14T21:35:56.1598430Z value_layer = self.transpose_for_scores(self.value_proj(hidden_states), self.num_attention_heads) 2025-08-14T21:35:56.1598434Z 2025-08-14T21:35:56.1598526Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1598714Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1598775Z return mod(**inputs) 2025-08-14T21:35:56.1599026Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1599095Z outputs = self.deberta( 2025-08-14T21:35:56.1599342Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1599408Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1599662Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1599739Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1599951Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1600021Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1600268Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1600359Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1600604Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1600674Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1600927Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 238, in forward 2025-08-14T21:35:56.1601103Z value_layer = self.transpose_for_scores(self.value_proj(hidden_states), self.num_attention_heads) 2025-08-14T21:35:56.1601393Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 194, in transpose_for_scores 2025-08-14T21:35:56.1601511Z return x.permute(0, 2, 1, 3).contiguous().view(-1, x.size(1), x.size(-1)) 2025-08-14T21:35:56.1601515Z 2025-08-14T21:35:56.1601607Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1601796Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1601855Z return mod(**inputs) 2025-08-14T21:35:56.1602115Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1602177Z outputs = self.deberta( 2025-08-14T21:35:56.1602436Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1602540Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1602802Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1602887Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1603085Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1603156Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1603407Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1603506Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1603753Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1603828Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1604077Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 268, in forward 2025-08-14T21:35:56.1604149Z context_layer = torch.bmm( 2025-08-14T21:35:56.1604153Z 2025-08-14T21:35:56.1604247Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1604428Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1604492Z return mod(**inputs) 2025-08-14T21:35:56.1604741Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1604811Z outputs = self.deberta( 2025-08-14T21:35:56.1605056Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1605121Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1605372Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1605448Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1605646Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1605721Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1605967Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1606053Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1606300Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1606369Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1606621Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 272, in forward 2025-08-14T21:35:56.1606793Z context_layer.view(-1, self.num_attention_heads, context_layer.size(-2), context_layer.size(-1)) 2025-08-14T21:35:56.1606797Z 2025-08-14T21:35:56.1606895Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1607073Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1607133Z return mod(**inputs) 2025-08-14T21:35:56.1607392Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1607454Z outputs = self.deberta( 2025-08-14T21:35:56.1607701Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1607774Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1608033Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1608148Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1608351Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1608421Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1608674Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1608755Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1609005Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 381, in forward 2025-08-14T21:35:56.1609126Z attention_output = self.output(self_output, query_states) 2025-08-14T21:35:56.1609375Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 52, in forward 2025-08-14T21:35:56.1609457Z hidden_states = self.dense(hidden_states) 2025-08-14T21:35:56.1609463Z 2025-08-14T21:35:56.1609558Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1609740Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1609808Z return mod(**inputs) 2025-08-14T21:35:56.1610061Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1610132Z outputs = self.deberta( 2025-08-14T21:35:56.1610380Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1610451Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1610707Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1610786Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1610996Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1611071Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1611321Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 446, in forward 2025-08-14T21:35:56.1611441Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:35:56.1611690Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 400, in forward 2025-08-14T21:35:56.1611767Z hidden_states = self.dense(hidden_states) 2025-08-14T21:35:56.1611781Z 2025-08-14T21:35:56.1611875Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1612059Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1612129Z return mod(**inputs) 2025-08-14T21:35:56.1612381Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1612448Z outputs = self.deberta( 2025-08-14T21:35:56.1612705Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1612773Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1613028Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1613108Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1613313Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1613394Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1613657Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 446, in forward 2025-08-14T21:35:56.1613792Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:35:56.1614049Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 401, in forward 2025-08-14T21:35:56.1614149Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:35:56.1614350Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:35:56.1614412Z return self.act(input) 2025-08-14T21:35:56.1614416Z 2025-08-14T21:35:56.1614507Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1614711Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1614770Z return mod(**inputs) 2025-08-14T21:35:56.1615032Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1615094Z outputs = self.deberta( 2025-08-14T21:35:56.1615344Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1615416Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1615666Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1615743Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1615949Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1616021Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1616279Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 447, in forward 2025-08-14T21:35:56.1616400Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:35:56.1616652Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 415, in forward 2025-08-14T21:35:56.1616735Z hidden_states = self.dense(hidden_states) 2025-08-14T21:35:56.1616739Z 2025-08-14T21:35:56.1616829Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1617017Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1617076Z return mod(**inputs) 2025-08-14T21:35:56.1617328Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1617398Z outputs = self.deberta( 2025-08-14T21:35:56.1617649Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1617712Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1617969Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1618048Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1618256Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1618325Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1618574Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1618665Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1618912Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1618990Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1619252Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 236, in forward 2025-08-14T21:35:56.1619453Z query_layer = self.transpose_for_scores(self.query_proj(query_states), self.num_attention_heads) 2025-08-14T21:35:56.1619457Z 2025-08-14T21:35:56.1619558Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1619737Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1619796Z return mod(**inputs) 2025-08-14T21:35:56.1620055Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1620116Z outputs = self.deberta( 2025-08-14T21:35:56.1620386Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1620452Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1620701Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1620786Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1620986Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1621063Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1621308Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1621389Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1621643Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1621712Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1621960Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 237, in forward 2025-08-14T21:35:56.1622130Z key_layer = self.transpose_for_scores(self.key_proj(hidden_states), self.num_attention_heads) 2025-08-14T21:35:56.1622134Z 2025-08-14T21:35:56.1622227Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1622411Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1622469Z return mod(**inputs) 2025-08-14T21:35:56.1622721Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1622788Z outputs = self.deberta( 2025-08-14T21:35:56.1623033Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1623106Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1623353Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1623430Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1623639Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1623710Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1623956Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1624047Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1624294Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1624373Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1624640Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 236, in forward 2025-08-14T21:35:56.1624874Z query_layer = self.transpose_for_scores(self.query_proj(query_states), self.num_attention_heads) 2025-08-14T21:35:56.1625218Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 194, in transpose_for_scores 2025-08-14T21:35:56.1625341Z return x.permute(0, 2, 1, 3).contiguous().view(-1, x.size(1), x.size(-1)) 2025-08-14T21:35:56.1625344Z 2025-08-14T21:35:56.1625449Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1625634Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1625693Z return mod(**inputs) 2025-08-14T21:35:56.1625954Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1626033Z outputs = self.deberta( 2025-08-14T21:35:56.1626292Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1626359Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1626608Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1626695Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1626896Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1626968Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1627224Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1627307Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1627569Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1627638Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1627884Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 248, in forward 2025-08-14T21:35:56.1628087Z attention_scores = torch.bmm(query_layer, key_layer.transpose(-1, -2) / scale.to(dtype=query_layer.dtype)) 2025-08-14T21:35:56.1628090Z 2025-08-14T21:35:56.1628183Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1628375Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1628433Z return mod(**inputs) 2025-08-14T21:35:56.1628683Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1628753Z outputs = self.deberta( 2025-08-14T21:35:56.1629001Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1629066Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1629319Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1629396Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1629602Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1629672Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1629916Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1630007Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1630254Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1630346Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1630609Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 248, in forward 2025-08-14T21:35:56.1630817Z attention_scores = torch.bmm(query_layer, key_layer.transpose(-1, -2) / scale.to(dtype=query_layer.dtype)) 2025-08-14T21:35:56.1630821Z 2025-08-14T21:35:56.1630923Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1631104Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1631170Z return mod(**inputs) 2025-08-14T21:35:56.1631426Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1631490Z outputs = self.deberta( 2025-08-14T21:35:56.1631772Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1631838Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1632085Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1632172Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1632372Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1632450Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1632695Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1632777Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1633029Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1633097Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1633351Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 238, in forward 2025-08-14T21:35:56.1633526Z value_layer = self.transpose_for_scores(self.value_proj(hidden_states), self.num_attention_heads) 2025-08-14T21:35:56.1633531Z 2025-08-14T21:35:56.1633623Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1633808Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1633867Z return mod(**inputs) 2025-08-14T21:35:56.1634118Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1634185Z outputs = self.deberta( 2025-08-14T21:35:56.1634432Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1634505Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1634751Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1634829Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1635036Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1635106Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1635356Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1635438Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1635682Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1635756Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1636017Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 238, in forward 2025-08-14T21:35:56.1636205Z value_layer = self.transpose_for_scores(self.value_proj(hidden_states), self.num_attention_heads) 2025-08-14T21:35:56.1636510Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 194, in transpose_for_scores 2025-08-14T21:35:56.1636629Z return x.permute(0, 2, 1, 3).contiguous().view(-1, x.size(1), x.size(-1)) 2025-08-14T21:35:56.1636633Z 2025-08-14T21:35:56.1636731Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1636910Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1636969Z return mod(**inputs) 2025-08-14T21:35:56.1637227Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1637303Z outputs = self.deberta( 2025-08-14T21:35:56.1637562Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1637629Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1637879Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1637963Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1638161Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1638231Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1638482Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1638566Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1638822Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1638890Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1639139Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 268, in forward 2025-08-14T21:35:56.1639211Z context_layer = torch.bmm( 2025-08-14T21:35:56.1639215Z 2025-08-14T21:35:56.1639308Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1639493Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1639553Z return mod(**inputs) 2025-08-14T21:35:56.1639807Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1639876Z outputs = self.deberta( 2025-08-14T21:35:56.1640126Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1640193Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1640451Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1640529Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1640736Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1640807Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1641056Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1641148Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1641396Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1641472Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1641733Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 272, in forward 2025-08-14T21:35:56.1641928Z context_layer.view(-1, self.num_attention_heads, context_layer.size(-2), context_layer.size(-1)) 2025-08-14T21:35:56.1641945Z 2025-08-14T21:35:56.1642048Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1642230Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1642295Z return mod(**inputs) 2025-08-14T21:35:56.1642547Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1642609Z outputs = self.deberta( 2025-08-14T21:35:56.1642865Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1642955Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1643205Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1643291Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1643493Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1643568Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1643819Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1643902Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1644157Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 381, in forward 2025-08-14T21:35:56.1644265Z attention_output = self.output(self_output, query_states) 2025-08-14T21:35:56.1644522Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 52, in forward 2025-08-14T21:35:56.1644598Z hidden_states = self.dense(hidden_states) 2025-08-14T21:35:56.1644604Z 2025-08-14T21:35:56.1644697Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1644882Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1644941Z return mod(**inputs) 2025-08-14T21:35:56.1645192Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1645261Z outputs = self.deberta( 2025-08-14T21:35:56.1645511Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1645584Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1645837Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1645915Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1646124Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1646197Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1646446Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 446, in forward 2025-08-14T21:35:56.1646561Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:35:56.1646810Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 400, in forward 2025-08-14T21:35:56.1646892Z hidden_states = self.dense(hidden_states) 2025-08-14T21:35:56.1646896Z 2025-08-14T21:35:56.1646991Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1647187Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1647258Z return mod(**inputs) 2025-08-14T21:35:56.1647520Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1647603Z outputs = self.deberta( 2025-08-14T21:35:56.1647859Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1647924Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1648185Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1648260Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1648477Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1648554Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1648800Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 446, in forward 2025-08-14T21:35:56.1648914Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:35:56.1649159Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 401, in forward 2025-08-14T21:35:56.1649258Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:35:56.1649459Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:35:56.1649523Z return self.act(input) 2025-08-14T21:35:56.1649526Z 2025-08-14T21:35:56.1649623Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1649806Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1649864Z return mod(**inputs) 2025-08-14T21:35:56.1650123Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1650183Z outputs = self.deberta( 2025-08-14T21:35:56.1650431Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1650503Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1650744Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1650826Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1651024Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1651097Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1651350Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 447, in forward 2025-08-14T21:35:56.1651468Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:35:56.1651720Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 415, in forward 2025-08-14T21:35:56.1651795Z hidden_states = self.dense(hidden_states) 2025-08-14T21:35:56.1651799Z 2025-08-14T21:35:56.1651890Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1652079Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1652137Z return mod(**inputs) 2025-08-14T21:35:56.1652387Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1652455Z outputs = self.deberta( 2025-08-14T21:35:56.1652718Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1652792Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1653052Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1653144Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1653351Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1653422Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1653677Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1653761Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1654025Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1654104Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1654352Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 236, in forward 2025-08-14T21:35:56.1654533Z query_layer = self.transpose_for_scores(self.query_proj(query_states), self.num_attention_heads) 2025-08-14T21:35:56.1654537Z 2025-08-14T21:35:56.1654630Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1654806Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1654871Z return mod(**inputs) 2025-08-14T21:35:56.1655125Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1655185Z outputs = self.deberta( 2025-08-14T21:35:56.1655443Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1655509Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1655765Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1655841Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1656041Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1656117Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1656366Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1656454Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1656701Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1656772Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1657027Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 237, in forward 2025-08-14T21:35:56.1657191Z key_layer = self.transpose_for_scores(self.key_proj(hidden_states), self.num_attention_heads) 2025-08-14T21:35:56.1657196Z 2025-08-14T21:35:56.1657288Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1657475Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1657532Z return mod(**inputs) 2025-08-14T21:35:56.1657791Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1657852Z outputs = self.deberta( 2025-08-14T21:35:56.1658100Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1658173Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1658436Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1658535Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1658759Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1658829Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1659079Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1659163Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1659407Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1659498Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1659746Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 236, in forward 2025-08-14T21:35:56.1659925Z query_layer = self.transpose_for_scores(self.query_proj(query_states), self.num_attention_heads) 2025-08-14T21:35:56.1660211Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 194, in transpose_for_scores 2025-08-14T21:35:56.1660331Z return x.permute(0, 2, 1, 3).contiguous().view(-1, x.size(1), x.size(-1)) 2025-08-14T21:35:56.1660335Z 2025-08-14T21:35:56.1660438Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1660617Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1660683Z return mod(**inputs) 2025-08-14T21:35:56.1660934Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1660996Z outputs = self.deberta( 2025-08-14T21:35:56.1661250Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1661315Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1661563Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1661649Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1661850Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1661929Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1662178Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1662262Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1662518Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1662587Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1662844Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 248, in forward 2025-08-14T21:35:56.1663040Z attention_scores = torch.bmm(query_layer, key_layer.transpose(-1, -2) / scale.to(dtype=query_layer.dtype)) 2025-08-14T21:35:56.1663043Z 2025-08-14T21:35:56.1663135Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1663322Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1663380Z return mod(**inputs) 2025-08-14T21:35:56.1663639Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1663702Z outputs = self.deberta( 2025-08-14T21:35:56.1663964Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1664036Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1664299Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1664399Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1664607Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1664676Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1664994Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1665082Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1665350Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1665429Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1665679Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 248, in forward 2025-08-14T21:35:56.1665881Z attention_scores = torch.bmm(query_layer, key_layer.transpose(-1, -2) / scale.to(dtype=query_layer.dtype)) 2025-08-14T21:35:56.1665885Z 2025-08-14T21:35:56.1665978Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1666159Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1666226Z return mod(**inputs) 2025-08-14T21:35:56.1666478Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1666541Z outputs = self.deberta( 2025-08-14T21:35:56.1666799Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1666865Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1667121Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1667200Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1667401Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1667478Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1667724Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1667813Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1668061Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1668129Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1668385Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 238, in forward 2025-08-14T21:35:56.1668562Z value_layer = self.transpose_for_scores(self.value_proj(hidden_states), self.num_attention_heads) 2025-08-14T21:35:56.1668566Z 2025-08-14T21:35:56.1668666Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1668844Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1668902Z return mod(**inputs) 2025-08-14T21:35:56.1669162Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1669224Z outputs = self.deberta( 2025-08-14T21:35:56.1669470Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1669543Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1669804Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1669905Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1670120Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1670192Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1670451Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1670534Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1670786Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1670877Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1671125Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 238, in forward 2025-08-14T21:35:56.1671311Z value_layer = self.transpose_for_scores(self.value_proj(hidden_states), self.num_attention_heads) 2025-08-14T21:35:56.1671599Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 194, in transpose_for_scores 2025-08-14T21:35:56.1671719Z return x.permute(0, 2, 1, 3).contiguous().view(-1, x.size(1), x.size(-1)) 2025-08-14T21:35:56.1671729Z 2025-08-14T21:35:56.1671823Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1672004Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1672069Z return mod(**inputs) 2025-08-14T21:35:56.1672320Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1672382Z outputs = self.deberta( 2025-08-14T21:35:56.1672636Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1672702Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1672956Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1673032Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1673233Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1673309Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1673554Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1673638Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1673892Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1673959Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1674216Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 268, in forward 2025-08-14T21:35:56.1674282Z context_layer = torch.bmm( 2025-08-14T21:35:56.1674285Z 2025-08-14T21:35:56.1674378Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1674565Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1674623Z return mod(**inputs) 2025-08-14T21:35:56.1674881Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1674944Z outputs = self.deberta( 2025-08-14T21:35:56.1675191Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1675284Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1675547Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1675640Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1675848Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1675920Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1676172Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1676255Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1676500Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1676595Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1676844Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 272, in forward 2025-08-14T21:35:56.1677023Z context_layer.view(-1, self.num_attention_heads, context_layer.size(-2), context_layer.size(-1)) 2025-08-14T21:35:56.1677028Z 2025-08-14T21:35:56.1677121Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1677300Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1677366Z return mod(**inputs) 2025-08-14T21:35:56.1677616Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1677677Z outputs = self.deberta( 2025-08-14T21:35:56.1677931Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1677996Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1678252Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1678331Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1678534Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1678611Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1678855Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1678943Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1679187Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 381, in forward 2025-08-14T21:35:56.1679293Z attention_output = self.output(self_output, query_states) 2025-08-14T21:35:56.1679551Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 52, in forward 2025-08-14T21:35:56.1679626Z hidden_states = self.dense(hidden_states) 2025-08-14T21:35:56.1679631Z 2025-08-14T21:35:56.1679722Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1679910Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1679968Z return mod(**inputs) 2025-08-14T21:35:56.1680227Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1680287Z outputs = self.deberta( 2025-08-14T21:35:56.1680534Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1680605Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1680867Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1680954Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1681168Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1681253Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1681505Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 446, in forward 2025-08-14T21:35:56.1681612Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:35:56.1681858Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 400, in forward 2025-08-14T21:35:56.1681939Z hidden_states = self.dense(hidden_states) 2025-08-14T21:35:56.1681972Z 2025-08-14T21:35:56.1682068Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1682253Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1682314Z return mod(**inputs) 2025-08-14T21:35:56.1682566Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1682635Z outputs = self.deberta( 2025-08-14T21:35:56.1682886Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1682958Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1683205Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1683282Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1683492Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1683563Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1683808Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 446, in forward 2025-08-14T21:35:56.1683923Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:35:56.1684168Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 401, in forward 2025-08-14T21:35:56.1684275Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:35:56.1684479Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:35:56.1684545Z return self.act(input) 2025-08-14T21:35:56.1684548Z 2025-08-14T21:35:56.1684815Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1685013Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1685083Z return mod(**inputs) 2025-08-14T21:35:56.1685345Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1685411Z outputs = self.deberta( 2025-08-14T21:35:56.1685679Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1685747Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1686004Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1686090Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1686303Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1686383Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1686664Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 447, in forward 2025-08-14T21:35:56.1686790Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:35:56.1687073Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 415, in forward 2025-08-14T21:35:56.1687169Z hidden_states = self.dense(hidden_states) 2025-08-14T21:35:56.1687172Z 2025-08-14T21:35:56.1687278Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1687457Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1687519Z return mod(**inputs) 2025-08-14T21:35:56.1687782Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1687868Z outputs = self.deberta( 2025-08-14T21:35:56.1688116Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1688186Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1688435Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1688521Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1688721Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1688792Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1689044Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1689130Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1689390Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1689460Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1689708Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 236, in forward 2025-08-14T21:35:56.1689890Z query_layer = self.transpose_for_scores(self.query_proj(query_states), self.num_attention_heads) 2025-08-14T21:35:56.1689893Z 2025-08-14T21:35:56.1689987Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1690174Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1690234Z return mod(**inputs) 2025-08-14T21:35:56.1690484Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1690551Z outputs = self.deberta( 2025-08-14T21:35:56.1690800Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1690867Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1691124Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1691201Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1691409Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1691477Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1691721Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1691812Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1692058Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1692128Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1692394Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 237, in forward 2025-08-14T21:35:56.1692576Z key_layer = self.transpose_for_scores(self.key_proj(hidden_states), self.num_attention_heads) 2025-08-14T21:35:56.1692593Z 2025-08-14T21:35:56.1692694Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1692875Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1692933Z return mod(**inputs) 2025-08-14T21:35:56.1693192Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1693251Z outputs = self.deberta( 2025-08-14T21:35:56.1693505Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1693585Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1693834Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1693917Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1694119Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1694189Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1694441Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1694522Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1694775Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1694843Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1695094Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 236, in forward 2025-08-14T21:35:56.1695272Z query_layer = self.transpose_for_scores(self.query_proj(query_states), self.num_attention_heads) 2025-08-14T21:35:56.1695556Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 194, in transpose_for_scores 2025-08-14T21:35:56.1695681Z return x.permute(0, 2, 1, 3).contiguous().view(-1, x.size(1), x.size(-1)) 2025-08-14T21:35:56.1695685Z 2025-08-14T21:35:56.1695776Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1695954Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1696020Z return mod(**inputs) 2025-08-14T21:35:56.1696273Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1696341Z outputs = self.deberta( 2025-08-14T21:35:56.1696593Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1696658Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1696918Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1696994Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1697194Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1697270Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1697520Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1697612Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1697874Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1697945Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1698214Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 248, in forward 2025-08-14T21:35:56.1698421Z attention_scores = torch.bmm(query_layer, key_layer.transpose(-1, -2) / scale.to(dtype=query_layer.dtype)) 2025-08-14T21:35:56.1698425Z 2025-08-14T21:35:56.1698525Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1698706Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1698766Z return mod(**inputs) 2025-08-14T21:35:56.1699026Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1699103Z outputs = self.deberta( 2025-08-14T21:35:56.1699355Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1699427Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1699677Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1699766Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1699968Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1700039Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1700295Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1700377Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1700637Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1700706Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1700953Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 248, in forward 2025-08-14T21:35:56.1701154Z attention_scores = torch.bmm(query_layer, key_layer.transpose(-1, -2) / scale.to(dtype=query_layer.dtype)) 2025-08-14T21:35:56.1701157Z 2025-08-14T21:35:56.1701251Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1701443Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1701502Z return mod(**inputs) 2025-08-14T21:35:56.1701754Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1701823Z outputs = self.deberta( 2025-08-14T21:35:56.1702072Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1702136Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1702393Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1702470Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1702681Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1702751Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1702999Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1703088Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1703334Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1703410Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1703672Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 238, in forward 2025-08-14T21:35:56.1703862Z value_layer = self.transpose_for_scores(self.value_proj(hidden_states), self.num_attention_heads) 2025-08-14T21:35:56.1703880Z 2025-08-14T21:35:56.1703982Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1704164Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1704221Z return mod(**inputs) 2025-08-14T21:35:56.1704479Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1704541Z outputs = self.deberta( 2025-08-14T21:35:56.1704847Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1704938Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1705188Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1705275Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1705475Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1705553Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1705801Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1705884Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1706138Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1706208Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1706457Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 238, in forward 2025-08-14T21:35:56.1706640Z value_layer = self.transpose_for_scores(self.value_proj(hidden_states), self.num_attention_heads) 2025-08-14T21:35:56.1706923Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 194, in transpose_for_scores 2025-08-14T21:35:56.1707049Z return x.permute(0, 2, 1, 3).contiguous().view(-1, x.size(1), x.size(-1)) 2025-08-14T21:35:56.1707053Z 2025-08-14T21:35:56.1707146Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1707326Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1707394Z return mod(**inputs) 2025-08-14T21:35:56.1707645Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1707714Z outputs = self.deberta( 2025-08-14T21:35:56.1707963Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1708030Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1708284Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1708358Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1708557Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1708633Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1708878Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1708967Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1709236Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1709306Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1709577Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 268, in forward 2025-08-14T21:35:56.1709656Z context_layer = torch.bmm( 2025-08-14T21:35:56.1709659Z 2025-08-14T21:35:56.1709759Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1709938Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1709995Z return mod(**inputs) 2025-08-14T21:35:56.1710250Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1710327Z outputs = self.deberta( 2025-08-14T21:35:56.1710572Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1710644Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1710891Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1710975Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1711176Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1711246Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1711501Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1711582Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1711837Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1711907Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1712156Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 272, in forward 2025-08-14T21:35:56.1712338Z context_layer.view(-1, self.num_attention_heads, context_layer.size(-2), context_layer.size(-1)) 2025-08-14T21:35:56.1712342Z 2025-08-14T21:35:56.1712436Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1712626Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1712685Z return mod(**inputs) 2025-08-14T21:35:56.1712935Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1713004Z outputs = self.deberta( 2025-08-14T21:35:56.1713253Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1713317Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1713574Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1713654Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1713861Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1713931Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1714179Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1714268Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1714516Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 381, in forward 2025-08-14T21:35:56.1714630Z attention_output = self.output(self_output, query_states) 2025-08-14T21:35:56.1715000Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 52, in forward 2025-08-14T21:35:56.1715079Z hidden_states = self.dense(hidden_states) 2025-08-14T21:35:56.1715112Z 2025-08-14T21:35:56.1715217Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1715400Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1715460Z return mod(**inputs) 2025-08-14T21:35:56.1715720Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1715782Z outputs = self.deberta( 2025-08-14T21:35:56.1716036Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1716118Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1716371Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1716459Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1716664Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1716744Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1716995Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 446, in forward 2025-08-14T21:35:56.1717104Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:35:56.1717362Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 400, in forward 2025-08-14T21:35:56.1717437Z hidden_states = self.dense(hidden_states) 2025-08-14T21:35:56.1717442Z 2025-08-14T21:35:56.1717535Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1717727Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1717788Z return mod(**inputs) 2025-08-14T21:35:56.1718051Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1718113Z outputs = self.deberta( 2025-08-14T21:35:56.1718362Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1718437Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1718687Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1718769Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1718973Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1719043Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1719300Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 446, in forward 2025-08-14T21:35:56.1719409Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:35:56.1719660Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 401, in forward 2025-08-14T21:35:56.1719768Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:35:56.1719961Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:35:56.1720034Z return self.act(input) 2025-08-14T21:35:56.1720037Z 2025-08-14T21:35:56.1720129Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1720313Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1720379Z return mod(**inputs) 2025-08-14T21:35:56.1720652Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1720728Z outputs = self.deberta( 2025-08-14T21:35:56.1720998Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1721062Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1721317Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1721394Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1721594Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1721690Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1721935Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 447, in forward 2025-08-14T21:35:56.1722060Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:35:56.1722305Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 415, in forward 2025-08-14T21:35:56.1722380Z hidden_states = self.dense(hidden_states) 2025-08-14T21:35:56.1722384Z 2025-08-14T21:35:56.1722481Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1722660Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1722725Z return mod(**inputs) 2025-08-14T21:35:56.1722973Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1723034Z outputs = self.deberta( 2025-08-14T21:35:56.1723286Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1723349Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1723595Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1723680Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1723881Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1723958Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1724201Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1724285Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1724537Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1724606Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1724851Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 236, in forward 2025-08-14T21:35:56.1725030Z query_layer = self.transpose_for_scores(self.query_proj(query_states), self.num_attention_heads) 2025-08-14T21:35:56.1725033Z 2025-08-14T21:35:56.1725124Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1725313Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1725371Z return mod(**inputs) 2025-08-14T21:35:56.1725623Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1725690Z outputs = self.deberta( 2025-08-14T21:35:56.1725935Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1726020Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1726284Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1726374Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1726581Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1726651Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1726903Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1726986Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1727232Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1727329Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1727582Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 237, in forward 2025-08-14T21:35:56.1727750Z key_layer = self.transpose_for_scores(self.key_proj(hidden_states), self.num_attention_heads) 2025-08-14T21:35:56.1727762Z 2025-08-14T21:35:56.1727856Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1728034Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1728102Z return mod(**inputs) 2025-08-14T21:35:56.1728355Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1728417Z outputs = self.deberta( 2025-08-14T21:35:56.1728671Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1728736Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1728993Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1729072Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1729275Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1729352Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1729599Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1729684Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1729939Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1730010Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1730270Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 236, in forward 2025-08-14T21:35:56.1730441Z query_layer = self.transpose_for_scores(self.query_proj(query_states), self.num_attention_heads) 2025-08-14T21:35:56.1730728Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 194, in transpose_for_scores 2025-08-14T21:35:56.1730854Z return x.permute(0, 2, 1, 3).contiguous().view(-1, x.size(1), x.size(-1)) 2025-08-14T21:35:56.1730858Z 2025-08-14T21:35:56.1730950Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1731138Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1731196Z return mod(**inputs) 2025-08-14T21:35:56.1731451Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1731521Z outputs = self.deberta( 2025-08-14T21:35:56.1731785Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1731857Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1732135Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1732215Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1732424Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1732495Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1732742Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1732856Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1733107Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1733186Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1733437Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 248, in forward 2025-08-14T21:35:56.1733636Z attention_scores = torch.bmm(query_layer, key_layer.transpose(-1, -2) / scale.to(dtype=query_layer.dtype)) 2025-08-14T21:35:56.1733639Z 2025-08-14T21:35:56.1733743Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1733926Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1733997Z return mod(**inputs) 2025-08-14T21:35:56.1734252Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1734320Z outputs = self.deberta( 2025-08-14T21:35:56.1734580Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1734647Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1734899Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1734988Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1735192Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1735274Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1735524Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1735612Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1735872Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1735945Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1736203Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 248, in forward 2025-08-14T21:35:56.1736398Z attention_scores = torch.bmm(query_layer, key_layer.transpose(-1, -2) / scale.to(dtype=query_layer.dtype)) 2025-08-14T21:35:56.1736402Z 2025-08-14T21:35:56.1736498Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1736686Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1736748Z return mod(**inputs) 2025-08-14T21:35:56.1737002Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1737076Z outputs = self.deberta( 2025-08-14T21:35:56.1737327Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1737415Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1737678Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1737769Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1737978Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1738049Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1738304Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1738388Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1738634Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1738726Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1738973Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 238, in forward 2025-08-14T21:35:56.1739147Z value_layer = self.transpose_for_scores(self.value_proj(hidden_states), self.num_attention_heads) 2025-08-14T21:35:56.1739158Z 2025-08-14T21:35:56.1739251Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1739430Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1739494Z return mod(**inputs) 2025-08-14T21:35:56.1739745Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1739805Z outputs = self.deberta( 2025-08-14T21:35:56.1740060Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1740126Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1740380Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1740458Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1740658Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1740734Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1740978Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1741060Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1741311Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1741382Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1741635Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 238, in forward 2025-08-14T21:35:56.1741807Z value_layer = self.transpose_for_scores(self.value_proj(hidden_states), self.num_attention_heads) 2025-08-14T21:35:56.1742090Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 194, in transpose_for_scores 2025-08-14T21:35:56.1742215Z return x.permute(0, 2, 1, 3).contiguous().view(-1, x.size(1), x.size(-1)) 2025-08-14T21:35:56.1742219Z 2025-08-14T21:35:56.1742309Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1742496Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1742554Z return mod(**inputs) 2025-08-14T21:35:56.1742806Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1742876Z outputs = self.deberta( 2025-08-14T21:35:56.1743139Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1743227Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1743492Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1743568Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1743778Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1743850Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1744099Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1744202Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1744455Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1744530Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1744839Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 268, in forward 2025-08-14T21:35:56.1744913Z context_layer = torch.bmm( 2025-08-14T21:35:56.1744917Z 2025-08-14T21:35:56.1745021Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1745200Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1745269Z return mod(**inputs) 2025-08-14T21:35:56.1745520Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1745589Z outputs = self.deberta( 2025-08-14T21:35:56.1745848Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1745913Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1746163Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1746249Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1746449Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1746529Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1746775Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1746858Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1747113Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1747184Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1747439Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 272, in forward 2025-08-14T21:35:56.1747612Z context_layer.view(-1, self.num_attention_heads, context_layer.size(-2), context_layer.size(-1)) 2025-08-14T21:35:56.1747617Z 2025-08-14T21:35:56.1747709Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1747896Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1747955Z return mod(**inputs) 2025-08-14T21:35:56.1748205Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1748274Z outputs = self.deberta( 2025-08-14T21:35:56.1748521Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1748592Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1748854Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1748970Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1749181Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1749253Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1749507Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1749589Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1749838Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 381, in forward 2025-08-14T21:35:56.1749967Z attention_output = self.output(self_output, query_states) 2025-08-14T21:35:56.1750220Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 52, in forward 2025-08-14T21:35:56.1750295Z hidden_states = self.dense(hidden_states) 2025-08-14T21:35:56.1750308Z 2025-08-14T21:35:56.1750400Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1750585Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1750649Z return mod(**inputs) 2025-08-14T21:35:56.1750907Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1750967Z outputs = self.deberta( 2025-08-14T21:35:56.1751225Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1751289Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1751545Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1751621Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1751827Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1751905Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1752151Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 446, in forward 2025-08-14T21:35:56.1752258Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:35:56.1752513Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 400, in forward 2025-08-14T21:35:56.1752586Z hidden_states = self.dense(hidden_states) 2025-08-14T21:35:56.1752590Z 2025-08-14T21:35:56.1752689Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1752872Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1752930Z return mod(**inputs) 2025-08-14T21:35:56.1753191Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1753252Z outputs = self.deberta( 2025-08-14T21:35:56.1753507Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1753570Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1753819Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1753902Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1754106Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1754190Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1754450Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 446, in forward 2025-08-14T21:35:56.1754588Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:35:56.1754842Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 401, in forward 2025-08-14T21:35:56.1754944Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:35:56.1755136Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:35:56.1755208Z return self.act(input) 2025-08-14T21:35:56.1755211Z 2025-08-14T21:35:56.1755301Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1755505Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1755564Z return mod(**inputs) 2025-08-14T21:35:56.1755815Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1755886Z outputs = self.deberta( 2025-08-14T21:35:56.1756135Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1756200Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1756456Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1756533Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1756740Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1756813Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1757063Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 447, in forward 2025-08-14T21:35:56.1757190Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:35:56.1757439Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 415, in forward 2025-08-14T21:35:56.1757523Z hidden_states = self.dense(hidden_states) 2025-08-14T21:35:56.1757526Z 2025-08-14T21:35:56.1757619Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1757801Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1757868Z return mod(**inputs) 2025-08-14T21:35:56.1758122Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1758185Z outputs = self.deberta( 2025-08-14T21:35:56.1758441Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1758505Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1758762Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1758839Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1759041Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1759119Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1759365Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1759456Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1759705Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1759790Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1760045Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 236, in forward 2025-08-14T21:35:56.1760245Z query_layer = self.transpose_for_scores(self.query_proj(query_states), self.num_attention_heads) 2025-08-14T21:35:56.1760249Z 2025-08-14T21:35:56.1760349Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1760530Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1760588Z return mod(**inputs) 2025-08-14T21:35:56.1760844Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1760904Z outputs = self.deberta( 2025-08-14T21:35:56.1761170Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1761243Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1761491Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1761574Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1761774Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1761843Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1762095Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1762179Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1762426Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1762507Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1762754Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 237, in forward 2025-08-14T21:35:56.1762928Z key_layer = self.transpose_for_scores(self.key_proj(hidden_states), self.num_attention_heads) 2025-08-14T21:35:56.1762933Z 2025-08-14T21:35:56.1763028Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1763210Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1763276Z return mod(**inputs) 2025-08-14T21:35:56.1763529Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1763598Z outputs = self.deberta( 2025-08-14T21:35:56.1763846Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1763911Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1764165Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1764242Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1764444Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1764523Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1764768Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1764857Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1765101Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1765171Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1765439Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 236, in forward 2025-08-14T21:35:56.1765610Z query_layer = self.transpose_for_scores(self.query_proj(query_states), self.num_attention_heads) 2025-08-14T21:35:56.1765930Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 194, in transpose_for_scores 2025-08-14T21:35:56.1766050Z return x.permute(0, 2, 1, 3).contiguous().view(-1, x.size(1), x.size(-1)) 2025-08-14T21:35:56.1766054Z 2025-08-14T21:35:56.1766146Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1766332Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1766391Z return mod(**inputs) 2025-08-14T21:35:56.1766646Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1766723Z outputs = self.deberta( 2025-08-14T21:35:56.1766971Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1767042Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1767293Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1767370Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1767577Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1767649Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1767904Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1767989Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1768238Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1768314Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1768560Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 248, in forward 2025-08-14T21:35:56.1768768Z attention_scores = torch.bmm(query_layer, key_layer.transpose(-1, -2) / scale.to(dtype=query_layer.dtype)) 2025-08-14T21:35:56.1768771Z 2025-08-14T21:35:56.1768864Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1769044Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1769112Z return mod(**inputs) 2025-08-14T21:35:56.1769363Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1769425Z outputs = self.deberta( 2025-08-14T21:35:56.1769683Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1769748Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1770007Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1770087Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1770289Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1770368Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1770615Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1770707Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1770956Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1771040Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1771319Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 248, in forward 2025-08-14T21:35:56.1771539Z attention_scores = torch.bmm(query_layer, key_layer.transpose(-1, -2) / scale.to(dtype=query_layer.dtype)) 2025-08-14T21:35:56.1771543Z 2025-08-14T21:35:56.1771643Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1771825Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1771884Z return mod(**inputs) 2025-08-14T21:35:56.1772146Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1772224Z outputs = self.deberta( 2025-08-14T21:35:56.1772474Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1772549Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1772798Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1772882Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1773084Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1773155Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1773410Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1773492Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1773744Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1773816Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1774063Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 238, in forward 2025-08-14T21:35:56.1774247Z value_layer = self.transpose_for_scores(self.value_proj(hidden_states), self.num_attention_heads) 2025-08-14T21:35:56.1774251Z 2025-08-14T21:35:56.1774344Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1774526Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1774593Z return mod(**inputs) 2025-08-14T21:35:56.1774845Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1774913Z outputs = self.deberta( 2025-08-14T21:35:56.1775162Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1775227Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1775485Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1775563Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1775771Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1775842Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1776089Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1776177Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1776424Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1776494Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1776767Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 238, in forward 2025-08-14T21:35:56.1776957Z value_layer = self.transpose_for_scores(self.value_proj(hidden_states), self.num_attention_heads) 2025-08-14T21:35:56.1777260Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 194, in transpose_for_scores 2025-08-14T21:35:56.1777380Z return x.permute(0, 2, 1, 3).contiguous().view(-1, x.size(1), x.size(-1)) 2025-08-14T21:35:56.1777383Z 2025-08-14T21:35:56.1777476Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1777669Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1777729Z return mod(**inputs) 2025-08-14T21:35:56.1777987Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1778065Z outputs = self.deberta( 2025-08-14T21:35:56.1778316Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1778391Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1778643Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1778728Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1778932Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1779004Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1779258Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1779345Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1779595Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1779673Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1779922Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 268, in forward 2025-08-14T21:35:56.1779996Z context_layer = torch.bmm( 2025-08-14T21:35:56.1779999Z 2025-08-14T21:35:56.1780092Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1780274Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1780341Z return mod(**inputs) 2025-08-14T21:35:56.1780593Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1780656Z outputs = self.deberta( 2025-08-14T21:35:56.1780911Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1780977Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1781236Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1781315Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1781517Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1781595Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1781840Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1781931Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1782177Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1782249Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1782519Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 272, in forward 2025-08-14T21:35:56.1782726Z context_layer.view(-1, self.num_attention_heads, context_layer.size(-2), context_layer.size(-1)) 2025-08-14T21:35:56.1782729Z 2025-08-14T21:35:56.1782830Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1783010Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1783068Z return mod(**inputs) 2025-08-14T21:35:56.1783327Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1783387Z outputs = self.deberta( 2025-08-14T21:35:56.1783652Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1783726Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1783974Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1784057Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1784259Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1784330Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1784687Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1784822Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1785095Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 381, in forward 2025-08-14T21:35:56.1785207Z attention_output = self.output(self_output, query_states) 2025-08-14T21:35:56.1785463Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 52, in forward 2025-08-14T21:35:56.1785551Z hidden_states = self.dense(hidden_states) 2025-08-14T21:35:56.1785557Z 2025-08-14T21:35:56.1785655Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1785847Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1785920Z return mod(**inputs) 2025-08-14T21:35:56.1786197Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1786266Z outputs = self.deberta( 2025-08-14T21:35:56.1786525Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1786604Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1786865Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1786942Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1787159Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1787234Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1787493Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 446, in forward 2025-08-14T21:35:56.1787614Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:35:56.1787877Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 400, in forward 2025-08-14T21:35:56.1787953Z hidden_states = self.dense(hidden_states) 2025-08-14T21:35:56.1787958Z 2025-08-14T21:35:56.1788060Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1788279Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1788348Z return mod(**inputs) 2025-08-14T21:35:56.1788636Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1788723Z outputs = self.deberta( 2025-08-14T21:35:56.1788988Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1789054Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1789319Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1789395Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1789626Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1789714Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1789973Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 446, in forward 2025-08-14T21:35:56.1790086Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:35:56.1790348Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 401, in forward 2025-08-14T21:35:56.1790451Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:35:56.1790653Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:35:56.1790718Z return self.act(input) 2025-08-14T21:35:56.1790721Z 2025-08-14T21:35:56.1790817Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1791013Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1791074Z return mod(**inputs) 2025-08-14T21:35:56.1791340Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1791405Z outputs = self.deberta( 2025-08-14T21:35:56.1791655Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1791728Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1791984Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1792061Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1792277Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1792351Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1792612Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 447, in forward 2025-08-14T21:35:56.1792734Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:35:56.1792992Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 415, in forward 2025-08-14T21:35:56.1793076Z hidden_states = self.dense(hidden_states) 2025-08-14T21:35:56.1793079Z 2025-08-14T21:35:56.1793174Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1793367Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1793428Z return mod(**inputs) 2025-08-14T21:35:56.1793690Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1793761Z outputs = self.deberta( 2025-08-14T21:35:56.1794041Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1794111Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1794391Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1794485Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1794696Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1794767Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1795019Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1795112Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1795383Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1795454Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1795717Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 236, in forward 2025-08-14T21:35:56.1795894Z query_layer = self.transpose_for_scores(self.query_proj(query_states), self.num_attention_heads) 2025-08-14T21:35:56.1795898Z 2025-08-14T21:35:56.1796001Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1796185Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1796242Z return mod(**inputs) 2025-08-14T21:35:56.1796509Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1796569Z outputs = self.deberta( 2025-08-14T21:35:56.1796831Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1796899Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1797152Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1797236Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1797442Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1797519Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1797770Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1797855Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1798111Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1798182Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1798441Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 237, in forward 2025-08-14T21:35:56.1798614Z key_layer = self.transpose_for_scores(self.key_proj(hidden_states), self.num_attention_heads) 2025-08-14T21:35:56.1798618Z 2025-08-14T21:35:56.1798711Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1798896Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1798955Z return mod(**inputs) 2025-08-14T21:35:56.1799204Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1799272Z outputs = self.deberta( 2025-08-14T21:35:56.1799516Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1799589Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1799848Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1799939Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1800160Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1800231Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1800477Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1800569Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1800813Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1800905Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1801158Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 236, in forward 2025-08-14T21:35:56.1801327Z query_layer = self.transpose_for_scores(self.query_proj(query_states), self.num_attention_heads) 2025-08-14T21:35:56.1801625Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 194, in transpose_for_scores 2025-08-14T21:35:56.1801746Z return x.permute(0, 2, 1, 3).contiguous().view(-1, x.size(1), x.size(-1)) 2025-08-14T21:35:56.1801749Z 2025-08-14T21:35:56.1801848Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1802031Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1802089Z return mod(**inputs) 2025-08-14T21:35:56.1802355Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1802419Z outputs = self.deberta( 2025-08-14T21:35:56.1802677Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1802743Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1802998Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1803080Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1803283Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1803353Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1803611Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1803696Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1803954Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1804022Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1804275Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 248, in forward 2025-08-14T21:35:56.1804480Z attention_scores = torch.bmm(query_layer, key_layer.transpose(-1, -2) / scale.to(dtype=query_layer.dtype)) 2025-08-14T21:35:56.1804483Z 2025-08-14T21:35:56.1804582Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1804769Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1804827Z return mod(**inputs) 2025-08-14T21:35:56.1805080Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1805151Z outputs = self.deberta( 2025-08-14T21:35:56.1805418Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1805486Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1805764Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1805855Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1806067Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1806137Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1806381Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1806471Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1806729Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1806803Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1807048Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 248, in forward 2025-08-14T21:35:56.1807239Z attention_scores = torch.bmm(query_layer, key_layer.transpose(-1, -2) / scale.to(dtype=query_layer.dtype)) 2025-08-14T21:35:56.1807242Z 2025-08-14T21:35:56.1807339Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1807517Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1807576Z return mod(**inputs) 2025-08-14T21:35:56.1807833Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1807895Z outputs = self.deberta( 2025-08-14T21:35:56.1808149Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1808214Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1808458Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1808544Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1808743Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1808818Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1809063Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1809144Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1809394Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1809463Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1809708Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 238, in forward 2025-08-14T21:35:56.1809887Z value_layer = self.transpose_for_scores(self.value_proj(hidden_states), self.num_attention_heads) 2025-08-14T21:35:56.1809891Z 2025-08-14T21:35:56.1809983Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1810167Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1810225Z return mod(**inputs) 2025-08-14T21:35:56.1810474Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1810542Z outputs = self.deberta( 2025-08-14T21:35:56.1810786Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1810857Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1811116Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1811221Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1811430Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1811500Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1811748Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1811839Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1812088Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1812189Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1812437Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 238, in forward 2025-08-14T21:35:56.1812611Z value_layer = self.transpose_for_scores(self.value_proj(hidden_states), self.num_attention_heads) 2025-08-14T21:35:56.1812902Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 194, in transpose_for_scores 2025-08-14T21:35:56.1813019Z return x.permute(0, 2, 1, 3).contiguous().view(-1, x.size(1), x.size(-1)) 2025-08-14T21:35:56.1813023Z 2025-08-14T21:35:56.1813121Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1813302Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1813361Z return mod(**inputs) 2025-08-14T21:35:56.1813617Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1813680Z outputs = self.deberta( 2025-08-14T21:35:56.1813933Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1813998Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1814244Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1814326Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1814524Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1814594Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1814845Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1814927Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1815181Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1815248Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1815497Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 268, in forward 2025-08-14T21:35:56.1815569Z context_layer = torch.bmm( 2025-08-14T21:35:56.1815572Z 2025-08-14T21:35:56.1815662Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1815849Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1815907Z return mod(**inputs) 2025-08-14T21:35:56.1816154Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1816223Z outputs = self.deberta( 2025-08-14T21:35:56.1816484Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1816550Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1816821Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1816913Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1817118Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1817188Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1817433Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1817522Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1817767Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1817858Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1818103Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 272, in forward 2025-08-14T21:35:56.1818278Z context_layer.view(-1, self.num_attention_heads, context_layer.size(-2), context_layer.size(-1)) 2025-08-14T21:35:56.1818282Z 2025-08-14T21:35:56.1818380Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1818560Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1818620Z return mod(**inputs) 2025-08-14T21:35:56.1818878Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1818940Z outputs = self.deberta( 2025-08-14T21:35:56.1819195Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1819260Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1819506Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1819593Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1819793Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1819872Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1820115Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1820196Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1820447Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 381, in forward 2025-08-14T21:35:56.1820555Z attention_output = self.output(self_output, query_states) 2025-08-14T21:35:56.1820805Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 52, in forward 2025-08-14T21:35:56.1820888Z hidden_states = self.dense(hidden_states) 2025-08-14T21:35:56.1820893Z 2025-08-14T21:35:56.1821028Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1821262Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1821349Z return mod(**inputs) 2025-08-14T21:35:56.1821626Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1821745Z outputs = self.deberta( 2025-08-14T21:35:56.1822003Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1822222Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1822506Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1822607Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1822899Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1823009Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1823322Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 446, in forward 2025-08-14T21:35:56.1823464Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:35:56.1823741Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 400, in forward 2025-08-14T21:35:56.1823865Z hidden_states = self.dense(hidden_states) 2025-08-14T21:35:56.1823884Z 2025-08-14T21:35:56.1823998Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1824218Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1824313Z return mod(**inputs) 2025-08-14T21:35:56.1824601Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1824710Z outputs = self.deberta( 2025-08-14T21:35:56.1825077Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1825167Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1825449Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1825566Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1825836Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1825926Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1826196Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 446, in forward 2025-08-14T21:35:56.1826349Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:35:56.1826608Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 401, in forward 2025-08-14T21:35:56.1826790Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:35:56.1827005Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:35:56.1827091Z return self.act(input) 2025-08-14T21:35:56.1827095Z 2025-08-14T21:35:56.1827252Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1827459Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1827576Z return mod(**inputs) 2025-08-14T21:35:56.1827861Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1827946Z outputs = self.deberta( 2025-08-14T21:35:56.1828238Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1828324Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1828591Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1828725Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1828956Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1829072Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1829356Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 447, in forward 2025-08-14T21:35:56.1829499Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:35:56.1829804Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 415, in forward 2025-08-14T21:35:56.1829932Z hidden_states = self.dense(hidden_states) 2025-08-14T21:35:56.1829935Z 2025-08-14T21:35:56.1830083Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1830286Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1830367Z return mod(**inputs) 2025-08-14T21:35:56.1830674Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1830758Z outputs = self.deberta( 2025-08-14T21:35:56.1831106Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1831191Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1831462Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1831593Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1831814Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1831949Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1832227Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1832331Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1832636Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1832726Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1832995Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 236, in forward 2025-08-14T21:35:56.1833223Z query_layer = self.transpose_for_scores(self.query_proj(query_states), self.num_attention_heads) 2025-08-14T21:35:56.1833227Z 2025-08-14T21:35:56.1833349Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1833581Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1833663Z return mod(**inputs) 2025-08-14T21:35:56.1833940Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1834039Z outputs = self.deberta( 2025-08-14T21:35:56.1834321Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1834447Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1834718Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1834818Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1835069Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1835149Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1848031Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1848208Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1848516Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1848601Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1848943Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 237, in forward 2025-08-14T21:35:56.1849154Z key_layer = self.transpose_for_scores(self.key_proj(hidden_states), self.num_attention_heads) 2025-08-14T21:35:56.1849206Z 2025-08-14T21:35:56.1849324Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1849521Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1849586Z return mod(**inputs) 2025-08-14T21:35:56.1849853Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1849921Z outputs = self.deberta( 2025-08-14T21:35:56.1850175Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1850280Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1850536Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1850631Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1850841Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1850914Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1851172Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1851256Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1851510Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1851583Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1851834Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 236, in forward 2025-08-14T21:35:56.1852018Z query_layer = self.transpose_for_scores(self.query_proj(query_states), self.num_attention_heads) 2025-08-14T21:35:56.1852311Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 194, in transpose_for_scores 2025-08-14T21:35:56.1852441Z return x.permute(0, 2, 1, 3).contiguous().view(-1, x.size(1), x.size(-1)) 2025-08-14T21:35:56.1852452Z 2025-08-14T21:35:56.1852553Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1852740Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1852810Z return mod(**inputs) 2025-08-14T21:35:56.1853066Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1853133Z outputs = self.deberta( 2025-08-14T21:35:56.1853391Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1853461Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1853714Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1853795Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1854000Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1854083Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1854332Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1854422Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1854707Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1854779Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1855045Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 248, in forward 2025-08-14T21:35:56.1855262Z attention_scores = torch.bmm(query_layer, key_layer.transpose(-1, -2) / scale.to(dtype=query_layer.dtype)) 2025-08-14T21:35:56.1855267Z 2025-08-14T21:35:56.1855362Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1855553Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1855613Z return mod(**inputs) 2025-08-14T21:35:56.1855873Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1855949Z outputs = self.deberta( 2025-08-14T21:35:56.1856202Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1856275Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1856525Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1856605Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1856816Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1856887Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1857144Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1857228Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1857479Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1857555Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1857806Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 248, in forward 2025-08-14T21:35:56.1858003Z attention_scores = torch.bmm(query_layer, key_layer.transpose(-1, -2) / scale.to(dtype=query_layer.dtype)) 2025-08-14T21:35:56.1858006Z 2025-08-14T21:35:56.1858099Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1858283Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1858350Z return mod(**inputs) 2025-08-14T21:35:56.1858605Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1858674Z outputs = self.deberta( 2025-08-14T21:35:56.1858925Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1858990Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1859250Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1859329Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1859532Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1859612Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1859863Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1859954Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1860205Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1860276Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1860545Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 238, in forward 2025-08-14T21:35:56.1860739Z value_layer = self.transpose_for_scores(self.value_proj(hidden_states), self.num_attention_heads) 2025-08-14T21:35:56.1860755Z 2025-08-14T21:35:56.1860859Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1861041Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1861102Z return mod(**inputs) 2025-08-14T21:35:56.1861361Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1861421Z outputs = self.deberta( 2025-08-14T21:35:56.1861683Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1861756Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1862002Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1862090Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1862292Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1862365Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1862618Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1862702Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1862954Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1863025Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1863273Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 238, in forward 2025-08-14T21:35:56.1863456Z value_layer = self.transpose_for_scores(self.value_proj(hidden_states), self.num_attention_heads) 2025-08-14T21:35:56.1863740Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 194, in transpose_for_scores 2025-08-14T21:35:56.1863871Z return x.permute(0, 2, 1, 3).contiguous().view(-1, x.size(1), x.size(-1)) 2025-08-14T21:35:56.1863874Z 2025-08-14T21:35:56.1863967Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1864150Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1864216Z return mod(**inputs) 2025-08-14T21:35:56.1864468Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1864531Z outputs = self.deberta( 2025-08-14T21:35:56.1864912Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1864989Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1865246Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1865323Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1865524Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1865603Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1865850Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1865941Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1866217Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1866290Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1866564Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 268, in forward 2025-08-14T21:35:56.1866648Z context_layer = torch.bmm( 2025-08-14T21:35:56.1866652Z 2025-08-14T21:35:56.1866746Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1866938Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1866998Z return mod(**inputs) 2025-08-14T21:35:56.1867265Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1867344Z outputs = self.deberta( 2025-08-14T21:35:56.1867596Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1867670Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1867925Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1868006Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1868216Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1868290Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1868547Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1868632Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1868882Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1868960Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1869212Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 272, in forward 2025-08-14T21:35:56.1869398Z context_layer.view(-1, self.num_attention_heads, context_layer.size(-2), context_layer.size(-1)) 2025-08-14T21:35:56.1869402Z 2025-08-14T21:35:56.1869494Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1869676Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1869745Z return mod(**inputs) 2025-08-14T21:35:56.1869996Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1870065Z outputs = self.deberta( 2025-08-14T21:35:56.1870316Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1870382Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1870636Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1870715Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1870918Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1870994Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1871243Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1871332Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1871581Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 381, in forward 2025-08-14T21:35:56.1871692Z attention_output = self.output(self_output, query_states) 2025-08-14T21:35:56.1871966Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 52, in forward 2025-08-14T21:35:56.1872061Z hidden_states = self.dense(hidden_states) 2025-08-14T21:35:56.1872078Z 2025-08-14T21:35:56.1872179Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1872361Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1872420Z return mod(**inputs) 2025-08-14T21:35:56.1872680Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1872742Z outputs = self.deberta( 2025-08-14T21:35:56.1872990Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1873086Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1873332Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1873414Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1873617Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1873688Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1873941Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 446, in forward 2025-08-14T21:35:56.1874051Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:35:56.1874303Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 400, in forward 2025-08-14T21:35:56.1874378Z hidden_states = self.dense(hidden_states) 2025-08-14T21:35:56.1874383Z 2025-08-14T21:35:56.1874474Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1874663Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1874722Z return mod(**inputs) 2025-08-14T21:35:56.1874974Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1875047Z outputs = self.deberta( 2025-08-14T21:35:56.1875292Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1875365Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1875612Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1875689Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1875897Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1875969Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1876223Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 446, in forward 2025-08-14T21:35:56.1876332Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:35:56.1876576Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 401, in forward 2025-08-14T21:35:56.1876684Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:35:56.1876877Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:35:56.1876939Z return self.act(input) 2025-08-14T21:35:56.1876943Z 2025-08-14T21:35:56.1877043Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1877227Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1877293Z return mod(**inputs) 2025-08-14T21:35:56.1877558Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1877650Z outputs = self.deberta( 2025-08-14T21:35:56.1877907Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1877972Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1878217Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1878301Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1878497Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1878594Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1878843Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 447, in forward 2025-08-14T21:35:56.1878964Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:35:56.1879220Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 415, in forward 2025-08-14T21:35:56.1879297Z hidden_states = self.dense(hidden_states) 2025-08-14T21:35:56.1879300Z 2025-08-14T21:35:56.1879399Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1879581Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1879641Z return mod(**inputs) 2025-08-14T21:35:56.1879897Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1879959Z outputs = self.deberta( 2025-08-14T21:35:56.1880214Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1880279Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1880528Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1880615Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1880815Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1880887Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1881142Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1881226Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1881482Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1881553Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1881798Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 236, in forward 2025-08-14T21:35:56.1881979Z query_layer = self.transpose_for_scores(self.query_proj(query_states), self.num_attention_heads) 2025-08-14T21:35:56.1881983Z 2025-08-14T21:35:56.1882079Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1882259Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1882324Z return mod(**inputs) 2025-08-14T21:35:56.1882574Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1882633Z outputs = self.deberta( 2025-08-14T21:35:56.1882889Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1882967Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1883237Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1883327Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1883526Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1883603Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1883852Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1883935Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1884191Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1884274Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1884529Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 237, in forward 2025-08-14T21:35:56.1884855Z key_layer = self.transpose_for_scores(self.key_proj(hidden_states), self.num_attention_heads) 2025-08-14T21:35:56.1884863Z 2025-08-14T21:35:56.1884962Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1885157Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1885220Z return mod(**inputs) 2025-08-14T21:35:56.1885489Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1885553Z outputs = self.deberta( 2025-08-14T21:35:56.1885808Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1885887Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1886146Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1886228Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1886455Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1886527Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1886783Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1886868Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1887126Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1887208Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1887467Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 236, in forward 2025-08-14T21:35:56.1887651Z query_layer = self.transpose_for_scores(self.query_proj(query_states), self.num_attention_heads) 2025-08-14T21:35:56.1887950Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 194, in transpose_for_scores 2025-08-14T21:35:56.1888077Z return x.permute(0, 2, 1, 3).contiguous().view(-1, x.size(1), x.size(-1)) 2025-08-14T21:35:56.1888080Z 2025-08-14T21:35:56.1888185Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1888372Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1888441Z return mod(**inputs) 2025-08-14T21:35:56.1888700Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1888765Z outputs = self.deberta( 2025-08-14T21:35:56.1889071Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1889163Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1889438Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1889524Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1889731Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1889809Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1890065Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1890176Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1890443Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1890516Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1890781Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 248, in forward 2025-08-14T21:35:56.1890986Z attention_scores = torch.bmm(query_layer, key_layer.transpose(-1, -2) / scale.to(dtype=query_layer.dtype)) 2025-08-14T21:35:56.1890990Z 2025-08-14T21:35:56.1891084Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1891278Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1891337Z return mod(**inputs) 2025-08-14T21:35:56.1891601Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1891674Z outputs = self.deberta( 2025-08-14T21:35:56.1891931Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1892007Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1892266Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1892346Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1892560Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1892631Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1892891Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1892976Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1893233Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1893310Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1893567Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 248, in forward 2025-08-14T21:35:56.1893767Z attention_scores = torch.bmm(query_layer, key_layer.transpose(-1, -2) / scale.to(dtype=query_layer.dtype)) 2025-08-14T21:35:56.1893778Z 2025-08-14T21:35:56.1893872Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1894060Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1894125Z return mod(**inputs) 2025-08-14T21:35:56.1894385Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1894449Z outputs = self.deberta( 2025-08-14T21:35:56.1894728Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1894796Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1895076Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1895182Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1895387Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1895469Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1895720Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1895805Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1896067Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1896155Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1896416Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 238, in forward 2025-08-14T21:35:56.1896595Z value_layer = self.transpose_for_scores(self.value_proj(hidden_states), self.num_attention_heads) 2025-08-14T21:35:56.1896599Z 2025-08-14T21:35:56.1896695Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1896888Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1896948Z return mod(**inputs) 2025-08-14T21:35:56.1897213Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1897273Z outputs = self.deberta( 2025-08-14T21:35:56.1897525Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1897596Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1897847Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1897935Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1898140Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1898214Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1898470Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1898554Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1898806Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1898884Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1899148Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 238, in forward 2025-08-14T21:35:56.1899327Z value_layer = self.transpose_for_scores(self.value_proj(hidden_states), self.num_attention_heads) 2025-08-14T21:35:56.1899610Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 194, in transpose_for_scores 2025-08-14T21:35:56.1899729Z return x.permute(0, 2, 1, 3).contiguous().view(-1, x.size(1), x.size(-1)) 2025-08-14T21:35:56.1899733Z 2025-08-14T21:35:56.1899832Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1900012Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1900076Z return mod(**inputs) 2025-08-14T21:35:56.1900328Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1900388Z outputs = self.deberta( 2025-08-14T21:35:56.1900654Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1900751Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1901000Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1901086Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1901288Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1901367Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1901614Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1901714Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1901973Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1902042Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1902297Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 268, in forward 2025-08-14T21:35:56.1902364Z context_layer = torch.bmm( 2025-08-14T21:35:56.1902367Z 2025-08-14T21:35:56.1902460Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1902648Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1902708Z return mod(**inputs) 2025-08-14T21:35:56.1902959Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1903029Z outputs = self.deberta( 2025-08-14T21:35:56.1903279Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1903351Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1903600Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1903679Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1903887Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1903958Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1904208Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1904289Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1904536Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 371, in forward 2025-08-14T21:35:56.1904614Z self_output, att_matrix = self.self( 2025-08-14T21:35:56.1904957Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 272, in forward 2025-08-14T21:35:56.1905140Z context_layer.view(-1, self.num_attention_heads, context_layer.size(-2), context_layer.size(-1)) 2025-08-14T21:35:56.1905153Z 2025-08-14T21:35:56.1905249Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1905429Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1905496Z return mod(**inputs) 2025-08-14T21:35:56.1905748Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1905809Z outputs = self.deberta( 2025-08-14T21:35:56.1906066Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1906150Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1906406Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1906515Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1906721Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1906800Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1907049Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 438, in forward 2025-08-14T21:35:56.1907132Z attention_output, att_matrix = self.attention( 2025-08-14T21:35:56.1907391Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 381, in forward 2025-08-14T21:35:56.1907511Z attention_output = self.output(self_output, query_states) 2025-08-14T21:35:56.1907764Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 52, in forward 2025-08-14T21:35:56.1907841Z hidden_states = self.dense(hidden_states) 2025-08-14T21:35:56.1907846Z 2025-08-14T21:35:56.1907937Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1908125Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1908182Z return mod(**inputs) 2025-08-14T21:35:56.1908439Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1908500Z outputs = self.deberta( 2025-08-14T21:35:56.1908745Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1908820Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1909067Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1909145Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1909356Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1909424Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1909675Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 446, in forward 2025-08-14T21:35:56.1909783Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:35:56.1910027Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 400, in forward 2025-08-14T21:35:56.1910112Z hidden_states = self.dense(hidden_states) 2025-08-14T21:35:56.1910115Z 2025-08-14T21:35:56.1910205Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1910393Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1910452Z return mod(**inputs) 2025-08-14T21:35:56.1910701Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1910770Z outputs = self.deberta( 2025-08-14T21:35:56.1911017Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1911081Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1911333Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1911409Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1911617Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1911704Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1911966Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 446, in forward 2025-08-14T21:35:56.1912095Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:35:56.1912344Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 401, in forward 2025-08-14T21:35:56.1912451Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:35:56.1912645Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:35:56.1912707Z return self.act(input) 2025-08-14T21:35:56.1912710Z 2025-08-14T21:35:56.1912806Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1913001Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1913061Z return mod(**inputs) 2025-08-14T21:35:56.1913323Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1231, in forward 2025-08-14T21:35:56.1913385Z outputs = self.deberta( 2025-08-14T21:35:56.1913638Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 786, in forward 2025-08-14T21:35:56.1913702Z encoder_outputs = self.encoder( 2025-08-14T21:35:56.1913948Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 659, in forward 2025-08-14T21:35:56.1914031Z output_states, attn_weights = layer_module( 2025-08-14T21:35:56.1914232Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:35:56.1914312Z return super().__call__(*args, **kwargs) 2025-08-14T21:35:56.1914560Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 447, in forward 2025-08-14T21:35:56.1914680Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:35:56.1914939Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 415, in forward 2025-08-14T21:35:56.1915016Z hidden_states = self.dense(hidden_states) 2025-08-14T21:35:56.1915019Z 2025-08-14T21:35:56.1915120Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1915302Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1915362Z return mod(**inputs) 2025-08-14T21:35:56.1915620Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1244, in forward 2025-08-14T21:35:56.1915699Z logits = self.qa_outputs(sequence_output) 2025-08-14T21:35:56.1915702Z 2025-08-14T21:35:56.1915798Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1915984Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1916045Z return mod(**inputs) 2025-08-14T21:35:56.1916306Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1262, in forward 2025-08-14T21:35:56.1916403Z start_loss = loss_fct(start_logits, start_positions) 2025-08-14T21:35:56.1916407Z 2025-08-14T21:35:56.1916499Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:35:56.1916686Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:35:56.1916746Z return mod(**inputs) 2025-08-14T21:35:56.1916998Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/deberta_v2/modeling_deberta_v2.py", line 1263, in forward 2025-08-14T21:35:56.1917090Z end_loss = loss_fct(end_logits, end_positions) 2025-08-14T21:35:56.1917093Z 2025-08-14T21:36:06.7810777Z Compilation time (from dynamo_timed): 22.764702232 2025-08-14T21:36:06.7812452Z pass 2025-08-14T21:36:06.7815598Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:36:06.7820274Z TIMING: _recursive_pre_grad_passes:0.0117 _recursive_joint_graph_passes:0.99846 _recursive_post_grad_passes:0.27931 async_compile.wait:0.50654 code_gen:9.63965 inductor_compile:12.2945 backend_compile:18.20616 gc:0.00028 entire_frame_compile:22.7647 total_wall_time:22.7647 2025-08-14T21:36:06.7821924Z STATS: call_* op count: 1087 | FakeTensorMode.__torch_dispatch__:30540 | FakeTensor.__torch_dispatch__:11359 | ProxyTorchDispatchMode.__torch_dispatch__:11524 2025-08-14T21:36:11.3540667Z Dynamo produced 1 graphs covering 1087 ops with 0 graph breaks (0 unique) 2025-08-14T21:36:11.3541944Z /opt/conda/envs/py_3.9/lib/python3.9/site-packages/llvmlite/binding/ffi.py:175: UserWarning: pkg_resources is deprecated as an API. See https://setuptools.pypa.io/en/latest/pkg_resources.html. The pkg_resources package is slated for removal as early as 2025-11-30. Refrain from using this package or pin to Setuptools<81. 2025-08-14T21:36:11.3542793Z from pkg_resources import resource_filename 2025-08-14T21:36:11.8891718Z 2025-08-14T21:36:12.5496276Z loading model: 0it [00:00, ?it/s] 2025-08-14T21:36:12.5498299Z loading model: 0it [00:00, ?it/s] 2025-08-14T21:36:12.5501836Z cpu eval DistilBertForMaskedLM 2025-08-14T21:36:12.8277786Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:36:12.8735696Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:36:12.9373269Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:36:17.2457245Z cudagraph partition due to non gpu ops 2025-08-14T21:36:17.2457654Z cudagraph partition due to non gpu ops 2025-08-14T21:36:17.2462225Z cudagraph partition due to non gpu ops 2025-08-14T21:36:17.2466259Z cudagraph partition due to non gpu ops 2025-08-14T21:36:17.2468388Z cudagraph partition due to non gpu ops 2025-08-14T21:36:17.2473031Z cudagraph partition due to non gpu ops 2025-08-14T21:36:17.2475169Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:17.2479802Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:36:17.2481634Z return mod(**inputs) 2025-08-14T21:36:17.2482178Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 826, in forward 2025-08-14T21:36:17.2485499Z dlbrt_output = self.distilbert( 2025-08-14T21:36:17.2486067Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 736, in forward 2025-08-14T21:36:17.2488533Z return self.transformer( 2025-08-14T21:36:17.2489012Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 541, in forward 2025-08-14T21:36:17.2489481Z layer_outputs = layer_module( 2025-08-14T21:36:17.2492236Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:17.2492648Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:17.2498204Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 476, in forward 2025-08-14T21:36:17.2498994Z sa_output = self.attention( 2025-08-14T21:36:17.2499401Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 390, in forward 2025-08-14T21:36:17.2499982Z q = shape(self.q_lin(query)) # (bs, n_heads, q_length, dim_per_head) 2025-08-14T21:36:17.2500180Z 2025-08-14T21:36:17.2500293Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:17.2500955Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:36:17.2501281Z return mod(**inputs) 2025-08-14T21:36:17.2501722Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 826, in forward 2025-08-14T21:36:17.2502178Z dlbrt_output = self.distilbert( 2025-08-14T21:36:17.2502555Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 736, in forward 2025-08-14T21:36:17.2502934Z return self.transformer( 2025-08-14T21:36:17.2503305Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 541, in forward 2025-08-14T21:36:17.2503676Z layer_outputs = layer_module( 2025-08-14T21:36:17.2504100Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:17.2504441Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:17.2504936Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 476, in forward 2025-08-14T21:36:17.2505326Z sa_output = self.attention( 2025-08-14T21:36:17.2505703Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 391, in forward 2025-08-14T21:36:17.2506131Z k = shape(self.k_lin(key)) # (bs, n_heads, k_length, dim_per_head) 2025-08-14T21:36:17.2506295Z 2025-08-14T21:36:17.2506401Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:17.2506731Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:36:17.2507046Z return mod(**inputs) 2025-08-14T21:36:17.2507417Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 826, in forward 2025-08-14T21:36:17.2507796Z dlbrt_output = self.distilbert( 2025-08-14T21:36:17.2508177Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 736, in forward 2025-08-14T21:36:17.2508556Z return self.transformer( 2025-08-14T21:36:17.2508919Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 541, in forward 2025-08-14T21:36:17.2509293Z layer_outputs = layer_module( 2025-08-14T21:36:17.2509615Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:17.2509954Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:17.2510329Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 476, in forward 2025-08-14T21:36:17.2510708Z sa_output = self.attention( 2025-08-14T21:36:17.2511077Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 392, in forward 2025-08-14T21:36:17.2511504Z v = shape(self.v_lin(value)) # (bs, n_heads, k_length, dim_per_head) 2025-08-14T21:36:17.2511669Z 2025-08-14T21:36:17.2511748Z cudagraph partition due to non gpu ops 2025-08-14T21:36:17.2511972Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:17.2512304Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:36:17.2512605Z return mod(**inputs) 2025-08-14T21:36:17.2512956Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 826, in forward 2025-08-14T21:36:17.2513341Z dlbrt_output = self.distilbert( 2025-08-14T21:36:17.2513718Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 736, in forward 2025-08-14T21:36:17.2514098Z return self.transformer( 2025-08-14T21:36:17.2514494Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 541, in forward 2025-08-14T21:36:17.2514880Z layer_outputs = layer_module( 2025-08-14T21:36:17.2515248Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:17.2515578Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:17.2515957Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 476, in forward 2025-08-14T21:36:17.2516335Z sa_output = self.attention( 2025-08-14T21:36:17.2516701Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 402, in forward 2025-08-14T21:36:17.2517122Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:36:17.2517326Z 2025-08-14T21:36:17.2517424Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:17.2517753Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:36:17.2518044Z return mod(**inputs) 2025-08-14T21:36:17.2518405Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 826, in forward 2025-08-14T21:36:17.2518792Z dlbrt_output = self.distilbert( 2025-08-14T21:36:17.2519167Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 736, in forward 2025-08-14T21:36:17.2519540Z return self.transformer( 2025-08-14T21:36:17.2519905Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 541, in forward 2025-08-14T21:36:17.2520280Z layer_outputs = layer_module( 2025-08-14T21:36:17.2520596Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:17.2520931Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:17.2521316Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 476, in forward 2025-08-14T21:36:17.2521696Z sa_output = self.attention( 2025-08-14T21:36:17.2522053Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 412, in forward 2025-08-14T21:36:17.2522442Z attn_output = self.out_lin(attn_output) 2025-08-14T21:36:17.2522576Z 2025-08-14T21:36:17.2522670Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:17.2522999Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:36:17.2523290Z return mod(**inputs) 2025-08-14T21:36:17.2523648Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 826, in forward 2025-08-14T21:36:17.2524030Z dlbrt_output = self.distilbert( 2025-08-14T21:36:17.2524399Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 736, in forward 2025-08-14T21:36:17.2524779Z return self.transformer( 2025-08-14T21:36:17.2525147Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 541, in forward 2025-08-14T21:36:17.2525525Z layer_outputs = layer_module( 2025-08-14T21:36:17.2525838Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:17.2526173Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:17.2526553Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 494, in forward 2025-08-14T21:36:17.2526969Z ffn_output = self.ffn(sa_output) # (bs, seq_length, dim) 2025-08-14T21:36:17.2527393Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 428, in forward 2025-08-14T21:36:17.2527898Z return apply_chunking_to_forward(self.ff_chunk, self.chunk_size_feed_forward, self.seq_len_dim, input) 2025-08-14T21:36:17.2528422Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:36:17.2528781Z return forward_fn(*input_tensors) 2025-08-14T21:36:17.2529162Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 431, in ff_chunk 2025-08-14T21:36:17.2529540Z x = self.lin1(input) 2025-08-14T21:36:17.2529638Z 2025-08-14T21:36:17.2529742Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:17.2530064Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:36:17.2530384Z return mod(**inputs) 2025-08-14T21:36:17.2530749Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 826, in forward 2025-08-14T21:36:17.2531134Z dlbrt_output = self.distilbert( 2025-08-14T21:36:17.2531509Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 736, in forward 2025-08-14T21:36:17.2531897Z return self.transformer( 2025-08-14T21:36:17.2532265Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 541, in forward 2025-08-14T21:36:17.2532638Z layer_outputs = layer_module( 2025-08-14T21:36:17.2532959Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:17.2533297Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:17.2533682Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 494, in forward 2025-08-14T21:36:17.2534097Z ffn_output = self.ffn(sa_output) # (bs, seq_length, dim) 2025-08-14T21:36:17.2534515Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 428, in forward 2025-08-14T21:36:17.2535019Z return apply_chunking_to_forward(self.ff_chunk, self.chunk_size_feed_forward, self.seq_len_dim, input) 2025-08-14T21:36:17.2535503Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:36:17.2535863Z return forward_fn(*input_tensors) 2025-08-14T21:36:17.2536247Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 432, in ff_chunk 2025-08-14T21:36:17.2536632Z x = self.activation(x) 2025-08-14T21:36:17.2536932Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:36:17.2537254Z return self.act(input) 2025-08-14T21:36:17.2537362Z 2025-08-14T21:36:17.2537460Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:17.2537792Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:36:17.2538090Z return mod(**inputs) 2025-08-14T21:36:17.2538450Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 826, in forward 2025-08-14T21:36:17.2538832Z dlbrt_output = self.distilbert( 2025-08-14T21:36:17.2539200Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 736, in forward 2025-08-14T21:36:17.2539583Z return self.transformer( 2025-08-14T21:36:17.2539950Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 541, in forward 2025-08-14T21:36:17.2540330Z layer_outputs = layer_module( 2025-08-14T21:36:17.2540663Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:17.2541001Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:17.2541401Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 494, in forward 2025-08-14T21:36:17.2541830Z ffn_output = self.ffn(sa_output) # (bs, seq_length, dim) 2025-08-14T21:36:17.2542233Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 428, in forward 2025-08-14T21:36:17.2542724Z return apply_chunking_to_forward(self.ff_chunk, self.chunk_size_feed_forward, self.seq_len_dim, input) 2025-08-14T21:36:17.2543203Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:36:17.2543598Z return forward_fn(*input_tensors) 2025-08-14T21:36:17.2543971Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 433, in ff_chunk 2025-08-14T21:36:17.2544343Z x = self.lin2(x) 2025-08-14T21:36:17.2544435Z 2025-08-14T21:36:17.2544538Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:17.2544966Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:36:17.2545278Z return mod(**inputs) 2025-08-14T21:36:17.2545638Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 826, in forward 2025-08-14T21:36:17.2546022Z dlbrt_output = self.distilbert( 2025-08-14T21:36:17.2546394Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 736, in forward 2025-08-14T21:36:17.2546777Z return self.transformer( 2025-08-14T21:36:17.2547149Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 541, in forward 2025-08-14T21:36:17.2547522Z layer_outputs = layer_module( 2025-08-14T21:36:17.2547847Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:17.2548189Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:17.2548575Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 476, in forward 2025-08-14T21:36:17.2548948Z sa_output = self.attention( 2025-08-14T21:36:17.2549323Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 390, in forward 2025-08-14T21:36:17.2549752Z q = shape(self.q_lin(query)) # (bs, n_heads, q_length, dim_per_head) 2025-08-14T21:36:17.2549918Z 2025-08-14T21:36:17.2550021Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:17.2550348Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:36:17.2550647Z return mod(**inputs) 2025-08-14T21:36:17.2551010Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 826, in forward 2025-08-14T21:36:17.2551387Z dlbrt_output = self.distilbert( 2025-08-14T21:36:17.2551765Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 736, in forward 2025-08-14T21:36:17.2552147Z return self.transformer( 2025-08-14T21:36:17.2552514Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 541, in forward 2025-08-14T21:36:17.2552884Z layer_outputs = layer_module( 2025-08-14T21:36:17.2553205Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:17.2553542Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:17.2553948Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 476, in forward 2025-08-14T21:36:17.2554322Z sa_output = self.attention( 2025-08-14T21:36:17.2554703Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 391, in forward 2025-08-14T21:36:17.2555142Z k = shape(self.k_lin(key)) # (bs, n_heads, k_length, dim_per_head) 2025-08-14T21:36:17.2555306Z 2025-08-14T21:36:17.2555406Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:17.2555737Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:36:17.2556038Z return mod(**inputs) 2025-08-14T21:36:17.2556397Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 826, in forward 2025-08-14T21:36:17.2556787Z dlbrt_output = self.distilbert( 2025-08-14T21:36:17.2557161Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 736, in forward 2025-08-14T21:36:17.2557539Z return self.transformer( 2025-08-14T21:36:17.2557893Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 541, in forward 2025-08-14T21:36:17.2558271Z layer_outputs = layer_module( 2025-08-14T21:36:17.2558587Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:17.2558918Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:17.2559288Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 476, in forward 2025-08-14T21:36:17.2559660Z sa_output = self.attention( 2025-08-14T21:36:17.2560026Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 392, in forward 2025-08-14T21:36:17.2560447Z v = shape(self.v_lin(value)) # (bs, n_heads, k_length, dim_per_head) 2025-08-14T21:36:17.2560613Z 2025-08-14T21:36:17.2560688Z cudagraph partition due to non gpu ops 2025-08-14T21:36:17.2560910Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:17.2561239Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:36:17.2561529Z return mod(**inputs) 2025-08-14T21:36:17.2561884Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 826, in forward 2025-08-14T21:36:17.2562264Z dlbrt_output = self.distilbert( 2025-08-14T21:36:17.2562637Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 736, in forward 2025-08-14T21:36:17.2563008Z return self.transformer( 2025-08-14T21:36:17.2563373Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 541, in forward 2025-08-14T21:36:17.2563750Z layer_outputs = layer_module( 2025-08-14T21:36:17.2564062Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:17.2564400Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:17.2564780Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 476, in forward 2025-08-14T21:36:17.2565153Z sa_output = self.attention( 2025-08-14T21:36:17.2565508Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 402, in forward 2025-08-14T21:36:17.2565950Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:36:17.2566125Z 2025-08-14T21:36:17.2566221Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:17.2566549Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:36:17.2566837Z return mod(**inputs) 2025-08-14T21:36:17.2567209Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 826, in forward 2025-08-14T21:36:17.2567619Z dlbrt_output = self.distilbert( 2025-08-14T21:36:17.2567989Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 736, in forward 2025-08-14T21:36:17.2568365Z return self.transformer( 2025-08-14T21:36:17.2568731Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 541, in forward 2025-08-14T21:36:17.2569105Z layer_outputs = layer_module( 2025-08-14T21:36:17.2569418Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:17.2569774Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:17.2570162Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 476, in forward 2025-08-14T21:36:17.2570542Z sa_output = self.attention( 2025-08-14T21:36:17.2570905Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 412, in forward 2025-08-14T21:36:17.2571291Z attn_output = self.out_lin(attn_output) 2025-08-14T21:36:17.2571417Z 2025-08-14T21:36:17.2571521Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:17.2571842Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:36:17.2572140Z return mod(**inputs) 2025-08-14T21:36:17.2572496Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 826, in forward 2025-08-14T21:36:17.2572874Z dlbrt_output = self.distilbert( 2025-08-14T21:36:17.2573240Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 736, in forward 2025-08-14T21:36:17.2573615Z return self.transformer( 2025-08-14T21:36:17.2573985Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 541, in forward 2025-08-14T21:36:17.2574353Z layer_outputs = layer_module( 2025-08-14T21:36:17.2574668Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:17.2574996Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:17.2575376Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 494, in forward 2025-08-14T21:36:17.2575781Z ffn_output = self.ffn(sa_output) # (bs, seq_length, dim) 2025-08-14T21:36:17.2576193Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 428, in forward 2025-08-14T21:36:17.2576689Z return apply_chunking_to_forward(self.ff_chunk, self.chunk_size_feed_forward, self.seq_len_dim, input) 2025-08-14T21:36:17.2577166Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:36:17.2577527Z return forward_fn(*input_tensors) 2025-08-14T21:36:17.2577908Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 431, in ff_chunk 2025-08-14T21:36:17.2578286Z x = self.lin1(input) 2025-08-14T21:36:17.2578383Z 2025-08-14T21:36:17.2578485Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:17.2578811Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:36:17.2579113Z return mod(**inputs) 2025-08-14T21:36:17.2579471Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 826, in forward 2025-08-14T21:36:17.2579848Z dlbrt_output = self.distilbert( 2025-08-14T21:36:17.2580242Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 736, in forward 2025-08-14T21:36:17.2580636Z return self.transformer( 2025-08-14T21:36:17.2581017Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 541, in forward 2025-08-14T21:36:17.2581388Z layer_outputs = layer_module( 2025-08-14T21:36:17.2581710Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:17.2582045Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:17.2582421Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 494, in forward 2025-08-14T21:36:17.2582854Z ffn_output = self.ffn(sa_output) # (bs, seq_length, dim) 2025-08-14T21:36:17.2583268Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 428, in forward 2025-08-14T21:36:17.2583761Z return apply_chunking_to_forward(self.ff_chunk, self.chunk_size_feed_forward, self.seq_len_dim, input) 2025-08-14T21:36:17.2584227Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:36:17.2584762Z return forward_fn(*input_tensors) 2025-08-14T21:36:17.2585199Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 432, in ff_chunk 2025-08-14T21:36:17.2585583Z x = self.activation(x) 2025-08-14T21:36:17.2585876Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:36:17.2586191Z return self.act(input) 2025-08-14T21:36:17.2586293Z 2025-08-14T21:36:17.2586396Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:17.2586717Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:36:17.2587020Z return mod(**inputs) 2025-08-14T21:36:17.2587379Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 826, in forward 2025-08-14T21:36:17.2587761Z dlbrt_output = self.distilbert( 2025-08-14T21:36:17.2588127Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 736, in forward 2025-08-14T21:36:17.2588506Z return self.transformer( 2025-08-14T21:36:17.2588874Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 541, in forward 2025-08-14T21:36:17.2589250Z layer_outputs = layer_module( 2025-08-14T21:36:17.2589559Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:17.2589888Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:17.2590265Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 494, in forward 2025-08-14T21:36:17.2590668Z ffn_output = self.ffn(sa_output) # (bs, seq_length, dim) 2025-08-14T21:36:17.2591074Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 428, in forward 2025-08-14T21:36:17.2591562Z return apply_chunking_to_forward(self.ff_chunk, self.chunk_size_feed_forward, self.seq_len_dim, input) 2025-08-14T21:36:17.2592027Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:36:17.2592378Z return forward_fn(*input_tensors) 2025-08-14T21:36:17.2592752Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 433, in ff_chunk 2025-08-14T21:36:17.2593126Z x = self.lin2(x) 2025-08-14T21:36:17.2593216Z 2025-08-14T21:36:17.2593378Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:17.2593703Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:36:17.2594067Z return mod(**inputs) 2025-08-14T21:36:17.2594426Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 826, in forward 2025-08-14T21:36:17.2594798Z dlbrt_output = self.distilbert( 2025-08-14T21:36:17.2595173Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 736, in forward 2025-08-14T21:36:17.2595550Z return self.transformer( 2025-08-14T21:36:17.2595915Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 541, in forward 2025-08-14T21:36:17.2596314Z layer_outputs = layer_module( 2025-08-14T21:36:17.2596633Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:17.2596966Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:17.2597339Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 476, in forward 2025-08-14T21:36:17.2597718Z sa_output = self.attention( 2025-08-14T21:36:17.2598083Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 390, in forward 2025-08-14T21:36:17.2598505Z q = shape(self.q_lin(query)) # (bs, n_heads, q_length, dim_per_head) 2025-08-14T21:36:17.2598670Z 2025-08-14T21:36:17.2598763Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:17.2599095Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:36:17.2599395Z return mod(**inputs) 2025-08-14T21:36:17.2599750Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 826, in forward 2025-08-14T21:36:17.2600120Z dlbrt_output = self.distilbert( 2025-08-14T21:36:17.2600495Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 736, in forward 2025-08-14T21:36:17.2600873Z return self.transformer( 2025-08-14T21:36:17.2601228Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 541, in forward 2025-08-14T21:36:17.2601605Z layer_outputs = layer_module( 2025-08-14T21:36:17.2601925Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:17.2602260Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:17.2602633Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 476, in forward 2025-08-14T21:36:17.2603012Z sa_output = self.attention( 2025-08-14T21:36:17.2603380Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 391, in forward 2025-08-14T21:36:17.2603799Z k = shape(self.k_lin(key)) # (bs, n_heads, k_length, dim_per_head) 2025-08-14T21:36:17.2603960Z 2025-08-14T21:36:17.2604055Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:17.2604385Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:36:17.2604681Z return mod(**inputs) 2025-08-14T21:36:17.2605027Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 826, in forward 2025-08-14T21:36:17.2605405Z dlbrt_output = self.distilbert( 2025-08-14T21:36:17.2605778Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 736, in forward 2025-08-14T21:36:17.2606158Z return self.transformer( 2025-08-14T21:36:17.2606531Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 541, in forward 2025-08-14T21:36:17.2606925Z layer_outputs = layer_module( 2025-08-14T21:36:17.2607262Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:17.2607594Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:17.2607969Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 476, in forward 2025-08-14T21:36:17.2608345Z sa_output = self.attention( 2025-08-14T21:36:17.2608710Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 392, in forward 2025-08-14T21:36:17.2609152Z v = shape(self.v_lin(value)) # (bs, n_heads, k_length, dim_per_head) 2025-08-14T21:36:17.2609325Z 2025-08-14T21:36:17.2609398Z cudagraph partition due to non gpu ops 2025-08-14T21:36:17.2609620Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:17.2609946Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:36:17.2610241Z return mod(**inputs) 2025-08-14T21:36:17.2610601Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 826, in forward 2025-08-14T21:36:17.2610980Z dlbrt_output = self.distilbert( 2025-08-14T21:36:17.2611351Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 736, in forward 2025-08-14T21:36:17.2611730Z return self.transformer( 2025-08-14T21:36:17.2612101Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 541, in forward 2025-08-14T21:36:17.2612482Z layer_outputs = layer_module( 2025-08-14T21:36:17.2612796Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:17.2613131Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:17.2613517Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 476, in forward 2025-08-14T21:36:17.2613894Z sa_output = self.attention( 2025-08-14T21:36:17.2614253Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 402, in forward 2025-08-14T21:36:17.2614691Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:36:17.2614861Z 2025-08-14T21:36:17.2614965Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:17.2615289Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:36:17.2615598Z return mod(**inputs) 2025-08-14T21:36:17.2615961Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 826, in forward 2025-08-14T21:36:17.2616340Z dlbrt_output = self.distilbert( 2025-08-14T21:36:17.2616709Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 736, in forward 2025-08-14T21:36:17.2617088Z return self.transformer( 2025-08-14T21:36:17.2617453Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 541, in forward 2025-08-14T21:36:17.2617830Z layer_outputs = layer_module( 2025-08-14T21:36:17.2618140Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:17.2618475Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:17.2618857Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 476, in forward 2025-08-14T21:36:17.2619226Z sa_output = self.attention( 2025-08-14T21:36:17.2619608Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 412, in forward 2025-08-14T21:36:17.2620019Z attn_output = self.out_lin(attn_output) 2025-08-14T21:36:17.2620163Z 2025-08-14T21:36:17.2620267Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:17.2620588Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:36:17.2620891Z return mod(**inputs) 2025-08-14T21:36:17.2621248Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 826, in forward 2025-08-14T21:36:17.2621618Z dlbrt_output = self.distilbert( 2025-08-14T21:36:17.2621993Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 736, in forward 2025-08-14T21:36:17.2622384Z return self.transformer( 2025-08-14T21:36:17.2622748Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 541, in forward 2025-08-14T21:36:17.2623116Z layer_outputs = layer_module( 2025-08-14T21:36:17.2623437Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:17.2623770Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:17.2624149Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 494, in forward 2025-08-14T21:36:17.2624551Z ffn_output = self.ffn(sa_output) # (bs, seq_length, dim) 2025-08-14T21:36:17.2625029Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 428, in forward 2025-08-14T21:36:17.2625535Z return apply_chunking_to_forward(self.ff_chunk, self.chunk_size_feed_forward, self.seq_len_dim, input) 2025-08-14T21:36:17.2626017Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:36:17.2626390Z return forward_fn(*input_tensors) 2025-08-14T21:36:17.2626777Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 431, in ff_chunk 2025-08-14T21:36:17.2627158Z x = self.lin1(input) 2025-08-14T21:36:17.2627257Z 2025-08-14T21:36:17.2627353Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:17.2627687Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:36:17.2627989Z return mod(**inputs) 2025-08-14T21:36:17.2628352Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 826, in forward 2025-08-14T21:36:17.2628732Z dlbrt_output = self.distilbert( 2025-08-14T21:36:17.2629114Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 736, in forward 2025-08-14T21:36:17.2629499Z return self.transformer( 2025-08-14T21:36:17.2629861Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 541, in forward 2025-08-14T21:36:17.2630242Z layer_outputs = layer_module( 2025-08-14T21:36:17.2630562Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:17.2630897Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:17.2631271Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 494, in forward 2025-08-14T21:36:17.2631777Z ffn_output = self.ffn(sa_output) # (bs, seq_length, dim) 2025-08-14T21:36:17.2632190Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 428, in forward 2025-08-14T21:36:17.2632705Z return apply_chunking_to_forward(self.ff_chunk, self.chunk_size_feed_forward, self.seq_len_dim, input) 2025-08-14T21:36:17.2633196Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:36:17.2633591Z return forward_fn(*input_tensors) 2025-08-14T21:36:17.2633981Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 432, in ff_chunk 2025-08-14T21:36:17.2634364Z x = self.activation(x) 2025-08-14T21:36:17.2634660Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:36:17.2634973Z return self.act(input) 2025-08-14T21:36:17.2635074Z 2025-08-14T21:36:17.2635176Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:17.2635516Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:36:17.2635816Z return mod(**inputs) 2025-08-14T21:36:17.2636172Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 826, in forward 2025-08-14T21:36:17.2636555Z dlbrt_output = self.distilbert( 2025-08-14T21:36:17.2636926Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 736, in forward 2025-08-14T21:36:17.2637303Z return self.transformer( 2025-08-14T21:36:17.2637665Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 541, in forward 2025-08-14T21:36:17.2638032Z layer_outputs = layer_module( 2025-08-14T21:36:17.2638353Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:17.2638687Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:17.2639069Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 494, in forward 2025-08-14T21:36:17.2639473Z ffn_output = self.ffn(sa_output) # (bs, seq_length, dim) 2025-08-14T21:36:17.2639883Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 428, in forward 2025-08-14T21:36:17.2640373Z return apply_chunking_to_forward(self.ff_chunk, self.chunk_size_feed_forward, self.seq_len_dim, input) 2025-08-14T21:36:17.2640842Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:36:17.2641198Z return forward_fn(*input_tensors) 2025-08-14T21:36:17.2641577Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 433, in ff_chunk 2025-08-14T21:36:17.2641949Z x = self.lin2(x) 2025-08-14T21:36:17.2642042Z 2025-08-14T21:36:17.2642136Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:17.2642470Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:36:17.2642767Z return mod(**inputs) 2025-08-14T21:36:17.2643117Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 826, in forward 2025-08-14T21:36:17.2643488Z dlbrt_output = self.distilbert( 2025-08-14T21:36:17.2643859Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 736, in forward 2025-08-14T21:36:17.2644231Z return self.transformer( 2025-08-14T21:36:17.2644588Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 541, in forward 2025-08-14T21:36:17.2644954Z layer_outputs = layer_module( 2025-08-14T21:36:17.2645273Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:17.2645603Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:17.2645987Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 476, in forward 2025-08-14T21:36:17.2646393Z sa_output = self.attention( 2025-08-14T21:36:17.2646773Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 390, in forward 2025-08-14T21:36:17.2647197Z q = shape(self.q_lin(query)) # (bs, n_heads, q_length, dim_per_head) 2025-08-14T21:36:17.2647363Z 2025-08-14T21:36:17.2647458Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:17.2647787Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:36:17.2648088Z return mod(**inputs) 2025-08-14T21:36:17.2648438Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 826, in forward 2025-08-14T21:36:17.2648836Z dlbrt_output = self.distilbert( 2025-08-14T21:36:17.2649213Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 736, in forward 2025-08-14T21:36:17.2649594Z return self.transformer( 2025-08-14T21:36:17.2649953Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 541, in forward 2025-08-14T21:36:17.2650327Z layer_outputs = layer_module( 2025-08-14T21:36:17.2650647Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:17.2650977Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:17.2651351Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 476, in forward 2025-08-14T21:36:17.2651728Z sa_output = self.attention( 2025-08-14T21:36:17.2652096Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 391, in forward 2025-08-14T21:36:17.2652507Z k = shape(self.k_lin(key)) # (bs, n_heads, k_length, dim_per_head) 2025-08-14T21:36:17.2652674Z 2025-08-14T21:36:17.2652772Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:17.2653103Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:36:17.2653402Z return mod(**inputs) 2025-08-14T21:36:17.2653750Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 826, in forward 2025-08-14T21:36:17.2654126Z dlbrt_output = self.distilbert( 2025-08-14T21:36:17.2654499Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 736, in forward 2025-08-14T21:36:17.2654878Z return self.transformer( 2025-08-14T21:36:17.2655235Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 541, in forward 2025-08-14T21:36:17.2655612Z layer_outputs = layer_module( 2025-08-14T21:36:17.2655931Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:17.2656256Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:17.2656639Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 476, in forward 2025-08-14T21:36:17.2657013Z sa_output = self.attention( 2025-08-14T21:36:17.2657378Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 392, in forward 2025-08-14T21:36:17.2657793Z v = shape(self.v_lin(value)) # (bs, n_heads, k_length, dim_per_head) 2025-08-14T21:36:17.2657962Z 2025-08-14T21:36:17.2658038Z cudagraph partition due to non gpu ops 2025-08-14T21:36:17.2658258Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:17.2658594Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:36:17.2658894Z return mod(**inputs) 2025-08-14T21:36:17.2659438Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 826, in forward 2025-08-14T21:36:17.2659847Z dlbrt_output = self.distilbert( 2025-08-14T21:36:17.2660218Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 736, in forward 2025-08-14T21:36:17.2660599Z return self.transformer( 2025-08-14T21:36:17.2660967Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 541, in forward 2025-08-14T21:36:17.2661348Z layer_outputs = layer_module( 2025-08-14T21:36:17.2661665Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:17.2662016Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:17.2662400Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 476, in forward 2025-08-14T21:36:17.2662771Z sa_output = self.attention( 2025-08-14T21:36:17.2663140Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 402, in forward 2025-08-14T21:36:17.2663570Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:36:17.2663740Z 2025-08-14T21:36:17.2663845Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:17.2664167Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:36:17.2664468Z return mod(**inputs) 2025-08-14T21:36:17.2664876Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 826, in forward 2025-08-14T21:36:17.2665270Z dlbrt_output = self.distilbert( 2025-08-14T21:36:17.2665645Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 736, in forward 2025-08-14T21:36:17.2666032Z return self.transformer( 2025-08-14T21:36:17.2666409Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 541, in forward 2025-08-14T21:36:17.2666782Z layer_outputs = layer_module( 2025-08-14T21:36:17.2667107Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:17.2667443Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:17.2667830Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 476, in forward 2025-08-14T21:36:17.2668207Z sa_output = self.attention( 2025-08-14T21:36:17.2668587Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 412, in forward 2025-08-14T21:36:17.2668981Z attn_output = self.out_lin(attn_output) 2025-08-14T21:36:17.2669110Z 2025-08-14T21:36:17.2669209Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:17.2669547Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:36:17.2669847Z return mod(**inputs) 2025-08-14T21:36:17.2670208Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 826, in forward 2025-08-14T21:36:17.2670600Z dlbrt_output = self.distilbert( 2025-08-14T21:36:17.2670976Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 736, in forward 2025-08-14T21:36:17.2671358Z return self.transformer( 2025-08-14T21:36:17.2671722Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 541, in forward 2025-08-14T21:36:17.2672118Z layer_outputs = layer_module( 2025-08-14T21:36:17.2672446Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:17.2672811Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:17.2673186Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 494, in forward 2025-08-14T21:36:17.2673596Z ffn_output = self.ffn(sa_output) # (bs, seq_length, dim) 2025-08-14T21:36:17.2674006Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 428, in forward 2025-08-14T21:36:17.2674498Z return apply_chunking_to_forward(self.ff_chunk, self.chunk_size_feed_forward, self.seq_len_dim, input) 2025-08-14T21:36:17.2674994Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:36:17.2675362Z return forward_fn(*input_tensors) 2025-08-14T21:36:17.2675754Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 431, in ff_chunk 2025-08-14T21:36:17.2676133Z x = self.lin1(input) 2025-08-14T21:36:17.2676230Z 2025-08-14T21:36:17.2676328Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:17.2676660Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:36:17.2676961Z return mod(**inputs) 2025-08-14T21:36:17.2677313Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 826, in forward 2025-08-14T21:36:17.2677696Z dlbrt_output = self.distilbert( 2025-08-14T21:36:17.2678072Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 736, in forward 2025-08-14T21:36:17.2678451Z return self.transformer( 2025-08-14T21:36:17.2678812Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 541, in forward 2025-08-14T21:36:17.2679194Z layer_outputs = layer_module( 2025-08-14T21:36:17.2679517Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:17.2679854Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:17.2680235Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 494, in forward 2025-08-14T21:36:17.2680652Z ffn_output = self.ffn(sa_output) # (bs, seq_length, dim) 2025-08-14T21:36:17.2681062Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 428, in forward 2025-08-14T21:36:17.2681550Z return apply_chunking_to_forward(self.ff_chunk, self.chunk_size_feed_forward, self.seq_len_dim, input) 2025-08-14T21:36:17.2682034Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:36:17.2682398Z return forward_fn(*input_tensors) 2025-08-14T21:36:17.2682782Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 432, in ff_chunk 2025-08-14T21:36:17.2683161Z x = self.activation(x) 2025-08-14T21:36:17.2683469Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:36:17.2683783Z return self.act(input) 2025-08-14T21:36:17.2683886Z 2025-08-14T21:36:17.2683991Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:17.2684319Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:36:17.2684796Z return mod(**inputs) 2025-08-14T21:36:17.2685159Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 826, in forward 2025-08-14T21:36:17.2685584Z dlbrt_output = self.distilbert( 2025-08-14T21:36:17.2685991Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 736, in forward 2025-08-14T21:36:17.2686395Z return self.transformer( 2025-08-14T21:36:17.2686762Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 541, in forward 2025-08-14T21:36:17.2687133Z layer_outputs = layer_module( 2025-08-14T21:36:17.2687459Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:17.2687796Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:17.2688173Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 494, in forward 2025-08-14T21:36:17.2688607Z ffn_output = self.ffn(sa_output) # (bs, seq_length, dim) 2025-08-14T21:36:17.2689016Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 428, in forward 2025-08-14T21:36:17.2689513Z return apply_chunking_to_forward(self.ff_chunk, self.chunk_size_feed_forward, self.seq_len_dim, input) 2025-08-14T21:36:17.2689981Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:36:17.2690342Z return forward_fn(*input_tensors) 2025-08-14T21:36:17.2690722Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 433, in ff_chunk 2025-08-14T21:36:17.2691096Z x = self.lin2(x) 2025-08-14T21:36:17.2691187Z 2025-08-14T21:36:17.2691283Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:17.2691614Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:36:17.2691916Z return mod(**inputs) 2025-08-14T21:36:17.2692269Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 826, in forward 2025-08-14T21:36:17.2692651Z dlbrt_output = self.distilbert( 2025-08-14T21:36:17.2693026Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 736, in forward 2025-08-14T21:36:17.2693400Z return self.transformer( 2025-08-14T21:36:17.2693757Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 541, in forward 2025-08-14T21:36:17.2694134Z layer_outputs = layer_module( 2025-08-14T21:36:17.2694453Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:17.2694785Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:17.2695160Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 476, in forward 2025-08-14T21:36:17.2695538Z sa_output = self.attention( 2025-08-14T21:36:17.2695909Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 390, in forward 2025-08-14T21:36:17.2696328Z q = shape(self.q_lin(query)) # (bs, n_heads, q_length, dim_per_head) 2025-08-14T21:36:17.2696502Z 2025-08-14T21:36:17.2696599Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:17.2696929Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:36:17.2697228Z return mod(**inputs) 2025-08-14T21:36:17.2697577Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 826, in forward 2025-08-14T21:36:17.2697961Z dlbrt_output = self.distilbert( 2025-08-14T21:36:17.2698334Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 736, in forward 2025-08-14T21:36:17.2698741Z return self.transformer( 2025-08-14T21:36:17.2699116Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 541, in forward 2025-08-14T21:36:17.2699511Z layer_outputs = layer_module( 2025-08-14T21:36:17.2699828Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:17.2700154Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:17.2700536Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 476, in forward 2025-08-14T21:36:17.2700913Z sa_output = self.attention( 2025-08-14T21:36:17.2701287Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 391, in forward 2025-08-14T21:36:17.2701732Z k = shape(self.k_lin(key)) # (bs, n_heads, k_length, dim_per_head) 2025-08-14T21:36:17.2701903Z 2025-08-14T21:36:17.2702002Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:17.2702337Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:36:17.2702639Z return mod(**inputs) 2025-08-14T21:36:17.2702989Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 826, in forward 2025-08-14T21:36:17.2703370Z dlbrt_output = self.distilbert( 2025-08-14T21:36:17.2703745Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 736, in forward 2025-08-14T21:36:17.2704113Z return self.transformer( 2025-08-14T21:36:17.2704482Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 541, in forward 2025-08-14T21:36:17.2704912Z layer_outputs = layer_module( 2025-08-14T21:36:17.2705237Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:17.2705565Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:17.2705947Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 476, in forward 2025-08-14T21:36:17.2706332Z sa_output = self.attention( 2025-08-14T21:36:17.2706692Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 392, in forward 2025-08-14T21:36:17.2707116Z v = shape(self.v_lin(value)) # (bs, n_heads, k_length, dim_per_head) 2025-08-14T21:36:17.2707286Z 2025-08-14T21:36:17.2707358Z cudagraph partition due to non gpu ops 2025-08-14T21:36:17.2707578Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:17.2707903Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:36:17.2708204Z return mod(**inputs) 2025-08-14T21:36:17.2708564Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 826, in forward 2025-08-14T21:36:17.2708946Z dlbrt_output = self.distilbert( 2025-08-14T21:36:17.2709315Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 736, in forward 2025-08-14T21:36:17.2709690Z return self.transformer( 2025-08-14T21:36:17.2710050Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 541, in forward 2025-08-14T21:36:17.2710418Z layer_outputs = layer_module( 2025-08-14T21:36:17.2710736Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:17.2711069Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:17.2711467Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 476, in forward 2025-08-14T21:36:17.2711846Z sa_output = self.attention( 2025-08-14T21:36:17.2712229Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 402, in forward 2025-08-14T21:36:17.2712711Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:36:17.2712884Z 2025-08-14T21:36:17.2712987Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:17.2713311Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:36:17.2713613Z return mod(**inputs) 2025-08-14T21:36:17.2713973Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 826, in forward 2025-08-14T21:36:17.2714347Z dlbrt_output = self.distilbert( 2025-08-14T21:36:17.2714742Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 736, in forward 2025-08-14T21:36:17.2715122Z return self.transformer( 2025-08-14T21:36:17.2715490Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 541, in forward 2025-08-14T21:36:17.2715863Z layer_outputs = layer_module( 2025-08-14T21:36:17.2716183Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:17.2716518Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:17.2716890Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 476, in forward 2025-08-14T21:36:17.2717268Z sa_output = self.attention( 2025-08-14T21:36:17.2717636Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 412, in forward 2025-08-14T21:36:17.2718027Z attn_output = self.out_lin(attn_output) 2025-08-14T21:36:17.2718152Z 2025-08-14T21:36:17.2718248Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:17.2718581Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:36:17.2718885Z return mod(**inputs) 2025-08-14T21:36:17.2719242Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 826, in forward 2025-08-14T21:36:17.2719614Z dlbrt_output = self.distilbert( 2025-08-14T21:36:17.2719988Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 736, in forward 2025-08-14T21:36:17.2720366Z return self.transformer( 2025-08-14T21:36:17.2720722Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 541, in forward 2025-08-14T21:36:17.2721104Z layer_outputs = layer_module( 2025-08-14T21:36:17.2721424Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:17.2721756Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:17.2722132Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 494, in forward 2025-08-14T21:36:17.2722549Z ffn_output = self.ffn(sa_output) # (bs, seq_length, dim) 2025-08-14T21:36:17.2722955Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 428, in forward 2025-08-14T21:36:17.2723446Z return apply_chunking_to_forward(self.ff_chunk, self.chunk_size_feed_forward, self.seq_len_dim, input) 2025-08-14T21:36:17.2723912Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:36:17.2724278Z return forward_fn(*input_tensors) 2025-08-14T21:36:17.2724674Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 431, in ff_chunk 2025-08-14T21:36:17.2725049Z x = self.lin1(input) 2025-08-14T21:36:17.2725155Z 2025-08-14T21:36:17.2725266Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:17.2725614Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:36:17.2725915Z return mod(**inputs) 2025-08-14T21:36:17.2726262Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 826, in forward 2025-08-14T21:36:17.2726645Z dlbrt_output = self.distilbert( 2025-08-14T21:36:17.2727022Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 736, in forward 2025-08-14T21:36:17.2727394Z return self.transformer( 2025-08-14T21:36:17.2727774Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 541, in forward 2025-08-14T21:36:17.2728152Z layer_outputs = layer_module( 2025-08-14T21:36:17.2728470Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:17.2728793Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:17.2729177Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 494, in forward 2025-08-14T21:36:17.2729590Z ffn_output = self.ffn(sa_output) # (bs, seq_length, dim) 2025-08-14T21:36:17.2730001Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 428, in forward 2025-08-14T21:36:17.2730486Z return apply_chunking_to_forward(self.ff_chunk, self.chunk_size_feed_forward, self.seq_len_dim, input) 2025-08-14T21:36:17.2730962Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:36:17.2731328Z return forward_fn(*input_tensors) 2025-08-14T21:36:17.2731709Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 432, in ff_chunk 2025-08-14T21:36:17.2732085Z x = self.activation(x) 2025-08-14T21:36:17.2732388Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:36:17.2732704Z return self.act(input) 2025-08-14T21:36:17.2732804Z 2025-08-14T21:36:17.2732901Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:17.2733235Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:36:17.2733537Z return mod(**inputs) 2025-08-14T21:36:17.2733894Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 826, in forward 2025-08-14T21:36:17.2734272Z dlbrt_output = self.distilbert( 2025-08-14T21:36:17.2734653Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 736, in forward 2025-08-14T21:36:17.2735034Z return self.transformer( 2025-08-14T21:36:17.2735393Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 541, in forward 2025-08-14T21:36:17.2735771Z layer_outputs = layer_module( 2025-08-14T21:36:17.2736092Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:17.2736425Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:17.2736796Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 494, in forward 2025-08-14T21:36:17.2737207Z ffn_output = self.ffn(sa_output) # (bs, seq_length, dim) 2025-08-14T21:36:17.2737617Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 428, in forward 2025-08-14T21:36:17.2738127Z return apply_chunking_to_forward(self.ff_chunk, self.chunk_size_feed_forward, self.seq_len_dim, input) 2025-08-14T21:36:17.2738610Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:36:17.2738994Z return forward_fn(*input_tensors) 2025-08-14T21:36:17.2739387Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 433, in ff_chunk 2025-08-14T21:36:17.2739758Z x = self.lin2(x) 2025-08-14T21:36:17.2739862Z 2025-08-14T21:36:17.2739961Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:17.2740300Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:36:17.2740630Z return mod(**inputs) 2025-08-14T21:36:17.2740981Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 826, in forward 2025-08-14T21:36:17.2741358Z dlbrt_output = self.distilbert( 2025-08-14T21:36:17.2741736Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 736, in forward 2025-08-14T21:36:17.2742111Z return self.transformer( 2025-08-14T21:36:17.2742467Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 541, in forward 2025-08-14T21:36:17.2742842Z layer_outputs = layer_module( 2025-08-14T21:36:17.2743161Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:17.2743487Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:17.2743868Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 476, in forward 2025-08-14T21:36:17.2744244Z sa_output = self.attention( 2025-08-14T21:36:17.2744608Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 390, in forward 2025-08-14T21:36:17.2745103Z q = shape(self.q_lin(query)) # (bs, n_heads, q_length, dim_per_head) 2025-08-14T21:36:17.2745282Z 2025-08-14T21:36:17.2745381Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:17.2745712Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:36:17.2746012Z return mod(**inputs) 2025-08-14T21:36:17.2746365Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 826, in forward 2025-08-14T21:36:17.2746747Z dlbrt_output = self.distilbert( 2025-08-14T21:36:17.2747125Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 736, in forward 2025-08-14T21:36:17.2747498Z return self.transformer( 2025-08-14T21:36:17.2747869Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 541, in forward 2025-08-14T21:36:17.2748248Z layer_outputs = layer_module( 2025-08-14T21:36:17.2748577Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:17.2748906Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:17.2749294Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 476, in forward 2025-08-14T21:36:17.2749669Z sa_output = self.attention( 2025-08-14T21:36:17.2750028Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 391, in forward 2025-08-14T21:36:17.2750448Z k = shape(self.k_lin(key)) # (bs, n_heads, k_length, dim_per_head) 2025-08-14T21:36:17.2750618Z 2025-08-14T21:36:17.2750716Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:17.2751066Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:36:17.2751359Z return mod(**inputs) 2025-08-14T21:36:17.2751730Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 826, in forward 2025-08-14T21:36:17.2752128Z dlbrt_output = self.distilbert( 2025-08-14T21:36:17.2752499Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 736, in forward 2025-08-14T21:36:17.2752867Z return self.transformer( 2025-08-14T21:36:17.2753229Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 541, in forward 2025-08-14T21:36:17.2753604Z layer_outputs = layer_module( 2025-08-14T21:36:17.2753939Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:17.2754277Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:17.2754662Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 476, in forward 2025-08-14T21:36:17.2755043Z sa_output = self.attention( 2025-08-14T21:36:17.2755404Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 392, in forward 2025-08-14T21:36:17.2755829Z v = shape(self.v_lin(value)) # (bs, n_heads, k_length, dim_per_head) 2025-08-14T21:36:17.2755993Z 2025-08-14T21:36:17.2756074Z cudagraph partition due to non gpu ops 2025-08-14T21:36:17.2756297Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:17.2756621Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:36:17.2756921Z return mod(**inputs) 2025-08-14T21:36:17.2757284Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 826, in forward 2025-08-14T21:36:17.2757665Z dlbrt_output = self.distilbert( 2025-08-14T21:36:17.2758043Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 736, in forward 2025-08-14T21:36:17.2758428Z return self.transformer( 2025-08-14T21:36:17.2758791Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 541, in forward 2025-08-14T21:36:17.2759162Z layer_outputs = layer_module( 2025-08-14T21:36:17.2759482Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:17.2759818Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:17.2760195Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 476, in forward 2025-08-14T21:36:17.2760575Z sa_output = self.attention( 2025-08-14T21:36:17.2760943Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 402, in forward 2025-08-14T21:36:17.2761380Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:36:17.2761552Z 2025-08-14T21:36:17.2761649Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:17.2761978Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:36:17.2762274Z return mod(**inputs) 2025-08-14T21:36:17.2762633Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 826, in forward 2025-08-14T21:36:17.2763013Z dlbrt_output = self.distilbert( 2025-08-14T21:36:17.2763388Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 736, in forward 2025-08-14T21:36:17.2763771Z return self.transformer( 2025-08-14T21:36:17.2764148Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 541, in forward 2025-08-14T21:36:17.2764530Z layer_outputs = layer_module( 2025-08-14T21:36:17.2764889Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:17.2765229Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:17.2765608Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 476, in forward 2025-08-14T21:36:17.2765991Z sa_output = self.attention( 2025-08-14T21:36:17.2766368Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 412, in forward 2025-08-14T21:36:17.2766754Z attn_output = self.out_lin(attn_output) 2025-08-14T21:36:17.2766905Z 2025-08-14T21:36:17.2767000Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:17.2767332Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:36:17.2767633Z return mod(**inputs) 2025-08-14T21:36:17.2767984Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 826, in forward 2025-08-14T21:36:17.2768366Z dlbrt_output = self.distilbert( 2025-08-14T21:36:17.2768745Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 736, in forward 2025-08-14T21:36:17.2769121Z return self.transformer( 2025-08-14T21:36:17.2769477Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 541, in forward 2025-08-14T21:36:17.2769852Z layer_outputs = layer_module( 2025-08-14T21:36:17.2770170Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:17.2770496Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:17.2770878Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 494, in forward 2025-08-14T21:36:17.2771288Z ffn_output = self.ffn(sa_output) # (bs, seq_length, dim) 2025-08-14T21:36:17.2771699Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 428, in forward 2025-08-14T21:36:17.2772184Z return apply_chunking_to_forward(self.ff_chunk, self.chunk_size_feed_forward, self.seq_len_dim, input) 2025-08-14T21:36:17.2772657Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:36:17.2773025Z return forward_fn(*input_tensors) 2025-08-14T21:36:17.2773406Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 431, in ff_chunk 2025-08-14T21:36:17.2773783Z x = self.lin1(input) 2025-08-14T21:36:17.2773886Z 2025-08-14T21:36:17.2773982Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:17.2774315Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:36:17.2774614Z return mod(**inputs) 2025-08-14T21:36:17.2774974Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 826, in forward 2025-08-14T21:36:17.2775356Z dlbrt_output = self.distilbert( 2025-08-14T21:36:17.2775730Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 736, in forward 2025-08-14T21:36:17.2776096Z return self.transformer( 2025-08-14T21:36:17.2776459Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 541, in forward 2025-08-14T21:36:17.2776833Z layer_outputs = layer_module( 2025-08-14T21:36:17.2777166Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:17.2777498Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:17.2777892Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 494, in forward 2025-08-14T21:36:17.2778317Z ffn_output = self.ffn(sa_output) # (bs, seq_length, dim) 2025-08-14T21:36:17.2778719Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 428, in forward 2025-08-14T21:36:17.2779213Z return apply_chunking_to_forward(self.ff_chunk, self.chunk_size_feed_forward, self.seq_len_dim, input) 2025-08-14T21:36:17.2779688Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:36:17.2780072Z return forward_fn(*input_tensors) 2025-08-14T21:36:17.2780447Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 432, in ff_chunk 2025-08-14T21:36:17.2780827Z x = self.activation(x) 2025-08-14T21:36:17.2781133Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:36:17.2781449Z return self.act(input) 2025-08-14T21:36:17.2781550Z 2025-08-14T21:36:17.2781646Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:17.2781980Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:36:17.2782280Z return mod(**inputs) 2025-08-14T21:36:17.2782627Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 826, in forward 2025-08-14T21:36:17.2783006Z dlbrt_output = self.distilbert( 2025-08-14T21:36:17.2783386Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 736, in forward 2025-08-14T21:36:17.2783764Z return self.transformer( 2025-08-14T21:36:17.2784124Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 541, in forward 2025-08-14T21:36:17.2784505Z layer_outputs = layer_module( 2025-08-14T21:36:17.2785032Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:17.2785372Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:17.2785769Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 494, in forward 2025-08-14T21:36:17.2786200Z ffn_output = self.ffn(sa_output) # (bs, seq_length, dim) 2025-08-14T21:36:17.2786615Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 428, in forward 2025-08-14T21:36:17.2787102Z return apply_chunking_to_forward(self.ff_chunk, self.chunk_size_feed_forward, self.seq_len_dim, input) 2025-08-14T21:36:17.2787575Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:36:17.2787946Z return forward_fn(*input_tensors) 2025-08-14T21:36:17.2788341Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 433, in ff_chunk 2025-08-14T21:36:17.2788716Z x = self.lin2(x) 2025-08-14T21:36:17.2788818Z 2025-08-14T21:36:17.2788917Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:17.2789256Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:36:17.2789564Z return mod(**inputs) 2025-08-14T21:36:17.2789921Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 836, in forward 2025-08-14T21:36:17.2790389Z prediction_logits = self.vocab_transform(hidden_states) # (bs, seq_length, dim) 2025-08-14T21:36:17.2790641Z 2025-08-14T21:36:17.2790749Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:17.2791104Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:36:17.2791444Z return mod(**inputs) 2025-08-14T21:36:17.2791807Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 839, in forward 2025-08-14T21:36:17.2792307Z prediction_logits = self.vocab_projector(prediction_logits) # (bs, seq_length, vocab_size) 2025-08-14T21:36:17.2792535Z 2025-08-14T21:36:17.2792634Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:17.2792970Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:36:17.2793303Z return mod(**inputs) 2025-08-14T21:36:17.2793669Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 843, in forward 2025-08-14T21:36:17.2794167Z mlm_loss = self.mlm_loss_fct(prediction_logits.view(-1, prediction_logits.size(-1)), labels.view(-1)) 2025-08-14T21:36:17.2794403Z 2025-08-14T21:36:23.5416024Z Compilation time (from dynamo_timed): 9.680234472 2025-08-14T21:36:23.5420758Z pass 2025-08-14T21:36:23.5425678Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:36:23.5431123Z TIMING: _recursive_pre_grad_passes:0.00455 _recursive_joint_graph_passes:0.23064 _recursive_post_grad_passes:0.04764 async_compile.wait:0.65918 code_gen:6.00208 inductor_compile:6.7978 backend_compile:8.42153 gc:0.00117 entire_frame_compile:9.68023 total_wall_time:9.68023 2025-08-14T21:36:23.5436971Z STATS: call_* op count: 153 | FakeTensorMode.__torch_dispatch__:6660 | FakeTensor.__torch_dispatch__:2532 | ProxyTorchDispatchMode.__torch_dispatch__:2359 2025-08-14T21:36:23.5441326Z Dynamo produced 1 graphs covering 153 ops with 0 graph breaks (0 unique) 2025-08-14T21:36:27.5890642Z /opt/conda/envs/py_3.9/lib/python3.9/site-packages/llvmlite/binding/ffi.py:175: UserWarning: pkg_resources is deprecated as an API. See https://setuptools.pypa.io/en/latest/pkg_resources.html. The pkg_resources package is slated for removal as early as 2025-11-30. Refrain from using this package or pin to Setuptools<81. 2025-08-14T21:36:27.5891508Z from pkg_resources import resource_filename 2025-08-14T21:36:28.1212741Z 2025-08-14T21:36:28.6296843Z loading model: 0it [00:00, ?it/s] 2025-08-14T21:36:28.6297308Z loading model: 0it [00:00, ?it/s] 2025-08-14T21:36:28.6299535Z cpu eval DistilBertForQuestionAnswering 2025-08-14T21:36:28.8836130Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:36:28.9246300Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:36:28.9640546Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:36:33.2769224Z cudagraph partition due to non gpu ops 2025-08-14T21:36:33.2773612Z cudagraph partition due to non gpu ops 2025-08-14T21:36:33.2774032Z cudagraph partition due to non gpu ops 2025-08-14T21:36:33.2774364Z cudagraph partition due to non gpu ops 2025-08-14T21:36:33.2774677Z cudagraph partition due to non gpu ops 2025-08-14T21:36:33.2775390Z cudagraph partition due to non gpu ops 2025-08-14T21:36:33.2775789Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:33.2776155Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:36:33.2776470Z return mod(**inputs) 2025-08-14T21:36:33.2776871Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 1031, in forward 2025-08-14T21:36:33.2777296Z distilbert_output = self.distilbert( 2025-08-14T21:36:33.2777937Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 736, in forward 2025-08-14T21:36:33.2778340Z return self.transformer( 2025-08-14T21:36:33.2778771Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 541, in forward 2025-08-14T21:36:33.2779209Z layer_outputs = layer_module( 2025-08-14T21:36:33.2779543Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:33.2779890Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:33.2780284Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 476, in forward 2025-08-14T21:36:33.2780680Z sa_output = self.attention( 2025-08-14T21:36:33.2781056Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 390, in forward 2025-08-14T21:36:33.2781547Z q = shape(self.q_lin(query)) # (bs, n_heads, q_length, dim_per_head) 2025-08-14T21:36:33.2781718Z 2025-08-14T21:36:33.2781826Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:33.2782158Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:36:33.2782466Z return mod(**inputs) 2025-08-14T21:36:33.2782829Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 1031, in forward 2025-08-14T21:36:33.2783212Z distilbert_output = self.distilbert( 2025-08-14T21:36:33.2783597Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 736, in forward 2025-08-14T21:36:33.2783973Z return self.transformer( 2025-08-14T21:36:33.2784342Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 541, in forward 2025-08-14T21:36:33.2784978Z layer_outputs = layer_module( 2025-08-14T21:36:33.2785309Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:33.2785653Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:33.2786038Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 476, in forward 2025-08-14T21:36:33.2786418Z sa_output = self.attention( 2025-08-14T21:36:33.2786783Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 391, in forward 2025-08-14T21:36:33.2787203Z k = shape(self.k_lin(key)) # (bs, n_heads, k_length, dim_per_head) 2025-08-14T21:36:33.2787368Z 2025-08-14T21:36:33.2787468Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:33.2787805Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:36:33.2788104Z return mod(**inputs) 2025-08-14T21:36:33.2788469Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 1031, in forward 2025-08-14T21:36:33.2788855Z distilbert_output = self.distilbert( 2025-08-14T21:36:33.2789244Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 736, in forward 2025-08-14T21:36:33.2789629Z return self.transformer( 2025-08-14T21:36:33.2789987Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 541, in forward 2025-08-14T21:36:33.2790367Z layer_outputs = layer_module( 2025-08-14T21:36:33.2790690Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:33.2793401Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:33.2793799Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 476, in forward 2025-08-14T21:36:33.2794229Z sa_output = self.attention( 2025-08-14T21:36:33.2794643Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 392, in forward 2025-08-14T21:36:33.2795066Z v = shape(self.v_lin(value)) # (bs, n_heads, k_length, dim_per_head) 2025-08-14T21:36:33.2795239Z 2025-08-14T21:36:33.2795316Z cudagraph partition due to non gpu ops 2025-08-14T21:36:33.2795543Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:33.2795877Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:36:33.2796170Z return mod(**inputs) 2025-08-14T21:36:33.2796538Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 1031, in forward 2025-08-14T21:36:33.2796997Z distilbert_output = self.distilbert( 2025-08-14T21:36:33.2797407Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 736, in forward 2025-08-14T21:36:33.2797791Z return self.transformer( 2025-08-14T21:36:33.2798156Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 541, in forward 2025-08-14T21:36:33.2798535Z layer_outputs = layer_module( 2025-08-14T21:36:33.2798851Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:33.2799194Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:33.2799579Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 476, in forward 2025-08-14T21:36:33.2799950Z sa_output = self.attention( 2025-08-14T21:36:33.2800325Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 402, in forward 2025-08-14T21:36:33.2800760Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:36:33.2800930Z 2025-08-14T21:36:33.2801034Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:33.2801359Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:36:33.2801659Z return mod(**inputs) 2025-08-14T21:36:33.2802020Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 1031, in forward 2025-08-14T21:36:33.2802408Z distilbert_output = self.distilbert( 2025-08-14T21:36:33.2802782Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 736, in forward 2025-08-14T21:36:33.2803157Z return self.transformer( 2025-08-14T21:36:33.2803527Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 541, in forward 2025-08-14T21:36:33.2803911Z layer_outputs = layer_module( 2025-08-14T21:36:33.2804235Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:33.2804572Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:33.2804954Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 476, in forward 2025-08-14T21:36:33.2805321Z sa_output = self.attention( 2025-08-14T21:36:33.2805684Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 412, in forward 2025-08-14T21:36:33.2806068Z attn_output = self.out_lin(attn_output) 2025-08-14T21:36:33.2806195Z 2025-08-14T21:36:33.2806304Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:33.2806624Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:36:33.2806985Z return mod(**inputs) 2025-08-14T21:36:33.2807362Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 1031, in forward 2025-08-14T21:36:33.2807745Z distilbert_output = self.distilbert( 2025-08-14T21:36:33.2808145Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 736, in forward 2025-08-14T21:36:33.2808525Z return self.transformer( 2025-08-14T21:36:33.2808889Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 541, in forward 2025-08-14T21:36:33.2809262Z layer_outputs = layer_module( 2025-08-14T21:36:33.2809583Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:33.2809917Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:33.2810312Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 494, in forward 2025-08-14T21:36:33.2810727Z ffn_output = self.ffn(sa_output) # (bs, seq_length, dim) 2025-08-14T21:36:33.2811138Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 428, in forward 2025-08-14T21:36:33.2811630Z return apply_chunking_to_forward(self.ff_chunk, self.chunk_size_feed_forward, self.seq_len_dim, input) 2025-08-14T21:36:33.2812103Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:36:33.2812470Z return forward_fn(*input_tensors) 2025-08-14T21:36:33.2812853Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 431, in ff_chunk 2025-08-14T21:36:33.2813229Z x = self.lin1(input) 2025-08-14T21:36:33.2813331Z 2025-08-14T21:36:33.2813427Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:33.2813761Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:36:33.2814060Z return mod(**inputs) 2025-08-14T21:36:33.2814411Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 1031, in forward 2025-08-14T21:36:33.2814797Z distilbert_output = self.distilbert( 2025-08-14T21:36:33.2815179Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 736, in forward 2025-08-14T21:36:33.2815555Z return self.transformer( 2025-08-14T21:36:33.2815909Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 541, in forward 2025-08-14T21:36:33.2816287Z layer_outputs = layer_module( 2025-08-14T21:36:33.2816604Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:33.2816936Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:33.2817308Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 494, in forward 2025-08-14T21:36:33.2817720Z ffn_output = self.ffn(sa_output) # (bs, seq_length, dim) 2025-08-14T21:36:33.2818127Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 428, in forward 2025-08-14T21:36:33.2818608Z return apply_chunking_to_forward(self.ff_chunk, self.chunk_size_feed_forward, self.seq_len_dim, input) 2025-08-14T21:36:33.2819084Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:36:33.2819446Z return forward_fn(*input_tensors) 2025-08-14T21:36:33.2819823Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 432, in ff_chunk 2025-08-14T21:36:33.2820236Z x = self.activation(x) 2025-08-14T21:36:33.2820553Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:36:33.2820871Z return self.act(input) 2025-08-14T21:36:33.2820969Z 2025-08-14T21:36:33.2821085Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:33.2821414Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:36:33.2821711Z return mod(**inputs) 2025-08-14T21:36:33.2822067Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 1031, in forward 2025-08-14T21:36:33.2822447Z distilbert_output = self.distilbert( 2025-08-14T21:36:33.2822835Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 736, in forward 2025-08-14T21:36:33.2823236Z return self.transformer( 2025-08-14T21:36:33.2823604Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 541, in forward 2025-08-14T21:36:33.2823978Z layer_outputs = layer_module( 2025-08-14T21:36:33.2824298Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:33.2824630Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:33.2825097Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 494, in forward 2025-08-14T21:36:33.2825510Z ffn_output = self.ffn(sa_output) # (bs, seq_length, dim) 2025-08-14T21:36:33.2825928Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 428, in forward 2025-08-14T21:36:33.2826428Z return apply_chunking_to_forward(self.ff_chunk, self.chunk_size_feed_forward, self.seq_len_dim, input) 2025-08-14T21:36:33.2826909Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:36:33.2827273Z return forward_fn(*input_tensors) 2025-08-14T21:36:33.2827663Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 433, in ff_chunk 2025-08-14T21:36:33.2828041Z x = self.lin2(x) 2025-08-14T21:36:33.2828135Z 2025-08-14T21:36:33.2828231Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:33.2828568Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:36:33.2828871Z return mod(**inputs) 2025-08-14T21:36:33.2829235Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 1031, in forward 2025-08-14T21:36:33.2829617Z distilbert_output = self.distilbert( 2025-08-14T21:36:33.2830006Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 736, in forward 2025-08-14T21:36:33.2830388Z return self.transformer( 2025-08-14T21:36:33.2830751Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 541, in forward 2025-08-14T21:36:33.2831135Z layer_outputs = layer_module( 2025-08-14T21:36:33.2831457Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:33.2831794Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:33.2832170Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 476, in forward 2025-08-14T21:36:33.2832550Z sa_output = self.attention( 2025-08-14T21:36:33.2832918Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 390, in forward 2025-08-14T21:36:33.2833371Z q = shape(self.q_lin(query)) # (bs, n_heads, q_length, dim_per_head) 2025-08-14T21:36:33.2833537Z 2025-08-14T21:36:33.2833647Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:33.2833981Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:36:33.2834296Z return mod(**inputs) 2025-08-14T21:36:33.2834654Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 1031, in forward 2025-08-14T21:36:33.2835042Z distilbert_output = self.distilbert( 2025-08-14T21:36:33.2835426Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 736, in forward 2025-08-14T21:36:33.2835800Z return self.transformer( 2025-08-14T21:36:33.2836157Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 541, in forward 2025-08-14T21:36:33.2836564Z layer_outputs = layer_module( 2025-08-14T21:36:33.2836886Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:33.2837210Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:33.2837593Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 476, in forward 2025-08-14T21:36:33.2837968Z sa_output = self.attention( 2025-08-14T21:36:33.2838332Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 391, in forward 2025-08-14T21:36:33.2838745Z k = shape(self.k_lin(key)) # (bs, n_heads, k_length, dim_per_head) 2025-08-14T21:36:33.2838914Z 2025-08-14T21:36:33.2839010Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:33.2839333Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:36:33.2839633Z return mod(**inputs) 2025-08-14T21:36:33.2839985Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 1031, in forward 2025-08-14T21:36:33.2840373Z distilbert_output = self.distilbert( 2025-08-14T21:36:33.2840757Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 736, in forward 2025-08-14T21:36:33.2841126Z return self.transformer( 2025-08-14T21:36:33.2841488Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 541, in forward 2025-08-14T21:36:33.2841864Z layer_outputs = layer_module( 2025-08-14T21:36:33.2842179Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:33.2842503Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:33.2842881Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 476, in forward 2025-08-14T21:36:33.2843263Z sa_output = self.attention( 2025-08-14T21:36:33.2843633Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 392, in forward 2025-08-14T21:36:33.2844048Z v = shape(self.v_lin(value)) # (bs, n_heads, k_length, dim_per_head) 2025-08-14T21:36:33.2844219Z 2025-08-14T21:36:33.2844295Z cudagraph partition due to non gpu ops 2025-08-14T21:36:33.2844512Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:33.2844835Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:36:33.2845134Z return mod(**inputs) 2025-08-14T21:36:33.2845494Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 1031, in forward 2025-08-14T21:36:33.2845883Z distilbert_output = self.distilbert( 2025-08-14T21:36:33.2846284Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 736, in forward 2025-08-14T21:36:33.2846686Z return self.transformer( 2025-08-14T21:36:33.2847068Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 541, in forward 2025-08-14T21:36:33.2847445Z layer_outputs = layer_module( 2025-08-14T21:36:33.2847765Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:33.2848099Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:33.2848481Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 476, in forward 2025-08-14T21:36:33.2848851Z sa_output = self.attention( 2025-08-14T21:36:33.2849216Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 402, in forward 2025-08-14T21:36:33.2849666Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:36:33.2849837Z 2025-08-14T21:36:33.2849941Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:33.2850268Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:36:33.2850566Z return mod(**inputs) 2025-08-14T21:36:33.2850930Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 1031, in forward 2025-08-14T21:36:33.2851311Z distilbert_output = self.distilbert( 2025-08-14T21:36:33.2851694Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 736, in forward 2025-08-14T21:36:33.2852070Z return self.transformer( 2025-08-14T21:36:33.2852432Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 541, in forward 2025-08-14T21:36:33.2852807Z layer_outputs = layer_module( 2025-08-14T21:36:33.2853129Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:33.2853467Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:33.2853844Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 476, in forward 2025-08-14T21:36:33.2854224Z sa_output = self.attention( 2025-08-14T21:36:33.2854594Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 412, in forward 2025-08-14T21:36:33.2854984Z attn_output = self.out_lin(attn_output) 2025-08-14T21:36:33.2855112Z 2025-08-14T21:36:33.2855207Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:33.2855538Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:36:33.2855841Z return mod(**inputs) 2025-08-14T21:36:33.2856204Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 1031, in forward 2025-08-14T21:36:33.2856582Z distilbert_output = self.distilbert( 2025-08-14T21:36:33.2856964Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 736, in forward 2025-08-14T21:36:33.2857342Z return self.transformer( 2025-08-14T21:36:33.2857699Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 541, in forward 2025-08-14T21:36:33.2858078Z layer_outputs = layer_module( 2025-08-14T21:36:33.2858397Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:33.2858729Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:33.2859101Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 494, in forward 2025-08-14T21:36:33.2859536Z ffn_output = self.ffn(sa_output) # (bs, seq_length, dim) 2025-08-14T21:36:33.2859964Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 428, in forward 2025-08-14T21:36:33.2860476Z return apply_chunking_to_forward(self.ff_chunk, self.chunk_size_feed_forward, self.seq_len_dim, input) 2025-08-14T21:36:33.2860950Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:36:33.2861316Z return forward_fn(*input_tensors) 2025-08-14T21:36:33.2861697Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 431, in ff_chunk 2025-08-14T21:36:33.2862065Z x = self.lin1(input) 2025-08-14T21:36:33.2862170Z 2025-08-14T21:36:33.2862266Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:33.2862616Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:36:33.2862916Z return mod(**inputs) 2025-08-14T21:36:33.2863273Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 1031, in forward 2025-08-14T21:36:33.2863665Z distilbert_output = self.distilbert( 2025-08-14T21:36:33.2864052Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 736, in forward 2025-08-14T21:36:33.2864433Z return self.transformer( 2025-08-14T21:36:33.2864884Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 541, in forward 2025-08-14T21:36:33.2865281Z layer_outputs = layer_module( 2025-08-14T21:36:33.2865608Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:33.2865942Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:33.2866334Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 494, in forward 2025-08-14T21:36:33.2866754Z ffn_output = self.ffn(sa_output) # (bs, seq_length, dim) 2025-08-14T21:36:33.2867290Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 428, in forward 2025-08-14T21:36:33.2867781Z return apply_chunking_to_forward(self.ff_chunk, self.chunk_size_feed_forward, self.seq_len_dim, input) 2025-08-14T21:36:33.2868260Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:36:33.2868625Z return forward_fn(*input_tensors) 2025-08-14T21:36:33.2869005Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 432, in ff_chunk 2025-08-14T21:36:33.2869388Z x = self.activation(x) 2025-08-14T21:36:33.2869695Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:36:33.2870014Z return self.act(input) 2025-08-14T21:36:33.2870113Z 2025-08-14T21:36:33.2870216Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:33.2870544Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:36:33.2870845Z return mod(**inputs) 2025-08-14T21:36:33.2871203Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 1031, in forward 2025-08-14T21:36:33.2871585Z distilbert_output = self.distilbert( 2025-08-14T21:36:33.2871972Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 736, in forward 2025-08-14T21:36:33.2872351Z return self.transformer( 2025-08-14T21:36:33.2872737Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 541, in forward 2025-08-14T21:36:33.2873122Z layer_outputs = layer_module( 2025-08-14T21:36:33.2873446Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:33.2873795Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:33.2874168Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 494, in forward 2025-08-14T21:36:33.2874579Z ffn_output = self.ffn(sa_output) # (bs, seq_length, dim) 2025-08-14T21:36:33.2874985Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 428, in forward 2025-08-14T21:36:33.2875475Z return apply_chunking_to_forward(self.ff_chunk, self.chunk_size_feed_forward, self.seq_len_dim, input) 2025-08-14T21:36:33.2875944Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:36:33.2876328Z return forward_fn(*input_tensors) 2025-08-14T21:36:33.2876707Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 433, in ff_chunk 2025-08-14T21:36:33.2877077Z x = self.lin2(x) 2025-08-14T21:36:33.2877170Z 2025-08-14T21:36:33.2877266Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:33.2877598Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:36:33.2877895Z return mod(**inputs) 2025-08-14T21:36:33.2878245Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 1031, in forward 2025-08-14T21:36:33.2878632Z distilbert_output = self.distilbert( 2025-08-14T21:36:33.2879014Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 736, in forward 2025-08-14T21:36:33.2879395Z return self.transformer( 2025-08-14T21:36:33.2879753Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 541, in forward 2025-08-14T21:36:33.2880136Z layer_outputs = layer_module( 2025-08-14T21:36:33.2880455Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:33.2880791Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:33.2881162Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 476, in forward 2025-08-14T21:36:33.2881543Z sa_output = self.attention( 2025-08-14T21:36:33.2881913Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 390, in forward 2025-08-14T21:36:33.2882333Z q = shape(self.q_lin(query)) # (bs, n_heads, q_length, dim_per_head) 2025-08-14T21:36:33.2882510Z 2025-08-14T21:36:33.2882608Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:33.2882939Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:36:33.2883237Z return mod(**inputs) 2025-08-14T21:36:33.2883590Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 1031, in forward 2025-08-14T21:36:33.2884056Z distilbert_output = self.distilbert( 2025-08-14T21:36:33.2884440Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 736, in forward 2025-08-14T21:36:33.2885064Z return self.transformer( 2025-08-14T21:36:33.2885430Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 541, in forward 2025-08-14T21:36:33.2885811Z layer_outputs = layer_module( 2025-08-14T21:36:33.2886187Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:33.2886544Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:33.2886970Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 476, in forward 2025-08-14T21:36:33.2887353Z sa_output = self.attention( 2025-08-14T21:36:33.2887718Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 391, in forward 2025-08-14T21:36:33.2888130Z k = shape(self.k_lin(key)) # (bs, n_heads, k_length, dim_per_head) 2025-08-14T21:36:33.2888299Z 2025-08-14T21:36:33.2888396Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:33.2888729Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:36:33.2889017Z return mod(**inputs) 2025-08-14T21:36:33.2889405Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 1031, in forward 2025-08-14T21:36:33.2889797Z distilbert_output = self.distilbert( 2025-08-14T21:36:33.2890179Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 736, in forward 2025-08-14T21:36:33.2890552Z return self.transformer( 2025-08-14T21:36:33.2890917Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 541, in forward 2025-08-14T21:36:33.2891294Z layer_outputs = layer_module( 2025-08-14T21:36:33.2891616Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:33.2891944Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:33.2892324Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 476, in forward 2025-08-14T21:36:33.2892706Z sa_output = self.attention( 2025-08-14T21:36:33.2893067Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 392, in forward 2025-08-14T21:36:33.2893491Z v = shape(self.v_lin(value)) # (bs, n_heads, k_length, dim_per_head) 2025-08-14T21:36:33.2893663Z 2025-08-14T21:36:33.2893738Z cudagraph partition due to non gpu ops 2025-08-14T21:36:33.2893959Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:33.2894282Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:36:33.2894580Z return mod(**inputs) 2025-08-14T21:36:33.2894943Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 1031, in forward 2025-08-14T21:36:33.2895333Z distilbert_output = self.distilbert( 2025-08-14T21:36:33.2895711Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 736, in forward 2025-08-14T21:36:33.2896093Z return self.transformer( 2025-08-14T21:36:33.2896460Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 541, in forward 2025-08-14T21:36:33.2896835Z layer_outputs = layer_module( 2025-08-14T21:36:33.2897158Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:33.2897493Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:33.2897875Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 476, in forward 2025-08-14T21:36:33.2898249Z sa_output = self.attention( 2025-08-14T21:36:33.2898616Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 402, in forward 2025-08-14T21:36:33.2899076Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:36:33.2899250Z 2025-08-14T21:36:33.2899346Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:33.2899691Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:36:33.2899988Z return mod(**inputs) 2025-08-14T21:36:33.2900365Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 1031, in forward 2025-08-14T21:36:33.2900745Z distilbert_output = self.distilbert( 2025-08-14T21:36:33.2901124Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 736, in forward 2025-08-14T21:36:33.2901499Z return self.transformer( 2025-08-14T21:36:33.2901864Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 541, in forward 2025-08-14T21:36:33.2902237Z layer_outputs = layer_module( 2025-08-14T21:36:33.2902574Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:33.2902914Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:33.2903294Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 476, in forward 2025-08-14T21:36:33.2903671Z sa_output = self.attention( 2025-08-14T21:36:33.2904036Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 412, in forward 2025-08-14T21:36:33.2904425Z attn_output = self.out_lin(attn_output) 2025-08-14T21:36:33.2904552Z 2025-08-14T21:36:33.2904647Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:33.2905029Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:36:33.2905329Z return mod(**inputs) 2025-08-14T21:36:33.2905687Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 1031, in forward 2025-08-14T21:36:33.2906078Z distilbert_output = self.distilbert( 2025-08-14T21:36:33.2906461Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 736, in forward 2025-08-14T21:36:33.2906842Z return self.transformer( 2025-08-14T21:36:33.2907201Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 541, in forward 2025-08-14T21:36:33.2907582Z layer_outputs = layer_module( 2025-08-14T21:36:33.2907905Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:33.2908241Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:33.2908617Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 494, in forward 2025-08-14T21:36:33.2909038Z ffn_output = self.ffn(sa_output) # (bs, seq_length, dim) 2025-08-14T21:36:33.2909451Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 428, in forward 2025-08-14T21:36:33.2909946Z return apply_chunking_to_forward(self.ff_chunk, self.chunk_size_feed_forward, self.seq_len_dim, input) 2025-08-14T21:36:33.2910426Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:36:33.2910788Z return forward_fn(*input_tensors) 2025-08-14T21:36:33.2911167Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 431, in ff_chunk 2025-08-14T21:36:33.2911554Z x = self.lin1(input) 2025-08-14T21:36:33.2911663Z 2025-08-14T21:36:33.2911763Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:33.2912113Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:36:33.2912458Z return mod(**inputs) 2025-08-14T21:36:33.2912850Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 1031, in forward 2025-08-14T21:36:33.2913238Z distilbert_output = self.distilbert( 2025-08-14T21:36:33.2913640Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 736, in forward 2025-08-14T21:36:33.2914012Z return self.transformer( 2025-08-14T21:36:33.2914379Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 541, in forward 2025-08-14T21:36:33.2914756Z layer_outputs = layer_module( 2025-08-14T21:36:33.2915072Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:33.2915397Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:33.2915793Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 494, in forward 2025-08-14T21:36:33.2916204Z ffn_output = self.ffn(sa_output) # (bs, seq_length, dim) 2025-08-14T21:36:33.2916611Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 428, in forward 2025-08-14T21:36:33.2917094Z return apply_chunking_to_forward(self.ff_chunk, self.chunk_size_feed_forward, self.seq_len_dim, input) 2025-08-14T21:36:33.2917565Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:36:33.2917926Z return forward_fn(*input_tensors) 2025-08-14T21:36:33.2918300Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 432, in ff_chunk 2025-08-14T21:36:33.2918668Z x = self.activation(x) 2025-08-14T21:36:33.2918968Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:36:33.2919279Z return self.act(input) 2025-08-14T21:36:33.2919379Z 2025-08-14T21:36:33.2919475Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:33.2919808Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:36:33.2920105Z return mod(**inputs) 2025-08-14T21:36:33.2920465Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 1031, in forward 2025-08-14T21:36:33.2920844Z distilbert_output = self.distilbert( 2025-08-14T21:36:33.2921230Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 736, in forward 2025-08-14T21:36:33.2921603Z return self.transformer( 2025-08-14T21:36:33.2921958Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 541, in forward 2025-08-14T21:36:33.2922339Z layer_outputs = layer_module( 2025-08-14T21:36:33.2922656Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:33.2922987Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:33.2923358Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 494, in forward 2025-08-14T21:36:33.2923767Z ffn_output = self.ffn(sa_output) # (bs, seq_length, dim) 2025-08-14T21:36:33.2924173Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 428, in forward 2025-08-14T21:36:33.2924658Z return apply_chunking_to_forward(self.ff_chunk, self.chunk_size_feed_forward, self.seq_len_dim, input) 2025-08-14T21:36:33.2925129Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:36:33.2925518Z return forward_fn(*input_tensors) 2025-08-14T21:36:33.2925919Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 433, in ff_chunk 2025-08-14T21:36:33.2926299Z x = self.lin2(x) 2025-08-14T21:36:33.2926397Z 2025-08-14T21:36:33.2926511Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:33.2926853Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:36:33.2927161Z return mod(**inputs) 2025-08-14T21:36:33.2927529Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 1031, in forward 2025-08-14T21:36:33.2927928Z distilbert_output = self.distilbert( 2025-08-14T21:36:33.2928318Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 736, in forward 2025-08-14T21:36:33.2928728Z return self.transformer( 2025-08-14T21:36:33.2929092Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 541, in forward 2025-08-14T21:36:33.2929474Z layer_outputs = layer_module( 2025-08-14T21:36:33.2929802Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:33.2930136Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:33.2930521Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 476, in forward 2025-08-14T21:36:33.2930910Z sa_output = self.attention( 2025-08-14T21:36:33.2931284Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 390, in forward 2025-08-14T21:36:33.2931715Z q = shape(self.q_lin(query)) # (bs, n_heads, q_length, dim_per_head) 2025-08-14T21:36:33.2931895Z 2025-08-14T21:36:33.2931992Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:33.2932330Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:36:33.2932633Z return mod(**inputs) 2025-08-14T21:36:33.2932995Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 1031, in forward 2025-08-14T21:36:33.2933399Z distilbert_output = self.distilbert( 2025-08-14T21:36:33.2933790Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 736, in forward 2025-08-14T21:36:33.2934170Z return self.transformer( 2025-08-14T21:36:33.2934544Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 541, in forward 2025-08-14T21:36:33.2934931Z layer_outputs = layer_module( 2025-08-14T21:36:33.2935256Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:33.2935593Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:33.2935981Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 476, in forward 2025-08-14T21:36:33.2936366Z sa_output = self.attention( 2025-08-14T21:36:33.2936735Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 391, in forward 2025-08-14T21:36:33.2937164Z k = shape(self.k_lin(key)) # (bs, n_heads, k_length, dim_per_head) 2025-08-14T21:36:33.2937333Z 2025-08-14T21:36:33.2937430Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:33.2937765Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:36:33.2938064Z return mod(**inputs) 2025-08-14T21:36:33.2938432Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 1031, in forward 2025-08-14T21:36:33.2938861Z distilbert_output = self.distilbert( 2025-08-14T21:36:33.2939271Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 736, in forward 2025-08-14T21:36:33.2939647Z return self.transformer( 2025-08-14T21:36:33.2940021Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 541, in forward 2025-08-14T21:36:33.2940402Z layer_outputs = layer_module( 2025-08-14T21:36:33.2940713Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:33.2941046Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:33.2941426Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 476, in forward 2025-08-14T21:36:33.2941804Z sa_output = self.attention( 2025-08-14T21:36:33.2942189Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 392, in forward 2025-08-14T21:36:33.2942610Z v = shape(self.v_lin(value)) # (bs, n_heads, k_length, dim_per_head) 2025-08-14T21:36:33.2942775Z 2025-08-14T21:36:33.2942859Z cudagraph partition due to non gpu ops 2025-08-14T21:36:33.2943080Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:33.2943400Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:36:33.2943700Z return mod(**inputs) 2025-08-14T21:36:33.2944057Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 1031, in forward 2025-08-14T21:36:33.2944440Z distilbert_output = self.distilbert( 2025-08-14T21:36:33.2944892Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 736, in forward 2025-08-14T21:36:33.2945295Z return self.transformer( 2025-08-14T21:36:33.2945672Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 541, in forward 2025-08-14T21:36:33.2946066Z layer_outputs = layer_module( 2025-08-14T21:36:33.2946387Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:33.2946721Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:33.2947098Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 476, in forward 2025-08-14T21:36:33.2947483Z sa_output = self.attention( 2025-08-14T21:36:33.2947853Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 402, in forward 2025-08-14T21:36:33.2948288Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:36:33.2948464Z 2025-08-14T21:36:33.2948562Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:33.2948896Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:36:33.2949195Z return mod(**inputs) 2025-08-14T21:36:33.2949559Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 1031, in forward 2025-08-14T21:36:33.2949941Z distilbert_output = self.distilbert( 2025-08-14T21:36:33.2950324Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 736, in forward 2025-08-14T21:36:33.2950698Z return self.transformer( 2025-08-14T21:36:33.2951055Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 541, in forward 2025-08-14T21:36:33.2951434Z layer_outputs = layer_module( 2025-08-14T21:36:33.2951752Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:33.2952114Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:33.2952515Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 476, in forward 2025-08-14T21:36:33.2952895Z sa_output = self.attention( 2025-08-14T21:36:33.2953277Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 412, in forward 2025-08-14T21:36:33.2953663Z attn_output = self.out_lin(attn_output) 2025-08-14T21:36:33.2953789Z 2025-08-14T21:36:33.2953884Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:33.2954213Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:36:33.2954513Z return mod(**inputs) 2025-08-14T21:36:33.2954865Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 1031, in forward 2025-08-14T21:36:33.2955271Z distilbert_output = self.distilbert( 2025-08-14T21:36:33.2955651Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 736, in forward 2025-08-14T21:36:33.2956028Z return self.transformer( 2025-08-14T21:36:33.2956389Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 541, in forward 2025-08-14T21:36:33.2956763Z layer_outputs = layer_module( 2025-08-14T21:36:33.2957075Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:33.2957399Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:33.2957777Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 494, in forward 2025-08-14T21:36:33.2958187Z ffn_output = self.ffn(sa_output) # (bs, seq_length, dim) 2025-08-14T21:36:33.2958596Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 428, in forward 2025-08-14T21:36:33.2959082Z return apply_chunking_to_forward(self.ff_chunk, self.chunk_size_feed_forward, self.seq_len_dim, input) 2025-08-14T21:36:33.2959640Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:36:33.2960010Z return forward_fn(*input_tensors) 2025-08-14T21:36:33.2960393Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 431, in ff_chunk 2025-08-14T21:36:33.2960802Z x = self.lin1(input) 2025-08-14T21:36:33.2960911Z 2025-08-14T21:36:33.2961006Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:33.2961342Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:36:33.2961645Z return mod(**inputs) 2025-08-14T21:36:33.2962011Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 1031, in forward 2025-08-14T21:36:33.2962404Z distilbert_output = self.distilbert( 2025-08-14T21:36:33.2962792Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 736, in forward 2025-08-14T21:36:33.2963167Z return self.transformer( 2025-08-14T21:36:33.2963535Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 541, in forward 2025-08-14T21:36:33.2963917Z layer_outputs = layer_module( 2025-08-14T21:36:33.2964241Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:33.2964575Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:33.2964965Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 494, in forward 2025-08-14T21:36:33.2965406Z ffn_output = self.ffn(sa_output) # (bs, seq_length, dim) 2025-08-14T21:36:33.2965832Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 428, in forward 2025-08-14T21:36:33.2966341Z return apply_chunking_to_forward(self.ff_chunk, self.chunk_size_feed_forward, self.seq_len_dim, input) 2025-08-14T21:36:33.2966821Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:36:33.2967186Z return forward_fn(*input_tensors) 2025-08-14T21:36:33.2967561Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 432, in ff_chunk 2025-08-14T21:36:33.2967940Z x = self.activation(x) 2025-08-14T21:36:33.2968240Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:36:33.2968567Z return self.act(input) 2025-08-14T21:36:33.2968668Z 2025-08-14T21:36:33.2968765Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:33.2969096Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:36:33.2969395Z return mod(**inputs) 2025-08-14T21:36:33.2969743Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 1031, in forward 2025-08-14T21:36:33.2970128Z distilbert_output = self.distilbert( 2025-08-14T21:36:33.2970510Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 736, in forward 2025-08-14T21:36:33.2970889Z return self.transformer( 2025-08-14T21:36:33.2971248Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 541, in forward 2025-08-14T21:36:33.2971631Z layer_outputs = layer_module( 2025-08-14T21:36:33.2971951Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:33.2972280Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:33.2972659Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 494, in forward 2025-08-14T21:36:33.2973069Z ffn_output = self.ffn(sa_output) # (bs, seq_length, dim) 2025-08-14T21:36:33.2973474Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 428, in forward 2025-08-14T21:36:33.2973961Z return apply_chunking_to_forward(self.ff_chunk, self.chunk_size_feed_forward, self.seq_len_dim, input) 2025-08-14T21:36:33.2974433Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:36:33.2974805Z return forward_fn(*input_tensors) 2025-08-14T21:36:33.2975182Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 433, in ff_chunk 2025-08-14T21:36:33.2975547Z x = self.lin2(x) 2025-08-14T21:36:33.2975645Z 2025-08-14T21:36:33.2975740Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:33.2976070Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:36:33.2976370Z return mod(**inputs) 2025-08-14T21:36:33.2976721Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 1031, in forward 2025-08-14T21:36:33.2977104Z distilbert_output = self.distilbert( 2025-08-14T21:36:33.2977483Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 736, in forward 2025-08-14T21:36:33.2977857Z return self.transformer( 2025-08-14T21:36:33.2978241Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 541, in forward 2025-08-14T21:36:33.2978617Z layer_outputs = layer_module( 2025-08-14T21:36:33.2978951Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:33.2979325Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:33.2979709Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 476, in forward 2025-08-14T21:36:33.2980085Z sa_output = self.attention( 2025-08-14T21:36:33.2980443Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 390, in forward 2025-08-14T21:36:33.2980868Z q = shape(self.q_lin(query)) # (bs, n_heads, q_length, dim_per_head) 2025-08-14T21:36:33.2981041Z 2025-08-14T21:36:33.2981137Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:33.2981491Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:36:33.2981783Z return mod(**inputs) 2025-08-14T21:36:33.2982144Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 1031, in forward 2025-08-14T21:36:33.2982533Z distilbert_output = self.distilbert( 2025-08-14T21:36:33.2982918Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 736, in forward 2025-08-14T21:36:33.2983287Z return self.transformer( 2025-08-14T21:36:33.2983652Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 541, in forward 2025-08-14T21:36:33.2984028Z layer_outputs = layer_module( 2025-08-14T21:36:33.2984336Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:33.2984868Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:33.2985288Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 476, in forward 2025-08-14T21:36:33.2985692Z sa_output = self.attention( 2025-08-14T21:36:33.2986076Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 391, in forward 2025-08-14T21:36:33.2986525Z k = shape(self.k_lin(key)) # (bs, n_heads, k_length, dim_per_head) 2025-08-14T21:36:33.2986684Z 2025-08-14T21:36:33.2986786Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:33.2987114Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:36:33.2987407Z return mod(**inputs) 2025-08-14T21:36:33.2987767Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 1031, in forward 2025-08-14T21:36:33.2988164Z distilbert_output = self.distilbert( 2025-08-14T21:36:33.2988543Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 736, in forward 2025-08-14T21:36:33.2988923Z return self.transformer( 2025-08-14T21:36:33.2989285Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 541, in forward 2025-08-14T21:36:33.2989666Z layer_outputs = layer_module( 2025-08-14T21:36:33.2989974Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:33.2990305Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:33.2990687Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 476, in forward 2025-08-14T21:36:33.2991062Z sa_output = self.attention( 2025-08-14T21:36:33.2991423Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 392, in forward 2025-08-14T21:36:33.2991905Z v = shape(self.v_lin(value)) # (bs, n_heads, k_length, dim_per_head) 2025-08-14T21:36:33.2992094Z 2025-08-14T21:36:33.2992179Z cudagraph partition due to non gpu ops 2025-08-14T21:36:33.2992391Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:33.2992739Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:36:33.2993040Z return mod(**inputs) 2025-08-14T21:36:33.2993399Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 1031, in forward 2025-08-14T21:36:33.2993775Z distilbert_output = self.distilbert( 2025-08-14T21:36:33.2994152Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 736, in forward 2025-08-14T21:36:33.2994529Z return self.transformer( 2025-08-14T21:36:33.2994913Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 541, in forward 2025-08-14T21:36:33.2995294Z layer_outputs = layer_module( 2025-08-14T21:36:33.2995613Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:33.2995948Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:33.2996327Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 476, in forward 2025-08-14T21:36:33.2996703Z sa_output = self.attention( 2025-08-14T21:36:33.2997073Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 402, in forward 2025-08-14T21:36:33.2997511Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:36:33.2997682Z 2025-08-14T21:36:33.2997779Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:33.2998113Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:36:33.2998418Z return mod(**inputs) 2025-08-14T21:36:33.2998774Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 1031, in forward 2025-08-14T21:36:33.2999171Z distilbert_output = self.distilbert( 2025-08-14T21:36:33.2999555Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 736, in forward 2025-08-14T21:36:33.2999933Z return self.transformer( 2025-08-14T21:36:33.3000290Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 541, in forward 2025-08-14T21:36:33.3000670Z layer_outputs = layer_module( 2025-08-14T21:36:33.3000988Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:33.3001327Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:33.3001710Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 476, in forward 2025-08-14T21:36:33.3002089Z sa_output = self.attention( 2025-08-14T21:36:33.3002458Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 412, in forward 2025-08-14T21:36:33.3002841Z attn_output = self.out_lin(attn_output) 2025-08-14T21:36:33.3002975Z 2025-08-14T21:36:33.3003070Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:33.3003404Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:36:33.3003704Z return mod(**inputs) 2025-08-14T21:36:33.3004054Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 1031, in forward 2025-08-14T21:36:33.3004462Z distilbert_output = self.distilbert( 2025-08-14T21:36:33.3004857Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 736, in forward 2025-08-14T21:36:33.3005230Z return self.transformer( 2025-08-14T21:36:33.3005609Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 541, in forward 2025-08-14T21:36:33.3005994Z layer_outputs = layer_module( 2025-08-14T21:36:33.3006312Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:33.3006640Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:33.3007019Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 494, in forward 2025-08-14T21:36:33.3007434Z ffn_output = self.ffn(sa_output) # (bs, seq_length, dim) 2025-08-14T21:36:33.3007844Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 428, in forward 2025-08-14T21:36:33.3008350Z return apply_chunking_to_forward(self.ff_chunk, self.chunk_size_feed_forward, self.seq_len_dim, input) 2025-08-14T21:36:33.3008826Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:36:33.3009190Z return forward_fn(*input_tensors) 2025-08-14T21:36:33.3009561Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 431, in ff_chunk 2025-08-14T21:36:33.3009935Z x = self.lin1(input) 2025-08-14T21:36:33.3010038Z 2025-08-14T21:36:33.3010132Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:33.3010463Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:36:33.3010756Z return mod(**inputs) 2025-08-14T21:36:33.3011114Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 1031, in forward 2025-08-14T21:36:33.3011504Z distilbert_output = self.distilbert( 2025-08-14T21:36:33.3011889Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 736, in forward 2025-08-14T21:36:33.3012263Z return self.transformer( 2025-08-14T21:36:33.3012624Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 541, in forward 2025-08-14T21:36:33.3013001Z layer_outputs = layer_module( 2025-08-14T21:36:33.3013316Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:33.3013649Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:33.3014030Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 494, in forward 2025-08-14T21:36:33.3014446Z ffn_output = self.ffn(sa_output) # (bs, seq_length, dim) 2025-08-14T21:36:33.3014846Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 428, in forward 2025-08-14T21:36:33.3015338Z return apply_chunking_to_forward(self.ff_chunk, self.chunk_size_feed_forward, self.seq_len_dim, input) 2025-08-14T21:36:33.3015811Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:36:33.3016172Z return forward_fn(*input_tensors) 2025-08-14T21:36:33.3016542Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 432, in ff_chunk 2025-08-14T21:36:33.3016919Z x = self.activation(x) 2025-08-14T21:36:33.3017217Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:36:33.3017524Z return self.act(input) 2025-08-14T21:36:33.3017667Z 2025-08-14T21:36:33.3017762Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:33.3018103Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:36:33.3018404Z return mod(**inputs) 2025-08-14T21:36:33.3018767Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 1031, in forward 2025-08-14T21:36:33.3019157Z distilbert_output = self.distilbert( 2025-08-14T21:36:33.3019540Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 736, in forward 2025-08-14T21:36:33.3019916Z return self.transformer( 2025-08-14T21:36:33.3020272Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 541, in forward 2025-08-14T21:36:33.3020648Z layer_outputs = layer_module( 2025-08-14T21:36:33.3020970Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:33.3021313Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:33.3021697Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 494, in forward 2025-08-14T21:36:33.3022110Z ffn_output = self.ffn(sa_output) # (bs, seq_length, dim) 2025-08-14T21:36:33.3022520Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 428, in forward 2025-08-14T21:36:33.3023007Z return apply_chunking_to_forward(self.ff_chunk, self.chunk_size_feed_forward, self.seq_len_dim, input) 2025-08-14T21:36:33.3023483Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:36:33.3023848Z return forward_fn(*input_tensors) 2025-08-14T21:36:33.3024229Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 433, in ff_chunk 2025-08-14T21:36:33.3024598Z x = self.lin2(x) 2025-08-14T21:36:33.3024696Z 2025-08-14T21:36:33.3024849Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:33.3025195Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:36:33.3025491Z return mod(**inputs) 2025-08-14T21:36:33.3025853Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 1031, in forward 2025-08-14T21:36:33.3026249Z distilbert_output = self.distilbert( 2025-08-14T21:36:33.3026637Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 736, in forward 2025-08-14T21:36:33.3027013Z return self.transformer( 2025-08-14T21:36:33.3027380Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 541, in forward 2025-08-14T21:36:33.3027764Z layer_outputs = layer_module( 2025-08-14T21:36:33.3028076Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:33.3028415Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:33.3028802Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 476, in forward 2025-08-14T21:36:33.3029184Z sa_output = self.attention( 2025-08-14T21:36:33.3029545Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 390, in forward 2025-08-14T21:36:33.3029975Z q = shape(self.q_lin(query)) # (bs, n_heads, q_length, dim_per_head) 2025-08-14T21:36:33.3030148Z 2025-08-14T21:36:33.3030245Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:33.3030577Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:36:33.3030892Z return mod(**inputs) 2025-08-14T21:36:33.3031269Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 1031, in forward 2025-08-14T21:36:33.3031663Z distilbert_output = self.distilbert( 2025-08-14T21:36:33.3032057Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 736, in forward 2025-08-14T21:36:33.3032446Z return self.transformer( 2025-08-14T21:36:33.3032820Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 541, in forward 2025-08-14T21:36:33.3033204Z layer_outputs = layer_module( 2025-08-14T21:36:33.3033518Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:33.3033856Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:33.3034263Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 476, in forward 2025-08-14T21:36:33.3034641Z sa_output = self.attention( 2025-08-14T21:36:33.3035002Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 391, in forward 2025-08-14T21:36:33.3035423Z k = shape(self.k_lin(key)) # (bs, n_heads, k_length, dim_per_head) 2025-08-14T21:36:33.3035585Z 2025-08-14T21:36:33.3035688Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:33.3036009Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:36:33.3036307Z return mod(**inputs) 2025-08-14T21:36:33.3036662Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 1031, in forward 2025-08-14T21:36:33.3037043Z distilbert_output = self.distilbert( 2025-08-14T21:36:33.3037418Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 736, in forward 2025-08-14T21:36:33.3037791Z return self.transformer( 2025-08-14T21:36:33.3038153Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 541, in forward 2025-08-14T21:36:33.3038529Z layer_outputs = layer_module( 2025-08-14T21:36:33.3038838Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:33.3039168Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:33.3039546Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 476, in forward 2025-08-14T21:36:33.3039914Z sa_output = self.attention( 2025-08-14T21:36:33.3040273Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 392, in forward 2025-08-14T21:36:33.3040696Z v = shape(self.v_lin(value)) # (bs, n_heads, k_length, dim_per_head) 2025-08-14T21:36:33.3040858Z 2025-08-14T21:36:33.3040939Z cudagraph partition due to non gpu ops 2025-08-14T21:36:33.3041150Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:33.3041475Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:36:33.3041771Z return mod(**inputs) 2025-08-14T21:36:33.3042119Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 1031, in forward 2025-08-14T21:36:33.3042505Z distilbert_output = self.distilbert( 2025-08-14T21:36:33.3042879Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 736, in forward 2025-08-14T21:36:33.3043248Z return self.transformer( 2025-08-14T21:36:33.3043599Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 541, in forward 2025-08-14T21:36:33.3043994Z layer_outputs = layer_module( 2025-08-14T21:36:33.3044326Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:33.3044661Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:33.3045056Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 476, in forward 2025-08-14T21:36:33.3045438Z sa_output = self.attention( 2025-08-14T21:36:33.3045807Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 402, in forward 2025-08-14T21:36:33.3046233Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:36:33.3046414Z 2025-08-14T21:36:33.3046511Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:33.3046846Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:36:33.3047160Z return mod(**inputs) 2025-08-14T21:36:33.3047516Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 1031, in forward 2025-08-14T21:36:33.3047907Z distilbert_output = self.distilbert( 2025-08-14T21:36:33.3048291Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 736, in forward 2025-08-14T21:36:33.3048656Z return self.transformer( 2025-08-14T21:36:33.3049016Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 541, in forward 2025-08-14T21:36:33.3049394Z layer_outputs = layer_module( 2025-08-14T21:36:33.3049710Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:33.3050034Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:33.3050417Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 476, in forward 2025-08-14T21:36:33.3050797Z sa_output = self.attention( 2025-08-14T21:36:33.3051166Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 412, in forward 2025-08-14T21:36:33.3051547Z attn_output = self.out_lin(attn_output) 2025-08-14T21:36:33.3051679Z 2025-08-14T21:36:33.3051775Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:33.3052103Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:36:33.3052394Z return mod(**inputs) 2025-08-14T21:36:33.3052752Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 1031, in forward 2025-08-14T21:36:33.3053135Z distilbert_output = self.distilbert( 2025-08-14T21:36:33.3053514Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 736, in forward 2025-08-14T21:36:33.3053890Z return self.transformer( 2025-08-14T21:36:33.3054254Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 541, in forward 2025-08-14T21:36:33.3054647Z layer_outputs = layer_module( 2025-08-14T21:36:33.3054954Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:33.3055286Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:33.3055664Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 494, in forward 2025-08-14T21:36:33.3056072Z ffn_output = self.ffn(sa_output) # (bs, seq_length, dim) 2025-08-14T21:36:33.3056474Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 428, in forward 2025-08-14T21:36:33.3056989Z return apply_chunking_to_forward(self.ff_chunk, self.chunk_size_feed_forward, self.seq_len_dim, input) 2025-08-14T21:36:33.3057477Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:36:33.3057860Z return forward_fn(*input_tensors) 2025-08-14T21:36:33.3058236Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 431, in ff_chunk 2025-08-14T21:36:33.3058611Z x = self.lin1(input) 2025-08-14T21:36:33.3058709Z 2025-08-14T21:36:33.3058810Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:33.3059139Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:36:33.3059430Z return mod(**inputs) 2025-08-14T21:36:33.3059784Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 1031, in forward 2025-08-14T21:36:33.3060187Z distilbert_output = self.distilbert( 2025-08-14T21:36:33.3060562Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 736, in forward 2025-08-14T21:36:33.3060943Z return self.transformer( 2025-08-14T21:36:33.3061304Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 541, in forward 2025-08-14T21:36:33.3061679Z layer_outputs = layer_module( 2025-08-14T21:36:33.3061987Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:33.3062318Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:33.3062697Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 494, in forward 2025-08-14T21:36:33.3063098Z ffn_output = self.ffn(sa_output) # (bs, seq_length, dim) 2025-08-14T21:36:33.3063509Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 428, in forward 2025-08-14T21:36:33.3064005Z return apply_chunking_to_forward(self.ff_chunk, self.chunk_size_feed_forward, self.seq_len_dim, input) 2025-08-14T21:36:33.3064481Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:36:33.3064913Z return forward_fn(*input_tensors) 2025-08-14T21:36:33.3065298Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 432, in ff_chunk 2025-08-14T21:36:33.3065683Z x = self.activation(x) 2025-08-14T21:36:33.3065989Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:36:33.3066296Z return self.act(input) 2025-08-14T21:36:33.3066406Z 2025-08-14T21:36:33.3066507Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:33.3066844Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:36:33.3067141Z return mod(**inputs) 2025-08-14T21:36:33.3067503Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 1031, in forward 2025-08-14T21:36:33.3067893Z distilbert_output = self.distilbert( 2025-08-14T21:36:33.3068279Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 736, in forward 2025-08-14T21:36:33.3068647Z return self.transformer( 2025-08-14T21:36:33.3069021Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 541, in forward 2025-08-14T21:36:33.3069397Z layer_outputs = layer_module( 2025-08-14T21:36:33.3069715Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:33.3070062Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:33.3070459Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 494, in forward 2025-08-14T21:36:33.3070873Z ffn_output = self.ffn(sa_output) # (bs, seq_length, dim) 2025-08-14T21:36:33.3071291Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 428, in forward 2025-08-14T21:36:33.3071786Z return apply_chunking_to_forward(self.ff_chunk, self.chunk_size_feed_forward, self.seq_len_dim, input) 2025-08-14T21:36:33.3072258Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:36:33.3072623Z return forward_fn(*input_tensors) 2025-08-14T21:36:33.3072995Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 433, in ff_chunk 2025-08-14T21:36:33.3073392Z x = self.lin2(x) 2025-08-14T21:36:33.3073491Z 2025-08-14T21:36:33.3073588Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:33.3073923Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:36:33.3074218Z return mod(**inputs) 2025-08-14T21:36:33.3074576Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 1043, in forward 2025-08-14T21:36:33.3075005Z logits = self.qa_outputs(hidden_states) # (bs, max_query_len, 2) 2025-08-14T21:36:33.3075168Z 2025-08-14T21:36:33.3075271Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:33.3075593Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:36:33.3075890Z return mod(**inputs) 2025-08-14T21:36:33.3076245Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 1061, in forward 2025-08-14T21:36:33.3076648Z start_loss = loss_fct(start_logits, start_positions) 2025-08-14T21:36:33.3076799Z 2025-08-14T21:36:33.3076895Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:33.3077224Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:36:33.3077519Z return mod(**inputs) 2025-08-14T21:36:33.3077868Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/distilbert/modeling_distilbert.py", line 1062, in forward 2025-08-14T21:36:33.3078267Z end_loss = loss_fct(end_logits, end_positions) 2025-08-14T21:36:33.3078402Z 2025-08-14T21:36:39.4646633Z Compilation time (from dynamo_timed): 9.584852183 2025-08-14T21:36:39.4648287Z pass 2025-08-14T21:36:39.4648743Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:36:39.4651994Z TIMING: _recursive_pre_grad_passes:0.00468 _recursive_joint_graph_passes:0.22163 _recursive_post_grad_passes:0.05567 async_compile.wait:0.64003 code_gen:5.91182 inductor_compile:6.74049 backend_compile:8.34336 gc:9e-05 entire_frame_compile:9.58485 total_wall_time:9.58485 2025-08-14T21:36:39.4653051Z STATS: call_* op count: 161 | FakeTensorMode.__torch_dispatch__:6705 | FakeTensor.__torch_dispatch__:2556 | ProxyTorchDispatchMode.__torch_dispatch__:2400 2025-08-14T21:36:39.4654972Z Dynamo produced 1 graphs covering 161 ops with 0 graph breaks (0 unique) 2025-08-14T21:36:43.4262413Z /opt/conda/envs/py_3.9/lib/python3.9/site-packages/llvmlite/binding/ffi.py:175: UserWarning: pkg_resources is deprecated as an API. See https://setuptools.pypa.io/en/latest/pkg_resources.html. The pkg_resources package is slated for removal as early as 2025-11-30. Refrain from using this package or pin to Setuptools<81. 2025-08-14T21:36:43.4263245Z from pkg_resources import resource_filename 2025-08-14T21:36:43.9540550Z 2025-08-14T21:36:45.7574998Z loading model: 0it [00:00, ?it/s]`loss_type=None` was set in the config but it is unrecognised.Using the default loss: `ForCausalLMLoss`. 2025-08-14T21:36:45.7576341Z WARNING:transformers.modeling_utils:`loss_type=None` was set in the config but it is unrecognised.Using the default loss: `ForCausalLMLoss`. 2025-08-14T21:36:45.7850743Z 2025-08-14T21:36:45.7859671Z loading model: 0it [00:01, ?it/s] 2025-08-14T21:36:45.7861408Z cpu eval DistillGPT2 2025-08-14T21:36:46.1402300Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:36:46.2776709Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:36:46.4356665Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:36:51.7492365Z cudagraph partition due to non gpu ops 2025-08-14T21:36:51.7496032Z cudagraph partition due to non gpu ops 2025-08-14T21:36:51.7499437Z cudagraph partition due to non gpu ops 2025-08-14T21:36:51.7500969Z cudagraph partition due to non gpu ops 2025-08-14T21:36:51.7501775Z cudagraph partition due to non gpu ops 2025-08-14T21:36:51.7502248Z cudagraph partition due to non gpu ops 2025-08-14T21:36:51.7506800Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:51.7510927Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1183, in forward 2025-08-14T21:36:51.7515314Z transformer_outputs = self.transformer( 2025-08-14T21:36:51.7518871Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 917, in forward 2025-08-14T21:36:51.7523069Z outputs = block( 2025-08-14T21:36:51.7527053Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:51.7528519Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:51.7528916Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:36:51.7529306Z return func(*args, **kwargs) 2025-08-14T21:36:51.7529680Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 404, in forward 2025-08-14T21:36:51.7530201Z attn_output, self_attn_weights = self.attn( 2025-08-14T21:36:51.7530574Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:36:51.7530925Z return func(*args, **kwargs) 2025-08-14T21:36:51.7531264Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 294, in forward 2025-08-14T21:36:51.7531721Z query_states, key_states, value_states = self.c_attn(hidden_states).split(self.split_size, dim=2) 2025-08-14T21:36:51.7532152Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 116, in forward 2025-08-14T21:36:51.7532531Z x = torch.addmm(self.bias, x.view(-1, x.size(-1)), self.weight) 2025-08-14T21:36:51.7532698Z 2025-08-14T21:36:51.7532776Z cudagraph partition due to non gpu ops 2025-08-14T21:36:51.7532977Z cudagraph partition due to non gpu ops 2025-08-14T21:36:51.7533169Z cudagraph partition due to non gpu ops 2025-08-14T21:36:51.7533352Z cudagraph partition due to non gpu ops 2025-08-14T21:36:51.7533570Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:51.7533958Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1183, in forward 2025-08-14T21:36:51.7534327Z transformer_outputs = self.transformer( 2025-08-14T21:36:51.7534685Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 917, in forward 2025-08-14T21:36:51.7535034Z outputs = block( 2025-08-14T21:36:51.7535339Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:51.7535855Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:51.7536323Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:36:51.7536675Z return func(*args, **kwargs) 2025-08-14T21:36:51.7537068Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 404, in forward 2025-08-14T21:36:51.7537429Z attn_output, self_attn_weights = self.attn( 2025-08-14T21:36:51.7537787Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:36:51.7538136Z return func(*args, **kwargs) 2025-08-14T21:36:51.7538467Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 336, in forward 2025-08-14T21:36:51.7538843Z attn_output, attn_weights = attention_interface( 2025-08-14T21:36:51.7539296Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:36:51.7539746Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:36:51.7539918Z 2025-08-14T21:36:51.7540016Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:51.7540400Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1183, in forward 2025-08-14T21:36:51.7540763Z transformer_outputs = self.transformer( 2025-08-14T21:36:51.7541117Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 917, in forward 2025-08-14T21:36:51.7541451Z outputs = block( 2025-08-14T21:36:51.7541747Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:51.7542079Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:51.7542418Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:36:51.7542767Z return func(*args, **kwargs) 2025-08-14T21:36:51.7543106Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 404, in forward 2025-08-14T21:36:51.7543469Z attn_output, self_attn_weights = self.attn( 2025-08-14T21:36:51.7543818Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:36:51.7544154Z return func(*args, **kwargs) 2025-08-14T21:36:51.7544491Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 336, in forward 2025-08-14T21:36:51.7544987Z attn_output, attn_weights = attention_interface( 2025-08-14T21:36:51.7545405Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:36:51.7545841Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:36:51.7545993Z 2025-08-14T21:36:51.7546102Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:51.7546482Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1183, in forward 2025-08-14T21:36:51.7546854Z transformer_outputs = self.transformer( 2025-08-14T21:36:51.7547217Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 917, in forward 2025-08-14T21:36:51.7547567Z outputs = block( 2025-08-14T21:36:51.7547865Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:51.7548203Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:51.7548555Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:36:51.7548917Z return func(*args, **kwargs) 2025-08-14T21:36:51.7549263Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 404, in forward 2025-08-14T21:36:51.7549648Z attn_output, self_attn_weights = self.attn( 2025-08-14T21:36:51.7550025Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:36:51.7550359Z return func(*args, **kwargs) 2025-08-14T21:36:51.7550693Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 349, in forward 2025-08-14T21:36:51.7551050Z attn_output = self.c_proj(attn_output) 2025-08-14T21:36:51.7551378Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 116, in forward 2025-08-14T21:36:51.7551739Z x = torch.addmm(self.bias, x.view(-1, x.size(-1)), self.weight) 2025-08-14T21:36:51.7551905Z 2025-08-14T21:36:51.7552004Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:51.7552444Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1183, in forward 2025-08-14T21:36:51.7552807Z transformer_outputs = self.transformer( 2025-08-14T21:36:51.7553166Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 917, in forward 2025-08-14T21:36:51.7553510Z outputs = block( 2025-08-14T21:36:51.7553804Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:51.7554132Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:51.7554477Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:36:51.7554818Z return func(*args, **kwargs) 2025-08-14T21:36:51.7555148Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 440, in forward 2025-08-14T21:36:51.7555530Z feed_forward_hidden_states = self.mlp(hidden_states) 2025-08-14T21:36:51.7555911Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 365, in forward 2025-08-14T21:36:51.7556270Z hidden_states = self.c_fc(hidden_states) 2025-08-14T21:36:51.7556596Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 116, in forward 2025-08-14T21:36:51.7556964Z x = torch.addmm(self.bias, x.view(-1, x.size(-1)), self.weight) 2025-08-14T21:36:51.7557123Z 2025-08-14T21:36:51.7557226Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:51.7557609Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1183, in forward 2025-08-14T21:36:51.7557969Z transformer_outputs = self.transformer( 2025-08-14T21:36:51.7558327Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 917, in forward 2025-08-14T21:36:51.7558672Z outputs = block( 2025-08-14T21:36:51.7558963Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:51.7559293Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:51.7559642Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:36:51.7559980Z return func(*args, **kwargs) 2025-08-14T21:36:51.7560312Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 440, in forward 2025-08-14T21:36:51.7560687Z feed_forward_hidden_states = self.mlp(hidden_states) 2025-08-14T21:36:51.7561061Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 366, in forward 2025-08-14T21:36:51.7561409Z hidden_states = self.act(hidden_states) 2025-08-14T21:36:51.7561754Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 47, in forward 2025-08-14T21:36:51.7562195Z return 0.5 * input * (1.0 + torch.tanh(math.sqrt(2.0 / math.pi) * (input + 0.044715 * torch.pow(input, 3.0)))) 2025-08-14T21:36:51.7562414Z 2025-08-14T21:36:51.7562514Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:51.7562925Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1183, in forward 2025-08-14T21:36:51.7563289Z transformer_outputs = self.transformer( 2025-08-14T21:36:51.7563644Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 917, in forward 2025-08-14T21:36:51.7563981Z outputs = block( 2025-08-14T21:36:51.7564270Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:51.7564600Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:51.7564970Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:36:51.7565311Z return func(*args, **kwargs) 2025-08-14T21:36:51.7565661Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 440, in forward 2025-08-14T21:36:51.7566049Z feed_forward_hidden_states = self.mlp(hidden_states) 2025-08-14T21:36:51.7566430Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 367, in forward 2025-08-14T21:36:51.7566794Z hidden_states = self.c_proj(hidden_states) 2025-08-14T21:36:51.7567135Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 116, in forward 2025-08-14T21:36:51.7567505Z x = torch.addmm(self.bias, x.view(-1, x.size(-1)), self.weight) 2025-08-14T21:36:51.7567668Z 2025-08-14T21:36:51.7567776Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:51.7568160Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1183, in forward 2025-08-14T21:36:51.7568533Z transformer_outputs = self.transformer( 2025-08-14T21:36:51.7568900Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 917, in forward 2025-08-14T21:36:51.7569242Z outputs = block( 2025-08-14T21:36:51.7569548Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:51.7569887Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:51.7570237Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:36:51.7570579Z return func(*args, **kwargs) 2025-08-14T21:36:51.7570927Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 404, in forward 2025-08-14T21:36:51.7571301Z attn_output, self_attn_weights = self.attn( 2025-08-14T21:36:51.7571660Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:36:51.7572009Z return func(*args, **kwargs) 2025-08-14T21:36:51.7572358Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 294, in forward 2025-08-14T21:36:51.7572821Z query_states, key_states, value_states = self.c_attn(hidden_states).split(self.split_size, dim=2) 2025-08-14T21:36:51.7573243Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 116, in forward 2025-08-14T21:36:51.7573614Z x = torch.addmm(self.bias, x.view(-1, x.size(-1)), self.weight) 2025-08-14T21:36:51.7573780Z 2025-08-14T21:36:51.7573858Z cudagraph partition due to non gpu ops 2025-08-14T21:36:51.7574065Z cudagraph partition due to non gpu ops 2025-08-14T21:36:51.7574260Z cudagraph partition due to non gpu ops 2025-08-14T21:36:51.7574475Z cudagraph partition due to non gpu ops 2025-08-14T21:36:51.7574691Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:51.7575076Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1183, in forward 2025-08-14T21:36:51.7575457Z transformer_outputs = self.transformer( 2025-08-14T21:36:51.7575819Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 917, in forward 2025-08-14T21:36:51.7576160Z outputs = block( 2025-08-14T21:36:51.7576452Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:51.7576783Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:51.7577131Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:36:51.7577466Z return func(*args, **kwargs) 2025-08-14T21:36:51.7577826Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 404, in forward 2025-08-14T21:36:51.7578191Z attn_output, self_attn_weights = self.attn( 2025-08-14T21:36:51.7578547Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:36:51.7578881Z return func(*args, **kwargs) 2025-08-14T21:36:51.7579220Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 336, in forward 2025-08-14T21:36:51.7579588Z attn_output, attn_weights = attention_interface( 2025-08-14T21:36:51.7579991Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:36:51.7580433Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:36:51.7580604Z 2025-08-14T21:36:51.7580699Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:51.7581082Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1183, in forward 2025-08-14T21:36:51.7581439Z transformer_outputs = self.transformer( 2025-08-14T21:36:51.7581793Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 917, in forward 2025-08-14T21:36:51.7582138Z outputs = block( 2025-08-14T21:36:51.7582435Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:51.7582759Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:51.7583105Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:36:51.7583446Z return func(*args, **kwargs) 2025-08-14T21:36:51.7583777Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 404, in forward 2025-08-14T21:36:51.7584144Z attn_output, self_attn_weights = self.attn( 2025-08-14T21:36:51.7584503Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:36:51.7585055Z return func(*args, **kwargs) 2025-08-14T21:36:51.7585395Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 336, in forward 2025-08-14T21:36:51.7585772Z attn_output, attn_weights = attention_interface( 2025-08-14T21:36:51.7586184Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:36:51.7586607Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:36:51.7586757Z 2025-08-14T21:36:51.7586852Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:51.7587234Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1183, in forward 2025-08-14T21:36:51.7587662Z transformer_outputs = self.transformer( 2025-08-14T21:36:51.7588045Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 917, in forward 2025-08-14T21:36:51.7588386Z outputs = block( 2025-08-14T21:36:51.7588708Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:51.7589046Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:51.7589388Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:36:51.7589732Z return func(*args, **kwargs) 2025-08-14T21:36:51.7590076Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 404, in forward 2025-08-14T21:36:51.7590433Z attn_output, self_attn_weights = self.attn( 2025-08-14T21:36:51.7590792Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:36:51.7591161Z return func(*args, **kwargs) 2025-08-14T21:36:51.7591500Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 349, in forward 2025-08-14T21:36:51.7591851Z attn_output = self.c_proj(attn_output) 2025-08-14T21:36:51.7592181Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 116, in forward 2025-08-14T21:36:51.7592545Z x = torch.addmm(self.bias, x.view(-1, x.size(-1)), self.weight) 2025-08-14T21:36:51.7592701Z 2025-08-14T21:36:51.7592805Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:51.7593178Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1183, in forward 2025-08-14T21:36:51.7593539Z transformer_outputs = self.transformer( 2025-08-14T21:36:51.7593896Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 917, in forward 2025-08-14T21:36:51.7594232Z outputs = block( 2025-08-14T21:36:51.7594531Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:51.7594860Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:51.7595209Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:36:51.7595541Z return func(*args, **kwargs) 2025-08-14T21:36:51.7595877Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 440, in forward 2025-08-14T21:36:51.7596251Z feed_forward_hidden_states = self.mlp(hidden_states) 2025-08-14T21:36:51.7596624Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 365, in forward 2025-08-14T21:36:51.7596971Z hidden_states = self.c_fc(hidden_states) 2025-08-14T21:36:51.7597302Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 116, in forward 2025-08-14T21:36:51.7597665Z x = torch.addmm(self.bias, x.view(-1, x.size(-1)), self.weight) 2025-08-14T21:36:51.7597820Z 2025-08-14T21:36:51.7597915Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:51.7598292Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1183, in forward 2025-08-14T21:36:51.7598653Z transformer_outputs = self.transformer( 2025-08-14T21:36:51.7599006Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 917, in forward 2025-08-14T21:36:51.7599337Z outputs = block( 2025-08-14T21:36:51.7599632Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:51.7599962Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:51.7600321Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:36:51.7600661Z return func(*args, **kwargs) 2025-08-14T21:36:51.7601016Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 440, in forward 2025-08-14T21:36:51.7601414Z feed_forward_hidden_states = self.mlp(hidden_states) 2025-08-14T21:36:51.7601784Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 366, in forward 2025-08-14T21:36:51.7602141Z hidden_states = self.act(hidden_states) 2025-08-14T21:36:51.7602468Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 47, in forward 2025-08-14T21:36:51.7602891Z return 0.5 * input * (1.0 + torch.tanh(math.sqrt(2.0 / math.pi) * (input + 0.044715 * torch.pow(input, 3.0)))) 2025-08-14T21:36:51.7603109Z 2025-08-14T21:36:51.7603206Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:51.7603604Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1183, in forward 2025-08-14T21:36:51.7603965Z transformer_outputs = self.transformer( 2025-08-14T21:36:51.7604310Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 917, in forward 2025-08-14T21:36:51.7604650Z outputs = block( 2025-08-14T21:36:51.7604947Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:51.7605277Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:51.7605615Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:36:51.7605956Z return func(*args, **kwargs) 2025-08-14T21:36:51.7606290Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 440, in forward 2025-08-14T21:36:51.7606667Z feed_forward_hidden_states = self.mlp(hidden_states) 2025-08-14T21:36:51.7607034Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 367, in forward 2025-08-14T21:36:51.7607396Z hidden_states = self.c_proj(hidden_states) 2025-08-14T21:36:51.7607730Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 116, in forward 2025-08-14T21:36:51.7608087Z x = torch.addmm(self.bias, x.view(-1, x.size(-1)), self.weight) 2025-08-14T21:36:51.7608249Z 2025-08-14T21:36:51.7608345Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:51.7608723Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1183, in forward 2025-08-14T21:36:51.7609083Z transformer_outputs = self.transformer( 2025-08-14T21:36:51.7609427Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 917, in forward 2025-08-14T21:36:51.7609768Z outputs = block( 2025-08-14T21:36:51.7610063Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:51.7610394Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:51.7610733Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:36:51.7611073Z return func(*args, **kwargs) 2025-08-14T21:36:51.7611409Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 404, in forward 2025-08-14T21:36:51.7611762Z attn_output, self_attn_weights = self.attn( 2025-08-14T21:36:51.7612114Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:36:51.7612455Z return func(*args, **kwargs) 2025-08-14T21:36:51.7612790Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 294, in forward 2025-08-14T21:36:51.7613268Z query_states, key_states, value_states = self.c_attn(hidden_states).split(self.split_size, dim=2) 2025-08-14T21:36:51.7613703Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 116, in forward 2025-08-14T21:36:51.7614092Z x = torch.addmm(self.bias, x.view(-1, x.size(-1)), self.weight) 2025-08-14T21:36:51.7614252Z 2025-08-14T21:36:51.7614336Z cudagraph partition due to non gpu ops 2025-08-14T21:36:51.7614531Z cudagraph partition due to non gpu ops 2025-08-14T21:36:51.7614725Z cudagraph partition due to non gpu ops 2025-08-14T21:36:51.7614917Z cudagraph partition due to non gpu ops 2025-08-14T21:36:51.7615127Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:51.7615509Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1183, in forward 2025-08-14T21:36:51.7615899Z transformer_outputs = self.transformer( 2025-08-14T21:36:51.7616258Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 917, in forward 2025-08-14T21:36:51.7616612Z outputs = block( 2025-08-14T21:36:51.7616935Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:51.7617276Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:51.7617635Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:36:51.7617993Z return func(*args, **kwargs) 2025-08-14T21:36:51.7618341Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 404, in forward 2025-08-14T21:36:51.7618712Z attn_output, self_attn_weights = self.attn( 2025-08-14T21:36:51.7619074Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:36:51.7619427Z return func(*args, **kwargs) 2025-08-14T21:36:51.7619777Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 336, in forward 2025-08-14T21:36:51.7620154Z attn_output, attn_weights = attention_interface( 2025-08-14T21:36:51.7620576Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:36:51.7621030Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:36:51.7621204Z 2025-08-14T21:36:51.7621312Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:51.7621698Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1183, in forward 2025-08-14T21:36:51.7622070Z transformer_outputs = self.transformer( 2025-08-14T21:36:51.7622441Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 917, in forward 2025-08-14T21:36:51.7622792Z outputs = block( 2025-08-14T21:36:51.7623095Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:51.7623435Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:51.7623790Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:36:51.7624131Z return func(*args, **kwargs) 2025-08-14T21:36:51.7624479Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 404, in forward 2025-08-14T21:36:51.7624930Z attn_output, self_attn_weights = self.attn( 2025-08-14T21:36:51.7625291Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:36:51.7625624Z return func(*args, **kwargs) 2025-08-14T21:36:51.7625990Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 336, in forward 2025-08-14T21:36:51.7626387Z attn_output, attn_weights = attention_interface( 2025-08-14T21:36:51.7626792Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:36:51.7627227Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:36:51.7627383Z 2025-08-14T21:36:51.7627478Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:51.7627857Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1183, in forward 2025-08-14T21:36:51.7628216Z transformer_outputs = self.transformer( 2025-08-14T21:36:51.7628574Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 917, in forward 2025-08-14T21:36:51.7628916Z outputs = block( 2025-08-14T21:36:51.7629251Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:51.7629578Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:51.7629928Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:36:51.7630270Z return func(*args, **kwargs) 2025-08-14T21:36:51.7630601Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 404, in forward 2025-08-14T21:36:51.7630965Z attn_output, self_attn_weights = self.attn( 2025-08-14T21:36:51.7631323Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:36:51.7631663Z return func(*args, **kwargs) 2025-08-14T21:36:51.7631994Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 349, in forward 2025-08-14T21:36:51.7632355Z attn_output = self.c_proj(attn_output) 2025-08-14T21:36:51.7632687Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 116, in forward 2025-08-14T21:36:51.7633044Z x = torch.addmm(self.bias, x.view(-1, x.size(-1)), self.weight) 2025-08-14T21:36:51.7633211Z 2025-08-14T21:36:51.7633309Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:51.7633692Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1183, in forward 2025-08-14T21:36:51.7634057Z transformer_outputs = self.transformer( 2025-08-14T21:36:51.7634405Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 917, in forward 2025-08-14T21:36:51.7634749Z outputs = block( 2025-08-14T21:36:51.7635045Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:51.7635379Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:51.7635722Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:36:51.7636063Z return func(*args, **kwargs) 2025-08-14T21:36:51.7636402Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 440, in forward 2025-08-14T21:36:51.7636772Z feed_forward_hidden_states = self.mlp(hidden_states) 2025-08-14T21:36:51.7637145Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 365, in forward 2025-08-14T21:36:51.7637502Z hidden_states = self.c_fc(hidden_states) 2025-08-14T21:36:51.7637830Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 116, in forward 2025-08-14T21:36:51.7638187Z x = torch.addmm(self.bias, x.view(-1, x.size(-1)), self.weight) 2025-08-14T21:36:51.7638353Z 2025-08-14T21:36:51.7638450Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:51.7638851Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1183, in forward 2025-08-14T21:36:51.7639232Z transformer_outputs = self.transformer( 2025-08-14T21:36:51.7639598Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 917, in forward 2025-08-14T21:36:51.7639941Z outputs = block( 2025-08-14T21:36:51.7640234Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:51.7640558Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:51.7640905Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:36:51.7641245Z return func(*args, **kwargs) 2025-08-14T21:36:51.7641585Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 440, in forward 2025-08-14T21:36:51.7641970Z feed_forward_hidden_states = self.mlp(hidden_states) 2025-08-14T21:36:51.7642346Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 366, in forward 2025-08-14T21:36:51.7642697Z hidden_states = self.act(hidden_states) 2025-08-14T21:36:51.7643010Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 47, in forward 2025-08-14T21:36:51.7643425Z return 0.5 * input * (1.0 + torch.tanh(math.sqrt(2.0 / math.pi) * (input + 0.044715 * torch.pow(input, 3.0)))) 2025-08-14T21:36:51.7643644Z 2025-08-14T21:36:51.7643739Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:51.7644114Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1183, in forward 2025-08-14T21:36:51.7644466Z transformer_outputs = self.transformer( 2025-08-14T21:36:51.7644819Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 917, in forward 2025-08-14T21:36:51.7645161Z outputs = block( 2025-08-14T21:36:51.7645456Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:51.7645779Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:51.7646124Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:36:51.7646464Z return func(*args, **kwargs) 2025-08-14T21:36:51.7646794Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 440, in forward 2025-08-14T21:36:51.7647167Z feed_forward_hidden_states = self.mlp(hidden_states) 2025-08-14T21:36:51.7647540Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 367, in forward 2025-08-14T21:36:51.7647902Z hidden_states = self.c_proj(hidden_states) 2025-08-14T21:36:51.7648231Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 116, in forward 2025-08-14T21:36:51.7648598Z x = torch.addmm(self.bias, x.view(-1, x.size(-1)), self.weight) 2025-08-14T21:36:51.7648757Z 2025-08-14T21:36:51.7648915Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:51.7649365Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1183, in forward 2025-08-14T21:36:51.7649780Z transformer_outputs = self.transformer( 2025-08-14T21:36:51.7661899Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 917, in forward 2025-08-14T21:36:51.7662286Z outputs = block( 2025-08-14T21:36:51.7662620Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:51.7662981Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:51.7663436Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:36:51.7663794Z return func(*args, **kwargs) 2025-08-14T21:36:51.7664183Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 442, in forward 2025-08-14T21:36:51.7664619Z hidden_states = residual + feed_forward_hidden_states 2025-08-14T21:36:51.7664842Z 2025-08-14T21:36:51.7664953Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:51.7665356Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1183, in forward 2025-08-14T21:36:51.7665731Z transformer_outputs = self.transformer( 2025-08-14T21:36:51.7666107Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 917, in forward 2025-08-14T21:36:51.7666452Z outputs = block( 2025-08-14T21:36:51.7666767Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:51.7667140Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:51.7667487Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:36:51.7667839Z return func(*args, **kwargs) 2025-08-14T21:36:51.7668191Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 404, in forward 2025-08-14T21:36:51.7668566Z attn_output, self_attn_weights = self.attn( 2025-08-14T21:36:51.7668921Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:36:51.7669268Z return func(*args, **kwargs) 2025-08-14T21:36:51.7669610Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 294, in forward 2025-08-14T21:36:51.7670073Z query_states, key_states, value_states = self.c_attn(hidden_states).split(self.split_size, dim=2) 2025-08-14T21:36:51.7670501Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 116, in forward 2025-08-14T21:36:51.7670876Z x = torch.addmm(self.bias, x.view(-1, x.size(-1)), self.weight) 2025-08-14T21:36:51.7671037Z 2025-08-14T21:36:51.7671124Z cudagraph partition due to non gpu ops 2025-08-14T21:36:51.7671316Z cudagraph partition due to non gpu ops 2025-08-14T21:36:51.7671510Z cudagraph partition due to non gpu ops 2025-08-14T21:36:51.7671700Z cudagraph partition due to non gpu ops 2025-08-14T21:36:51.7671916Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:51.7672297Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1183, in forward 2025-08-14T21:36:51.7672667Z transformer_outputs = self.transformer( 2025-08-14T21:36:51.7673027Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 917, in forward 2025-08-14T21:36:51.7673370Z outputs = block( 2025-08-14T21:36:51.7673674Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:51.7674016Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:51.7674365Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:36:51.7674701Z return func(*args, **kwargs) 2025-08-14T21:36:51.7675041Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 404, in forward 2025-08-14T21:36:51.7675411Z attn_output, self_attn_weights = self.attn( 2025-08-14T21:36:51.7675760Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:36:51.7676103Z return func(*args, **kwargs) 2025-08-14T21:36:51.7676484Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 336, in forward 2025-08-14T21:36:51.7676877Z attn_output, attn_weights = attention_interface( 2025-08-14T21:36:51.7677289Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:36:51.7677753Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:36:51.7677926Z 2025-08-14T21:36:51.7678030Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:51.7678420Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1183, in forward 2025-08-14T21:36:51.7678781Z transformer_outputs = self.transformer( 2025-08-14T21:36:51.7679140Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 917, in forward 2025-08-14T21:36:51.7679484Z outputs = block( 2025-08-14T21:36:51.7679801Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:51.7680143Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:51.7680496Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:36:51.7680844Z return func(*args, **kwargs) 2025-08-14T21:36:51.7681177Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 404, in forward 2025-08-14T21:36:51.7681545Z attn_output, self_attn_weights = self.attn( 2025-08-14T21:36:51.7681902Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:36:51.7682236Z return func(*args, **kwargs) 2025-08-14T21:36:51.7682575Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 336, in forward 2025-08-14T21:36:51.7682950Z attn_output, attn_weights = attention_interface( 2025-08-14T21:36:51.7683362Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:36:51.7683778Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:36:51.7683937Z 2025-08-14T21:36:51.7684035Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:51.7684422Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1183, in forward 2025-08-14T21:36:51.7684930Z transformer_outputs = self.transformer( 2025-08-14T21:36:51.7685292Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 917, in forward 2025-08-14T21:36:51.7685647Z outputs = block( 2025-08-14T21:36:51.7685952Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:51.7686290Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:51.7686645Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:36:51.7686992Z return func(*args, **kwargs) 2025-08-14T21:36:51.7687333Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 404, in forward 2025-08-14T21:36:51.7687695Z attn_output, self_attn_weights = self.attn( 2025-08-14T21:36:51.7688056Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:36:51.7688399Z return func(*args, **kwargs) 2025-08-14T21:36:51.7688739Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 349, in forward 2025-08-14T21:36:51.7689093Z attn_output = self.c_proj(attn_output) 2025-08-14T21:36:51.7689427Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 116, in forward 2025-08-14T21:36:51.7689847Z x = torch.addmm(self.bias, x.view(-1, x.size(-1)), self.weight) 2025-08-14T21:36:51.7690006Z 2025-08-14T21:36:51.7690126Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:51.7690537Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1183, in forward 2025-08-14T21:36:51.7690909Z transformer_outputs = self.transformer( 2025-08-14T21:36:51.7691271Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 917, in forward 2025-08-14T21:36:51.7691607Z outputs = block( 2025-08-14T21:36:51.7691906Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:51.7692244Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:51.7692585Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:36:51.7692954Z return func(*args, **kwargs) 2025-08-14T21:36:51.7693298Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 440, in forward 2025-08-14T21:36:51.7693684Z feed_forward_hidden_states = self.mlp(hidden_states) 2025-08-14T21:36:51.7694057Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 365, in forward 2025-08-14T21:36:51.7694416Z hidden_states = self.c_fc(hidden_states) 2025-08-14T21:36:51.7694750Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 116, in forward 2025-08-14T21:36:51.7695119Z x = torch.addmm(self.bias, x.view(-1, x.size(-1)), self.weight) 2025-08-14T21:36:51.7695277Z 2025-08-14T21:36:51.7695372Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:51.7695754Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1183, in forward 2025-08-14T21:36:51.7696122Z transformer_outputs = self.transformer( 2025-08-14T21:36:51.7696472Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 917, in forward 2025-08-14T21:36:51.7696816Z outputs = block( 2025-08-14T21:36:51.7697115Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:51.7697447Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:51.7697783Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:36:51.7698127Z return func(*args, **kwargs) 2025-08-14T21:36:51.7698469Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 440, in forward 2025-08-14T21:36:51.7698850Z feed_forward_hidden_states = self.mlp(hidden_states) 2025-08-14T21:36:51.7699226Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 366, in forward 2025-08-14T21:36:51.7699584Z hidden_states = self.act(hidden_states) 2025-08-14T21:36:51.7699898Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 47, in forward 2025-08-14T21:36:51.7700316Z return 0.5 * input * (1.0 + torch.tanh(math.sqrt(2.0 / math.pi) * (input + 0.044715 * torch.pow(input, 3.0)))) 2025-08-14T21:36:51.7700538Z 2025-08-14T21:36:51.7700633Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:51.7701013Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1183, in forward 2025-08-14T21:36:51.7701370Z transformer_outputs = self.transformer( 2025-08-14T21:36:51.7701725Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 917, in forward 2025-08-14T21:36:51.7702063Z outputs = block( 2025-08-14T21:36:51.7702400Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:51.7702745Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:51.7703088Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:36:51.7703444Z return func(*args, **kwargs) 2025-08-14T21:36:51.7703777Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 440, in forward 2025-08-14T21:36:51.7704154Z feed_forward_hidden_states = self.mlp(hidden_states) 2025-08-14T21:36:51.7704527Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 367, in forward 2025-08-14T21:36:51.7704959Z hidden_states = self.c_proj(hidden_states) 2025-08-14T21:36:51.7705289Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 116, in forward 2025-08-14T21:36:51.7705688Z x = torch.addmm(self.bias, x.view(-1, x.size(-1)), self.weight) 2025-08-14T21:36:51.7705848Z 2025-08-14T21:36:51.7705952Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:51.7706332Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1183, in forward 2025-08-14T21:36:51.7706701Z transformer_outputs = self.transformer( 2025-08-14T21:36:51.7707064Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 917, in forward 2025-08-14T21:36:51.7707408Z outputs = block( 2025-08-14T21:36:51.7707698Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:51.7708034Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:51.7708383Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:36:51.7708729Z return func(*args, **kwargs) 2025-08-14T21:36:51.7709061Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 404, in forward 2025-08-14T21:36:51.7709428Z attn_output, self_attn_weights = self.attn( 2025-08-14T21:36:51.7709786Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:36:51.7710124Z return func(*args, **kwargs) 2025-08-14T21:36:51.7710465Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 294, in forward 2025-08-14T21:36:51.7710923Z query_states, key_states, value_states = self.c_attn(hidden_states).split(self.split_size, dim=2) 2025-08-14T21:36:51.7711353Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 116, in forward 2025-08-14T21:36:51.7711712Z x = torch.addmm(self.bias, x.view(-1, x.size(-1)), self.weight) 2025-08-14T21:36:51.7711882Z 2025-08-14T21:36:51.7711957Z cudagraph partition due to non gpu ops 2025-08-14T21:36:51.7712163Z cudagraph partition due to non gpu ops 2025-08-14T21:36:51.7712355Z cudagraph partition due to non gpu ops 2025-08-14T21:36:51.7712546Z cudagraph partition due to non gpu ops 2025-08-14T21:36:51.7712761Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:51.7713145Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1183, in forward 2025-08-14T21:36:51.7713504Z transformer_outputs = self.transformer( 2025-08-14T21:36:51.7713863Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 917, in forward 2025-08-14T21:36:51.7714203Z outputs = block( 2025-08-14T21:36:51.7714491Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:51.7714825Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:51.7716056Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:36:51.7716419Z return func(*args, **kwargs) 2025-08-14T21:36:51.7716760Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 404, in forward 2025-08-14T21:36:51.7717152Z attn_output, self_attn_weights = self.attn( 2025-08-14T21:36:51.7717519Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:36:51.7717863Z return func(*args, **kwargs) 2025-08-14T21:36:51.7718195Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 336, in forward 2025-08-14T21:36:51.7718573Z attn_output, attn_weights = attention_interface( 2025-08-14T21:36:51.7718986Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:36:51.7719446Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:36:51.7719622Z 2025-08-14T21:36:51.7719720Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:51.7720107Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1183, in forward 2025-08-14T21:36:51.7720475Z transformer_outputs = self.transformer( 2025-08-14T21:36:51.7720825Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 917, in forward 2025-08-14T21:36:51.7721168Z outputs = block( 2025-08-14T21:36:51.7721463Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:51.7721796Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:51.7722136Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:36:51.7722482Z return func(*args, **kwargs) 2025-08-14T21:36:51.7722828Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 404, in forward 2025-08-14T21:36:51.7723187Z attn_output, self_attn_weights = self.attn( 2025-08-14T21:36:51.7723546Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:36:51.7723889Z return func(*args, **kwargs) 2025-08-14T21:36:51.7724229Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 336, in forward 2025-08-14T21:36:51.7724592Z attn_output, attn_weights = attention_interface( 2025-08-14T21:36:51.7725001Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:36:51.7725429Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:36:51.7725582Z 2025-08-14T21:36:51.7725686Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:51.7726066Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1183, in forward 2025-08-14T21:36:51.7726433Z transformer_outputs = self.transformer( 2025-08-14T21:36:51.7726796Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 917, in forward 2025-08-14T21:36:51.7727135Z outputs = block( 2025-08-14T21:36:51.7727434Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:51.7727768Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:51.7728118Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:36:51.7728458Z return func(*args, **kwargs) 2025-08-14T21:36:51.7728800Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 404, in forward 2025-08-14T21:36:51.7729186Z attn_output, self_attn_weights = self.attn( 2025-08-14T21:36:51.7729549Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:36:51.7729904Z return func(*args, **kwargs) 2025-08-14T21:36:51.7730248Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 349, in forward 2025-08-14T21:36:51.7730607Z attn_output = self.c_proj(attn_output) 2025-08-14T21:36:51.7730934Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 116, in forward 2025-08-14T21:36:51.7731302Z x = torch.addmm(self.bias, x.view(-1, x.size(-1)), self.weight) 2025-08-14T21:36:51.7731458Z 2025-08-14T21:36:51.7731561Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:51.7731942Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1183, in forward 2025-08-14T21:36:51.7732349Z transformer_outputs = self.transformer( 2025-08-14T21:36:51.7732706Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 917, in forward 2025-08-14T21:36:51.7733047Z outputs = block( 2025-08-14T21:36:51.7733339Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:51.7733671Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:51.7734015Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:36:51.7734356Z return func(*args, **kwargs) 2025-08-14T21:36:51.7734685Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 440, in forward 2025-08-14T21:36:51.7735057Z feed_forward_hidden_states = self.mlp(hidden_states) 2025-08-14T21:36:51.7735432Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 365, in forward 2025-08-14T21:36:51.7735782Z hidden_states = self.c_fc(hidden_states) 2025-08-14T21:36:51.7736117Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 116, in forward 2025-08-14T21:36:51.7736481Z x = torch.addmm(self.bias, x.view(-1, x.size(-1)), self.weight) 2025-08-14T21:36:51.7736639Z 2025-08-14T21:36:51.7736738Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:51.7737107Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1183, in forward 2025-08-14T21:36:51.7737468Z transformer_outputs = self.transformer( 2025-08-14T21:36:51.7737821Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 917, in forward 2025-08-14T21:36:51.7738158Z outputs = block( 2025-08-14T21:36:51.7738447Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:51.7738775Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:51.7739119Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:36:51.7739452Z return func(*args, **kwargs) 2025-08-14T21:36:51.7739792Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 440, in forward 2025-08-14T21:36:51.7740168Z feed_forward_hidden_states = self.mlp(hidden_states) 2025-08-14T21:36:51.7740540Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 366, in forward 2025-08-14T21:36:51.7740886Z hidden_states = self.act(hidden_states) 2025-08-14T21:36:51.7741206Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 47, in forward 2025-08-14T21:36:51.7741646Z return 0.5 * input * (1.0 + torch.tanh(math.sqrt(2.0 / math.pi) * (input + 0.044715 * torch.pow(input, 3.0)))) 2025-08-14T21:36:51.7741858Z 2025-08-14T21:36:51.7741975Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:51.7742367Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1183, in forward 2025-08-14T21:36:51.7742731Z transformer_outputs = self.transformer( 2025-08-14T21:36:51.7743086Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 917, in forward 2025-08-14T21:36:51.7743421Z outputs = block( 2025-08-14T21:36:51.7743716Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:51.7744051Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:51.7744397Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:36:51.7744828Z return func(*args, **kwargs) 2025-08-14T21:36:51.7745184Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 440, in forward 2025-08-14T21:36:51.7745565Z feed_forward_hidden_states = self.mlp(hidden_states) 2025-08-14T21:36:51.7745938Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 367, in forward 2025-08-14T21:36:51.7746314Z hidden_states = self.c_proj(hidden_states) 2025-08-14T21:36:51.7746658Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 116, in forward 2025-08-14T21:36:51.7747032Z x = torch.addmm(self.bias, x.view(-1, x.size(-1)), self.weight) 2025-08-14T21:36:51.7747194Z 2025-08-14T21:36:51.7747293Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:51.7747689Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1183, in forward 2025-08-14T21:36:51.7748063Z transformer_outputs = self.transformer( 2025-08-14T21:36:51.7748426Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 917, in forward 2025-08-14T21:36:51.7748765Z outputs = block( 2025-08-14T21:36:51.7749065Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:51.7749405Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:51.7749748Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:36:51.7750095Z return func(*args, **kwargs) 2025-08-14T21:36:51.7750436Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 442, in forward 2025-08-14T21:36:51.7750823Z hidden_states = residual + feed_forward_hidden_states 2025-08-14T21:36:51.7750978Z 2025-08-14T21:36:51.7751075Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:51.7751460Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1183, in forward 2025-08-14T21:36:51.7751826Z transformer_outputs = self.transformer( 2025-08-14T21:36:51.7752185Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 917, in forward 2025-08-14T21:36:51.7752525Z outputs = block( 2025-08-14T21:36:51.7752824Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:51.7753159Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:51.7753503Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:36:51.7753849Z return func(*args, **kwargs) 2025-08-14T21:36:51.7754188Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 404, in forward 2025-08-14T21:36:51.7754572Z attn_output, self_attn_weights = self.attn( 2025-08-14T21:36:51.7754937Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:36:51.7755290Z return func(*args, **kwargs) 2025-08-14T21:36:51.7755645Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 294, in forward 2025-08-14T21:36:51.7756099Z query_states, key_states, value_states = self.c_attn(hidden_states).split(self.split_size, dim=2) 2025-08-14T21:36:51.7756531Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 116, in forward 2025-08-14T21:36:51.7756907Z x = torch.addmm(self.bias, x.view(-1, x.size(-1)), self.weight) 2025-08-14T21:36:51.7757069Z 2025-08-14T21:36:51.7757152Z cudagraph partition due to non gpu ops 2025-08-14T21:36:51.7757350Z cudagraph partition due to non gpu ops 2025-08-14T21:36:51.7757564Z cudagraph partition due to non gpu ops 2025-08-14T21:36:51.7757755Z cudagraph partition due to non gpu ops 2025-08-14T21:36:51.7757964Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:51.7758348Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1183, in forward 2025-08-14T21:36:51.7758714Z transformer_outputs = self.transformer( 2025-08-14T21:36:51.7759060Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 917, in forward 2025-08-14T21:36:51.7759400Z outputs = block( 2025-08-14T21:36:51.7759693Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:51.7760023Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:51.7760360Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:36:51.7760705Z return func(*args, **kwargs) 2025-08-14T21:36:51.7761045Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 404, in forward 2025-08-14T21:36:51.7761405Z attn_output, self_attn_weights = self.attn( 2025-08-14T21:36:51.7761752Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:36:51.7762090Z return func(*args, **kwargs) 2025-08-14T21:36:51.7762429Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 336, in forward 2025-08-14T21:36:51.7762791Z attn_output, attn_weights = attention_interface( 2025-08-14T21:36:51.7763197Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:36:51.7763637Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:36:51.7763806Z 2025-08-14T21:36:51.7763907Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:51.7764280Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1183, in forward 2025-08-14T21:36:51.7764642Z transformer_outputs = self.transformer( 2025-08-14T21:36:51.7764995Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 917, in forward 2025-08-14T21:36:51.7765336Z outputs = block( 2025-08-14T21:36:51.7765625Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:51.7765953Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:51.7766296Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:36:51.7766628Z return func(*args, **kwargs) 2025-08-14T21:36:51.7766984Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 404, in forward 2025-08-14T21:36:51.7767347Z attn_output, self_attn_weights = self.attn( 2025-08-14T21:36:51.7767717Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:36:51.7768063Z return func(*args, **kwargs) 2025-08-14T21:36:51.7768400Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 336, in forward 2025-08-14T21:36:51.7768767Z attn_output, attn_weights = attention_interface( 2025-08-14T21:36:51.7769165Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:36:51.7769587Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:36:51.7769742Z 2025-08-14T21:36:51.7769834Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:51.7770230Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1183, in forward 2025-08-14T21:36:51.7770588Z transformer_outputs = self.transformer( 2025-08-14T21:36:51.7770949Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 917, in forward 2025-08-14T21:36:51.7771291Z outputs = block( 2025-08-14T21:36:51.7771590Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:51.7771915Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:51.7772262Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:36:51.7772604Z return func(*args, **kwargs) 2025-08-14T21:36:51.7772936Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 404, in forward 2025-08-14T21:36:51.7773306Z attn_output, self_attn_weights = self.attn( 2025-08-14T21:36:51.7773663Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:36:51.7774002Z return func(*args, **kwargs) 2025-08-14T21:36:51.7774338Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 349, in forward 2025-08-14T21:36:51.7774699Z attn_output = self.c_proj(attn_output) 2025-08-14T21:36:51.7775030Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 116, in forward 2025-08-14T21:36:51.7775390Z x = torch.addmm(self.bias, x.view(-1, x.size(-1)), self.weight) 2025-08-14T21:36:51.7775556Z 2025-08-14T21:36:51.7775653Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:51.7776037Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1183, in forward 2025-08-14T21:36:51.7776404Z transformer_outputs = self.transformer( 2025-08-14T21:36:51.7776755Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 917, in forward 2025-08-14T21:36:51.7777099Z outputs = block( 2025-08-14T21:36:51.7777399Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:51.7777733Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:51.7778074Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:36:51.7778417Z return func(*args, **kwargs) 2025-08-14T21:36:51.7778757Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 440, in forward 2025-08-14T21:36:51.7779129Z feed_forward_hidden_states = self.mlp(hidden_states) 2025-08-14T21:36:51.7779508Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 365, in forward 2025-08-14T21:36:51.7779890Z hidden_states = self.c_fc(hidden_states) 2025-08-14T21:36:51.7780241Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 116, in forward 2025-08-14T21:36:51.7780602Z x = torch.addmm(self.bias, x.view(-1, x.size(-1)), self.weight) 2025-08-14T21:36:51.7780766Z 2025-08-14T21:36:51.7780875Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:51.7781261Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1183, in forward 2025-08-14T21:36:51.7781622Z transformer_outputs = self.transformer( 2025-08-14T21:36:51.7781964Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 917, in forward 2025-08-14T21:36:51.7782303Z outputs = block( 2025-08-14T21:36:51.7782591Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:51.7782949Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:51.7783298Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:36:51.7783643Z return func(*args, **kwargs) 2025-08-14T21:36:51.7783978Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 440, in forward 2025-08-14T21:36:51.7784359Z feed_forward_hidden_states = self.mlp(hidden_states) 2025-08-14T21:36:51.7784939Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 366, in forward 2025-08-14T21:36:51.7785305Z hidden_states = self.act(hidden_states) 2025-08-14T21:36:51.7785623Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 47, in forward 2025-08-14T21:36:51.7786041Z return 0.5 * input * (1.0 + torch.tanh(math.sqrt(2.0 / math.pi) * (input + 0.044715 * torch.pow(input, 3.0)))) 2025-08-14T21:36:51.7786261Z 2025-08-14T21:36:51.7786363Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:51.7786743Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1183, in forward 2025-08-14T21:36:51.7787099Z transformer_outputs = self.transformer( 2025-08-14T21:36:51.7787455Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 917, in forward 2025-08-14T21:36:51.7787794Z outputs = block( 2025-08-14T21:36:51.7788082Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:36:51.7788415Z return super().__call__(*args, **kwargs) 2025-08-14T21:36:51.7788759Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:36:51.7789096Z return func(*args, **kwargs) 2025-08-14T21:36:51.7789428Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 440, in forward 2025-08-14T21:36:51.7789807Z feed_forward_hidden_states = self.mlp(hidden_states) 2025-08-14T21:36:51.7790177Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 367, in forward 2025-08-14T21:36:51.7790533Z hidden_states = self.c_proj(hidden_states) 2025-08-14T21:36:51.7790866Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 116, in forward 2025-08-14T21:36:51.7791228Z x = torch.addmm(self.bias, x.view(-1, x.size(-1)), self.weight) 2025-08-14T21:36:51.7791384Z 2025-08-14T21:36:51.7791484Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:51.7791852Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1207, in forward 2025-08-14T21:36:51.7792241Z logits = self.lm_head(hidden_states[:, slice_indices, :]) 2025-08-14T21:36:51.7792440Z 2025-08-14T21:36:58.8395121Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:36:58.8399472Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/loss/loss_utils.py", line 67, in ForCausalLMLoss 2025-08-14T21:36:58.8403491Z loss = fixed_cross_entropy(logits, shift_labels, num_items_in_batch, ignore_index, **kwargs) 2025-08-14T21:36:58.8405415Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/loss/loss_utils.py", line 36, in fixed_cross_entropy 2025-08-14T21:36:58.8407755Z loss = nn.functional.cross_entropy(source, target, ignore_index=ignore_index, reduction=reduction) 2025-08-14T21:36:58.8411730Z 2025-08-14T21:36:59.9287055Z Compilation time (from dynamo_timed): 12.332895871 2025-08-14T21:36:59.9423131Z pass 2025-08-14T21:36:59.9428522Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:36:59.9433365Z TIMING: gc:0.00374 entire_frame_compile:12.3329 _recursive_pre_grad_passes:0.00643 _recursive_joint_graph_passes:0.20088 _recursive_post_grad_passes:0.05192 async_compile.wait:1.28871 code_gen:7.58091 inductor_compile:8.19963 backend_compile:9.73045 total_wall_time:12.3329 2025-08-14T21:36:59.9434783Z STATS: call_* op count: 299 | FakeTensorMode.__torch_dispatch__:7245 | FakeTensor.__torch_dispatch__:2465 | ProxyTorchDispatchMode.__torch_dispatch__:2190 2025-08-14T21:36:59.9435271Z Dynamo produced 2 graphs covering 299 ops with 2 graph breaks (1 unique) 2025-08-14T21:37:04.1556709Z /opt/conda/envs/py_3.9/lib/python3.9/site-packages/llvmlite/binding/ffi.py:175: UserWarning: pkg_resources is deprecated as an API. See https://setuptools.pypa.io/en/latest/pkg_resources.html. The pkg_resources package is slated for removal as early as 2025-11-30. Refrain from using this package or pin to Setuptools<81. 2025-08-14T21:37:04.1557585Z from pkg_resources import resource_filename 2025-08-14T21:37:04.7040454Z 2025-08-14T21:37:04.7054763Z loading model: 0it [00:00, ?it/s]If you want to use `ElectraForCausalLM` as a standalone, add `is_decoder=True.` 2025-08-14T21:37:04.7056725Z WARNING:transformers.models.electra.modeling_electra:If you want to use `ElectraForCausalLM` as a standalone, add `is_decoder=True.` 2025-08-14T21:37:05.0672279Z 2025-08-14T21:37:05.0672843Z loading model: 0it [00:00, ?it/s] 2025-08-14T21:37:05.0689899Z cpu eval ElectraForCausalLM 2025-08-14T21:37:05.2312413Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:37:05.3131384Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:37:05.3820920Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:37:12.6062872Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:12.6067238Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:12.6069277Z return mod(**inputs) 2025-08-14T21:37:12.6074061Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1543, in forward 2025-08-14T21:37:12.6075929Z outputs = self.electra( 2025-08-14T21:37:12.6079011Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 797, in forward 2025-08-14T21:37:12.6083482Z hidden_states = self.embeddings_project(hidden_states) 2025-08-14T21:37:12.6083830Z 2025-08-14T21:37:12.6084091Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:12.6084454Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:12.6084928Z return mod(**inputs) 2025-08-14T21:37:12.6085398Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1543, in forward 2025-08-14T21:37:12.6089517Z outputs = self.electra( 2025-08-14T21:37:12.6091810Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:12.6092494Z hidden_states = self.encoder( 2025-08-14T21:37:12.6097265Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:12.6098325Z layer_outputs = layer_module( 2025-08-14T21:37:12.6098685Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:12.6099035Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:12.6099434Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 474, in forward 2025-08-14T21:37:12.6099825Z self_attention_outputs = self.attention( 2025-08-14T21:37:12.6100275Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:12.6104411Z return func(*args, **kwargs) 2025-08-14T21:37:12.6109193Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 401, in forward 2025-08-14T21:37:12.6111020Z self_outputs = self.self( 2025-08-14T21:37:12.6116340Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:12.6118188Z return func(*args, **kwargs) 2025-08-14T21:37:12.6121172Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 241, in forward 2025-08-14T21:37:12.6121707Z query_layer = self.query(hidden_states) 2025-08-14T21:37:12.6126253Z 2025-08-14T21:37:12.6130709Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:12.6132980Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:12.6136688Z return mod(**inputs) 2025-08-14T21:37:12.6139557Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1543, in forward 2025-08-14T21:37:12.6141191Z outputs = self.electra( 2025-08-14T21:37:12.6141581Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:12.6141960Z hidden_states = self.encoder( 2025-08-14T21:37:12.6142322Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:12.6148389Z layer_outputs = layer_module( 2025-08-14T21:37:12.6148793Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:12.6149145Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:12.6149540Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 474, in forward 2025-08-14T21:37:12.6149939Z self_attention_outputs = self.attention( 2025-08-14T21:37:12.6150310Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:12.6150661Z return func(*args, **kwargs) 2025-08-14T21:37:12.6152039Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 401, in forward 2025-08-14T21:37:12.6152544Z self_outputs = self.self( 2025-08-14T21:37:12.6152907Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:12.6153244Z return func(*args, **kwargs) 2025-08-14T21:37:12.6153662Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 270, in forward 2025-08-14T21:37:12.6154038Z key_layer = self.key(current_states) 2025-08-14T21:37:12.6157303Z 2025-08-14T21:37:12.6161628Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:12.6165924Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:12.6170184Z return mod(**inputs) 2025-08-14T21:37:12.6172730Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1543, in forward 2025-08-14T21:37:12.6173257Z outputs = self.electra( 2025-08-14T21:37:12.6175606Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:12.6176052Z hidden_states = self.encoder( 2025-08-14T21:37:12.6180571Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:12.6181892Z layer_outputs = layer_module( 2025-08-14T21:37:12.6182282Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:12.6182784Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:12.6183174Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 474, in forward 2025-08-14T21:37:12.6183556Z self_attention_outputs = self.attention( 2025-08-14T21:37:12.6183923Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:12.6184266Z return func(*args, **kwargs) 2025-08-14T21:37:12.6184765Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 401, in forward 2025-08-14T21:37:12.6185222Z self_outputs = self.self( 2025-08-14T21:37:12.6185563Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:12.6185919Z return func(*args, **kwargs) 2025-08-14T21:37:12.6186292Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 274, in forward 2025-08-14T21:37:12.6186702Z value_layer = self.value(current_states) 2025-08-14T21:37:12.6186838Z 2025-08-14T21:37:12.6186921Z cudagraph partition due to non gpu ops 2025-08-14T21:37:12.6187114Z cudagraph partition due to non gpu ops 2025-08-14T21:37:12.6187337Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:12.6187679Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:12.6187976Z return mod(**inputs) 2025-08-14T21:37:12.6188327Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1543, in forward 2025-08-14T21:37:12.6188695Z outputs = self.electra( 2025-08-14T21:37:12.6189041Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:12.6189411Z hidden_states = self.encoder( 2025-08-14T21:37:12.6189774Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:12.6190140Z layer_outputs = layer_module( 2025-08-14T21:37:12.6190461Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:12.6190827Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:12.6191192Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 474, in forward 2025-08-14T21:37:12.6191566Z self_attention_outputs = self.attention( 2025-08-14T21:37:12.6191920Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:12.6192255Z return func(*args, **kwargs) 2025-08-14T21:37:12.6192606Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 411, in forward 2025-08-14T21:37:12.6193136Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:37:12.6193590Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 348, in forward 2025-08-14T21:37:12.6193966Z hidden_states = self.dense(hidden_states) 2025-08-14T21:37:12.6194138Z 2025-08-14T21:37:12.6194239Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:12.6194577Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:12.6194886Z return mod(**inputs) 2025-08-14T21:37:12.6195519Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1543, in forward 2025-08-14T21:37:12.6195890Z outputs = self.electra( 2025-08-14T21:37:12.6196236Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:12.6196625Z hidden_states = self.encoder( 2025-08-14T21:37:12.6196981Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:12.6197339Z layer_outputs = layer_module( 2025-08-14T21:37:12.6197660Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:12.6197988Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:12.6198357Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 504, in forward 2025-08-14T21:37:12.6198729Z layer_output = apply_chunking_to_forward( 2025-08-14T21:37:12.6199098Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:37:12.6199455Z return forward_fn(*input_tensors) 2025-08-14T21:37:12.6199848Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 512, in feed_forward_chunk 2025-08-14T21:37:12.6200287Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:37:12.6200685Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 427, in forward 2025-08-14T21:37:12.6201055Z hidden_states = self.dense(hidden_states) 2025-08-14T21:37:12.6201186Z 2025-08-14T21:37:12.6201283Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:12.6201611Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:12.6201902Z return mod(**inputs) 2025-08-14T21:37:12.6202247Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1543, in forward 2025-08-14T21:37:12.6202609Z outputs = self.electra( 2025-08-14T21:37:12.6202946Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:12.6203309Z hidden_states = self.encoder( 2025-08-14T21:37:12.6203664Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:12.6204027Z layer_outputs = layer_module( 2025-08-14T21:37:12.6204337Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:12.6204675Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:12.6205041Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 504, in forward 2025-08-14T21:37:12.6205410Z layer_output = apply_chunking_to_forward( 2025-08-14T21:37:12.6205768Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:37:12.6206161Z return forward_fn(*input_tensors) 2025-08-14T21:37:12.6206573Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 512, in feed_forward_chunk 2025-08-14T21:37:12.6207006Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:37:12.6207426Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 428, in forward 2025-08-14T21:37:12.6207831Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:37:12.6208186Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:37:12.6208498Z return self.act(input) 2025-08-14T21:37:12.6208616Z 2025-08-14T21:37:12.6208714Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:12.6209050Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:12.6209356Z return mod(**inputs) 2025-08-14T21:37:12.6209716Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1543, in forward 2025-08-14T21:37:12.6210080Z outputs = self.electra( 2025-08-14T21:37:12.6210427Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:12.6210783Z hidden_states = self.encoder( 2025-08-14T21:37:12.6211142Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:12.6211502Z layer_outputs = layer_module( 2025-08-14T21:37:12.6211818Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:12.6212144Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:12.6212512Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 504, in forward 2025-08-14T21:37:12.6212889Z layer_output = apply_chunking_to_forward( 2025-08-14T21:37:12.6213251Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:37:12.6213613Z return forward_fn(*input_tensors) 2025-08-14T21:37:12.6214005Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 513, in feed_forward_chunk 2025-08-14T21:37:12.6214453Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:37:12.6214864Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 441, in forward 2025-08-14T21:37:12.6215233Z hidden_states = self.dense(hidden_states) 2025-08-14T21:37:12.6215365Z 2025-08-14T21:37:12.6215461Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:12.6215795Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:12.6216095Z return mod(**inputs) 2025-08-14T21:37:12.6216443Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1543, in forward 2025-08-14T21:37:12.6216805Z outputs = self.electra( 2025-08-14T21:37:12.6217144Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:12.6217505Z hidden_states = self.encoder( 2025-08-14T21:37:12.6217860Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:12.6218223Z layer_outputs = layer_module( 2025-08-14T21:37:12.6218534Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:12.6218870Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:12.6219241Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 474, in forward 2025-08-14T21:37:12.6219640Z self_attention_outputs = self.attention( 2025-08-14T21:37:12.6220038Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:12.6220407Z return func(*args, **kwargs) 2025-08-14T21:37:12.6220766Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 401, in forward 2025-08-14T21:37:12.6221120Z self_outputs = self.self( 2025-08-14T21:37:12.6221456Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:12.6221801Z return func(*args, **kwargs) 2025-08-14T21:37:12.6222160Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 241, in forward 2025-08-14T21:37:12.6222533Z query_layer = self.query(hidden_states) 2025-08-14T21:37:12.6222689Z 2025-08-14T21:37:12.6222786Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:12.6223125Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:12.6223424Z return mod(**inputs) 2025-08-14T21:37:12.6223776Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1543, in forward 2025-08-14T21:37:12.6224138Z outputs = self.electra( 2025-08-14T21:37:12.6224484Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:12.6224919Z hidden_states = self.encoder( 2025-08-14T21:37:12.6225279Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:12.6225648Z layer_outputs = layer_module( 2025-08-14T21:37:12.6225964Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:12.6226306Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:12.6226681Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 474, in forward 2025-08-14T21:37:12.6227059Z self_attention_outputs = self.attention( 2025-08-14T21:37:12.6227410Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:12.6227761Z return func(*args, **kwargs) 2025-08-14T21:37:12.6228122Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 401, in forward 2025-08-14T21:37:12.6228487Z self_outputs = self.self( 2025-08-14T21:37:12.6228820Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:12.6229167Z return func(*args, **kwargs) 2025-08-14T21:37:12.6229519Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 270, in forward 2025-08-14T21:37:12.6229881Z key_layer = self.key(current_states) 2025-08-14T21:37:12.6230013Z 2025-08-14T21:37:12.6230111Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:12.6230444Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:12.6230743Z return mod(**inputs) 2025-08-14T21:37:12.6231083Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1543, in forward 2025-08-14T21:37:12.6231445Z outputs = self.electra( 2025-08-14T21:37:12.6231793Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:12.6232150Z hidden_states = self.encoder( 2025-08-14T21:37:12.6232527Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:12.6232886Z layer_outputs = layer_module( 2025-08-14T21:37:12.6233222Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:12.6233572Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:12.6233937Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 474, in forward 2025-08-14T21:37:12.6234306Z self_attention_outputs = self.attention( 2025-08-14T21:37:12.6234654Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:12.6234986Z return func(*args, **kwargs) 2025-08-14T21:37:12.6235335Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 401, in forward 2025-08-14T21:37:12.6235716Z self_outputs = self.self( 2025-08-14T21:37:12.6236042Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:12.6236394Z return func(*args, **kwargs) 2025-08-14T21:37:12.6236755Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 274, in forward 2025-08-14T21:37:12.6237133Z value_layer = self.value(current_states) 2025-08-14T21:37:12.6237259Z 2025-08-14T21:37:12.6237336Z cudagraph partition due to non gpu ops 2025-08-14T21:37:12.6237540Z cudagraph partition due to non gpu ops 2025-08-14T21:37:12.6237762Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:12.6238091Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:12.6238398Z return mod(**inputs) 2025-08-14T21:37:12.6238749Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1543, in forward 2025-08-14T21:37:12.6239123Z outputs = self.electra( 2025-08-14T21:37:12.6239473Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:12.6239842Z hidden_states = self.encoder( 2025-08-14T21:37:12.6240206Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:12.6240569Z layer_outputs = layer_module( 2025-08-14T21:37:12.6240894Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:12.6241233Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:12.6241603Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 474, in forward 2025-08-14T21:37:12.6241973Z self_attention_outputs = self.attention( 2025-08-14T21:37:12.6242334Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:12.6242682Z return func(*args, **kwargs) 2025-08-14T21:37:12.6243042Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 411, in forward 2025-08-14T21:37:12.6243458Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:37:12.6243874Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 348, in forward 2025-08-14T21:37:12.6244251Z hidden_states = self.dense(hidden_states) 2025-08-14T21:37:12.6244383Z 2025-08-14T21:37:12.6244481Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:12.6244820Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:12.6245125Z return mod(**inputs) 2025-08-14T21:37:12.6245472Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1543, in forward 2025-08-14T21:37:12.6245877Z outputs = self.electra( 2025-08-14T21:37:12.6246238Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:12.6246624Z hidden_states = self.encoder( 2025-08-14T21:37:12.6246970Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:12.6247328Z layer_outputs = layer_module( 2025-08-14T21:37:12.6247643Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:12.6247973Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:12.6248331Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 504, in forward 2025-08-14T21:37:12.6248726Z layer_output = apply_chunking_to_forward( 2025-08-14T21:37:12.6249094Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:37:12.6249458Z return forward_fn(*input_tensors) 2025-08-14T21:37:12.6249840Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 512, in feed_forward_chunk 2025-08-14T21:37:12.6250272Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:37:12.6250674Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 427, in forward 2025-08-14T21:37:12.6251036Z hidden_states = self.dense(hidden_states) 2025-08-14T21:37:12.6251168Z 2025-08-14T21:37:12.6251263Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:12.6251595Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:12.6251898Z return mod(**inputs) 2025-08-14T21:37:12.6252235Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1543, in forward 2025-08-14T21:37:12.6252599Z outputs = self.electra( 2025-08-14T21:37:12.6252949Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:12.6253314Z hidden_states = self.encoder( 2025-08-14T21:37:12.6253660Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:12.6254019Z layer_outputs = layer_module( 2025-08-14T21:37:12.6254336Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:12.6254660Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:12.6255025Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 504, in forward 2025-08-14T21:37:12.6255402Z layer_output = apply_chunking_to_forward( 2025-08-14T21:37:12.6255777Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:37:12.6256131Z return forward_fn(*input_tensors) 2025-08-14T21:37:12.6256521Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 512, in feed_forward_chunk 2025-08-14T21:37:12.6256951Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:37:12.6257355Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 428, in forward 2025-08-14T21:37:12.6257740Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:37:12.6258094Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:37:12.6258438Z return self.act(input) 2025-08-14T21:37:12.6258545Z 2025-08-14T21:37:12.6258644Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:12.6259004Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:12.6259307Z return mod(**inputs) 2025-08-14T21:37:12.6259668Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1543, in forward 2025-08-14T21:37:12.6260023Z outputs = self.electra( 2025-08-14T21:37:12.6260372Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:12.6260732Z hidden_states = self.encoder( 2025-08-14T21:37:12.6261080Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:12.6261438Z layer_outputs = layer_module( 2025-08-14T21:37:12.6261756Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:12.6262111Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:12.6262472Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 504, in forward 2025-08-14T21:37:12.6262846Z layer_output = apply_chunking_to_forward( 2025-08-14T21:37:12.6263219Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:37:12.6263576Z return forward_fn(*input_tensors) 2025-08-14T21:37:12.6263958Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 513, in feed_forward_chunk 2025-08-14T21:37:12.6264400Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:37:12.6264887Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 441, in forward 2025-08-14T21:37:12.6265271Z hidden_states = self.dense(hidden_states) 2025-08-14T21:37:12.6265408Z 2025-08-14T21:37:12.6265509Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:12.6265847Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:12.6266152Z return mod(**inputs) 2025-08-14T21:37:12.6266494Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1543, in forward 2025-08-14T21:37:12.6266859Z outputs = self.electra( 2025-08-14T21:37:12.6267205Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:12.6267571Z hidden_states = self.encoder( 2025-08-14T21:37:12.6267921Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:12.6268290Z layer_outputs = layer_module( 2025-08-14T21:37:12.6268612Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:12.6268940Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:12.6269313Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 474, in forward 2025-08-14T21:37:12.6269686Z self_attention_outputs = self.attention( 2025-08-14T21:37:12.6270042Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:12.6270383Z return func(*args, **kwargs) 2025-08-14T21:37:12.6270739Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 401, in forward 2025-08-14T21:37:12.6271103Z self_outputs = self.self( 2025-08-14T21:37:12.6271430Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:12.6271814Z return func(*args, **kwargs) 2025-08-14T21:37:12.6272182Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 241, in forward 2025-08-14T21:37:12.6272558Z query_layer = self.query(hidden_states) 2025-08-14T21:37:12.6272686Z 2025-08-14T21:37:12.6272795Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:12.6273129Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:12.6273430Z return mod(**inputs) 2025-08-14T21:37:12.6273768Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1543, in forward 2025-08-14T21:37:12.6274132Z outputs = self.electra( 2025-08-14T21:37:12.6274478Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:12.6274856Z hidden_states = self.encoder( 2025-08-14T21:37:12.6275202Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:12.6275563Z layer_outputs = layer_module( 2025-08-14T21:37:12.6275883Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:12.6276217Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:12.6276576Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 474, in forward 2025-08-14T21:37:12.6276947Z self_attention_outputs = self.attention( 2025-08-14T21:37:12.6277300Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:12.6277634Z return func(*args, **kwargs) 2025-08-14T21:37:12.6277987Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 401, in forward 2025-08-14T21:37:12.6278351Z self_outputs = self.self( 2025-08-14T21:37:12.6278681Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:12.6279014Z return func(*args, **kwargs) 2025-08-14T21:37:12.6279365Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 270, in forward 2025-08-14T21:37:12.6279731Z key_layer = self.key(current_states) 2025-08-14T21:37:12.6279854Z 2025-08-14T21:37:12.6279948Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:12.6280276Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:12.6280577Z return mod(**inputs) 2025-08-14T21:37:12.6280920Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1543, in forward 2025-08-14T21:37:12.6281278Z outputs = self.electra( 2025-08-14T21:37:12.6281621Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:12.6281980Z hidden_states = self.encoder( 2025-08-14T21:37:12.6282333Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:12.6282685Z layer_outputs = layer_module( 2025-08-14T21:37:12.6282999Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:12.6283332Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:12.6283691Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 474, in forward 2025-08-14T21:37:12.6284063Z self_attention_outputs = self.attention( 2025-08-14T21:37:12.6284414Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:12.6284903Z return func(*args, **kwargs) 2025-08-14T21:37:12.6285289Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 401, in forward 2025-08-14T21:37:12.6285653Z self_outputs = self.self( 2025-08-14T21:37:12.6286011Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:12.6286347Z return func(*args, **kwargs) 2025-08-14T21:37:12.6286697Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 274, in forward 2025-08-14T21:37:12.6287067Z value_layer = self.value(current_states) 2025-08-14T21:37:12.6287189Z 2025-08-14T21:37:12.6287272Z cudagraph partition due to non gpu ops 2025-08-14T21:37:12.6287464Z cudagraph partition due to non gpu ops 2025-08-14T21:37:12.6287688Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:12.6288047Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:12.6288341Z return mod(**inputs) 2025-08-14T21:37:12.6288687Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1543, in forward 2025-08-14T21:37:12.6289052Z outputs = self.electra( 2025-08-14T21:37:12.6289395Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:12.6289746Z hidden_states = self.encoder( 2025-08-14T21:37:12.6290099Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:12.6290457Z layer_outputs = layer_module( 2025-08-14T21:37:12.6290762Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:12.6291097Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:12.6291462Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 474, in forward 2025-08-14T21:37:12.6291835Z self_attention_outputs = self.attention( 2025-08-14T21:37:12.6292182Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:12.6292527Z return func(*args, **kwargs) 2025-08-14T21:37:12.6292875Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 411, in forward 2025-08-14T21:37:12.6293286Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:37:12.6293689Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 348, in forward 2025-08-14T21:37:12.6294060Z hidden_states = self.dense(hidden_states) 2025-08-14T21:37:12.6294189Z 2025-08-14T21:37:12.6294292Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:12.6294613Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:12.6294910Z return mod(**inputs) 2025-08-14T21:37:12.6295252Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1543, in forward 2025-08-14T21:37:12.6295611Z outputs = self.electra( 2025-08-14T21:37:12.6295943Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:12.6296300Z hidden_states = self.encoder( 2025-08-14T21:37:12.6296653Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:12.6297008Z layer_outputs = layer_module( 2025-08-14T21:37:12.6297313Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:12.6297669Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:12.6298049Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 504, in forward 2025-08-14T21:37:12.6298417Z layer_output = apply_chunking_to_forward( 2025-08-14T21:37:12.6298798Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:37:12.6299163Z return forward_fn(*input_tensors) 2025-08-14T21:37:12.6299557Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 512, in feed_forward_chunk 2025-08-14T21:37:12.6299990Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:37:12.6300400Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 427, in forward 2025-08-14T21:37:12.6300792Z hidden_states = self.dense(hidden_states) 2025-08-14T21:37:12.6300919Z 2025-08-14T21:37:12.6301023Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:12.6301350Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:12.6301651Z return mod(**inputs) 2025-08-14T21:37:12.6301998Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1543, in forward 2025-08-14T21:37:12.6302354Z outputs = self.electra( 2025-08-14T21:37:12.6302701Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:12.6303066Z hidden_states = self.encoder( 2025-08-14T21:37:12.6303420Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:12.6303772Z layer_outputs = layer_module( 2025-08-14T21:37:12.6304093Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:12.6304427Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:12.6304837Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 504, in forward 2025-08-14T21:37:12.6305222Z layer_output = apply_chunking_to_forward( 2025-08-14T21:37:12.6305592Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:37:12.6305958Z return forward_fn(*input_tensors) 2025-08-14T21:37:12.6306341Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 512, in feed_forward_chunk 2025-08-14T21:37:12.6306776Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:37:12.6307180Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 428, in forward 2025-08-14T21:37:12.6307582Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:37:12.6307930Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:37:12.6308247Z return self.act(input) 2025-08-14T21:37:12.6308350Z 2025-08-14T21:37:12.6308457Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:12.6308781Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:12.6309077Z return mod(**inputs) 2025-08-14T21:37:12.6309421Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1543, in forward 2025-08-14T21:37:12.6309783Z outputs = self.electra( 2025-08-14T21:37:12.6310124Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:12.6310511Z hidden_states = self.encoder( 2025-08-14T21:37:12.6310866Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:12.6311238Z layer_outputs = layer_module( 2025-08-14T21:37:12.6311573Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:12.6311910Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:12.6312279Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 504, in forward 2025-08-14T21:37:12.6312651Z layer_output = apply_chunking_to_forward( 2025-08-14T21:37:12.6313024Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:37:12.6313390Z return forward_fn(*input_tensors) 2025-08-14T21:37:12.6313785Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 513, in feed_forward_chunk 2025-08-14T21:37:12.6314269Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:37:12.6314701Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 441, in forward 2025-08-14T21:37:12.6315092Z hidden_states = self.dense(hidden_states) 2025-08-14T21:37:12.6315236Z 2025-08-14T21:37:12.6315342Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:12.6315670Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:12.6315976Z return mod(**inputs) 2025-08-14T21:37:12.6316324Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1543, in forward 2025-08-14T21:37:12.6316682Z outputs = self.electra( 2025-08-14T21:37:12.6317030Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:12.6317401Z hidden_states = self.encoder( 2025-08-14T21:37:12.6317764Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:12.6318123Z layer_outputs = layer_module( 2025-08-14T21:37:12.6318451Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:12.6318795Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:12.6319160Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 474, in forward 2025-08-14T21:37:12.6319541Z self_attention_outputs = self.attention( 2025-08-14T21:37:12.6319903Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:12.6320255Z return func(*args, **kwargs) 2025-08-14T21:37:12.6320611Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 401, in forward 2025-08-14T21:37:12.6320978Z self_outputs = self.self( 2025-08-14T21:37:12.6321324Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:12.6321666Z return func(*args, **kwargs) 2025-08-14T21:37:12.6322026Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 241, in forward 2025-08-14T21:37:12.6322405Z query_layer = self.query(hidden_states) 2025-08-14T21:37:12.6322533Z 2025-08-14T21:37:12.6322640Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:12.6322973Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:12.6323282Z return mod(**inputs) 2025-08-14T21:37:12.6323632Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1543, in forward 2025-08-14T21:37:12.6324019Z outputs = self.electra( 2025-08-14T21:37:12.6324378Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:12.6324760Z hidden_states = self.encoder( 2025-08-14T21:37:12.6325131Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:12.6325494Z layer_outputs = layer_module( 2025-08-14T21:37:12.6325822Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:12.6326166Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:12.6326542Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 474, in forward 2025-08-14T21:37:12.6326913Z self_attention_outputs = self.attention( 2025-08-14T21:37:12.6327320Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:12.6327672Z return func(*args, **kwargs) 2025-08-14T21:37:12.6328027Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 401, in forward 2025-08-14T21:37:12.6328396Z self_outputs = self.self( 2025-08-14T21:37:12.6328734Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:12.6329084Z return func(*args, **kwargs) 2025-08-14T21:37:12.6329438Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 270, in forward 2025-08-14T21:37:12.6329816Z key_layer = self.key(current_states) 2025-08-14T21:37:12.6329972Z 2025-08-14T21:37:12.6330079Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:12.6330422Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:12.6330721Z return mod(**inputs) 2025-08-14T21:37:12.6331072Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1543, in forward 2025-08-14T21:37:12.6331441Z outputs = self.electra( 2025-08-14T21:37:12.6331785Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:12.6332152Z hidden_states = self.encoder( 2025-08-14T21:37:12.6332511Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:12.6332878Z layer_outputs = layer_module( 2025-08-14T21:37:12.6333194Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:12.6333532Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:12.6333905Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 474, in forward 2025-08-14T21:37:12.6334280Z self_attention_outputs = self.attention( 2025-08-14T21:37:12.6334640Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:12.6334996Z return func(*args, **kwargs) 2025-08-14T21:37:12.6335355Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 401, in forward 2025-08-14T21:37:12.6335716Z self_outputs = self.self( 2025-08-14T21:37:12.6336052Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:12.6336404Z return func(*args, **kwargs) 2025-08-14T21:37:12.6336757Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 274, in forward 2025-08-14T21:37:12.6337156Z value_layer = self.value(current_states) 2025-08-14T21:37:12.6337290Z 2025-08-14T21:37:12.6337366Z cudagraph partition due to non gpu ops 2025-08-14T21:37:12.6337584Z cudagraph partition due to non gpu ops 2025-08-14T21:37:12.6337805Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:12.6338160Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:12.6338471Z return mod(**inputs) 2025-08-14T21:37:12.6338821Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1543, in forward 2025-08-14T21:37:12.6339197Z outputs = self.electra( 2025-08-14T21:37:12.6339552Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:12.6339925Z hidden_states = self.encoder( 2025-08-14T21:37:12.6340272Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:12.6340660Z layer_outputs = layer_module( 2025-08-14T21:37:12.6340978Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:12.6341309Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:12.6341670Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 474, in forward 2025-08-14T21:37:12.6342039Z self_attention_outputs = self.attention( 2025-08-14T21:37:12.6342386Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:12.6342720Z return func(*args, **kwargs) 2025-08-14T21:37:12.6343070Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 411, in forward 2025-08-14T21:37:12.6343479Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:37:12.6343893Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 348, in forward 2025-08-14T21:37:12.6344260Z hidden_states = self.dense(hidden_states) 2025-08-14T21:37:12.6344394Z 2025-08-14T21:37:12.6344491Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:12.6344901Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:12.6345218Z return mod(**inputs) 2025-08-14T21:37:12.6345568Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1543, in forward 2025-08-14T21:37:12.6345942Z outputs = self.electra( 2025-08-14T21:37:12.6346307Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:12.6346662Z hidden_states = self.encoder( 2025-08-14T21:37:12.6347027Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:12.6347391Z layer_outputs = layer_module( 2025-08-14T21:37:12.6347710Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:12.6348040Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:12.6348408Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 504, in forward 2025-08-14T21:37:12.6348784Z layer_output = apply_chunking_to_forward( 2025-08-14T21:37:12.6349149Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:37:12.6349511Z return forward_fn(*input_tensors) 2025-08-14T21:37:12.6349900Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 512, in feed_forward_chunk 2025-08-14T21:37:12.6350360Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:37:12.6350774Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 427, in forward 2025-08-14T21:37:12.6351148Z hidden_states = self.dense(hidden_states) 2025-08-14T21:37:12.6351289Z 2025-08-14T21:37:12.6351395Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:12.6351725Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:12.6352019Z return mod(**inputs) 2025-08-14T21:37:12.6352364Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1543, in forward 2025-08-14T21:37:12.6352728Z outputs = self.electra( 2025-08-14T21:37:12.6353066Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:12.6353446Z hidden_states = self.encoder( 2025-08-14T21:37:12.6353798Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:12.6354160Z layer_outputs = layer_module( 2025-08-14T21:37:12.6354470Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:12.6354805Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:12.6355168Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 504, in forward 2025-08-14T21:37:12.6355529Z layer_output = apply_chunking_to_forward( 2025-08-14T21:37:12.6355894Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:37:12.6356252Z return forward_fn(*input_tensors) 2025-08-14T21:37:12.6356637Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 512, in feed_forward_chunk 2025-08-14T21:37:12.6357066Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:37:12.6357469Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 428, in forward 2025-08-14T21:37:12.6357870Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:37:12.6358221Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:37:12.6358528Z return self.act(input) 2025-08-14T21:37:12.6358638Z 2025-08-14T21:37:12.6358733Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:12.6359065Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:12.6359356Z return mod(**inputs) 2025-08-14T21:37:12.6359702Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1543, in forward 2025-08-14T21:37:12.6360066Z outputs = self.electra( 2025-08-14T21:37:12.6360408Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:12.6360762Z hidden_states = self.encoder( 2025-08-14T21:37:12.6361113Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:12.6361470Z layer_outputs = layer_module( 2025-08-14T21:37:12.6361781Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:12.6362105Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:12.6362471Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 504, in forward 2025-08-14T21:37:12.6362840Z layer_output = apply_chunking_to_forward( 2025-08-14T21:37:12.6363224Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:37:12.6363604Z return forward_fn(*input_tensors) 2025-08-14T21:37:12.6364021Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 513, in feed_forward_chunk 2025-08-14T21:37:12.6364467Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:37:12.6364872Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 441, in forward 2025-08-14T21:37:12.6365243Z hidden_states = self.dense(hidden_states) 2025-08-14T21:37:12.6365367Z 2025-08-14T21:37:12.6365473Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:12.6365804Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:12.6366098Z return mod(**inputs) 2025-08-14T21:37:12.6366467Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1543, in forward 2025-08-14T21:37:12.6366833Z outputs = self.electra( 2025-08-14T21:37:12.6367173Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:12.6367538Z hidden_states = self.encoder( 2025-08-14T21:37:12.6367895Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:12.6368254Z layer_outputs = layer_module( 2025-08-14T21:37:12.6368566Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:12.6368899Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:12.6369264Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 474, in forward 2025-08-14T21:37:12.6369633Z self_attention_outputs = self.attention( 2025-08-14T21:37:12.6369992Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:12.6370337Z return func(*args, **kwargs) 2025-08-14T21:37:12.6370697Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 401, in forward 2025-08-14T21:37:12.6371055Z self_outputs = self.self( 2025-08-14T21:37:12.6371386Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:12.6371730Z return func(*args, **kwargs) 2025-08-14T21:37:12.6372072Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 241, in forward 2025-08-14T21:37:12.6372443Z query_layer = self.query(hidden_states) 2025-08-14T21:37:12.6372574Z 2025-08-14T21:37:12.6372675Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:12.6373007Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:12.6373300Z return mod(**inputs) 2025-08-14T21:37:12.6373646Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1543, in forward 2025-08-14T21:37:12.6374004Z outputs = self.electra( 2025-08-14T21:37:12.6374347Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:12.6374702Z hidden_states = self.encoder( 2025-08-14T21:37:12.6375058Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:12.6375421Z layer_outputs = layer_module( 2025-08-14T21:37:12.6375730Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:12.6376084Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:12.6376471Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 474, in forward 2025-08-14T21:37:12.6376844Z self_attention_outputs = self.attention( 2025-08-14T21:37:12.6377213Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:12.6377559Z return func(*args, **kwargs) 2025-08-14T21:37:12.6377909Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 401, in forward 2025-08-14T21:37:12.6378268Z self_outputs = self.self( 2025-08-14T21:37:12.6378594Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:12.6378934Z return func(*args, **kwargs) 2025-08-14T21:37:12.6379286Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 270, in forward 2025-08-14T21:37:12.6379667Z key_layer = self.key(current_states) 2025-08-14T21:37:12.6379796Z 2025-08-14T21:37:12.6379893Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:12.6380224Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:12.6380522Z return mod(**inputs) 2025-08-14T21:37:12.6380858Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1543, in forward 2025-08-14T21:37:12.6381218Z outputs = self.electra( 2025-08-14T21:37:12.6381562Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:12.6381918Z hidden_states = self.encoder( 2025-08-14T21:37:12.6382271Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:12.6382638Z layer_outputs = layer_module( 2025-08-14T21:37:12.6382956Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:12.6383283Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:12.6383654Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 474, in forward 2025-08-14T21:37:12.6384030Z self_attention_outputs = self.attention( 2025-08-14T21:37:12.6384377Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:12.6384939Z return func(*args, **kwargs) 2025-08-14T21:37:12.6385308Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 401, in forward 2025-08-14T21:37:12.6385681Z self_outputs = self.self( 2025-08-14T21:37:12.6386026Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:12.6386377Z return func(*args, **kwargs) 2025-08-14T21:37:12.6386732Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 274, in forward 2025-08-14T21:37:12.6387113Z value_layer = self.value(current_states) 2025-08-14T21:37:12.6387242Z 2025-08-14T21:37:12.6387319Z cudagraph partition due to non gpu ops 2025-08-14T21:37:12.6387526Z cudagraph partition due to non gpu ops 2025-08-14T21:37:12.6387755Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:12.6388085Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:12.6388395Z return mod(**inputs) 2025-08-14T21:37:12.6388745Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1543, in forward 2025-08-14T21:37:12.6389114Z outputs = self.electra( 2025-08-14T21:37:12.6389503Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:12.6389896Z hidden_states = self.encoder( 2025-08-14T21:37:12.6390288Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:12.6390657Z layer_outputs = layer_module( 2025-08-14T21:37:12.6390986Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:12.6391336Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:12.6391718Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 474, in forward 2025-08-14T21:37:12.6392093Z self_attention_outputs = self.attention( 2025-08-14T21:37:12.6392453Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:12.6392834Z return func(*args, **kwargs) 2025-08-14T21:37:12.6393188Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 411, in forward 2025-08-14T21:37:12.6393613Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:37:12.6394030Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 348, in forward 2025-08-14T21:37:12.6394415Z hidden_states = self.dense(hidden_states) 2025-08-14T21:37:12.6394544Z 2025-08-14T21:37:12.6394643Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:12.6394984Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:12.6395295Z return mod(**inputs) 2025-08-14T21:37:12.6395646Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1543, in forward 2025-08-14T21:37:12.6396012Z outputs = self.electra( 2025-08-14T21:37:12.6396365Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:12.6396737Z hidden_states = self.encoder( 2025-08-14T21:37:12.6397094Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:12.6397462Z layer_outputs = layer_module( 2025-08-14T21:37:12.6397784Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:12.6398125Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:12.6398489Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 504, in forward 2025-08-14T21:37:12.6398883Z layer_output = apply_chunking_to_forward( 2025-08-14T21:37:12.6399252Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:37:12.6399614Z return forward_fn(*input_tensors) 2025-08-14T21:37:12.6399996Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 512, in feed_forward_chunk 2025-08-14T21:37:12.6400431Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:37:12.6400833Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 427, in forward 2025-08-14T21:37:12.6401196Z hidden_states = self.dense(hidden_states) 2025-08-14T21:37:12.6401329Z 2025-08-14T21:37:12.6401426Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:12.6401755Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:12.6402056Z return mod(**inputs) 2025-08-14T21:37:12.6402395Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1543, in forward 2025-08-14T21:37:12.6402776Z outputs = self.electra( 2025-08-14T21:37:12.6403139Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:12.6403495Z hidden_states = self.encoder( 2025-08-14T21:37:12.6403869Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:12.6404231Z layer_outputs = layer_module( 2025-08-14T21:37:12.6404552Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:12.6404878Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:12.6405246Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 504, in forward 2025-08-14T21:37:12.6405620Z layer_output = apply_chunking_to_forward( 2025-08-14T21:37:12.6406011Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:37:12.6406371Z return forward_fn(*input_tensors) 2025-08-14T21:37:12.6406767Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 512, in feed_forward_chunk 2025-08-14T21:37:12.6407206Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:37:12.6407604Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 428, in forward 2025-08-14T21:37:12.6408005Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:37:12.6408357Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:37:12.6408675Z return self.act(input) 2025-08-14T21:37:12.6408778Z 2025-08-14T21:37:12.6408873Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:12.6409210Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:12.6409514Z return mod(**inputs) 2025-08-14T21:37:12.6409855Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1543, in forward 2025-08-14T21:37:12.6410221Z outputs = self.electra( 2025-08-14T21:37:12.6410570Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:12.6410932Z hidden_states = self.encoder( 2025-08-14T21:37:12.6411283Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:12.6411647Z layer_outputs = layer_module( 2025-08-14T21:37:12.6411965Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:12.6412302Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:12.6412663Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 504, in forward 2025-08-14T21:37:12.6413038Z layer_output = apply_chunking_to_forward( 2025-08-14T21:37:12.6413411Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:37:12.6413767Z return forward_fn(*input_tensors) 2025-08-14T21:37:12.6414159Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 513, in feed_forward_chunk 2025-08-14T21:37:12.6414606Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:37:12.6415026Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 441, in forward 2025-08-14T21:37:12.6415395Z hidden_states = self.dense(hidden_states) 2025-08-14T21:37:12.6415546Z 2025-08-14T21:37:12.6415642Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:12.6416001Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:12.6416305Z return mod(**inputs) 2025-08-14T21:37:12.6416659Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1543, in forward 2025-08-14T21:37:12.6417028Z outputs = self.electra( 2025-08-14T21:37:12.6417372Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:12.6417728Z hidden_states = self.encoder( 2025-08-14T21:37:12.6418083Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:12.6418444Z layer_outputs = layer_module( 2025-08-14T21:37:12.6418766Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:12.6419114Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:12.6419484Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 474, in forward 2025-08-14T21:37:12.6419862Z self_attention_outputs = self.attention( 2025-08-14T21:37:12.6420208Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:12.6420550Z return func(*args, **kwargs) 2025-08-14T21:37:12.6420903Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 401, in forward 2025-08-14T21:37:12.6421264Z self_outputs = self.self( 2025-08-14T21:37:12.6421589Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:12.6421928Z return func(*args, **kwargs) 2025-08-14T21:37:12.6422286Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 241, in forward 2025-08-14T21:37:12.6422656Z query_layer = self.query(hidden_states) 2025-08-14T21:37:12.6422782Z 2025-08-14T21:37:12.6422877Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:12.6423210Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:12.6423509Z return mod(**inputs) 2025-08-14T21:37:12.6423848Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1543, in forward 2025-08-14T21:37:12.6424210Z outputs = self.electra( 2025-08-14T21:37:12.6424550Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:12.6424977Z hidden_states = self.encoder( 2025-08-14T21:37:12.6425333Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:12.6425707Z layer_outputs = layer_module( 2025-08-14T21:37:12.6426033Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:12.6426369Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:12.6426744Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 474, in forward 2025-08-14T21:37:12.6427123Z self_attention_outputs = self.attention( 2025-08-14T21:37:12.6427477Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:12.6427815Z return func(*args, **kwargs) 2025-08-14T21:37:12.6428169Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 401, in forward 2025-08-14T21:37:12.6428551Z self_outputs = self.self( 2025-08-14T21:37:12.6428884Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:12.6429233Z return func(*args, **kwargs) 2025-08-14T21:37:12.6429608Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 270, in forward 2025-08-14T21:37:12.6429981Z key_layer = self.key(current_states) 2025-08-14T21:37:12.6430104Z 2025-08-14T21:37:12.6430200Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:12.6430535Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:12.6430837Z return mod(**inputs) 2025-08-14T21:37:12.6431183Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1543, in forward 2025-08-14T21:37:12.6431536Z outputs = self.electra( 2025-08-14T21:37:12.6431881Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:12.6432268Z hidden_states = self.encoder( 2025-08-14T21:37:12.6432619Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:12.6432982Z layer_outputs = layer_module( 2025-08-14T21:37:12.6433298Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:12.6433634Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:12.6433993Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 474, in forward 2025-08-14T21:37:12.6434365Z self_attention_outputs = self.attention( 2025-08-14T21:37:12.6434714Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:12.6435060Z return func(*args, **kwargs) 2025-08-14T21:37:12.6435407Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 401, in forward 2025-08-14T21:37:12.6435771Z self_outputs = self.self( 2025-08-14T21:37:12.6436105Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:12.6436441Z return func(*args, **kwargs) 2025-08-14T21:37:12.6436794Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 274, in forward 2025-08-14T21:37:12.6437165Z value_layer = self.value(current_states) 2025-08-14T21:37:12.6437288Z 2025-08-14T21:37:12.6437370Z cudagraph partition due to non gpu ops 2025-08-14T21:37:12.6437559Z cudagraph partition due to non gpu ops 2025-08-14T21:37:12.6437778Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:12.6438108Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:12.6438404Z return mod(**inputs) 2025-08-14T21:37:12.6438753Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1543, in forward 2025-08-14T21:37:12.6439114Z outputs = self.electra( 2025-08-14T21:37:12.6439460Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:12.6439815Z hidden_states = self.encoder( 2025-08-14T21:37:12.6440166Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:12.6440522Z layer_outputs = layer_module( 2025-08-14T21:37:12.6440834Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:12.6441168Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:12.6441533Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 474, in forward 2025-08-14T21:37:12.6441932Z self_attention_outputs = self.attention( 2025-08-14T21:37:12.6442296Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:12.6442658Z return func(*args, **kwargs) 2025-08-14T21:37:12.6443009Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 411, in forward 2025-08-14T21:37:12.6443424Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:37:12.6443828Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 348, in forward 2025-08-14T21:37:12.6444200Z hidden_states = self.dense(hidden_states) 2025-08-14T21:37:12.6444324Z 2025-08-14T21:37:12.6444427Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:12.6444769Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:12.6445068Z return mod(**inputs) 2025-08-14T21:37:12.6445413Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1543, in forward 2025-08-14T21:37:12.6445774Z outputs = self.electra( 2025-08-14T21:37:12.6446114Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:12.6446476Z hidden_states = self.encoder( 2025-08-14T21:37:12.6446828Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:12.6447181Z layer_outputs = layer_module( 2025-08-14T21:37:12.6447498Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:12.6447830Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:12.6448199Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 504, in forward 2025-08-14T21:37:12.6448567Z layer_output = apply_chunking_to_forward( 2025-08-14T21:37:12.6448940Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:37:12.6449302Z return forward_fn(*input_tensors) 2025-08-14T21:37:12.6449696Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 512, in feed_forward_chunk 2025-08-14T21:37:12.6450128Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:37:12.6450532Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 427, in forward 2025-08-14T21:37:12.6450904Z hidden_states = self.dense(hidden_states) 2025-08-14T21:37:12.6451028Z 2025-08-14T21:37:12.6451130Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:12.6451466Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:12.6451769Z return mod(**inputs) 2025-08-14T21:37:12.6452123Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1543, in forward 2025-08-14T21:37:12.6452477Z outputs = self.electra( 2025-08-14T21:37:12.6452824Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:12.6453183Z hidden_states = self.encoder( 2025-08-14T21:37:12.6453539Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:12.6453891Z layer_outputs = layer_module( 2025-08-14T21:37:12.6454208Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:12.6454586Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:12.6454963Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 504, in forward 2025-08-14T21:37:12.6455343Z layer_output = apply_chunking_to_forward( 2025-08-14T21:37:12.6455730Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:37:12.6456095Z return forward_fn(*input_tensors) 2025-08-14T21:37:12.6456480Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 512, in feed_forward_chunk 2025-08-14T21:37:12.6456910Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:37:12.6457314Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 428, in forward 2025-08-14T21:37:12.6457712Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:37:12.6458077Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:37:12.6458394Z return self.act(input) 2025-08-14T21:37:12.6458496Z 2025-08-14T21:37:12.6458599Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:12.6458923Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:12.6459227Z return mod(**inputs) 2025-08-14T21:37:12.6459569Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1543, in forward 2025-08-14T21:37:12.6459930Z outputs = self.electra( 2025-08-14T21:37:12.6460269Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:12.6460632Z hidden_states = self.encoder( 2025-08-14T21:37:12.6460987Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:12.6461347Z layer_outputs = layer_module( 2025-08-14T21:37:12.6461662Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:12.6461997Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:12.6462362Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 504, in forward 2025-08-14T21:37:12.6462728Z layer_output = apply_chunking_to_forward( 2025-08-14T21:37:12.6463092Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:37:12.6463452Z return forward_fn(*input_tensors) 2025-08-14T21:37:12.6463833Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 513, in feed_forward_chunk 2025-08-14T21:37:12.6464282Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:37:12.6464700Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 441, in forward 2025-08-14T21:37:12.6465146Z hidden_states = self.dense(hidden_states) 2025-08-14T21:37:12.6465279Z 2025-08-14T21:37:12.6465381Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:12.6465720Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:12.6466028Z return mod(**inputs) 2025-08-14T21:37:12.6466380Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1543, in forward 2025-08-14T21:37:12.6466738Z outputs = self.electra( 2025-08-14T21:37:12.6467094Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:12.6467497Z hidden_states = self.encoder( 2025-08-14T21:37:12.6467851Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:12.6468232Z layer_outputs = layer_module( 2025-08-14T21:37:12.6468575Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:12.6468919Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:12.6469289Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 474, in forward 2025-08-14T21:37:12.6469672Z self_attention_outputs = self.attention( 2025-08-14T21:37:12.6470034Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:12.6470387Z return func(*args, **kwargs) 2025-08-14T21:37:12.6470744Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 401, in forward 2025-08-14T21:37:12.6471289Z self_outputs = self.self( 2025-08-14T21:37:12.6471626Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:12.6471961Z return func(*args, **kwargs) 2025-08-14T21:37:12.6472319Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 241, in forward 2025-08-14T21:37:12.6472690Z query_layer = self.query(hidden_states) 2025-08-14T21:37:12.6472817Z 2025-08-14T21:37:12.6472924Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:12.6473254Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:12.6473555Z return mod(**inputs) 2025-08-14T21:37:12.6473897Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1543, in forward 2025-08-14T21:37:12.6474251Z outputs = self.electra( 2025-08-14T21:37:12.6474598Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:12.6474958Z hidden_states = self.encoder( 2025-08-14T21:37:12.6475309Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:12.6475661Z layer_outputs = layer_module( 2025-08-14T21:37:12.6475975Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:12.6476310Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:12.6476671Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 474, in forward 2025-08-14T21:37:12.6477036Z self_attention_outputs = self.attention( 2025-08-14T21:37:12.6477384Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:12.6477726Z return func(*args, **kwargs) 2025-08-14T21:37:12.6478070Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 401, in forward 2025-08-14T21:37:12.6478430Z self_outputs = self.self( 2025-08-14T21:37:12.6478761Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:12.6479101Z return func(*args, **kwargs) 2025-08-14T21:37:12.6479442Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 270, in forward 2025-08-14T21:37:12.6479809Z key_layer = self.key(current_states) 2025-08-14T21:37:12.6479932Z 2025-08-14T21:37:12.6480037Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:12.6480359Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:12.6480678Z return mod(**inputs) 2025-08-14T21:37:12.6481023Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1543, in forward 2025-08-14T21:37:12.6481402Z outputs = self.electra( 2025-08-14T21:37:12.6481754Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:12.6482117Z hidden_states = self.encoder( 2025-08-14T21:37:12.6482469Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:12.6482827Z layer_outputs = layer_module( 2025-08-14T21:37:12.6483137Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:12.6483471Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:12.6483837Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 474, in forward 2025-08-14T21:37:12.6484223Z self_attention_outputs = self.attention( 2025-08-14T21:37:12.6484674Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:12.6485047Z return func(*args, **kwargs) 2025-08-14T21:37:12.6485415Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 401, in forward 2025-08-14T21:37:12.6485797Z self_outputs = self.self( 2025-08-14T21:37:12.6486132Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:12.6486479Z return func(*args, **kwargs) 2025-08-14T21:37:12.6486823Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 274, in forward 2025-08-14T21:37:12.6487202Z value_layer = self.value(current_states) 2025-08-14T21:37:12.6487340Z 2025-08-14T21:37:12.6487417Z cudagraph partition due to non gpu ops 2025-08-14T21:37:12.6487620Z cudagraph partition due to non gpu ops 2025-08-14T21:37:12.6487832Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:12.6488164Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:12.6488466Z return mod(**inputs) 2025-08-14T21:37:12.6488803Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1543, in forward 2025-08-14T21:37:12.6489167Z outputs = self.electra( 2025-08-14T21:37:12.6489511Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:12.6489870Z hidden_states = self.encoder( 2025-08-14T21:37:12.6490220Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:12.6490587Z layer_outputs = layer_module( 2025-08-14T21:37:12.6490907Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:12.6491235Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:12.6491602Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 474, in forward 2025-08-14T21:37:12.6491972Z self_attention_outputs = self.attention( 2025-08-14T21:37:12.6492320Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:12.6492652Z return func(*args, **kwargs) 2025-08-14T21:37:12.6492999Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 411, in forward 2025-08-14T21:37:12.6493409Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:37:12.6493817Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 348, in forward 2025-08-14T21:37:12.6494220Z hidden_states = self.dense(hidden_states) 2025-08-14T21:37:12.6494353Z 2025-08-14T21:37:12.6494471Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:12.6494828Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:12.6495125Z return mod(**inputs) 2025-08-14T21:37:12.6495475Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1543, in forward 2025-08-14T21:37:12.6495841Z outputs = self.electra( 2025-08-14T21:37:12.6496189Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:12.6496546Z hidden_states = self.encoder( 2025-08-14T21:37:12.6496899Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:12.6497285Z layer_outputs = layer_module( 2025-08-14T21:37:12.6497596Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:12.6497930Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:12.6498299Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 504, in forward 2025-08-14T21:37:12.6498672Z layer_output = apply_chunking_to_forward( 2025-08-14T21:37:12.6499033Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:37:12.6499396Z return forward_fn(*input_tensors) 2025-08-14T21:37:12.6499785Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 512, in feed_forward_chunk 2025-08-14T21:37:12.6500220Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:37:12.6500622Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 427, in forward 2025-08-14T21:37:12.6500992Z hidden_states = self.dense(hidden_states) 2025-08-14T21:37:12.6501118Z 2025-08-14T21:37:12.6501222Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:12.6501555Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:12.6501851Z return mod(**inputs) 2025-08-14T21:37:12.6502190Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1543, in forward 2025-08-14T21:37:12.6502548Z outputs = self.electra( 2025-08-14T21:37:12.6502885Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:12.6503244Z hidden_states = self.encoder( 2025-08-14T21:37:12.6503598Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:12.6503962Z layer_outputs = layer_module( 2025-08-14T21:37:12.6504271Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:12.6504601Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:12.6505024Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 504, in forward 2025-08-14T21:37:12.6505401Z layer_output = apply_chunking_to_forward( 2025-08-14T21:37:12.6505774Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:37:12.6506137Z return forward_fn(*input_tensors) 2025-08-14T21:37:12.6506532Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 512, in feed_forward_chunk 2025-08-14T21:37:12.6506988Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:37:12.6507418Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 428, in forward 2025-08-14T21:37:12.6507821Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:37:12.6508190Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:37:12.6508503Z return self.act(input) 2025-08-14T21:37:12.6508614Z 2025-08-14T21:37:12.6508709Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:12.6509042Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:12.6509331Z return mod(**inputs) 2025-08-14T21:37:12.6509675Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1543, in forward 2025-08-14T21:37:12.6510036Z outputs = self.electra( 2025-08-14T21:37:12.6510408Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:12.6510764Z hidden_states = self.encoder( 2025-08-14T21:37:12.6511117Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:12.6511478Z layer_outputs = layer_module( 2025-08-14T21:37:12.6511787Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:12.6512115Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:12.6512479Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 504, in forward 2025-08-14T21:37:12.6512850Z layer_output = apply_chunking_to_forward( 2025-08-14T21:37:12.6513209Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:37:12.6513573Z return forward_fn(*input_tensors) 2025-08-14T21:37:12.6513963Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 513, in feed_forward_chunk 2025-08-14T21:37:12.6514411Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:37:12.6514818Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 441, in forward 2025-08-14T21:37:12.6515191Z hidden_states = self.dense(hidden_states) 2025-08-14T21:37:12.6515319Z 2025-08-14T21:37:12.6515425Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:12.6515747Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:12.6516045Z return mod(**inputs) 2025-08-14T21:37:12.6516389Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1543, in forward 2025-08-14T21:37:12.6516755Z outputs = self.electra( 2025-08-14T21:37:12.6517096Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:12.6517456Z hidden_states = self.encoder( 2025-08-14T21:37:12.6517813Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:12.6518170Z layer_outputs = layer_module( 2025-08-14T21:37:12.6518480Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:12.6518811Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:12.6519175Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 474, in forward 2025-08-14T21:37:12.6519539Z self_attention_outputs = self.attention( 2025-08-14T21:37:12.6519913Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:12.6520254Z return func(*args, **kwargs) 2025-08-14T21:37:12.6520619Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 401, in forward 2025-08-14T21:37:12.6520992Z self_outputs = self.self( 2025-08-14T21:37:12.6521223Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:12.6521287Z return func(*args, **kwargs) 2025-08-14T21:37:12.6521526Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 241, in forward 2025-08-14T21:37:12.6521608Z query_layer = self.query(hidden_states) 2025-08-14T21:37:12.6521612Z 2025-08-14T21:37:12.6521710Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:12.6521899Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:12.6521980Z return mod(**inputs) 2025-08-14T21:37:12.6522224Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1543, in forward 2025-08-14T21:37:12.6522294Z outputs = self.electra( 2025-08-14T21:37:12.6522536Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:12.6522601Z hidden_states = self.encoder( 2025-08-14T21:37:12.6522848Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:12.6522912Z layer_outputs = layer_module( 2025-08-14T21:37:12.6523125Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:12.6523198Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:12.6523441Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 474, in forward 2025-08-14T21:37:12.6523526Z self_attention_outputs = self.attention( 2025-08-14T21:37:12.6523746Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:12.6523819Z return func(*args, **kwargs) 2025-08-14T21:37:12.6524057Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 401, in forward 2025-08-14T21:37:12.6524123Z self_outputs = self.self( 2025-08-14T21:37:12.6524350Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:12.6524413Z return func(*args, **kwargs) 2025-08-14T21:37:12.6524650Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 270, in forward 2025-08-14T21:37:12.6524732Z key_layer = self.key(current_states) 2025-08-14T21:37:12.6524736Z 2025-08-14T21:37:12.6524832Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:12.6525025Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:12.6525084Z return mod(**inputs) 2025-08-14T21:37:12.6525327Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1543, in forward 2025-08-14T21:37:12.6525397Z outputs = self.electra( 2025-08-14T21:37:12.6525638Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:12.6525710Z hidden_states = self.encoder( 2025-08-14T21:37:12.6525945Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:12.6526009Z layer_outputs = layer_module( 2025-08-14T21:37:12.6526234Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:12.6526306Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:12.6526555Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 474, in forward 2025-08-14T21:37:12.6526653Z self_attention_outputs = self.attention( 2025-08-14T21:37:12.6526874Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:12.6526942Z return func(*args, **kwargs) 2025-08-14T21:37:12.6527178Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 401, in forward 2025-08-14T21:37:12.6527242Z self_outputs = self.self( 2025-08-14T21:37:12.6527467Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:12.6527548Z return func(*args, **kwargs) 2025-08-14T21:37:12.6527796Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 274, in forward 2025-08-14T21:37:12.6527877Z value_layer = self.value(current_states) 2025-08-14T21:37:12.6527880Z 2025-08-14T21:37:12.6527956Z cudagraph partition due to non gpu ops 2025-08-14T21:37:12.6528035Z cudagraph partition due to non gpu ops 2025-08-14T21:37:12.6528132Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:12.6528318Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:12.6528386Z return mod(**inputs) 2025-08-14T21:37:12.6528635Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1543, in forward 2025-08-14T21:37:12.6528697Z outputs = self.electra( 2025-08-14T21:37:12.6528948Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:12.6529016Z hidden_states = self.encoder( 2025-08-14T21:37:12.6529267Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:12.6529333Z layer_outputs = layer_module( 2025-08-14T21:37:12.6529540Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:12.6529619Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:12.6529864Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 474, in forward 2025-08-14T21:37:12.6529945Z self_attention_outputs = self.attention( 2025-08-14T21:37:12.6530169Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:12.6530231Z return func(*args, **kwargs) 2025-08-14T21:37:12.6530486Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 411, in forward 2025-08-14T21:37:12.6530605Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:37:12.6530850Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 348, in forward 2025-08-14T21:37:12.6530936Z hidden_states = self.dense(hidden_states) 2025-08-14T21:37:12.6530939Z 2025-08-14T21:37:12.6531035Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:12.6531227Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:12.6531287Z return mod(**inputs) 2025-08-14T21:37:12.6531534Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1543, in forward 2025-08-14T21:37:12.6531607Z outputs = self.electra( 2025-08-14T21:37:12.6531860Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:12.6531934Z hidden_states = self.encoder( 2025-08-14T21:37:12.6532185Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:12.6532268Z layer_outputs = layer_module( 2025-08-14T21:37:12.6532479Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:12.6532550Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:12.6532786Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 504, in forward 2025-08-14T21:37:12.6532871Z layer_output = apply_chunking_to_forward( 2025-08-14T21:37:12.6533104Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:37:12.6533200Z return forward_fn(*input_tensors) 2025-08-14T21:37:12.6533467Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 512, in feed_forward_chunk 2025-08-14T21:37:12.6533575Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:37:12.6533821Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 427, in forward 2025-08-14T21:37:12.6533894Z hidden_states = self.dense(hidden_states) 2025-08-14T21:37:12.6533898Z 2025-08-14T21:37:12.6533998Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:12.6534179Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:12.6534239Z return mod(**inputs) 2025-08-14T21:37:12.6534484Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1543, in forward 2025-08-14T21:37:12.6534549Z outputs = self.electra( 2025-08-14T21:37:12.6534783Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:12.6534855Z hidden_states = self.encoder( 2025-08-14T21:37:12.6535096Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:12.6535167Z layer_outputs = layer_module( 2025-08-14T21:37:12.6535367Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:12.6535436Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:12.6535681Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 504, in forward 2025-08-14T21:37:12.6535756Z layer_output = apply_chunking_to_forward( 2025-08-14T21:37:12.6535995Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:37:12.6536067Z return forward_fn(*input_tensors) 2025-08-14T21:37:12.6536332Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 512, in feed_forward_chunk 2025-08-14T21:37:12.6536448Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:37:12.6536684Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 428, in forward 2025-08-14T21:37:12.6536784Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:37:12.6536985Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:37:12.6537048Z return self.act(input) 2025-08-14T21:37:12.6537052Z 2025-08-14T21:37:12.6537151Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:12.6537332Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:12.6537410Z return mod(**inputs) 2025-08-14T21:37:12.6537674Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1543, in forward 2025-08-14T21:37:12.6537738Z outputs = self.electra( 2025-08-14T21:37:12.6537998Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:12.6538072Z hidden_states = self.encoder( 2025-08-14T21:37:12.6538315Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:12.6538385Z layer_outputs = layer_module( 2025-08-14T21:37:12.6538590Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:12.6538662Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:12.6538911Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 504, in forward 2025-08-14T21:37:12.6539002Z layer_output = apply_chunking_to_forward( 2025-08-14T21:37:12.6539244Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:37:12.6539314Z return forward_fn(*input_tensors) 2025-08-14T21:37:12.6539581Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 513, in feed_forward_chunk 2025-08-14T21:37:12.6539708Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:37:12.6539946Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 441, in forward 2025-08-14T21:37:12.6540019Z hidden_states = self.dense(hidden_states) 2025-08-14T21:37:12.6540030Z 2025-08-14T21:37:12.6540121Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:12.6540305Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:12.6540370Z return mod(**inputs) 2025-08-14T21:37:12.6540613Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1543, in forward 2025-08-14T21:37:12.6540675Z outputs = self.electra( 2025-08-14T21:37:12.6540920Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:12.6540984Z hidden_states = self.encoder( 2025-08-14T21:37:12.6541228Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:12.6541291Z layer_outputs = layer_module( 2025-08-14T21:37:12.6541490Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:12.6541570Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:12.6541808Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 474, in forward 2025-08-14T21:37:12.6541882Z self_attention_outputs = self.attention( 2025-08-14T21:37:12.6542111Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:12.6542173Z return func(*args, **kwargs) 2025-08-14T21:37:12.6542420Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 401, in forward 2025-08-14T21:37:12.6542485Z self_outputs = self.self( 2025-08-14T21:37:12.6542703Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:12.6542773Z return func(*args, **kwargs) 2025-08-14T21:37:12.6543010Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 241, in forward 2025-08-14T21:37:12.6543108Z query_layer = self.query(hidden_states) 2025-08-14T21:37:12.6543112Z 2025-08-14T21:37:12.6543218Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:12.6543401Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:12.6543483Z return mod(**inputs) 2025-08-14T21:37:12.6543724Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1543, in forward 2025-08-14T21:37:12.6543785Z outputs = self.electra( 2025-08-14T21:37:12.6544026Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:12.6544088Z hidden_states = self.encoder( 2025-08-14T21:37:12.6544333Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:12.6544422Z layer_outputs = layer_module( 2025-08-14T21:37:12.6544625Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:12.6544704Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:12.6545015Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 474, in forward 2025-08-14T21:37:12.6545094Z self_attention_outputs = self.attention( 2025-08-14T21:37:12.6545325Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:12.6545388Z return func(*args, **kwargs) 2025-08-14T21:37:12.6545635Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 401, in forward 2025-08-14T21:37:12.6545699Z self_outputs = self.self( 2025-08-14T21:37:12.6545921Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:12.6545996Z return func(*args, **kwargs) 2025-08-14T21:37:12.6546235Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 270, in forward 2025-08-14T21:37:12.6546315Z key_layer = self.key(current_states) 2025-08-14T21:37:12.6546320Z 2025-08-14T21:37:12.6546415Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:12.6546596Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:12.6546667Z return mod(**inputs) 2025-08-14T21:37:12.6546910Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1543, in forward 2025-08-14T21:37:12.6546973Z outputs = self.electra( 2025-08-14T21:37:12.6547222Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:12.6547291Z hidden_states = self.encoder( 2025-08-14T21:37:12.6547540Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:12.6547606Z layer_outputs = layer_module( 2025-08-14T21:37:12.6547811Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:12.6547894Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:12.6548131Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 474, in forward 2025-08-14T21:37:12.6548203Z self_attention_outputs = self.attention( 2025-08-14T21:37:12.6548431Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:12.6548491Z return func(*args, **kwargs) 2025-08-14T21:37:12.6548737Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 401, in forward 2025-08-14T21:37:12.6548820Z self_outputs = self.self( 2025-08-14T21:37:12.6549052Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:12.6549122Z return func(*args, **kwargs) 2025-08-14T21:37:12.6549381Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 274, in forward 2025-08-14T21:37:12.6549463Z value_layer = self.value(current_states) 2025-08-14T21:37:12.6549467Z 2025-08-14T21:37:12.6549539Z cudagraph partition due to non gpu ops 2025-08-14T21:37:12.6549610Z cudagraph partition due to non gpu ops 2025-08-14T21:37:12.6549711Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:12.6549894Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:12.6549953Z return mod(**inputs) 2025-08-14T21:37:12.6550222Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1543, in forward 2025-08-14T21:37:12.6550287Z outputs = self.electra( 2025-08-14T21:37:12.6550529Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:12.6550595Z hidden_states = self.encoder( 2025-08-14T21:37:12.6550832Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:12.6550902Z layer_outputs = layer_module( 2025-08-14T21:37:12.6551103Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:12.6551174Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:12.6551417Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 474, in forward 2025-08-14T21:37:12.6551492Z self_attention_outputs = self.attention( 2025-08-14T21:37:12.6551720Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:12.6551782Z return func(*args, **kwargs) 2025-08-14T21:37:12.6552022Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 411, in forward 2025-08-14T21:37:12.6552146Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:37:12.6552385Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 348, in forward 2025-08-14T21:37:12.6552465Z hidden_states = self.dense(hidden_states) 2025-08-14T21:37:12.6552469Z 2025-08-14T21:37:12.6552561Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:12.6552740Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:12.6552808Z return mod(**inputs) 2025-08-14T21:37:12.6553050Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1543, in forward 2025-08-14T21:37:12.6553110Z outputs = self.electra( 2025-08-14T21:37:12.6553355Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:12.6553416Z hidden_states = self.encoder( 2025-08-14T21:37:12.6553663Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:12.6553727Z layer_outputs = layer_module( 2025-08-14T21:37:12.6553927Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:12.6554002Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:12.6554238Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 504, in forward 2025-08-14T21:37:12.6554339Z layer_output = apply_chunking_to_forward( 2025-08-14T21:37:12.6554586Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:37:12.6554657Z return forward_fn(*input_tensors) 2025-08-14T21:37:12.6554951Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 512, in feed_forward_chunk 2025-08-14T21:37:12.6555061Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:37:12.6555299Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 427, in forward 2025-08-14T21:37:12.6555379Z hidden_states = self.dense(hidden_states) 2025-08-14T21:37:12.6555382Z 2025-08-14T21:37:12.6555474Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:12.6555661Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:12.6555740Z return mod(**inputs) 2025-08-14T21:37:12.6555991Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1543, in forward 2025-08-14T21:37:12.6556060Z outputs = self.electra( 2025-08-14T21:37:12.6556305Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:12.6556375Z hidden_states = self.encoder( 2025-08-14T21:37:12.6556618Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:12.6556680Z layer_outputs = layer_module( 2025-08-14T21:37:12.6556894Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:12.6556965Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:12.6557211Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 504, in forward 2025-08-14T21:37:12.6557296Z layer_output = apply_chunking_to_forward( 2025-08-14T21:37:12.6557537Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:37:12.6557615Z return forward_fn(*input_tensors) 2025-08-14T21:37:12.6557890Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 512, in feed_forward_chunk 2025-08-14T21:37:12.6558000Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:37:12.6558250Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 428, in forward 2025-08-14T21:37:12.6558352Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:37:12.6558558Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:37:12.6558624Z return self.act(input) 2025-08-14T21:37:12.6558628Z 2025-08-14T21:37:12.6558724Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:12.6558915Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:12.6558975Z return mod(**inputs) 2025-08-14T21:37:12.6559226Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1543, in forward 2025-08-14T21:37:12.6559296Z outputs = self.electra( 2025-08-14T21:37:12.6559543Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:12.6559615Z hidden_states = self.encoder( 2025-08-14T21:37:12.6559859Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:12.6559942Z layer_outputs = layer_module( 2025-08-14T21:37:12.6560156Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:12.6560252Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:12.6560507Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 504, in forward 2025-08-14T21:37:12.6560593Z layer_output = apply_chunking_to_forward( 2025-08-14T21:37:12.6560829Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:37:12.6560903Z return forward_fn(*input_tensors) 2025-08-14T21:37:12.6561170Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 513, in feed_forward_chunk 2025-08-14T21:37:12.6561288Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:37:12.6561551Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 441, in forward 2025-08-14T21:37:12.6561626Z hidden_states = self.dense(hidden_states) 2025-08-14T21:37:12.6561629Z 2025-08-14T21:37:12.6561731Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:12.6561913Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:12.6561971Z return mod(**inputs) 2025-08-14T21:37:12.6562216Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1543, in forward 2025-08-14T21:37:12.6562278Z outputs = self.electra( 2025-08-14T21:37:12.6562514Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:12.6562585Z hidden_states = self.encoder( 2025-08-14T21:37:12.6562822Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:12.6562893Z layer_outputs = layer_module( 2025-08-14T21:37:12.6563094Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:12.6563164Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:12.6563408Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 474, in forward 2025-08-14T21:37:12.6563483Z self_attention_outputs = self.attention( 2025-08-14T21:37:12.6563709Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:12.6563772Z return func(*args, **kwargs) 2025-08-14T21:37:12.6564009Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 401, in forward 2025-08-14T21:37:12.6564081Z self_outputs = self.self( 2025-08-14T21:37:12.6564303Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:12.6564367Z return func(*args, **kwargs) 2025-08-14T21:37:12.6564610Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 241, in forward 2025-08-14T21:37:12.6564685Z query_layer = self.query(hidden_states) 2025-08-14T21:37:12.6564689Z 2025-08-14T21:37:12.6564788Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:12.6564967Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:12.6565027Z return mod(**inputs) 2025-08-14T21:37:12.6565275Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1543, in forward 2025-08-14T21:37:12.6565336Z outputs = self.electra( 2025-08-14T21:37:12.6565577Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:12.6565660Z hidden_states = self.encoder( 2025-08-14T21:37:12.6565915Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:12.6565986Z layer_outputs = layer_module( 2025-08-14T21:37:12.6566205Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:12.6566279Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:12.6566527Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 474, in forward 2025-08-14T21:37:12.6566600Z self_attention_outputs = self.attention( 2025-08-14T21:37:12.6566827Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:12.6566890Z return func(*args, **kwargs) 2025-08-14T21:37:12.6567150Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 401, in forward 2025-08-14T21:37:12.6567223Z self_outputs = self.self( 2025-08-14T21:37:12.6567442Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:12.6567505Z return func(*args, **kwargs) 2025-08-14T21:37:12.6567752Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 270, in forward 2025-08-14T21:37:12.6567823Z key_layer = self.key(current_states) 2025-08-14T21:37:12.6567826Z 2025-08-14T21:37:12.6567927Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:12.6568110Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:12.6568169Z return mod(**inputs) 2025-08-14T21:37:12.6568416Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1543, in forward 2025-08-14T21:37:12.6568478Z outputs = self.electra( 2025-08-14T21:37:12.6568725Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:12.6568789Z hidden_states = self.encoder( 2025-08-14T21:37:12.6569029Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:12.6569098Z layer_outputs = layer_module( 2025-08-14T21:37:12.6569300Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:12.6569370Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:12.6569616Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 474, in forward 2025-08-14T21:37:12.6569690Z self_attention_outputs = self.attention( 2025-08-14T21:37:12.6569920Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:12.6569984Z return func(*args, **kwargs) 2025-08-14T21:37:12.6570225Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 401, in forward 2025-08-14T21:37:12.6570295Z self_outputs = self.self( 2025-08-14T21:37:12.6570515Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:12.6570576Z return func(*args, **kwargs) 2025-08-14T21:37:12.6570824Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 274, in forward 2025-08-14T21:37:12.6570898Z value_layer = self.value(current_states) 2025-08-14T21:37:12.6570901Z 2025-08-14T21:37:12.6570981Z cudagraph partition due to non gpu ops 2025-08-14T21:37:12.6571072Z cudagraph partition due to non gpu ops 2025-08-14T21:37:12.6571167Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:12.6571370Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:12.6571431Z return mod(**inputs) 2025-08-14T21:37:12.6571701Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1543, in forward 2025-08-14T21:37:12.6571765Z outputs = self.electra( 2025-08-14T21:37:12.6572005Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:12.6572078Z hidden_states = self.encoder( 2025-08-14T21:37:12.6572316Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:12.6572380Z layer_outputs = layer_module( 2025-08-14T21:37:12.6572587Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:12.6572676Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:12.6572926Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 474, in forward 2025-08-14T21:37:12.6573002Z self_attention_outputs = self.attention( 2025-08-14T21:37:12.6573223Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:12.6573295Z return func(*args, **kwargs) 2025-08-14T21:37:12.6573532Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 411, in forward 2025-08-14T21:37:12.6573651Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:37:12.6573913Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 348, in forward 2025-08-14T21:37:12.6573992Z hidden_states = self.dense(hidden_states) 2025-08-14T21:37:12.6573996Z 2025-08-14T21:37:12.6574099Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:12.6574283Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:12.6574342Z return mod(**inputs) 2025-08-14T21:37:12.6574594Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1543, in forward 2025-08-14T21:37:12.6574655Z outputs = self.electra( 2025-08-14T21:37:12.6574900Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:12.6574961Z hidden_states = self.encoder( 2025-08-14T21:37:12.6575200Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:12.6575270Z layer_outputs = layer_module( 2025-08-14T21:37:12.6575478Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:12.6575549Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:12.6575797Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 504, in forward 2025-08-14T21:37:12.6575874Z layer_output = apply_chunking_to_forward( 2025-08-14T21:37:12.6576118Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:37:12.6576187Z return forward_fn(*input_tensors) 2025-08-14T21:37:12.6576457Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 512, in feed_forward_chunk 2025-08-14T21:37:12.6576574Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:37:12.6576812Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 427, in forward 2025-08-14T21:37:12.6576911Z hidden_states = self.dense(hidden_states) 2025-08-14T21:37:12.6576914Z 2025-08-14T21:37:12.6577023Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:12.6577224Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:12.6577293Z return mod(**inputs) 2025-08-14T21:37:12.6577535Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1543, in forward 2025-08-14T21:37:12.6577597Z outputs = self.electra( 2025-08-14T21:37:12.6577843Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:12.6577905Z hidden_states = self.encoder( 2025-08-14T21:37:12.6578149Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:12.6578231Z layer_outputs = layer_module( 2025-08-14T21:37:12.6578435Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:12.6578515Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:12.6578754Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 504, in forward 2025-08-14T21:37:12.6578836Z layer_output = apply_chunking_to_forward( 2025-08-14T21:37:12.6579072Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:37:12.6579141Z return forward_fn(*input_tensors) 2025-08-14T21:37:12.6579416Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 512, in feed_forward_chunk 2025-08-14T21:37:12.6579524Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:37:12.6579763Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 428, in forward 2025-08-14T21:37:12.6579873Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:37:12.6580067Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:37:12.6580139Z return self.act(input) 2025-08-14T21:37:12.6580142Z 2025-08-14T21:37:12.6580236Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:12.6580418Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:12.6580484Z return mod(**inputs) 2025-08-14T21:37:12.6580728Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1543, in forward 2025-08-14T21:37:12.6580796Z outputs = self.electra( 2025-08-14T21:37:12.6581033Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:12.6581098Z hidden_states = self.encoder( 2025-08-14T21:37:12.6581344Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:12.6581408Z layer_outputs = layer_module( 2025-08-14T21:37:12.6581609Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:12.6581688Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:12.6581924Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 504, in forward 2025-08-14T21:37:12.6582008Z layer_output = apply_chunking_to_forward( 2025-08-14T21:37:12.6582241Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:37:12.6582310Z return forward_fn(*input_tensors) 2025-08-14T21:37:12.6582616Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 513, in feed_forward_chunk 2025-08-14T21:37:12.6582747Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:37:12.6583013Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 441, in forward 2025-08-14T21:37:12.6583088Z hidden_states = self.dense(hidden_states) 2025-08-14T21:37:12.6583091Z 2025-08-14T21:37:12.6583186Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:12.6583376Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:12.6583435Z return mod(**inputs) 2025-08-14T21:37:12.6583676Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1543, in forward 2025-08-14T21:37:12.6583742Z outputs = self.electra( 2025-08-14T21:37:12.6583996Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:12.6584068Z hidden_states = self.encoder( 2025-08-14T21:37:12.6584306Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:12.6584368Z layer_outputs = layer_module( 2025-08-14T21:37:12.6584718Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:12.6584842Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:12.6585100Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 474, in forward 2025-08-14T21:37:12.6585186Z self_attention_outputs = self.attention( 2025-08-14T21:37:12.6585415Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:12.6585493Z return func(*args, **kwargs) 2025-08-14T21:37:12.6585740Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 401, in forward 2025-08-14T21:37:12.6585807Z self_outputs = self.self( 2025-08-14T21:37:12.6586071Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:12.6586136Z return func(*args, **kwargs) 2025-08-14T21:37:12.6586386Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 241, in forward 2025-08-14T21:37:12.6586459Z query_layer = self.query(hidden_states) 2025-08-14T21:37:12.6586463Z 2025-08-14T21:37:12.6586558Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:12.6586746Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:12.6586807Z return mod(**inputs) 2025-08-14T21:37:12.6587054Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1543, in forward 2025-08-14T21:37:12.6587127Z outputs = self.electra( 2025-08-14T21:37:12.6587367Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:12.6587440Z hidden_states = self.encoder( 2025-08-14T21:37:12.6587681Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:12.6587745Z layer_outputs = layer_module( 2025-08-14T21:37:12.6587958Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:12.6588029Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:12.6588276Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 474, in forward 2025-08-14T21:37:12.6588385Z self_attention_outputs = self.attention( 2025-08-14T21:37:12.6588629Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:12.6588702Z return func(*args, **kwargs) 2025-08-14T21:37:12.6588965Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 401, in forward 2025-08-14T21:37:12.6589031Z self_outputs = self.self( 2025-08-14T21:37:12.6589259Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:12.6589320Z return func(*args, **kwargs) 2025-08-14T21:37:12.6589565Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 270, in forward 2025-08-14T21:37:12.6589635Z key_layer = self.key(current_states) 2025-08-14T21:37:12.6589639Z 2025-08-14T21:37:12.6589761Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:12.6589954Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:12.6590015Z return mod(**inputs) 2025-08-14T21:37:12.6590262Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1543, in forward 2025-08-14T21:37:12.6590331Z outputs = self.electra( 2025-08-14T21:37:12.6590571Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:12.6590642Z hidden_states = self.encoder( 2025-08-14T21:37:12.6590882Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:12.6590945Z layer_outputs = layer_module( 2025-08-14T21:37:12.6591156Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:12.6591231Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:12.6591479Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 474, in forward 2025-08-14T21:37:12.6591556Z self_attention_outputs = self.attention( 2025-08-14T21:37:12.6591779Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:12.6591849Z return func(*args, **kwargs) 2025-08-14T21:37:12.6592089Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 401, in forward 2025-08-14T21:37:12.6592151Z self_outputs = self.self( 2025-08-14T21:37:12.6592381Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:12.6592444Z return func(*args, **kwargs) 2025-08-14T21:37:12.6592690Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 274, in forward 2025-08-14T21:37:12.6592766Z value_layer = self.value(current_states) 2025-08-14T21:37:12.6592770Z 2025-08-14T21:37:12.6592843Z cudagraph partition due to non gpu ops 2025-08-14T21:37:12.6592924Z cudagraph partition due to non gpu ops 2025-08-14T21:37:12.6593019Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:12.6593205Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:12.6593272Z return mod(**inputs) 2025-08-14T21:37:12.6593516Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1543, in forward 2025-08-14T21:37:12.6593585Z outputs = self.electra( 2025-08-14T21:37:12.6593824Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:12.6593906Z hidden_states = self.encoder( 2025-08-14T21:37:12.6594156Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:12.6594236Z layer_outputs = layer_module( 2025-08-14T21:37:12.6594458Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:12.6594532Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:12.6594770Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 474, in forward 2025-08-14T21:37:12.6594850Z self_attention_outputs = self.attention( 2025-08-14T21:37:12.6595071Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:12.6595133Z return func(*args, **kwargs) 2025-08-14T21:37:12.6595381Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 411, in forward 2025-08-14T21:37:12.6595515Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:37:12.6595763Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 348, in forward 2025-08-14T21:37:12.6595838Z hidden_states = self.dense(hidden_states) 2025-08-14T21:37:12.6595843Z 2025-08-14T21:37:12.6595937Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:12.6596128Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:12.6596191Z return mod(**inputs) 2025-08-14T21:37:12.6596442Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1543, in forward 2025-08-14T21:37:12.6596506Z outputs = self.electra( 2025-08-14T21:37:12.6596745Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:12.6596821Z hidden_states = self.encoder( 2025-08-14T21:37:12.6597062Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:12.6597128Z layer_outputs = layer_module( 2025-08-14T21:37:12.6597339Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:12.6597411Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:12.6597660Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 504, in forward 2025-08-14T21:37:12.6597739Z layer_output = apply_chunking_to_forward( 2025-08-14T21:37:12.6597978Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:37:12.6598057Z return forward_fn(*input_tensors) 2025-08-14T21:37:12.6598329Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 512, in feed_forward_chunk 2025-08-14T21:37:12.6598445Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:37:12.6598693Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 427, in forward 2025-08-14T21:37:12.6598772Z hidden_states = self.dense(hidden_states) 2025-08-14T21:37:12.6598775Z 2025-08-14T21:37:12.6598879Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:12.6599064Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:12.6599126Z return mod(**inputs) 2025-08-14T21:37:12.6599379Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1543, in forward 2025-08-14T21:37:12.6599442Z outputs = self.electra( 2025-08-14T21:37:12.6599688Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:12.6599771Z hidden_states = self.encoder( 2025-08-14T21:37:12.6600025Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:12.6600111Z layer_outputs = layer_module( 2025-08-14T21:37:12.6600314Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:12.6600386Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:12.6600631Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 504, in forward 2025-08-14T21:37:12.6600705Z layer_output = apply_chunking_to_forward( 2025-08-14T21:37:12.6600948Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:37:12.6601042Z return forward_fn(*input_tensors) 2025-08-14T21:37:12.6601312Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 512, in feed_forward_chunk 2025-08-14T21:37:12.6601428Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:37:12.6601666Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 428, in forward 2025-08-14T21:37:12.6601775Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:37:12.6601968Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:37:12.6602032Z return self.act(input) 2025-08-14T21:37:12.6602035Z 2025-08-14T21:37:12.6602135Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:12.6602317Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:12.6602377Z return mod(**inputs) 2025-08-14T21:37:12.6602628Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1543, in forward 2025-08-14T21:37:12.6602691Z outputs = self.electra( 2025-08-14T21:37:12.6602939Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:12.6603004Z hidden_states = self.encoder( 2025-08-14T21:37:12.6603242Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:12.6603311Z layer_outputs = layer_module( 2025-08-14T21:37:12.6603509Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:12.6603585Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:12.6603820Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 504, in forward 2025-08-14T21:37:12.6603898Z layer_output = apply_chunking_to_forward( 2025-08-14T21:37:12.6604139Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:37:12.6604207Z return forward_fn(*input_tensors) 2025-08-14T21:37:12.6604472Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 513, in feed_forward_chunk 2025-08-14T21:37:12.6604599Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:37:12.6604835Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 441, in forward 2025-08-14T21:37:12.6604914Z hidden_states = self.dense(hidden_states) 2025-08-14T21:37:12.6604917Z 2025-08-14T21:37:12.6605011Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:12.6605194Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:12.6605277Z return mod(**inputs) 2025-08-14T21:37:12.6605536Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1543, in forward 2025-08-14T21:37:12.6605607Z outputs = self.electra( 2025-08-14T21:37:12.6605861Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:12.6605927Z hidden_states = self.encoder( 2025-08-14T21:37:12.6606172Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:12.6606236Z layer_outputs = layer_module( 2025-08-14T21:37:12.6606436Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:12.6606515Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:12.6606752Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 474, in forward 2025-08-14T21:37:12.6606852Z self_attention_outputs = self.attention( 2025-08-14T21:37:12.6607076Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:12.6607140Z return func(*args, **kwargs) 2025-08-14T21:37:12.6607386Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 401, in forward 2025-08-14T21:37:12.6607449Z self_outputs = self.self( 2025-08-14T21:37:12.6607671Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:12.6607740Z return func(*args, **kwargs) 2025-08-14T21:37:12.6607978Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 241, in forward 2025-08-14T21:37:12.6608060Z query_layer = self.query(hidden_states) 2025-08-14T21:37:12.6608065Z 2025-08-14T21:37:12.6608160Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:12.6608345Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:12.6608412Z return mod(**inputs) 2025-08-14T21:37:12.6608654Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1543, in forward 2025-08-14T21:37:12.6608724Z outputs = self.electra( 2025-08-14T21:37:12.6608964Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:12.6609026Z hidden_states = self.encoder( 2025-08-14T21:37:12.6609271Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:12.6609334Z layer_outputs = layer_module( 2025-08-14T21:37:12.6609538Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:12.6609617Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:12.6609859Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 474, in forward 2025-08-14T21:37:12.6609944Z self_attention_outputs = self.attention( 2025-08-14T21:37:12.6610165Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:12.6610227Z return func(*args, **kwargs) 2025-08-14T21:37:12.6610471Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 401, in forward 2025-08-14T21:37:12.6610534Z self_outputs = self.self( 2025-08-14T21:37:12.6610759Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:12.6610840Z return func(*args, **kwargs) 2025-08-14T21:37:12.6611077Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 270, in forward 2025-08-14T21:37:12.6611167Z key_layer = self.key(current_states) 2025-08-14T21:37:12.6611170Z 2025-08-14T21:37:12.6611266Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:12.6611463Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:12.6611532Z return mod(**inputs) 2025-08-14T21:37:12.6611774Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1543, in forward 2025-08-14T21:37:12.6611843Z outputs = self.electra( 2025-08-14T21:37:12.6612081Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:12.6612144Z hidden_states = self.encoder( 2025-08-14T21:37:12.6612409Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:12.6612472Z layer_outputs = layer_module( 2025-08-14T21:37:12.6612673Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:12.6612753Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:12.6612989Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 474, in forward 2025-08-14T21:37:12.6613069Z self_attention_outputs = self.attention( 2025-08-14T21:37:12.6613287Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:12.6613348Z return func(*args, **kwargs) 2025-08-14T21:37:12.6613592Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 401, in forward 2025-08-14T21:37:12.6613659Z self_outputs = self.self( 2025-08-14T21:37:12.6613884Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:12.6613949Z return func(*args, **kwargs) 2025-08-14T21:37:12.6614189Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 274, in forward 2025-08-14T21:37:12.6614271Z value_layer = self.value(current_states) 2025-08-14T21:37:12.6614274Z 2025-08-14T21:37:12.6614346Z cudagraph partition due to non gpu ops 2025-08-14T21:37:12.6614417Z cudagraph partition due to non gpu ops 2025-08-14T21:37:12.6614518Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:12.6614699Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:12.6614766Z return mod(**inputs) 2025-08-14T21:37:12.6615007Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1543, in forward 2025-08-14T21:37:12.6615071Z outputs = self.electra( 2025-08-14T21:37:12.6615315Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:12.6615379Z hidden_states = self.encoder( 2025-08-14T21:37:12.6615616Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:12.6615689Z layer_outputs = layer_module( 2025-08-14T21:37:12.6615890Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:12.6615968Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:12.6616204Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 474, in forward 2025-08-14T21:37:12.6616278Z self_attention_outputs = self.attention( 2025-08-14T21:37:12.6616524Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:12.6616588Z return func(*args, **kwargs) 2025-08-14T21:37:12.6616839Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 411, in forward 2025-08-14T21:37:12.6616981Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:37:12.6617222Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 348, in forward 2025-08-14T21:37:12.6617303Z hidden_states = self.dense(hidden_states) 2025-08-14T21:37:12.6617307Z 2025-08-14T21:37:12.6617399Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:12.6617581Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:12.6617648Z return mod(**inputs) 2025-08-14T21:37:12.6617891Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1543, in forward 2025-08-14T21:37:12.6617977Z outputs = self.electra( 2025-08-14T21:37:12.6618218Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:12.6618282Z hidden_states = self.encoder( 2025-08-14T21:37:12.6618531Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:12.6618593Z layer_outputs = layer_module( 2025-08-14T21:37:12.6618798Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:12.6618876Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:12.6619117Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 504, in forward 2025-08-14T21:37:12.6619201Z layer_output = apply_chunking_to_forward( 2025-08-14T21:37:12.6619445Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:37:12.6619515Z return forward_fn(*input_tensors) 2025-08-14T21:37:12.6619796Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 512, in feed_forward_chunk 2025-08-14T21:37:12.6619903Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:37:12.6620150Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 427, in forward 2025-08-14T21:37:12.6620224Z hidden_states = self.dense(hidden_states) 2025-08-14T21:37:12.6620227Z 2025-08-14T21:37:12.6620320Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:12.6620511Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:12.6620575Z return mod(**inputs) 2025-08-14T21:37:12.6620818Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1543, in forward 2025-08-14T21:37:12.6620888Z outputs = self.electra( 2025-08-14T21:37:12.6621130Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:12.6621200Z hidden_states = self.encoder( 2025-08-14T21:37:12.6621443Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:12.6621507Z layer_outputs = layer_module( 2025-08-14T21:37:12.6621720Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:12.6621792Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:12.6622039Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 504, in forward 2025-08-14T21:37:12.6622131Z layer_output = apply_chunking_to_forward( 2025-08-14T21:37:12.6622383Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:37:12.6622462Z return forward_fn(*input_tensors) 2025-08-14T21:37:12.6622751Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 512, in feed_forward_chunk 2025-08-14T21:37:12.6622860Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:37:12.6623104Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 428, in forward 2025-08-14T21:37:12.6623205Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:37:12.6623407Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:37:12.6623489Z return self.act(input) 2025-08-14T21:37:12.6623492Z 2025-08-14T21:37:12.6623584Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:12.6623776Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:12.6623835Z return mod(**inputs) 2025-08-14T21:37:12.6624083Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1543, in forward 2025-08-14T21:37:12.6624144Z outputs = self.electra( 2025-08-14T21:37:12.6624379Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:12.6624449Z hidden_states = self.encoder( 2025-08-14T21:37:12.6624684Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:12.6624746Z layer_outputs = layer_module( 2025-08-14T21:37:12.6625022Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:12.6625102Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:12.6625354Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 504, in forward 2025-08-14T21:37:12.6625430Z layer_output = apply_chunking_to_forward( 2025-08-14T21:37:12.6625665Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:37:12.6625742Z return forward_fn(*input_tensors) 2025-08-14T21:37:12.6626015Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 513, in feed_forward_chunk 2025-08-14T21:37:12.6626144Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:37:12.6626388Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 441, in forward 2025-08-14T21:37:12.6626465Z hidden_states = self.dense(hidden_states) 2025-08-14T21:37:12.6626469Z 2025-08-14T21:37:12.6626573Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:12.6626757Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:12.6626820Z return mod(**inputs) 2025-08-14T21:37:12.6627071Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1560, in forward 2025-08-14T21:37:12.6627241Z prediction_scores = self.generator_lm_head(self.generator_predictions(sequence_output)) 2025-08-14T21:37:12.6627492Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 640, in forward 2025-08-14T21:37:12.6627588Z hidden_states = self.dense(generator_hidden_states) 2025-08-14T21:37:12.6627592Z 2025-08-14T21:37:12.6627686Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:12.6627903Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:12.6627965Z return mod(**inputs) 2025-08-14T21:37:12.6628231Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1560, in forward 2025-08-14T21:37:12.6628421Z prediction_scores = self.generator_lm_head(self.generator_predictions(sequence_output)) 2025-08-14T21:37:12.6628425Z 2025-08-14T21:37:12.6628520Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:12.6628712Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:12.6628773Z return mod(**inputs) 2025-08-14T21:37:12.6629020Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1564, in forward 2025-08-14T21:37:12.6629087Z lm_loss = self.loss_function( 2025-08-14T21:37:12.6629331Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/loss/loss_utils.py", line 67, in ForCausalLMLoss 2025-08-14T21:37:12.6629503Z loss = fixed_cross_entropy(logits, shift_labels, num_items_in_batch, ignore_index, **kwargs) 2025-08-14T21:37:12.6629735Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/loss/loss_utils.py", line 36, in fixed_cross_entropy 2025-08-14T21:37:12.6629915Z loss = nn.functional.cross_entropy(source, target, ignore_index=ignore_index, reduction=reduction) 2025-08-14T21:37:12.6629927Z 2025-08-14T21:37:19.8885850Z Compilation time (from dynamo_timed): 13.580757711 2025-08-14T21:37:19.8964948Z pass 2025-08-14T21:37:19.8968076Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:37:19.8972181Z TIMING: _recursive_pre_grad_passes:0.00625 _recursive_joint_graph_passes:0.40651 _recursive_post_grad_passes:0.07084 async_compile.wait:0.69004 code_gen:6.7081 inductor_compile:7.78699 backend_compile:11.00871 gc:0.00011 entire_frame_compile:13.58076 total_wall_time:13.58076 2025-08-14T21:37:19.8973236Z STATS: call_* op count: 377 | FakeTensorMode.__torch_dispatch__:15041 | FakeTensor.__torch_dispatch__:4687 | ProxyTorchDispatchMode.__torch_dispatch__:5671 2025-08-14T21:37:19.8973701Z Dynamo produced 1 graphs covering 377 ops with 0 graph breaks (0 unique) 2025-08-14T21:37:24.0467344Z /opt/conda/envs/py_3.9/lib/python3.9/site-packages/llvmlite/binding/ffi.py:175: UserWarning: pkg_resources is deprecated as an API. See https://setuptools.pypa.io/en/latest/pkg_resources.html. The pkg_resources package is slated for removal as early as 2025-11-30. Refrain from using this package or pin to Setuptools<81. 2025-08-14T21:37:24.0468354Z from pkg_resources import resource_filename 2025-08-14T21:37:24.6618990Z 2025-08-14T21:37:24.9865930Z loading model: 0it [00:00, ?it/s] 2025-08-14T21:37:24.9870176Z loading model: 0it [00:00, ?it/s] 2025-08-14T21:37:24.9881576Z cpu eval ElectraForQuestionAnswering 2025-08-14T21:37:25.0982060Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:37:25.1544437Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:37:25.2122085Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:37:32.3967131Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:32.3971947Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:32.3977050Z return mod(**inputs) 2025-08-14T21:37:32.3979230Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1317, in forward 2025-08-14T21:37:32.3979669Z discriminator_hidden_states = self.electra( 2025-08-14T21:37:32.3980244Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 797, in forward 2025-08-14T21:37:32.3981111Z hidden_states = self.embeddings_project(hidden_states) 2025-08-14T21:37:32.3981703Z 2025-08-14T21:37:32.3982277Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:32.3982830Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:32.3983216Z return mod(**inputs) 2025-08-14T21:37:32.3983609Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1317, in forward 2025-08-14T21:37:32.3984016Z discriminator_hidden_states = self.electra( 2025-08-14T21:37:32.3984410Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:32.3985066Z hidden_states = self.encoder( 2025-08-14T21:37:32.3985447Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:32.3985939Z layer_outputs = layer_module( 2025-08-14T21:37:32.3986361Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:32.3986703Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:32.3987175Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 474, in forward 2025-08-14T21:37:32.3987609Z self_attention_outputs = self.attention( 2025-08-14T21:37:32.3987972Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:32.3988331Z return func(*args, **kwargs) 2025-08-14T21:37:32.3988700Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 401, in forward 2025-08-14T21:37:32.3989072Z self_outputs = self.self( 2025-08-14T21:37:32.3989411Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:32.3989763Z return func(*args, **kwargs) 2025-08-14T21:37:32.3990124Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 241, in forward 2025-08-14T21:37:32.3990604Z query_layer = self.query(hidden_states) 2025-08-14T21:37:32.3990748Z 2025-08-14T21:37:32.3990853Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:32.3991282Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:32.3991595Z return mod(**inputs) 2025-08-14T21:37:32.3991944Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1317, in forward 2025-08-14T21:37:32.3992339Z discriminator_hidden_states = self.electra( 2025-08-14T21:37:32.3992721Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:32.3993099Z hidden_states = self.encoder( 2025-08-14T21:37:32.3993456Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:32.3993825Z layer_outputs = layer_module( 2025-08-14T21:37:32.3994150Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:32.3994485Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:32.3994860Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 474, in forward 2025-08-14T21:37:32.3995237Z self_attention_outputs = self.attention( 2025-08-14T21:37:32.3995593Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:32.3995934Z return func(*args, **kwargs) 2025-08-14T21:37:32.3996293Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 401, in forward 2025-08-14T21:37:32.3996803Z self_outputs = self.self( 2025-08-14T21:37:32.3997180Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:32.3997652Z return func(*args, **kwargs) 2025-08-14T21:37:32.3998012Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 270, in forward 2025-08-14T21:37:32.3998392Z key_layer = self.key(current_states) 2025-08-14T21:37:32.3998519Z 2025-08-14T21:37:32.3998619Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:32.3998972Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:32.3999286Z return mod(**inputs) 2025-08-14T21:37:32.3999642Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1317, in forward 2025-08-14T21:37:32.4000046Z discriminator_hidden_states = self.electra( 2025-08-14T21:37:32.4000586Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:32.4001054Z hidden_states = self.encoder( 2025-08-14T21:37:32.4001413Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:32.4001885Z layer_outputs = layer_module( 2025-08-14T21:37:32.4002202Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:32.4002530Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:32.4002893Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 474, in forward 2025-08-14T21:37:32.4003263Z self_attention_outputs = self.attention( 2025-08-14T21:37:32.4003615Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:32.4003946Z return func(*args, **kwargs) 2025-08-14T21:37:32.4004394Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 401, in forward 2025-08-14T21:37:32.4004759Z self_outputs = self.self( 2025-08-14T21:37:32.4005100Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:32.4005440Z return func(*args, **kwargs) 2025-08-14T21:37:32.4005789Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 274, in forward 2025-08-14T21:37:32.4006166Z value_layer = self.value(current_states) 2025-08-14T21:37:32.4006303Z 2025-08-14T21:37:32.4006386Z cudagraph partition due to non gpu ops 2025-08-14T21:37:32.4006577Z cudagraph partition due to non gpu ops 2025-08-14T21:37:32.4006800Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:32.4007134Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:32.4007428Z return mod(**inputs) 2025-08-14T21:37:32.4007772Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1317, in forward 2025-08-14T21:37:32.4008151Z discriminator_hidden_states = self.electra( 2025-08-14T21:37:32.4008610Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:32.4008965Z hidden_states = self.encoder( 2025-08-14T21:37:32.4009406Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:32.4009764Z layer_outputs = layer_module( 2025-08-14T21:37:32.4010075Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:32.4010437Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:32.4010815Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 474, in forward 2025-08-14T21:37:32.4011184Z self_attention_outputs = self.attention( 2025-08-14T21:37:32.4011542Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:32.4011884Z return func(*args, **kwargs) 2025-08-14T21:37:32.4012264Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 411, in forward 2025-08-14T21:37:32.4012945Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:37:32.4013429Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 348, in forward 2025-08-14T21:37:32.4013829Z hidden_states = self.dense(hidden_states) 2025-08-14T21:37:32.4013958Z 2025-08-14T21:37:32.4014063Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:32.4014391Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:32.4014693Z return mod(**inputs) 2025-08-14T21:37:32.4015040Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1317, in forward 2025-08-14T21:37:32.4015421Z discriminator_hidden_states = self.electra( 2025-08-14T21:37:32.4015791Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:32.4016150Z hidden_states = self.encoder( 2025-08-14T21:37:32.4016505Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:32.4016867Z layer_outputs = layer_module( 2025-08-14T21:37:32.4017185Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:32.4017521Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:32.4017892Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 504, in forward 2025-08-14T21:37:32.4018265Z layer_output = apply_chunking_to_forward( 2025-08-14T21:37:32.4018639Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:37:32.4019004Z return forward_fn(*input_tensors) 2025-08-14T21:37:32.4019400Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 512, in feed_forward_chunk 2025-08-14T21:37:32.4019829Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:37:32.4020267Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 427, in forward 2025-08-14T21:37:32.4020637Z hidden_states = self.dense(hidden_states) 2025-08-14T21:37:32.4020773Z 2025-08-14T21:37:32.4020871Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:32.4021207Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:32.4021509Z return mod(**inputs) 2025-08-14T21:37:32.4021849Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1317, in forward 2025-08-14T21:37:32.4022232Z discriminator_hidden_states = self.electra( 2025-08-14T21:37:32.4022606Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:32.4022959Z hidden_states = self.encoder( 2025-08-14T21:37:32.4023314Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:32.4023693Z layer_outputs = layer_module( 2025-08-14T21:37:32.4024028Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:32.4024354Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:32.4024743Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 504, in forward 2025-08-14T21:37:32.4025210Z layer_output = apply_chunking_to_forward( 2025-08-14T21:37:32.4025594Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:37:32.4025960Z return forward_fn(*input_tensors) 2025-08-14T21:37:32.4026368Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 512, in feed_forward_chunk 2025-08-14T21:37:32.4026804Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:37:32.4027237Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 428, in forward 2025-08-14T21:37:32.4027638Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:37:32.4027991Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:37:32.4028306Z return self.act(input) 2025-08-14T21:37:32.4028409Z 2025-08-14T21:37:32.4028504Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:32.4028836Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:32.4029132Z return mod(**inputs) 2025-08-14T21:37:32.4029472Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1317, in forward 2025-08-14T21:37:32.4029843Z discriminator_hidden_states = self.electra( 2025-08-14T21:37:32.4030216Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:32.4030578Z hidden_states = self.encoder( 2025-08-14T21:37:32.4030922Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:32.4031285Z layer_outputs = layer_module( 2025-08-14T21:37:32.4031602Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:32.4031934Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:32.4032292Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 504, in forward 2025-08-14T21:37:32.4032661Z layer_output = apply_chunking_to_forward( 2025-08-14T21:37:32.4033028Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:37:32.4033381Z return forward_fn(*input_tensors) 2025-08-14T21:37:32.4033773Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 513, in feed_forward_chunk 2025-08-14T21:37:32.4034211Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:37:32.4034625Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 441, in forward 2025-08-14T21:37:32.4034987Z hidden_states = self.dense(hidden_states) 2025-08-14T21:37:32.4035119Z 2025-08-14T21:37:32.4035213Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:32.4035542Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:32.4035840Z return mod(**inputs) 2025-08-14T21:37:32.4036173Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1317, in forward 2025-08-14T21:37:32.4036570Z discriminator_hidden_states = self.electra( 2025-08-14T21:37:32.4037008Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:32.4037362Z hidden_states = self.encoder( 2025-08-14T21:37:32.4037727Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:32.4038086Z layer_outputs = layer_module( 2025-08-14T21:37:32.4038399Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:32.4038719Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:32.4039107Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 474, in forward 2025-08-14T21:37:32.4039473Z self_attention_outputs = self.attention( 2025-08-14T21:37:32.4039820Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:32.4040174Z return func(*args, **kwargs) 2025-08-14T21:37:32.4040530Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 401, in forward 2025-08-14T21:37:32.4040890Z self_outputs = self.self( 2025-08-14T21:37:32.4041216Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:32.4041553Z return func(*args, **kwargs) 2025-08-14T21:37:32.4041905Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 241, in forward 2025-08-14T21:37:32.4042271Z query_layer = self.query(hidden_states) 2025-08-14T21:37:32.4042395Z 2025-08-14T21:37:32.4042492Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:32.4042826Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:32.4043130Z return mod(**inputs) 2025-08-14T21:37:32.4043467Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1317, in forward 2025-08-14T21:37:32.4043847Z discriminator_hidden_states = self.electra( 2025-08-14T21:37:32.4044226Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:32.4044585Z hidden_states = self.encoder( 2025-08-14T21:37:32.4044931Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:32.4045289Z layer_outputs = layer_module( 2025-08-14T21:37:32.4045602Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:32.4045933Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:32.4046290Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 474, in forward 2025-08-14T21:37:32.4046664Z self_attention_outputs = self.attention( 2025-08-14T21:37:32.4047016Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:32.4047349Z return func(*args, **kwargs) 2025-08-14T21:37:32.4047698Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 401, in forward 2025-08-14T21:37:32.4048058Z self_outputs = self.self( 2025-08-14T21:37:32.4048388Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:32.4048720Z return func(*args, **kwargs) 2025-08-14T21:37:32.4049072Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 270, in forward 2025-08-14T21:37:32.4049488Z key_layer = self.key(current_states) 2025-08-14T21:37:32.4049612Z 2025-08-14T21:37:32.4049707Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:32.4050057Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:32.4050355Z return mod(**inputs) 2025-08-14T21:37:32.4050715Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1317, in forward 2025-08-14T21:37:32.4051100Z discriminator_hidden_states = self.electra( 2025-08-14T21:37:32.4051485Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:32.4051857Z hidden_states = self.encoder( 2025-08-14T21:37:32.4052224Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:32.4052587Z layer_outputs = layer_module( 2025-08-14T21:37:32.4052920Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:32.4053252Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:32.4053606Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 474, in forward 2025-08-14T21:37:32.4053978Z self_attention_outputs = self.attention( 2025-08-14T21:37:32.4054326Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:32.4054663Z return func(*args, **kwargs) 2025-08-14T21:37:32.4055003Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 401, in forward 2025-08-14T21:37:32.4055360Z self_outputs = self.self( 2025-08-14T21:37:32.4055689Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:32.4056025Z return func(*args, **kwargs) 2025-08-14T21:37:32.4056375Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 274, in forward 2025-08-14T21:37:32.4056744Z value_layer = self.value(current_states) 2025-08-14T21:37:32.4056867Z 2025-08-14T21:37:32.4056948Z cudagraph partition due to non gpu ops 2025-08-14T21:37:32.4057138Z cudagraph partition due to non gpu ops 2025-08-14T21:37:32.4057354Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:32.4057684Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:32.4057972Z return mod(**inputs) 2025-08-14T21:37:32.4058314Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1317, in forward 2025-08-14T21:37:32.4058689Z discriminator_hidden_states = self.electra( 2025-08-14T21:37:32.4059056Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:32.4059412Z hidden_states = self.encoder( 2025-08-14T21:37:32.4059761Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:32.4060118Z layer_outputs = layer_module( 2025-08-14T21:37:32.4060432Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:32.4060755Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:32.4061115Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 474, in forward 2025-08-14T21:37:32.4061477Z self_attention_outputs = self.attention( 2025-08-14T21:37:32.4061816Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:32.4062156Z return func(*args, **kwargs) 2025-08-14T21:37:32.4062528Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 411, in forward 2025-08-14T21:37:32.4062953Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:37:32.4063371Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 348, in forward 2025-08-14T21:37:32.4063744Z hidden_states = self.dense(hidden_states) 2025-08-14T21:37:32.4063869Z 2025-08-14T21:37:32.4063971Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:32.4064299Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:32.4064589Z return mod(**inputs) 2025-08-14T21:37:32.4065045Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1317, in forward 2025-08-14T21:37:32.4065436Z discriminator_hidden_states = self.electra( 2025-08-14T21:37:32.4065828Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:32.4066191Z hidden_states = self.encoder( 2025-08-14T21:37:32.4066550Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:32.4066912Z layer_outputs = layer_module( 2025-08-14T21:37:32.4067226Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:32.4067558Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:32.4067925Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 504, in forward 2025-08-14T21:37:32.4068293Z layer_output = apply_chunking_to_forward( 2025-08-14T21:37:32.4068664Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:37:32.4069028Z return forward_fn(*input_tensors) 2025-08-14T21:37:32.4069416Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 512, in feed_forward_chunk 2025-08-14T21:37:32.4069841Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:37:32.4070244Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 427, in forward 2025-08-14T21:37:32.4070613Z hidden_states = self.dense(hidden_states) 2025-08-14T21:37:32.4070737Z 2025-08-14T21:37:32.4070839Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:32.4071164Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:32.4071461Z return mod(**inputs) 2025-08-14T21:37:32.4071805Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1317, in forward 2025-08-14T21:37:32.4072179Z discriminator_hidden_states = self.electra( 2025-08-14T21:37:32.4072554Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:32.4072911Z hidden_states = self.encoder( 2025-08-14T21:37:32.4073261Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:32.4073613Z layer_outputs = layer_module( 2025-08-14T21:37:32.4073929Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:32.4074258Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:32.4074616Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 504, in forward 2025-08-14T21:37:32.4074986Z layer_output = apply_chunking_to_forward( 2025-08-14T21:37:32.4075376Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:37:32.4075735Z return forward_fn(*input_tensors) 2025-08-14T21:37:32.4076132Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 512, in feed_forward_chunk 2025-08-14T21:37:32.4076594Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:37:32.4076994Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 428, in forward 2025-08-14T21:37:32.4077396Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:37:32.4077742Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:37:32.4078054Z return self.act(input) 2025-08-14T21:37:32.4078156Z 2025-08-14T21:37:32.4078259Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:32.4078605Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:32.4078910Z return mod(**inputs) 2025-08-14T21:37:32.4079259Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1317, in forward 2025-08-14T21:37:32.4079649Z discriminator_hidden_states = self.electra( 2025-08-14T21:37:32.4080021Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:32.4080385Z hidden_states = self.encoder( 2025-08-14T21:37:32.4080750Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:32.4081115Z layer_outputs = layer_module( 2025-08-14T21:37:32.4081433Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:32.4081779Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:32.4082157Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 504, in forward 2025-08-14T21:37:32.4082529Z layer_output = apply_chunking_to_forward( 2025-08-14T21:37:32.4082902Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:37:32.4083271Z return forward_fn(*input_tensors) 2025-08-14T21:37:32.4083668Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 513, in feed_forward_chunk 2025-08-14T21:37:32.4084112Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:37:32.4084540Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 441, in forward 2025-08-14T21:37:32.4085035Z hidden_states = self.dense(hidden_states) 2025-08-14T21:37:32.4085167Z 2025-08-14T21:37:32.4085274Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:32.4085602Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:32.4085905Z return mod(**inputs) 2025-08-14T21:37:32.4086272Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1317, in forward 2025-08-14T21:37:32.4086649Z discriminator_hidden_states = self.electra( 2025-08-14T21:37:32.4087028Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:32.4087397Z hidden_states = self.encoder( 2025-08-14T21:37:32.4087751Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:32.4088104Z layer_outputs = layer_module( 2025-08-14T21:37:32.4088418Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:32.4088799Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:32.4089189Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 474, in forward 2025-08-14T21:37:32.4089579Z self_attention_outputs = self.attention( 2025-08-14T21:37:32.4089937Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:32.4090278Z return func(*args, **kwargs) 2025-08-14T21:37:32.4090625Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 401, in forward 2025-08-14T21:37:32.4090987Z self_outputs = self.self( 2025-08-14T21:37:32.4091324Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:32.4091661Z return func(*args, **kwargs) 2025-08-14T21:37:32.4092032Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 241, in forward 2025-08-14T21:37:32.4092401Z query_layer = self.query(hidden_states) 2025-08-14T21:37:32.4092525Z 2025-08-14T21:37:32.4092626Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:32.4092949Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:32.4093246Z return mod(**inputs) 2025-08-14T21:37:32.4093586Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1317, in forward 2025-08-14T21:37:32.4093959Z discriminator_hidden_states = self.electra( 2025-08-14T21:37:32.4094324Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:32.4094680Z hidden_states = self.encoder( 2025-08-14T21:37:32.4095033Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:32.4095393Z layer_outputs = layer_module( 2025-08-14T21:37:32.4095699Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:32.4096031Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:32.4096392Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 474, in forward 2025-08-14T21:37:32.4096752Z self_attention_outputs = self.attention( 2025-08-14T21:37:32.4097100Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:32.4097438Z return func(*args, **kwargs) 2025-08-14T21:37:32.4097785Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 401, in forward 2025-08-14T21:37:32.4098138Z self_outputs = self.self( 2025-08-14T21:37:32.4098470Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:32.4098812Z return func(*args, **kwargs) 2025-08-14T21:37:32.4099152Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 270, in forward 2025-08-14T21:37:32.4099520Z key_layer = self.key(current_states) 2025-08-14T21:37:32.4099647Z 2025-08-14T21:37:32.4099745Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:32.4100074Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:32.4100364Z return mod(**inputs) 2025-08-14T21:37:32.4100704Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1317, in forward 2025-08-14T21:37:32.4101079Z discriminator_hidden_states = self.electra( 2025-08-14T21:37:32.4101461Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:32.4101836Z hidden_states = self.encoder( 2025-08-14T21:37:32.4102193Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:32.4102570Z layer_outputs = layer_module( 2025-08-14T21:37:32.4102880Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:32.4103208Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:32.4103571Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 474, in forward 2025-08-14T21:37:32.4103937Z self_attention_outputs = self.attention( 2025-08-14T21:37:32.4104279Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:32.4104636Z return func(*args, **kwargs) 2025-08-14T21:37:32.4105081Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 401, in forward 2025-08-14T21:37:32.4105444Z self_outputs = self.self( 2025-08-14T21:37:32.4105779Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:32.4106122Z return func(*args, **kwargs) 2025-08-14T21:37:32.4106474Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 274, in forward 2025-08-14T21:37:32.4106838Z value_layer = self.value(current_states) 2025-08-14T21:37:32.4106968Z 2025-08-14T21:37:32.4107042Z cudagraph partition due to non gpu ops 2025-08-14T21:37:32.4107242Z cudagraph partition due to non gpu ops 2025-08-14T21:37:32.4107453Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:32.4107787Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:32.4108086Z return mod(**inputs) 2025-08-14T21:37:32.4108428Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1317, in forward 2025-08-14T21:37:32.4108801Z discriminator_hidden_states = self.electra( 2025-08-14T21:37:32.4109170Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:32.4109524Z hidden_states = self.encoder( 2025-08-14T21:37:32.4109867Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:32.4110222Z layer_outputs = layer_module( 2025-08-14T21:37:32.4110537Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:32.4110862Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:32.4111218Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 474, in forward 2025-08-14T21:37:32.4111590Z self_attention_outputs = self.attention( 2025-08-14T21:37:32.4111939Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:32.4112279Z return func(*args, **kwargs) 2025-08-14T21:37:32.4112620Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 411, in forward 2025-08-14T21:37:32.4113032Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:37:32.4113441Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 348, in forward 2025-08-14T21:37:32.4113802Z hidden_states = self.dense(hidden_states) 2025-08-14T21:37:32.4113934Z 2025-08-14T21:37:32.4114028Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:32.4114385Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:32.4114700Z return mod(**inputs) 2025-08-14T21:37:32.4115038Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1317, in forward 2025-08-14T21:37:32.4115431Z discriminator_hidden_states = self.electra( 2025-08-14T21:37:32.4115807Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:32.4116164Z hidden_states = self.encoder( 2025-08-14T21:37:32.4116508Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:32.4116863Z layer_outputs = layer_module( 2025-08-14T21:37:32.4117177Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:32.4117522Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:32.4117889Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 504, in forward 2025-08-14T21:37:32.4118264Z layer_output = apply_chunking_to_forward( 2025-08-14T21:37:32.4118644Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:37:32.4118997Z return forward_fn(*input_tensors) 2025-08-14T21:37:32.4119384Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 512, in feed_forward_chunk 2025-08-14T21:37:32.4119818Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:37:32.4120225Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 427, in forward 2025-08-14T21:37:32.4120590Z hidden_states = self.dense(hidden_states) 2025-08-14T21:37:32.4120725Z 2025-08-14T21:37:32.4120821Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:32.4121152Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:32.4121445Z return mod(**inputs) 2025-08-14T21:37:32.4121791Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1317, in forward 2025-08-14T21:37:32.4122175Z discriminator_hidden_states = self.electra( 2025-08-14T21:37:32.4122547Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:32.4122898Z hidden_states = self.encoder( 2025-08-14T21:37:32.4123247Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:32.4123607Z layer_outputs = layer_module( 2025-08-14T21:37:32.4123919Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:32.4124251Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:32.4124617Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 504, in forward 2025-08-14T21:37:32.4124988Z layer_output = apply_chunking_to_forward( 2025-08-14T21:37:32.4125345Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:37:32.4125703Z return forward_fn(*input_tensors) 2025-08-14T21:37:32.4126097Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 512, in feed_forward_chunk 2025-08-14T21:37:32.4126529Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:37:32.4126923Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 428, in forward 2025-08-14T21:37:32.4127339Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:37:32.4127715Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:37:32.4128021Z return self.act(input) 2025-08-14T21:37:32.4128127Z 2025-08-14T21:37:32.4128234Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:32.4128566Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:32.4128865Z return mod(**inputs) 2025-08-14T21:37:32.4129201Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1317, in forward 2025-08-14T21:37:32.4129577Z discriminator_hidden_states = self.electra( 2025-08-14T21:37:32.4129950Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:32.4130324Z hidden_states = self.encoder( 2025-08-14T21:37:32.4130672Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:32.4131031Z layer_outputs = layer_module( 2025-08-14T21:37:32.4131347Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:32.4131670Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:32.4132039Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 504, in forward 2025-08-14T21:37:32.4132414Z layer_output = apply_chunking_to_forward( 2025-08-14T21:37:32.4132779Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:37:32.4133132Z return forward_fn(*input_tensors) 2025-08-14T21:37:32.4133515Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 513, in feed_forward_chunk 2025-08-14T21:37:32.4133959Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:37:32.4134370Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 441, in forward 2025-08-14T21:37:32.4134736Z hidden_states = self.dense(hidden_states) 2025-08-14T21:37:32.4134867Z 2025-08-14T21:37:32.4134962Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:32.4135293Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:32.4135585Z return mod(**inputs) 2025-08-14T21:37:32.4135927Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1317, in forward 2025-08-14T21:37:32.4136303Z discriminator_hidden_states = self.electra( 2025-08-14T21:37:32.4136674Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:32.4137029Z hidden_states = self.encoder( 2025-08-14T21:37:32.4137383Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:32.4137742Z layer_outputs = layer_module( 2025-08-14T21:37:32.4138048Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:32.4138379Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:32.4138743Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 474, in forward 2025-08-14T21:37:32.4139113Z self_attention_outputs = self.attention( 2025-08-14T21:37:32.4139456Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:32.4139820Z return func(*args, **kwargs) 2025-08-14T21:37:32.4140172Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 401, in forward 2025-08-14T21:37:32.4140547Z self_outputs = self.self( 2025-08-14T21:37:32.4140893Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:32.4141236Z return func(*args, **kwargs) 2025-08-14T21:37:32.4141583Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 241, in forward 2025-08-14T21:37:32.4141945Z query_layer = self.query(hidden_states) 2025-08-14T21:37:32.4142076Z 2025-08-14T21:37:32.4142172Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:32.4142498Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:32.4142793Z return mod(**inputs) 2025-08-14T21:37:32.4143127Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1317, in forward 2025-08-14T21:37:32.4143523Z discriminator_hidden_states = self.electra( 2025-08-14T21:37:32.4143900Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:32.4144255Z hidden_states = self.encoder( 2025-08-14T21:37:32.4144609Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:32.4145043Z layer_outputs = layer_module( 2025-08-14T21:37:32.4145367Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:32.4145692Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:32.4146060Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 474, in forward 2025-08-14T21:37:32.4146437Z self_attention_outputs = self.attention( 2025-08-14T21:37:32.4146795Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:32.4147134Z return func(*args, **kwargs) 2025-08-14T21:37:32.4147488Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 401, in forward 2025-08-14T21:37:32.4147852Z self_outputs = self.self( 2025-08-14T21:37:32.4148182Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:32.4148532Z return func(*args, **kwargs) 2025-08-14T21:37:32.4148885Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 270, in forward 2025-08-14T21:37:32.4149254Z key_layer = self.key(current_states) 2025-08-14T21:37:32.4149376Z 2025-08-14T21:37:32.4149470Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:32.4149803Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:32.4150100Z return mod(**inputs) 2025-08-14T21:37:32.4150439Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1317, in forward 2025-08-14T21:37:32.4150819Z discriminator_hidden_states = self.electra( 2025-08-14T21:37:32.4151205Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:32.4151561Z hidden_states = self.encoder( 2025-08-14T21:37:32.4151903Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:32.4152263Z layer_outputs = layer_module( 2025-08-14T21:37:32.4152581Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:32.4152939Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:32.4153308Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 474, in forward 2025-08-14T21:37:32.4153680Z self_attention_outputs = self.attention( 2025-08-14T21:37:32.4154045Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:32.4154389Z return func(*args, **kwargs) 2025-08-14T21:37:32.4154754Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 401, in forward 2025-08-14T21:37:32.4155124Z self_outputs = self.self( 2025-08-14T21:37:32.4155468Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:32.4155815Z return func(*args, **kwargs) 2025-08-14T21:37:32.4156178Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 274, in forward 2025-08-14T21:37:32.4156574Z value_layer = self.value(current_states) 2025-08-14T21:37:32.4156700Z 2025-08-14T21:37:32.4156778Z cudagraph partition due to non gpu ops 2025-08-14T21:37:32.4156983Z cudagraph partition due to non gpu ops 2025-08-14T21:37:32.4157211Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:32.4157551Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:32.4157852Z return mod(**inputs) 2025-08-14T21:37:32.4158205Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1317, in forward 2025-08-14T21:37:32.4158593Z discriminator_hidden_states = self.electra( 2025-08-14T21:37:32.4158971Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:32.4159342Z hidden_states = self.encoder( 2025-08-14T21:37:32.4159705Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:32.4160075Z layer_outputs = layer_module( 2025-08-14T21:37:32.4160397Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:32.4160743Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:32.4161120Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 474, in forward 2025-08-14T21:37:32.4161500Z self_attention_outputs = self.attention( 2025-08-14T21:37:32.4161856Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:32.4162206Z return func(*args, **kwargs) 2025-08-14T21:37:32.4162567Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 411, in forward 2025-08-14T21:37:32.4162989Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:37:32.4163409Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 348, in forward 2025-08-14T21:37:32.4163793Z hidden_states = self.dense(hidden_states) 2025-08-14T21:37:32.4163927Z 2025-08-14T21:37:32.4164033Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:32.4164366Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:32.4164677Z return mod(**inputs) 2025-08-14T21:37:32.4165033Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1317, in forward 2025-08-14T21:37:32.4165423Z discriminator_hidden_states = self.electra( 2025-08-14T21:37:32.4165803Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:32.4166182Z hidden_states = self.encoder( 2025-08-14T21:37:32.4166550Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:32.4166904Z layer_outputs = layer_module( 2025-08-14T21:37:32.4167238Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:32.4167578Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:32.4167948Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 504, in forward 2025-08-14T21:37:32.4168320Z layer_output = apply_chunking_to_forward( 2025-08-14T21:37:32.4168693Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:37:32.4169059Z return forward_fn(*input_tensors) 2025-08-14T21:37:32.4181594Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 512, in feed_forward_chunk 2025-08-14T21:37:32.4182088Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:37:32.4182521Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 427, in forward 2025-08-14T21:37:32.4182904Z hidden_states = self.dense(hidden_states) 2025-08-14T21:37:32.4183049Z 2025-08-14T21:37:32.4183153Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:32.4183498Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:32.4183812Z return mod(**inputs) 2025-08-14T21:37:32.4184162Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1317, in forward 2025-08-14T21:37:32.4184550Z discriminator_hidden_states = self.electra( 2025-08-14T21:37:32.4185318Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:32.4185687Z hidden_states = self.encoder( 2025-08-14T21:37:32.4186059Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:32.4186426Z layer_outputs = layer_module( 2025-08-14T21:37:32.4186752Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:32.4187089Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:32.4187462Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 504, in forward 2025-08-14T21:37:32.4187838Z layer_output = apply_chunking_to_forward( 2025-08-14T21:37:32.4188205Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:37:32.4188576Z return forward_fn(*input_tensors) 2025-08-14T21:37:32.4188974Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 512, in feed_forward_chunk 2025-08-14T21:37:32.4189412Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:37:32.4189812Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 428, in forward 2025-08-14T21:37:32.4190214Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:37:32.4190565Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:37:32.4190883Z return self.act(input) 2025-08-14T21:37:32.4190987Z 2025-08-14T21:37:32.4191087Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:32.4191424Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:32.4191851Z return mod(**inputs) 2025-08-14T21:37:32.4192239Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1317, in forward 2025-08-14T21:37:32.4192631Z discriminator_hidden_states = self.electra( 2025-08-14T21:37:32.4193049Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:32.4193413Z hidden_states = self.encoder( 2025-08-14T21:37:32.4193763Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:32.4194126Z layer_outputs = layer_module( 2025-08-14T21:37:32.4194446Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:32.4194784Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:32.4195144Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 504, in forward 2025-08-14T21:37:32.4195557Z layer_output = apply_chunking_to_forward( 2025-08-14T21:37:32.4195934Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:37:32.4196294Z return forward_fn(*input_tensors) 2025-08-14T21:37:32.4196691Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 513, in feed_forward_chunk 2025-08-14T21:37:32.4197144Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:37:32.4197562Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 441, in forward 2025-08-14T21:37:32.4197929Z hidden_states = self.dense(hidden_states) 2025-08-14T21:37:32.4198065Z 2025-08-14T21:37:32.4198164Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:32.4198510Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:32.4198817Z return mod(**inputs) 2025-08-14T21:37:32.4199160Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1317, in forward 2025-08-14T21:37:32.4199646Z discriminator_hidden_states = self.electra( 2025-08-14T21:37:32.4200134Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:32.4200637Z hidden_states = self.encoder( 2025-08-14T21:37:32.4201042Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:32.4201414Z layer_outputs = layer_module( 2025-08-14T21:37:32.4201742Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:32.4202082Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:32.4202472Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 474, in forward 2025-08-14T21:37:32.4202860Z self_attention_outputs = self.attention( 2025-08-14T21:37:32.4203225Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:32.4203588Z return func(*args, **kwargs) 2025-08-14T21:37:32.4203952Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 401, in forward 2025-08-14T21:37:32.4204333Z self_outputs = self.self( 2025-08-14T21:37:32.4204673Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:32.4205032Z return func(*args, **kwargs) 2025-08-14T21:37:32.4205397Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 241, in forward 2025-08-14T21:37:32.4205824Z query_layer = self.query(hidden_states) 2025-08-14T21:37:32.4205955Z 2025-08-14T21:37:32.4206074Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:32.4206429Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:32.4206784Z return mod(**inputs) 2025-08-14T21:37:32.4207129Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1317, in forward 2025-08-14T21:37:32.4207516Z discriminator_hidden_states = self.electra( 2025-08-14T21:37:32.4207896Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:32.4208263Z hidden_states = self.encoder( 2025-08-14T21:37:32.4208614Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:32.4209009Z layer_outputs = layer_module( 2025-08-14T21:37:32.4209335Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:32.4209673Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:32.4210163Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 474, in forward 2025-08-14T21:37:32.4210644Z self_attention_outputs = self.attention( 2025-08-14T21:37:32.4211064Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:32.4211408Z return func(*args, **kwargs) 2025-08-14T21:37:32.4211769Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 401, in forward 2025-08-14T21:37:32.4212137Z self_outputs = self.self( 2025-08-14T21:37:32.4212475Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:32.4212825Z return func(*args, **kwargs) 2025-08-14T21:37:32.4213187Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 270, in forward 2025-08-14T21:37:32.4213565Z key_layer = self.key(current_states) 2025-08-14T21:37:32.4213690Z 2025-08-14T21:37:32.4213789Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:32.4214126Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:32.4214430Z return mod(**inputs) 2025-08-14T21:37:32.4214781Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1317, in forward 2025-08-14T21:37:32.4215160Z discriminator_hidden_states = self.electra( 2025-08-14T21:37:32.4215542Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:32.4215912Z hidden_states = self.encoder( 2025-08-14T21:37:32.4216269Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:32.4216638Z layer_outputs = layer_module( 2025-08-14T21:37:32.4216959Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:32.4217303Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:32.4217668Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 474, in forward 2025-08-14T21:37:32.4218046Z self_attention_outputs = self.attention( 2025-08-14T21:37:32.4218402Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:32.4218753Z return func(*args, **kwargs) 2025-08-14T21:37:32.4219103Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 401, in forward 2025-08-14T21:37:32.4219495Z self_outputs = self.self( 2025-08-14T21:37:32.4219868Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:32.4220204Z return func(*args, **kwargs) 2025-08-14T21:37:32.4220573Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 274, in forward 2025-08-14T21:37:32.4220950Z value_layer = self.value(current_states) 2025-08-14T21:37:32.4221075Z 2025-08-14T21:37:32.4221149Z cudagraph partition due to non gpu ops 2025-08-14T21:37:32.4221348Z cudagraph partition due to non gpu ops 2025-08-14T21:37:32.4221565Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:32.4221889Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:32.4222186Z return mod(**inputs) 2025-08-14T21:37:32.4222546Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1317, in forward 2025-08-14T21:37:32.4222926Z discriminator_hidden_states = self.electra( 2025-08-14T21:37:32.4223293Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:32.4223652Z hidden_states = self.encoder( 2025-08-14T21:37:32.4224002Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:32.4224351Z layer_outputs = layer_module( 2025-08-14T21:37:32.4224675Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:32.4225138Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:32.4225539Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 474, in forward 2025-08-14T21:37:32.4225939Z self_attention_outputs = self.attention( 2025-08-14T21:37:32.4226320Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:32.4226682Z return func(*args, **kwargs) 2025-08-14T21:37:32.4227028Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 411, in forward 2025-08-14T21:37:32.4227446Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:37:32.4227860Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 348, in forward 2025-08-14T21:37:32.4228236Z hidden_states = self.dense(hidden_states) 2025-08-14T21:37:32.4228363Z 2025-08-14T21:37:32.4228458Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:32.4228788Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:32.4229090Z return mod(**inputs) 2025-08-14T21:37:32.4229435Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1317, in forward 2025-08-14T21:37:32.4229808Z discriminator_hidden_states = self.electra( 2025-08-14T21:37:32.4230182Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:32.4230541Z hidden_states = self.encoder( 2025-08-14T21:37:32.4230882Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:32.4231241Z layer_outputs = layer_module( 2025-08-14T21:37:32.4231554Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:32.4231884Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:32.4232268Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 504, in forward 2025-08-14T21:37:32.4232643Z layer_output = apply_chunking_to_forward( 2025-08-14T21:37:32.4233030Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:37:32.4233407Z return forward_fn(*input_tensors) 2025-08-14T21:37:32.4233792Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 512, in feed_forward_chunk 2025-08-14T21:37:32.4234229Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:37:32.4234636Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 427, in forward 2025-08-14T21:37:32.4234999Z hidden_states = self.dense(hidden_states) 2025-08-14T21:37:32.4235133Z 2025-08-14T21:37:32.4235229Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:32.4235573Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:32.4235870Z return mod(**inputs) 2025-08-14T21:37:32.4236203Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1317, in forward 2025-08-14T21:37:32.4236577Z discriminator_hidden_states = self.electra( 2025-08-14T21:37:32.4236950Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:32.4237305Z hidden_states = self.encoder( 2025-08-14T21:37:32.4237649Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:32.4238003Z layer_outputs = layer_module( 2025-08-14T21:37:32.4238317Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:32.4238642Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:32.4239004Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 504, in forward 2025-08-14T21:37:32.4239375Z layer_output = apply_chunking_to_forward( 2025-08-14T21:37:32.4239742Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:37:32.4240095Z return forward_fn(*input_tensors) 2025-08-14T21:37:32.4240480Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 512, in feed_forward_chunk 2025-08-14T21:37:32.4240910Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:37:32.4241309Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 428, in forward 2025-08-14T21:37:32.4241701Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:37:32.4242050Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:37:32.4242362Z return self.act(input) 2025-08-14T21:37:32.4242463Z 2025-08-14T21:37:32.4242560Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:32.4242892Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:32.4243187Z return mod(**inputs) 2025-08-14T21:37:32.4243528Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1317, in forward 2025-08-14T21:37:32.4243897Z discriminator_hidden_states = self.electra( 2025-08-14T21:37:32.4244269Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:32.4244627Z hidden_states = self.encoder( 2025-08-14T21:37:32.4244971Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:32.4245349Z layer_outputs = layer_module( 2025-08-14T21:37:32.4245773Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:32.4246121Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:32.4246500Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 504, in forward 2025-08-14T21:37:32.4246880Z layer_output = apply_chunking_to_forward( 2025-08-14T21:37:32.4247253Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:37:32.4247618Z return forward_fn(*input_tensors) 2025-08-14T21:37:32.4247999Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 513, in feed_forward_chunk 2025-08-14T21:37:32.4248466Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:37:32.4248881Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 441, in forward 2025-08-14T21:37:32.4249245Z hidden_states = self.dense(hidden_states) 2025-08-14T21:37:32.4249381Z 2025-08-14T21:37:32.4249478Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:32.4249810Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:32.4250107Z return mod(**inputs) 2025-08-14T21:37:32.4250443Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1317, in forward 2025-08-14T21:37:32.4250824Z discriminator_hidden_states = self.electra( 2025-08-14T21:37:32.4251197Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:32.4251554Z hidden_states = self.encoder( 2025-08-14T21:37:32.4251899Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:32.4252257Z layer_outputs = layer_module( 2025-08-14T21:37:32.4252575Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:32.4252899Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:32.4253263Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 474, in forward 2025-08-14T21:37:32.4253633Z self_attention_outputs = self.attention( 2025-08-14T21:37:32.4253985Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:32.4254321Z return func(*args, **kwargs) 2025-08-14T21:37:32.4254670Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 401, in forward 2025-08-14T21:37:32.4255030Z self_outputs = self.self( 2025-08-14T21:37:32.4255358Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:32.4255698Z return func(*args, **kwargs) 2025-08-14T21:37:32.4256047Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 241, in forward 2025-08-14T21:37:32.4256414Z query_layer = self.query(hidden_states) 2025-08-14T21:37:32.4256538Z 2025-08-14T21:37:32.4256634Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:32.4256966Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:32.4257263Z return mod(**inputs) 2025-08-14T21:37:32.4257604Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1317, in forward 2025-08-14T21:37:32.4257994Z discriminator_hidden_states = self.electra( 2025-08-14T21:37:32.4258383Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:32.4258748Z hidden_states = self.encoder( 2025-08-14T21:37:32.4259109Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:32.4259468Z layer_outputs = layer_module( 2025-08-14T21:37:32.4259783Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:32.4260114Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:32.4260471Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 474, in forward 2025-08-14T21:37:32.4260842Z self_attention_outputs = self.attention( 2025-08-14T21:37:32.4261192Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:32.4261545Z return func(*args, **kwargs) 2025-08-14T21:37:32.4261896Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 401, in forward 2025-08-14T21:37:32.4262254Z self_outputs = self.self( 2025-08-14T21:37:32.4262587Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:32.4262921Z return func(*args, **kwargs) 2025-08-14T21:37:32.4263270Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 270, in forward 2025-08-14T21:37:32.4263633Z key_layer = self.key(current_states) 2025-08-14T21:37:32.4263754Z 2025-08-14T21:37:32.4263855Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:32.4264173Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:32.4264472Z return mod(**inputs) 2025-08-14T21:37:32.4264929Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1317, in forward 2025-08-14T21:37:32.4265352Z discriminator_hidden_states = self.electra( 2025-08-14T21:37:32.4265768Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:32.4266168Z hidden_states = self.encoder( 2025-08-14T21:37:32.4266524Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:32.4266883Z layer_outputs = layer_module( 2025-08-14T21:37:32.4267205Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:32.4267541Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:32.4267904Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 474, in forward 2025-08-14T21:37:32.4268281Z self_attention_outputs = self.attention( 2025-08-14T21:37:32.4268635Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:32.4268983Z return func(*args, **kwargs) 2025-08-14T21:37:32.4269331Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 401, in forward 2025-08-14T21:37:32.4269695Z self_outputs = self.self( 2025-08-14T21:37:32.4270033Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:32.4270378Z return func(*args, **kwargs) 2025-08-14T21:37:32.4270728Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 274, in forward 2025-08-14T21:37:32.4271100Z value_layer = self.value(current_states) 2025-08-14T21:37:32.4271262Z 2025-08-14T21:37:32.4271344Z cudagraph partition due to non gpu ops 2025-08-14T21:37:32.4271534Z cudagraph partition due to non gpu ops 2025-08-14T21:37:32.4271765Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:32.4272114Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:32.4272418Z return mod(**inputs) 2025-08-14T21:37:32.4272756Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1317, in forward 2025-08-14T21:37:32.4273136Z discriminator_hidden_states = self.electra( 2025-08-14T21:37:32.4273509Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:32.4273866Z hidden_states = self.encoder( 2025-08-14T21:37:32.4274220Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:32.4274602Z layer_outputs = layer_module( 2025-08-14T21:37:32.4274922Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:32.4275256Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:32.4275618Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 474, in forward 2025-08-14T21:37:32.4275990Z self_attention_outputs = self.attention( 2025-08-14T21:37:32.4276343Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:32.4276684Z return func(*args, **kwargs) 2025-08-14T21:37:32.4277028Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 411, in forward 2025-08-14T21:37:32.4277442Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:37:32.4277858Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 348, in forward 2025-08-14T21:37:32.4278226Z hidden_states = self.dense(hidden_states) 2025-08-14T21:37:32.4278360Z 2025-08-14T21:37:32.4278453Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:32.4278788Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:32.4279088Z return mod(**inputs) 2025-08-14T21:37:32.4279428Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1317, in forward 2025-08-14T21:37:32.4279809Z discriminator_hidden_states = self.electra( 2025-08-14T21:37:32.4280182Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:32.4280543Z hidden_states = self.encoder( 2025-08-14T21:37:32.4280892Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:32.4281254Z layer_outputs = layer_module( 2025-08-14T21:37:32.4281570Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:32.4281899Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:32.4282264Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 504, in forward 2025-08-14T21:37:32.4282637Z layer_output = apply_chunking_to_forward( 2025-08-14T21:37:32.4283005Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:37:32.4283355Z return forward_fn(*input_tensors) 2025-08-14T21:37:32.4283742Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 512, in feed_forward_chunk 2025-08-14T21:37:32.4284197Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:37:32.4284866Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 427, in forward 2025-08-14T21:37:32.4285262Z hidden_states = self.dense(hidden_states) 2025-08-14T21:37:32.4285398Z 2025-08-14T21:37:32.4285528Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:32.4285909Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:32.4286242Z return mod(**inputs) 2025-08-14T21:37:32.4286635Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1317, in forward 2025-08-14T21:37:32.4287066Z discriminator_hidden_states = self.electra( 2025-08-14T21:37:32.4287436Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:32.4287823Z hidden_states = self.encoder( 2025-08-14T21:37:32.4288214Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:32.4288629Z layer_outputs = layer_module( 2025-08-14T21:37:32.4288985Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:32.4289371Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:32.4289790Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 504, in forward 2025-08-14T21:37:32.4290214Z layer_output = apply_chunking_to_forward( 2025-08-14T21:37:32.4290626Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:37:32.4291039Z return forward_fn(*input_tensors) 2025-08-14T21:37:32.4291486Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 512, in feed_forward_chunk 2025-08-14T21:37:32.4291987Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:37:32.4292445Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 428, in forward 2025-08-14T21:37:32.4292904Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:37:32.4293303Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:37:32.4293662Z return self.act(input) 2025-08-14T21:37:32.4293770Z 2025-08-14T21:37:32.4293864Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:32.4294193Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:32.4294492Z return mod(**inputs) 2025-08-14T21:37:32.4294830Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1317, in forward 2025-08-14T21:37:32.4295215Z discriminator_hidden_states = self.electra( 2025-08-14T21:37:32.4295592Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:32.4295951Z hidden_states = self.encoder( 2025-08-14T21:37:32.4296299Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:32.4296659Z layer_outputs = layer_module( 2025-08-14T21:37:32.4296974Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:32.4297299Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:32.4297667Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 504, in forward 2025-08-14T21:37:32.4298037Z layer_output = apply_chunking_to_forward( 2025-08-14T21:37:32.4298437Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:37:32.4298799Z return forward_fn(*input_tensors) 2025-08-14T21:37:32.4299203Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 513, in feed_forward_chunk 2025-08-14T21:37:32.4299652Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:37:32.4300072Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 441, in forward 2025-08-14T21:37:32.4300441Z hidden_states = self.dense(hidden_states) 2025-08-14T21:37:32.4300575Z 2025-08-14T21:37:32.4300673Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:32.4301010Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:32.4301309Z return mod(**inputs) 2025-08-14T21:37:32.4301673Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1317, in forward 2025-08-14T21:37:32.4302058Z discriminator_hidden_states = self.electra( 2025-08-14T21:37:32.4302439Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:32.4302793Z hidden_states = self.encoder( 2025-08-14T21:37:32.4303153Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:32.4303518Z layer_outputs = layer_module( 2025-08-14T21:37:32.4303833Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:32.4304182Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:32.4304547Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 474, in forward 2025-08-14T21:37:32.4304967Z self_attention_outputs = self.attention( 2025-08-14T21:37:32.4305332Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:32.4305680Z return func(*args, **kwargs) 2025-08-14T21:37:32.4306029Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 401, in forward 2025-08-14T21:37:32.4306385Z self_outputs = self.self( 2025-08-14T21:37:32.4306717Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:32.4307056Z return func(*args, **kwargs) 2025-08-14T21:37:32.4307398Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 241, in forward 2025-08-14T21:37:32.4307771Z query_layer = self.query(hidden_states) 2025-08-14T21:37:32.4307908Z 2025-08-14T21:37:32.4308001Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:32.4308330Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:32.4308618Z return mod(**inputs) 2025-08-14T21:37:32.4308963Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1317, in forward 2025-08-14T21:37:32.4309340Z discriminator_hidden_states = self.electra( 2025-08-14T21:37:32.4309711Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:32.4310062Z hidden_states = self.encoder( 2025-08-14T21:37:32.4310412Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:32.4310767Z layer_outputs = layer_module( 2025-08-14T21:37:32.4311074Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:32.4311427Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:32.4311806Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 474, in forward 2025-08-14T21:37:32.4312178Z self_attention_outputs = self.attention( 2025-08-14T21:37:32.4312533Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:32.4312880Z return func(*args, **kwargs) 2025-08-14T21:37:32.4313232Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 401, in forward 2025-08-14T21:37:32.4313591Z self_outputs = self.self( 2025-08-14T21:37:32.4313918Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:32.4314260Z return func(*args, **kwargs) 2025-08-14T21:37:32.4314639Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 270, in forward 2025-08-14T21:37:32.4315002Z key_layer = self.key(current_states) 2025-08-14T21:37:32.4315132Z 2025-08-14T21:37:32.4315229Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:32.4315561Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:32.4315858Z return mod(**inputs) 2025-08-14T21:37:32.4316191Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1317, in forward 2025-08-14T21:37:32.4316568Z discriminator_hidden_states = self.electra( 2025-08-14T21:37:32.4316940Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:32.4317290Z hidden_states = self.encoder( 2025-08-14T21:37:32.4317647Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:32.4318010Z layer_outputs = layer_module( 2025-08-14T21:37:32.4318328Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:32.4318662Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:32.4319023Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 474, in forward 2025-08-14T21:37:32.4319395Z self_attention_outputs = self.attention( 2025-08-14T21:37:32.4319742Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:32.4320074Z return func(*args, **kwargs) 2025-08-14T21:37:32.4320420Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 401, in forward 2025-08-14T21:37:32.4320780Z self_outputs = self.self( 2025-08-14T21:37:32.4321101Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:32.4321440Z return func(*args, **kwargs) 2025-08-14T21:37:32.4321787Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 274, in forward 2025-08-14T21:37:32.4322157Z value_layer = self.value(current_states) 2025-08-14T21:37:32.4322280Z 2025-08-14T21:37:32.4322353Z cudagraph partition due to non gpu ops 2025-08-14T21:37:32.4322549Z cudagraph partition due to non gpu ops 2025-08-14T21:37:32.4322764Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:32.4323085Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:32.4323383Z return mod(**inputs) 2025-08-14T21:37:32.4323726Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1317, in forward 2025-08-14T21:37:32.4324123Z discriminator_hidden_states = self.electra( 2025-08-14T21:37:32.4324504Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:32.4324865Z hidden_states = self.encoder( 2025-08-14T21:37:32.4325227Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:32.4325578Z layer_outputs = layer_module( 2025-08-14T21:37:32.4325895Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:32.4326222Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:32.4326583Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 474, in forward 2025-08-14T21:37:32.4326941Z self_attention_outputs = self.attention( 2025-08-14T21:37:32.4327310Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:32.4327654Z return func(*args, **kwargs) 2025-08-14T21:37:32.4327997Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 411, in forward 2025-08-14T21:37:32.4328414Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:37:32.4328826Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 348, in forward 2025-08-14T21:37:32.4329197Z hidden_states = self.dense(hidden_states) 2025-08-14T21:37:32.4329322Z 2025-08-14T21:37:32.4329416Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:32.4329743Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:32.4330043Z return mod(**inputs) 2025-08-14T21:37:32.4330389Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1317, in forward 2025-08-14T21:37:32.4330764Z discriminator_hidden_states = self.electra( 2025-08-14T21:37:32.4331143Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:32.4331502Z hidden_states = self.encoder( 2025-08-14T21:37:32.4331850Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:32.4332213Z layer_outputs = layer_module( 2025-08-14T21:37:32.4332532Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:32.4332864Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:32.4333227Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 504, in forward 2025-08-14T21:37:32.4333606Z layer_output = apply_chunking_to_forward( 2025-08-14T21:37:32.4333982Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:37:32.4334343Z return forward_fn(*input_tensors) 2025-08-14T21:37:32.4334730Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 512, in feed_forward_chunk 2025-08-14T21:37:32.4335166Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:37:32.4335570Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 427, in forward 2025-08-14T21:37:32.4335933Z hidden_states = self.dense(hidden_states) 2025-08-14T21:37:32.4336065Z 2025-08-14T21:37:32.4336160Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:32.4336486Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:32.4336799Z return mod(**inputs) 2025-08-14T21:37:32.4337146Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1317, in forward 2025-08-14T21:37:32.4337525Z discriminator_hidden_states = self.electra( 2025-08-14T21:37:32.4337913Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:32.4338274Z hidden_states = self.encoder( 2025-08-14T21:37:32.4338618Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:32.4338974Z layer_outputs = layer_module( 2025-08-14T21:37:32.4339289Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:32.4339612Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:32.4339976Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 504, in forward 2025-08-14T21:37:32.4340366Z layer_output = apply_chunking_to_forward( 2025-08-14T21:37:32.4340727Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:37:32.4341076Z return forward_fn(*input_tensors) 2025-08-14T21:37:32.4341458Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 512, in feed_forward_chunk 2025-08-14T21:37:32.4341884Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:37:32.4342279Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 428, in forward 2025-08-14T21:37:32.4342669Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:37:32.4343014Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:37:32.4343326Z return self.act(input) 2025-08-14T21:37:32.4343425Z 2025-08-14T21:37:32.4343517Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:32.4343847Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:32.4344142Z return mod(**inputs) 2025-08-14T21:37:32.4344481Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1317, in forward 2025-08-14T21:37:32.4344936Z discriminator_hidden_states = self.electra( 2025-08-14T21:37:32.4345323Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:32.4345691Z hidden_states = self.encoder( 2025-08-14T21:37:32.4346054Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:32.4346413Z layer_outputs = layer_module( 2025-08-14T21:37:32.4346738Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:32.4347082Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:32.4347452Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 504, in forward 2025-08-14T21:37:32.4347832Z layer_output = apply_chunking_to_forward( 2025-08-14T21:37:32.4348206Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:37:32.4348574Z return forward_fn(*input_tensors) 2025-08-14T21:37:32.4348963Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 513, in feed_forward_chunk 2025-08-14T21:37:32.4349415Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:37:32.4349840Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 441, in forward 2025-08-14T21:37:32.4350237Z hidden_states = self.dense(hidden_states) 2025-08-14T21:37:32.4350387Z 2025-08-14T21:37:32.4350488Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:32.4350837Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:32.4351146Z return mod(**inputs) 2025-08-14T21:37:32.4351485Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1317, in forward 2025-08-14T21:37:32.4351868Z discriminator_hidden_states = self.electra( 2025-08-14T21:37:32.4352247Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:32.4352612Z hidden_states = self.encoder( 2025-08-14T21:37:32.4352961Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:32.4353340Z layer_outputs = layer_module( 2025-08-14T21:37:32.4353660Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:32.4353991Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:32.4354364Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 474, in forward 2025-08-14T21:37:32.4354739Z self_attention_outputs = self.attention( 2025-08-14T21:37:32.4355098Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:32.4355438Z return func(*args, **kwargs) 2025-08-14T21:37:32.4355790Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 401, in forward 2025-08-14T21:37:32.4356153Z self_outputs = self.self( 2025-08-14T21:37:32.4356382Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:32.4356453Z return func(*args, **kwargs) 2025-08-14T21:37:32.4356697Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 241, in forward 2025-08-14T21:37:32.4356775Z query_layer = self.query(hidden_states) 2025-08-14T21:37:32.4356779Z 2025-08-14T21:37:32.4356884Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:32.4357070Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:32.4357131Z return mod(**inputs) 2025-08-14T21:37:32.4357384Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1317, in forward 2025-08-14T21:37:32.4357465Z discriminator_hidden_states = self.electra( 2025-08-14T21:37:32.4357713Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:32.4357780Z hidden_states = self.encoder( 2025-08-14T21:37:32.4358019Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:32.4358091Z layer_outputs = layer_module( 2025-08-14T21:37:32.4358308Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:32.4358380Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:32.4358626Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 474, in forward 2025-08-14T21:37:32.4358702Z self_attention_outputs = self.attention( 2025-08-14T21:37:32.4358933Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:32.4358996Z return func(*args, **kwargs) 2025-08-14T21:37:32.4359268Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 401, in forward 2025-08-14T21:37:32.4359353Z self_outputs = self.self( 2025-08-14T21:37:32.4359573Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:32.4359663Z return func(*args, **kwargs) 2025-08-14T21:37:32.4359903Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 270, in forward 2025-08-14T21:37:32.4359976Z key_layer = self.key(current_states) 2025-08-14T21:37:32.4359980Z 2025-08-14T21:37:32.4360081Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:32.4360263Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:32.4360321Z return mod(**inputs) 2025-08-14T21:37:32.4360567Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1317, in forward 2025-08-14T21:37:32.4360667Z discriminator_hidden_states = self.electra( 2025-08-14T21:37:32.4360910Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:32.4360973Z hidden_states = self.encoder( 2025-08-14T21:37:32.4361209Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:32.4361277Z layer_outputs = layer_module( 2025-08-14T21:37:32.4361477Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:32.4361554Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:32.4361788Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 474, in forward 2025-08-14T21:37:32.4361862Z self_attention_outputs = self.attention( 2025-08-14T21:37:32.4362086Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:32.4362148Z return func(*args, **kwargs) 2025-08-14T21:37:32.4362386Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 401, in forward 2025-08-14T21:37:32.4362454Z self_outputs = self.self( 2025-08-14T21:37:32.4362669Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:32.4362737Z return func(*args, **kwargs) 2025-08-14T21:37:32.4362972Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 274, in forward 2025-08-14T21:37:32.4363043Z value_layer = self.value(current_states) 2025-08-14T21:37:32.4363046Z 2025-08-14T21:37:32.4363125Z cudagraph partition due to non gpu ops 2025-08-14T21:37:32.4363199Z cudagraph partition due to non gpu ops 2025-08-14T21:37:32.4363293Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:32.4363482Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:32.4363542Z return mod(**inputs) 2025-08-14T21:37:32.4363791Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1317, in forward 2025-08-14T21:37:32.4363869Z discriminator_hidden_states = self.electra( 2025-08-14T21:37:32.4364105Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:32.4364175Z hidden_states = self.encoder( 2025-08-14T21:37:32.4364409Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:32.4364479Z layer_outputs = layer_module( 2025-08-14T21:37:32.4364698Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:32.4364769Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:32.4365028Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 474, in forward 2025-08-14T21:37:32.4365120Z self_attention_outputs = self.attention( 2025-08-14T21:37:32.4365339Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:32.4365409Z return func(*args, **kwargs) 2025-08-14T21:37:32.4365643Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 411, in forward 2025-08-14T21:37:32.4365767Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:37:32.4366002Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 348, in forward 2025-08-14T21:37:32.4366111Z hidden_states = self.dense(hidden_states) 2025-08-14T21:37:32.4366114Z 2025-08-14T21:37:32.4366218Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:32.4366399Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:32.4366464Z return mod(**inputs) 2025-08-14T21:37:32.4366706Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1317, in forward 2025-08-14T21:37:32.4366783Z discriminator_hidden_states = self.electra( 2025-08-14T21:37:32.4367027Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:32.4367090Z hidden_states = self.encoder( 2025-08-14T21:37:32.4367327Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:32.4367406Z layer_outputs = layer_module( 2025-08-14T21:37:32.4367609Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:32.4367685Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:32.4367923Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 504, in forward 2025-08-14T21:37:32.4368000Z layer_output = apply_chunking_to_forward( 2025-08-14T21:37:32.4368247Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:37:32.4368314Z return forward_fn(*input_tensors) 2025-08-14T21:37:32.4368582Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 512, in feed_forward_chunk 2025-08-14T21:37:32.4368697Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:37:32.4368938Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 427, in forward 2025-08-14T21:37:32.4369020Z hidden_states = self.dense(hidden_states) 2025-08-14T21:37:32.4369023Z 2025-08-14T21:37:32.4369116Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:32.4369295Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:32.4369363Z return mod(**inputs) 2025-08-14T21:37:32.4369603Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1317, in forward 2025-08-14T21:37:32.4369686Z discriminator_hidden_states = self.electra( 2025-08-14T21:37:32.4369920Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:32.4369983Z hidden_states = self.encoder( 2025-08-14T21:37:32.4370225Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:32.4370304Z layer_outputs = layer_module( 2025-08-14T21:37:32.4370521Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:32.4370599Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:32.4370852Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 504, in forward 2025-08-14T21:37:32.4370934Z layer_output = apply_chunking_to_forward( 2025-08-14T21:37:32.4371167Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:37:32.4371234Z return forward_fn(*input_tensors) 2025-08-14T21:37:32.4371508Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 512, in feed_forward_chunk 2025-08-14T21:37:32.4371636Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:37:32.4371877Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 428, in forward 2025-08-14T21:37:32.4371979Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:37:32.4372171Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:37:32.4372240Z return self.act(input) 2025-08-14T21:37:32.4372243Z 2025-08-14T21:37:32.4372336Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:32.4372520Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:32.4372578Z return mod(**inputs) 2025-08-14T21:37:32.4372820Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1317, in forward 2025-08-14T21:37:32.4372903Z discriminator_hidden_states = self.electra( 2025-08-14T21:37:32.4373142Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:32.4373205Z hidden_states = self.encoder( 2025-08-14T21:37:32.4373449Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:32.4373511Z layer_outputs = layer_module( 2025-08-14T21:37:32.4373714Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:32.4373783Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:32.4374018Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 504, in forward 2025-08-14T21:37:32.4374099Z layer_output = apply_chunking_to_forward( 2025-08-14T21:37:32.4374332Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:37:32.4374402Z return forward_fn(*input_tensors) 2025-08-14T21:37:32.4374673Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 513, in feed_forward_chunk 2025-08-14T21:37:32.4374793Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:37:32.4375034Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 441, in forward 2025-08-14T21:37:32.4375104Z hidden_states = self.dense(hidden_states) 2025-08-14T21:37:32.4375107Z 2025-08-14T21:37:32.4375198Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:32.4375381Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:32.4375437Z return mod(**inputs) 2025-08-14T21:37:32.4375681Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1317, in forward 2025-08-14T21:37:32.4375778Z discriminator_hidden_states = self.electra( 2025-08-14T21:37:32.4376030Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:32.4376103Z hidden_states = self.encoder( 2025-08-14T21:37:32.4376357Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:32.4376422Z layer_outputs = layer_module( 2025-08-14T21:37:32.4376629Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:32.4376701Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:32.4376941Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 474, in forward 2025-08-14T21:37:32.4377015Z self_attention_outputs = self.attention( 2025-08-14T21:37:32.4377251Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:32.4377321Z return func(*args, **kwargs) 2025-08-14T21:37:32.4377557Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 401, in forward 2025-08-14T21:37:32.4377627Z self_outputs = self.self( 2025-08-14T21:37:32.4377845Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:32.4377908Z return func(*args, **kwargs) 2025-08-14T21:37:32.4378151Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 241, in forward 2025-08-14T21:37:32.4378221Z query_layer = self.query(hidden_states) 2025-08-14T21:37:32.4378224Z 2025-08-14T21:37:32.4378317Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:32.4378504Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:32.4378564Z return mod(**inputs) 2025-08-14T21:37:32.4378814Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1317, in forward 2025-08-14T21:37:32.4378890Z discriminator_hidden_states = self.electra( 2025-08-14T21:37:32.4379127Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:32.4379196Z hidden_states = self.encoder( 2025-08-14T21:37:32.4379431Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:32.4379499Z layer_outputs = layer_module( 2025-08-14T21:37:32.4379700Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:32.4379771Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:32.4380019Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 474, in forward 2025-08-14T21:37:32.4380093Z self_attention_outputs = self.attention( 2025-08-14T21:37:32.4380312Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:32.4380381Z return func(*args, **kwargs) 2025-08-14T21:37:32.4380615Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 401, in forward 2025-08-14T21:37:32.4380685Z self_outputs = self.self( 2025-08-14T21:37:32.4380904Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:32.4380964Z return func(*args, **kwargs) 2025-08-14T21:37:32.4381206Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 270, in forward 2025-08-14T21:37:32.4381292Z key_layer = self.key(current_states) 2025-08-14T21:37:32.4381295Z 2025-08-14T21:37:32.4381411Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:32.4381596Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:32.4381670Z return mod(**inputs) 2025-08-14T21:37:32.4381919Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1317, in forward 2025-08-14T21:37:32.4381996Z discriminator_hidden_states = self.electra( 2025-08-14T21:37:32.4382230Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:32.4382298Z hidden_states = self.encoder( 2025-08-14T21:37:32.4382533Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:32.4382621Z layer_outputs = layer_module( 2025-08-14T21:37:32.4382818Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:32.4382890Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:32.4383134Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 474, in forward 2025-08-14T21:37:32.4383207Z self_attention_outputs = self.attention( 2025-08-14T21:37:32.4383425Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:32.4383493Z return func(*args, **kwargs) 2025-08-14T21:37:32.4383730Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 401, in forward 2025-08-14T21:37:32.4383797Z self_outputs = self.self( 2025-08-14T21:37:32.4384013Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:32.4384077Z return func(*args, **kwargs) 2025-08-14T21:37:32.4384322Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 274, in forward 2025-08-14T21:37:32.4384392Z value_layer = self.value(current_states) 2025-08-14T21:37:32.4384396Z 2025-08-14T21:37:32.4384475Z cudagraph partition due to non gpu ops 2025-08-14T21:37:32.4384545Z cudagraph partition due to non gpu ops 2025-08-14T21:37:32.4384881Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:32.4385115Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:32.4385179Z return mod(**inputs) 2025-08-14T21:37:32.4385442Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1317, in forward 2025-08-14T21:37:32.4385533Z discriminator_hidden_states = self.electra( 2025-08-14T21:37:32.4385797Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:32.4385871Z hidden_states = self.encoder( 2025-08-14T21:37:32.4386128Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:32.4386197Z layer_outputs = layer_module( 2025-08-14T21:37:32.4386421Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:32.4386493Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:32.4386746Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 474, in forward 2025-08-14T21:37:32.4386825Z self_attention_outputs = self.attention( 2025-08-14T21:37:32.4387047Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:32.4387156Z return func(*args, **kwargs) 2025-08-14T21:37:32.4387419Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 411, in forward 2025-08-14T21:37:32.4387539Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:37:32.4387827Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 348, in forward 2025-08-14T21:37:32.4387909Z hidden_states = self.dense(hidden_states) 2025-08-14T21:37:32.4387913Z 2025-08-14T21:37:32.4388019Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:32.4388210Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:32.4388273Z return mod(**inputs) 2025-08-14T21:37:32.4388540Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1317, in forward 2025-08-14T21:37:32.4389572Z discriminator_hidden_states = self.electra( 2025-08-14T21:37:32.4389835Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:32.4389913Z hidden_states = self.encoder( 2025-08-14T21:37:32.4390172Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:32.4390247Z layer_outputs = layer_module( 2025-08-14T21:37:32.4390465Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:32.4390539Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:32.4390800Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 504, in forward 2025-08-14T21:37:32.4390881Z layer_output = apply_chunking_to_forward( 2025-08-14T21:37:32.4391134Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:37:32.4391217Z return forward_fn(*input_tensors) 2025-08-14T21:37:32.4391507Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 512, in feed_forward_chunk 2025-08-14T21:37:32.4391632Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:37:32.4391892Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 427, in forward 2025-08-14T21:37:32.4391972Z hidden_states = self.dense(hidden_states) 2025-08-14T21:37:32.4391976Z 2025-08-14T21:37:32.4392081Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:32.4392276Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:32.4392347Z return mod(**inputs) 2025-08-14T21:37:32.4392608Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1317, in forward 2025-08-14T21:37:32.4392694Z discriminator_hidden_states = self.electra( 2025-08-14T21:37:32.4392960Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:32.4393029Z hidden_states = self.encoder( 2025-08-14T21:37:32.4393283Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:32.4393357Z layer_outputs = layer_module( 2025-08-14T21:37:32.4393574Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:32.4393655Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:32.4393910Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 504, in forward 2025-08-14T21:37:32.4394010Z layer_output = apply_chunking_to_forward( 2025-08-14T21:37:32.4394270Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:37:32.4394361Z return forward_fn(*input_tensors) 2025-08-14T21:37:32.4394675Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 512, in feed_forward_chunk 2025-08-14T21:37:32.4394796Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:37:32.4395052Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 428, in forward 2025-08-14T21:37:32.4395161Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:37:32.4395353Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:37:32.4395416Z return self.act(input) 2025-08-14T21:37:32.4395427Z 2025-08-14T21:37:32.4395538Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:32.4395722Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:32.4395789Z return mod(**inputs) 2025-08-14T21:37:32.4396034Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1317, in forward 2025-08-14T21:37:32.4396113Z discriminator_hidden_states = self.electra( 2025-08-14T21:37:32.4396358Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:32.4396421Z hidden_states = self.encoder( 2025-08-14T21:37:32.4396665Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:32.4396728Z layer_outputs = layer_module( 2025-08-14T21:37:32.4396929Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:32.4397010Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:32.4397248Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 504, in forward 2025-08-14T21:37:32.4397321Z layer_output = apply_chunking_to_forward( 2025-08-14T21:37:32.4397563Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:37:32.4397630Z return forward_fn(*input_tensors) 2025-08-14T21:37:32.4397902Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 513, in feed_forward_chunk 2025-08-14T21:37:32.4398020Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:37:32.4398262Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 441, in forward 2025-08-14T21:37:32.4398346Z hidden_states = self.dense(hidden_states) 2025-08-14T21:37:32.4398349Z 2025-08-14T21:37:32.4398442Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:32.4398631Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:32.4398690Z return mod(**inputs) 2025-08-14T21:37:32.4398934Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1317, in forward 2025-08-14T21:37:32.4399018Z discriminator_hidden_states = self.electra( 2025-08-14T21:37:32.4399257Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:32.4399319Z hidden_states = self.encoder( 2025-08-14T21:37:32.4399563Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:32.4399625Z layer_outputs = layer_module( 2025-08-14T21:37:32.4399876Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:32.4399962Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:32.4400202Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 474, in forward 2025-08-14T21:37:32.4400299Z self_attention_outputs = self.attention( 2025-08-14T21:37:32.4400520Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:32.4400590Z return func(*args, **kwargs) 2025-08-14T21:37:32.4400824Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 401, in forward 2025-08-14T21:37:32.4400888Z self_outputs = self.self( 2025-08-14T21:37:32.4401113Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:32.4401191Z return func(*args, **kwargs) 2025-08-14T21:37:32.4401435Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 241, in forward 2025-08-14T21:37:32.4401516Z query_layer = self.query(hidden_states) 2025-08-14T21:37:32.4401519Z 2025-08-14T21:37:32.4401613Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:32.4401801Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:32.4401860Z return mod(**inputs) 2025-08-14T21:37:32.4402106Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1317, in forward 2025-08-14T21:37:32.4402191Z discriminator_hidden_states = self.electra( 2025-08-14T21:37:32.4402430Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:32.4402500Z hidden_states = self.encoder( 2025-08-14T21:37:32.4402742Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:32.4402805Z layer_outputs = layer_module( 2025-08-14T21:37:32.4403015Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:32.4403085Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:32.4403326Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 474, in forward 2025-08-14T21:37:32.4403404Z self_attention_outputs = self.attention( 2025-08-14T21:37:32.4403626Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:32.4403693Z return func(*args, **kwargs) 2025-08-14T21:37:32.4403933Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 401, in forward 2025-08-14T21:37:32.4403997Z self_outputs = self.self( 2025-08-14T21:37:32.4404228Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:32.4404287Z return func(*args, **kwargs) 2025-08-14T21:37:32.4404531Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 270, in forward 2025-08-14T21:37:32.4404605Z key_layer = self.key(current_states) 2025-08-14T21:37:32.4404609Z 2025-08-14T21:37:32.4404702Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:32.4404889Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:32.4404945Z return mod(**inputs) 2025-08-14T21:37:32.4405188Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1317, in forward 2025-08-14T21:37:32.4405301Z discriminator_hidden_states = self.electra( 2025-08-14T21:37:32.4405551Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:32.4405621Z hidden_states = self.encoder( 2025-08-14T21:37:32.4405870Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:32.4405935Z layer_outputs = layer_module( 2025-08-14T21:37:32.4406142Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:32.4406212Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:32.4406450Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 474, in forward 2025-08-14T21:37:32.4406528Z self_attention_outputs = self.attention( 2025-08-14T21:37:32.4406747Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:32.4406832Z return func(*args, **kwargs) 2025-08-14T21:37:32.4407078Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 401, in forward 2025-08-14T21:37:32.4407141Z self_outputs = self.self( 2025-08-14T21:37:32.4407371Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:32.4407431Z return func(*args, **kwargs) 2025-08-14T21:37:32.4407672Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 274, in forward 2025-08-14T21:37:32.4407750Z value_layer = self.value(current_states) 2025-08-14T21:37:32.4407753Z 2025-08-14T21:37:32.4407826Z cudagraph partition due to non gpu ops 2025-08-14T21:37:32.4407904Z cudagraph partition due to non gpu ops 2025-08-14T21:37:32.4407998Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:32.4408178Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:32.4408245Z return mod(**inputs) 2025-08-14T21:37:32.4408487Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1317, in forward 2025-08-14T21:37:32.4408571Z discriminator_hidden_states = self.electra( 2025-08-14T21:37:32.4408804Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:32.4408866Z hidden_states = self.encoder( 2025-08-14T21:37:32.4409108Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:32.4409170Z layer_outputs = layer_module( 2025-08-14T21:37:32.4409370Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:32.4409449Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:32.4409683Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 474, in forward 2025-08-14T21:37:32.4409762Z self_attention_outputs = self.attention( 2025-08-14T21:37:32.4409981Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:32.4410042Z return func(*args, **kwargs) 2025-08-14T21:37:32.4410282Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 411, in forward 2025-08-14T21:37:32.4410396Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:37:32.4410641Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 348, in forward 2025-08-14T21:37:32.4410715Z hidden_states = self.dense(hidden_states) 2025-08-14T21:37:32.4410734Z 2025-08-14T21:37:32.4410828Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:32.4411025Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:32.4411084Z return mod(**inputs) 2025-08-14T21:37:32.4411340Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1317, in forward 2025-08-14T21:37:32.4411426Z discriminator_hidden_states = self.electra( 2025-08-14T21:37:32.4411661Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:32.4411732Z hidden_states = self.encoder( 2025-08-14T21:37:32.4411967Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:32.4412029Z layer_outputs = layer_module( 2025-08-14T21:37:32.4412235Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:32.4412319Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:32.4412555Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 504, in forward 2025-08-14T21:37:32.4412638Z layer_output = apply_chunking_to_forward( 2025-08-14T21:37:32.4412868Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:37:32.4412943Z return forward_fn(*input_tensors) 2025-08-14T21:37:32.4413208Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 512, in feed_forward_chunk 2025-08-14T21:37:32.4413314Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:37:32.4413556Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 427, in forward 2025-08-14T21:37:32.4413632Z hidden_states = self.dense(hidden_states) 2025-08-14T21:37:32.4413635Z 2025-08-14T21:37:32.4413731Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:32.4413911Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:32.4413969Z return mod(**inputs) 2025-08-14T21:37:32.4414215Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1317, in forward 2025-08-14T21:37:32.4414291Z discriminator_hidden_states = self.electra( 2025-08-14T21:37:32.4414534Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:32.4414596Z hidden_states = self.encoder( 2025-08-14T21:37:32.4414830Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:32.4414903Z layer_outputs = layer_module( 2025-08-14T21:37:32.4415102Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:32.4415171Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:32.4415414Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 504, in forward 2025-08-14T21:37:32.4415487Z layer_output = apply_chunking_to_forward( 2025-08-14T21:37:32.4415725Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:37:32.4415793Z return forward_fn(*input_tensors) 2025-08-14T21:37:32.4416058Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 512, in feed_forward_chunk 2025-08-14T21:37:32.4416171Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:37:32.4416406Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 428, in forward 2025-08-14T21:37:32.4416524Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:37:32.4416734Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:37:32.4416812Z return self.act(input) 2025-08-14T21:37:32.4416816Z 2025-08-14T21:37:32.4416916Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:32.4417097Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:32.4417155Z return mod(**inputs) 2025-08-14T21:37:32.4417404Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1317, in forward 2025-08-14T21:37:32.4417480Z discriminator_hidden_states = self.electra( 2025-08-14T21:37:32.4417722Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:32.4417801Z hidden_states = self.encoder( 2025-08-14T21:37:32.4418039Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:32.4418108Z layer_outputs = layer_module( 2025-08-14T21:37:32.4418313Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:32.4418383Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:32.4418624Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 504, in forward 2025-08-14T21:37:32.4418698Z layer_output = apply_chunking_to_forward( 2025-08-14T21:37:32.4418938Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:37:32.4419005Z return forward_fn(*input_tensors) 2025-08-14T21:37:32.4419274Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 513, in feed_forward_chunk 2025-08-14T21:37:32.4419400Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:37:32.4419639Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 441, in forward 2025-08-14T21:37:32.4419720Z hidden_states = self.dense(hidden_states) 2025-08-14T21:37:32.4419723Z 2025-08-14T21:37:32.4419816Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:32.4419994Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:32.4420061Z return mod(**inputs) 2025-08-14T21:37:32.4420302Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1317, in forward 2025-08-14T21:37:32.4420378Z discriminator_hidden_states = self.electra( 2025-08-14T21:37:32.4420622Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:32.4420685Z hidden_states = self.encoder( 2025-08-14T21:37:32.4420928Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:32.4420991Z layer_outputs = layer_module( 2025-08-14T21:37:32.4421190Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:32.4421267Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:32.4421503Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 474, in forward 2025-08-14T21:37:32.4421584Z self_attention_outputs = self.attention( 2025-08-14T21:37:32.4421806Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:32.4421905Z return func(*args, **kwargs) 2025-08-14T21:37:32.4422166Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 401, in forward 2025-08-14T21:37:32.4422231Z self_outputs = self.self( 2025-08-14T21:37:32.4422467Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:32.4422539Z return func(*args, **kwargs) 2025-08-14T21:37:32.4422776Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 241, in forward 2025-08-14T21:37:32.4422856Z query_layer = self.query(hidden_states) 2025-08-14T21:37:32.4422860Z 2025-08-14T21:37:32.4422952Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:32.4423133Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:32.4423201Z return mod(**inputs) 2025-08-14T21:37:32.4423464Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1317, in forward 2025-08-14T21:37:32.4423549Z discriminator_hidden_states = self.electra( 2025-08-14T21:37:32.4423783Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:32.4423845Z hidden_states = self.encoder( 2025-08-14T21:37:32.4424088Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:32.4424151Z layer_outputs = layer_module( 2025-08-14T21:37:32.4424349Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:32.4424434Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:32.4424667Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 474, in forward 2025-08-14T21:37:32.4424751Z self_attention_outputs = self.attention( 2025-08-14T21:37:32.4425080Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:32.4425149Z return func(*args, **kwargs) 2025-08-14T21:37:32.4425401Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 401, in forward 2025-08-14T21:37:32.4425466Z self_outputs = self.self( 2025-08-14T21:37:32.4425684Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:32.4425754Z return func(*args, **kwargs) 2025-08-14T21:37:32.4426067Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 270, in forward 2025-08-14T21:37:32.4426145Z key_layer = self.key(current_states) 2025-08-14T21:37:32.4426151Z 2025-08-14T21:37:32.4426245Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:32.4426423Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:32.4426490Z return mod(**inputs) 2025-08-14T21:37:32.4426731Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1317, in forward 2025-08-14T21:37:32.4426816Z discriminator_hidden_states = self.electra( 2025-08-14T21:37:32.4427051Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:32.4427116Z hidden_states = self.encoder( 2025-08-14T21:37:32.4427361Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:32.4427423Z layer_outputs = layer_module( 2025-08-14T21:37:32.4427623Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:32.4427726Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:32.4427983Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 474, in forward 2025-08-14T21:37:32.4428066Z self_attention_outputs = self.attention( 2025-08-14T21:37:32.4428299Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:32.4428361Z return func(*args, **kwargs) 2025-08-14T21:37:32.4428608Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 401, in forward 2025-08-14T21:37:32.4428669Z self_outputs = self.self( 2025-08-14T21:37:32.4428896Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:32.4428956Z return func(*args, **kwargs) 2025-08-14T21:37:32.4429212Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 274, in forward 2025-08-14T21:37:32.4429293Z value_layer = self.value(current_states) 2025-08-14T21:37:32.4429297Z 2025-08-14T21:37:32.4429369Z cudagraph partition due to non gpu ops 2025-08-14T21:37:32.4429441Z cudagraph partition due to non gpu ops 2025-08-14T21:37:32.4429542Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:32.4429723Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:32.4429788Z return mod(**inputs) 2025-08-14T21:37:32.4430031Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1317, in forward 2025-08-14T21:37:32.4430107Z discriminator_hidden_states = self.electra( 2025-08-14T21:37:32.4430350Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:32.4430416Z hidden_states = self.encoder( 2025-08-14T21:37:32.4430657Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:32.4430727Z layer_outputs = layer_module( 2025-08-14T21:37:32.4430929Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:32.4431004Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:32.4431242Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 474, in forward 2025-08-14T21:37:32.4431313Z self_attention_outputs = self.attention( 2025-08-14T21:37:32.4431536Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:32.4431596Z return func(*args, **kwargs) 2025-08-14T21:37:32.4431842Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 411, in forward 2025-08-14T21:37:32.4431959Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:37:32.4432200Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 348, in forward 2025-08-14T21:37:32.4432282Z hidden_states = self.dense(hidden_states) 2025-08-14T21:37:32.4432285Z 2025-08-14T21:37:32.4432377Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:32.4432556Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:32.4432622Z return mod(**inputs) 2025-08-14T21:37:32.4432863Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1317, in forward 2025-08-14T21:37:32.4432947Z discriminator_hidden_states = self.electra( 2025-08-14T21:37:32.4433184Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:32.4433265Z hidden_states = self.encoder( 2025-08-14T21:37:32.4433526Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:32.4433592Z layer_outputs = layer_module( 2025-08-14T21:37:32.4433814Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:32.4433885Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:32.4434123Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 504, in forward 2025-08-14T21:37:32.4434207Z layer_output = apply_chunking_to_forward( 2025-08-14T21:37:32.4434441Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:37:32.4434511Z return forward_fn(*input_tensors) 2025-08-14T21:37:32.4434800Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 512, in feed_forward_chunk 2025-08-14T21:37:32.4434909Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:37:32.4435156Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 427, in forward 2025-08-14T21:37:32.4435229Z hidden_states = self.dense(hidden_states) 2025-08-14T21:37:32.4435232Z 2025-08-14T21:37:32.4435324Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:32.4435511Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:32.4435569Z return mod(**inputs) 2025-08-14T21:37:32.4435816Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1317, in forward 2025-08-14T21:37:32.4435893Z discriminator_hidden_states = self.electra( 2025-08-14T21:37:32.4436136Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:32.4436207Z hidden_states = self.encoder( 2025-08-14T21:37:32.4436446Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:32.4436510Z layer_outputs = layer_module( 2025-08-14T21:37:32.4436718Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:32.4436786Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:32.4437029Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 504, in forward 2025-08-14T21:37:32.4437102Z layer_output = apply_chunking_to_forward( 2025-08-14T21:37:32.4437336Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:37:32.4437416Z return forward_fn(*input_tensors) 2025-08-14T21:37:32.4437682Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 512, in feed_forward_chunk 2025-08-14T21:37:32.4437795Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:37:32.4438034Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 428, in forward 2025-08-14T21:37:32.4438157Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:37:32.4438355Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:37:32.4438418Z return self.act(input) 2025-08-14T21:37:32.4438421Z 2025-08-14T21:37:32.4438514Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:32.4438702Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:32.4438779Z return mod(**inputs) 2025-08-14T21:37:32.4439046Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1317, in forward 2025-08-14T21:37:32.4439125Z discriminator_hidden_states = self.electra( 2025-08-14T21:37:32.4439372Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:32.4439443Z hidden_states = self.encoder( 2025-08-14T21:37:32.4439681Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:32.4439751Z layer_outputs = layer_module( 2025-08-14T21:37:32.4439952Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:32.4440023Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:32.4440285Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 504, in forward 2025-08-14T21:37:32.4440361Z layer_output = apply_chunking_to_forward( 2025-08-14T21:37:32.4440594Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:37:32.4440670Z return forward_fn(*input_tensors) 2025-08-14T21:37:32.4440936Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 513, in feed_forward_chunk 2025-08-14T21:37:32.4441061Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:37:32.4441298Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 441, in forward 2025-08-14T21:37:32.4441370Z hidden_states = self.dense(hidden_states) 2025-08-14T21:37:32.4441373Z 2025-08-14T21:37:32.4441469Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:32.4441648Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:32.4441713Z return mod(**inputs) 2025-08-14T21:37:32.4441951Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1317, in forward 2025-08-14T21:37:32.4442025Z discriminator_hidden_states = self.electra( 2025-08-14T21:37:32.4442262Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:32.4442323Z hidden_states = self.encoder( 2025-08-14T21:37:32.4442555Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:32.4442623Z layer_outputs = layer_module( 2025-08-14T21:37:32.4442821Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:32.4442897Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:32.4443135Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 474, in forward 2025-08-14T21:37:32.4443206Z self_attention_outputs = self.attention( 2025-08-14T21:37:32.4443430Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:32.4443489Z return func(*args, **kwargs) 2025-08-14T21:37:32.4443722Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 401, in forward 2025-08-14T21:37:32.4443786Z self_outputs = self.self( 2025-08-14T21:37:32.4444000Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:32.4444061Z return func(*args, **kwargs) 2025-08-14T21:37:32.4444296Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 241, in forward 2025-08-14T21:37:32.4444382Z query_layer = self.query(hidden_states) 2025-08-14T21:37:32.4444386Z 2025-08-14T21:37:32.4444494Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:32.4444693Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:32.4444755Z return mod(**inputs) 2025-08-14T21:37:32.4444996Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1317, in forward 2025-08-14T21:37:32.4445071Z discriminator_hidden_states = self.electra( 2025-08-14T21:37:32.4445311Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:32.4445371Z hidden_states = self.encoder( 2025-08-14T21:37:32.4445606Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:32.4445687Z layer_outputs = layer_module( 2025-08-14T21:37:32.4445887Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:32.4445958Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:32.4446192Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 474, in forward 2025-08-14T21:37:32.4446260Z self_attention_outputs = self.attention( 2025-08-14T21:37:32.4446480Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:32.4446539Z return func(*args, **kwargs) 2025-08-14T21:37:32.4446778Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 401, in forward 2025-08-14T21:37:32.4446838Z self_outputs = self.self( 2025-08-14T21:37:32.4447059Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:32.4447128Z return func(*args, **kwargs) 2025-08-14T21:37:32.4447367Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 270, in forward 2025-08-14T21:37:32.4447438Z key_layer = self.key(current_states) 2025-08-14T21:37:32.4447442Z 2025-08-14T21:37:32.4447541Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:32.4447720Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:32.4447787Z return mod(**inputs) 2025-08-14T21:37:32.4448028Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1317, in forward 2025-08-14T21:37:32.4448105Z discriminator_hidden_states = self.electra( 2025-08-14T21:37:32.4448348Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:32.4448414Z hidden_states = self.encoder( 2025-08-14T21:37:32.4448661Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:32.4448722Z layer_outputs = layer_module( 2025-08-14T21:37:32.4448922Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:32.4449000Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:32.4449237Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 474, in forward 2025-08-14T21:37:32.4449310Z self_attention_outputs = self.attention( 2025-08-14T21:37:32.4449539Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:32.4449601Z return func(*args, **kwargs) 2025-08-14T21:37:32.4449859Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 401, in forward 2025-08-14T21:37:32.4449935Z self_outputs = self.self( 2025-08-14T21:37:32.4450157Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:32.4450236Z return func(*args, **kwargs) 2025-08-14T21:37:32.4450474Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 274, in forward 2025-08-14T21:37:32.4450545Z value_layer = self.value(current_states) 2025-08-14T21:37:32.4450556Z 2025-08-14T21:37:32.4450628Z cudagraph partition due to non gpu ops 2025-08-14T21:37:32.4450699Z cudagraph partition due to non gpu ops 2025-08-14T21:37:32.4450797Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:32.4450973Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:32.4451048Z return mod(**inputs) 2025-08-14T21:37:32.4451307Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1317, in forward 2025-08-14T21:37:32.4451384Z discriminator_hidden_states = self.electra( 2025-08-14T21:37:32.4451631Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:32.4451700Z hidden_states = self.encoder( 2025-08-14T21:37:32.4451946Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:32.4452013Z layer_outputs = layer_module( 2025-08-14T21:37:32.4452217Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:32.4452288Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:32.4452536Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 474, in forward 2025-08-14T21:37:32.4452609Z self_attention_outputs = self.attention( 2025-08-14T21:37:32.4452839Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:32.4452901Z return func(*args, **kwargs) 2025-08-14T21:37:32.4453144Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 411, in forward 2025-08-14T21:37:32.4453266Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:37:32.4453508Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 348, in forward 2025-08-14T21:37:32.4453583Z hidden_states = self.dense(hidden_states) 2025-08-14T21:37:32.4453594Z 2025-08-14T21:37:32.4453686Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:32.4453869Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:32.4453938Z return mod(**inputs) 2025-08-14T21:37:32.4454187Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1317, in forward 2025-08-14T21:37:32.4454262Z discriminator_hidden_states = self.electra( 2025-08-14T21:37:32.4454513Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:32.4454575Z hidden_states = self.encoder( 2025-08-14T21:37:32.4454825Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:32.4454888Z layer_outputs = layer_module( 2025-08-14T21:37:32.4455095Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:32.4455172Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:32.4455432Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 504, in forward 2025-08-14T21:37:32.4455521Z layer_output = apply_chunking_to_forward( 2025-08-14T21:37:32.4455777Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:37:32.4455847Z return forward_fn(*input_tensors) 2025-08-14T21:37:32.4456119Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 512, in feed_forward_chunk 2025-08-14T21:37:32.4456224Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:37:32.4456459Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 427, in forward 2025-08-14T21:37:32.4456538Z hidden_states = self.dense(hidden_states) 2025-08-14T21:37:32.4456541Z 2025-08-14T21:37:32.4456650Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:32.4456834Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:32.4456894Z return mod(**inputs) 2025-08-14T21:37:32.4457137Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1317, in forward 2025-08-14T21:37:32.4457220Z discriminator_hidden_states = self.electra( 2025-08-14T21:37:32.4457455Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:32.4457518Z hidden_states = self.encoder( 2025-08-14T21:37:32.4457761Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:32.4457822Z layer_outputs = layer_module( 2025-08-14T21:37:32.4458028Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:32.4458099Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:32.4458336Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 504, in forward 2025-08-14T21:37:32.4458416Z layer_output = apply_chunking_to_forward( 2025-08-14T21:37:32.4458648Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:37:32.4458716Z return forward_fn(*input_tensors) 2025-08-14T21:37:32.4458980Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 512, in feed_forward_chunk 2025-08-14T21:37:32.4459082Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:37:32.4459317Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 428, in forward 2025-08-14T21:37:32.4459418Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:37:32.4459606Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:37:32.4459672Z return self.act(input) 2025-08-14T21:37:32.4459675Z 2025-08-14T21:37:32.4459765Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:32.4459947Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:32.4460006Z return mod(**inputs) 2025-08-14T21:37:32.4460244Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1317, in forward 2025-08-14T21:37:32.4460326Z discriminator_hidden_states = self.electra( 2025-08-14T21:37:32.4460561Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 799, in forward 2025-08-14T21:37:32.4460631Z hidden_states = self.encoder( 2025-08-14T21:37:32.4460884Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 566, in forward 2025-08-14T21:37:32.4460965Z layer_outputs = layer_module( 2025-08-14T21:37:32.4461173Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:32.4461262Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:32.4461501Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 504, in forward 2025-08-14T21:37:32.4461583Z layer_output = apply_chunking_to_forward( 2025-08-14T21:37:32.4461814Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:37:32.4461887Z return forward_fn(*input_tensors) 2025-08-14T21:37:32.4462153Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 513, in feed_forward_chunk 2025-08-14T21:37:32.4462289Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:37:32.4462533Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 441, in forward 2025-08-14T21:37:32.4462606Z hidden_states = self.dense(hidden_states) 2025-08-14T21:37:32.4462611Z 2025-08-14T21:37:32.4462703Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:32.4462878Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:32.4462934Z return mod(**inputs) 2025-08-14T21:37:32.4463172Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1330, in forward 2025-08-14T21:37:32.4463243Z logits = self.qa_outputs(sequence_output) 2025-08-14T21:37:32.4463246Z 2025-08-14T21:37:32.4463334Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:32.4463515Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:32.4463572Z return mod(**inputs) 2025-08-14T21:37:32.4463819Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1348, in forward 2025-08-14T21:37:32.4463915Z start_loss = loss_fct(start_logits, start_positions) 2025-08-14T21:37:32.4463919Z 2025-08-14T21:37:32.4464009Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:32.4464193Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:32.4464251Z return mod(**inputs) 2025-08-14T21:37:32.4464497Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/electra/modeling_electra.py", line 1349, in forward 2025-08-14T21:37:32.4464581Z end_loss = loss_fct(end_logits, end_positions) 2025-08-14T21:37:32.4464585Z 2025-08-14T21:37:38.7856484Z Compilation time (from dynamo_timed): 12.72300855 2025-08-14T21:37:38.7860496Z pass 2025-08-14T21:37:38.7864270Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:37:38.7868780Z TIMING: _recursive_pre_grad_passes:0.0063 _recursive_joint_graph_passes:0.4057 _recursive_post_grad_passes:0.07625 async_compile.wait:0.00177 code_gen:5.88099 inductor_compile:6.96643 backend_compile:10.19951 gc:0.00015 entire_frame_compile:12.72301 total_wall_time:12.72301 2025-08-14T21:37:38.7873346Z STATS: call_* op count: 378 | FakeTensorMode.__torch_dispatch__:15006 | FakeTensor.__torch_dispatch__:4704 | ProxyTorchDispatchMode.__torch_dispatch__:5698 2025-08-14T21:37:38.7877738Z Dynamo produced 1 graphs covering 378 ops with 0 graph breaks (0 unique) 2025-08-14T21:37:42.9419790Z /opt/conda/envs/py_3.9/lib/python3.9/site-packages/llvmlite/binding/ffi.py:175: UserWarning: pkg_resources is deprecated as an API. See https://setuptools.pypa.io/en/latest/pkg_resources.html. The pkg_resources package is slated for removal as early as 2025-11-30. Refrain from using this package or pin to Setuptools<81. 2025-08-14T21:37:42.9421112Z from pkg_resources import resource_filename 2025-08-14T21:37:43.4721067Z 2025-08-14T21:37:44.8427392Z loading model: 0it [00:00, ?it/s] 2025-08-14T21:37:44.8427780Z loading model: 0it [00:01, ?it/s] 2025-08-14T21:37:44.8440444Z cpu eval GPT2ForSequenceClassification 2025-08-14T21:37:45.3928759Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:37:45.6382698Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:37:45.8859631Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:37:52.0076993Z cudagraph partition due to non gpu ops 2025-08-14T21:37:52.0081092Z cudagraph partition due to non gpu ops 2025-08-14T21:37:52.0083425Z cudagraph partition due to non gpu ops 2025-08-14T21:37:52.0084183Z cudagraph partition due to non gpu ops 2025-08-14T21:37:52.0084488Z cudagraph partition due to non gpu ops 2025-08-14T21:37:52.0084972Z cudagraph partition due to non gpu ops 2025-08-14T21:37:52.0085181Z cudagraph partition due to non gpu ops 2025-08-14T21:37:52.0085418Z cudagraph partition due to non gpu ops 2025-08-14T21:37:52.0085618Z cudagraph partition due to non gpu ops 2025-08-14T21:37:52.0085814Z cudagraph partition due to non gpu ops 2025-08-14T21:37:52.0086002Z cudagraph partition due to non gpu ops 2025-08-14T21:37:52.0086184Z cudagraph partition due to non gpu ops 2025-08-14T21:37:52.0086406Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:52.0086769Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:52.0087080Z return mod(**inputs) 2025-08-14T21:37:52.0087454Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1509, in forward 2025-08-14T21:37:52.0087884Z last_non_pad_token = (token_indices * non_pad_mask).argmax(-1) 2025-08-14T21:37:52.0088051Z 2025-08-14T21:37:52.0088159Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:52.0088489Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:52.0088808Z return mod(**inputs) 2025-08-14T21:37:52.0089155Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1480, in forward 2025-08-14T21:37:52.0089529Z transformer_outputs = self.transformer( 2025-08-14T21:37:52.0089894Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 917, in forward 2025-08-14T21:37:52.0090242Z outputs = block( 2025-08-14T21:37:52.0090547Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:52.0090882Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:52.0091246Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0091600Z return func(*args, **kwargs) 2025-08-14T21:37:52.0091948Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 404, in forward 2025-08-14T21:37:52.0092317Z attn_output, self_attn_weights = self.attn( 2025-08-14T21:37:52.0092681Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0093031Z return func(*args, **kwargs) 2025-08-14T21:37:52.0093368Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 294, in forward 2025-08-14T21:37:52.0093830Z query_states, key_states, value_states = self.c_attn(hidden_states).split(self.split_size, dim=2) 2025-08-14T21:37:52.0094262Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 116, in forward 2025-08-14T21:37:52.0094740Z x = torch.addmm(self.bias, x.view(-1, x.size(-1)), self.weight) 2025-08-14T21:37:52.0094906Z 2025-08-14T21:37:52.0095034Z cudagraph partition due to non gpu ops 2025-08-14T21:37:52.0095238Z cudagraph partition due to non gpu ops 2025-08-14T21:37:52.0095430Z cudagraph partition due to non gpu ops 2025-08-14T21:37:52.0095657Z cudagraph partition due to non gpu ops 2025-08-14T21:37:52.0095881Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:52.0096221Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:52.0096525Z return mod(**inputs) 2025-08-14T21:37:52.0096852Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1480, in forward 2025-08-14T21:37:52.0097219Z transformer_outputs = self.transformer( 2025-08-14T21:37:52.0097580Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 917, in forward 2025-08-14T21:37:52.0097957Z outputs = block( 2025-08-14T21:37:52.0098261Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:52.0098604Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:52.0098964Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0099298Z return func(*args, **kwargs) 2025-08-14T21:37:52.0099639Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 404, in forward 2025-08-14T21:37:52.0099999Z attn_output, self_attn_weights = self.attn( 2025-08-14T21:37:52.0100352Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0100682Z return func(*args, **kwargs) 2025-08-14T21:37:52.0101023Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 336, in forward 2025-08-14T21:37:52.0101398Z attn_output, attn_weights = attention_interface( 2025-08-14T21:37:52.0101804Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:37:52.0102249Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:37:52.0102421Z 2025-08-14T21:37:52.0102517Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:52.0102848Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:52.0103141Z return mod(**inputs) 2025-08-14T21:37:52.0103471Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1480, in forward 2025-08-14T21:37:52.0103833Z transformer_outputs = self.transformer( 2025-08-14T21:37:52.0104191Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 917, in forward 2025-08-14T21:37:52.0104523Z outputs = block( 2025-08-14T21:37:52.0104926Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:52.0105269Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:52.0105620Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0105962Z return func(*args, **kwargs) 2025-08-14T21:37:52.0106305Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 404, in forward 2025-08-14T21:37:52.0106671Z attn_output, self_attn_weights = self.attn( 2025-08-14T21:37:52.0107023Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0107503Z return func(*args, **kwargs) 2025-08-14T21:37:52.0107869Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 336, in forward 2025-08-14T21:37:52.0108251Z attn_output, attn_weights = attention_interface( 2025-08-14T21:37:52.0108683Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:37:52.0109115Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:37:52.0109268Z 2025-08-14T21:37:52.0109372Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:52.0109703Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:52.0110007Z return mod(**inputs) 2025-08-14T21:37:52.0110344Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1480, in forward 2025-08-14T21:37:52.0110711Z transformer_outputs = self.transformer( 2025-08-14T21:37:52.0111086Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 917, in forward 2025-08-14T21:37:52.0111424Z outputs = block( 2025-08-14T21:37:52.0111718Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:52.0112042Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:52.0112386Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0112725Z return func(*args, **kwargs) 2025-08-14T21:37:52.0113058Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 404, in forward 2025-08-14T21:37:52.0113409Z attn_output, self_attn_weights = self.attn( 2025-08-14T21:37:52.0113759Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0114095Z return func(*args, **kwargs) 2025-08-14T21:37:52.0114420Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 349, in forward 2025-08-14T21:37:52.0114776Z attn_output = self.c_proj(attn_output) 2025-08-14T21:37:52.0115105Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 116, in forward 2025-08-14T21:37:52.0115471Z x = torch.addmm(self.bias, x.view(-1, x.size(-1)), self.weight) 2025-08-14T21:37:52.0115630Z 2025-08-14T21:37:52.0115724Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:52.0116063Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:52.0116358Z return mod(**inputs) 2025-08-14T21:37:52.0116688Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1480, in forward 2025-08-14T21:37:52.0117040Z transformer_outputs = self.transformer( 2025-08-14T21:37:52.0117392Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 917, in forward 2025-08-14T21:37:52.0117730Z outputs = block( 2025-08-14T21:37:52.0118017Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:52.0118348Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:52.0118690Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0119027Z return func(*args, **kwargs) 2025-08-14T21:37:52.0119358Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 440, in forward 2025-08-14T21:37:52.0119731Z feed_forward_hidden_states = self.mlp(hidden_states) 2025-08-14T21:37:52.0120106Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 365, in forward 2025-08-14T21:37:52.0120487Z hidden_states = self.c_fc(hidden_states) 2025-08-14T21:37:52.0120824Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 116, in forward 2025-08-14T21:37:52.0121192Z x = torch.addmm(self.bias, x.view(-1, x.size(-1)), self.weight) 2025-08-14T21:37:52.0121349Z 2025-08-14T21:37:52.0121497Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:52.0121825Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:52.0122125Z return mod(**inputs) 2025-08-14T21:37:52.0122457Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1480, in forward 2025-08-14T21:37:52.0122823Z transformer_outputs = self.transformer( 2025-08-14T21:37:52.0123167Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 917, in forward 2025-08-14T21:37:52.0123553Z outputs = block( 2025-08-14T21:37:52.0123846Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:52.0124169Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:52.0124515Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0124855Z return func(*args, **kwargs) 2025-08-14T21:37:52.0125188Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 440, in forward 2025-08-14T21:37:52.0125554Z feed_forward_hidden_states = self.mlp(hidden_states) 2025-08-14T21:37:52.0125931Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 366, in forward 2025-08-14T21:37:52.0126286Z hidden_states = self.act(hidden_states) 2025-08-14T21:37:52.0126609Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 47, in forward 2025-08-14T21:37:52.0127029Z return 0.5 * input * (1.0 + torch.tanh(math.sqrt(2.0 / math.pi) * (input + 0.044715 * torch.pow(input, 3.0)))) 2025-08-14T21:37:52.0127252Z 2025-08-14T21:37:52.0127351Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:52.0127681Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:52.0127972Z return mod(**inputs) 2025-08-14T21:37:52.0128315Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1480, in forward 2025-08-14T21:37:52.0128678Z transformer_outputs = self.transformer( 2025-08-14T21:37:52.0129031Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 917, in forward 2025-08-14T21:37:52.0129360Z outputs = block( 2025-08-14T21:37:52.0129652Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:52.0129982Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:52.0130320Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0130660Z return func(*args, **kwargs) 2025-08-14T21:37:52.0130996Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 440, in forward 2025-08-14T21:37:52.0131367Z feed_forward_hidden_states = self.mlp(hidden_states) 2025-08-14T21:37:52.0131732Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 367, in forward 2025-08-14T21:37:52.0132092Z hidden_states = self.c_proj(hidden_states) 2025-08-14T21:37:52.0132422Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 116, in forward 2025-08-14T21:37:52.0132783Z x = torch.addmm(self.bias, x.view(-1, x.size(-1)), self.weight) 2025-08-14T21:37:52.0132961Z 2025-08-14T21:37:52.0133058Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:52.0133404Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:52.0133703Z return mod(**inputs) 2025-08-14T21:37:52.0134041Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1480, in forward 2025-08-14T21:37:52.0134410Z transformer_outputs = self.transformer( 2025-08-14T21:37:52.0134765Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 917, in forward 2025-08-14T21:37:52.0135103Z outputs = block( 2025-08-14T21:37:52.0135391Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:52.0135722Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:52.0136067Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0136468Z return func(*args, **kwargs) 2025-08-14T21:37:52.0136805Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 404, in forward 2025-08-14T21:37:52.0137169Z attn_output, self_attn_weights = self.attn( 2025-08-14T21:37:52.0137524Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0137854Z return func(*args, **kwargs) 2025-08-14T21:37:52.0138193Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 294, in forward 2025-08-14T21:37:52.0138645Z query_states, key_states, value_states = self.c_attn(hidden_states).split(self.split_size, dim=2) 2025-08-14T21:37:52.0139069Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 116, in forward 2025-08-14T21:37:52.0139424Z x = torch.addmm(self.bias, x.view(-1, x.size(-1)), self.weight) 2025-08-14T21:37:52.0139592Z 2025-08-14T21:37:52.0139667Z cudagraph partition due to non gpu ops 2025-08-14T21:37:52.0139871Z cudagraph partition due to non gpu ops 2025-08-14T21:37:52.0140058Z cudagraph partition due to non gpu ops 2025-08-14T21:37:52.0140251Z cudagraph partition due to non gpu ops 2025-08-14T21:37:52.0140470Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:52.0140799Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:52.0141092Z return mod(**inputs) 2025-08-14T21:37:52.0141426Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1480, in forward 2025-08-14T21:37:52.0141790Z transformer_outputs = self.transformer( 2025-08-14T21:37:52.0142137Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 917, in forward 2025-08-14T21:37:52.0142484Z outputs = block( 2025-08-14T21:37:52.0142779Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:52.0143110Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:52.0143450Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0143789Z return func(*args, **kwargs) 2025-08-14T21:37:52.0144122Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 404, in forward 2025-08-14T21:37:52.0144476Z attn_output, self_attn_weights = self.attn( 2025-08-14T21:37:52.0144926Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0145279Z return func(*args, **kwargs) 2025-08-14T21:37:52.0145618Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 336, in forward 2025-08-14T21:37:52.0146008Z attn_output, attn_weights = attention_interface( 2025-08-14T21:37:52.0146432Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:37:52.0146876Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:37:52.0147062Z 2025-08-14T21:37:52.0147169Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:52.0147494Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:52.0147793Z return mod(**inputs) 2025-08-14T21:37:52.0148126Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1480, in forward 2025-08-14T21:37:52.0148480Z transformer_outputs = self.transformer( 2025-08-14T21:37:52.0148837Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 917, in forward 2025-08-14T21:37:52.0149193Z outputs = block( 2025-08-14T21:37:52.0149488Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:52.0149813Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:52.0150162Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0150499Z return func(*args, **kwargs) 2025-08-14T21:37:52.0150830Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 404, in forward 2025-08-14T21:37:52.0151193Z attn_output, self_attn_weights = self.attn( 2025-08-14T21:37:52.0151546Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0151887Z return func(*args, **kwargs) 2025-08-14T21:37:52.0152218Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 336, in forward 2025-08-14T21:37:52.0152591Z attn_output, attn_weights = attention_interface( 2025-08-14T21:37:52.0152999Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:37:52.0153421Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:37:52.0153569Z 2025-08-14T21:37:52.0153665Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:52.0153992Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:52.0154290Z return mod(**inputs) 2025-08-14T21:37:52.0154610Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1480, in forward 2025-08-14T21:37:52.0154970Z transformer_outputs = self.transformer( 2025-08-14T21:37:52.0155323Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 917, in forward 2025-08-14T21:37:52.0155665Z outputs = block( 2025-08-14T21:37:52.0155951Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:52.0156280Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:52.0156625Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0156958Z return func(*args, **kwargs) 2025-08-14T21:37:52.0157290Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 404, in forward 2025-08-14T21:37:52.0157646Z attn_output, self_attn_weights = self.attn( 2025-08-14T21:37:52.0157997Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0158327Z return func(*args, **kwargs) 2025-08-14T21:37:52.0158663Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 349, in forward 2025-08-14T21:37:52.0159041Z attn_output = self.c_proj(attn_output) 2025-08-14T21:37:52.0159386Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 116, in forward 2025-08-14T21:37:52.0159764Z x = torch.addmm(self.bias, x.view(-1, x.size(-1)), self.weight) 2025-08-14T21:37:52.0159931Z 2025-08-14T21:37:52.0160026Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:52.0160354Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:52.0160645Z return mod(**inputs) 2025-08-14T21:37:52.0160980Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1480, in forward 2025-08-14T21:37:52.0161344Z transformer_outputs = self.transformer( 2025-08-14T21:37:52.0161702Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 917, in forward 2025-08-14T21:37:52.0162051Z outputs = block( 2025-08-14T21:37:52.0162346Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:52.0162678Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:52.0163017Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0163357Z return func(*args, **kwargs) 2025-08-14T21:37:52.0163694Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 440, in forward 2025-08-14T21:37:52.0164070Z feed_forward_hidden_states = self.mlp(hidden_states) 2025-08-14T21:37:52.0164435Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 365, in forward 2025-08-14T21:37:52.0164792Z hidden_states = self.c_fc(hidden_states) 2025-08-14T21:37:52.0165125Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 116, in forward 2025-08-14T21:37:52.0165488Z x = torch.addmm(self.bias, x.view(-1, x.size(-1)), self.weight) 2025-08-14T21:37:52.0165646Z 2025-08-14T21:37:52.0165741Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:52.0166075Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:52.0166373Z return mod(**inputs) 2025-08-14T21:37:52.0166695Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1480, in forward 2025-08-14T21:37:52.0167055Z transformer_outputs = self.transformer( 2025-08-14T21:37:52.0167412Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 917, in forward 2025-08-14T21:37:52.0167753Z outputs = block( 2025-08-14T21:37:52.0168043Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:52.0168377Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:52.0168726Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0169058Z return func(*args, **kwargs) 2025-08-14T21:37:52.0169394Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 440, in forward 2025-08-14T21:37:52.0169768Z feed_forward_hidden_states = self.mlp(hidden_states) 2025-08-14T21:37:52.0170142Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 366, in forward 2025-08-14T21:37:52.0170488Z hidden_states = self.act(hidden_states) 2025-08-14T21:37:52.0170811Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 47, in forward 2025-08-14T21:37:52.0171234Z return 0.5 * input * (1.0 + torch.tanh(math.sqrt(2.0 / math.pi) * (input + 0.044715 * torch.pow(input, 3.0)))) 2025-08-14T21:37:52.0171470Z 2025-08-14T21:37:52.0171575Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:52.0171944Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:52.0172243Z return mod(**inputs) 2025-08-14T21:37:52.0172601Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1480, in forward 2025-08-14T21:37:52.0172964Z transformer_outputs = self.transformer( 2025-08-14T21:37:52.0173334Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 917, in forward 2025-08-14T21:37:52.0173682Z outputs = block( 2025-08-14T21:37:52.0173989Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:52.0174324Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:52.0174691Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0175031Z return func(*args, **kwargs) 2025-08-14T21:37:52.0175363Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 440, in forward 2025-08-14T21:37:52.0175739Z feed_forward_hidden_states = self.mlp(hidden_states) 2025-08-14T21:37:52.0176109Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 367, in forward 2025-08-14T21:37:52.0176469Z hidden_states = self.c_proj(hidden_states) 2025-08-14T21:37:52.0176792Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 116, in forward 2025-08-14T21:37:52.0177153Z x = torch.addmm(self.bias, x.view(-1, x.size(-1)), self.weight) 2025-08-14T21:37:52.0177310Z 2025-08-14T21:37:52.0177412Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:52.0177743Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:52.0178034Z return mod(**inputs) 2025-08-14T21:37:52.0178363Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1480, in forward 2025-08-14T21:37:52.0178725Z transformer_outputs = self.transformer( 2025-08-14T21:37:52.0179069Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 917, in forward 2025-08-14T21:37:52.0179406Z outputs = block( 2025-08-14T21:37:52.0179700Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:52.0180030Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:52.0180369Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0180709Z return func(*args, **kwargs) 2025-08-14T21:37:52.0181049Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 404, in forward 2025-08-14T21:37:52.0181404Z attn_output, self_attn_weights = self.attn( 2025-08-14T21:37:52.0181759Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0182099Z return func(*args, **kwargs) 2025-08-14T21:37:52.0182437Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 294, in forward 2025-08-14T21:37:52.0182881Z query_states, key_states, value_states = self.c_attn(hidden_states).split(self.split_size, dim=2) 2025-08-14T21:37:52.0183305Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 116, in forward 2025-08-14T21:37:52.0183670Z x = torch.addmm(self.bias, x.view(-1, x.size(-1)), self.weight) 2025-08-14T21:37:52.0183828Z 2025-08-14T21:37:52.0183933Z cudagraph partition due to non gpu ops 2025-08-14T21:37:52.0184127Z cudagraph partition due to non gpu ops 2025-08-14T21:37:52.0184322Z cudagraph partition due to non gpu ops 2025-08-14T21:37:52.0184524Z cudagraph partition due to non gpu ops 2025-08-14T21:37:52.0184949Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:52.0185345Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:52.0185653Z return mod(**inputs) 2025-08-14T21:37:52.0185982Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1480, in forward 2025-08-14T21:37:52.0186347Z transformer_outputs = self.transformer( 2025-08-14T21:37:52.0186704Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 917, in forward 2025-08-14T21:37:52.0187043Z outputs = block( 2025-08-14T21:37:52.0187334Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:52.0187697Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:52.0188048Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0188393Z return func(*args, **kwargs) 2025-08-14T21:37:52.0188728Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 404, in forward 2025-08-14T21:37:52.0189092Z attn_output, self_attn_weights = self.attn( 2025-08-14T21:37:52.0189446Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0189777Z return func(*args, **kwargs) 2025-08-14T21:37:52.0190114Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 336, in forward 2025-08-14T21:37:52.0190488Z attn_output, attn_weights = attention_interface( 2025-08-14T21:37:52.0190898Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:37:52.0191331Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:37:52.0191507Z 2025-08-14T21:37:52.0191604Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:52.0191932Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:52.0192228Z return mod(**inputs) 2025-08-14T21:37:52.0192550Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1480, in forward 2025-08-14T21:37:52.0192908Z transformer_outputs = self.transformer( 2025-08-14T21:37:52.0193261Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 917, in forward 2025-08-14T21:37:52.0193592Z outputs = block( 2025-08-14T21:37:52.0193887Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:52.0194216Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:52.0194560Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0194892Z return func(*args, **kwargs) 2025-08-14T21:37:52.0195229Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 404, in forward 2025-08-14T21:37:52.0195588Z attn_output, self_attn_weights = self.attn( 2025-08-14T21:37:52.0195934Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0196274Z return func(*args, **kwargs) 2025-08-14T21:37:52.0196610Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 336, in forward 2025-08-14T21:37:52.0197011Z attn_output, attn_weights = attention_interface( 2025-08-14T21:37:52.0197435Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:37:52.0197859Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:37:52.0198010Z 2025-08-14T21:37:52.0198130Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:52.0198464Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:52.0198757Z return mod(**inputs) 2025-08-14T21:37:52.0199089Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1480, in forward 2025-08-14T21:37:52.0199456Z transformer_outputs = self.transformer( 2025-08-14T21:37:52.0199807Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 917, in forward 2025-08-14T21:37:52.0200160Z outputs = block( 2025-08-14T21:37:52.0200467Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:52.0200801Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:52.0201139Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0201475Z return func(*args, **kwargs) 2025-08-14T21:37:52.0201811Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 404, in forward 2025-08-14T21:37:52.0202165Z attn_output, self_attn_weights = self.attn( 2025-08-14T21:37:52.0202517Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0202852Z return func(*args, **kwargs) 2025-08-14T21:37:52.0203187Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 349, in forward 2025-08-14T21:37:52.0203541Z attn_output = self.c_proj(attn_output) 2025-08-14T21:37:52.0203868Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 116, in forward 2025-08-14T21:37:52.0204235Z x = torch.addmm(self.bias, x.view(-1, x.size(-1)), self.weight) 2025-08-14T21:37:52.0204390Z 2025-08-14T21:37:52.0204492Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:52.0204817Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:52.0205117Z return mod(**inputs) 2025-08-14T21:37:52.0205445Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1480, in forward 2025-08-14T21:37:52.0205796Z transformer_outputs = self.transformer( 2025-08-14T21:37:52.0206151Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 917, in forward 2025-08-14T21:37:52.0206490Z outputs = block( 2025-08-14T21:37:52.0206784Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:52.0207110Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:52.0207456Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0207791Z return func(*args, **kwargs) 2025-08-14T21:37:52.0208121Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 440, in forward 2025-08-14T21:37:52.0208494Z feed_forward_hidden_states = self.mlp(hidden_states) 2025-08-14T21:37:52.0208868Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 365, in forward 2025-08-14T21:37:52.0209219Z hidden_states = self.c_fc(hidden_states) 2025-08-14T21:37:52.0209534Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 116, in forward 2025-08-14T21:37:52.0209920Z x = torch.addmm(self.bias, x.view(-1, x.size(-1)), self.weight) 2025-08-14T21:37:52.0210077Z 2025-08-14T21:37:52.0210194Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:52.0210526Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:52.0210833Z return mod(**inputs) 2025-08-14T21:37:52.0211166Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1480, in forward 2025-08-14T21:37:52.0211524Z transformer_outputs = self.transformer( 2025-08-14T21:37:52.0211869Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 917, in forward 2025-08-14T21:37:52.0212208Z outputs = block( 2025-08-14T21:37:52.0212502Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:52.0212854Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:52.0213197Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0213542Z return func(*args, **kwargs) 2025-08-14T21:37:52.0213885Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 440, in forward 2025-08-14T21:37:52.0214255Z feed_forward_hidden_states = self.mlp(hidden_states) 2025-08-14T21:37:52.0214633Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 366, in forward 2025-08-14T21:37:52.0214992Z hidden_states = self.act(hidden_states) 2025-08-14T21:37:52.0215319Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 47, in forward 2025-08-14T21:37:52.0215732Z return 0.5 * input * (1.0 + torch.tanh(math.sqrt(2.0 / math.pi) * (input + 0.044715 * torch.pow(input, 3.0)))) 2025-08-14T21:37:52.0215961Z 2025-08-14T21:37:52.0216057Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:52.0216394Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:52.0216694Z return mod(**inputs) 2025-08-14T21:37:52.0217022Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1480, in forward 2025-08-14T21:37:52.0217386Z transformer_outputs = self.transformer( 2025-08-14T21:37:52.0217742Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 917, in forward 2025-08-14T21:37:52.0218077Z outputs = block( 2025-08-14T21:37:52.0218373Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:52.0218703Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:52.0219052Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0219387Z return func(*args, **kwargs) 2025-08-14T21:37:52.0219728Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 440, in forward 2025-08-14T21:37:52.0220103Z feed_forward_hidden_states = self.mlp(hidden_states) 2025-08-14T21:37:52.0220470Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 367, in forward 2025-08-14T21:37:52.0220834Z hidden_states = self.c_proj(hidden_states) 2025-08-14T21:37:52.0221167Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 116, in forward 2025-08-14T21:37:52.0221530Z x = torch.addmm(self.bias, x.view(-1, x.size(-1)), self.weight) 2025-08-14T21:37:52.0221688Z 2025-08-14T21:37:52.0221784Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:52.0222115Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:52.0222429Z return mod(**inputs) 2025-08-14T21:37:52.0222782Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1480, in forward 2025-08-14T21:37:52.0223141Z transformer_outputs = self.transformer( 2025-08-14T21:37:52.0223510Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 917, in forward 2025-08-14T21:37:52.0223851Z outputs = block( 2025-08-14T21:37:52.0224138Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:52.0224466Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:52.0224878Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0225228Z return func(*args, **kwargs) 2025-08-14T21:37:52.0225560Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 442, in forward 2025-08-14T21:37:52.0225964Z hidden_states = residual + feed_forward_hidden_states 2025-08-14T21:37:52.0226110Z 2025-08-14T21:37:52.0226213Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:52.0226538Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:52.0226836Z return mod(**inputs) 2025-08-14T21:37:52.0227167Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1480, in forward 2025-08-14T21:37:52.0227531Z transformer_outputs = self.transformer( 2025-08-14T21:37:52.0227881Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 917, in forward 2025-08-14T21:37:52.0228219Z outputs = block( 2025-08-14T21:37:52.0228512Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:52.0228841Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:52.0229187Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0229525Z return func(*args, **kwargs) 2025-08-14T21:37:52.0229863Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 404, in forward 2025-08-14T21:37:52.0230219Z attn_output, self_attn_weights = self.attn( 2025-08-14T21:37:52.0230575Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0230914Z return func(*args, **kwargs) 2025-08-14T21:37:52.0231254Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 294, in forward 2025-08-14T21:37:52.0231699Z query_states, key_states, value_states = self.c_attn(hidden_states).split(self.split_size, dim=2) 2025-08-14T21:37:52.0232129Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 116, in forward 2025-08-14T21:37:52.0232492Z x = torch.addmm(self.bias, x.view(-1, x.size(-1)), self.weight) 2025-08-14T21:37:52.0232650Z 2025-08-14T21:37:52.0232726Z cudagraph partition due to non gpu ops 2025-08-14T21:37:52.0232927Z cudagraph partition due to non gpu ops 2025-08-14T21:37:52.0233120Z cudagraph partition due to non gpu ops 2025-08-14T21:37:52.0233311Z cudagraph partition due to non gpu ops 2025-08-14T21:37:52.0233515Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:52.0233847Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:52.0234145Z return mod(**inputs) 2025-08-14T21:37:52.0234469Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1480, in forward 2025-08-14T21:37:52.0234830Z transformer_outputs = self.transformer( 2025-08-14T21:37:52.0235213Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 917, in forward 2025-08-14T21:37:52.0235557Z outputs = block( 2025-08-14T21:37:52.0235854Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:52.0236207Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:52.0236559Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0236894Z return func(*args, **kwargs) 2025-08-14T21:37:52.0237233Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 404, in forward 2025-08-14T21:37:52.0237598Z attn_output, self_attn_weights = self.attn( 2025-08-14T21:37:52.0237957Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0238308Z return func(*args, **kwargs) 2025-08-14T21:37:52.0238645Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 336, in forward 2025-08-14T21:37:52.0239012Z attn_output, attn_weights = attention_interface( 2025-08-14T21:37:52.0239410Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:37:52.0239848Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:37:52.0240023Z 2025-08-14T21:37:52.0240119Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:52.0240451Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:52.0240741Z return mod(**inputs) 2025-08-14T21:37:52.0241072Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1480, in forward 2025-08-14T21:37:52.0241436Z transformer_outputs = self.transformer( 2025-08-14T21:37:52.0241793Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 917, in forward 2025-08-14T21:37:52.0242125Z outputs = block( 2025-08-14T21:37:52.0242419Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:52.0242747Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:52.0243085Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0243425Z return func(*args, **kwargs) 2025-08-14T21:37:52.0243764Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 404, in forward 2025-08-14T21:37:52.0244125Z attn_output, self_attn_weights = self.attn( 2025-08-14T21:37:52.0244474Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0244816Z return func(*args, **kwargs) 2025-08-14T21:37:52.0245153Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 336, in forward 2025-08-14T21:37:52.0245524Z attn_output, attn_weights = attention_interface( 2025-08-14T21:37:52.0245925Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:37:52.0246344Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:37:52.0246494Z 2025-08-14T21:37:52.0246598Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:52.0246919Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:52.0247218Z return mod(**inputs) 2025-08-14T21:37:52.0247548Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1480, in forward 2025-08-14T21:37:52.0247929Z transformer_outputs = self.transformer( 2025-08-14T21:37:52.0248294Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 917, in forward 2025-08-14T21:37:52.0248636Z outputs = block( 2025-08-14T21:37:52.0248943Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:52.0249269Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:52.0249616Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0249955Z return func(*args, **kwargs) 2025-08-14T21:37:52.0250288Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 404, in forward 2025-08-14T21:37:52.0250641Z attn_output, self_attn_weights = self.attn( 2025-08-14T21:37:52.0250996Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0251357Z return func(*args, **kwargs) 2025-08-14T21:37:52.0251687Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 349, in forward 2025-08-14T21:37:52.0252045Z attn_output = self.c_proj(attn_output) 2025-08-14T21:37:52.0252380Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 116, in forward 2025-08-14T21:37:52.0252751Z x = torch.addmm(self.bias, x.view(-1, x.size(-1)), self.weight) 2025-08-14T21:37:52.0252911Z 2025-08-14T21:37:52.0253005Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:52.0253339Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:52.0253638Z return mod(**inputs) 2025-08-14T21:37:52.0253967Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1480, in forward 2025-08-14T21:37:52.0254322Z transformer_outputs = self.transformer( 2025-08-14T21:37:52.0254676Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 917, in forward 2025-08-14T21:37:52.0255017Z outputs = block( 2025-08-14T21:37:52.0255304Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:52.0255634Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:52.0255976Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0256313Z return func(*args, **kwargs) 2025-08-14T21:37:52.0256643Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 440, in forward 2025-08-14T21:37:52.0257018Z feed_forward_hidden_states = self.mlp(hidden_states) 2025-08-14T21:37:52.0257391Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 365, in forward 2025-08-14T21:37:52.0257745Z hidden_states = self.c_fc(hidden_states) 2025-08-14T21:37:52.0258076Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 116, in forward 2025-08-14T21:37:52.0258442Z x = torch.addmm(self.bias, x.view(-1, x.size(-1)), self.weight) 2025-08-14T21:37:52.0258598Z 2025-08-14T21:37:52.0258701Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:52.0259028Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:52.0259326Z return mod(**inputs) 2025-08-14T21:37:52.0259653Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1480, in forward 2025-08-14T21:37:52.0260013Z transformer_outputs = self.transformer( 2025-08-14T21:37:52.0260360Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 917, in forward 2025-08-14T21:37:52.0260718Z outputs = block( 2025-08-14T21:37:52.0261023Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:52.0261350Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:52.0261720Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0262067Z return func(*args, **kwargs) 2025-08-14T21:37:52.0262415Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 440, in forward 2025-08-14T21:37:52.0262789Z feed_forward_hidden_states = self.mlp(hidden_states) 2025-08-14T21:37:52.0263176Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 366, in forward 2025-08-14T21:37:52.0263541Z hidden_states = self.act(hidden_states) 2025-08-14T21:37:52.0263866Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 47, in forward 2025-08-14T21:37:52.0264298Z return 0.5 * input * (1.0 + torch.tanh(math.sqrt(2.0 / math.pi) * (input + 0.044715 * torch.pow(input, 3.0)))) 2025-08-14T21:37:52.0264521Z 2025-08-14T21:37:52.0264618Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:52.0265036Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:52.0265340Z return mod(**inputs) 2025-08-14T21:37:52.0265675Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1480, in forward 2025-08-14T21:37:52.0266040Z transformer_outputs = self.transformer( 2025-08-14T21:37:52.0266397Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 917, in forward 2025-08-14T21:37:52.0266734Z outputs = block( 2025-08-14T21:37:52.0267034Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:52.0267373Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:52.0267715Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0268063Z return func(*args, **kwargs) 2025-08-14T21:37:52.0268407Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 440, in forward 2025-08-14T21:37:52.0268785Z feed_forward_hidden_states = self.mlp(hidden_states) 2025-08-14T21:37:52.0269153Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 367, in forward 2025-08-14T21:37:52.0269519Z hidden_states = self.c_proj(hidden_states) 2025-08-14T21:37:52.0269854Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 116, in forward 2025-08-14T21:37:52.0270220Z x = torch.addmm(self.bias, x.view(-1, x.size(-1)), self.weight) 2025-08-14T21:37:52.0270381Z 2025-08-14T21:37:52.0270479Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:52.0270812Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:52.0271111Z return mod(**inputs) 2025-08-14T21:37:52.0271438Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1480, in forward 2025-08-14T21:37:52.0271799Z transformer_outputs = self.transformer( 2025-08-14T21:37:52.0272155Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 917, in forward 2025-08-14T21:37:52.0272492Z outputs = block( 2025-08-14T21:37:52.0272777Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:52.0273108Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:52.0273491Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0273824Z return func(*args, **kwargs) 2025-08-14T21:37:52.0274185Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 404, in forward 2025-08-14T21:37:52.0274565Z attn_output, self_attn_weights = self.attn( 2025-08-14T21:37:52.0274923Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0275259Z return func(*args, **kwargs) 2025-08-14T21:37:52.0275598Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 294, in forward 2025-08-14T21:37:52.0276049Z query_states, key_states, value_states = self.c_attn(hidden_states).split(self.split_size, dim=2) 2025-08-14T21:37:52.0276473Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 116, in forward 2025-08-14T21:37:52.0276856Z x = torch.addmm(self.bias, x.view(-1, x.size(-1)), self.weight) 2025-08-14T21:37:52.0277019Z 2025-08-14T21:37:52.0277098Z cudagraph partition due to non gpu ops 2025-08-14T21:37:52.0277299Z cudagraph partition due to non gpu ops 2025-08-14T21:37:52.0277484Z cudagraph partition due to non gpu ops 2025-08-14T21:37:52.0277679Z cudagraph partition due to non gpu ops 2025-08-14T21:37:52.0277896Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:52.0278227Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:52.0278524Z return mod(**inputs) 2025-08-14T21:37:52.0278858Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1480, in forward 2025-08-14T21:37:52.0279224Z transformer_outputs = self.transformer( 2025-08-14T21:37:52.0279572Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 917, in forward 2025-08-14T21:37:52.0279916Z outputs = block( 2025-08-14T21:37:52.0280215Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:52.0280548Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:52.0280895Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0281240Z return func(*args, **kwargs) 2025-08-14T21:37:52.0281578Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 404, in forward 2025-08-14T21:37:52.0281938Z attn_output, self_attn_weights = self.attn( 2025-08-14T21:37:52.0282295Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0282635Z return func(*args, **kwargs) 2025-08-14T21:37:52.0282971Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 336, in forward 2025-08-14T21:37:52.0283341Z attn_output, attn_weights = attention_interface( 2025-08-14T21:37:52.0283754Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:37:52.0284199Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:37:52.0284366Z 2025-08-14T21:37:52.0284467Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:52.0284907Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:52.0285214Z return mod(**inputs) 2025-08-14T21:37:52.0285547Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1480, in forward 2025-08-14T21:37:52.0285906Z transformer_outputs = self.transformer( 2025-08-14T21:37:52.0286263Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 917, in forward 2025-08-14T21:37:52.0286652Z outputs = block( 2025-08-14T21:37:52.0286976Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:52.0287305Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:52.0287710Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0288054Z return func(*args, **kwargs) 2025-08-14T21:37:52.0288385Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 404, in forward 2025-08-14T21:37:52.0288751Z attn_output, self_attn_weights = self.attn( 2025-08-14T21:37:52.0289107Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0289444Z return func(*args, **kwargs) 2025-08-14T21:37:52.0289801Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 336, in forward 2025-08-14T21:37:52.0290176Z attn_output, attn_weights = attention_interface( 2025-08-14T21:37:52.0290584Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:37:52.0290999Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:37:52.0291156Z 2025-08-14T21:37:52.0291253Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:52.0291588Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:52.0291884Z return mod(**inputs) 2025-08-14T21:37:52.0292208Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1480, in forward 2025-08-14T21:37:52.0292570Z transformer_outputs = self.transformer( 2025-08-14T21:37:52.0292926Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 917, in forward 2025-08-14T21:37:52.0293266Z outputs = block( 2025-08-14T21:37:52.0293553Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:52.0293884Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:52.0294229Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0294560Z return func(*args, **kwargs) 2025-08-14T21:37:52.0294898Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 404, in forward 2025-08-14T21:37:52.0295263Z attn_output, self_attn_weights = self.attn( 2025-08-14T21:37:52.0295621Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0295954Z return func(*args, **kwargs) 2025-08-14T21:37:52.0296294Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 349, in forward 2025-08-14T21:37:52.0296652Z attn_output = self.c_proj(attn_output) 2025-08-14T21:37:52.0296974Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 116, in forward 2025-08-14T21:37:52.0297340Z x = torch.addmm(self.bias, x.view(-1, x.size(-1)), self.weight) 2025-08-14T21:37:52.0297504Z 2025-08-14T21:37:52.0297599Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:52.0297929Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:52.0298217Z return mod(**inputs) 2025-08-14T21:37:52.0298546Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1480, in forward 2025-08-14T21:37:52.0298907Z transformer_outputs = self.transformer( 2025-08-14T21:37:52.0299283Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 917, in forward 2025-08-14T21:37:52.0299616Z outputs = block( 2025-08-14T21:37:52.0299924Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:52.0300276Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:52.0300616Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0300959Z return func(*args, **kwargs) 2025-08-14T21:37:52.0301298Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 440, in forward 2025-08-14T21:37:52.0301675Z feed_forward_hidden_states = self.mlp(hidden_states) 2025-08-14T21:37:52.0302044Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 365, in forward 2025-08-14T21:37:52.0302417Z hidden_states = self.c_fc(hidden_states) 2025-08-14T21:37:52.0302744Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 116, in forward 2025-08-14T21:37:52.0303101Z x = torch.addmm(self.bias, x.view(-1, x.size(-1)), self.weight) 2025-08-14T21:37:52.0303263Z 2025-08-14T21:37:52.0303360Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:52.0303687Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:52.0303983Z return mod(**inputs) 2025-08-14T21:37:52.0304304Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1480, in forward 2025-08-14T21:37:52.0304665Z transformer_outputs = self.transformer( 2025-08-14T21:37:52.0305077Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 917, in forward 2025-08-14T21:37:52.0305419Z outputs = block( 2025-08-14T21:37:52.0305711Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:52.0306042Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:52.0306398Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0306739Z return func(*args, **kwargs) 2025-08-14T21:37:52.0307093Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 440, in forward 2025-08-14T21:37:52.0307484Z feed_forward_hidden_states = self.mlp(hidden_states) 2025-08-14T21:37:52.0307867Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 366, in forward 2025-08-14T21:37:52.0308229Z hidden_states = self.act(hidden_states) 2025-08-14T21:37:52.0308558Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 47, in forward 2025-08-14T21:37:52.0308991Z return 0.5 * input * (1.0 + torch.tanh(math.sqrt(2.0 / math.pi) * (input + 0.044715 * torch.pow(input, 3.0)))) 2025-08-14T21:37:52.0309235Z 2025-08-14T21:37:52.0309339Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:52.0309665Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:52.0309964Z return mod(**inputs) 2025-08-14T21:37:52.0310296Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1480, in forward 2025-08-14T21:37:52.0310648Z transformer_outputs = self.transformer( 2025-08-14T21:37:52.0311002Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 917, in forward 2025-08-14T21:37:52.0311344Z outputs = block( 2025-08-14T21:37:52.0311638Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:52.0311984Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:52.0312346Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0312687Z return func(*args, **kwargs) 2025-08-14T21:37:52.0313031Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 440, in forward 2025-08-14T21:37:52.0313407Z feed_forward_hidden_states = self.mlp(hidden_states) 2025-08-14T21:37:52.0313780Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 367, in forward 2025-08-14T21:37:52.0314139Z hidden_states = self.c_proj(hidden_states) 2025-08-14T21:37:52.0314463Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 116, in forward 2025-08-14T21:37:52.0314827Z x = torch.addmm(self.bias, x.view(-1, x.size(-1)), self.weight) 2025-08-14T21:37:52.0314985Z 2025-08-14T21:37:52.0315117Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:52.0315443Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:52.0315733Z return mod(**inputs) 2025-08-14T21:37:52.0316065Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1480, in forward 2025-08-14T21:37:52.0316424Z transformer_outputs = self.transformer( 2025-08-14T21:37:52.0316769Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 917, in forward 2025-08-14T21:37:52.0317107Z outputs = block( 2025-08-14T21:37:52.0317399Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:52.0317727Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:52.0318061Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0318401Z return func(*args, **kwargs) 2025-08-14T21:37:52.0318738Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 442, in forward 2025-08-14T21:37:52.0319107Z hidden_states = residual + feed_forward_hidden_states 2025-08-14T21:37:52.0319259Z 2025-08-14T21:37:52.0319353Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:52.0319682Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:52.0319977Z return mod(**inputs) 2025-08-14T21:37:52.0320298Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1480, in forward 2025-08-14T21:37:52.0320654Z transformer_outputs = self.transformer( 2025-08-14T21:37:52.0321006Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 917, in forward 2025-08-14T21:37:52.0321336Z outputs = block( 2025-08-14T21:37:52.0321626Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:52.0321954Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:52.0322298Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0322629Z return func(*args, **kwargs) 2025-08-14T21:37:52.0322966Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 404, in forward 2025-08-14T21:37:52.0323326Z attn_output, self_attn_weights = self.attn( 2025-08-14T21:37:52.0323678Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0324007Z return func(*args, **kwargs) 2025-08-14T21:37:52.0324343Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 294, in forward 2025-08-14T21:37:52.0324811Z query_states, key_states, value_states = self.c_attn(hidden_states).split(self.split_size, dim=2) 2025-08-14T21:37:52.0325275Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 116, in forward 2025-08-14T21:37:52.0325655Z x = torch.addmm(self.bias, x.view(-1, x.size(-1)), self.weight) 2025-08-14T21:37:52.0325821Z 2025-08-14T21:37:52.0325899Z cudagraph partition due to non gpu ops 2025-08-14T21:37:52.0326098Z cudagraph partition due to non gpu ops 2025-08-14T21:37:52.0326287Z cudagraph partition due to non gpu ops 2025-08-14T21:37:52.0326486Z cudagraph partition due to non gpu ops 2025-08-14T21:37:52.0326702Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:52.0327025Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:52.0327325Z return mod(**inputs) 2025-08-14T21:37:52.0327657Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1480, in forward 2025-08-14T21:37:52.0328042Z transformer_outputs = self.transformer( 2025-08-14T21:37:52.0328391Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 917, in forward 2025-08-14T21:37:52.0328733Z outputs = block( 2025-08-14T21:37:52.0329029Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:52.0329356Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:52.0329704Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0330047Z return func(*args, **kwargs) 2025-08-14T21:37:52.0330387Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 404, in forward 2025-08-14T21:37:52.0330741Z attn_output, self_attn_weights = self.attn( 2025-08-14T21:37:52.0331101Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0331442Z return func(*args, **kwargs) 2025-08-14T21:37:52.0331773Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 336, in forward 2025-08-14T21:37:52.0332146Z attn_output, attn_weights = attention_interface( 2025-08-14T21:37:52.0332559Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:37:52.0333001Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:37:52.0333170Z 2025-08-14T21:37:52.0333266Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:52.0333596Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:52.0333896Z return mod(**inputs) 2025-08-14T21:37:52.0334233Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1480, in forward 2025-08-14T21:37:52.0334591Z transformer_outputs = self.transformer( 2025-08-14T21:37:52.0334945Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 917, in forward 2025-08-14T21:37:52.0335289Z outputs = block( 2025-08-14T21:37:52.0335575Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:52.0335906Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:52.0336251Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0336590Z return func(*args, **kwargs) 2025-08-14T21:37:52.0336922Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 404, in forward 2025-08-14T21:37:52.0337305Z attn_output, self_attn_weights = self.attn( 2025-08-14T21:37:52.0337662Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0338010Z return func(*args, **kwargs) 2025-08-14T21:37:52.0338371Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 336, in forward 2025-08-14T21:37:52.0338747Z attn_output, attn_weights = attention_interface( 2025-08-14T21:37:52.0339155Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:37:52.0339567Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:37:52.0339722Z 2025-08-14T21:37:52.0339816Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:52.0340150Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:52.0340464Z return mod(**inputs) 2025-08-14T21:37:52.0340788Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1480, in forward 2025-08-14T21:37:52.0341156Z transformer_outputs = self.transformer( 2025-08-14T21:37:52.0341514Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 917, in forward 2025-08-14T21:37:52.0341848Z outputs = block( 2025-08-14T21:37:52.0342147Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:52.0342480Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:52.0342827Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0343159Z return func(*args, **kwargs) 2025-08-14T21:37:52.0343497Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 404, in forward 2025-08-14T21:37:52.0343861Z attn_output, self_attn_weights = self.attn( 2025-08-14T21:37:52.0344208Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0344548Z return func(*args, **kwargs) 2025-08-14T21:37:52.0344957Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 349, in forward 2025-08-14T21:37:52.0345325Z attn_output = self.c_proj(attn_output) 2025-08-14T21:37:52.0345651Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 116, in forward 2025-08-14T21:37:52.0346021Z x = torch.addmm(self.bias, x.view(-1, x.size(-1)), self.weight) 2025-08-14T21:37:52.0346181Z 2025-08-14T21:37:52.0346283Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:52.0346622Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:52.0346922Z return mod(**inputs) 2025-08-14T21:37:52.0347257Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1480, in forward 2025-08-14T21:37:52.0347628Z transformer_outputs = self.transformer( 2025-08-14T21:37:52.0347981Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 917, in forward 2025-08-14T21:37:52.0348329Z outputs = block( 2025-08-14T21:37:52.0348629Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:52.0348963Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:52.0349304Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0349646Z return func(*args, **kwargs) 2025-08-14T21:37:52.0349985Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 440, in forward 2025-08-14T21:37:52.0350376Z feed_forward_hidden_states = self.mlp(hidden_states) 2025-08-14T21:37:52.0350773Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 365, in forward 2025-08-14T21:37:52.0351133Z hidden_states = self.c_fc(hidden_states) 2025-08-14T21:37:52.0351475Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 116, in forward 2025-08-14T21:37:52.0351838Z x = torch.addmm(self.bias, x.view(-1, x.size(-1)), self.weight) 2025-08-14T21:37:52.0352004Z 2025-08-14T21:37:52.0352098Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:52.0352430Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:52.0352730Z return mod(**inputs) 2025-08-14T21:37:52.0353054Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1480, in forward 2025-08-14T21:37:52.0353438Z transformer_outputs = self.transformer( 2025-08-14T21:37:52.0353804Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 917, in forward 2025-08-14T21:37:52.0354145Z outputs = block( 2025-08-14T21:37:52.0354448Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:52.0354790Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:52.0355142Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0355480Z return func(*args, **kwargs) 2025-08-14T21:37:52.0355827Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 440, in forward 2025-08-14T21:37:52.0356210Z feed_forward_hidden_states = self.mlp(hidden_states) 2025-08-14T21:37:52.0356582Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 366, in forward 2025-08-14T21:37:52.0356949Z hidden_states = self.act(hidden_states) 2025-08-14T21:37:52.0357274Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 47, in forward 2025-08-14T21:37:52.0357692Z return 0.5 * input * (1.0 + torch.tanh(math.sqrt(2.0 / math.pi) * (input + 0.044715 * torch.pow(input, 3.0)))) 2025-08-14T21:37:52.0357908Z 2025-08-14T21:37:52.0358001Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:52.0358335Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:52.0358638Z return mod(**inputs) 2025-08-14T21:37:52.0358976Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1480, in forward 2025-08-14T21:37:52.0359339Z transformer_outputs = self.transformer( 2025-08-14T21:37:52.0359701Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 917, in forward 2025-08-14T21:37:52.0360051Z outputs = block( 2025-08-14T21:37:52.0360347Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:52.0360686Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:52.0361040Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0361388Z return func(*args, **kwargs) 2025-08-14T21:37:52.0361726Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 440, in forward 2025-08-14T21:37:52.0362110Z feed_forward_hidden_states = self.mlp(hidden_states) 2025-08-14T21:37:52.0362490Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 367, in forward 2025-08-14T21:37:52.0362855Z hidden_states = self.c_proj(hidden_states) 2025-08-14T21:37:52.0363213Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 116, in forward 2025-08-14T21:37:52.0363600Z x = torch.addmm(self.bias, x.view(-1, x.size(-1)), self.weight) 2025-08-14T21:37:52.0363763Z 2025-08-14T21:37:52.0363867Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:52.0364218Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:52.0364520Z return mod(**inputs) 2025-08-14T21:37:52.0364855Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1480, in forward 2025-08-14T21:37:52.0365217Z transformer_outputs = self.transformer( 2025-08-14T21:37:52.0365568Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 917, in forward 2025-08-14T21:37:52.0365911Z outputs = block( 2025-08-14T21:37:52.0366208Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:52.0366554Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:52.0366902Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0367242Z return func(*args, **kwargs) 2025-08-14T21:37:52.0367585Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 404, in forward 2025-08-14T21:37:52.0367941Z attn_output, self_attn_weights = self.attn( 2025-08-14T21:37:52.0368298Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0368639Z return func(*args, **kwargs) 2025-08-14T21:37:52.0368971Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 294, in forward 2025-08-14T21:37:52.0369426Z query_states, key_states, value_states = self.c_attn(hidden_states).split(self.split_size, dim=2) 2025-08-14T21:37:52.0369857Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 116, in forward 2025-08-14T21:37:52.0370222Z x = torch.addmm(self.bias, x.view(-1, x.size(-1)), self.weight) 2025-08-14T21:37:52.0370380Z 2025-08-14T21:37:52.0370457Z cudagraph partition due to non gpu ops 2025-08-14T21:37:52.0370656Z cudagraph partition due to non gpu ops 2025-08-14T21:37:52.0370847Z cudagraph partition due to non gpu ops 2025-08-14T21:37:52.0371036Z cudagraph partition due to non gpu ops 2025-08-14T21:37:52.0371246Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:52.0371579Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:52.0371882Z return mod(**inputs) 2025-08-14T21:37:52.0372208Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1480, in forward 2025-08-14T21:37:52.0372574Z transformer_outputs = self.transformer( 2025-08-14T21:37:52.0372933Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 917, in forward 2025-08-14T21:37:52.0373275Z outputs = block( 2025-08-14T21:37:52.0373567Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:52.0373903Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:52.0374252Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0374585Z return func(*args, **kwargs) 2025-08-14T21:37:52.0374924Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 404, in forward 2025-08-14T21:37:52.0375284Z attn_output, self_attn_weights = self.attn( 2025-08-14T21:37:52.0375637Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0375988Z return func(*args, **kwargs) 2025-08-14T21:37:52.0376338Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 336, in forward 2025-08-14T21:37:52.0376719Z attn_output, attn_weights = attention_interface( 2025-08-14T21:37:52.0377137Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:37:52.0377581Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:37:52.0377757Z 2025-08-14T21:37:52.0377852Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:52.0378184Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:52.0378476Z return mod(**inputs) 2025-08-14T21:37:52.0378809Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1480, in forward 2025-08-14T21:37:52.0379190Z transformer_outputs = self.transformer( 2025-08-14T21:37:52.0379548Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 917, in forward 2025-08-14T21:37:52.0379880Z outputs = block( 2025-08-14T21:37:52.0380178Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:52.0380510Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:52.0380848Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0381189Z return func(*args, **kwargs) 2025-08-14T21:37:52.0381528Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 404, in forward 2025-08-14T21:37:52.0381888Z attn_output, self_attn_weights = self.attn( 2025-08-14T21:37:52.0382233Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0382575Z return func(*args, **kwargs) 2025-08-14T21:37:52.0382915Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 336, in forward 2025-08-14T21:37:52.0383281Z attn_output, attn_weights = attention_interface( 2025-08-14T21:37:52.0383685Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:37:52.0384107Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:37:52.0384254Z 2025-08-14T21:37:52.0384356Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:52.0384835Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:52.0385149Z return mod(**inputs) 2025-08-14T21:37:52.0385484Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1480, in forward 2025-08-14T21:37:52.0385852Z transformer_outputs = self.transformer( 2025-08-14T21:37:52.0386209Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 917, in forward 2025-08-14T21:37:52.0386552Z outputs = block( 2025-08-14T21:37:52.0386848Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:52.0387173Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:52.0387526Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0387872Z return func(*args, **kwargs) 2025-08-14T21:37:52.0388212Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 404, in forward 2025-08-14T21:37:52.0388573Z attn_output, self_attn_weights = self.attn( 2025-08-14T21:37:52.0388978Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0389322Z return func(*args, **kwargs) 2025-08-14T21:37:52.0389678Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 349, in forward 2025-08-14T21:37:52.0390064Z attn_output = self.c_proj(attn_output) 2025-08-14T21:37:52.0390399Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 116, in forward 2025-08-14T21:37:52.0390778Z x = torch.addmm(self.bias, x.view(-1, x.size(-1)), self.weight) 2025-08-14T21:37:52.0390937Z 2025-08-14T21:37:52.0391167Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:52.0391496Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:52.0391790Z return mod(**inputs) 2025-08-14T21:37:52.0392119Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1480, in forward 2025-08-14T21:37:52.0392507Z transformer_outputs = self.transformer( 2025-08-14T21:37:52.0392869Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 917, in forward 2025-08-14T21:37:52.0393215Z outputs = block( 2025-08-14T21:37:52.0393508Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:52.0393845Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:52.0394194Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0394535Z return func(*args, **kwargs) 2025-08-14T21:37:52.0394867Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 440, in forward 2025-08-14T21:37:52.0395246Z feed_forward_hidden_states = self.mlp(hidden_states) 2025-08-14T21:37:52.0395627Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 365, in forward 2025-08-14T21:37:52.0395981Z hidden_states = self.c_fc(hidden_states) 2025-08-14T21:37:52.0396313Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 116, in forward 2025-08-14T21:37:52.0396682Z x = torch.addmm(self.bias, x.view(-1, x.size(-1)), self.weight) 2025-08-14T21:37:52.0396839Z 2025-08-14T21:37:52.0396943Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:52.0397266Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:52.0397564Z return mod(**inputs) 2025-08-14T21:37:52.0397891Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1480, in forward 2025-08-14T21:37:52.0398250Z transformer_outputs = self.transformer( 2025-08-14T21:37:52.0398605Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 917, in forward 2025-08-14T21:37:52.0398946Z outputs = block( 2025-08-14T21:37:52.0399239Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:52.0399565Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:52.0399913Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0400252Z return func(*args, **kwargs) 2025-08-14T21:37:52.0400594Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 440, in forward 2025-08-14T21:37:52.0400965Z feed_forward_hidden_states = self.mlp(hidden_states) 2025-08-14T21:37:52.0401340Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 366, in forward 2025-08-14T21:37:52.0401717Z hidden_states = self.act(hidden_states) 2025-08-14T21:37:52.0402032Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 47, in forward 2025-08-14T21:37:52.0402461Z return 0.5 * input * (1.0 + torch.tanh(math.sqrt(2.0 / math.pi) * (input + 0.044715 * torch.pow(input, 3.0)))) 2025-08-14T21:37:52.0402685Z 2025-08-14T21:37:52.0402797Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:52.0403130Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:52.0403419Z return mod(**inputs) 2025-08-14T21:37:52.0403748Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1480, in forward 2025-08-14T21:37:52.0404106Z transformer_outputs = self.transformer( 2025-08-14T21:37:52.0404459Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 917, in forward 2025-08-14T21:37:52.0404808Z outputs = block( 2025-08-14T21:37:52.0405104Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:52.0405437Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:52.0405777Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0406119Z return func(*args, **kwargs) 2025-08-14T21:37:52.0406460Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 440, in forward 2025-08-14T21:37:52.0406836Z feed_forward_hidden_states = self.mlp(hidden_states) 2025-08-14T21:37:52.0407201Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 367, in forward 2025-08-14T21:37:52.0407565Z hidden_states = self.c_proj(hidden_states) 2025-08-14T21:37:52.0407898Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 116, in forward 2025-08-14T21:37:52.0408264Z x = torch.addmm(self.bias, x.view(-1, x.size(-1)), self.weight) 2025-08-14T21:37:52.0408420Z 2025-08-14T21:37:52.0408516Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:52.0408848Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:52.0409149Z return mod(**inputs) 2025-08-14T21:37:52.0409474Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1480, in forward 2025-08-14T21:37:52.0409834Z transformer_outputs = self.transformer( 2025-08-14T21:37:52.0410189Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 917, in forward 2025-08-14T21:37:52.0410524Z outputs = block( 2025-08-14T21:37:52.0410810Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:52.0411145Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:52.0411493Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0411825Z return func(*args, **kwargs) 2025-08-14T21:37:52.0412165Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 442, in forward 2025-08-14T21:37:52.0412544Z hidden_states = residual + feed_forward_hidden_states 2025-08-14T21:37:52.0412691Z 2025-08-14T21:37:52.0412791Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:52.0413114Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:52.0413411Z return mod(**inputs) 2025-08-14T21:37:52.0413739Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1480, in forward 2025-08-14T21:37:52.0414096Z transformer_outputs = self.transformer( 2025-08-14T21:37:52.0414464Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 917, in forward 2025-08-14T21:37:52.0414803Z outputs = block( 2025-08-14T21:37:52.0415123Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:52.0415463Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:52.0415813Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0416152Z return func(*args, **kwargs) 2025-08-14T21:37:52.0416488Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 404, in forward 2025-08-14T21:37:52.0416845Z attn_output, self_attn_weights = self.attn( 2025-08-14T21:37:52.0417199Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0417559Z return func(*args, **kwargs) 2025-08-14T21:37:52.0417887Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 294, in forward 2025-08-14T21:37:52.0418341Z query_states, key_states, value_states = self.c_attn(hidden_states).split(self.split_size, dim=2) 2025-08-14T21:37:52.0418771Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 116, in forward 2025-08-14T21:37:52.0419136Z x = torch.addmm(self.bias, x.view(-1, x.size(-1)), self.weight) 2025-08-14T21:37:52.0419293Z 2025-08-14T21:37:52.0419368Z cudagraph partition due to non gpu ops 2025-08-14T21:37:52.0419565Z cudagraph partition due to non gpu ops 2025-08-14T21:37:52.0419758Z cudagraph partition due to non gpu ops 2025-08-14T21:37:52.0419940Z cudagraph partition due to non gpu ops 2025-08-14T21:37:52.0420156Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:52.0420488Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:52.0420792Z return mod(**inputs) 2025-08-14T21:37:52.0421120Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1480, in forward 2025-08-14T21:37:52.0421481Z transformer_outputs = self.transformer( 2025-08-14T21:37:52.0421840Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 917, in forward 2025-08-14T21:37:52.0422176Z outputs = block( 2025-08-14T21:37:52.0422474Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:52.0422806Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:52.0423152Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0423486Z return func(*args, **kwargs) 2025-08-14T21:37:52.0423826Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 404, in forward 2025-08-14T21:37:52.0424194Z attn_output, self_attn_weights = self.attn( 2025-08-14T21:37:52.0424542Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0424978Z return func(*args, **kwargs) 2025-08-14T21:37:52.0425323Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 336, in forward 2025-08-14T21:37:52.0425702Z attn_output, attn_weights = attention_interface( 2025-08-14T21:37:52.0426104Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:37:52.0426549Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:37:52.0426728Z 2025-08-14T21:37:52.0426824Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:52.0427189Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:52.0427483Z return mod(**inputs) 2025-08-14T21:37:52.0427835Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1480, in forward 2025-08-14T21:37:52.0428220Z transformer_outputs = self.transformer( 2025-08-14T21:37:52.0428575Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 917, in forward 2025-08-14T21:37:52.0428920Z outputs = block( 2025-08-14T21:37:52.0429222Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:52.0429557Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:52.0429902Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0429977Z return func(*args, **kwargs) 2025-08-14T21:37:52.0430222Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 404, in forward 2025-08-14T21:37:52.0430306Z attn_output, self_attn_weights = self.attn( 2025-08-14T21:37:52.0430533Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0430598Z return func(*args, **kwargs) 2025-08-14T21:37:52.0430829Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 336, in forward 2025-08-14T21:37:52.0430916Z attn_output, attn_weights = attention_interface( 2025-08-14T21:37:52.0431179Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:37:52.0431287Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:37:52.0431291Z 2025-08-14T21:37:52.0431387Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:52.0431581Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:52.0431640Z return mod(**inputs) 2025-08-14T21:37:52.0431870Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1480, in forward 2025-08-14T21:37:52.0431955Z transformer_outputs = self.transformer( 2025-08-14T21:37:52.0432180Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 917, in forward 2025-08-14T21:37:52.0432238Z outputs = block( 2025-08-14T21:37:52.0432448Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:52.0432521Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:52.0432747Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0432812Z return func(*args, **kwargs) 2025-08-14T21:37:52.0433038Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 404, in forward 2025-08-14T21:37:52.0433129Z attn_output, self_attn_weights = self.attn( 2025-08-14T21:37:52.0433350Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0433412Z return func(*args, **kwargs) 2025-08-14T21:37:52.0433645Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 349, in forward 2025-08-14T21:37:52.0433720Z attn_output = self.c_proj(attn_output) 2025-08-14T21:37:52.0433927Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 116, in forward 2025-08-14T21:37:52.0434036Z x = torch.addmm(self.bias, x.view(-1, x.size(-1)), self.weight) 2025-08-14T21:37:52.0434040Z 2025-08-14T21:37:52.0434136Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:52.0434342Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:52.0434418Z return mod(**inputs) 2025-08-14T21:37:52.0434656Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1480, in forward 2025-08-14T21:37:52.0434750Z transformer_outputs = self.transformer( 2025-08-14T21:37:52.0434978Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 917, in forward 2025-08-14T21:37:52.0435044Z outputs = block( 2025-08-14T21:37:52.0435244Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:52.0435316Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:52.0435544Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0435627Z return func(*args, **kwargs) 2025-08-14T21:37:52.0435865Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 440, in forward 2025-08-14T21:37:52.0435961Z feed_forward_hidden_states = self.mlp(hidden_states) 2025-08-14T21:37:52.0436194Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 365, in forward 2025-08-14T21:37:52.0436277Z hidden_states = self.c_fc(hidden_states) 2025-08-14T21:37:52.0436482Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 116, in forward 2025-08-14T21:37:52.0436601Z x = torch.addmm(self.bias, x.view(-1, x.size(-1)), self.weight) 2025-08-14T21:37:52.0436605Z 2025-08-14T21:37:52.0436699Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:52.0436883Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:52.0436953Z return mod(**inputs) 2025-08-14T21:37:52.0437187Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1480, in forward 2025-08-14T21:37:52.0437262Z transformer_outputs = self.transformer( 2025-08-14T21:37:52.0437502Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 917, in forward 2025-08-14T21:37:52.0437559Z outputs = block( 2025-08-14T21:37:52.0437774Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:52.0437843Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:52.0438065Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0438135Z return func(*args, **kwargs) 2025-08-14T21:37:52.0438364Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 440, in forward 2025-08-14T21:37:52.0438461Z feed_forward_hidden_states = self.mlp(hidden_states) 2025-08-14T21:37:52.0438699Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 366, in forward 2025-08-14T21:37:52.0438770Z hidden_states = self.act(hidden_states) 2025-08-14T21:37:52.0438975Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 47, in forward 2025-08-14T21:37:52.0439140Z return 0.5 * input * (1.0 + torch.tanh(math.sqrt(2.0 / math.pi) * (input + 0.044715 * torch.pow(input, 3.0)))) 2025-08-14T21:37:52.0439143Z 2025-08-14T21:37:52.0439238Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:52.0439429Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:52.0439487Z return mod(**inputs) 2025-08-14T21:37:52.0439726Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1480, in forward 2025-08-14T21:37:52.0439818Z transformer_outputs = self.transformer( 2025-08-14T21:37:52.0440057Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 917, in forward 2025-08-14T21:37:52.0440124Z outputs = block( 2025-08-14T21:37:52.0440340Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:52.0440415Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:52.0440643Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0440704Z return func(*args, **kwargs) 2025-08-14T21:37:52.0440938Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 440, in forward 2025-08-14T21:37:52.0441030Z feed_forward_hidden_states = self.mlp(hidden_states) 2025-08-14T21:37:52.0441253Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 367, in forward 2025-08-14T21:37:52.0441358Z hidden_states = self.c_proj(hidden_states) 2025-08-14T21:37:52.0441562Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 116, in forward 2025-08-14T21:37:52.0441681Z x = torch.addmm(self.bias, x.view(-1, x.size(-1)), self.weight) 2025-08-14T21:37:52.0441684Z 2025-08-14T21:37:52.0441780Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:52.0441964Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:52.0442034Z return mod(**inputs) 2025-08-14T21:37:52.0442269Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1480, in forward 2025-08-14T21:37:52.0442347Z transformer_outputs = self.transformer( 2025-08-14T21:37:52.0442584Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 917, in forward 2025-08-14T21:37:52.0442645Z outputs = block( 2025-08-14T21:37:52.0442859Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:52.0442932Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:52.0443158Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0443229Z return func(*args, **kwargs) 2025-08-14T21:37:52.0443457Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 404, in forward 2025-08-14T21:37:52.0443538Z attn_output, self_attn_weights = self.attn( 2025-08-14T21:37:52.0443767Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0443833Z return func(*args, **kwargs) 2025-08-14T21:37:52.0444067Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 294, in forward 2025-08-14T21:37:52.0444246Z query_states, key_states, value_states = self.c_attn(hidden_states).split(self.split_size, dim=2) 2025-08-14T21:37:52.0444447Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 116, in forward 2025-08-14T21:37:52.0444564Z x = torch.addmm(self.bias, x.view(-1, x.size(-1)), self.weight) 2025-08-14T21:37:52.0444567Z 2025-08-14T21:37:52.0444643Z cudagraph partition due to non gpu ops 2025-08-14T21:37:52.0444725Z cudagraph partition due to non gpu ops 2025-08-14T21:37:52.0444796Z cudagraph partition due to non gpu ops 2025-08-14T21:37:52.0444868Z cudagraph partition due to non gpu ops 2025-08-14T21:37:52.0444970Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:52.0445154Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:52.0445238Z return mod(**inputs) 2025-08-14T21:37:52.0445483Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1480, in forward 2025-08-14T21:37:52.0445570Z transformer_outputs = self.transformer( 2025-08-14T21:37:52.0445816Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 917, in forward 2025-08-14T21:37:52.0445876Z outputs = block( 2025-08-14T21:37:52.0446077Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:52.0446156Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:52.0446375Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0446436Z return func(*args, **kwargs) 2025-08-14T21:37:52.0446666Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 404, in forward 2025-08-14T21:37:52.0446766Z attn_output, self_attn_weights = self.attn( 2025-08-14T21:37:52.0446995Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0447058Z return func(*args, **kwargs) 2025-08-14T21:37:52.0447283Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 336, in forward 2025-08-14T21:37:52.0447378Z attn_output, attn_weights = attention_interface( 2025-08-14T21:37:52.0447646Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:37:52.0447769Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:37:52.0447772Z 2025-08-14T21:37:52.0447864Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:52.0448045Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:52.0448116Z return mod(**inputs) 2025-08-14T21:37:52.0448347Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1480, in forward 2025-08-14T21:37:52.0448420Z transformer_outputs = self.transformer( 2025-08-14T21:37:52.0448650Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 917, in forward 2025-08-14T21:37:52.0448707Z outputs = block( 2025-08-14T21:37:52.0448913Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:52.0448985Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:52.0449201Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0449270Z return func(*args, **kwargs) 2025-08-14T21:37:52.0449492Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 404, in forward 2025-08-14T21:37:52.0449575Z attn_output, self_attn_weights = self.attn( 2025-08-14T21:37:52.0449800Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0449860Z return func(*args, **kwargs) 2025-08-14T21:37:52.0450091Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 336, in forward 2025-08-14T21:37:52.0450185Z attn_output, attn_weights = attention_interface( 2025-08-14T21:37:52.0450449Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:37:52.0450556Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:37:52.0450559Z 2025-08-14T21:37:52.0450651Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:52.0450839Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:52.0450918Z return mod(**inputs) 2025-08-14T21:37:52.0451162Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1480, in forward 2025-08-14T21:37:52.0451249Z transformer_outputs = self.transformer( 2025-08-14T21:37:52.0451484Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 917, in forward 2025-08-14T21:37:52.0451545Z outputs = block( 2025-08-14T21:37:52.0451758Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:52.0451830Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:52.0452062Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0452124Z return func(*args, **kwargs) 2025-08-14T21:37:52.0452353Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 404, in forward 2025-08-14T21:37:52.0452456Z attn_output, self_attn_weights = self.attn( 2025-08-14T21:37:52.0452675Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0452744Z return func(*args, **kwargs) 2025-08-14T21:37:52.0452968Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 349, in forward 2025-08-14T21:37:52.0453042Z attn_output = self.c_proj(attn_output) 2025-08-14T21:37:52.0453249Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 116, in forward 2025-08-14T21:37:52.0453354Z x = torch.addmm(self.bias, x.view(-1, x.size(-1)), self.weight) 2025-08-14T21:37:52.0453357Z 2025-08-14T21:37:52.0453448Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:52.0453635Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:52.0453696Z return mod(**inputs) 2025-08-14T21:37:52.0453932Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1480, in forward 2025-08-14T21:37:52.0454006Z transformer_outputs = self.transformer( 2025-08-14T21:37:52.0454228Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 917, in forward 2025-08-14T21:37:52.0454293Z outputs = block( 2025-08-14T21:37:52.0454491Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:52.0454561Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:52.0454786Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0454847Z return func(*args, **kwargs) 2025-08-14T21:37:52.0455078Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 440, in forward 2025-08-14T21:37:52.0455171Z feed_forward_hidden_states = self.mlp(hidden_states) 2025-08-14T21:37:52.0455397Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 365, in forward 2025-08-14T21:37:52.0455480Z hidden_states = self.c_fc(hidden_states) 2025-08-14T21:37:52.0455678Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 116, in forward 2025-08-14T21:37:52.0455789Z x = torch.addmm(self.bias, x.view(-1, x.size(-1)), self.weight) 2025-08-14T21:37:52.0455792Z 2025-08-14T21:37:52.0455884Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:52.0456064Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:52.0456129Z return mod(**inputs) 2025-08-14T21:37:52.0456355Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1480, in forward 2025-08-14T21:37:52.0456450Z transformer_outputs = self.transformer( 2025-08-14T21:37:52.0456695Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 917, in forward 2025-08-14T21:37:52.0456753Z outputs = block( 2025-08-14T21:37:52.0456981Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:52.0457056Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:52.0457275Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0457344Z return func(*args, **kwargs) 2025-08-14T21:37:52.0457569Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 440, in forward 2025-08-14T21:37:52.0457660Z feed_forward_hidden_states = self.mlp(hidden_states) 2025-08-14T21:37:52.0457908Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 366, in forward 2025-08-14T21:37:52.0457979Z hidden_states = self.act(hidden_states) 2025-08-14T21:37:52.0458178Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 47, in forward 2025-08-14T21:37:52.0458345Z return 0.5 * input * (1.0 + torch.tanh(math.sqrt(2.0 / math.pi) * (input + 0.044715 * torch.pow(input, 3.0)))) 2025-08-14T21:37:52.0458348Z 2025-08-14T21:37:52.0458441Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:52.0458629Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:52.0458688Z return mod(**inputs) 2025-08-14T21:37:52.0458924Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1480, in forward 2025-08-14T21:37:52.0458996Z transformer_outputs = self.transformer( 2025-08-14T21:37:52.0459225Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 917, in forward 2025-08-14T21:37:52.0459289Z outputs = block( 2025-08-14T21:37:52.0459492Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:52.0459562Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:52.0459791Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0459853Z return func(*args, **kwargs) 2025-08-14T21:37:52.0460085Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 440, in forward 2025-08-14T21:37:52.0460177Z feed_forward_hidden_states = self.mlp(hidden_states) 2025-08-14T21:37:52.0460402Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 367, in forward 2025-08-14T21:37:52.0460492Z hidden_states = self.c_proj(hidden_states) 2025-08-14T21:37:52.0460691Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 116, in forward 2025-08-14T21:37:52.0460804Z x = torch.addmm(self.bias, x.view(-1, x.size(-1)), self.weight) 2025-08-14T21:37:52.0460808Z 2025-08-14T21:37:52.0460900Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:52.0461082Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:52.0461148Z return mod(**inputs) 2025-08-14T21:37:52.0461377Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1480, in forward 2025-08-14T21:37:52.0461451Z transformer_outputs = self.transformer( 2025-08-14T21:37:52.0461686Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 917, in forward 2025-08-14T21:37:52.0461757Z outputs = block( 2025-08-14T21:37:52.0461968Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:52.0462058Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:52.0462279Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0462365Z return func(*args, **kwargs) 2025-08-14T21:37:52.0462594Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 442, in forward 2025-08-14T21:37:52.0462699Z hidden_states = residual + feed_forward_hidden_states 2025-08-14T21:37:52.0462703Z 2025-08-14T21:37:52.0462795Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:52.0462979Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:52.0463047Z return mod(**inputs) 2025-08-14T21:37:52.0463278Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1480, in forward 2025-08-14T21:37:52.0463378Z transformer_outputs = self.transformer( 2025-08-14T21:37:52.0463614Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 917, in forward 2025-08-14T21:37:52.0463672Z outputs = block( 2025-08-14T21:37:52.0463883Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:52.0463955Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:52.0464175Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0464245Z return func(*args, **kwargs) 2025-08-14T21:37:52.0464470Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 404, in forward 2025-08-14T21:37:52.0464549Z attn_output, self_attn_weights = self.attn( 2025-08-14T21:37:52.0464848Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0464924Z return func(*args, **kwargs) 2025-08-14T21:37:52.0465161Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 294, in forward 2025-08-14T21:37:52.0465334Z query_states, key_states, value_states = self.c_attn(hidden_states).split(self.split_size, dim=2) 2025-08-14T21:37:52.0465537Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 116, in forward 2025-08-14T21:37:52.0465655Z x = torch.addmm(self.bias, x.view(-1, x.size(-1)), self.weight) 2025-08-14T21:37:52.0465659Z 2025-08-14T21:37:52.0465733Z cudagraph partition due to non gpu ops 2025-08-14T21:37:52.0465814Z cudagraph partition due to non gpu ops 2025-08-14T21:37:52.0465884Z cudagraph partition due to non gpu ops 2025-08-14T21:37:52.0465954Z cudagraph partition due to non gpu ops 2025-08-14T21:37:52.0466060Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:52.0466244Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:52.0466305Z return mod(**inputs) 2025-08-14T21:37:52.0466546Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1480, in forward 2025-08-14T21:37:52.0466620Z transformer_outputs = self.transformer( 2025-08-14T21:37:52.0466855Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 917, in forward 2025-08-14T21:37:52.0466913Z outputs = block( 2025-08-14T21:37:52.0467119Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:52.0467198Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:52.0467418Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0467503Z return func(*args, **kwargs) 2025-08-14T21:37:52.0467752Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 404, in forward 2025-08-14T21:37:52.0467833Z attn_output, self_attn_weights = self.attn( 2025-08-14T21:37:52.0468071Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0468135Z return func(*args, **kwargs) 2025-08-14T21:37:52.0468358Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 336, in forward 2025-08-14T21:37:52.0468453Z attn_output, attn_weights = attention_interface( 2025-08-14T21:37:52.0468717Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:37:52.0468833Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:37:52.0468858Z 2025-08-14T21:37:52.0468952Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:52.0469133Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:52.0469201Z return mod(**inputs) 2025-08-14T21:37:52.0469431Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1480, in forward 2025-08-14T21:37:52.0469506Z transformer_outputs = self.transformer( 2025-08-14T21:37:52.0469741Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 917, in forward 2025-08-14T21:37:52.0469799Z outputs = block( 2025-08-14T21:37:52.0470005Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:52.0470075Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:52.0470295Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0470365Z return func(*args, **kwargs) 2025-08-14T21:37:52.0470591Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 404, in forward 2025-08-14T21:37:52.0470670Z attn_output, self_attn_weights = self.attn( 2025-08-14T21:37:52.0470898Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0470960Z return func(*args, **kwargs) 2025-08-14T21:37:52.0471189Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 336, in forward 2025-08-14T21:37:52.0471275Z attn_output, attn_weights = attention_interface( 2025-08-14T21:37:52.0471543Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:37:52.0471653Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:37:52.0471657Z 2025-08-14T21:37:52.0471750Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:52.0471940Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:52.0471999Z return mod(**inputs) 2025-08-14T21:37:52.0472227Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1480, in forward 2025-08-14T21:37:52.0472309Z transformer_outputs = self.transformer( 2025-08-14T21:37:52.0472537Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 917, in forward 2025-08-14T21:37:52.0472593Z outputs = block( 2025-08-14T21:37:52.0472800Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:52.0472873Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:52.0473142Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0473215Z return func(*args, **kwargs) 2025-08-14T21:37:52.0473447Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 404, in forward 2025-08-14T21:37:52.0474184Z attn_output, self_attn_weights = self.attn( 2025-08-14T21:37:52.0474412Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0474475Z return func(*args, **kwargs) 2025-08-14T21:37:52.0474711Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 349, in forward 2025-08-14T21:37:52.0474787Z attn_output = self.c_proj(attn_output) 2025-08-14T21:37:52.0474997Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 116, in forward 2025-08-14T21:37:52.0475125Z x = torch.addmm(self.bias, x.view(-1, x.size(-1)), self.weight) 2025-08-14T21:37:52.0475128Z 2025-08-14T21:37:52.0475223Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:52.0475415Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:52.0475476Z return mod(**inputs) 2025-08-14T21:37:52.0475715Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1480, in forward 2025-08-14T21:37:52.0475791Z transformer_outputs = self.transformer( 2025-08-14T21:37:52.0476015Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 917, in forward 2025-08-14T21:37:52.0476080Z outputs = block( 2025-08-14T21:37:52.0476280Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:52.0476352Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:52.0476581Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0476644Z return func(*args, **kwargs) 2025-08-14T21:37:52.0476873Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 440, in forward 2025-08-14T21:37:52.0476968Z feed_forward_hidden_states = self.mlp(hidden_states) 2025-08-14T21:37:52.0477194Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 365, in forward 2025-08-14T21:37:52.0477275Z hidden_states = self.c_fc(hidden_states) 2025-08-14T21:37:52.0477473Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 116, in forward 2025-08-14T21:37:52.0477577Z x = torch.addmm(self.bias, x.view(-1, x.size(-1)), self.weight) 2025-08-14T21:37:52.0477588Z 2025-08-14T21:37:52.0477681Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:52.0477864Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:52.0477932Z return mod(**inputs) 2025-08-14T21:37:52.0478161Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1480, in forward 2025-08-14T21:37:52.0478238Z transformer_outputs = self.transformer( 2025-08-14T21:37:52.0478467Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 917, in forward 2025-08-14T21:37:52.0478524Z outputs = block( 2025-08-14T21:37:52.0478731Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:52.0478802Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:52.0479020Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0479106Z return func(*args, **kwargs) 2025-08-14T21:37:52.0479337Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 440, in forward 2025-08-14T21:37:52.0479443Z feed_forward_hidden_states = self.mlp(hidden_states) 2025-08-14T21:37:52.0479690Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 366, in forward 2025-08-14T21:37:52.0479767Z hidden_states = self.act(hidden_states) 2025-08-14T21:37:52.0479970Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 47, in forward 2025-08-14T21:37:52.0480135Z return 0.5 * input * (1.0 + torch.tanh(math.sqrt(2.0 / math.pi) * (input + 0.044715 * torch.pow(input, 3.0)))) 2025-08-14T21:37:52.0480138Z 2025-08-14T21:37:52.0480232Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:52.0480422Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:52.0480499Z return mod(**inputs) 2025-08-14T21:37:52.0480735Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1480, in forward 2025-08-14T21:37:52.0480810Z transformer_outputs = self.transformer( 2025-08-14T21:37:52.0481036Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 917, in forward 2025-08-14T21:37:52.0481101Z outputs = block( 2025-08-14T21:37:52.0481299Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:52.0481372Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:52.0481599Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0481660Z return func(*args, **kwargs) 2025-08-14T21:37:52.0481895Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 440, in forward 2025-08-14T21:37:52.0481990Z feed_forward_hidden_states = self.mlp(hidden_states) 2025-08-14T21:37:52.0482214Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 367, in forward 2025-08-14T21:37:52.0482304Z hidden_states = self.c_proj(hidden_states) 2025-08-14T21:37:52.0482502Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 116, in forward 2025-08-14T21:37:52.0482613Z x = torch.addmm(self.bias, x.view(-1, x.size(-1)), self.weight) 2025-08-14T21:37:52.0482618Z 2025-08-14T21:37:52.0482711Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:52.0482892Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:52.0482958Z return mod(**inputs) 2025-08-14T21:37:52.0483187Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1480, in forward 2025-08-14T21:37:52.0483265Z transformer_outputs = self.transformer( 2025-08-14T21:37:52.0483496Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 917, in forward 2025-08-14T21:37:52.0483555Z outputs = block( 2025-08-14T21:37:52.0483763Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:52.0483837Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:52.0484055Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0484125Z return func(*args, **kwargs) 2025-08-14T21:37:52.0484347Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 404, in forward 2025-08-14T21:37:52.0484427Z attn_output, self_attn_weights = self.attn( 2025-08-14T21:37:52.0484876Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0484986Z return func(*args, **kwargs) 2025-08-14T21:37:52.0485250Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 294, in forward 2025-08-14T21:37:52.0485456Z query_states, key_states, value_states = self.c_attn(hidden_states).split(self.split_size, dim=2) 2025-08-14T21:37:52.0485657Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 116, in forward 2025-08-14T21:37:52.0485771Z x = torch.addmm(self.bias, x.view(-1, x.size(-1)), self.weight) 2025-08-14T21:37:52.0485774Z 2025-08-14T21:37:52.0485848Z cudagraph partition due to non gpu ops 2025-08-14T21:37:52.0485928Z cudagraph partition due to non gpu ops 2025-08-14T21:37:52.0485997Z cudagraph partition due to non gpu ops 2025-08-14T21:37:52.0486066Z cudagraph partition due to non gpu ops 2025-08-14T21:37:52.0486165Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:52.0486371Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:52.0486431Z return mod(**inputs) 2025-08-14T21:37:52.0486668Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1480, in forward 2025-08-14T21:37:52.0486743Z transformer_outputs = self.transformer( 2025-08-14T21:37:52.0486974Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 917, in forward 2025-08-14T21:37:52.0487031Z outputs = block( 2025-08-14T21:37:52.0487231Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:52.0487310Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:52.0487528Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0487590Z return func(*args, **kwargs) 2025-08-14T21:37:52.0487827Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 404, in forward 2025-08-14T21:37:52.0487906Z attn_output, self_attn_weights = self.attn( 2025-08-14T21:37:52.0488131Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0488191Z return func(*args, **kwargs) 2025-08-14T21:37:52.0488413Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 336, in forward 2025-08-14T21:37:52.0488510Z attn_output, attn_weights = attention_interface( 2025-08-14T21:37:52.0488774Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:37:52.0488889Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:37:52.0488901Z 2025-08-14T21:37:52.0488996Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:52.0489179Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:52.0489245Z return mod(**inputs) 2025-08-14T21:37:52.0489474Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1480, in forward 2025-08-14T21:37:52.0489551Z transformer_outputs = self.transformer( 2025-08-14T21:37:52.0489781Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 917, in forward 2025-08-14T21:37:52.0489839Z outputs = block( 2025-08-14T21:37:52.0490048Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:52.0490120Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:52.0490362Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0490455Z return func(*args, **kwargs) 2025-08-14T21:37:52.0490710Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 404, in forward 2025-08-14T21:37:52.0490797Z attn_output, self_attn_weights = self.attn( 2025-08-14T21:37:52.0491053Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0491120Z return func(*args, **kwargs) 2025-08-14T21:37:52.0491373Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 336, in forward 2025-08-14T21:37:52.0491465Z attn_output, attn_weights = attention_interface( 2025-08-14T21:37:52.0491749Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:37:52.0491863Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:37:52.0491884Z 2025-08-14T21:37:52.0491985Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:52.0492185Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:52.0492246Z return mod(**inputs) 2025-08-14T21:37:52.0492489Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1480, in forward 2025-08-14T21:37:52.0492574Z transformer_outputs = self.transformer( 2025-08-14T21:37:52.0492810Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 917, in forward 2025-08-14T21:37:52.0492870Z outputs = block( 2025-08-14T21:37:52.0493088Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:52.0493162Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:52.0493398Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0493465Z return func(*args, **kwargs) 2025-08-14T21:37:52.0493702Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 404, in forward 2025-08-14T21:37:52.0493792Z attn_output, self_attn_weights = self.attn( 2025-08-14T21:37:52.0494073Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0494137Z return func(*args, **kwargs) 2025-08-14T21:37:52.0494378Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 349, in forward 2025-08-14T21:37:52.0494455Z attn_output = self.c_proj(attn_output) 2025-08-14T21:37:52.0494668Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 116, in forward 2025-08-14T21:37:52.0494780Z x = torch.addmm(self.bias, x.view(-1, x.size(-1)), self.weight) 2025-08-14T21:37:52.0494786Z 2025-08-14T21:37:52.0494884Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:52.0495085Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:52.0495148Z return mod(**inputs) 2025-08-14T21:37:52.0495439Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1480, in forward 2025-08-14T21:37:52.0495520Z transformer_outputs = self.transformer( 2025-08-14T21:37:52.0495752Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 917, in forward 2025-08-14T21:37:52.0495820Z outputs = block( 2025-08-14T21:37:52.0496028Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:52.0496102Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:52.0496339Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0496421Z return func(*args, **kwargs) 2025-08-14T21:37:52.0496683Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 440, in forward 2025-08-14T21:37:52.0496786Z feed_forward_hidden_states = self.mlp(hidden_states) 2025-08-14T21:37:52.0497040Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 365, in forward 2025-08-14T21:37:52.0497126Z hidden_states = self.c_fc(hidden_states) 2025-08-14T21:37:52.0497334Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 116, in forward 2025-08-14T21:37:52.0497445Z x = torch.addmm(self.bias, x.view(-1, x.size(-1)), self.weight) 2025-08-14T21:37:52.0497457Z 2025-08-14T21:37:52.0498099Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:52.0498486Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:52.0498757Z return mod(**inputs) 2025-08-14T21:37:52.0499071Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1480, in forward 2025-08-14T21:37:52.0499165Z transformer_outputs = self.transformer( 2025-08-14T21:37:52.0499458Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 917, in forward 2025-08-14T21:37:52.0499522Z outputs = block( 2025-08-14T21:37:52.0499767Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:52.0499866Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:52.0500249Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0500352Z return func(*args, **kwargs) 2025-08-14T21:37:52.0500740Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 440, in forward 2025-08-14T21:37:52.0500855Z feed_forward_hidden_states = self.mlp(hidden_states) 2025-08-14T21:37:52.0501122Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 366, in forward 2025-08-14T21:37:52.0501220Z hidden_states = self.act(hidden_states) 2025-08-14T21:37:52.0501482Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 47, in forward 2025-08-14T21:37:52.0501681Z return 0.5 * input * (1.0 + torch.tanh(math.sqrt(2.0 / math.pi) * (input + 0.044715 * torch.pow(input, 3.0)))) 2025-08-14T21:37:52.0501690Z 2025-08-14T21:37:52.0501801Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:52.0502049Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:52.0502116Z return mod(**inputs) 2025-08-14T21:37:52.0502388Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1480, in forward 2025-08-14T21:37:52.0502477Z transformer_outputs = self.transformer( 2025-08-14T21:37:52.0502736Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 917, in forward 2025-08-14T21:37:52.0502806Z outputs = block( 2025-08-14T21:37:52.0503040Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:52.0503118Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:52.0503397Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0503468Z return func(*args, **kwargs) 2025-08-14T21:37:52.0503743Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 440, in forward 2025-08-14T21:37:52.0503843Z feed_forward_hidden_states = self.mlp(hidden_states) 2025-08-14T21:37:52.0504142Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 367, in forward 2025-08-14T21:37:52.0504259Z hidden_states = self.c_proj(hidden_states) 2025-08-14T21:37:52.0504502Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 116, in forward 2025-08-14T21:37:52.0504639Z x = torch.addmm(self.bias, x.view(-1, x.size(-1)), self.weight) 2025-08-14T21:37:52.0504643Z 2025-08-14T21:37:52.0504751Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:52.0505082Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:52.0505161Z return mod(**inputs) 2025-08-14T21:37:52.0505417Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1480, in forward 2025-08-14T21:37:52.0505503Z transformer_outputs = self.transformer( 2025-08-14T21:37:52.0505804Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 917, in forward 2025-08-14T21:37:52.0505866Z outputs = block( 2025-08-14T21:37:52.0506105Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:52.0506185Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:52.0506444Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0506521Z return func(*args, **kwargs) 2025-08-14T21:37:52.0506781Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 442, in forward 2025-08-14T21:37:52.0506885Z hidden_states = residual + feed_forward_hidden_states 2025-08-14T21:37:52.0506897Z 2025-08-14T21:37:52.0506998Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:52.0507216Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:52.0507291Z return mod(**inputs) 2025-08-14T21:37:52.0507570Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1480, in forward 2025-08-14T21:37:52.0507653Z transformer_outputs = self.transformer( 2025-08-14T21:37:52.0507906Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 917, in forward 2025-08-14T21:37:52.0507962Z outputs = block( 2025-08-14T21:37:52.0508168Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:52.0508253Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:52.0508471Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0508542Z return func(*args, **kwargs) 2025-08-14T21:37:52.0508763Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 404, in forward 2025-08-14T21:37:52.0508848Z attn_output, self_attn_weights = self.attn( 2025-08-14T21:37:52.0509074Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0509135Z return func(*args, **kwargs) 2025-08-14T21:37:52.0509366Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 294, in forward 2025-08-14T21:37:52.0509539Z query_states, key_states, value_states = self.c_attn(hidden_states).split(self.split_size, dim=2) 2025-08-14T21:37:52.0509747Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 116, in forward 2025-08-14T21:37:52.0509867Z x = torch.addmm(self.bias, x.view(-1, x.size(-1)), self.weight) 2025-08-14T21:37:52.0509871Z 2025-08-14T21:37:52.0509947Z cudagraph partition due to non gpu ops 2025-08-14T21:37:52.0510048Z cudagraph partition due to non gpu ops 2025-08-14T21:37:52.0510119Z cudagraph partition due to non gpu ops 2025-08-14T21:37:52.0510189Z cudagraph partition due to non gpu ops 2025-08-14T21:37:52.0510322Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:52.0510528Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:52.0510593Z return mod(**inputs) 2025-08-14T21:37:52.0510837Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1480, in forward 2025-08-14T21:37:52.0510913Z transformer_outputs = self.transformer( 2025-08-14T21:37:52.0511144Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 917, in forward 2025-08-14T21:37:52.0511210Z outputs = block( 2025-08-14T21:37:52.0511414Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:52.0511521Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:52.0511746Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0511812Z return func(*args, **kwargs) 2025-08-14T21:37:52.0512049Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 404, in forward 2025-08-14T21:37:52.0512134Z attn_output, self_attn_weights = self.attn( 2025-08-14T21:37:52.0512369Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0512431Z return func(*args, **kwargs) 2025-08-14T21:37:52.0512654Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 336, in forward 2025-08-14T21:37:52.0512752Z attn_output, attn_weights = attention_interface( 2025-08-14T21:37:52.0513017Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:37:52.0513144Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:37:52.0513154Z 2025-08-14T21:37:52.0513249Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:52.0513429Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:52.0513498Z return mod(**inputs) 2025-08-14T21:37:52.0513724Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1480, in forward 2025-08-14T21:37:52.0513799Z transformer_outputs = self.transformer( 2025-08-14T21:37:52.0514030Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 917, in forward 2025-08-14T21:37:52.0514087Z outputs = block( 2025-08-14T21:37:52.0514292Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:52.0514364Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:52.0514581Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0514647Z return func(*args, **kwargs) 2025-08-14T21:37:52.0514870Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 404, in forward 2025-08-14T21:37:52.0514947Z attn_output, self_attn_weights = self.attn( 2025-08-14T21:37:52.0515170Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0515231Z return func(*args, **kwargs) 2025-08-14T21:37:52.0515461Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 336, in forward 2025-08-14T21:37:52.0515546Z attn_output, attn_weights = attention_interface( 2025-08-14T21:37:52.0515828Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:37:52.0515952Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:37:52.0515956Z 2025-08-14T21:37:52.0516049Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:52.0516251Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:52.0516311Z return mod(**inputs) 2025-08-14T21:37:52.0516539Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1480, in forward 2025-08-14T21:37:52.0516620Z transformer_outputs = self.transformer( 2025-08-14T21:37:52.0516845Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 917, in forward 2025-08-14T21:37:52.0516902Z outputs = block( 2025-08-14T21:37:52.0517110Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:52.0517201Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:52.0517431Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0517492Z return func(*args, **kwargs) 2025-08-14T21:37:52.0517722Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 404, in forward 2025-08-14T21:37:52.0517806Z attn_output, self_attn_weights = self.attn( 2025-08-14T21:37:52.0518026Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0518086Z return func(*args, **kwargs) 2025-08-14T21:37:52.0518322Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 349, in forward 2025-08-14T21:37:52.0518395Z attn_output = self.c_proj(attn_output) 2025-08-14T21:37:52.0518606Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 116, in forward 2025-08-14T21:37:52.0518714Z x = torch.addmm(self.bias, x.view(-1, x.size(-1)), self.weight) 2025-08-14T21:37:52.0518717Z 2025-08-14T21:37:52.0518810Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:52.0518999Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:52.0519059Z return mod(**inputs) 2025-08-14T21:37:52.0519296Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1480, in forward 2025-08-14T21:37:52.0519371Z transformer_outputs = self.transformer( 2025-08-14T21:37:52.0519595Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 917, in forward 2025-08-14T21:37:52.0519659Z outputs = block( 2025-08-14T21:37:52.0519861Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:52.0519934Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:52.0520163Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0520225Z return func(*args, **kwargs) 2025-08-14T21:37:52.0520458Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 440, in forward 2025-08-14T21:37:52.0520552Z feed_forward_hidden_states = self.mlp(hidden_states) 2025-08-14T21:37:52.0520776Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 365, in forward 2025-08-14T21:37:52.0520857Z hidden_states = self.c_fc(hidden_states) 2025-08-14T21:37:52.0521056Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 116, in forward 2025-08-14T21:37:52.0521184Z x = torch.addmm(self.bias, x.view(-1, x.size(-1)), self.weight) 2025-08-14T21:37:52.0521195Z 2025-08-14T21:37:52.0521288Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:52.0521482Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:52.0521551Z return mod(**inputs) 2025-08-14T21:37:52.0521794Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1480, in forward 2025-08-14T21:37:52.0521870Z transformer_outputs = self.transformer( 2025-08-14T21:37:52.0522099Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 917, in forward 2025-08-14T21:37:52.0522154Z outputs = block( 2025-08-14T21:37:52.0522362Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:52.0522431Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:52.0522665Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0522732Z return func(*args, **kwargs) 2025-08-14T21:37:52.0522955Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 440, in forward 2025-08-14T21:37:52.0523047Z feed_forward_hidden_states = self.mlp(hidden_states) 2025-08-14T21:37:52.0523277Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 366, in forward 2025-08-14T21:37:52.0523347Z hidden_states = self.act(hidden_states) 2025-08-14T21:37:52.0523545Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 47, in forward 2025-08-14T21:37:52.0523709Z return 0.5 * input * (1.0 + torch.tanh(math.sqrt(2.0 / math.pi) * (input + 0.044715 * torch.pow(input, 3.0)))) 2025-08-14T21:37:52.0523713Z 2025-08-14T21:37:52.0523806Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:52.0523996Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:52.0524055Z return mod(**inputs) 2025-08-14T21:37:52.0524291Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1480, in forward 2025-08-14T21:37:52.0524366Z transformer_outputs = self.transformer( 2025-08-14T21:37:52.0524587Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 917, in forward 2025-08-14T21:37:52.0524652Z outputs = block( 2025-08-14T21:37:52.0524851Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:37:52.0524922Z return super().__call__(*args, **kwargs) 2025-08-14T21:37:52.0525147Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:37:52.0525211Z return func(*args, **kwargs) 2025-08-14T21:37:52.0525442Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 440, in forward 2025-08-14T21:37:52.0525534Z feed_forward_hidden_states = self.mlp(hidden_states) 2025-08-14T21:37:52.0525757Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 367, in forward 2025-08-14T21:37:52.0525843Z hidden_states = self.c_proj(hidden_states) 2025-08-14T21:37:52.0526039Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 116, in forward 2025-08-14T21:37:52.0526143Z x = torch.addmm(self.bias, x.view(-1, x.size(-1)), self.weight) 2025-08-14T21:37:52.0526152Z 2025-08-14T21:37:52.0526243Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:52.0526422Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:52.0526506Z return mod(**inputs) 2025-08-14T21:37:52.0526733Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1494, in forward 2025-08-14T21:37:52.0526815Z logits = self.score(hidden_states) 2025-08-14T21:37:52.0526820Z 2025-08-14T21:37:52.0526921Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:52.0527111Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:52.0527179Z return mod(**inputs) 2025-08-14T21:37:52.0527409Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1537, in forward 2025-08-14T21:37:52.0527541Z loss = loss_fct(pooled_logits.view(-1, self.num_labels), labels.view(-1)) 2025-08-14T21:37:52.0527544Z 2025-08-14T21:37:52.0527644Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:37:52.0527821Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:37:52.0527900Z return mod(**inputs) 2025-08-14T21:37:52.0528139Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/gpt2/modeling_gpt2.py", line 1537, in forward 2025-08-14T21:37:52.0528265Z loss = loss_fct(pooled_logits.view(-1, self.num_labels), labels.view(-1)) 2025-08-14T21:37:52.0528269Z 2025-08-14T21:38:02.1461750Z Compilation time (from dynamo_timed): 15.084142595 2025-08-14T21:38:02.1462038Z pass 2025-08-14T21:38:02.1462311Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:38:02.1463033Z TIMING: _recursive_pre_grad_passes:0.01196 _recursive_joint_graph_passes:0.49198 _recursive_post_grad_passes:0.0716 async_compile.wait:0.70639 code_gen:7.5365 inductor_compile:8.56415 backend_compile:11.43091 gc:0.00042 entire_frame_compile:15.08414 total_wall_time:15.08414 2025-08-14T21:38:02.1464001Z STATS: call_* op count: 1138 | FakeTensorMode.__torch_dispatch__:12461 | FakeTensor.__torch_dispatch__:4654 | ProxyTorchDispatchMode.__torch_dispatch__:4144 2025-08-14T21:38:02.1464491Z Dynamo produced 2 graphs covering 1138 ops with 0 graph breaks (0 unique) 2025-08-14T21:38:06.4460138Z /opt/conda/envs/py_3.9/lib/python3.9/site-packages/llvmlite/binding/ffi.py:175: UserWarning: pkg_resources is deprecated as an API. See https://setuptools.pypa.io/en/latest/pkg_resources.html. The pkg_resources package is slated for removal as early as 2025-11-30. Refrain from using this package or pin to Setuptools<81. 2025-08-14T21:38:06.4461002Z from pkg_resources import resource_filename 2025-08-14T21:38:07.0554045Z 2025-08-14T21:38:08.1373746Z loading model: 0it [00:00, ?it/s] 2025-08-14T21:38:08.1378019Z loading model: 0it [00:01, ?it/s] 2025-08-14T21:38:08.1382933Z cpu eval GoogleFnet 2025-08-14T21:38:08.5006330Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:38:08.6236396Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:38:08.7417830Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:38:13.6269146Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:13.6272766Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:13.6277025Z return mod(**inputs) 2025-08-14T21:38:13.6279441Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 671, in forward 2025-08-14T21:38:13.6279931Z outputs = self.fnet( 2025-08-14T21:38:13.6280276Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 518, in forward 2025-08-14T21:38:13.6280643Z encoder_outputs = self.encoder( 2025-08-14T21:38:13.6281001Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 280, in forward 2025-08-14T21:38:13.6281698Z layer_outputs = layer_module(hidden_states) 2025-08-14T21:38:13.6282088Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:13.6282435Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:13.6282849Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 249, in forward 2025-08-14T21:38:13.6283233Z self_fourier_outputs = self.fourier(hidden_states) 2025-08-14T21:38:13.6283602Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 202, in forward 2025-08-14T21:38:13.6283963Z self_outputs = self.self(hidden_states) 2025-08-14T21:38:13.6284324Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 181, in forward 2025-08-14T21:38:13.6284964Z outputs = self.fourier_transform(hidden_states).real 2025-08-14T21:38:13.6285190Z 2025-08-14T21:38:13.6285293Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:13.6285638Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:13.6285954Z return mod(**inputs) 2025-08-14T21:38:13.6286286Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 671, in forward 2025-08-14T21:38:13.6286636Z outputs = self.fnet( 2025-08-14T21:38:13.6286958Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 518, in forward 2025-08-14T21:38:13.6287306Z encoder_outputs = self.encoder( 2025-08-14T21:38:13.6287641Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 280, in forward 2025-08-14T21:38:13.6288003Z layer_outputs = layer_module(hidden_states) 2025-08-14T21:38:13.6288342Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:13.6288672Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:13.6289026Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 249, in forward 2025-08-14T21:38:13.6289487Z self_fourier_outputs = self.fourier(hidden_states) 2025-08-14T21:38:13.6289872Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 202, in forward 2025-08-14T21:38:13.6290224Z self_outputs = self.self(hidden_states) 2025-08-14T21:38:13.6290620Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 181, in forward 2025-08-14T21:38:13.6291003Z outputs = self.fourier_transform(hidden_states).real 2025-08-14T21:38:13.6291148Z 2025-08-14T21:38:13.6291252Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:13.6291594Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:13.6291915Z return mod(**inputs) 2025-08-14T21:38:13.6292242Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 671, in forward 2025-08-14T21:38:13.6292576Z outputs = self.fnet( 2025-08-14T21:38:13.6292905Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 518, in forward 2025-08-14T21:38:13.6293250Z encoder_outputs = self.encoder( 2025-08-14T21:38:13.6293626Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 280, in forward 2025-08-14T21:38:13.6293982Z layer_outputs = layer_module(hidden_states) 2025-08-14T21:38:13.6294317Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:13.6294649Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:13.6294991Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 249, in forward 2025-08-14T21:38:13.6295453Z self_fourier_outputs = self.fourier(hidden_states) 2025-08-14T21:38:13.6295845Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 202, in forward 2025-08-14T21:38:13.6296239Z self_outputs = self.self(hidden_states) 2025-08-14T21:38:13.6296595Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 181, in forward 2025-08-14T21:38:13.6296966Z outputs = self.fourier_transform(hidden_states).real 2025-08-14T21:38:13.6297121Z 2025-08-14T21:38:13.6297222Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:13.6297560Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:13.6297864Z return mod(**inputs) 2025-08-14T21:38:13.6298194Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 671, in forward 2025-08-14T21:38:13.6298598Z outputs = self.fnet( 2025-08-14T21:38:13.6298920Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 518, in forward 2025-08-14T21:38:13.6299266Z encoder_outputs = self.encoder( 2025-08-14T21:38:13.6299602Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 280, in forward 2025-08-14T21:38:13.6299961Z layer_outputs = layer_module(hidden_states) 2025-08-14T21:38:13.6300300Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:13.6300621Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:13.6300970Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 249, in forward 2025-08-14T21:38:13.6301341Z self_fourier_outputs = self.fourier(hidden_states) 2025-08-14T21:38:13.6301706Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 202, in forward 2025-08-14T21:38:13.6302052Z self_outputs = self.self(hidden_states) 2025-08-14T21:38:13.6302400Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 181, in forward 2025-08-14T21:38:13.6302770Z outputs = self.fourier_transform(hidden_states).real 2025-08-14T21:38:13.6302912Z 2025-08-14T21:38:13.6303008Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:13.6303334Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:13.6303629Z return mod(**inputs) 2025-08-14T21:38:13.6303951Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 671, in forward 2025-08-14T21:38:13.6304285Z outputs = self.fnet( 2025-08-14T21:38:13.6304604Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 518, in forward 2025-08-14T21:38:13.6305047Z encoder_outputs = self.encoder( 2025-08-14T21:38:13.6305388Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 280, in forward 2025-08-14T21:38:13.6305751Z layer_outputs = layer_module(hidden_states) 2025-08-14T21:38:13.6306090Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:13.6306429Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:13.6306775Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 249, in forward 2025-08-14T21:38:13.6307147Z self_fourier_outputs = self.fourier(hidden_states) 2025-08-14T21:38:13.6307514Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 202, in forward 2025-08-14T21:38:13.6307899Z self_outputs = self.self(hidden_states) 2025-08-14T21:38:13.6308250Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 181, in forward 2025-08-14T21:38:13.6308649Z outputs = self.fourier_transform(hidden_states).real 2025-08-14T21:38:13.6308795Z 2025-08-14T21:38:13.6308942Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:13.6309275Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:13.6309563Z return mod(**inputs) 2025-08-14T21:38:13.6309886Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 671, in forward 2025-08-14T21:38:13.6310224Z outputs = self.fnet( 2025-08-14T21:38:13.6310539Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 518, in forward 2025-08-14T21:38:13.6310886Z encoder_outputs = self.encoder( 2025-08-14T21:38:13.6311256Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 280, in forward 2025-08-14T21:38:13.6311617Z layer_outputs = layer_module(hidden_states) 2025-08-14T21:38:13.6311946Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:13.6312277Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:13.6326101Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 249, in forward 2025-08-14T21:38:13.6326526Z self_fourier_outputs = self.fourier(hidden_states) 2025-08-14T21:38:13.6326912Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 202, in forward 2025-08-14T21:38:13.6327286Z self_outputs = self.self(hidden_states) 2025-08-14T21:38:13.6327655Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 181, in forward 2025-08-14T21:38:13.6328054Z outputs = self.fourier_transform(hidden_states).real 2025-08-14T21:38:13.6328210Z 2025-08-14T21:38:13.6328315Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:13.6328657Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:13.6328974Z return mod(**inputs) 2025-08-14T21:38:13.6329299Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 671, in forward 2025-08-14T21:38:13.6329657Z outputs = self.fnet( 2025-08-14T21:38:13.6329984Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 518, in forward 2025-08-14T21:38:13.6330340Z encoder_outputs = self.encoder( 2025-08-14T21:38:13.6330685Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 280, in forward 2025-08-14T21:38:13.6331054Z layer_outputs = layer_module(hidden_states) 2025-08-14T21:38:13.6331402Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:13.6331737Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:13.6332095Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 249, in forward 2025-08-14T21:38:13.6332472Z self_fourier_outputs = self.fourier(hidden_states) 2025-08-14T21:38:13.6332844Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 202, in forward 2025-08-14T21:38:13.6333197Z self_outputs = self.self(hidden_states) 2025-08-14T21:38:13.6333553Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 181, in forward 2025-08-14T21:38:13.6333930Z outputs = self.fourier_transform(hidden_states).real 2025-08-14T21:38:13.6334071Z 2025-08-14T21:38:13.6334170Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:13.6334580Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:13.6334907Z return mod(**inputs) 2025-08-14T21:38:13.6335240Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 671, in forward 2025-08-14T21:38:13.6335605Z outputs = self.fnet( 2025-08-14T21:38:13.6335935Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 518, in forward 2025-08-14T21:38:13.6336291Z encoder_outputs = self.encoder( 2025-08-14T21:38:13.6336637Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 280, in forward 2025-08-14T21:38:13.6337007Z layer_outputs = layer_module(hidden_states) 2025-08-14T21:38:13.6337352Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:13.6337721Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:13.6338072Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 249, in forward 2025-08-14T21:38:13.6338453Z self_fourier_outputs = self.fourier(hidden_states) 2025-08-14T21:38:13.6338824Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 202, in forward 2025-08-14T21:38:13.6339182Z self_outputs = self.self(hidden_states) 2025-08-14T21:38:13.6339529Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 181, in forward 2025-08-14T21:38:13.6339910Z outputs = self.fourier_transform(hidden_states).real 2025-08-14T21:38:13.6340050Z 2025-08-14T21:38:13.6340155Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:13.6340479Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:13.6340784Z return mod(**inputs) 2025-08-14T21:38:13.6341109Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 671, in forward 2025-08-14T21:38:13.6341456Z outputs = self.fnet( 2025-08-14T21:38:13.6341774Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 518, in forward 2025-08-14T21:38:13.6342123Z encoder_outputs = self.encoder( 2025-08-14T21:38:13.6342468Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 280, in forward 2025-08-14T21:38:13.6342829Z layer_outputs = layer_module(hidden_states) 2025-08-14T21:38:13.6343159Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:13.6343497Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:13.6343849Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 249, in forward 2025-08-14T21:38:13.6344214Z self_fourier_outputs = self.fourier(hidden_states) 2025-08-14T21:38:13.6344584Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 202, in forward 2025-08-14T21:38:13.6345041Z self_outputs = self.self(hidden_states) 2025-08-14T21:38:13.6345404Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 181, in forward 2025-08-14T21:38:13.6345777Z outputs = self.fourier_transform(hidden_states).real 2025-08-14T21:38:13.6345931Z 2025-08-14T21:38:13.6346028Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:13.6346360Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:13.6346655Z return mod(**inputs) 2025-08-14T21:38:13.6346988Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 671, in forward 2025-08-14T21:38:13.6347362Z outputs = self.fnet( 2025-08-14T21:38:13.6347705Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 518, in forward 2025-08-14T21:38:13.6348048Z encoder_outputs = self.encoder( 2025-08-14T21:38:13.6348407Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 280, in forward 2025-08-14T21:38:13.6348775Z layer_outputs = layer_module(hidden_states) 2025-08-14T21:38:13.6349111Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:13.6349434Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:13.6349786Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 249, in forward 2025-08-14T21:38:13.6350153Z self_fourier_outputs = self.fourier(hidden_states) 2025-08-14T21:38:13.6350511Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 202, in forward 2025-08-14T21:38:13.6350890Z self_outputs = self.self(hidden_states) 2025-08-14T21:38:13.6351247Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 181, in forward 2025-08-14T21:38:13.6351622Z outputs = self.fourier_transform(hidden_states).real 2025-08-14T21:38:13.6351764Z 2025-08-14T21:38:13.6351860Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:13.6352187Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:13.6352486Z return mod(**inputs) 2025-08-14T21:38:13.6352799Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 671, in forward 2025-08-14T21:38:13.6353141Z outputs = self.fnet( 2025-08-14T21:38:13.6353462Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 518, in forward 2025-08-14T21:38:13.6353813Z encoder_outputs = self.encoder( 2025-08-14T21:38:13.6354152Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 280, in forward 2025-08-14T21:38:13.6354513Z layer_outputs = layer_module(hidden_states) 2025-08-14T21:38:13.6354852Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:13.6355184Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:13.6355526Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 249, in forward 2025-08-14T21:38:13.6355895Z self_fourier_outputs = self.fourier(hidden_states) 2025-08-14T21:38:13.6356264Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 202, in forward 2025-08-14T21:38:13.6356611Z self_outputs = self.self(hidden_states) 2025-08-14T21:38:13.6356969Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 181, in forward 2025-08-14T21:38:13.6357345Z outputs = self.fourier_transform(hidden_states).real 2025-08-14T21:38:13.6357488Z 2025-08-14T21:38:13.6357590Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:13.6357913Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:13.6358212Z return mod(**inputs) 2025-08-14T21:38:13.6358535Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 671, in forward 2025-08-14T21:38:13.6358869Z outputs = self.fnet( 2025-08-14T21:38:13.6359193Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 518, in forward 2025-08-14T21:38:13.6359541Z encoder_outputs = self.encoder( 2025-08-14T21:38:13.6359888Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 280, in forward 2025-08-14T21:38:13.6360265Z layer_outputs = layer_module(hidden_states) 2025-08-14T21:38:13.6360618Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:13.6360951Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:13.6361317Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 249, in forward 2025-08-14T21:38:13.6361686Z self_fourier_outputs = self.fourier(hidden_states) 2025-08-14T21:38:13.6362054Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 202, in forward 2025-08-14T21:38:13.6362411Z self_outputs = self.self(hidden_states) 2025-08-14T21:38:13.6362756Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 181, in forward 2025-08-14T21:38:13.6363139Z outputs = self.fourier_transform(hidden_states).real 2025-08-14T21:38:13.6363301Z 2025-08-14T21:38:13.6363403Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:13.6363727Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:13.6364025Z return mod(**inputs) 2025-08-14T21:38:13.6364351Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 671, in forward 2025-08-14T21:38:13.6364684Z outputs = self.fnet( 2025-08-14T21:38:13.6365008Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 512, in forward 2025-08-14T21:38:13.6365362Z embedding_output = self.embeddings( 2025-08-14T21:38:13.6365715Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 142, in forward 2025-08-14T21:38:13.6366070Z embeddings = self.projection(embeddings) 2025-08-14T21:38:13.6366206Z 2025-08-14T21:38:13.6366281Z cudagraph partition due to non gpu ops 2025-08-14T21:38:13.6366503Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:13.6366835Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:13.6367123Z return mod(**inputs) 2025-08-14T21:38:13.6367450Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 671, in forward 2025-08-14T21:38:13.6367794Z outputs = self.fnet( 2025-08-14T21:38:13.6368108Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 518, in forward 2025-08-14T21:38:13.6368458Z encoder_outputs = self.encoder( 2025-08-14T21:38:13.6368801Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 280, in forward 2025-08-14T21:38:13.6369160Z layer_outputs = layer_module(hidden_states) 2025-08-14T21:38:13.6369488Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:13.6369821Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:13.6370171Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 249, in forward 2025-08-14T21:38:13.6370532Z self_fourier_outputs = self.fourier(hidden_states) 2025-08-14T21:38:13.6370898Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 202, in forward 2025-08-14T21:38:13.6371250Z self_outputs = self.self(hidden_states) 2025-08-14T21:38:13.6371600Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 181, in forward 2025-08-14T21:38:13.6371967Z outputs = self.fourier_transform(hidden_states).real 2025-08-14T21:38:13.6372115Z 2025-08-14T21:38:13.6372208Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:13.6372557Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:13.6372856Z return mod(**inputs) 2025-08-14T21:38:13.6373189Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 671, in forward 2025-08-14T21:38:13.6373538Z outputs = self.fnet( 2025-08-14T21:38:13.6373884Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 518, in forward 2025-08-14T21:38:13.6374229Z encoder_outputs = self.encoder( 2025-08-14T21:38:13.6374571Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 280, in forward 2025-08-14T21:38:13.6374934Z layer_outputs = layer_module(hidden_states) 2025-08-14T21:38:13.6375270Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:13.6375594Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:13.6375963Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 249, in forward 2025-08-14T21:38:13.6376331Z self_fourier_outputs = self.fourier(hidden_states) 2025-08-14T21:38:13.6376687Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 202, in forward 2025-08-14T21:38:13.6377046Z self_outputs = self.self(hidden_states) 2025-08-14T21:38:13.6377395Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 181, in forward 2025-08-14T21:38:13.6377769Z outputs = self.fourier_transform(hidden_states).real 2025-08-14T21:38:13.6377911Z 2025-08-14T21:38:13.6378005Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:13.6378332Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:13.6378628Z return mod(**inputs) 2025-08-14T21:38:13.6378954Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 671, in forward 2025-08-14T21:38:13.6379291Z outputs = self.fnet( 2025-08-14T21:38:13.6379613Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 518, in forward 2025-08-14T21:38:13.6379961Z encoder_outputs = self.encoder( 2025-08-14T21:38:13.6380297Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 280, in forward 2025-08-14T21:38:13.6380652Z layer_outputs = layer_module(hidden_states) 2025-08-14T21:38:13.6380985Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:13.6381316Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:13.6381658Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 249, in forward 2025-08-14T21:38:13.6382030Z self_fourier_outputs = self.fourier(hidden_states) 2025-08-14T21:38:13.6382398Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 202, in forward 2025-08-14T21:38:13.6382746Z self_outputs = self.self(hidden_states) 2025-08-14T21:38:13.6383100Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 181, in forward 2025-08-14T21:38:13.6383475Z outputs = self.fourier_transform(hidden_states).real 2025-08-14T21:38:13.6383616Z 2025-08-14T21:38:13.6383719Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:13.6384038Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:13.6384337Z return mod(**inputs) 2025-08-14T21:38:13.6384875Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 671, in forward 2025-08-14T21:38:13.6385233Z outputs = self.fnet( 2025-08-14T21:38:13.6385622Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 518, in forward 2025-08-14T21:38:13.6386005Z encoder_outputs = self.encoder( 2025-08-14T21:38:13.6386352Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 280, in forward 2025-08-14T21:38:13.6386733Z layer_outputs = layer_module(hidden_states) 2025-08-14T21:38:13.6387075Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:13.6387405Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:13.6387752Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 249, in forward 2025-08-14T21:38:13.6388112Z self_fourier_outputs = self.fourier(hidden_states) 2025-08-14T21:38:13.6388475Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 202, in forward 2025-08-14T21:38:13.6388873Z self_outputs = self.self(hidden_states) 2025-08-14T21:38:13.6389217Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 181, in forward 2025-08-14T21:38:13.6389590Z outputs = self.fourier_transform(hidden_states).real 2025-08-14T21:38:13.6389739Z 2025-08-14T21:38:13.6389835Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:13.6390161Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:13.6390450Z return mod(**inputs) 2025-08-14T21:38:13.6390769Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 671, in forward 2025-08-14T21:38:13.6391112Z outputs = self.fnet( 2025-08-14T21:38:13.6391434Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 518, in forward 2025-08-14T21:38:13.6391778Z encoder_outputs = self.encoder( 2025-08-14T21:38:13.6392124Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 280, in forward 2025-08-14T21:38:13.6392486Z layer_outputs = layer_module(hidden_states) 2025-08-14T21:38:13.6392814Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:13.6393146Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:13.6393500Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 252, in forward 2025-08-14T21:38:13.6393861Z layer_output = apply_chunking_to_forward( 2025-08-14T21:38:13.6394229Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:38:13.6394598Z return forward_fn(*input_tensors) 2025-08-14T21:38:13.6394975Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 261, in feed_forward_chunk 2025-08-14T21:38:13.6395389Z intermediate_output = self.intermediate(fourier_output) 2025-08-14T21:38:13.6395777Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 219, in forward 2025-08-14T21:38:13.6396140Z hidden_states = self.dense(hidden_states) 2025-08-14T21:38:13.6396267Z 2025-08-14T21:38:13.6396369Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:13.6396693Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:13.6396992Z return mod(**inputs) 2025-08-14T21:38:13.6397312Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 671, in forward 2025-08-14T21:38:13.6397653Z outputs = self.fnet( 2025-08-14T21:38:13.6397968Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 518, in forward 2025-08-14T21:38:13.6398337Z encoder_outputs = self.encoder( 2025-08-14T21:38:13.6398730Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 280, in forward 2025-08-14T21:38:13.6399087Z layer_outputs = layer_module(hidden_states) 2025-08-14T21:38:13.6399455Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:13.6399800Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:13.6400166Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 252, in forward 2025-08-14T21:38:13.6400536Z layer_output = apply_chunking_to_forward( 2025-08-14T21:38:13.6400922Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:38:13.6401307Z return forward_fn(*input_tensors) 2025-08-14T21:38:13.6401693Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 261, in feed_forward_chunk 2025-08-14T21:38:13.6402143Z intermediate_output = self.intermediate(fourier_output) 2025-08-14T21:38:13.6402541Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 220, in forward 2025-08-14T21:38:13.6402941Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:38:13.6403296Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 47, in forward 2025-08-14T21:38:13.6403718Z return 0.5 * input * (1.0 + torch.tanh(math.sqrt(2.0 / math.pi) * (input + 0.044715 * torch.pow(input, 3.0)))) 2025-08-14T21:38:13.6403939Z 2025-08-14T21:38:13.6404035Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:13.6404363Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:13.6404655Z return mod(**inputs) 2025-08-14T21:38:13.6404984Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 671, in forward 2025-08-14T21:38:13.6405326Z outputs = self.fnet( 2025-08-14T21:38:13.6405643Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 518, in forward 2025-08-14T21:38:13.6405992Z encoder_outputs = self.encoder( 2025-08-14T21:38:13.6406335Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 280, in forward 2025-08-14T21:38:13.6406694Z layer_outputs = layer_module(hidden_states) 2025-08-14T21:38:13.6407019Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:13.6407351Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:13.6407703Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 252, in forward 2025-08-14T21:38:13.6408066Z layer_output = apply_chunking_to_forward( 2025-08-14T21:38:13.6408430Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:38:13.6408790Z return forward_fn(*input_tensors) 2025-08-14T21:38:13.6409167Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 262, in feed_forward_chunk 2025-08-14T21:38:13.6409586Z layer_output = self.output(intermediate_output, fourier_output) 2025-08-14T21:38:13.6409981Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 233, in forward 2025-08-14T21:38:13.6410339Z hidden_states = self.dense(hidden_states) 2025-08-14T21:38:13.6410464Z 2025-08-14T21:38:13.6410549Z cudagraph partition due to non gpu ops 2025-08-14T21:38:13.6410764Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:13.6411094Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:13.6411415Z return mod(**inputs) 2025-08-14T21:38:13.6411751Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 671, in forward 2025-08-14T21:38:13.6412099Z outputs = self.fnet( 2025-08-14T21:38:13.6412439Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 518, in forward 2025-08-14T21:38:13.6412790Z encoder_outputs = self.encoder( 2025-08-14T21:38:13.6413123Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 280, in forward 2025-08-14T21:38:13.6413482Z layer_outputs = layer_module(hidden_states) 2025-08-14T21:38:13.6413817Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:13.6414145Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:13.6414507Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 249, in forward 2025-08-14T21:38:13.6414877Z self_fourier_outputs = self.fourier(hidden_states) 2025-08-14T21:38:13.6415244Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 202, in forward 2025-08-14T21:38:13.6415592Z self_outputs = self.self(hidden_states) 2025-08-14T21:38:13.6415948Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 181, in forward 2025-08-14T21:38:13.6416320Z outputs = self.fourier_transform(hidden_states).real 2025-08-14T21:38:13.6416463Z 2025-08-14T21:38:13.6416564Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:13.6416886Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:13.6417181Z return mod(**inputs) 2025-08-14T21:38:13.6417503Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 671, in forward 2025-08-14T21:38:13.6417847Z outputs = self.fnet( 2025-08-14T21:38:13.6418164Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 518, in forward 2025-08-14T21:38:13.6418516Z encoder_outputs = self.encoder( 2025-08-14T21:38:13.6418854Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 280, in forward 2025-08-14T21:38:13.6419212Z layer_outputs = layer_module(hidden_states) 2025-08-14T21:38:13.6419550Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:13.6419877Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:13.6420221Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 249, in forward 2025-08-14T21:38:13.6420590Z self_fourier_outputs = self.fourier(hidden_states) 2025-08-14T21:38:13.6420958Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 202, in forward 2025-08-14T21:38:13.6421304Z self_outputs = self.self(hidden_states) 2025-08-14T21:38:13.6421656Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 181, in forward 2025-08-14T21:38:13.6422025Z outputs = self.fourier_transform(hidden_states).real 2025-08-14T21:38:13.6422165Z 2025-08-14T21:38:13.6422267Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:13.6422586Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:13.6422881Z return mod(**inputs) 2025-08-14T21:38:13.6423200Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 671, in forward 2025-08-14T21:38:13.6423532Z outputs = self.fnet( 2025-08-14T21:38:13.6423872Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 518, in forward 2025-08-14T21:38:13.6424222Z encoder_outputs = self.encoder( 2025-08-14T21:38:13.6424592Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 280, in forward 2025-08-14T21:38:13.6425044Z layer_outputs = layer_module(hidden_states) 2025-08-14T21:38:13.6425394Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:13.6425734Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:13.6426095Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 249, in forward 2025-08-14T21:38:13.6426466Z self_fourier_outputs = self.fourier(hidden_states) 2025-08-14T21:38:13.6426842Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 202, in forward 2025-08-14T21:38:13.6427221Z self_outputs = self.self(hidden_states) 2025-08-14T21:38:13.6427568Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 181, in forward 2025-08-14T21:38:13.6427944Z outputs = self.fourier_transform(hidden_states).real 2025-08-14T21:38:13.6428095Z 2025-08-14T21:38:13.6428192Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:13.6428519Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:13.6428809Z return mod(**inputs) 2025-08-14T21:38:13.6429129Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 671, in forward 2025-08-14T21:38:13.6429474Z outputs = self.fnet( 2025-08-14T21:38:13.6429785Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 518, in forward 2025-08-14T21:38:13.6430135Z encoder_outputs = self.encoder( 2025-08-14T21:38:13.6430476Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 280, in forward 2025-08-14T21:38:13.6430836Z layer_outputs = layer_module(hidden_states) 2025-08-14T21:38:13.6431165Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:13.6431494Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:13.6431845Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 249, in forward 2025-08-14T21:38:13.6432214Z self_fourier_outputs = self.fourier(hidden_states) 2025-08-14T21:38:13.6432570Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 202, in forward 2025-08-14T21:38:13.6432926Z self_outputs = self.self(hidden_states) 2025-08-14T21:38:13.6433496Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 181, in forward 2025-08-14T21:38:13.6433866Z outputs = self.fourier_transform(hidden_states).real 2025-08-14T21:38:13.6434015Z 2025-08-14T21:38:13.6434111Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:13.6434443Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:13.6434749Z return mod(**inputs) 2025-08-14T21:38:13.6435066Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 671, in forward 2025-08-14T21:38:13.6435412Z outputs = self.fnet( 2025-08-14T21:38:13.6435736Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 518, in forward 2025-08-14T21:38:13.6436084Z encoder_outputs = self.encoder( 2025-08-14T21:38:13.6436419Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 280, in forward 2025-08-14T21:38:13.6436817Z layer_outputs = layer_module(hidden_states) 2025-08-14T21:38:13.6437151Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:13.6437491Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:13.6437858Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 252, in forward 2025-08-14T21:38:13.6438222Z layer_output = apply_chunking_to_forward( 2025-08-14T21:38:13.6438592Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:38:13.6438947Z return forward_fn(*input_tensors) 2025-08-14T21:38:13.6439325Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 261, in feed_forward_chunk 2025-08-14T21:38:13.6439742Z intermediate_output = self.intermediate(fourier_output) 2025-08-14T21:38:13.6440122Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 219, in forward 2025-08-14T21:38:13.6440503Z hidden_states = self.dense(hidden_states) 2025-08-14T21:38:13.6440636Z 2025-08-14T21:38:13.6440731Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:13.6441061Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:13.6441353Z return mod(**inputs) 2025-08-14T21:38:13.6441670Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 671, in forward 2025-08-14T21:38:13.6442012Z outputs = self.fnet( 2025-08-14T21:38:13.6442336Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 518, in forward 2025-08-14T21:38:13.6442673Z encoder_outputs = self.encoder( 2025-08-14T21:38:13.6443014Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 280, in forward 2025-08-14T21:38:13.6443371Z layer_outputs = layer_module(hidden_states) 2025-08-14T21:38:13.6443698Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:13.6444026Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:13.6444374Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 252, in forward 2025-08-14T21:38:13.6444729Z layer_output = apply_chunking_to_forward( 2025-08-14T21:38:13.6445089Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:38:13.6445451Z return forward_fn(*input_tensors) 2025-08-14T21:38:13.6445834Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 261, in feed_forward_chunk 2025-08-14T21:38:13.6446240Z intermediate_output = self.intermediate(fourier_output) 2025-08-14T21:38:13.6446627Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 220, in forward 2025-08-14T21:38:13.6447011Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:38:13.6447352Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 47, in forward 2025-08-14T21:38:13.6447768Z return 0.5 * input * (1.0 + torch.tanh(math.sqrt(2.0 / math.pi) * (input + 0.044715 * torch.pow(input, 3.0)))) 2025-08-14T21:38:13.6447988Z 2025-08-14T21:38:13.6448083Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:13.6448415Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:13.6448707Z return mod(**inputs) 2025-08-14T21:38:13.6449034Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 671, in forward 2025-08-14T21:38:13.6449377Z outputs = self.fnet( 2025-08-14T21:38:13.6449725Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 518, in forward 2025-08-14T21:38:13.6450085Z encoder_outputs = self.encoder( 2025-08-14T21:38:13.6450431Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 280, in forward 2025-08-14T21:38:13.6450812Z layer_outputs = layer_module(hidden_states) 2025-08-14T21:38:13.6451142Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:13.6451473Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:13.6451821Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 252, in forward 2025-08-14T21:38:13.6452176Z layer_output = apply_chunking_to_forward( 2025-08-14T21:38:13.6452533Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:38:13.6452916Z return forward_fn(*input_tensors) 2025-08-14T21:38:13.6453288Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 262, in feed_forward_chunk 2025-08-14T21:38:13.6453708Z layer_output = self.output(intermediate_output, fourier_output) 2025-08-14T21:38:13.6454095Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 233, in forward 2025-08-14T21:38:13.6454450Z hidden_states = self.dense(hidden_states) 2025-08-14T21:38:13.6454573Z 2025-08-14T21:38:13.6454652Z cudagraph partition due to non gpu ops 2025-08-14T21:38:13.6454864Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:13.6455192Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:13.6455489Z return mod(**inputs) 2025-08-14T21:38:13.6455813Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 671, in forward 2025-08-14T21:38:13.6456150Z outputs = self.fnet( 2025-08-14T21:38:13.6456474Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 518, in forward 2025-08-14T21:38:13.6456821Z encoder_outputs = self.encoder( 2025-08-14T21:38:13.6457159Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 280, in forward 2025-08-14T21:38:13.6457522Z layer_outputs = layer_module(hidden_states) 2025-08-14T21:38:13.6457857Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:13.6458190Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:13.6458531Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 249, in forward 2025-08-14T21:38:13.6458904Z self_fourier_outputs = self.fourier(hidden_states) 2025-08-14T21:38:13.6459275Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 202, in forward 2025-08-14T21:38:13.6459632Z self_outputs = self.self(hidden_states) 2025-08-14T21:38:13.6459978Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 181, in forward 2025-08-14T21:38:13.6460353Z outputs = self.fourier_transform(hidden_states).real 2025-08-14T21:38:13.6460497Z 2025-08-14T21:38:13.6460598Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:13.6460917Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:13.6461214Z return mod(**inputs) 2025-08-14T21:38:13.6461536Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 671, in forward 2025-08-14T21:38:13.6461874Z outputs = self.fnet( 2025-08-14T21:38:13.6462205Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 518, in forward 2025-08-14T21:38:13.6462551Z encoder_outputs = self.encoder( 2025-08-14T21:38:13.6462905Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 280, in forward 2025-08-14T21:38:13.6463275Z layer_outputs = layer_module(hidden_states) 2025-08-14T21:38:13.6463612Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:13.6463943Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:13.6464292Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 249, in forward 2025-08-14T21:38:13.6464654Z self_fourier_outputs = self.fourier(hidden_states) 2025-08-14T21:38:13.6465107Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 202, in forward 2025-08-14T21:38:13.6465496Z self_outputs = self.self(hidden_states) 2025-08-14T21:38:13.6465855Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 181, in forward 2025-08-14T21:38:13.6466228Z outputs = self.fourier_transform(hidden_states).real 2025-08-14T21:38:13.6466378Z 2025-08-14T21:38:13.6466474Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:13.6466809Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:13.6467103Z return mod(**inputs) 2025-08-14T21:38:13.6467432Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 671, in forward 2025-08-14T21:38:13.6467775Z outputs = self.fnet( 2025-08-14T21:38:13.6468097Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 518, in forward 2025-08-14T21:38:13.6468437Z encoder_outputs = self.encoder( 2025-08-14T21:38:13.6468782Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 280, in forward 2025-08-14T21:38:13.6469143Z layer_outputs = layer_module(hidden_states) 2025-08-14T21:38:13.6469472Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:13.6469803Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:13.6470154Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 249, in forward 2025-08-14T21:38:13.6470522Z self_fourier_outputs = self.fourier(hidden_states) 2025-08-14T21:38:13.6470879Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 202, in forward 2025-08-14T21:38:13.6471230Z self_outputs = self.self(hidden_states) 2025-08-14T21:38:13.6471582Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 181, in forward 2025-08-14T21:38:13.6471956Z outputs = self.fourier_transform(hidden_states).real 2025-08-14T21:38:13.6472098Z 2025-08-14T21:38:13.6472194Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:13.6472522Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:13.6472821Z return mod(**inputs) 2025-08-14T21:38:13.6473137Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 671, in forward 2025-08-14T21:38:13.6473483Z outputs = self.fnet( 2025-08-14T21:38:13.6473807Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 518, in forward 2025-08-14T21:38:13.6474155Z encoder_outputs = self.encoder( 2025-08-14T21:38:13.6474489Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 280, in forward 2025-08-14T21:38:13.6474882Z layer_outputs = layer_module(hidden_states) 2025-08-14T21:38:13.6475218Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:13.6475559Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:13.6475931Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 249, in forward 2025-08-14T21:38:13.6476300Z self_fourier_outputs = self.fourier(hidden_states) 2025-08-14T21:38:13.6476667Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 202, in forward 2025-08-14T21:38:13.6477012Z self_outputs = self.self(hidden_states) 2025-08-14T21:38:13.6477366Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 181, in forward 2025-08-14T21:38:13.6477737Z outputs = self.fourier_transform(hidden_states).real 2025-08-14T21:38:13.6477880Z 2025-08-14T21:38:13.6478001Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:13.6478322Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:13.6478620Z return mod(**inputs) 2025-08-14T21:38:13.6478945Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 671, in forward 2025-08-14T21:38:13.6479278Z outputs = self.fnet( 2025-08-14T21:38:13.6479600Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 518, in forward 2025-08-14T21:38:13.6479947Z encoder_outputs = self.encoder( 2025-08-14T21:38:13.6480288Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 280, in forward 2025-08-14T21:38:13.6480638Z layer_outputs = layer_module(hidden_states) 2025-08-14T21:38:13.6480970Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:13.6481303Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:13.6481649Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 252, in forward 2025-08-14T21:38:13.6482006Z layer_output = apply_chunking_to_forward( 2025-08-14T21:38:13.6482377Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:38:13.6482739Z return forward_fn(*input_tensors) 2025-08-14T21:38:13.6483104Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 261, in feed_forward_chunk 2025-08-14T21:38:13.6483515Z intermediate_output = self.intermediate(fourier_output) 2025-08-14T21:38:13.6483898Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 219, in forward 2025-08-14T21:38:13.6484255Z hidden_states = self.dense(hidden_states) 2025-08-14T21:38:13.6484383Z 2025-08-14T21:38:13.6484476Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:13.6484936Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:13.6485240Z return mod(**inputs) 2025-08-14T21:38:13.6485564Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 671, in forward 2025-08-14T21:38:13.6485912Z outputs = self.fnet( 2025-08-14T21:38:13.6486238Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 518, in forward 2025-08-14T21:38:13.6486592Z encoder_outputs = self.encoder( 2025-08-14T21:38:13.6486935Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 280, in forward 2025-08-14T21:38:13.6487299Z layer_outputs = layer_module(hidden_states) 2025-08-14T21:38:13.6487643Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:13.6488027Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:13.6488396Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 252, in forward 2025-08-14T21:38:13.6488758Z layer_output = apply_chunking_to_forward( 2025-08-14T21:38:13.6489154Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:38:13.6489505Z return forward_fn(*input_tensors) 2025-08-14T21:38:13.6489868Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 261, in feed_forward_chunk 2025-08-14T21:38:13.6490273Z intermediate_output = self.intermediate(fourier_output) 2025-08-14T21:38:13.6490650Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 220, in forward 2025-08-14T21:38:13.6491051Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:38:13.6491389Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 47, in forward 2025-08-14T21:38:13.6491792Z return 0.5 * input * (1.0 + torch.tanh(math.sqrt(2.0 / math.pi) * (input + 0.044715 * torch.pow(input, 3.0)))) 2025-08-14T21:38:13.6492001Z 2025-08-14T21:38:13.6492099Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:13.6492418Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:13.6492710Z return mod(**inputs) 2025-08-14T21:38:13.6493029Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 671, in forward 2025-08-14T21:38:13.6493360Z outputs = self.fnet( 2025-08-14T21:38:13.6493678Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 518, in forward 2025-08-14T21:38:13.6494027Z encoder_outputs = self.encoder( 2025-08-14T21:38:13.6494365Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 280, in forward 2025-08-14T21:38:13.6494715Z layer_outputs = layer_module(hidden_states) 2025-08-14T21:38:13.6495047Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:13.6495371Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:13.6495708Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 252, in forward 2025-08-14T21:38:13.6496063Z layer_output = apply_chunking_to_forward( 2025-08-14T21:38:13.6496425Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:38:13.6496781Z return forward_fn(*input_tensors) 2025-08-14T21:38:13.6497145Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 262, in feed_forward_chunk 2025-08-14T21:38:13.6497566Z layer_output = self.output(intermediate_output, fourier_output) 2025-08-14T21:38:13.6497954Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 233, in forward 2025-08-14T21:38:13.6498302Z hidden_states = self.dense(hidden_states) 2025-08-14T21:38:13.6498422Z 2025-08-14T21:38:13.6498492Z cudagraph partition due to non gpu ops 2025-08-14T21:38:13.6498702Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:13.6499021Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:13.6499307Z return mod(**inputs) 2025-08-14T21:38:13.6499623Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 671, in forward 2025-08-14T21:38:13.6499956Z outputs = self.fnet( 2025-08-14T21:38:13.6500273Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 518, in forward 2025-08-14T21:38:13.6500626Z encoder_outputs = self.encoder( 2025-08-14T21:38:13.6500977Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 280, in forward 2025-08-14T21:38:13.6501351Z layer_outputs = layer_module(hidden_states) 2025-08-14T21:38:13.6501681Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:13.6502008Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:13.6502354Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 249, in forward 2025-08-14T21:38:13.6502719Z self_fourier_outputs = self.fourier(hidden_states) 2025-08-14T21:38:13.6503080Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 202, in forward 2025-08-14T21:38:13.6504146Z self_outputs = self.self(hidden_states) 2025-08-14T21:38:13.6504490Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 181, in forward 2025-08-14T21:38:13.6504934Z outputs = self.fourier_transform(hidden_states).real 2025-08-14T21:38:13.6505081Z 2025-08-14T21:38:13.6505179Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:13.6505505Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:13.6505804Z return mod(**inputs) 2025-08-14T21:38:13.6506121Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 671, in forward 2025-08-14T21:38:13.6506464Z outputs = self.fnet( 2025-08-14T21:38:13.6506786Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 518, in forward 2025-08-14T21:38:13.6507137Z encoder_outputs = self.encoder( 2025-08-14T21:38:13.6507477Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 280, in forward 2025-08-14T21:38:13.6507837Z layer_outputs = layer_module(hidden_states) 2025-08-14T21:38:13.6508161Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:13.6508486Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:13.6508836Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 249, in forward 2025-08-14T21:38:13.6509204Z self_fourier_outputs = self.fourier(hidden_states) 2025-08-14T21:38:13.6509568Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 202, in forward 2025-08-14T21:38:13.6509912Z self_outputs = self.self(hidden_states) 2025-08-14T21:38:13.6510263Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 181, in forward 2025-08-14T21:38:13.6510645Z outputs = self.fourier_transform(hidden_states).real 2025-08-14T21:38:13.6510789Z 2025-08-14T21:38:13.6510891Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:13.6511211Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:13.6511507Z return mod(**inputs) 2025-08-14T21:38:13.6511830Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 671, in forward 2025-08-14T21:38:13.6512161Z outputs = self.fnet( 2025-08-14T21:38:13.6512484Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 518, in forward 2025-08-14T21:38:13.6512831Z encoder_outputs = self.encoder( 2025-08-14T21:38:13.6513177Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 280, in forward 2025-08-14T21:38:13.6513530Z layer_outputs = layer_module(hidden_states) 2025-08-14T21:38:13.6513894Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:13.6514252Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:13.6514612Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 249, in forward 2025-08-14T21:38:13.6514986Z self_fourier_outputs = self.fourier(hidden_states) 2025-08-14T21:38:13.6515352Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 202, in forward 2025-08-14T21:38:13.6515705Z self_outputs = self.self(hidden_states) 2025-08-14T21:38:13.6516048Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 181, in forward 2025-08-14T21:38:13.6516420Z outputs = self.fourier_transform(hidden_states).real 2025-08-14T21:38:13.6516568Z 2025-08-14T21:38:13.6516663Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:13.6517023Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:13.6517313Z return mod(**inputs) 2025-08-14T21:38:13.6517631Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 671, in forward 2025-08-14T21:38:13.6517976Z outputs = self.fnet( 2025-08-14T21:38:13.6518286Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 518, in forward 2025-08-14T21:38:13.6518636Z encoder_outputs = self.encoder( 2025-08-14T21:38:13.6518974Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 280, in forward 2025-08-14T21:38:13.6519333Z layer_outputs = layer_module(hidden_states) 2025-08-14T21:38:13.6519661Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:13.6519994Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:13.6520346Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 249, in forward 2025-08-14T21:38:13.6520715Z self_fourier_outputs = self.fourier(hidden_states) 2025-08-14T21:38:13.6521074Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 202, in forward 2025-08-14T21:38:13.6521428Z self_outputs = self.self(hidden_states) 2025-08-14T21:38:13.6521779Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 181, in forward 2025-08-14T21:38:13.6522142Z outputs = self.fourier_transform(hidden_states).real 2025-08-14T21:38:13.6522292Z 2025-08-14T21:38:13.6522387Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:13.6522716Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:13.6523019Z return mod(**inputs) 2025-08-14T21:38:13.6523336Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 671, in forward 2025-08-14T21:38:13.6523677Z outputs = self.fnet( 2025-08-14T21:38:13.6524001Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 518, in forward 2025-08-14T21:38:13.6524340Z encoder_outputs = self.encoder( 2025-08-14T21:38:13.6524682Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 280, in forward 2025-08-14T21:38:13.6525040Z layer_outputs = layer_module(hidden_states) 2025-08-14T21:38:13.6525374Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:13.6525698Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:13.6526046Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 252, in forward 2025-08-14T21:38:13.6526429Z layer_output = apply_chunking_to_forward( 2025-08-14T21:38:13.6526817Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:38:13.6527175Z return forward_fn(*input_tensors) 2025-08-14T21:38:13.6527569Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 261, in feed_forward_chunk 2025-08-14T21:38:13.6527983Z intermediate_output = self.intermediate(fourier_output) 2025-08-14T21:38:13.6528360Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 219, in forward 2025-08-14T21:38:13.6528719Z hidden_states = self.dense(hidden_states) 2025-08-14T21:38:13.6528853Z 2025-08-14T21:38:13.6528946Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:13.6529273Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:13.6529587Z return mod(**inputs) 2025-08-14T21:38:13.6529914Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 671, in forward 2025-08-14T21:38:13.6530261Z outputs = self.fnet( 2025-08-14T21:38:13.6530582Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 518, in forward 2025-08-14T21:38:13.6530933Z encoder_outputs = self.encoder( 2025-08-14T21:38:13.6531276Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 280, in forward 2025-08-14T21:38:13.6531641Z layer_outputs = layer_module(hidden_states) 2025-08-14T21:38:13.6531971Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:13.6532302Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:13.6532651Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 252, in forward 2025-08-14T21:38:13.6533015Z layer_output = apply_chunking_to_forward( 2025-08-14T21:38:13.6533380Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:38:13.6533744Z return forward_fn(*input_tensors) 2025-08-14T21:38:13.6534120Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 261, in feed_forward_chunk 2025-08-14T21:38:13.6534527Z intermediate_output = self.intermediate(fourier_output) 2025-08-14T21:38:13.6534913Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 220, in forward 2025-08-14T21:38:13.6535300Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:38:13.6535649Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 47, in forward 2025-08-14T21:38:13.6536066Z return 0.5 * input * (1.0 + torch.tanh(math.sqrt(2.0 / math.pi) * (input + 0.044715 * torch.pow(input, 3.0)))) 2025-08-14T21:38:13.6536291Z 2025-08-14T21:38:13.6536388Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:13.6536723Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:13.6537024Z return mod(**inputs) 2025-08-14T21:38:13.6537344Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 671, in forward 2025-08-14T21:38:13.6537690Z outputs = self.fnet( 2025-08-14T21:38:13.6538017Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 518, in forward 2025-08-14T21:38:13.6538359Z encoder_outputs = self.encoder( 2025-08-14T21:38:13.6538705Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 280, in forward 2025-08-14T21:38:13.6539088Z layer_outputs = layer_module(hidden_states) 2025-08-14T21:38:13.6539444Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:13.6539771Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:13.6540152Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 252, in forward 2025-08-14T21:38:13.6540525Z layer_output = apply_chunking_to_forward( 2025-08-14T21:38:13.6540899Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:38:13.6541274Z return forward_fn(*input_tensors) 2025-08-14T21:38:13.6541662Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 262, in feed_forward_chunk 2025-08-14T21:38:13.6542105Z layer_output = self.output(intermediate_output, fourier_output) 2025-08-14T21:38:13.6542526Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 233, in forward 2025-08-14T21:38:13.6542895Z hidden_states = self.dense(hidden_states) 2025-08-14T21:38:13.6543033Z 2025-08-14T21:38:13.6543110Z cudagraph partition due to non gpu ops 2025-08-14T21:38:13.6543333Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:13.6543664Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:13.6543967Z return mod(**inputs) 2025-08-14T21:38:13.6544298Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 671, in forward 2025-08-14T21:38:13.6544637Z outputs = self.fnet( 2025-08-14T21:38:13.6545050Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 518, in forward 2025-08-14T21:38:13.6545423Z encoder_outputs = self.encoder( 2025-08-14T21:38:13.6545786Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 280, in forward 2025-08-14T21:38:13.6546156Z layer_outputs = layer_module(hidden_states) 2025-08-14T21:38:13.6546508Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:13.6546855Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:13.6547213Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 249, in forward 2025-08-14T21:38:13.6547599Z self_fourier_outputs = self.fourier(hidden_states) 2025-08-14T21:38:13.6547984Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 202, in forward 2025-08-14T21:38:13.6548355Z self_outputs = self.self(hidden_states) 2025-08-14T21:38:13.6548718Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 181, in forward 2025-08-14T21:38:13.6549110Z outputs = self.fourier_transform(hidden_states).real 2025-08-14T21:38:13.6549256Z 2025-08-14T21:38:13.6549362Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:13.6549701Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:13.6550004Z return mod(**inputs) 2025-08-14T21:38:13.6550335Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 671, in forward 2025-08-14T21:38:13.6550685Z outputs = self.fnet( 2025-08-14T21:38:13.6551008Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 518, in forward 2025-08-14T21:38:13.6551368Z encoder_outputs = self.encoder( 2025-08-14T21:38:13.6551717Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 280, in forward 2025-08-14T21:38:13.6552084Z layer_outputs = layer_module(hidden_states) 2025-08-14T21:38:13.6552444Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:13.6552798Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:13.6553177Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 249, in forward 2025-08-14T21:38:13.6553550Z self_fourier_outputs = self.fourier(hidden_states) 2025-08-14T21:38:13.6553925Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 202, in forward 2025-08-14T21:38:13.6554289Z self_outputs = self.self(hidden_states) 2025-08-14T21:38:13.6554649Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 181, in forward 2025-08-14T21:38:13.6555026Z outputs = self.fourier_transform(hidden_states).real 2025-08-14T21:38:13.6555178Z 2025-08-14T21:38:13.6555276Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:13.6555636Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:13.6555939Z return mod(**inputs) 2025-08-14T21:38:13.6556265Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 671, in forward 2025-08-14T21:38:13.6556622Z outputs = self.fnet( 2025-08-14T21:38:13.6556946Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 518, in forward 2025-08-14T21:38:13.6557285Z encoder_outputs = self.encoder( 2025-08-14T21:38:13.6557630Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 280, in forward 2025-08-14T21:38:13.6557987Z layer_outputs = layer_module(hidden_states) 2025-08-14T21:38:13.6558322Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:13.6558650Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:13.6558999Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 249, in forward 2025-08-14T21:38:13.6559368Z self_fourier_outputs = self.fourier(hidden_states) 2025-08-14T21:38:13.6559726Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 202, in forward 2025-08-14T21:38:13.6560079Z self_outputs = self.self(hidden_states) 2025-08-14T21:38:13.6560429Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 181, in forward 2025-08-14T21:38:13.6560797Z outputs = self.fourier_transform(hidden_states).real 2025-08-14T21:38:13.6560937Z 2025-08-14T21:38:13.6561029Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:13.6561357Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:13.6561654Z return mod(**inputs) 2025-08-14T21:38:13.6561979Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 671, in forward 2025-08-14T21:38:13.6562310Z outputs = self.fnet( 2025-08-14T21:38:13.6562633Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 518, in forward 2025-08-14T21:38:13.6562979Z encoder_outputs = self.encoder( 2025-08-14T21:38:13.6563312Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 280, in forward 2025-08-14T21:38:13.6563673Z layer_outputs = layer_module(hidden_states) 2025-08-14T21:38:13.6564007Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:13.6564331Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:13.6564671Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 249, in forward 2025-08-14T21:38:13.6565059Z self_fourier_outputs = self.fourier(hidden_states) 2025-08-14T21:38:13.6565441Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 202, in forward 2025-08-14T21:38:13.6565796Z self_outputs = self.self(hidden_states) 2025-08-14T21:38:13.6566161Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 181, in forward 2025-08-14T21:38:13.6566535Z outputs = self.fourier_transform(hidden_states).real 2025-08-14T21:38:13.6566678Z 2025-08-14T21:38:13.6566778Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:13.6567100Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:13.6567393Z return mod(**inputs) 2025-08-14T21:38:13.6567713Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 671, in forward 2025-08-14T21:38:13.6568076Z outputs = self.fnet( 2025-08-14T21:38:13.6568390Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 518, in forward 2025-08-14T21:38:13.6568739Z encoder_outputs = self.encoder( 2025-08-14T21:38:13.6569084Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 280, in forward 2025-08-14T21:38:13.6569435Z layer_outputs = layer_module(hidden_states) 2025-08-14T21:38:13.6569773Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:13.6570101Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:13.6570450Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 252, in forward 2025-08-14T21:38:13.6570800Z layer_output = apply_chunking_to_forward( 2025-08-14T21:38:13.6571171Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:38:13.6571539Z return forward_fn(*input_tensors) 2025-08-14T21:38:13.6571913Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 261, in feed_forward_chunk 2025-08-14T21:38:13.6572323Z intermediate_output = self.intermediate(fourier_output) 2025-08-14T21:38:13.6572708Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 219, in forward 2025-08-14T21:38:13.6573066Z hidden_states = self.dense(hidden_states) 2025-08-14T21:38:13.6573194Z 2025-08-14T21:38:13.6573288Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:13.6573619Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:13.6573915Z return mod(**inputs) 2025-08-14T21:38:13.6574237Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 671, in forward 2025-08-14T21:38:13.6574575Z outputs = self.fnet( 2025-08-14T21:38:13.6574899Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 518, in forward 2025-08-14T21:38:13.6575256Z encoder_outputs = self.encoder( 2025-08-14T21:38:13.6575591Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 280, in forward 2025-08-14T21:38:13.6575950Z layer_outputs = layer_module(hidden_states) 2025-08-14T21:38:13.6576287Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:13.6576618Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:13.6576961Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 252, in forward 2025-08-14T21:38:13.6577319Z layer_output = apply_chunking_to_forward( 2025-08-14T21:38:13.6577716Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:38:13.6578084Z return forward_fn(*input_tensors) 2025-08-14T21:38:13.6578468Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 261, in feed_forward_chunk 2025-08-14T21:38:13.6578904Z intermediate_output = self.intermediate(fourier_output) 2025-08-14T21:38:13.6579289Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 220, in forward 2025-08-14T21:38:13.6579667Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:38:13.6580014Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 47, in forward 2025-08-14T21:38:13.6580427Z return 0.5 * input * (1.0 + torch.tanh(math.sqrt(2.0 / math.pi) * (input + 0.044715 * torch.pow(input, 3.0)))) 2025-08-14T21:38:13.6580637Z 2025-08-14T21:38:13.6580757Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:13.6581084Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:13.6581385Z return mod(**inputs) 2025-08-14T21:38:13.6581718Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 671, in forward 2025-08-14T21:38:13.6582064Z outputs = self.fnet( 2025-08-14T21:38:13.6582382Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 518, in forward 2025-08-14T21:38:13.6582732Z encoder_outputs = self.encoder( 2025-08-14T21:38:13.6583076Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 280, in forward 2025-08-14T21:38:13.6583434Z layer_outputs = layer_module(hidden_states) 2025-08-14T21:38:13.6583771Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:13.6584109Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:13.6584462Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 252, in forward 2025-08-14T21:38:13.6584976Z layer_output = apply_chunking_to_forward( 2025-08-14T21:38:13.6585364Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:38:13.6585741Z return forward_fn(*input_tensors) 2025-08-14T21:38:13.6586122Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 262, in feed_forward_chunk 2025-08-14T21:38:13.6586570Z layer_output = self.output(intermediate_output, fourier_output) 2025-08-14T21:38:13.6586969Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 233, in forward 2025-08-14T21:38:13.6587405Z hidden_states = self.dense(hidden_states) 2025-08-14T21:38:13.6587538Z 2025-08-14T21:38:13.6587616Z cudagraph partition due to non gpu ops 2025-08-14T21:38:13.6587845Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:13.6588192Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:13.6588502Z return mod(**inputs) 2025-08-14T21:38:13.6588833Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 671, in forward 2025-08-14T21:38:13.6589186Z outputs = self.fnet( 2025-08-14T21:38:13.6589523Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 518, in forward 2025-08-14T21:38:13.6589875Z encoder_outputs = self.encoder( 2025-08-14T21:38:13.6590233Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 280, in forward 2025-08-14T21:38:13.6590606Z layer_outputs = layer_module(hidden_states) 2025-08-14T21:38:13.6591004Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:13.6591359Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:13.6591722Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 249, in forward 2025-08-14T21:38:13.6592133Z self_fourier_outputs = self.fourier(hidden_states) 2025-08-14T21:38:13.6592509Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 202, in forward 2025-08-14T21:38:13.6592877Z self_outputs = self.self(hidden_states) 2025-08-14T21:38:13.6593245Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 181, in forward 2025-08-14T21:38:13.6593630Z outputs = self.fourier_transform(hidden_states).real 2025-08-14T21:38:13.6593777Z 2025-08-14T21:38:13.6593876Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:13.6594242Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:13.6594546Z return mod(**inputs) 2025-08-14T21:38:13.6594881Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 671, in forward 2025-08-14T21:38:13.6595229Z outputs = self.fnet( 2025-08-14T21:38:13.6595563Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 518, in forward 2025-08-14T21:38:13.6595921Z encoder_outputs = self.encoder( 2025-08-14T21:38:13.6596263Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 280, in forward 2025-08-14T21:38:13.6596634Z layer_outputs = layer_module(hidden_states) 2025-08-14T21:38:13.6596977Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:13.6597315Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:13.6597670Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 249, in forward 2025-08-14T21:38:13.6598049Z self_fourier_outputs = self.fourier(hidden_states) 2025-08-14T21:38:13.6598426Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 202, in forward 2025-08-14T21:38:13.6598788Z self_outputs = self.self(hidden_states) 2025-08-14T21:38:13.6599141Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 181, in forward 2025-08-14T21:38:13.6599535Z outputs = self.fourier_transform(hidden_states).real 2025-08-14T21:38:13.6599674Z 2025-08-14T21:38:13.6599774Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:13.6600094Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:13.6600391Z return mod(**inputs) 2025-08-14T21:38:13.6600717Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 671, in forward 2025-08-14T21:38:13.6601058Z outputs = self.fnet( 2025-08-14T21:38:13.6601371Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 518, in forward 2025-08-14T21:38:13.6601719Z encoder_outputs = self.encoder( 2025-08-14T21:38:13.6602061Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 280, in forward 2025-08-14T21:38:13.6602420Z layer_outputs = layer_module(hidden_states) 2025-08-14T21:38:13.6602753Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:13.6603086Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:13.6603435Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 249, in forward 2025-08-14T21:38:13.6603845Z self_fourier_outputs = self.fourier(hidden_states) 2025-08-14T21:38:13.6604228Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 202, in forward 2025-08-14T21:38:13.6604593Z self_outputs = self.self(hidden_states) 2025-08-14T21:38:13.6604967Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 181, in forward 2025-08-14T21:38:13.6605339Z outputs = self.fourier_transform(hidden_states).real 2025-08-14T21:38:13.6605491Z 2025-08-14T21:38:13.6605588Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:13.6605920Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:13.6606213Z return mod(**inputs) 2025-08-14T21:38:13.6606542Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 671, in forward 2025-08-14T21:38:13.6606909Z outputs = self.fnet( 2025-08-14T21:38:13.6607233Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 518, in forward 2025-08-14T21:38:13.6607576Z encoder_outputs = self.encoder( 2025-08-14T21:38:13.6607922Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 280, in forward 2025-08-14T21:38:13.6608285Z layer_outputs = layer_module(hidden_states) 2025-08-14T21:38:13.6608614Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:13.6608944Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:13.6609295Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 249, in forward 2025-08-14T21:38:13.6609664Z self_fourier_outputs = self.fourier(hidden_states) 2025-08-14T21:38:13.6610022Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 202, in forward 2025-08-14T21:38:13.6610381Z self_outputs = self.self(hidden_states) 2025-08-14T21:38:13.6610733Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 181, in forward 2025-08-14T21:38:13.6611106Z outputs = self.fourier_transform(hidden_states).real 2025-08-14T21:38:13.6611250Z 2025-08-14T21:38:13.6611345Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:13.6611672Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:13.6611969Z return mod(**inputs) 2025-08-14T21:38:13.6612285Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 671, in forward 2025-08-14T21:38:13.6612625Z outputs = self.fnet( 2025-08-14T21:38:13.6612946Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 518, in forward 2025-08-14T21:38:13.6613296Z encoder_outputs = self.encoder( 2025-08-14T21:38:13.6613632Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 280, in forward 2025-08-14T21:38:13.6613992Z layer_outputs = layer_module(hidden_states) 2025-08-14T21:38:13.6614329Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:13.6614652Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:13.6614998Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 252, in forward 2025-08-14T21:38:13.6615355Z layer_output = apply_chunking_to_forward( 2025-08-14T21:38:13.6615724Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:38:13.6616078Z return forward_fn(*input_tensors) 2025-08-14T21:38:13.6616449Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 261, in feed_forward_chunk 2025-08-14T21:38:13.6616880Z intermediate_output = self.intermediate(fourier_output) 2025-08-14T21:38:13.6617286Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 219, in forward 2025-08-14T21:38:13.6617656Z hidden_states = self.dense(hidden_states) 2025-08-14T21:38:13.6617792Z 2025-08-14T21:38:13.6617888Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:13.6618221Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:13.6618513Z return mod(**inputs) 2025-08-14T21:38:13.6618839Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 671, in forward 2025-08-14T21:38:13.6619181Z outputs = self.fnet( 2025-08-14T21:38:13.6619505Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 518, in forward 2025-08-14T21:38:13.6619869Z encoder_outputs = self.encoder( 2025-08-14T21:38:13.6620216Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 280, in forward 2025-08-14T21:38:13.6620580Z layer_outputs = layer_module(hidden_states) 2025-08-14T21:38:13.6620910Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:13.6621244Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:13.6621598Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 252, in forward 2025-08-14T21:38:13.6621961Z layer_output = apply_chunking_to_forward( 2025-08-14T21:38:13.6622323Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:38:13.6622690Z return forward_fn(*input_tensors) 2025-08-14T21:38:13.6623070Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 261, in feed_forward_chunk 2025-08-14T21:38:13.6623485Z intermediate_output = self.intermediate(fourier_output) 2025-08-14T21:38:13.6623866Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 220, in forward 2025-08-14T21:38:13.6624250Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:38:13.6624601Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 47, in forward 2025-08-14T21:38:13.6625085Z return 0.5 * input * (1.0 + torch.tanh(math.sqrt(2.0 / math.pi) * (input + 0.044715 * torch.pow(input, 3.0)))) 2025-08-14T21:38:13.6625311Z 2025-08-14T21:38:13.6625408Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:13.6625745Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:13.6626049Z return mod(**inputs) 2025-08-14T21:38:13.6626368Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 671, in forward 2025-08-14T21:38:13.6626717Z outputs = self.fnet( 2025-08-14T21:38:13.6627043Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 518, in forward 2025-08-14T21:38:13.6627392Z encoder_outputs = self.encoder( 2025-08-14T21:38:13.6627727Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 280, in forward 2025-08-14T21:38:13.6628088Z layer_outputs = layer_module(hidden_states) 2025-08-14T21:38:13.6628429Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:13.6628752Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:13.6629103Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 252, in forward 2025-08-14T21:38:13.6629489Z layer_output = apply_chunking_to_forward( 2025-08-14T21:38:13.6629871Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:38:13.6630232Z return forward_fn(*input_tensors) 2025-08-14T21:38:13.6630624Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 262, in feed_forward_chunk 2025-08-14T21:38:13.6631053Z layer_output = self.output(intermediate_output, fourier_output) 2025-08-14T21:38:13.6631449Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 233, in forward 2025-08-14T21:38:13.6631807Z hidden_states = self.dense(hidden_states) 2025-08-14T21:38:13.6631942Z 2025-08-14T21:38:13.6632019Z cudagraph partition due to non gpu ops 2025-08-14T21:38:13.6632242Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:13.6632584Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:13.6632885Z return mod(**inputs) 2025-08-14T21:38:13.6633212Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 671, in forward 2025-08-14T21:38:13.6633555Z outputs = self.fnet( 2025-08-14T21:38:13.6633873Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 518, in forward 2025-08-14T21:38:13.6634223Z encoder_outputs = self.encoder( 2025-08-14T21:38:13.6634568Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 280, in forward 2025-08-14T21:38:13.6634923Z layer_outputs = layer_module(hidden_states) 2025-08-14T21:38:13.6635259Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:13.6635593Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:13.6635949Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 249, in forward 2025-08-14T21:38:13.6636318Z self_fourier_outputs = self.fourier(hidden_states) 2025-08-14T21:38:13.6636692Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 202, in forward 2025-08-14T21:38:13.6637051Z self_outputs = self.self(hidden_states) 2025-08-14T21:38:13.6637398Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 181, in forward 2025-08-14T21:38:13.6637780Z outputs = self.fourier_transform(hidden_states).real 2025-08-14T21:38:13.6637930Z 2025-08-14T21:38:13.6638026Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:13.6638359Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:13.6638651Z return mod(**inputs) 2025-08-14T21:38:13.6638980Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 671, in forward 2025-08-14T21:38:13.6639321Z outputs = self.fnet( 2025-08-14T21:38:13.6639645Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 518, in forward 2025-08-14T21:38:13.6639989Z encoder_outputs = self.encoder( 2025-08-14T21:38:13.6640340Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 280, in forward 2025-08-14T21:38:13.6640701Z layer_outputs = layer_module(hidden_states) 2025-08-14T21:38:13.6641028Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:13.6641360Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:13.6641710Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 249, in forward 2025-08-14T21:38:13.6642098Z self_fourier_outputs = self.fourier(hidden_states) 2025-08-14T21:38:13.6642477Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 202, in forward 2025-08-14T21:38:13.6642835Z self_outputs = self.self(hidden_states) 2025-08-14T21:38:13.6643202Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 181, in forward 2025-08-14T21:38:13.6643583Z outputs = self.fourier_transform(hidden_states).real 2025-08-14T21:38:13.6643728Z 2025-08-14T21:38:13.6643823Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:13.6644156Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:13.6644457Z return mod(**inputs) 2025-08-14T21:38:13.6644773Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 671, in forward 2025-08-14T21:38:13.6645120Z outputs = self.fnet( 2025-08-14T21:38:13.6645464Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 518, in forward 2025-08-14T21:38:13.6645816Z encoder_outputs = self.encoder( 2025-08-14T21:38:13.6646154Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 280, in forward 2025-08-14T21:38:13.6646515Z layer_outputs = layer_module(hidden_states) 2025-08-14T21:38:13.6646720Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:13.6646800Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:13.6647023Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 249, in forward 2025-08-14T21:38:13.6647117Z self_fourier_outputs = self.fourier(hidden_states) 2025-08-14T21:38:13.6647339Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 202, in forward 2025-08-14T21:38:13.6647414Z self_outputs = self.self(hidden_states) 2025-08-14T21:38:13.6647646Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 181, in forward 2025-08-14T21:38:13.6647736Z outputs = self.fourier_transform(hidden_states).real 2025-08-14T21:38:13.6647740Z 2025-08-14T21:38:13.6647833Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:13.6648020Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:13.6648079Z return mod(**inputs) 2025-08-14T21:38:13.6648307Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 671, in forward 2025-08-14T21:38:13.6648365Z outputs = self.fnet( 2025-08-14T21:38:13.6648586Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 518, in forward 2025-08-14T21:38:13.6648662Z encoder_outputs = self.encoder( 2025-08-14T21:38:13.6648885Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 280, in forward 2025-08-14T21:38:13.6648968Z layer_outputs = layer_module(hidden_states) 2025-08-14T21:38:13.6649171Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:13.6649242Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:13.6649467Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 249, in forward 2025-08-14T21:38:13.6649553Z self_fourier_outputs = self.fourier(hidden_states) 2025-08-14T21:38:13.6649773Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 202, in forward 2025-08-14T21:38:13.6649851Z self_outputs = self.self(hidden_states) 2025-08-14T21:38:13.6650071Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 181, in forward 2025-08-14T21:38:13.6650189Z outputs = self.fourier_transform(hidden_states).real 2025-08-14T21:38:13.6650211Z 2025-08-14T21:38:13.6650307Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:13.6650505Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:13.6650575Z return mod(**inputs) 2025-08-14T21:38:13.6650803Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 671, in forward 2025-08-14T21:38:13.6650862Z outputs = self.fnet( 2025-08-14T21:38:13.6651093Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 518, in forward 2025-08-14T21:38:13.6651158Z encoder_outputs = self.encoder( 2025-08-14T21:38:13.6651391Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 280, in forward 2025-08-14T21:38:13.6651488Z layer_outputs = layer_module(hidden_states) 2025-08-14T21:38:13.6651691Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:13.6651769Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:13.6651991Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 252, in forward 2025-08-14T21:38:13.6652075Z layer_output = apply_chunking_to_forward( 2025-08-14T21:38:13.6652312Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:38:13.6652381Z return forward_fn(*input_tensors) 2025-08-14T21:38:13.6652642Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 261, in feed_forward_chunk 2025-08-14T21:38:13.6652745Z intermediate_output = self.intermediate(fourier_output) 2025-08-14T21:38:13.6652970Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 219, in forward 2025-08-14T21:38:13.6653056Z hidden_states = self.dense(hidden_states) 2025-08-14T21:38:13.6653059Z 2025-08-14T21:38:13.6653152Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:13.6653343Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:13.6653402Z return mod(**inputs) 2025-08-14T21:38:13.6653624Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 671, in forward 2025-08-14T21:38:13.6653690Z outputs = self.fnet( 2025-08-14T21:38:13.6653912Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 518, in forward 2025-08-14T21:38:13.6653986Z encoder_outputs = self.encoder( 2025-08-14T21:38:13.6654206Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 280, in forward 2025-08-14T21:38:13.6654284Z layer_outputs = layer_module(hidden_states) 2025-08-14T21:38:13.6654491Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:13.6654563Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:13.6654783Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 252, in forward 2025-08-14T21:38:13.6654869Z layer_output = apply_chunking_to_forward( 2025-08-14T21:38:13.6655104Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:38:13.6655180Z return forward_fn(*input_tensors) 2025-08-14T21:38:13.6655430Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 261, in feed_forward_chunk 2025-08-14T21:38:13.6655561Z intermediate_output = self.intermediate(fourier_output) 2025-08-14T21:38:13.6655810Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 220, in forward 2025-08-14T21:38:13.6655912Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:38:13.6656124Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 47, in forward 2025-08-14T21:38:13.6656291Z return 0.5 * input * (1.0 + torch.tanh(math.sqrt(2.0 / math.pi) * (input + 0.044715 * torch.pow(input, 3.0)))) 2025-08-14T21:38:13.6656294Z 2025-08-14T21:38:13.6656390Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:13.6656580Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:13.6656640Z return mod(**inputs) 2025-08-14T21:38:13.6656870Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 671, in forward 2025-08-14T21:38:13.6656955Z outputs = self.fnet( 2025-08-14T21:38:13.6657184Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 518, in forward 2025-08-14T21:38:13.6657258Z encoder_outputs = self.encoder( 2025-08-14T21:38:13.6657486Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 280, in forward 2025-08-14T21:38:13.6657562Z layer_outputs = layer_module(hidden_states) 2025-08-14T21:38:13.6657772Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:13.6657843Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:13.6658077Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 252, in forward 2025-08-14T21:38:13.6658153Z layer_output = apply_chunking_to_forward( 2025-08-14T21:38:13.6658392Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:38:13.6658470Z return forward_fn(*input_tensors) 2025-08-14T21:38:13.6658728Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 262, in feed_forward_chunk 2025-08-14T21:38:13.6658846Z layer_output = self.output(intermediate_output, fourier_output) 2025-08-14T21:38:13.6659080Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 233, in forward 2025-08-14T21:38:13.6659154Z hidden_states = self.dense(hidden_states) 2025-08-14T21:38:13.6659157Z 2025-08-14T21:38:13.6659238Z cudagraph partition due to non gpu ops 2025-08-14T21:38:13.6659331Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:13.6659515Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:13.6659582Z return mod(**inputs) 2025-08-14T21:38:13.6659808Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 671, in forward 2025-08-14T21:38:13.6659867Z outputs = self.fnet( 2025-08-14T21:38:13.6660100Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 518, in forward 2025-08-14T21:38:13.6660167Z encoder_outputs = self.encoder( 2025-08-14T21:38:13.6660402Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 280, in forward 2025-08-14T21:38:13.6660477Z layer_outputs = layer_module(hidden_states) 2025-08-14T21:38:13.6660680Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:13.6660758Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:13.6660982Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 249, in forward 2025-08-14T21:38:13.6661094Z self_fourier_outputs = self.fourier(hidden_states) 2025-08-14T21:38:13.6661335Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 202, in forward 2025-08-14T21:38:13.6661408Z self_outputs = self.self(hidden_states) 2025-08-14T21:38:13.6661652Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 181, in forward 2025-08-14T21:38:13.6661747Z outputs = self.fourier_transform(hidden_states).real 2025-08-14T21:38:13.6661751Z 2025-08-14T21:38:13.6661844Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:13.6662035Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:13.6662094Z return mod(**inputs) 2025-08-14T21:38:13.6662322Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 671, in forward 2025-08-14T21:38:13.6662380Z outputs = self.fnet( 2025-08-14T21:38:13.6662622Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 518, in forward 2025-08-14T21:38:13.6662697Z encoder_outputs = self.encoder( 2025-08-14T21:38:13.6662920Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 280, in forward 2025-08-14T21:38:13.6662997Z layer_outputs = layer_module(hidden_states) 2025-08-14T21:38:13.6663206Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:13.6663276Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:13.6663504Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 249, in forward 2025-08-14T21:38:13.6663593Z self_fourier_outputs = self.fourier(hidden_states) 2025-08-14T21:38:13.6663816Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 202, in forward 2025-08-14T21:38:13.6663900Z self_outputs = self.self(hidden_states) 2025-08-14T21:38:13.6664124Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 181, in forward 2025-08-14T21:38:13.6664223Z outputs = self.fourier_transform(hidden_states).real 2025-08-14T21:38:13.6664226Z 2025-08-14T21:38:13.6664320Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:13.6664502Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:13.6664569Z return mod(**inputs) 2025-08-14T21:38:13.6664860Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 671, in forward 2025-08-14T21:38:13.6664928Z outputs = self.fnet( 2025-08-14T21:38:13.6665160Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 518, in forward 2025-08-14T21:38:13.6665230Z encoder_outputs = self.encoder( 2025-08-14T21:38:13.6665461Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 280, in forward 2025-08-14T21:38:13.6665540Z layer_outputs = layer_module(hidden_states) 2025-08-14T21:38:13.6665741Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:13.6665822Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:13.6666045Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 249, in forward 2025-08-14T21:38:13.6666140Z self_fourier_outputs = self.fourier(hidden_states) 2025-08-14T21:38:13.6666361Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 202, in forward 2025-08-14T21:38:13.6666436Z self_outputs = self.self(hidden_states) 2025-08-14T21:38:13.6666667Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 181, in forward 2025-08-14T21:38:13.6666783Z outputs = self.fourier_transform(hidden_states).real 2025-08-14T21:38:13.6666787Z 2025-08-14T21:38:13.6666896Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:13.6667105Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:13.6667166Z return mod(**inputs) 2025-08-14T21:38:13.6667396Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 671, in forward 2025-08-14T21:38:13.6667456Z outputs = self.fnet( 2025-08-14T21:38:13.6667678Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 518, in forward 2025-08-14T21:38:13.6667751Z encoder_outputs = self.encoder( 2025-08-14T21:38:13.6667974Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 280, in forward 2025-08-14T21:38:13.6668082Z layer_outputs = layer_module(hidden_states) 2025-08-14T21:38:13.6668283Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:13.6668355Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:13.6668587Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 249, in forward 2025-08-14T21:38:13.6668673Z self_fourier_outputs = self.fourier(hidden_states) 2025-08-14T21:38:13.6668893Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 202, in forward 2025-08-14T21:38:13.6668973Z self_outputs = self.self(hidden_states) 2025-08-14T21:38:13.6669194Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 181, in forward 2025-08-14T21:38:13.6669292Z outputs = self.fourier_transform(hidden_states).real 2025-08-14T21:38:13.6669299Z 2025-08-14T21:38:13.6669390Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:13.6669570Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:13.6669637Z return mod(**inputs) 2025-08-14T21:38:13.6669860Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 671, in forward 2025-08-14T21:38:13.6669919Z outputs = self.fnet( 2025-08-14T21:38:13.6670150Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 518, in forward 2025-08-14T21:38:13.6670217Z encoder_outputs = self.encoder( 2025-08-14T21:38:13.6670447Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 280, in forward 2025-08-14T21:38:13.6670522Z layer_outputs = layer_module(hidden_states) 2025-08-14T21:38:13.6670721Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:13.6670801Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:13.6671024Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 252, in forward 2025-08-14T21:38:13.6671108Z layer_output = apply_chunking_to_forward( 2025-08-14T21:38:13.6671344Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:38:13.6671414Z return forward_fn(*input_tensors) 2025-08-14T21:38:13.6671672Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 261, in feed_forward_chunk 2025-08-14T21:38:13.6671776Z intermediate_output = self.intermediate(fourier_output) 2025-08-14T21:38:13.6671999Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 219, in forward 2025-08-14T21:38:13.6672104Z hidden_states = self.dense(hidden_states) 2025-08-14T21:38:13.6672108Z 2025-08-14T21:38:13.6672201Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:13.6672406Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:13.6672467Z return mod(**inputs) 2025-08-14T21:38:13.6672710Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 671, in forward 2025-08-14T21:38:13.6672778Z outputs = self.fnet( 2025-08-14T21:38:13.6673003Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 518, in forward 2025-08-14T21:38:13.6673075Z encoder_outputs = self.encoder( 2025-08-14T21:38:13.6673298Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 280, in forward 2025-08-14T21:38:13.6673375Z layer_outputs = layer_module(hidden_states) 2025-08-14T21:38:13.6673589Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:13.6673688Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:13.6673911Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 252, in forward 2025-08-14T21:38:13.6673996Z layer_output = apply_chunking_to_forward( 2025-08-14T21:38:13.6674231Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:38:13.6674305Z return forward_fn(*input_tensors) 2025-08-14T21:38:13.6674555Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 261, in feed_forward_chunk 2025-08-14T21:38:13.6674657Z intermediate_output = self.intermediate(fourier_output) 2025-08-14T21:38:13.6674884Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 220, in forward 2025-08-14T21:38:13.6674986Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:38:13.6675188Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 47, in forward 2025-08-14T21:38:13.6675352Z return 0.5 * input * (1.0 + torch.tanh(math.sqrt(2.0 / math.pi) * (input + 0.044715 * torch.pow(input, 3.0)))) 2025-08-14T21:38:13.6675356Z 2025-08-14T21:38:13.6675449Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:13.6675639Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:13.6675697Z return mod(**inputs) 2025-08-14T21:38:13.6675920Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 671, in forward 2025-08-14T21:38:13.6675986Z outputs = self.fnet( 2025-08-14T21:38:13.6676205Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 518, in forward 2025-08-14T21:38:13.6676278Z encoder_outputs = self.encoder( 2025-08-14T21:38:13.6676501Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 280, in forward 2025-08-14T21:38:13.6676578Z layer_outputs = layer_module(hidden_states) 2025-08-14T21:38:13.6676784Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:13.6676854Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:13.6677074Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 252, in forward 2025-08-14T21:38:13.6677157Z layer_output = apply_chunking_to_forward( 2025-08-14T21:38:13.6677389Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:38:13.6677490Z return forward_fn(*input_tensors) 2025-08-14T21:38:13.6677738Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 262, in feed_forward_chunk 2025-08-14T21:38:13.6677885Z layer_output = self.output(intermediate_output, fourier_output) 2025-08-14T21:38:13.6678117Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 233, in forward 2025-08-14T21:38:13.6678204Z hidden_states = self.dense(hidden_states) 2025-08-14T21:38:13.6678209Z 2025-08-14T21:38:13.6678293Z cudagraph partition due to non gpu ops 2025-08-14T21:38:13.6678386Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:13.6678570Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:13.6678638Z return mod(**inputs) 2025-08-14T21:38:13.6678861Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 671, in forward 2025-08-14T21:38:13.6678921Z outputs = self.fnet( 2025-08-14T21:38:13.6679165Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 518, in forward 2025-08-14T21:38:13.6679231Z encoder_outputs = self.encoder( 2025-08-14T21:38:13.6679459Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 280, in forward 2025-08-14T21:38:13.6679538Z layer_outputs = layer_module(hidden_states) 2025-08-14T21:38:13.6679738Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:13.6679818Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:13.6680040Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 249, in forward 2025-08-14T21:38:13.6680136Z self_fourier_outputs = self.fourier(hidden_states) 2025-08-14T21:38:13.6680357Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 202, in forward 2025-08-14T21:38:13.6680432Z self_outputs = self.self(hidden_states) 2025-08-14T21:38:13.6680665Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 181, in forward 2025-08-14T21:38:13.6680757Z outputs = self.fourier_transform(hidden_states).real 2025-08-14T21:38:13.6680760Z 2025-08-14T21:38:13.6680852Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:13.6681040Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:13.6681100Z return mod(**inputs) 2025-08-14T21:38:13.6681329Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 671, in forward 2025-08-14T21:38:13.6681389Z outputs = self.fnet( 2025-08-14T21:38:13.6681609Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 518, in forward 2025-08-14T21:38:13.6681685Z encoder_outputs = self.encoder( 2025-08-14T21:38:13.6681909Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 280, in forward 2025-08-14T21:38:13.6681985Z layer_outputs = layer_module(hidden_states) 2025-08-14T21:38:13.6682195Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:13.6682267Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:13.6682496Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 249, in forward 2025-08-14T21:38:13.6682583Z self_fourier_outputs = self.fourier(hidden_states) 2025-08-14T21:38:13.6682804Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 202, in forward 2025-08-14T21:38:13.6682884Z self_outputs = self.self(hidden_states) 2025-08-14T21:38:13.6683103Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 181, in forward 2025-08-14T21:38:13.6683220Z outputs = self.fourier_transform(hidden_states).real 2025-08-14T21:38:13.6683223Z 2025-08-14T21:38:13.6683331Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:13.6683529Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:13.6683599Z return mod(**inputs) 2025-08-14T21:38:13.6683822Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 671, in forward 2025-08-14T21:38:13.6683882Z outputs = self.fnet( 2025-08-14T21:38:13.6684111Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 518, in forward 2025-08-14T21:38:13.6684177Z encoder_outputs = self.encoder( 2025-08-14T21:38:13.6684412Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 280, in forward 2025-08-14T21:38:13.6684511Z layer_outputs = layer_module(hidden_states) 2025-08-14T21:38:13.6684831Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:13.6684918Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:13.6685152Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 249, in forward 2025-08-14T21:38:13.6685249Z self_fourier_outputs = self.fourier(hidden_states) 2025-08-14T21:38:13.6685480Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 202, in forward 2025-08-14T21:38:13.6685554Z self_outputs = self.self(hidden_states) 2025-08-14T21:38:13.6685789Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 181, in forward 2025-08-14T21:38:13.6685884Z outputs = self.fourier_transform(hidden_states).real 2025-08-14T21:38:13.6685890Z 2025-08-14T21:38:13.6685984Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:13.6686180Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:13.6686241Z return mod(**inputs) 2025-08-14T21:38:13.6686486Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 671, in forward 2025-08-14T21:38:13.6686545Z outputs = self.fnet( 2025-08-14T21:38:13.6686816Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 518, in forward 2025-08-14T21:38:13.6686892Z encoder_outputs = self.encoder( 2025-08-14T21:38:13.6687120Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 280, in forward 2025-08-14T21:38:13.6687206Z layer_outputs = layer_module(hidden_states) 2025-08-14T21:38:13.6687411Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:13.6687487Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:13.6687723Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 249, in forward 2025-08-14T21:38:13.6687811Z self_fourier_outputs = self.fourier(hidden_states) 2025-08-14T21:38:13.6688039Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 202, in forward 2025-08-14T21:38:13.6688122Z self_outputs = self.self(hidden_states) 2025-08-14T21:38:13.6688349Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 181, in forward 2025-08-14T21:38:13.6688448Z outputs = self.fourier_transform(hidden_states).real 2025-08-14T21:38:13.6688451Z 2025-08-14T21:38:13.6688544Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:13.6688727Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:13.6688837Z return mod(**inputs) 2025-08-14T21:38:13.6689093Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 671, in forward 2025-08-14T21:38:13.6689154Z outputs = self.fnet( 2025-08-14T21:38:13.6689453Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 518, in forward 2025-08-14T21:38:13.6689523Z encoder_outputs = self.encoder( 2025-08-14T21:38:13.6689761Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 280, in forward 2025-08-14T21:38:13.6689837Z layer_outputs = layer_module(hidden_states) 2025-08-14T21:38:13.6690041Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:13.6690118Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:13.6690348Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 252, in forward 2025-08-14T21:38:13.6690460Z layer_output = apply_chunking_to_forward( 2025-08-14T21:38:13.6690707Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:38:13.6690781Z return forward_fn(*input_tensors) 2025-08-14T21:38:13.6691052Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 261, in feed_forward_chunk 2025-08-14T21:38:13.6691161Z intermediate_output = self.intermediate(fourier_output) 2025-08-14T21:38:13.6691396Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 219, in forward 2025-08-14T21:38:13.6691483Z hidden_states = self.dense(hidden_states) 2025-08-14T21:38:13.6691486Z 2025-08-14T21:38:13.6691584Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:13.6691782Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:13.6691850Z return mod(**inputs) 2025-08-14T21:38:13.6692085Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 671, in forward 2025-08-14T21:38:13.6692158Z outputs = self.fnet( 2025-08-14T21:38:13.6692393Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 518, in forward 2025-08-14T21:38:13.6692471Z encoder_outputs = self.encoder( 2025-08-14T21:38:13.6692703Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 280, in forward 2025-08-14T21:38:13.6692784Z layer_outputs = layer_module(hidden_states) 2025-08-14T21:38:13.6693002Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:13.6693078Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:13.6693314Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 252, in forward 2025-08-14T21:38:13.6693405Z layer_output = apply_chunking_to_forward( 2025-08-14T21:38:13.6693654Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:38:13.6693737Z return forward_fn(*input_tensors) 2025-08-14T21:38:13.6693998Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 261, in feed_forward_chunk 2025-08-14T21:38:13.6694107Z intermediate_output = self.intermediate(fourier_output) 2025-08-14T21:38:13.6694351Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 220, in forward 2025-08-14T21:38:13.6694455Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:38:13.6694662Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 47, in forward 2025-08-14T21:38:13.6694851Z return 0.5 * input * (1.0 + torch.tanh(math.sqrt(2.0 / math.pi) * (input + 0.044715 * torch.pow(input, 3.0)))) 2025-08-14T21:38:13.6694871Z 2025-08-14T21:38:13.6694969Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:13.6695188Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:13.6695250Z return mod(**inputs) 2025-08-14T21:38:13.6695483Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 671, in forward 2025-08-14T21:38:13.6695552Z outputs = self.fnet( 2025-08-14T21:38:13.6695782Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 518, in forward 2025-08-14T21:38:13.6695856Z encoder_outputs = self.encoder( 2025-08-14T21:38:13.6696083Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 280, in forward 2025-08-14T21:38:13.6696182Z layer_outputs = layer_module(hidden_states) 2025-08-14T21:38:13.6696398Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:13.6696473Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:13.6696720Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 252, in forward 2025-08-14T21:38:13.6696800Z layer_output = apply_chunking_to_forward( 2025-08-14T21:38:13.6697045Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:38:13.6697125Z return forward_fn(*input_tensors) 2025-08-14T21:38:13.6697387Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 262, in feed_forward_chunk 2025-08-14T21:38:13.6697507Z layer_output = self.output(intermediate_output, fourier_output) 2025-08-14T21:38:13.6697752Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 233, in forward 2025-08-14T21:38:13.6697833Z hidden_states = self.dense(hidden_states) 2025-08-14T21:38:13.6697836Z 2025-08-14T21:38:13.6697925Z cudagraph partition due to non gpu ops 2025-08-14T21:38:13.6698027Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:13.6698220Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:13.6698295Z return mod(**inputs) 2025-08-14T21:38:13.6698531Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 671, in forward 2025-08-14T21:38:13.6698596Z outputs = self.fnet( 2025-08-14T21:38:13.6698837Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 518, in forward 2025-08-14T21:38:13.6698907Z encoder_outputs = self.encoder( 2025-08-14T21:38:13.6699151Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 280, in forward 2025-08-14T21:38:13.6699245Z layer_outputs = layer_module(hidden_states) 2025-08-14T21:38:13.6699450Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:13.6699534Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:13.6699761Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 249, in forward 2025-08-14T21:38:13.6699860Z self_fourier_outputs = self.fourier(hidden_states) 2025-08-14T21:38:13.6700084Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 202, in forward 2025-08-14T21:38:13.6700160Z self_outputs = self.self(hidden_states) 2025-08-14T21:38:13.6700392Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 181, in forward 2025-08-14T21:38:13.6700502Z outputs = self.fourier_transform(hidden_states).real 2025-08-14T21:38:13.6700505Z 2025-08-14T21:38:13.6700613Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:13.6700802Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:13.6700876Z return mod(**inputs) 2025-08-14T21:38:13.6701109Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 671, in forward 2025-08-14T21:38:13.6701167Z outputs = self.fnet( 2025-08-14T21:38:13.6701387Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 518, in forward 2025-08-14T21:38:13.6701457Z encoder_outputs = self.encoder( 2025-08-14T21:38:13.6701677Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 280, in forward 2025-08-14T21:38:13.6701772Z layer_outputs = layer_module(hidden_states) 2025-08-14T21:38:13.6701980Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:13.6702053Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:13.6702281Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 249, in forward 2025-08-14T21:38:13.6702369Z self_fourier_outputs = self.fourier(hidden_states) 2025-08-14T21:38:13.6702590Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 202, in forward 2025-08-14T21:38:13.6702669Z self_outputs = self.self(hidden_states) 2025-08-14T21:38:13.6702889Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 181, in forward 2025-08-14T21:38:13.6702988Z outputs = self.fourier_transform(hidden_states).real 2025-08-14T21:38:13.6702991Z 2025-08-14T21:38:13.6703087Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:13.6703268Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:13.6703333Z return mod(**inputs) 2025-08-14T21:38:13.6703557Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 671, in forward 2025-08-14T21:38:13.6703615Z outputs = self.fnet( 2025-08-14T21:38:13.6703843Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 518, in forward 2025-08-14T21:38:13.6703909Z encoder_outputs = self.encoder( 2025-08-14T21:38:13.6704136Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 280, in forward 2025-08-14T21:38:13.6704212Z layer_outputs = layer_module(hidden_states) 2025-08-14T21:38:13.6704412Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:13.6704493Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:13.6704716Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 249, in forward 2025-08-14T21:38:13.6704868Z self_fourier_outputs = self.fourier(hidden_states) 2025-08-14T21:38:13.6705104Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 202, in forward 2025-08-14T21:38:13.6705177Z self_outputs = self.self(hidden_states) 2025-08-14T21:38:13.6705410Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 181, in forward 2025-08-14T21:38:13.6705502Z outputs = self.fourier_transform(hidden_states).real 2025-08-14T21:38:13.6705505Z 2025-08-14T21:38:13.6705598Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:13.6705785Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:13.6705866Z return mod(**inputs) 2025-08-14T21:38:13.6706117Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 671, in forward 2025-08-14T21:38:13.6706179Z outputs = self.fnet( 2025-08-14T21:38:13.6706417Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 518, in forward 2025-08-14T21:38:13.6706495Z encoder_outputs = self.encoder( 2025-08-14T21:38:13.6706719Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 280, in forward 2025-08-14T21:38:13.6706803Z layer_outputs = layer_module(hidden_states) 2025-08-14T21:38:13.6707005Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:13.6707076Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:13.6707306Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 249, in forward 2025-08-14T21:38:13.6707412Z self_fourier_outputs = self.fourier(hidden_states) 2025-08-14T21:38:13.6707636Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 202, in forward 2025-08-14T21:38:13.6707716Z self_outputs = self.self(hidden_states) 2025-08-14T21:38:13.6707940Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 181, in forward 2025-08-14T21:38:13.6708038Z outputs = self.fourier_transform(hidden_states).real 2025-08-14T21:38:13.6708042Z 2025-08-14T21:38:13.6708135Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:13.6708316Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:13.6708383Z return mod(**inputs) 2025-08-14T21:38:13.6708608Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 671, in forward 2025-08-14T21:38:13.6708669Z outputs = self.fnet( 2025-08-14T21:38:13.6708902Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 518, in forward 2025-08-14T21:38:13.6708967Z encoder_outputs = self.encoder( 2025-08-14T21:38:13.6709198Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 280, in forward 2025-08-14T21:38:13.6709273Z layer_outputs = layer_module(hidden_states) 2025-08-14T21:38:13.6709473Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:13.6709551Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:13.6709775Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 252, in forward 2025-08-14T21:38:13.6709857Z layer_output = apply_chunking_to_forward( 2025-08-14T21:38:13.6710098Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:38:13.6710168Z return forward_fn(*input_tensors) 2025-08-14T21:38:13.6710433Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 261, in feed_forward_chunk 2025-08-14T21:38:13.6710537Z intermediate_output = self.intermediate(fourier_output) 2025-08-14T21:38:13.6710759Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 219, in forward 2025-08-14T21:38:13.6710841Z hidden_states = self.dense(hidden_states) 2025-08-14T21:38:13.6710844Z 2025-08-14T21:38:13.6710936Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:13.6711124Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:13.6711183Z return mod(**inputs) 2025-08-14T21:38:13.6711406Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 671, in forward 2025-08-14T21:38:13.6711493Z outputs = self.fnet( 2025-08-14T21:38:13.6711732Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 518, in forward 2025-08-14T21:38:13.6711809Z encoder_outputs = self.encoder( 2025-08-14T21:38:13.6712051Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 280, in forward 2025-08-14T21:38:13.6712129Z layer_outputs = layer_module(hidden_states) 2025-08-14T21:38:13.6712336Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:13.6712407Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:13.6712630Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 252, in forward 2025-08-14T21:38:13.6712711Z layer_output = apply_chunking_to_forward( 2025-08-14T21:38:13.6712987Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:38:13.6713066Z return forward_fn(*input_tensors) 2025-08-14T21:38:13.6713325Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 261, in feed_forward_chunk 2025-08-14T21:38:13.6713428Z intermediate_output = self.intermediate(fourier_output) 2025-08-14T21:38:13.6713661Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 220, in forward 2025-08-14T21:38:13.6713760Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:38:13.6713963Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 47, in forward 2025-08-14T21:38:13.6714130Z return 0.5 * input * (1.0 + torch.tanh(math.sqrt(2.0 / math.pi) * (input + 0.044715 * torch.pow(input, 3.0)))) 2025-08-14T21:38:13.6714136Z 2025-08-14T21:38:13.6714229Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:13.6714423Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:13.6714483Z return mod(**inputs) 2025-08-14T21:38:13.6714709Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 671, in forward 2025-08-14T21:38:13.6714776Z outputs = self.fnet( 2025-08-14T21:38:13.6715002Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 518, in forward 2025-08-14T21:38:13.6715074Z encoder_outputs = self.encoder( 2025-08-14T21:38:13.6715298Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 280, in forward 2025-08-14T21:38:13.6715373Z layer_outputs = layer_module(hidden_states) 2025-08-14T21:38:13.6715582Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:13.6715656Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:13.6715880Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 252, in forward 2025-08-14T21:38:13.6715963Z layer_output = apply_chunking_to_forward( 2025-08-14T21:38:13.6716201Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:38:13.6716275Z return forward_fn(*input_tensors) 2025-08-14T21:38:13.6716526Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 262, in feed_forward_chunk 2025-08-14T21:38:13.6716639Z layer_output = self.output(intermediate_output, fourier_output) 2025-08-14T21:38:13.6716870Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 233, in forward 2025-08-14T21:38:13.6716963Z hidden_states = self.dense(hidden_states) 2025-08-14T21:38:13.6716966Z 2025-08-14T21:38:13.6717046Z cudagraph partition due to non gpu ops 2025-08-14T21:38:13.6717184Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:13.6717394Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:13.6717477Z return mod(**inputs) 2025-08-14T21:38:13.6717700Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 671, in forward 2025-08-14T21:38:13.6717757Z outputs = self.fnet( 2025-08-14T21:38:13.6717984Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 518, in forward 2025-08-14T21:38:13.6718047Z encoder_outputs = self.encoder( 2025-08-14T21:38:13.6718273Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 280, in forward 2025-08-14T21:38:13.6718351Z layer_outputs = layer_module(hidden_states) 2025-08-14T21:38:13.6718572Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:13.6718653Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:13.6718878Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 249, in forward 2025-08-14T21:38:13.6718974Z self_fourier_outputs = self.fourier(hidden_states) 2025-08-14T21:38:13.6719198Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 202, in forward 2025-08-14T21:38:13.6719268Z self_outputs = self.self(hidden_states) 2025-08-14T21:38:13.6719498Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 181, in forward 2025-08-14T21:38:13.6719591Z outputs = self.fourier_transform(hidden_states).real 2025-08-14T21:38:13.6719594Z 2025-08-14T21:38:13.6719690Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:13.6719879Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:13.6719937Z return mod(**inputs) 2025-08-14T21:38:13.6720173Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 671, in forward 2025-08-14T21:38:13.6720232Z outputs = self.fnet( 2025-08-14T21:38:13.6720455Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 518, in forward 2025-08-14T21:38:13.6720529Z encoder_outputs = self.encoder( 2025-08-14T21:38:13.6720755Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 280, in forward 2025-08-14T21:38:13.6720829Z layer_outputs = layer_module(hidden_states) 2025-08-14T21:38:13.6721038Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:13.6721111Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:13.6721343Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 249, in forward 2025-08-14T21:38:13.6721432Z self_fourier_outputs = self.fourier(hidden_states) 2025-08-14T21:38:13.6721659Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 202, in forward 2025-08-14T21:38:13.6721739Z self_outputs = self.self(hidden_states) 2025-08-14T21:38:13.6721965Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 181, in forward 2025-08-14T21:38:13.6722064Z outputs = self.fourier_transform(hidden_states).real 2025-08-14T21:38:13.6722067Z 2025-08-14T21:38:13.6722160Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:13.6722345Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:13.6722431Z return mod(**inputs) 2025-08-14T21:38:13.6722661Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 671, in forward 2025-08-14T21:38:13.6722736Z outputs = self.fnet( 2025-08-14T21:38:13.6722986Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 518, in forward 2025-08-14T21:38:13.6723055Z encoder_outputs = self.encoder( 2025-08-14T21:38:13.6723292Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 280, in forward 2025-08-14T21:38:13.6723368Z layer_outputs = layer_module(hidden_states) 2025-08-14T21:38:13.6723568Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:13.6723647Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:13.6723868Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 249, in forward 2025-08-14T21:38:13.6723980Z self_fourier_outputs = self.fourier(hidden_states) 2025-08-14T21:38:13.6724204Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 202, in forward 2025-08-14T21:38:13.6724276Z self_outputs = self.self(hidden_states) 2025-08-14T21:38:13.6724505Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 181, in forward 2025-08-14T21:38:13.6724594Z outputs = self.fourier_transform(hidden_states).real 2025-08-14T21:38:13.6724598Z 2025-08-14T21:38:13.6724688Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:13.6724874Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:13.6724934Z return mod(**inputs) 2025-08-14T21:38:13.6725164Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 671, in forward 2025-08-14T21:38:13.6725224Z outputs = self.fnet( 2025-08-14T21:38:13.6725447Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 518, in forward 2025-08-14T21:38:13.6725520Z encoder_outputs = self.encoder( 2025-08-14T21:38:13.6725742Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 280, in forward 2025-08-14T21:38:13.6725825Z layer_outputs = layer_module(hidden_states) 2025-08-14T21:38:13.6726023Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:13.6726093Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:13.6726320Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 249, in forward 2025-08-14T21:38:13.6726405Z self_fourier_outputs = self.fourier(hidden_states) 2025-08-14T21:38:13.6726625Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 202, in forward 2025-08-14T21:38:13.6726705Z self_outputs = self.self(hidden_states) 2025-08-14T21:38:13.6726927Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 181, in forward 2025-08-14T21:38:13.6727022Z outputs = self.fourier_transform(hidden_states).real 2025-08-14T21:38:13.6727026Z 2025-08-14T21:38:13.6727118Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:13.6727297Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:13.6727363Z return mod(**inputs) 2025-08-14T21:38:13.6727584Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 671, in forward 2025-08-14T21:38:13.6727642Z outputs = self.fnet( 2025-08-14T21:38:13.6727873Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 518, in forward 2025-08-14T21:38:13.6727954Z encoder_outputs = self.encoder( 2025-08-14T21:38:13.6728201Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 280, in forward 2025-08-14T21:38:13.6728280Z layer_outputs = layer_module(hidden_states) 2025-08-14T21:38:13.6728498Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:13.6728579Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:13.6728809Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 252, in forward 2025-08-14T21:38:13.6728893Z layer_output = apply_chunking_to_forward( 2025-08-14T21:38:13.6729130Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:38:13.6729201Z return forward_fn(*input_tensors) 2025-08-14T21:38:13.6729480Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 261, in feed_forward_chunk 2025-08-14T21:38:13.6729582Z intermediate_output = self.intermediate(fourier_output) 2025-08-14T21:38:13.6729809Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 219, in forward 2025-08-14T21:38:13.6729890Z hidden_states = self.dense(hidden_states) 2025-08-14T21:38:13.6729894Z 2025-08-14T21:38:13.6729985Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:13.6730172Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:13.6730231Z return mod(**inputs) 2025-08-14T21:38:13.6730454Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 671, in forward 2025-08-14T21:38:13.6730520Z outputs = self.fnet( 2025-08-14T21:38:13.6730745Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 518, in forward 2025-08-14T21:38:13.6730819Z encoder_outputs = self.encoder( 2025-08-14T21:38:13.6731042Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 280, in forward 2025-08-14T21:38:13.6731117Z layer_outputs = layer_module(hidden_states) 2025-08-14T21:38:13.6731321Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:13.6731390Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:13.6731610Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 252, in forward 2025-08-14T21:38:13.6731692Z layer_output = apply_chunking_to_forward( 2025-08-14T21:38:13.6731928Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:38:13.6732008Z return forward_fn(*input_tensors) 2025-08-14T21:38:13.6732260Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 261, in feed_forward_chunk 2025-08-14T21:38:13.6732362Z intermediate_output = self.intermediate(fourier_output) 2025-08-14T21:38:13.6732594Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 220, in forward 2025-08-14T21:38:13.6732691Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:38:13.6732891Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 47, in forward 2025-08-14T21:38:13.6733054Z return 0.5 * input * (1.0 + torch.tanh(math.sqrt(2.0 / math.pi) * (input + 0.044715 * torch.pow(input, 3.0)))) 2025-08-14T21:38:13.6733057Z 2025-08-14T21:38:13.6733150Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:13.6733341Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:13.6733417Z return mod(**inputs) 2025-08-14T21:38:13.6733661Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 671, in forward 2025-08-14T21:38:13.6733729Z outputs = self.fnet( 2025-08-14T21:38:13.6733965Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 518, in forward 2025-08-14T21:38:13.6734040Z encoder_outputs = self.encoder( 2025-08-14T21:38:13.6734266Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 280, in forward 2025-08-14T21:38:13.6734342Z layer_outputs = layer_module(hidden_states) 2025-08-14T21:38:13.6734552Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:13.6734623Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:13.6734847Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 252, in forward 2025-08-14T21:38:13.6734948Z layer_output = apply_chunking_to_forward( 2025-08-14T21:38:13.6735187Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:38:13.6735267Z return forward_fn(*input_tensors) 2025-08-14T21:38:13.6735521Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 262, in feed_forward_chunk 2025-08-14T21:38:13.6735635Z layer_output = self.output(intermediate_output, fourier_output) 2025-08-14T21:38:13.6735869Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 233, in forward 2025-08-14T21:38:13.6735943Z hidden_states = self.dense(hidden_states) 2025-08-14T21:38:13.6735946Z 2025-08-14T21:38:13.6736027Z cudagraph partition due to non gpu ops 2025-08-14T21:38:13.6736121Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:13.6736307Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:13.6736374Z return mod(**inputs) 2025-08-14T21:38:13.6736597Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 671, in forward 2025-08-14T21:38:13.6736659Z outputs = self.fnet( 2025-08-14T21:38:13.6736893Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 518, in forward 2025-08-14T21:38:13.6736958Z encoder_outputs = self.encoder( 2025-08-14T21:38:13.6737188Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 280, in forward 2025-08-14T21:38:13.6737264Z layer_outputs = layer_module(hidden_states) 2025-08-14T21:38:13.6737462Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:13.6737544Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:13.6737765Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 249, in forward 2025-08-14T21:38:13.6737858Z self_fourier_outputs = self.fourier(hidden_states) 2025-08-14T21:38:13.6738082Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 202, in forward 2025-08-14T21:38:13.6738154Z self_outputs = self.self(hidden_states) 2025-08-14T21:38:13.6740561Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 181, in forward 2025-08-14T21:38:13.6740675Z outputs = self.fourier_transform(hidden_states).real 2025-08-14T21:38:13.6740678Z 2025-08-14T21:38:13.6740776Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:13.6740977Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:13.6741059Z return mod(**inputs) 2025-08-14T21:38:13.6741291Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 671, in forward 2025-08-14T21:38:13.6741361Z outputs = self.fnet( 2025-08-14T21:38:13.6741605Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 518, in forward 2025-08-14T21:38:13.6741682Z encoder_outputs = self.encoder( 2025-08-14T21:38:13.6741910Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 280, in forward 2025-08-14T21:38:13.6742022Z layer_outputs = layer_module(hidden_states) 2025-08-14T21:38:13.6742228Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:13.6742307Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:13.6742533Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 249, in forward 2025-08-14T21:38:13.6742641Z self_fourier_outputs = self.fourier(hidden_states) 2025-08-14T21:38:13.6742874Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 202, in forward 2025-08-14T21:38:13.6742946Z self_outputs = self.self(hidden_states) 2025-08-14T21:38:13.6743178Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 181, in forward 2025-08-14T21:38:13.6743270Z outputs = self.fourier_transform(hidden_states).real 2025-08-14T21:38:13.6743273Z 2025-08-14T21:38:13.6743367Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:13.6743556Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:13.6743616Z return mod(**inputs) 2025-08-14T21:38:13.6743840Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 671, in forward 2025-08-14T21:38:13.6743909Z outputs = self.fnet( 2025-08-14T21:38:13.6744132Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 518, in forward 2025-08-14T21:38:13.6744204Z encoder_outputs = self.encoder( 2025-08-14T21:38:13.6744429Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 280, in forward 2025-08-14T21:38:13.6744505Z layer_outputs = layer_module(hidden_states) 2025-08-14T21:38:13.6744713Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:13.6744871Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:13.6745113Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 249, in forward 2025-08-14T21:38:13.6745201Z self_fourier_outputs = self.fourier(hidden_states) 2025-08-14T21:38:13.6745423Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 202, in forward 2025-08-14T21:38:13.6745507Z self_outputs = self.self(hidden_states) 2025-08-14T21:38:13.6745731Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 181, in forward 2025-08-14T21:38:13.6745825Z outputs = self.fourier_transform(hidden_states).real 2025-08-14T21:38:13.6745838Z 2025-08-14T21:38:13.6745931Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:13.6746113Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:13.6746245Z return mod(**inputs) 2025-08-14T21:38:13.6746472Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 671, in forward 2025-08-14T21:38:13.6746533Z outputs = self.fnet( 2025-08-14T21:38:13.6746764Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 518, in forward 2025-08-14T21:38:13.6746848Z encoder_outputs = self.encoder( 2025-08-14T21:38:13.6747072Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 280, in forward 2025-08-14T21:38:13.6747154Z layer_outputs = layer_module(hidden_states) 2025-08-14T21:38:13.6747372Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:13.6747454Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:13.6747678Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 249, in forward 2025-08-14T21:38:13.6747763Z self_fourier_outputs = self.fourier(hidden_states) 2025-08-14T21:38:13.6747993Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 202, in forward 2025-08-14T21:38:13.6748063Z self_outputs = self.self(hidden_states) 2025-08-14T21:38:13.6748314Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 181, in forward 2025-08-14T21:38:13.6748404Z outputs = self.fourier_transform(hidden_states).real 2025-08-14T21:38:13.6748408Z 2025-08-14T21:38:13.6748500Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:13.6748693Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:13.6748751Z return mod(**inputs) 2025-08-14T21:38:13.6748977Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 671, in forward 2025-08-14T21:38:13.6749044Z outputs = self.fnet( 2025-08-14T21:38:13.6749268Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 518, in forward 2025-08-14T21:38:13.6749341Z encoder_outputs = self.encoder( 2025-08-14T21:38:13.6749565Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 280, in forward 2025-08-14T21:38:13.6749644Z layer_outputs = layer_module(hidden_states) 2025-08-14T21:38:13.6749853Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:13.6749923Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:13.6750155Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 252, in forward 2025-08-14T21:38:13.6750232Z layer_output = apply_chunking_to_forward( 2025-08-14T21:38:13.6750471Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:38:13.6750547Z return forward_fn(*input_tensors) 2025-08-14T21:38:13.6750802Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 261, in feed_forward_chunk 2025-08-14T21:38:13.6750908Z intermediate_output = self.intermediate(fourier_output) 2025-08-14T21:38:13.6751143Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 219, in forward 2025-08-14T21:38:13.6751217Z hidden_states = self.dense(hidden_states) 2025-08-14T21:38:13.6751220Z 2025-08-14T21:38:13.6751318Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:13.6751499Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:13.6751556Z return mod(**inputs) 2025-08-14T21:38:13.6751817Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 671, in forward 2025-08-14T21:38:13.6751879Z outputs = self.fnet( 2025-08-14T21:38:13.6752113Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 518, in forward 2025-08-14T21:38:13.6752178Z encoder_outputs = self.encoder( 2025-08-14T21:38:13.6752420Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 280, in forward 2025-08-14T21:38:13.6752502Z layer_outputs = layer_module(hidden_states) 2025-08-14T21:38:13.6752700Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:13.6752786Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:13.6753017Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 252, in forward 2025-08-14T21:38:13.6753089Z layer_output = apply_chunking_to_forward( 2025-08-14T21:38:13.6753331Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:38:13.6753398Z return forward_fn(*input_tensors) 2025-08-14T21:38:13.6753662Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 261, in feed_forward_chunk 2025-08-14T21:38:13.6753783Z intermediate_output = self.intermediate(fourier_output) 2025-08-14T21:38:13.6754007Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 220, in forward 2025-08-14T21:38:13.6754112Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:38:13.6754311Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 47, in forward 2025-08-14T21:38:13.6754486Z return 0.5 * input * (1.0 + torch.tanh(math.sqrt(2.0 / math.pi) * (input + 0.044715 * torch.pow(input, 3.0)))) 2025-08-14T21:38:13.6754489Z 2025-08-14T21:38:13.6754585Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:13.6754764Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:13.6754830Z return mod(**inputs) 2025-08-14T21:38:13.6755054Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 671, in forward 2025-08-14T21:38:13.6755122Z outputs = self.fnet( 2025-08-14T21:38:13.6755346Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 518, in forward 2025-08-14T21:38:13.6755411Z encoder_outputs = self.encoder( 2025-08-14T21:38:13.6755642Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 280, in forward 2025-08-14T21:38:13.6755717Z layer_outputs = layer_module(hidden_states) 2025-08-14T21:38:13.6755919Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:13.6755998Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:13.6756220Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 252, in forward 2025-08-14T21:38:13.6756301Z layer_output = apply_chunking_to_forward( 2025-08-14T21:38:13.6756540Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:38:13.6756609Z return forward_fn(*input_tensors) 2025-08-14T21:38:13.6756867Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 262, in feed_forward_chunk 2025-08-14T21:38:13.6756982Z layer_output = self.output(intermediate_output, fourier_output) 2025-08-14T21:38:13.6757208Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 233, in forward 2025-08-14T21:38:13.6757337Z hidden_states = self.dense(hidden_states) 2025-08-14T21:38:13.6757341Z 2025-08-14T21:38:13.6757440Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:13.6757632Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:13.6757696Z return mod(**inputs) 2025-08-14T21:38:13.6757922Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 681, in forward 2025-08-14T21:38:13.6758035Z prediction_scores = self.cls(sequence_output) 2025-08-14T21:38:13.6758260Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 359, in forward 2025-08-14T21:38:13.6758388Z prediction_scores = self.predictions(sequence_output) 2025-08-14T21:38:13.6758615Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 340, in forward 2025-08-14T21:38:13.6758697Z hidden_states = self.transform(hidden_states) 2025-08-14T21:38:13.6758928Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 321, in forward 2025-08-14T21:38:13.6759002Z hidden_states = self.dense(hidden_states) 2025-08-14T21:38:13.6759005Z 2025-08-14T21:38:13.6759104Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:13.6759316Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:13.6759375Z return mod(**inputs) 2025-08-14T21:38:13.6759608Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 681, in forward 2025-08-14T21:38:13.6759689Z prediction_scores = self.cls(sequence_output) 2025-08-14T21:38:13.6759912Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 359, in forward 2025-08-14T21:38:13.6760019Z prediction_scores = self.predictions(sequence_output) 2025-08-14T21:38:13.6760244Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 341, in forward 2025-08-14T21:38:13.6760329Z hidden_states = self.decoder(hidden_states) 2025-08-14T21:38:13.6760332Z 2025-08-14T21:38:13.6760422Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:13.6760601Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:13.6760671Z return mod(**inputs) 2025-08-14T21:38:13.6760894Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/fnet/modeling_fnet.py", line 686, in forward 2025-08-14T21:38:13.6761069Z masked_lm_loss = loss_fct(prediction_scores.view(-1, self.config.vocab_size), labels.view(-1)) 2025-08-14T21:38:13.6761078Z 2025-08-14T21:38:20.7044643Z Compilation time (from dynamo_timed): 10.964175002 2025-08-14T21:38:20.7095951Z pass 2025-08-14T21:38:20.7099888Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:38:20.7104545Z TIMING: _recursive_pre_grad_passes:0.00525 _recursive_joint_graph_passes:0.18714 _recursive_post_grad_passes:0.06854 async_compile.wait:0.6403 code_gen:6.71444 inductor_compile:7.72366 backend_compile:9.41736 gc:0.00011 entire_frame_compile:10.96418 total_wall_time:10.96418 2025-08-14T21:38:20.7105868Z STATS: call_* op count: 232 | FakeTensorMode.__torch_dispatch__:7521 | FakeTensor.__torch_dispatch__:3660 | ProxyTorchDispatchMode.__torch_dispatch__:2859 2025-08-14T21:38:20.7106352Z Dynamo produced 1 graphs covering 232 ops with 0 graph breaks (0 unique) 2025-08-14T21:38:24.7816846Z /opt/conda/envs/py_3.9/lib/python3.9/site-packages/llvmlite/binding/ffi.py:175: UserWarning: pkg_resources is deprecated as an API. See https://setuptools.pypa.io/en/latest/pkg_resources.html. The pkg_resources package is slated for removal as early as 2025-11-30. Refrain from using this package or pin to Setuptools<81. 2025-08-14T21:38:24.7817685Z from pkg_resources import resource_filename 2025-08-14T21:38:25.3476410Z 2025-08-14T21:38:26.5579494Z loading model: 0it [00:00, ?it/s] 2025-08-14T21:38:26.5579933Z loading model: 0it [00:01, ?it/s] 2025-08-14T21:38:26.5592208Z cpu eval LayoutLMForMaskedLM 2025-08-14T21:38:27.0902787Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:38:27.2671270Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:38:27.4655156Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:38:34.9881199Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:34.9882120Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:34.9882583Z return mod(**inputs) 2025-08-14T21:38:34.9883666Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:34.9884101Z return func(*args, **kwargs) 2025-08-14T21:38:34.9884456Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:34.9885000Z return func(*args, **kwargs) 2025-08-14T21:38:34.9885326Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:34.9885906Z output = func(self, *args, **kwargs) 2025-08-14T21:38:34.9886426Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 757, in forward 2025-08-14T21:38:34.9887042Z outputs = self.layoutlm( 2025-08-14T21:38:34.9887478Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:34.9887912Z return func(*args, **kwargs) 2025-08-14T21:38:34.9888253Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:34.9888607Z return func(*args, **kwargs) 2025-08-14T21:38:34.9888927Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:34.9889263Z output = func(self, *args, **kwargs) 2025-08-14T21:38:34.9889652Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:34.9890043Z encoder_outputs = self.encoder( 2025-08-14T21:38:34.9890398Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:34.9890743Z return func(*args, **kwargs) 2025-08-14T21:38:34.9891176Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:34.9891615Z return func(*args, **kwargs) 2025-08-14T21:38:34.9891956Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:34.9892300Z return func(*args, **kwargs) 2025-08-14T21:38:34.9892485Z [Previous line repeated 1 more time] 2025-08-14T21:38:34.9892821Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:34.9893157Z output = func(self, *args, **kwargs) 2025-08-14T21:38:34.9893540Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:34.9894024Z layer_outputs = layer_module( 2025-08-14T21:38:34.9894371Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:34.9894875Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:34.9895242Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:34.9895686Z return func(*args, **kwargs) 2025-08-14T21:38:34.9896029Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:34.9896390Z return func(*args, **kwargs) 2025-08-14T21:38:34.9896736Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:34.9897132Z return func(*args, **kwargs) 2025-08-14T21:38:34.9897644Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 338, in forward 2025-08-14T21:38:34.9898041Z self_attention_outputs = self.attention( 2025-08-14T21:38:34.9898446Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:34.9898938Z return func(*args, **kwargs) 2025-08-14T21:38:34.9899278Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:34.9899746Z return func(*args, **kwargs) 2025-08-14T21:38:34.9900195Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:34.9900540Z return func(*args, **kwargs) 2025-08-14T21:38:34.9901024Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 271, in forward 2025-08-14T21:38:34.9901393Z self_outputs = self.self( 2025-08-14T21:38:34.9901727Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:34.9902208Z return func(*args, **kwargs) 2025-08-14T21:38:34.9902728Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:34.9903078Z return func(*args, **kwargs) 2025-08-14T21:38:34.9903403Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:34.9903742Z return func(*args, **kwargs) 2025-08-14T21:38:34.9904098Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 191, in forward 2025-08-14T21:38:34.9904549Z query_states = self.query(hidden_states).view(hidden_shape).transpose(1, 2) 2025-08-14T21:38:34.9904744Z 2025-08-14T21:38:34.9904938Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:34.9905305Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:34.9905631Z return mod(**inputs) 2025-08-14T21:38:34.9905977Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:34.9906441Z return func(*args, **kwargs) 2025-08-14T21:38:34.9906877Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:34.9907230Z return func(*args, **kwargs) 2025-08-14T21:38:34.9907535Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:34.9907869Z output = func(self, *args, **kwargs) 2025-08-14T21:38:34.9908390Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 757, in forward 2025-08-14T21:38:34.9908763Z outputs = self.layoutlm( 2025-08-14T21:38:34.9909084Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:34.9909446Z return func(*args, **kwargs) 2025-08-14T21:38:34.9909781Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:34.9910124Z return func(*args, **kwargs) 2025-08-14T21:38:34.9910468Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:34.9910797Z output = func(self, *args, **kwargs) 2025-08-14T21:38:34.9911158Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:34.9911549Z encoder_outputs = self.encoder( 2025-08-14T21:38:34.9911888Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:34.9912219Z return func(*args, **kwargs) 2025-08-14T21:38:34.9912561Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:34.9912900Z return func(*args, **kwargs) 2025-08-14T21:38:34.9913262Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:34.9913740Z return func(*args, **kwargs) 2025-08-14T21:38:34.9913918Z [Previous line repeated 1 more time] 2025-08-14T21:38:34.9914243Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:34.9914560Z output = func(self, *args, **kwargs) 2025-08-14T21:38:34.9914925Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:34.9915352Z layer_outputs = layer_module( 2025-08-14T21:38:34.9915730Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:34.9916054Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:34.9916400Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:34.9916734Z return func(*args, **kwargs) 2025-08-14T21:38:34.9917114Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:34.9917607Z return func(*args, **kwargs) 2025-08-14T21:38:34.9918044Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:34.9918377Z return func(*args, **kwargs) 2025-08-14T21:38:34.9918727Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 338, in forward 2025-08-14T21:38:34.9919189Z self_attention_outputs = self.attention( 2025-08-14T21:38:34.9919545Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:34.9919877Z return func(*args, **kwargs) 2025-08-14T21:38:34.9920204Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:34.9920540Z return func(*args, **kwargs) 2025-08-14T21:38:34.9920873Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:34.9921373Z return func(*args, **kwargs) 2025-08-14T21:38:34.9921826Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 271, in forward 2025-08-14T21:38:34.9922397Z self_outputs = self.self( 2025-08-14T21:38:34.9922916Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:34.9923269Z return func(*args, **kwargs) 2025-08-14T21:38:34.9923713Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:34.9924050Z return func(*args, **kwargs) 2025-08-14T21:38:34.9924371Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:34.9924745Z return func(*args, **kwargs) 2025-08-14T21:38:34.9925105Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 192, in forward 2025-08-14T21:38:34.9925536Z key_states = self.key(hidden_states).view(hidden_shape).transpose(1, 2) 2025-08-14T21:38:34.9925714Z 2025-08-14T21:38:34.9925837Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:34.9926176Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:34.9926507Z return mod(**inputs) 2025-08-14T21:38:34.9926879Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:34.9927234Z return func(*args, **kwargs) 2025-08-14T21:38:34.9927562Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:34.9927899Z return func(*args, **kwargs) 2025-08-14T21:38:34.9928333Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:34.9928882Z output = func(self, *args, **kwargs) 2025-08-14T21:38:34.9929489Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 757, in forward 2025-08-14T21:38:34.9929883Z outputs = self.layoutlm( 2025-08-14T21:38:34.9930204Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:34.9930537Z return func(*args, **kwargs) 2025-08-14T21:38:34.9930860Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:34.9931186Z return func(*args, **kwargs) 2025-08-14T21:38:34.9931492Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:34.9931817Z output = func(self, *args, **kwargs) 2025-08-14T21:38:34.9932185Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:34.9932547Z encoder_outputs = self.encoder( 2025-08-14T21:38:34.9932992Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:34.9933460Z return func(*args, **kwargs) 2025-08-14T21:38:34.9933782Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:34.9934120Z return func(*args, **kwargs) 2025-08-14T21:38:34.9934449Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:34.9934786Z return func(*args, **kwargs) 2025-08-14T21:38:34.9934960Z [Previous line repeated 1 more time] 2025-08-14T21:38:34.9935293Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:34.9935673Z output = func(self, *args, **kwargs) 2025-08-14T21:38:34.9936072Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:34.9936441Z layer_outputs = layer_module( 2025-08-14T21:38:34.9936767Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:34.9937105Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:34.9937448Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:34.9937788Z return func(*args, **kwargs) 2025-08-14T21:38:34.9938115Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:34.9938581Z return func(*args, **kwargs) 2025-08-14T21:38:34.9939053Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:34.9939390Z return func(*args, **kwargs) 2025-08-14T21:38:34.9939741Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 338, in forward 2025-08-14T21:38:34.9940154Z self_attention_outputs = self.attention( 2025-08-14T21:38:34.9940610Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:34.9940973Z return func(*args, **kwargs) 2025-08-14T21:38:34.9941520Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:34.9941896Z return func(*args, **kwargs) 2025-08-14T21:38:34.9942346Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:34.9942697Z return func(*args, **kwargs) 2025-08-14T21:38:34.9943044Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 271, in forward 2025-08-14T21:38:34.9943414Z self_outputs = self.self( 2025-08-14T21:38:34.9943745Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:34.9944111Z return func(*args, **kwargs) 2025-08-14T21:38:34.9944435Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:34.9944834Z return func(*args, **kwargs) 2025-08-14T21:38:34.9945174Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:34.9945525Z return func(*args, **kwargs) 2025-08-14T21:38:34.9945953Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 193, in forward 2025-08-14T21:38:34.9946415Z value_states = self.value(hidden_states).view(hidden_shape).transpose(1, 2) 2025-08-14T21:38:34.9946609Z 2025-08-14T21:38:34.9946692Z cudagraph partition due to non gpu ops 2025-08-14T21:38:34.9946894Z cudagraph partition due to non gpu ops 2025-08-14T21:38:34.9947133Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:34.9947473Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:34.9947769Z return mod(**inputs) 2025-08-14T21:38:34.9948145Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:34.9948511Z return func(*args, **kwargs) 2025-08-14T21:38:34.9948855Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:34.9949203Z return func(*args, **kwargs) 2025-08-14T21:38:34.9949531Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:34.9949877Z output = func(self, *args, **kwargs) 2025-08-14T21:38:34.9950260Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 757, in forward 2025-08-14T21:38:34.9950658Z outputs = self.layoutlm( 2025-08-14T21:38:34.9951007Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:34.9951364Z return func(*args, **kwargs) 2025-08-14T21:38:34.9951705Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:34.9952062Z return func(*args, **kwargs) 2025-08-14T21:38:34.9952391Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:34.9952760Z output = func(self, *args, **kwargs) 2025-08-14T21:38:34.9953146Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:34.9953539Z encoder_outputs = self.encoder( 2025-08-14T21:38:34.9953901Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:34.9954272Z return func(*args, **kwargs) 2025-08-14T21:38:34.9954616Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:34.9954974Z return func(*args, **kwargs) 2025-08-14T21:38:34.9955333Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:34.9955688Z return func(*args, **kwargs) 2025-08-14T21:38:34.9955885Z [Previous line repeated 1 more time] 2025-08-14T21:38:34.9956224Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:34.9956540Z output = func(self, *args, **kwargs) 2025-08-14T21:38:34.9956907Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:34.9957275Z layer_outputs = layer_module( 2025-08-14T21:38:34.9957615Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:34.9957942Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:34.9958289Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:34.9958627Z return func(*args, **kwargs) 2025-08-14T21:38:34.9958949Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:34.9959285Z return func(*args, **kwargs) 2025-08-14T21:38:34.9959611Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:34.9959944Z return func(*args, **kwargs) 2025-08-14T21:38:34.9960290Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 338, in forward 2025-08-14T21:38:34.9960671Z self_attention_outputs = self.attention( 2025-08-14T21:38:34.9961016Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:34.9961349Z return func(*args, **kwargs) 2025-08-14T21:38:34.9961676Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:34.9962016Z return func(*args, **kwargs) 2025-08-14T21:38:34.9962343Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:34.9962672Z return func(*args, **kwargs) 2025-08-14T21:38:34.9963027Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 278, in forward 2025-08-14T21:38:34.9963449Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:38:34.9963867Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 225, in forward 2025-08-14T21:38:34.9964243Z hidden_states = self.dense(hidden_states) 2025-08-14T21:38:34.9964379Z 2025-08-14T21:38:34.9964476Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:34.9964814Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:34.9965108Z return mod(**inputs) 2025-08-14T21:38:34.9965438Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:34.9965785Z return func(*args, **kwargs) 2025-08-14T21:38:34.9966137Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:34.9966476Z return func(*args, **kwargs) 2025-08-14T21:38:34.9966803Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:34.9967163Z output = func(self, *args, **kwargs) 2025-08-14T21:38:34.9967518Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 757, in forward 2025-08-14T21:38:34.9967882Z outputs = self.layoutlm( 2025-08-14T21:38:34.9968223Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:34.9968569Z return func(*args, **kwargs) 2025-08-14T21:38:34.9968889Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:34.9969225Z return func(*args, **kwargs) 2025-08-14T21:38:34.9969535Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:34.9969861Z output = func(self, *args, **kwargs) 2025-08-14T21:38:34.9970218Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:34.9970610Z encoder_outputs = self.encoder( 2025-08-14T21:38:34.9970956Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:34.9971288Z return func(*args, **kwargs) 2025-08-14T21:38:34.9971618Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:34.9971953Z return func(*args, **kwargs) 2025-08-14T21:38:34.9972279Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:34.9972604Z return func(*args, **kwargs) 2025-08-14T21:38:34.9972781Z [Previous line repeated 1 more time] 2025-08-14T21:38:34.9973102Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:34.9973416Z output = func(self, *args, **kwargs) 2025-08-14T21:38:34.9973789Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:34.9974153Z layer_outputs = layer_module( 2025-08-14T21:38:34.9974469Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:34.9974795Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:34.9975223Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:34.9975651Z return func(*args, **kwargs) 2025-08-14T21:38:34.9976060Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:34.9976479Z return func(*args, **kwargs) 2025-08-14T21:38:34.9976806Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:34.9977150Z return func(*args, **kwargs) 2025-08-14T21:38:34.9977500Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 348, in forward 2025-08-14T21:38:34.9977884Z layer_output = apply_chunking_to_forward( 2025-08-14T21:38:34.9978259Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:38:34.9978628Z return forward_fn(*input_tensors) 2025-08-14T21:38:34.9979037Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 356, in feed_forward_chunk 2025-08-14T21:38:34.9979490Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:38:34.9979910Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 294, in forward 2025-08-14T21:38:34.9980288Z hidden_states = self.dense(hidden_states) 2025-08-14T21:38:34.9980447Z 2025-08-14T21:38:34.9980543Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:34.9980872Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:34.9981173Z return mod(**inputs) 2025-08-14T21:38:34.9981503Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:34.9981844Z return func(*args, **kwargs) 2025-08-14T21:38:34.9982172Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:34.9982514Z return func(*args, **kwargs) 2025-08-14T21:38:34.9982816Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:34.9983144Z output = func(self, *args, **kwargs) 2025-08-14T21:38:34.9983511Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 757, in forward 2025-08-14T21:38:34.9983894Z outputs = self.layoutlm( 2025-08-14T21:38:34.9984222Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:34.9984700Z return func(*args, **kwargs) 2025-08-14T21:38:34.9985163Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:34.9985510Z return func(*args, **kwargs) 2025-08-14T21:38:34.9985837Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:34.9986179Z output = func(self, *args, **kwargs) 2025-08-14T21:38:34.9986554Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:34.9986946Z encoder_outputs = self.encoder( 2025-08-14T21:38:34.9987305Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:34.9987660Z return func(*args, **kwargs) 2025-08-14T21:38:34.9987991Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:34.9988341Z return func(*args, **kwargs) 2025-08-14T21:38:34.9988676Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:34.9989017Z return func(*args, **kwargs) 2025-08-14T21:38:34.9989200Z [Previous line repeated 1 more time] 2025-08-14T21:38:34.9989537Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:34.9989871Z output = func(self, *args, **kwargs) 2025-08-14T21:38:34.9990242Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:34.9990700Z layer_outputs = layer_module( 2025-08-14T21:38:34.9991032Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:34.9991373Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:34.9991736Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:34.9992083Z return func(*args, **kwargs) 2025-08-14T21:38:34.9992422Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:34.9992837Z return func(*args, **kwargs) 2025-08-14T21:38:34.9993179Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:34.9993529Z return func(*args, **kwargs) 2025-08-14T21:38:34.9993888Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 348, in forward 2025-08-14T21:38:34.9994317Z layer_output = apply_chunking_to_forward( 2025-08-14T21:38:34.9994711Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:38:34.9995095Z return forward_fn(*input_tensors) 2025-08-14T21:38:34.9995523Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 356, in feed_forward_chunk 2025-08-14T21:38:34.9995982Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:38:34.9996408Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 295, in forward 2025-08-14T21:38:34.9996823Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:38:34.9997178Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:38:34.9997527Z return self.act(input) 2025-08-14T21:38:34.9997634Z 2025-08-14T21:38:34.9997742Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:34.9998077Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:34.9998384Z return mod(**inputs) 2025-08-14T21:38:34.9998714Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:34.9999059Z return func(*args, **kwargs) 2025-08-14T21:38:34.9999387Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:34.9999731Z return func(*args, **kwargs) 2025-08-14T21:38:35.0000049Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0000382Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0000750Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 757, in forward 2025-08-14T21:38:35.0001129Z outputs = self.layoutlm( 2025-08-14T21:38:35.0001505Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0001835Z return func(*args, **kwargs) 2025-08-14T21:38:35.0002159Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0002496Z return func(*args, **kwargs) 2025-08-14T21:38:35.0002803Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0003120Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0003481Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:35.0003846Z encoder_outputs = self.encoder( 2025-08-14T21:38:35.0004179Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0004515Z return func(*args, **kwargs) 2025-08-14T21:38:35.0004835Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0005166Z return func(*args, **kwargs) 2025-08-14T21:38:35.0005485Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0005827Z return func(*args, **kwargs) 2025-08-14T21:38:35.0006024Z [Previous line repeated 1 more time] 2025-08-14T21:38:35.0006356Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0006680Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0007045Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:35.0007433Z layer_outputs = layer_module( 2025-08-14T21:38:35.0007746Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:35.0008081Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:35.0008443Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0008775Z return func(*args, **kwargs) 2025-08-14T21:38:35.0009103Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0009439Z return func(*args, **kwargs) 2025-08-14T21:38:35.0009768Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0010097Z return func(*args, **kwargs) 2025-08-14T21:38:35.0010454Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 348, in forward 2025-08-14T21:38:35.0010849Z layer_output = apply_chunking_to_forward( 2025-08-14T21:38:35.0011217Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:38:35.0011573Z return forward_fn(*input_tensors) 2025-08-14T21:38:35.0011966Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 357, in feed_forward_chunk 2025-08-14T21:38:35.0012415Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:38:35.0012826Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 308, in forward 2025-08-14T21:38:35.0013205Z hidden_states = self.dense(hidden_states) 2025-08-14T21:38:35.0013339Z 2025-08-14T21:38:35.0013436Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:35.0013771Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:35.0014064Z return mod(**inputs) 2025-08-14T21:38:35.0014381Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0014720Z return func(*args, **kwargs) 2025-08-14T21:38:35.0015040Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0015374Z return func(*args, **kwargs) 2025-08-14T21:38:35.0015682Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0016006Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0016364Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 757, in forward 2025-08-14T21:38:35.0016733Z outputs = self.layoutlm( 2025-08-14T21:38:35.0017059Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0017395Z return func(*args, **kwargs) 2025-08-14T21:38:35.0017713Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0018047Z return func(*args, **kwargs) 2025-08-14T21:38:35.0018354Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0018670Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0019051Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:35.0019424Z encoder_outputs = self.encoder( 2025-08-14T21:38:35.0019761Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0020108Z return func(*args, **kwargs) 2025-08-14T21:38:35.0020431Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0020763Z return func(*args, **kwargs) 2025-08-14T21:38:35.0021091Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0021428Z return func(*args, **kwargs) 2025-08-14T21:38:35.0021604Z [Previous line repeated 1 more time] 2025-08-14T21:38:35.0021928Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0022245Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0022606Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:35.0022971Z layer_outputs = layer_module( 2025-08-14T21:38:35.0023298Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:35.0023630Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:35.0023970Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0024306Z return func(*args, **kwargs) 2025-08-14T21:38:35.0024624Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0025051Z return func(*args, **kwargs) 2025-08-14T21:38:35.0025398Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0025740Z return func(*args, **kwargs) 2025-08-14T21:38:35.0026109Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 338, in forward 2025-08-14T21:38:35.0026509Z self_attention_outputs = self.attention( 2025-08-14T21:38:35.0026875Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0027208Z return func(*args, **kwargs) 2025-08-14T21:38:35.0027596Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0027953Z return func(*args, **kwargs) 2025-08-14T21:38:35.0028292Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0028636Z return func(*args, **kwargs) 2025-08-14T21:38:35.0029004Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 271, in forward 2025-08-14T21:38:35.0029384Z self_outputs = self.self( 2025-08-14T21:38:35.0029719Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0030079Z return func(*args, **kwargs) 2025-08-14T21:38:35.0030416Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0030762Z return func(*args, **kwargs) 2025-08-14T21:38:35.0031094Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0031439Z return func(*args, **kwargs) 2025-08-14T21:38:35.0031801Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 191, in forward 2025-08-14T21:38:35.0032268Z query_states = self.query(hidden_states).view(hidden_shape).transpose(1, 2) 2025-08-14T21:38:35.0032466Z 2025-08-14T21:38:35.0032567Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:35.0032913Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:35.0033242Z return mod(**inputs) 2025-08-14T21:38:35.0033567Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0033916Z return func(*args, **kwargs) 2025-08-14T21:38:35.0034267Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0034616Z return func(*args, **kwargs) 2025-08-14T21:38:35.0034924Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0035261Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0035639Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 757, in forward 2025-08-14T21:38:35.0036004Z outputs = self.layoutlm( 2025-08-14T21:38:35.0036336Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0036709Z return func(*args, **kwargs) 2025-08-14T21:38:35.0037041Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0037380Z return func(*args, **kwargs) 2025-08-14T21:38:35.0037693Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0038025Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0038394Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:35.0038774Z encoder_outputs = self.encoder( 2025-08-14T21:38:35.0039121Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0039471Z return func(*args, **kwargs) 2025-08-14T21:38:35.0039790Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0040130Z return func(*args, **kwargs) 2025-08-14T21:38:35.0040454Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0040784Z return func(*args, **kwargs) 2025-08-14T21:38:35.0040962Z [Previous line repeated 1 more time] 2025-08-14T21:38:35.0041286Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0041609Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0041970Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:35.0042341Z layer_outputs = layer_module( 2025-08-14T21:38:35.0042666Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:35.0042995Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:35.0043342Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0043681Z return func(*args, **kwargs) 2025-08-14T21:38:35.0044009Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0044340Z return func(*args, **kwargs) 2025-08-14T21:38:35.0044664Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0045021Z return func(*args, **kwargs) 2025-08-14T21:38:35.0045370Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 338, in forward 2025-08-14T21:38:35.0045745Z self_attention_outputs = self.attention( 2025-08-14T21:38:35.0046090Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0046445Z return func(*args, **kwargs) 2025-08-14T21:38:35.0046761Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0047098Z return func(*args, **kwargs) 2025-08-14T21:38:35.0047435Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0047772Z return func(*args, **kwargs) 2025-08-14T21:38:35.0048119Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 271, in forward 2025-08-14T21:38:35.0048486Z self_outputs = self.self( 2025-08-14T21:38:35.0048817Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0049149Z return func(*args, **kwargs) 2025-08-14T21:38:35.0049474Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0049827Z return func(*args, **kwargs) 2025-08-14T21:38:35.0050153Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0050478Z return func(*args, **kwargs) 2025-08-14T21:38:35.0050837Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 192, in forward 2025-08-14T21:38:35.0051269Z key_states = self.key(hidden_states).view(hidden_shape).transpose(1, 2) 2025-08-14T21:38:35.0051447Z 2025-08-14T21:38:35.0051552Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:35.0051884Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:35.0052185Z return mod(**inputs) 2025-08-14T21:38:35.0052508Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0052839Z return func(*args, **kwargs) 2025-08-14T21:38:35.0053164Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0053497Z return func(*args, **kwargs) 2025-08-14T21:38:35.0053805Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0054123Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0054492Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 757, in forward 2025-08-14T21:38:35.0054856Z outputs = self.layoutlm( 2025-08-14T21:38:35.0055181Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0055519Z return func(*args, **kwargs) 2025-08-14T21:38:35.0055848Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0056187Z return func(*args, **kwargs) 2025-08-14T21:38:35.0056489Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0056815Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0057183Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:35.0057547Z encoder_outputs = self.encoder( 2025-08-14T21:38:35.0057933Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0058279Z return func(*args, **kwargs) 2025-08-14T21:38:35.0058608Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0058941Z return func(*args, **kwargs) 2025-08-14T21:38:35.0059286Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0059622Z return func(*args, **kwargs) 2025-08-14T21:38:35.0059792Z [Previous line repeated 1 more time] 2025-08-14T21:38:35.0060130Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0060458Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0060825Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:35.0061186Z layer_outputs = layer_module( 2025-08-14T21:38:35.0061507Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:35.0061841Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:35.0062176Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0062530Z return func(*args, **kwargs) 2025-08-14T21:38:35.0062854Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0063193Z return func(*args, **kwargs) 2025-08-14T21:38:35.0063513Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0063849Z return func(*args, **kwargs) 2025-08-14T21:38:35.0064201Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 338, in forward 2025-08-14T21:38:35.0064609Z self_attention_outputs = self.attention( 2025-08-14T21:38:35.0065035Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0065390Z return func(*args, **kwargs) 2025-08-14T21:38:35.0065731Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0066104Z return func(*args, **kwargs) 2025-08-14T21:38:35.0066470Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0066822Z return func(*args, **kwargs) 2025-08-14T21:38:35.0067186Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 271, in forward 2025-08-14T21:38:35.0067559Z self_outputs = self.self( 2025-08-14T21:38:35.0067903Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0068249Z return func(*args, **kwargs) 2025-08-14T21:38:35.0068576Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0068924Z return func(*args, **kwargs) 2025-08-14T21:38:35.0069261Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0069605Z return func(*args, **kwargs) 2025-08-14T21:38:35.0069961Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 193, in forward 2025-08-14T21:38:35.0070410Z value_states = self.value(hidden_states).view(hidden_shape).transpose(1, 2) 2025-08-14T21:38:35.0070600Z 2025-08-14T21:38:35.0070684Z cudagraph partition due to non gpu ops 2025-08-14T21:38:35.0070878Z cudagraph partition due to non gpu ops 2025-08-14T21:38:35.0071121Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:35.0071464Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:35.0071774Z return mod(**inputs) 2025-08-14T21:38:35.0072092Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0072458Z return func(*args, **kwargs) 2025-08-14T21:38:35.0072793Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0073132Z return func(*args, **kwargs) 2025-08-14T21:38:35.0073462Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0073803Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0074185Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 757, in forward 2025-08-14T21:38:35.0074558Z outputs = self.layoutlm( 2025-08-14T21:38:35.0074901Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0075251Z return func(*args, **kwargs) 2025-08-14T21:38:35.0075585Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0075948Z return func(*args, **kwargs) 2025-08-14T21:38:35.0076265Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0076601Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0076970Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:35.0077347Z encoder_outputs = self.encoder( 2025-08-14T21:38:35.0077699Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0078043Z return func(*args, **kwargs) 2025-08-14T21:38:35.0078368Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0078709Z return func(*args, **kwargs) 2025-08-14T21:38:35.0079057Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0079383Z return func(*args, **kwargs) 2025-08-14T21:38:35.0079562Z [Previous line repeated 1 more time] 2025-08-14T21:38:35.0079884Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0080206Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0080564Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:35.0080931Z layer_outputs = layer_module( 2025-08-14T21:38:35.0081249Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:35.0081574Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:35.0081922Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0082259Z return func(*args, **kwargs) 2025-08-14T21:38:35.0082583Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0082908Z return func(*args, **kwargs) 2025-08-14T21:38:35.0083235Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0083570Z return func(*args, **kwargs) 2025-08-14T21:38:35.0083933Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 338, in forward 2025-08-14T21:38:35.0084316Z self_attention_outputs = self.attention( 2025-08-14T21:38:35.0084825Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0085175Z return func(*args, **kwargs) 2025-08-14T21:38:35.0085504Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0085888Z return func(*args, **kwargs) 2025-08-14T21:38:35.0086211Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0086545Z return func(*args, **kwargs) 2025-08-14T21:38:35.0086927Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 278, in forward 2025-08-14T21:38:35.0087353Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:38:35.0087772Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 225, in forward 2025-08-14T21:38:35.0088147Z hidden_states = self.dense(hidden_states) 2025-08-14T21:38:35.0088284Z 2025-08-14T21:38:35.0088380Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:35.0088712Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:35.0089038Z return mod(**inputs) 2025-08-14T21:38:35.0089352Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0089697Z return func(*args, **kwargs) 2025-08-14T21:38:35.0090023Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0090354Z return func(*args, **kwargs) 2025-08-14T21:38:35.0090668Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0090996Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0091367Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 757, in forward 2025-08-14T21:38:35.0091730Z outputs = self.layoutlm( 2025-08-14T21:38:35.0092059Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0092399Z return func(*args, **kwargs) 2025-08-14T21:38:35.0092714Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0093052Z return func(*args, **kwargs) 2025-08-14T21:38:35.0093361Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0093690Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0094053Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:35.0094424Z encoder_outputs = self.encoder( 2025-08-14T21:38:35.0094762Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0095097Z return func(*args, **kwargs) 2025-08-14T21:38:35.0095417Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0095752Z return func(*args, **kwargs) 2025-08-14T21:38:35.0096077Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0096408Z return func(*args, **kwargs) 2025-08-14T21:38:35.0096587Z [Previous line repeated 1 more time] 2025-08-14T21:38:35.0096912Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0097260Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0097623Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:35.0097988Z layer_outputs = layer_module( 2025-08-14T21:38:35.0098307Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:35.0098651Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:35.0098997Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0099334Z return func(*args, **kwargs) 2025-08-14T21:38:35.0099670Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0099999Z return func(*args, **kwargs) 2025-08-14T21:38:35.0100325Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0100659Z return func(*args, **kwargs) 2025-08-14T21:38:35.0101002Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 348, in forward 2025-08-14T21:38:35.0101384Z layer_output = apply_chunking_to_forward( 2025-08-14T21:38:35.0101776Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:38:35.0102139Z return forward_fn(*input_tensors) 2025-08-14T21:38:35.0102528Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 356, in feed_forward_chunk 2025-08-14T21:38:35.0102971Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:38:35.0103385Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 294, in forward 2025-08-14T21:38:35.0103765Z hidden_states = self.dense(hidden_states) 2025-08-14T21:38:35.0103897Z 2025-08-14T21:38:35.0103996Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:35.0104331Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:35.0104635Z return mod(**inputs) 2025-08-14T21:38:35.0105009Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0105353Z return func(*args, **kwargs) 2025-08-14T21:38:35.0105682Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0106026Z return func(*args, **kwargs) 2025-08-14T21:38:35.0106330Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0106659Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0107032Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 757, in forward 2025-08-14T21:38:35.0107401Z outputs = self.layoutlm( 2025-08-14T21:38:35.0107736Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0108079Z return func(*args, **kwargs) 2025-08-14T21:38:35.0108412Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0108742Z return func(*args, **kwargs) 2025-08-14T21:38:35.0109054Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0109385Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0109750Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:35.0110125Z encoder_outputs = self.encoder( 2025-08-14T21:38:35.0110484Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0110823Z return func(*args, **kwargs) 2025-08-14T21:38:35.0111141Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0111497Z return func(*args, **kwargs) 2025-08-14T21:38:35.0111821Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0112156Z return func(*args, **kwargs) 2025-08-14T21:38:35.0112327Z [Previous line repeated 1 more time] 2025-08-14T21:38:35.0112665Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0112995Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0113360Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:35.0113727Z layer_outputs = layer_module( 2025-08-14T21:38:35.0114047Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:35.0114381Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:35.0114774Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0115110Z return func(*args, **kwargs) 2025-08-14T21:38:35.0115433Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0115761Z return func(*args, **kwargs) 2025-08-14T21:38:35.0116086Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0116416Z return func(*args, **kwargs) 2025-08-14T21:38:35.0116767Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 348, in forward 2025-08-14T21:38:35.0117138Z layer_output = apply_chunking_to_forward( 2025-08-14T21:38:35.0117507Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:38:35.0117870Z return forward_fn(*input_tensors) 2025-08-14T21:38:35.0118258Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 356, in feed_forward_chunk 2025-08-14T21:38:35.0118696Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:38:35.0119104Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 295, in forward 2025-08-14T21:38:35.0119505Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:38:35.0119849Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:38:35.0120160Z return self.act(input) 2025-08-14T21:38:35.0120268Z 2025-08-14T21:38:35.0120363Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:35.0120693Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:35.0120988Z return mod(**inputs) 2025-08-14T21:38:35.0121308Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0121646Z return func(*args, **kwargs) 2025-08-14T21:38:35.0121966Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0122303Z return func(*args, **kwargs) 2025-08-14T21:38:35.0122610Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0122935Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0123310Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 757, in forward 2025-08-14T21:38:35.0123688Z outputs = self.layoutlm( 2025-08-14T21:38:35.0124024Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0124375Z return func(*args, **kwargs) 2025-08-14T21:38:35.0124704Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0125041Z return func(*args, **kwargs) 2025-08-14T21:38:35.0125369Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0125692Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0126057Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:35.0126429Z encoder_outputs = self.encoder( 2025-08-14T21:38:35.0126770Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0127097Z return func(*args, **kwargs) 2025-08-14T21:38:35.0127425Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0127783Z return func(*args, **kwargs) 2025-08-14T21:38:35.0128104Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0128439Z return func(*args, **kwargs) 2025-08-14T21:38:35.0128622Z [Previous line repeated 1 more time] 2025-08-14T21:38:35.0128948Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0129267Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0129637Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:35.0130006Z layer_outputs = layer_module( 2025-08-14T21:38:35.0130320Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:35.0130657Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:35.0131009Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0131345Z return func(*args, **kwargs) 2025-08-14T21:38:35.0131665Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0132004Z return func(*args, **kwargs) 2025-08-14T21:38:35.0132331Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0132657Z return func(*args, **kwargs) 2025-08-14T21:38:35.0133013Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 348, in forward 2025-08-14T21:38:35.0133397Z layer_output = apply_chunking_to_forward( 2025-08-14T21:38:35.0133767Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:38:35.0134125Z return forward_fn(*input_tensors) 2025-08-14T21:38:35.0134519Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 357, in feed_forward_chunk 2025-08-14T21:38:35.0134969Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:38:35.0135391Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 308, in forward 2025-08-14T21:38:35.0135763Z hidden_states = self.dense(hidden_states) 2025-08-14T21:38:35.0135897Z 2025-08-14T21:38:35.0136011Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:35.0136343Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:35.0136634Z return mod(**inputs) 2025-08-14T21:38:35.0136957Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0137326Z return func(*args, **kwargs) 2025-08-14T21:38:35.0137653Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0137984Z return func(*args, **kwargs) 2025-08-14T21:38:35.0138305Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0138640Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0139004Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 757, in forward 2025-08-14T21:38:35.0139379Z outputs = self.layoutlm( 2025-08-14T21:38:35.0139712Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0140053Z return func(*args, **kwargs) 2025-08-14T21:38:35.0140377Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0140732Z return func(*args, **kwargs) 2025-08-14T21:38:35.0141040Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0141360Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0141729Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:35.0142101Z encoder_outputs = self.encoder( 2025-08-14T21:38:35.0142438Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0142767Z return func(*args, **kwargs) 2025-08-14T21:38:35.0143091Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0143426Z return func(*args, **kwargs) 2025-08-14T21:38:35.0143753Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0144084Z return func(*args, **kwargs) 2025-08-14T21:38:35.0144264Z [Previous line repeated 1 more time] 2025-08-14T21:38:35.0144587Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0144977Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0145350Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:35.0145726Z layer_outputs = layer_module( 2025-08-14T21:38:35.0146050Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:35.0146379Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:35.0146726Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0147064Z return func(*args, **kwargs) 2025-08-14T21:38:35.0147385Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0147725Z return func(*args, **kwargs) 2025-08-14T21:38:35.0148056Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0148393Z return func(*args, **kwargs) 2025-08-14T21:38:35.0148740Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 338, in forward 2025-08-14T21:38:35.0149143Z self_attention_outputs = self.attention( 2025-08-14T21:38:35.0149490Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0149819Z return func(*args, **kwargs) 2025-08-14T21:38:35.0150144Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0150498Z return func(*args, **kwargs) 2025-08-14T21:38:35.0150821Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0151149Z return func(*args, **kwargs) 2025-08-14T21:38:35.0151516Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 271, in forward 2025-08-14T21:38:35.0151886Z self_outputs = self.self( 2025-08-14T21:38:35.0152213Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0152550Z return func(*args, **kwargs) 2025-08-14T21:38:35.0152878Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0153213Z return func(*args, **kwargs) 2025-08-14T21:38:35.0153549Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0153884Z return func(*args, **kwargs) 2025-08-14T21:38:35.0154238Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 191, in forward 2025-08-14T21:38:35.0154674Z query_states = self.query(hidden_states).view(hidden_shape).transpose(1, 2) 2025-08-14T21:38:35.0154860Z 2025-08-14T21:38:35.0154956Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:35.0155290Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:35.0155590Z return mod(**inputs) 2025-08-14T21:38:35.0155903Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0156243Z return func(*args, **kwargs) 2025-08-14T21:38:35.0156569Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0156909Z return func(*args, **kwargs) 2025-08-14T21:38:35.0157209Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0157540Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0157908Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 757, in forward 2025-08-14T21:38:35.0158267Z outputs = self.layoutlm( 2025-08-14T21:38:35.0158596Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0158934Z return func(*args, **kwargs) 2025-08-14T21:38:35.0159258Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0159585Z return func(*args, **kwargs) 2025-08-14T21:38:35.0159894Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0160219Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0160577Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:35.0160947Z encoder_outputs = self.encoder( 2025-08-14T21:38:35.0161286Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0161621Z return func(*args, **kwargs) 2025-08-14T21:38:35.0161953Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0162290Z return func(*args, **kwargs) 2025-08-14T21:38:35.0162619Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0162970Z return func(*args, **kwargs) 2025-08-14T21:38:35.0163140Z [Previous line repeated 1 more time] 2025-08-14T21:38:35.0163462Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0163787Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0164159Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:35.0164530Z layer_outputs = layer_module( 2025-08-14T21:38:35.0164847Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:35.0165181Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:35.0165523Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0165857Z return func(*args, **kwargs) 2025-08-14T21:38:35.0166184Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0166529Z return func(*args, **kwargs) 2025-08-14T21:38:35.0166855Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0167190Z return func(*args, **kwargs) 2025-08-14T21:38:35.0167545Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 338, in forward 2025-08-14T21:38:35.0167918Z self_attention_outputs = self.attention( 2025-08-14T21:38:35.0168269Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0168605Z return func(*args, **kwargs) 2025-08-14T21:38:35.0168924Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0169260Z return func(*args, **kwargs) 2025-08-14T21:38:35.0169589Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0169924Z return func(*args, **kwargs) 2025-08-14T21:38:35.0170270Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 271, in forward 2025-08-14T21:38:35.0170636Z self_outputs = self.self( 2025-08-14T21:38:35.0170968Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0171305Z return func(*args, **kwargs) 2025-08-14T21:38:35.0171629Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0172071Z return func(*args, **kwargs) 2025-08-14T21:38:35.0172541Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0172904Z return func(*args, **kwargs) 2025-08-14T21:38:35.0173286Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 192, in forward 2025-08-14T21:38:35.0173724Z key_states = self.key(hidden_states).view(hidden_shape).transpose(1, 2) 2025-08-14T21:38:35.0173905Z 2025-08-14T21:38:35.0174011Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:35.0174841Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:35.0175152Z return mod(**inputs) 2025-08-14T21:38:35.0175504Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0175849Z return func(*args, **kwargs) 2025-08-14T21:38:35.0176189Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0176539Z return func(*args, **kwargs) 2025-08-14T21:38:35.0176873Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0177201Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0177649Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 757, in forward 2025-08-14T21:38:35.0178109Z outputs = self.layoutlm( 2025-08-14T21:38:35.0178450Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0178842Z return func(*args, **kwargs) 2025-08-14T21:38:35.0179188Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0179537Z return func(*args, **kwargs) 2025-08-14T21:38:35.0179844Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0180198Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0180567Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:35.0180942Z encoder_outputs = self.encoder( 2025-08-14T21:38:35.0181282Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0181622Z return func(*args, **kwargs) 2025-08-14T21:38:35.0181957Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0182302Z return func(*args, **kwargs) 2025-08-14T21:38:35.0182645Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0182993Z return func(*args, **kwargs) 2025-08-14T21:38:35.0183175Z [Previous line repeated 1 more time] 2025-08-14T21:38:35.0183504Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0183848Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0184229Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:35.0184846Z layer_outputs = layer_module( 2025-08-14T21:38:35.0185259Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:35.0185643Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:35.0186080Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0186426Z return func(*args, **kwargs) 2025-08-14T21:38:35.0186769Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0187123Z return func(*args, **kwargs) 2025-08-14T21:38:35.0187459Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0187810Z return func(*args, **kwargs) 2025-08-14T21:38:35.0188181Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 338, in forward 2025-08-14T21:38:35.0188576Z self_attention_outputs = self.attention( 2025-08-14T21:38:35.0188934Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0189283Z return func(*args, **kwargs) 2025-08-14T21:38:35.0189681Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0190037Z return func(*args, **kwargs) 2025-08-14T21:38:35.0190367Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0190745Z return func(*args, **kwargs) 2025-08-14T21:38:35.0191113Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 271, in forward 2025-08-14T21:38:35.0191489Z self_outputs = self.self( 2025-08-14T21:38:35.0191856Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0192203Z return func(*args, **kwargs) 2025-08-14T21:38:35.0192539Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0192886Z return func(*args, **kwargs) 2025-08-14T21:38:35.0193211Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0193547Z return func(*args, **kwargs) 2025-08-14T21:38:35.0193891Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 193, in forward 2025-08-14T21:38:35.0194357Z value_states = self.value(hidden_states).view(hidden_shape).transpose(1, 2) 2025-08-14T21:38:35.0194546Z 2025-08-14T21:38:35.0194620Z cudagraph partition due to non gpu ops 2025-08-14T21:38:35.0194814Z cudagraph partition due to non gpu ops 2025-08-14T21:38:35.0195028Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:35.0195360Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:35.0195657Z return mod(**inputs) 2025-08-14T21:38:35.0195974Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0196308Z return func(*args, **kwargs) 2025-08-14T21:38:35.0196633Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0196966Z return func(*args, **kwargs) 2025-08-14T21:38:35.0197268Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0197591Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0197957Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 757, in forward 2025-08-14T21:38:35.0198317Z outputs = self.layoutlm( 2025-08-14T21:38:35.0198641Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0198972Z return func(*args, **kwargs) 2025-08-14T21:38:35.0199298Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0199625Z return func(*args, **kwargs) 2025-08-14T21:38:35.0199929Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0200253Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0200614Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:35.0200981Z encoder_outputs = self.encoder( 2025-08-14T21:38:35.0201319Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0201653Z return func(*args, **kwargs) 2025-08-14T21:38:35.0201969Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0202321Z return func(*args, **kwargs) 2025-08-14T21:38:35.0202647Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0202985Z return func(*args, **kwargs) 2025-08-14T21:38:35.0203156Z [Previous line repeated 1 more time] 2025-08-14T21:38:35.0203478Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0203820Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0204181Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:35.0204548Z layer_outputs = layer_module( 2025-08-14T21:38:35.0204884Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:35.0205217Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:35.0205555Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0205893Z return func(*args, **kwargs) 2025-08-14T21:38:35.0206216Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0206544Z return func(*args, **kwargs) 2025-08-14T21:38:35.0206888Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0207220Z return func(*args, **kwargs) 2025-08-14T21:38:35.0207572Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 338, in forward 2025-08-14T21:38:35.0207942Z self_attention_outputs = self.attention( 2025-08-14T21:38:35.0208287Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0208621Z return func(*args, **kwargs) 2025-08-14T21:38:35.0208935Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0209271Z return func(*args, **kwargs) 2025-08-14T21:38:35.0209593Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0209930Z return func(*args, **kwargs) 2025-08-14T21:38:35.0210273Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 278, in forward 2025-08-14T21:38:35.0210688Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:38:35.0211101Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 225, in forward 2025-08-14T21:38:35.0211477Z hidden_states = self.dense(hidden_states) 2025-08-14T21:38:35.0211603Z 2025-08-14T21:38:35.0211699Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:35.0212025Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:35.0212323Z return mod(**inputs) 2025-08-14T21:38:35.0212633Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0212969Z return func(*args, **kwargs) 2025-08-14T21:38:35.0213293Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0213626Z return func(*args, **kwargs) 2025-08-14T21:38:35.0213924Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0214247Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0214610Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 757, in forward 2025-08-14T21:38:35.0214964Z outputs = self.layoutlm( 2025-08-14T21:38:35.0215311Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0215653Z return func(*args, **kwargs) 2025-08-14T21:38:35.0215980Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0216329Z return func(*args, **kwargs) 2025-08-14T21:38:35.0216636Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0216958Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0217333Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:35.0217705Z encoder_outputs = self.encoder( 2025-08-14T21:38:35.0218043Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0218382Z return func(*args, **kwargs) 2025-08-14T21:38:35.0218706Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0219047Z return func(*args, **kwargs) 2025-08-14T21:38:35.0219374Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0219727Z return func(*args, **kwargs) 2025-08-14T21:38:35.0219901Z [Previous line repeated 1 more time] 2025-08-14T21:38:35.0220225Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0220553Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0220911Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:35.0221284Z layer_outputs = layer_module( 2025-08-14T21:38:35.0221606Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:35.0221941Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:35.0222278Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0222614Z return func(*args, **kwargs) 2025-08-14T21:38:35.0222942Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0223268Z return func(*args, **kwargs) 2025-08-14T21:38:35.0223594Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0223930Z return func(*args, **kwargs) 2025-08-14T21:38:35.0224283Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 348, in forward 2025-08-14T21:38:35.0224669Z layer_output = apply_chunking_to_forward( 2025-08-14T21:38:35.0225129Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:38:35.0225509Z return forward_fn(*input_tensors) 2025-08-14T21:38:35.0225932Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 356, in feed_forward_chunk 2025-08-14T21:38:35.0226380Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:38:35.0226853Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 294, in forward 2025-08-14T21:38:35.0227250Z hidden_states = self.dense(hidden_states) 2025-08-14T21:38:35.0227381Z 2025-08-14T21:38:35.0227479Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:35.0227825Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:35.0228135Z return mod(**inputs) 2025-08-14T21:38:35.0228484Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0228828Z return func(*args, **kwargs) 2025-08-14T21:38:35.0229166Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0229532Z return func(*args, **kwargs) 2025-08-14T21:38:35.0229841Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0230174Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0230568Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 757, in forward 2025-08-14T21:38:35.0230950Z outputs = self.layoutlm( 2025-08-14T21:38:35.0231280Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0231628Z return func(*args, **kwargs) 2025-08-14T21:38:35.0231962Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0232300Z return func(*args, **kwargs) 2025-08-14T21:38:35.0232616Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0232980Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0233358Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:35.0233731Z encoder_outputs = self.encoder( 2025-08-14T21:38:35.0234082Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0234429Z return func(*args, **kwargs) 2025-08-14T21:38:35.0234757Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0235106Z return func(*args, **kwargs) 2025-08-14T21:38:35.0235440Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0235784Z return func(*args, **kwargs) 2025-08-14T21:38:35.0235956Z [Previous line repeated 1 more time] 2025-08-14T21:38:35.0236292Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0236625Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0236992Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:35.0237369Z layer_outputs = layer_module( 2025-08-14T21:38:35.0237695Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:35.0254290Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:35.0254728Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0255097Z return func(*args, **kwargs) 2025-08-14T21:38:35.0255448Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0255802Z return func(*args, **kwargs) 2025-08-14T21:38:35.0256137Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0256475Z return func(*args, **kwargs) 2025-08-14T21:38:35.0256845Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 348, in forward 2025-08-14T21:38:35.0256933Z layer_output = apply_chunking_to_forward( 2025-08-14T21:38:35.0257248Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:38:35.0257331Z return forward_fn(*input_tensors) 2025-08-14T21:38:35.0257607Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 356, in feed_forward_chunk 2025-08-14T21:38:35.0257722Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:38:35.0258031Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 295, in forward 2025-08-14T21:38:35.0258137Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:38:35.0258345Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:38:35.0258434Z return self.act(input) 2025-08-14T21:38:35.0258440Z 2025-08-14T21:38:35.0258545Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:35.0258748Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:35.0258814Z return mod(**inputs) 2025-08-14T21:38:35.0259046Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0259110Z return func(*args, **kwargs) 2025-08-14T21:38:35.0259331Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0259438Z return func(*args, **kwargs) 2025-08-14T21:38:35.0259645Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0259717Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0259979Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 757, in forward 2025-08-14T21:38:35.0260047Z outputs = self.layoutlm( 2025-08-14T21:38:35.0260279Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0260341Z return func(*args, **kwargs) 2025-08-14T21:38:35.0260558Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0260629Z return func(*args, **kwargs) 2025-08-14T21:38:35.0260827Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0260898Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0261152Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:35.0261222Z encoder_outputs = self.encoder( 2025-08-14T21:38:35.0261445Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0261506Z return func(*args, **kwargs) 2025-08-14T21:38:35.0261722Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0261790Z return func(*args, **kwargs) 2025-08-14T21:38:35.0262006Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0262074Z return func(*args, **kwargs) 2025-08-14T21:38:35.0262150Z [Previous line repeated 1 more time] 2025-08-14T21:38:35.0262345Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0262423Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0262669Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:35.0262736Z layer_outputs = layer_module( 2025-08-14T21:38:35.0262948Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:35.0263039Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:35.0263269Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0263330Z return func(*args, **kwargs) 2025-08-14T21:38:35.0263546Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0263632Z return func(*args, **kwargs) 2025-08-14T21:38:35.0263851Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0263912Z return func(*args, **kwargs) 2025-08-14T21:38:35.0264181Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 348, in forward 2025-08-14T21:38:35.0264264Z layer_output = apply_chunking_to_forward( 2025-08-14T21:38:35.0264517Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:38:35.0264589Z return forward_fn(*input_tensors) 2025-08-14T21:38:35.0264959Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 357, in feed_forward_chunk 2025-08-14T21:38:35.0265106Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:38:35.0265392Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 308, in forward 2025-08-14T21:38:35.0265483Z hidden_states = self.dense(hidden_states) 2025-08-14T21:38:35.0265487Z 2025-08-14T21:38:35.0265594Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:35.0265794Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:35.0265867Z return mod(**inputs) 2025-08-14T21:38:35.0266101Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0266164Z return func(*args, **kwargs) 2025-08-14T21:38:35.0266429Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0266495Z return func(*args, **kwargs) 2025-08-14T21:38:35.0266717Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0266791Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0267052Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 757, in forward 2025-08-14T21:38:35.0267135Z outputs = self.layoutlm( 2025-08-14T21:38:35.0267366Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0267432Z return func(*args, **kwargs) 2025-08-14T21:38:35.0267670Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0267735Z return func(*args, **kwargs) 2025-08-14T21:38:35.0267955Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0268029Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0268293Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:35.0268375Z encoder_outputs = self.encoder( 2025-08-14T21:38:35.0268609Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0268682Z return func(*args, **kwargs) 2025-08-14T21:38:35.0268913Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0268977Z return func(*args, **kwargs) 2025-08-14T21:38:35.0269236Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0269301Z return func(*args, **kwargs) 2025-08-14T21:38:35.0269377Z [Previous line repeated 1 more time] 2025-08-14T21:38:35.0269591Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0269683Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0269950Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:35.0270018Z layer_outputs = layer_module( 2025-08-14T21:38:35.0270248Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:35.0270336Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:35.0270571Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0270636Z return func(*args, **kwargs) 2025-08-14T21:38:35.0270874Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0270939Z return func(*args, **kwargs) 2025-08-14T21:38:35.0271196Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0271258Z return func(*args, **kwargs) 2025-08-14T21:38:35.0271516Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 338, in forward 2025-08-14T21:38:35.0271605Z self_attention_outputs = self.attention( 2025-08-14T21:38:35.0271834Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0271897Z return func(*args, **kwargs) 2025-08-14T21:38:35.0272133Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0272195Z return func(*args, **kwargs) 2025-08-14T21:38:35.0272433Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0272499Z return func(*args, **kwargs) 2025-08-14T21:38:35.0272758Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 271, in forward 2025-08-14T21:38:35.0272835Z self_outputs = self.self( 2025-08-14T21:38:35.0273067Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0273130Z return func(*args, **kwargs) 2025-08-14T21:38:35.0273366Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0273430Z return func(*args, **kwargs) 2025-08-14T21:38:35.0273665Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0273728Z return func(*args, **kwargs) 2025-08-14T21:38:35.0274052Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 191, in forward 2025-08-14T21:38:35.0274201Z query_states = self.query(hidden_states).view(hidden_shape).transpose(1, 2) 2025-08-14T21:38:35.0274204Z 2025-08-14T21:38:35.0274301Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:35.0274495Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:35.0274558Z return mod(**inputs) 2025-08-14T21:38:35.0274774Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0274842Z return func(*args, **kwargs) 2025-08-14T21:38:35.0275071Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0275133Z return func(*args, **kwargs) 2025-08-14T21:38:35.0275339Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0275425Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0275677Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 757, in forward 2025-08-14T21:38:35.0275743Z outputs = self.layoutlm( 2025-08-14T21:38:35.0275984Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0276057Z return func(*args, **kwargs) 2025-08-14T21:38:35.0276278Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0276345Z return func(*args, **kwargs) 2025-08-14T21:38:35.0276554Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0276625Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0276882Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:35.0276968Z encoder_outputs = self.encoder( 2025-08-14T21:38:35.0277182Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0277252Z return func(*args, **kwargs) 2025-08-14T21:38:35.0277467Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0277537Z return func(*args, **kwargs) 2025-08-14T21:38:35.0277753Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0277816Z return func(*args, **kwargs) 2025-08-14T21:38:35.0277894Z [Previous line repeated 1 more time] 2025-08-14T21:38:35.0278089Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0278154Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0278408Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:35.0278473Z layer_outputs = layer_module( 2025-08-14T21:38:35.0278681Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:35.0278755Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:35.0278971Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0279040Z return func(*args, **kwargs) 2025-08-14T21:38:35.0279258Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0279318Z return func(*args, **kwargs) 2025-08-14T21:38:35.0279540Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0279603Z return func(*args, **kwargs) 2025-08-14T21:38:35.0279856Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 338, in forward 2025-08-14T21:38:35.0279932Z self_attention_outputs = self.attention( 2025-08-14T21:38:35.0280150Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0280220Z return func(*args, **kwargs) 2025-08-14T21:38:35.0280435Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0280512Z return func(*args, **kwargs) 2025-08-14T21:38:35.0280736Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0280796Z return func(*args, **kwargs) 2025-08-14T21:38:35.0281048Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 271, in forward 2025-08-14T21:38:35.0281135Z self_outputs = self.self( 2025-08-14T21:38:35.0281351Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0281420Z return func(*args, **kwargs) 2025-08-14T21:38:35.0281651Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0281719Z return func(*args, **kwargs) 2025-08-14T21:38:35.0281940Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0282003Z return func(*args, **kwargs) 2025-08-14T21:38:35.0282259Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 192, in forward 2025-08-14T21:38:35.0282388Z key_states = self.key(hidden_states).view(hidden_shape).transpose(1, 2) 2025-08-14T21:38:35.0282409Z 2025-08-14T21:38:35.0282510Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:35.0282703Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:35.0282764Z return mod(**inputs) 2025-08-14T21:38:35.0282990Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0283050Z return func(*args, **kwargs) 2025-08-14T21:38:35.0283268Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0283337Z return func(*args, **kwargs) 2025-08-14T21:38:35.0283537Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0283605Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0283858Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 757, in forward 2025-08-14T21:38:35.0283927Z outputs = self.layoutlm( 2025-08-14T21:38:35.0284157Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0284218Z return func(*args, **kwargs) 2025-08-14T21:38:35.0284437Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0284504Z return func(*args, **kwargs) 2025-08-14T21:38:35.0284997Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0285088Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0285365Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:35.0285444Z encoder_outputs = self.encoder( 2025-08-14T21:38:35.0285700Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0285775Z return func(*args, **kwargs) 2025-08-14T21:38:35.0286022Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0286097Z return func(*args, **kwargs) 2025-08-14T21:38:35.0286334Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0286410Z return func(*args, **kwargs) 2025-08-14T21:38:35.0286494Z [Previous line repeated 1 more time] 2025-08-14T21:38:35.0286745Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0286816Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0287067Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:35.0287160Z layer_outputs = layer_module( 2025-08-14T21:38:35.0287364Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:35.0287431Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:35.0287675Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0287745Z return func(*args, **kwargs) 2025-08-14T21:38:35.0287958Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0288018Z return func(*args, **kwargs) 2025-08-14T21:38:35.0288237Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0288298Z return func(*args, **kwargs) 2025-08-14T21:38:35.0288547Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 338, in forward 2025-08-14T21:38:35.0288648Z self_attention_outputs = self.attention( 2025-08-14T21:38:35.0288862Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0288929Z return func(*args, **kwargs) 2025-08-14T21:38:35.0289146Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0289204Z return func(*args, **kwargs) 2025-08-14T21:38:35.0289425Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0289484Z return func(*args, **kwargs) 2025-08-14T21:38:35.0289732Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 271, in forward 2025-08-14T21:38:35.0289795Z self_outputs = self.self( 2025-08-14T21:38:35.0290007Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0290078Z return func(*args, **kwargs) 2025-08-14T21:38:35.0290293Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0290352Z return func(*args, **kwargs) 2025-08-14T21:38:35.0290574Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0290634Z return func(*args, **kwargs) 2025-08-14T21:38:35.0290883Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 193, in forward 2025-08-14T21:38:35.0291017Z value_states = self.value(hidden_states).view(hidden_shape).transpose(1, 2) 2025-08-14T21:38:35.0291022Z 2025-08-14T21:38:35.0291094Z cudagraph partition due to non gpu ops 2025-08-14T21:38:35.0291171Z cudagraph partition due to non gpu ops 2025-08-14T21:38:35.0291268Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:35.0291457Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:35.0291518Z return mod(**inputs) 2025-08-14T21:38:35.0291733Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0291800Z return func(*args, **kwargs) 2025-08-14T21:38:35.0292012Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0292084Z return func(*args, **kwargs) 2025-08-14T21:38:35.0292288Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0292355Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0292606Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 757, in forward 2025-08-14T21:38:35.0292689Z outputs = self.layoutlm( 2025-08-14T21:38:35.0292904Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0292971Z return func(*args, **kwargs) 2025-08-14T21:38:35.0293199Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0293262Z return func(*args, **kwargs) 2025-08-14T21:38:35.0293466Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0293533Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0293787Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:35.0293854Z encoder_outputs = self.encoder( 2025-08-14T21:38:35.0294075Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0294161Z return func(*args, **kwargs) 2025-08-14T21:38:35.0294381Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0294442Z return func(*args, **kwargs) 2025-08-14T21:38:35.0294666Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0294727Z return func(*args, **kwargs) 2025-08-14T21:38:35.0294802Z [Previous line repeated 1 more time] 2025-08-14T21:38:35.0295001Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0295067Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0295318Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:35.0295384Z layer_outputs = layer_module( 2025-08-14T21:38:35.0295594Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:35.0295666Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:35.0295884Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0295951Z return func(*args, **kwargs) 2025-08-14T21:38:35.0296166Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0296227Z return func(*args, **kwargs) 2025-08-14T21:38:35.0296450Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0296510Z return func(*args, **kwargs) 2025-08-14T21:38:35.0296758Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 338, in forward 2025-08-14T21:38:35.0296837Z self_attention_outputs = self.attention( 2025-08-14T21:38:35.0297052Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0297118Z return func(*args, **kwargs) 2025-08-14T21:38:35.0297335Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0297395Z return func(*args, **kwargs) 2025-08-14T21:38:35.0297631Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0297692Z return func(*args, **kwargs) 2025-08-14T21:38:35.0297943Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 278, in forward 2025-08-14T21:38:35.0298063Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:38:35.0298330Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 225, in forward 2025-08-14T21:38:35.0298412Z hidden_states = self.dense(hidden_states) 2025-08-14T21:38:35.0298415Z 2025-08-14T21:38:35.0298509Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:35.0298712Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:35.0298773Z return mod(**inputs) 2025-08-14T21:38:35.0298994Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0299063Z return func(*args, **kwargs) 2025-08-14T21:38:35.0299280Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0299338Z return func(*args, **kwargs) 2025-08-14T21:38:35.0299544Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0299630Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0299877Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 757, in forward 2025-08-14T21:38:35.0299942Z outputs = self.layoutlm( 2025-08-14T21:38:35.0300159Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0300226Z return func(*args, **kwargs) 2025-08-14T21:38:35.0300444Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0300505Z return func(*args, **kwargs) 2025-08-14T21:38:35.0300706Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0300772Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0301019Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:35.0301089Z encoder_outputs = self.encoder( 2025-08-14T21:38:35.0301305Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0301375Z return func(*args, **kwargs) 2025-08-14T21:38:35.0301590Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0301649Z return func(*args, **kwargs) 2025-08-14T21:38:35.0301873Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0301931Z return func(*args, **kwargs) 2025-08-14T21:38:35.0302007Z [Previous line repeated 1 more time] 2025-08-14T21:38:35.0302205Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0302272Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0302524Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:35.0302587Z layer_outputs = layer_module( 2025-08-14T21:38:35.0302796Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:35.0302868Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:35.0303084Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0303164Z return func(*args, **kwargs) 2025-08-14T21:38:35.0303379Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0303440Z return func(*args, **kwargs) 2025-08-14T21:38:35.0303661Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0303745Z return func(*args, **kwargs) 2025-08-14T21:38:35.0303999Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 348, in forward 2025-08-14T21:38:35.0304076Z layer_output = apply_chunking_to_forward( 2025-08-14T21:38:35.0304332Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:38:35.0304410Z return forward_fn(*input_tensors) 2025-08-14T21:38:35.0304691Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 356, in feed_forward_chunk 2025-08-14T21:38:35.0304866Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:38:35.0305122Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 294, in forward 2025-08-14T21:38:35.0305222Z hidden_states = self.dense(hidden_states) 2025-08-14T21:38:35.0305226Z 2025-08-14T21:38:35.0305328Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:35.0305511Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:35.0305570Z return mod(**inputs) 2025-08-14T21:38:35.0305796Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0305857Z return func(*args, **kwargs) 2025-08-14T21:38:35.0306082Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0306144Z return func(*args, **kwargs) 2025-08-14T21:38:35.0306340Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0306416Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0306661Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 757, in forward 2025-08-14T21:38:35.0306727Z outputs = self.layoutlm( 2025-08-14T21:38:35.0306954Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0307016Z return func(*args, **kwargs) 2025-08-14T21:38:35.0307238Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0307300Z return func(*args, **kwargs) 2025-08-14T21:38:35.0307496Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0307571Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0307813Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:35.0307878Z encoder_outputs = self.encoder( 2025-08-14T21:38:35.0308102Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0308161Z return func(*args, **kwargs) 2025-08-14T21:38:35.0308380Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0308442Z return func(*args, **kwargs) 2025-08-14T21:38:35.0308654Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0308722Z return func(*args, **kwargs) 2025-08-14T21:38:35.0308806Z [Previous line repeated 1 more time] 2025-08-14T21:38:35.0309004Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0309077Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0309320Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:35.0309409Z layer_outputs = layer_module( 2025-08-14T21:38:35.0309618Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:35.0309690Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:35.0309928Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0309991Z return func(*args, **kwargs) 2025-08-14T21:38:35.0310216Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0310277Z return func(*args, **kwargs) 2025-08-14T21:38:35.0310492Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0310560Z return func(*args, **kwargs) 2025-08-14T21:38:35.0310809Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 348, in forward 2025-08-14T21:38:35.0310900Z layer_output = apply_chunking_to_forward( 2025-08-14T21:38:35.0311148Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:38:35.0311218Z return forward_fn(*input_tensors) 2025-08-14T21:38:35.0311500Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 356, in feed_forward_chunk 2025-08-14T21:38:35.0311612Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:38:35.0311858Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 295, in forward 2025-08-14T21:38:35.0311967Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:38:35.0312161Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:38:35.0312232Z return self.act(input) 2025-08-14T21:38:35.0312235Z 2025-08-14T21:38:35.0312328Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:35.0312508Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:35.0312574Z return mod(**inputs) 2025-08-14T21:38:35.0312791Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0312852Z return func(*args, **kwargs) 2025-08-14T21:38:35.0313077Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0313137Z return func(*args, **kwargs) 2025-08-14T21:38:35.0313339Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0313405Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0313652Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 757, in forward 2025-08-14T21:38:35.0313723Z outputs = self.layoutlm( 2025-08-14T21:38:35.0313946Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0314007Z return func(*args, **kwargs) 2025-08-14T21:38:35.0314230Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0314290Z return func(*args, **kwargs) 2025-08-14T21:38:35.0314508Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0314579Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0314823Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:35.0314912Z encoder_outputs = self.encoder( 2025-08-14T21:38:35.0315132Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0315192Z return func(*args, **kwargs) 2025-08-14T21:38:35.0315434Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0315497Z return func(*args, **kwargs) 2025-08-14T21:38:35.0315724Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0315787Z return func(*args, **kwargs) 2025-08-14T21:38:35.0315858Z [Previous line repeated 1 more time] 2025-08-14T21:38:35.0316064Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0316130Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0316381Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:35.0316466Z layer_outputs = layer_module( 2025-08-14T21:38:35.0316671Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:35.0316753Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:35.0316972Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0317032Z return func(*args, **kwargs) 2025-08-14T21:38:35.0317257Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0317318Z return func(*args, **kwargs) 2025-08-14T21:38:35.0317542Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0317603Z return func(*args, **kwargs) 2025-08-14T21:38:35.0317850Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 348, in forward 2025-08-14T21:38:35.0317932Z layer_output = apply_chunking_to_forward( 2025-08-14T21:38:35.0318169Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:38:35.0318238Z return forward_fn(*input_tensors) 2025-08-14T21:38:35.0318521Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 357, in feed_forward_chunk 2025-08-14T21:38:35.0318645Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:38:35.0318898Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 308, in forward 2025-08-14T21:38:35.0318973Z hidden_states = self.dense(hidden_states) 2025-08-14T21:38:35.0318976Z 2025-08-14T21:38:35.0319074Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:35.0319263Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:35.0319322Z return mod(**inputs) 2025-08-14T21:38:35.0319550Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0319611Z return func(*args, **kwargs) 2025-08-14T21:38:35.0319828Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0319896Z return func(*args, **kwargs) 2025-08-14T21:38:35.0320119Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0320187Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0320438Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 757, in forward 2025-08-14T21:38:35.0320520Z outputs = self.layoutlm( 2025-08-14T21:38:35.0320744Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0320804Z return func(*args, **kwargs) 2025-08-14T21:38:35.0321037Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0321105Z return func(*args, **kwargs) 2025-08-14T21:38:35.0321303Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0321372Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0321624Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:35.0321691Z encoder_outputs = self.encoder( 2025-08-14T21:38:35.0321915Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0321994Z return func(*args, **kwargs) 2025-08-14T21:38:35.0322211Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0322278Z return func(*args, **kwargs) 2025-08-14T21:38:35.0322498Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0322565Z return func(*args, **kwargs) 2025-08-14T21:38:35.0322635Z [Previous line repeated 1 more time] 2025-08-14T21:38:35.0322837Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0322911Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0323156Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:35.0323222Z layer_outputs = layer_module( 2025-08-14T21:38:35.0323434Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:35.0323508Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:35.0323735Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0323796Z return func(*args, **kwargs) 2025-08-14T21:38:35.0324012Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0324078Z return func(*args, **kwargs) 2025-08-14T21:38:35.0324298Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0324358Z return func(*args, **kwargs) 2025-08-14T21:38:35.0324614Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 338, in forward 2025-08-14T21:38:35.0324693Z self_attention_outputs = self.attention( 2025-08-14T21:38:35.0324919Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0324979Z return func(*args, **kwargs) 2025-08-14T21:38:35.0325197Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0325264Z return func(*args, **kwargs) 2025-08-14T21:38:35.0325479Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0325555Z return func(*args, **kwargs) 2025-08-14T21:38:35.0325809Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 271, in forward 2025-08-14T21:38:35.0325873Z self_outputs = self.self( 2025-08-14T21:38:35.0326097Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0326173Z return func(*args, **kwargs) 2025-08-14T21:38:35.0326388Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0326455Z return func(*args, **kwargs) 2025-08-14T21:38:35.0326689Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0326753Z return func(*args, **kwargs) 2025-08-14T21:38:35.0327008Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 191, in forward 2025-08-14T21:38:35.0327144Z query_states = self.query(hidden_states).view(hidden_shape).transpose(1, 2) 2025-08-14T21:38:35.0327148Z 2025-08-14T21:38:35.0327248Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:35.0327431Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:35.0327512Z return mod(**inputs) 2025-08-14T21:38:35.0327737Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0327796Z return func(*args, **kwargs) 2025-08-14T21:38:35.0328023Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0328082Z return func(*args, **kwargs) 2025-08-14T21:38:35.0328279Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0328354Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0328598Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 757, in forward 2025-08-14T21:38:35.0328662Z outputs = self.layoutlm( 2025-08-14T21:38:35.0328887Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0328952Z return func(*args, **kwargs) 2025-08-14T21:38:35.0329176Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0329236Z return func(*args, **kwargs) 2025-08-14T21:38:35.0329435Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0329510Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0329756Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:35.0329830Z encoder_outputs = self.encoder( 2025-08-14T21:38:35.0330046Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0330106Z return func(*args, **kwargs) 2025-08-14T21:38:35.0330331Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0330390Z return func(*args, **kwargs) 2025-08-14T21:38:35.0330607Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0330675Z return func(*args, **kwargs) 2025-08-14T21:38:35.0330744Z [Previous line repeated 1 more time] 2025-08-14T21:38:35.0330948Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0331027Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0331271Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:35.0331342Z layer_outputs = layer_module( 2025-08-14T21:38:35.0331544Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:35.0331632Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:35.0331855Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0331913Z return func(*args, **kwargs) 2025-08-14T21:38:35.0332147Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0332210Z return func(*args, **kwargs) 2025-08-14T21:38:35.0332429Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0332497Z return func(*args, **kwargs) 2025-08-14T21:38:35.0332744Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 338, in forward 2025-08-14T21:38:35.0332818Z self_attention_outputs = self.attention( 2025-08-14T21:38:35.0333040Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0333118Z return func(*args, **kwargs) 2025-08-14T21:38:35.0333349Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0333409Z return func(*args, **kwargs) 2025-08-14T21:38:35.0333632Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0333700Z return func(*args, **kwargs) 2025-08-14T21:38:35.0333952Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 271, in forward 2025-08-14T21:38:35.0334019Z self_outputs = self.self( 2025-08-14T21:38:35.0334248Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0334308Z return func(*args, **kwargs) 2025-08-14T21:38:35.0334540Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0334599Z return func(*args, **kwargs) 2025-08-14T21:38:35.0334823Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0334893Z return func(*args, **kwargs) 2025-08-14T21:38:35.0335145Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 192, in forward 2025-08-14T21:38:35.0335282Z key_states = self.key(hidden_states).view(hidden_shape).transpose(1, 2) 2025-08-14T21:38:35.0335286Z 2025-08-14T21:38:35.0335381Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:35.0335568Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:35.0335636Z return mod(**inputs) 2025-08-14T21:38:35.0335861Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0335925Z return func(*args, **kwargs) 2025-08-14T21:38:35.0336158Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0336219Z return func(*args, **kwargs) 2025-08-14T21:38:35.0336433Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0336501Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0336763Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 757, in forward 2025-08-14T21:38:35.0336838Z outputs = self.layoutlm( 2025-08-14T21:38:35.0337054Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0337115Z return func(*args, **kwargs) 2025-08-14T21:38:35.0337358Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0337419Z return func(*args, **kwargs) 2025-08-14T21:38:35.0337625Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0337707Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0337953Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:35.0338027Z encoder_outputs = self.encoder( 2025-08-14T21:38:35.0338248Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0338315Z return func(*args, **kwargs) 2025-08-14T21:38:35.0338532Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0338620Z return func(*args, **kwargs) 2025-08-14T21:38:35.0338846Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0338905Z return func(*args, **kwargs) 2025-08-14T21:38:35.0338974Z [Previous line repeated 1 more time] 2025-08-14T21:38:35.0339181Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0339248Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0339499Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:35.0339563Z layer_outputs = layer_module( 2025-08-14T21:38:35.0339766Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:35.0339846Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:35.0340063Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0340126Z return func(*args, **kwargs) 2025-08-14T21:38:35.0340353Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0340412Z return func(*args, **kwargs) 2025-08-14T21:38:35.0340635Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0340706Z return func(*args, **kwargs) 2025-08-14T21:38:35.0340954Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 338, in forward 2025-08-14T21:38:35.0341030Z self_attention_outputs = self.attention( 2025-08-14T21:38:35.0341255Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0341316Z return func(*args, **kwargs) 2025-08-14T21:38:35.0341535Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0341604Z return func(*args, **kwargs) 2025-08-14T21:38:35.0341823Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0341889Z return func(*args, **kwargs) 2025-08-14T21:38:35.0342136Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 271, in forward 2025-08-14T21:38:35.0342199Z self_outputs = self.self( 2025-08-14T21:38:35.0342441Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0342504Z return func(*args, **kwargs) 2025-08-14T21:38:35.0342720Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0342804Z return func(*args, **kwargs) 2025-08-14T21:38:35.0343018Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0343085Z return func(*args, **kwargs) 2025-08-14T21:38:35.0343342Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 193, in forward 2025-08-14T21:38:35.0343479Z value_states = self.value(hidden_states).view(hidden_shape).transpose(1, 2) 2025-08-14T21:38:35.0343482Z 2025-08-14T21:38:35.0343562Z cudagraph partition due to non gpu ops 2025-08-14T21:38:35.0343636Z cudagraph partition due to non gpu ops 2025-08-14T21:38:35.0343737Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:35.0343919Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:35.0343980Z return mod(**inputs) 2025-08-14T21:38:35.0344204Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0344284Z return func(*args, **kwargs) 2025-08-14T21:38:35.0344501Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0344569Z return func(*args, **kwargs) 2025-08-14T21:38:35.0344833Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0344917Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0345163Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 757, in forward 2025-08-14T21:38:35.0345228Z outputs = self.layoutlm( 2025-08-14T21:38:35.0345491Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0345552Z return func(*args, **kwargs) 2025-08-14T21:38:35.0345780Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0345851Z return func(*args, **kwargs) 2025-08-14T21:38:35.0346052Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0346130Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0346382Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:35.0346450Z encoder_outputs = self.encoder( 2025-08-14T21:38:35.0346709Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0346769Z return func(*args, **kwargs) 2025-08-14T21:38:35.0346985Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0347055Z return func(*args, **kwargs) 2025-08-14T21:38:35.0347273Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0347340Z return func(*args, **kwargs) 2025-08-14T21:38:35.0347409Z [Previous line repeated 1 more time] 2025-08-14T21:38:35.0347608Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0347683Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0347949Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:35.0348014Z layer_outputs = layer_module( 2025-08-14T21:38:35.0348220Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:35.0348293Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:35.0348523Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0348600Z return func(*args, **kwargs) 2025-08-14T21:38:35.0348826Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0348895Z return func(*args, **kwargs) 2025-08-14T21:38:35.0349133Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0349204Z return func(*args, **kwargs) 2025-08-14T21:38:35.0349460Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 338, in forward 2025-08-14T21:38:35.0349536Z self_attention_outputs = self.attention( 2025-08-14T21:38:35.0349767Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0349828Z return func(*args, **kwargs) 2025-08-14T21:38:35.0350069Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0350138Z return func(*args, **kwargs) 2025-08-14T21:38:35.0350362Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0350431Z return func(*args, **kwargs) 2025-08-14T21:38:35.0350682Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 278, in forward 2025-08-14T21:38:35.0350804Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:38:35.0351063Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 225, in forward 2025-08-14T21:38:35.0351143Z hidden_states = self.dense(hidden_states) 2025-08-14T21:38:35.0351146Z 2025-08-14T21:38:35.0351249Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:35.0351438Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:35.0351500Z return mod(**inputs) 2025-08-14T21:38:35.0351731Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0351795Z return func(*args, **kwargs) 2025-08-14T21:38:35.0352018Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0352088Z return func(*args, **kwargs) 2025-08-14T21:38:35.0352292Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0352367Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0352618Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 757, in forward 2025-08-14T21:38:35.0352684Z outputs = self.layoutlm( 2025-08-14T21:38:35.0352940Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0353002Z return func(*args, **kwargs) 2025-08-14T21:38:35.0353248Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0353318Z return func(*args, **kwargs) 2025-08-14T21:38:35.0353539Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0353614Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0353896Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:35.0353969Z encoder_outputs = self.encoder( 2025-08-14T21:38:35.0354217Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0354300Z return func(*args, **kwargs) 2025-08-14T21:38:35.0354550Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0354621Z return func(*args, **kwargs) 2025-08-14T21:38:35.0354889Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0354960Z return func(*args, **kwargs) 2025-08-14T21:38:35.0355033Z [Previous line repeated 1 more time] 2025-08-14T21:38:35.0355241Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0355316Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0355583Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:35.0355651Z layer_outputs = layer_module( 2025-08-14T21:38:35.0355871Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:35.0355965Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:35.0356217Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0356281Z return func(*args, **kwargs) 2025-08-14T21:38:35.0356507Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0356577Z return func(*args, **kwargs) 2025-08-14T21:38:35.0356825Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0356895Z return func(*args, **kwargs) 2025-08-14T21:38:35.0357173Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 348, in forward 2025-08-14T21:38:35.0357252Z layer_output = apply_chunking_to_forward( 2025-08-14T21:38:35.0357523Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:38:35.0357594Z return forward_fn(*input_tensors) 2025-08-14T21:38:35.0357870Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 356, in feed_forward_chunk 2025-08-14T21:38:35.0357986Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:38:35.0358233Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 294, in forward 2025-08-14T21:38:35.0358315Z hidden_states = self.dense(hidden_states) 2025-08-14T21:38:35.0358318Z 2025-08-14T21:38:35.0358411Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:35.0358592Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:35.0358663Z return mod(**inputs) 2025-08-14T21:38:35.0358883Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0358952Z return func(*args, **kwargs) 2025-08-14T21:38:35.0359171Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0359231Z return func(*args, **kwargs) 2025-08-14T21:38:35.0359434Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0359501Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0359768Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 757, in forward 2025-08-14T21:38:35.0359842Z outputs = self.layoutlm( 2025-08-14T21:38:35.0360061Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0360148Z return func(*args, **kwargs) 2025-08-14T21:38:35.0360368Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0360429Z return func(*args, **kwargs) 2025-08-14T21:38:35.0360659Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0360729Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0360970Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:35.0361046Z encoder_outputs = self.encoder( 2025-08-14T21:38:35.0361261Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0361329Z return func(*args, **kwargs) 2025-08-14T21:38:35.0361544Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0361623Z return func(*args, **kwargs) 2025-08-14T21:38:35.0361844Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0361902Z return func(*args, **kwargs) 2025-08-14T21:38:35.0361974Z [Previous line repeated 1 more time] 2025-08-14T21:38:35.0362179Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0362244Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0362495Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:35.0362558Z layer_outputs = layer_module( 2025-08-14T21:38:35.0362758Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:35.0362836Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:35.0363055Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0363119Z return func(*args, **kwargs) 2025-08-14T21:38:35.0363335Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0363395Z return func(*args, **kwargs) 2025-08-14T21:38:35.0363617Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0363676Z return func(*args, **kwargs) 2025-08-14T21:38:35.0363919Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 348, in forward 2025-08-14T21:38:35.0364002Z layer_output = apply_chunking_to_forward( 2025-08-14T21:38:35.0364239Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:38:35.0364317Z return forward_fn(*input_tensors) 2025-08-14T21:38:35.0364597Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 356, in feed_forward_chunk 2025-08-14T21:38:35.0364706Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:38:35.0364954Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 295, in forward 2025-08-14T21:38:35.0365056Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:38:35.0365269Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:38:35.0365343Z return self.act(input) 2025-08-14T21:38:35.0365347Z 2025-08-14T21:38:35.0365442Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:35.0365624Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:35.0365706Z return mod(**inputs) 2025-08-14T21:38:35.0365926Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0365992Z return func(*args, **kwargs) 2025-08-14T21:38:35.0366224Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0366285Z return func(*args, **kwargs) 2025-08-14T21:38:35.0366493Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0366560Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0366805Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 757, in forward 2025-08-14T21:38:35.0366875Z outputs = self.layoutlm( 2025-08-14T21:38:35.0367094Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0367183Z return func(*args, **kwargs) 2025-08-14T21:38:35.0367406Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0367466Z return func(*args, **kwargs) 2025-08-14T21:38:35.0367676Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0367743Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0368001Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:35.0368069Z encoder_outputs = self.encoder( 2025-08-14T21:38:35.0368290Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0368356Z return func(*args, **kwargs) 2025-08-14T21:38:35.0368577Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0368639Z return func(*args, **kwargs) 2025-08-14T21:38:35.0368867Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0368927Z return func(*args, **kwargs) 2025-08-14T21:38:35.0369001Z [Previous line repeated 1 more time] 2025-08-14T21:38:35.0369203Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0369267Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0369523Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:35.0369586Z layer_outputs = layer_module( 2025-08-14T21:38:35.0369790Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:35.0369872Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:35.0370090Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0370155Z return func(*args, **kwargs) 2025-08-14T21:38:35.0370375Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0370434Z return func(*args, **kwargs) 2025-08-14T21:38:35.0370658Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0370734Z return func(*args, **kwargs) 2025-08-14T21:38:35.0370978Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 348, in forward 2025-08-14T21:38:35.0371059Z layer_output = apply_chunking_to_forward( 2025-08-14T21:38:35.0371298Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:38:35.0371390Z return forward_fn(*input_tensors) 2025-08-14T21:38:35.0371665Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 357, in feed_forward_chunk 2025-08-14T21:38:35.0371798Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:38:35.0372048Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 308, in forward 2025-08-14T21:38:35.0372121Z hidden_states = self.dense(hidden_states) 2025-08-14T21:38:35.0372126Z 2025-08-14T21:38:35.0372224Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:35.0372404Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:35.0372464Z return mod(**inputs) 2025-08-14T21:38:35.0372688Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0372765Z return func(*args, **kwargs) 2025-08-14T21:38:35.0372985Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0373052Z return func(*args, **kwargs) 2025-08-14T21:38:35.0373253Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0373326Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0373576Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 757, in forward 2025-08-14T21:38:35.0373639Z outputs = self.layoutlm( 2025-08-14T21:38:35.0373862Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0373923Z return func(*args, **kwargs) 2025-08-14T21:38:35.0374142Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0374212Z return func(*args, **kwargs) 2025-08-14T21:38:35.0374411Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0374485Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0374731Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:35.0374796Z encoder_outputs = self.encoder( 2025-08-14T21:38:35.0375024Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0375083Z return func(*args, **kwargs) 2025-08-14T21:38:35.0375307Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0375367Z return func(*args, **kwargs) 2025-08-14T21:38:35.0375587Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0375654Z return func(*args, **kwargs) 2025-08-14T21:38:35.0375723Z [Previous line repeated 1 more time] 2025-08-14T21:38:35.0375923Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0375995Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0376242Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:35.0376325Z layer_outputs = layer_module( 2025-08-14T21:38:35.0376528Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:35.0376601Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:35.0376826Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0376903Z return func(*args, **kwargs) 2025-08-14T21:38:35.0377122Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0377188Z return func(*args, **kwargs) 2025-08-14T21:38:35.0377422Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0377491Z return func(*args, **kwargs) 2025-08-14T21:38:35.0377738Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 338, in forward 2025-08-14T21:38:35.0377812Z self_attention_outputs = self.attention( 2025-08-14T21:38:35.0378036Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0378095Z return func(*args, **kwargs) 2025-08-14T21:38:35.0378329Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0378395Z return func(*args, **kwargs) 2025-08-14T21:38:35.0378610Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0378678Z return func(*args, **kwargs) 2025-08-14T21:38:35.0378918Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 271, in forward 2025-08-14T21:38:35.0378980Z self_outputs = self.self( 2025-08-14T21:38:35.0379200Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0379258Z return func(*args, **kwargs) 2025-08-14T21:38:35.0379479Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0379540Z return func(*args, **kwargs) 2025-08-14T21:38:35.0379759Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0379828Z return func(*args, **kwargs) 2025-08-14T21:38:35.0380071Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 191, in forward 2025-08-14T21:38:35.0380205Z query_states = self.query(hidden_states).view(hidden_shape).transpose(1, 2) 2025-08-14T21:38:35.0380209Z 2025-08-14T21:38:35.0380309Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:35.0380490Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:35.0380556Z return mod(**inputs) 2025-08-14T21:38:35.0380772Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0380832Z return func(*args, **kwargs) 2025-08-14T21:38:35.0381056Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0381116Z return func(*args, **kwargs) 2025-08-14T21:38:35.0381312Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0381386Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0381629Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 757, in forward 2025-08-14T21:38:35.0381701Z outputs = self.layoutlm( 2025-08-14T21:38:35.0381933Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0381995Z return func(*args, **kwargs) 2025-08-14T21:38:35.0382219Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0382302Z return func(*args, **kwargs) 2025-08-14T21:38:35.0382512Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0382578Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0382842Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:35.0382916Z encoder_outputs = self.encoder( 2025-08-14T21:38:35.0383132Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0383192Z return func(*args, **kwargs) 2025-08-14T21:38:35.0383419Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0383478Z return func(*args, **kwargs) 2025-08-14T21:38:35.0383702Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0383777Z return func(*args, **kwargs) 2025-08-14T21:38:35.0383846Z [Previous line repeated 1 more time] 2025-08-14T21:38:35.0384049Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0384113Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0384356Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:35.0384427Z layer_outputs = layer_module( 2025-08-14T21:38:35.0384895Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:35.0384997Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:35.0385227Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0385290Z return func(*args, **kwargs) 2025-08-14T21:38:35.0385539Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0385608Z return func(*args, **kwargs) 2025-08-14T21:38:35.0385861Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0385939Z return func(*args, **kwargs) 2025-08-14T21:38:35.0386234Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 338, in forward 2025-08-14T21:38:35.0386320Z self_attention_outputs = self.attention( 2025-08-14T21:38:35.0386553Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0386616Z return func(*args, **kwargs) 2025-08-14T21:38:35.0386857Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0386919Z return func(*args, **kwargs) 2025-08-14T21:38:35.0387201Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0387266Z return func(*args, **kwargs) 2025-08-14T21:38:35.0387532Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 271, in forward 2025-08-14T21:38:35.0387605Z self_outputs = self.self( 2025-08-14T21:38:35.0387838Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0387942Z return func(*args, **kwargs) 2025-08-14T21:38:35.0388184Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0388249Z return func(*args, **kwargs) 2025-08-14T21:38:35.0388486Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0388575Z return func(*args, **kwargs) 2025-08-14T21:38:35.0388840Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 192, in forward 2025-08-14T21:38:35.0388980Z key_states = self.key(hidden_states).view(hidden_shape).transpose(1, 2) 2025-08-14T21:38:35.0389007Z 2025-08-14T21:38:35.0389109Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:35.0389304Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:35.0389376Z return mod(**inputs) 2025-08-14T21:38:35.0389608Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0389678Z return func(*args, **kwargs) 2025-08-14T21:38:35.0389906Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0389996Z return func(*args, **kwargs) 2025-08-14T21:38:35.0390211Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0390281Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0390545Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 757, in forward 2025-08-14T21:38:35.0390612Z outputs = self.layoutlm( 2025-08-14T21:38:35.0390840Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0390911Z return func(*args, **kwargs) 2025-08-14T21:38:35.0391139Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0391201Z return func(*args, **kwargs) 2025-08-14T21:38:35.0391414Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0391486Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0391747Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:35.0391815Z encoder_outputs = self.encoder( 2025-08-14T21:38:35.0392044Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0392117Z return func(*args, **kwargs) 2025-08-14T21:38:35.0392345Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0392410Z return func(*args, **kwargs) 2025-08-14T21:38:35.0392645Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0392707Z return func(*args, **kwargs) 2025-08-14T21:38:35.0392785Z [Previous line repeated 1 more time] 2025-08-14T21:38:35.0392994Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0393062Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0393322Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:35.0393390Z layer_outputs = layer_module( 2025-08-14T21:38:35.0393601Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:35.0393684Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:35.0393927Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0393998Z return func(*args, **kwargs) 2025-08-14T21:38:35.0394228Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0394312Z return func(*args, **kwargs) 2025-08-14T21:38:35.0394550Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0394613Z return func(*args, **kwargs) 2025-08-14T21:38:35.0394902Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 338, in forward 2025-08-14T21:38:35.0394984Z self_attention_outputs = self.attention( 2025-08-14T21:38:35.0395215Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0395288Z return func(*args, **kwargs) 2025-08-14T21:38:35.0395517Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0395582Z return func(*args, **kwargs) 2025-08-14T21:38:35.0395816Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0395898Z return func(*args, **kwargs) 2025-08-14T21:38:35.0396174Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 271, in forward 2025-08-14T21:38:35.0396236Z self_outputs = self.self( 2025-08-14T21:38:35.0396456Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0396522Z return func(*args, **kwargs) 2025-08-14T21:38:35.0396741Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0396799Z return func(*args, **kwargs) 2025-08-14T21:38:35.0397025Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0397085Z return func(*args, **kwargs) 2025-08-14T21:38:35.0397337Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 193, in forward 2025-08-14T21:38:35.0397475Z value_states = self.value(hidden_states).view(hidden_shape).transpose(1, 2) 2025-08-14T21:38:35.0397479Z 2025-08-14T21:38:35.0397551Z cudagraph partition due to non gpu ops 2025-08-14T21:38:35.0397627Z cudagraph partition due to non gpu ops 2025-08-14T21:38:35.0397721Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:35.0397905Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:35.0397971Z return mod(**inputs) 2025-08-14T21:38:35.0398192Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0398257Z return func(*args, **kwargs) 2025-08-14T21:38:35.0398475Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0398537Z return func(*args, **kwargs) 2025-08-14T21:38:35.0398741Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0398808Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0399060Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 757, in forward 2025-08-14T21:38:35.0399123Z outputs = self.layoutlm( 2025-08-14T21:38:35.0399340Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0399406Z return func(*args, **kwargs) 2025-08-14T21:38:35.0399635Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0399696Z return func(*args, **kwargs) 2025-08-14T21:38:35.0399899Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0399981Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0400230Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:35.0400295Z encoder_outputs = self.encoder( 2025-08-14T21:38:35.0400523Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0400590Z return func(*args, **kwargs) 2025-08-14T21:38:35.0400805Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0400867Z return func(*args, **kwargs) 2025-08-14T21:38:35.0401094Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0401154Z return func(*args, **kwargs) 2025-08-14T21:38:35.0401228Z [Previous line repeated 1 more time] 2025-08-14T21:38:35.0401441Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0401506Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0401754Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:35.0401820Z layer_outputs = layer_module( 2025-08-14T21:38:35.0402019Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:35.0402097Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:35.0402314Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0402380Z return func(*args, **kwargs) 2025-08-14T21:38:35.0402596Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0402658Z return func(*args, **kwargs) 2025-08-14T21:38:35.0402877Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0402938Z return func(*args, **kwargs) 2025-08-14T21:38:35.0403183Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 338, in forward 2025-08-14T21:38:35.0403264Z self_attention_outputs = self.attention( 2025-08-14T21:38:35.0403481Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0403550Z return func(*args, **kwargs) 2025-08-14T21:38:35.0403765Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0403823Z return func(*args, **kwargs) 2025-08-14T21:38:35.0404044Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0404107Z return func(*args, **kwargs) 2025-08-14T21:38:35.0404355Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 278, in forward 2025-08-14T21:38:35.0404472Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:38:35.0404716Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 225, in forward 2025-08-14T21:38:35.0404797Z hidden_states = self.dense(hidden_states) 2025-08-14T21:38:35.0404800Z 2025-08-14T21:38:35.0404914Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:35.0405096Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:35.0405163Z return mod(**inputs) 2025-08-14T21:38:35.0405388Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0405474Z return func(*args, **kwargs) 2025-08-14T21:38:35.0405698Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0405759Z return func(*args, **kwargs) 2025-08-14T21:38:35.0405980Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0406050Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0406313Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 757, in forward 2025-08-14T21:38:35.0406376Z outputs = self.layoutlm( 2025-08-14T21:38:35.0406593Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0406659Z return func(*args, **kwargs) 2025-08-14T21:38:35.0406872Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0406950Z return func(*args, **kwargs) 2025-08-14T21:38:35.0407159Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0407225Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0407482Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:35.0407547Z encoder_outputs = self.encoder( 2025-08-14T21:38:35.0407770Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0407836Z return func(*args, **kwargs) 2025-08-14T21:38:35.0408056Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0408116Z return func(*args, **kwargs) 2025-08-14T21:38:35.0408342Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0408405Z return func(*args, **kwargs) 2025-08-14T21:38:35.0408481Z [Previous line repeated 1 more time] 2025-08-14T21:38:35.0408679Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0408747Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0409002Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:35.0409067Z layer_outputs = layer_module( 2025-08-14T21:38:35.0409272Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:35.0409350Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:35.0409571Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0409640Z return func(*args, **kwargs) 2025-08-14T21:38:35.0409861Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0409921Z return func(*args, **kwargs) 2025-08-14T21:38:35.0410149Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0410207Z return func(*args, **kwargs) 2025-08-14T21:38:35.0410456Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 348, in forward 2025-08-14T21:38:35.0410552Z layer_output = apply_chunking_to_forward( 2025-08-14T21:38:35.0410794Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:38:35.0410869Z return forward_fn(*input_tensors) 2025-08-14T21:38:35.0411143Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 356, in feed_forward_chunk 2025-08-14T21:38:35.0411270Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:38:35.0411519Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 294, in forward 2025-08-14T21:38:35.0411605Z hidden_states = self.dense(hidden_states) 2025-08-14T21:38:35.0411609Z 2025-08-14T21:38:35.0411711Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:35.0411891Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:35.0411950Z return mod(**inputs) 2025-08-14T21:38:35.0412172Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0412232Z return func(*args, **kwargs) 2025-08-14T21:38:35.0412448Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0412536Z return func(*args, **kwargs) 2025-08-14T21:38:35.0412732Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0412804Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0413046Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 757, in forward 2025-08-14T21:38:35.0413109Z outputs = self.layoutlm( 2025-08-14T21:38:35.0413332Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0413393Z return func(*args, **kwargs) 2025-08-14T21:38:35.0413615Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0413674Z return func(*args, **kwargs) 2025-08-14T21:38:35.0413871Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0413945Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0414186Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:35.0414253Z encoder_outputs = self.encoder( 2025-08-14T21:38:35.0414476Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0414536Z return func(*args, **kwargs) 2025-08-14T21:38:35.0414759Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0414820Z return func(*args, **kwargs) 2025-08-14T21:38:35.0415035Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0415102Z return func(*args, **kwargs) 2025-08-14T21:38:35.0415173Z [Previous line repeated 1 more time] 2025-08-14T21:38:35.0415368Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0415443Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0415687Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:35.0415758Z layer_outputs = layer_module( 2025-08-14T21:38:35.0415957Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:35.0416045Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:35.0416270Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0416331Z return func(*args, **kwargs) 2025-08-14T21:38:35.0416545Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0416630Z return func(*args, **kwargs) 2025-08-14T21:38:35.0416845Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0416912Z return func(*args, **kwargs) 2025-08-14T21:38:35.0417171Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 348, in forward 2025-08-14T21:38:35.0417250Z layer_output = apply_chunking_to_forward( 2025-08-14T21:38:35.0417495Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:38:35.0417564Z return forward_fn(*input_tensors) 2025-08-14T21:38:35.0417841Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 356, in feed_forward_chunk 2025-08-14T21:38:35.0417951Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:38:35.0418212Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 295, in forward 2025-08-14T21:38:35.0418320Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:38:35.0418515Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:38:35.0418579Z return self.act(input) 2025-08-14T21:38:35.0418590Z 2025-08-14T21:38:35.0418685Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:35.0418868Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:35.0418943Z return mod(**inputs) 2025-08-14T21:38:35.0419162Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0419222Z return func(*args, **kwargs) 2025-08-14T21:38:35.0419447Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0419507Z return func(*args, **kwargs) 2025-08-14T21:38:35.0419705Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0419779Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0420024Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 757, in forward 2025-08-14T21:38:35.0420093Z outputs = self.layoutlm( 2025-08-14T21:38:35.0420313Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0420373Z return func(*args, **kwargs) 2025-08-14T21:38:35.0420597Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0420656Z return func(*args, **kwargs) 2025-08-14T21:38:35.0420863Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0420929Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0421170Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:35.0421245Z encoder_outputs = self.encoder( 2025-08-14T21:38:35.0421461Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0421520Z return func(*args, **kwargs) 2025-08-14T21:38:35.0421760Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0421821Z return func(*args, **kwargs) 2025-08-14T21:38:35.0422044Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0422122Z return func(*args, **kwargs) 2025-08-14T21:38:35.0422191Z [Previous line repeated 1 more time] 2025-08-14T21:38:35.0422396Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0422463Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0422722Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:35.0422795Z layer_outputs = layer_module( 2025-08-14T21:38:35.0422996Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:35.0423073Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:35.0423287Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0423345Z return func(*args, **kwargs) 2025-08-14T21:38:35.0423567Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0423653Z return func(*args, **kwargs) 2025-08-14T21:38:35.0423870Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0423937Z return func(*args, **kwargs) 2025-08-14T21:38:35.0424181Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 348, in forward 2025-08-14T21:38:35.0424263Z layer_output = apply_chunking_to_forward( 2025-08-14T21:38:35.0424500Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:38:35.0424569Z return forward_fn(*input_tensors) 2025-08-14T21:38:35.0424924Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 357, in feed_forward_chunk 2025-08-14T21:38:35.0425058Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:38:35.0425316Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 308, in forward 2025-08-14T21:38:35.0425393Z hidden_states = self.dense(hidden_states) 2025-08-14T21:38:35.0425397Z 2025-08-14T21:38:35.0425495Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:35.0425689Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:35.0425751Z return mod(**inputs) 2025-08-14T21:38:35.0425978Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0426048Z return func(*args, **kwargs) 2025-08-14T21:38:35.0426279Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0426348Z return func(*args, **kwargs) 2025-08-14T21:38:35.0426546Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0426614Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0426869Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 757, in forward 2025-08-14T21:38:35.0426934Z outputs = self.layoutlm( 2025-08-14T21:38:35.0427162Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0427224Z return func(*args, **kwargs) 2025-08-14T21:38:35.0427467Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0427537Z return func(*args, **kwargs) 2025-08-14T21:38:35.0427739Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0427824Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0428083Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:35.0428148Z encoder_outputs = self.encoder( 2025-08-14T21:38:35.0428394Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0428458Z return func(*args, **kwargs) 2025-08-14T21:38:35.0428681Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0428750Z return func(*args, **kwargs) 2025-08-14T21:38:35.0428973Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0429034Z return func(*args, **kwargs) 2025-08-14T21:38:35.0429112Z [Previous line repeated 1 more time] 2025-08-14T21:38:35.0429315Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0429409Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0429663Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:35.0429729Z layer_outputs = layer_module( 2025-08-14T21:38:35.0429943Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:35.0430018Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:35.0430243Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0430311Z return func(*args, **kwargs) 2025-08-14T21:38:35.0430535Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0430604Z return func(*args, **kwargs) 2025-08-14T21:38:35.0430830Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0430891Z return func(*args, **kwargs) 2025-08-14T21:38:35.0431149Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 338, in forward 2025-08-14T21:38:35.0431227Z self_attention_outputs = self.attention( 2025-08-14T21:38:35.0431457Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0431518Z return func(*args, **kwargs) 2025-08-14T21:38:35.0431742Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0431812Z return func(*args, **kwargs) 2025-08-14T21:38:35.0432037Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0432101Z return func(*args, **kwargs) 2025-08-14T21:38:35.0432357Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 271, in forward 2025-08-14T21:38:35.0432421Z self_outputs = self.self( 2025-08-14T21:38:35.0432654Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0432714Z return func(*args, **kwargs) 2025-08-14T21:38:35.0432938Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0433021Z return func(*args, **kwargs) 2025-08-14T21:38:35.0433244Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0433306Z return func(*args, **kwargs) 2025-08-14T21:38:35.0433559Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 191, in forward 2025-08-14T21:38:35.0433715Z query_states = self.query(hidden_states).view(hidden_shape).transpose(1, 2) 2025-08-14T21:38:35.0433719Z 2025-08-14T21:38:35.0433821Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:35.0434021Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:35.0434083Z return mod(**inputs) 2025-08-14T21:38:35.0434318Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0434382Z return func(*args, **kwargs) 2025-08-14T21:38:35.0434616Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0434678Z return func(*args, **kwargs) 2025-08-14T21:38:35.0434880Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0434972Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0435222Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 757, in forward 2025-08-14T21:38:35.0435287Z outputs = self.layoutlm( 2025-08-14T21:38:35.0435521Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0435582Z return func(*args, **kwargs) 2025-08-14T21:38:35.0435812Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0435874Z return func(*args, **kwargs) 2025-08-14T21:38:35.0436076Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0436150Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0436400Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:35.0436471Z encoder_outputs = self.encoder( 2025-08-14T21:38:35.0436703Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0436764Z return func(*args, **kwargs) 2025-08-14T21:38:35.0436994Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0437055Z return func(*args, **kwargs) 2025-08-14T21:38:35.0437278Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0437347Z return func(*args, **kwargs) 2025-08-14T21:38:35.0437417Z [Previous line repeated 1 more time] 2025-08-14T21:38:35.0437618Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0437694Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0437945Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:35.0438014Z layer_outputs = layer_module( 2025-08-14T21:38:35.0438221Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:35.0438293Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:35.0438520Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0438603Z return func(*args, **kwargs) 2025-08-14T21:38:35.0438818Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0438884Z return func(*args, **kwargs) 2025-08-14T21:38:35.0439101Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0439185Z return func(*args, **kwargs) 2025-08-14T21:38:35.0439428Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 338, in forward 2025-08-14T21:38:35.0439501Z self_attention_outputs = self.attention( 2025-08-14T21:38:35.0439738Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0439799Z return func(*args, **kwargs) 2025-08-14T21:38:35.0440024Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0440082Z return func(*args, **kwargs) 2025-08-14T21:38:35.0440295Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0440361Z return func(*args, **kwargs) 2025-08-14T21:38:35.0440603Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 271, in forward 2025-08-14T21:38:35.0440681Z self_outputs = self.self( 2025-08-14T21:38:35.0440906Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0440965Z return func(*args, **kwargs) 2025-08-14T21:38:35.0441189Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0441248Z return func(*args, **kwargs) 2025-08-14T21:38:35.0441466Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0441531Z return func(*args, **kwargs) 2025-08-14T21:38:35.0441775Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 192, in forward 2025-08-14T21:38:35.0441901Z key_states = self.key(hidden_states).view(hidden_shape).transpose(1, 2) 2025-08-14T21:38:35.0441915Z 2025-08-14T21:38:35.0442009Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:35.0442189Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:35.0442255Z return mod(**inputs) 2025-08-14T21:38:35.0442474Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0442533Z return func(*args, **kwargs) 2025-08-14T21:38:35.0442758Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0442818Z return func(*args, **kwargs) 2025-08-14T21:38:35.0443024Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0443091Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0443336Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 757, in forward 2025-08-14T21:38:35.0443409Z outputs = self.layoutlm( 2025-08-14T21:38:35.0443626Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0443686Z return func(*args, **kwargs) 2025-08-14T21:38:35.0443911Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0443972Z return func(*args, **kwargs) 2025-08-14T21:38:35.0444193Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0444262Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0444506Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:35.0444581Z encoder_outputs = self.encoder( 2025-08-14T21:38:35.0444814Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0444874Z return func(*args, **kwargs) 2025-08-14T21:38:35.0445096Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0445178Z return func(*args, **kwargs) 2025-08-14T21:38:35.0445405Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0445465Z return func(*args, **kwargs) 2025-08-14T21:38:35.0445534Z [Previous line repeated 1 more time] 2025-08-14T21:38:35.0445738Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0445803Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0446049Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:35.0446140Z layer_outputs = layer_module( 2025-08-14T21:38:35.0446342Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:35.0446420Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:35.0446637Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0446698Z return func(*args, **kwargs) 2025-08-14T21:38:35.0446923Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0446985Z return func(*args, **kwargs) 2025-08-14T21:38:35.0447209Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0447268Z return func(*args, **kwargs) 2025-08-14T21:38:35.0447510Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 338, in forward 2025-08-14T21:38:35.0447596Z self_attention_outputs = self.attention( 2025-08-14T21:38:35.0447811Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0447872Z return func(*args, **kwargs) 2025-08-14T21:38:35.0448098Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0448159Z return func(*args, **kwargs) 2025-08-14T21:38:35.0448384Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0448444Z return func(*args, **kwargs) 2025-08-14T21:38:35.0448688Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 271, in forward 2025-08-14T21:38:35.0448758Z self_outputs = self.self( 2025-08-14T21:38:35.0448979Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0449040Z return func(*args, **kwargs) 2025-08-14T21:38:35.0449263Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0449326Z return func(*args, **kwargs) 2025-08-14T21:38:35.0449549Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0449609Z return func(*args, **kwargs) 2025-08-14T21:38:35.0449901Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 193, in forward 2025-08-14T21:38:35.0450044Z value_states = self.value(hidden_states).view(hidden_shape).transpose(1, 2) 2025-08-14T21:38:35.0450047Z 2025-08-14T21:38:35.0450119Z cudagraph partition due to non gpu ops 2025-08-14T21:38:35.0450213Z cudagraph partition due to non gpu ops 2025-08-14T21:38:35.0450309Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:35.0450492Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:35.0450560Z return mod(**inputs) 2025-08-14T21:38:35.0450795Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0450856Z return func(*args, **kwargs) 2025-08-14T21:38:35.0451083Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0451143Z return func(*args, **kwargs) 2025-08-14T21:38:35.0451350Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0451416Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0451660Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 757, in forward 2025-08-14T21:38:35.0451750Z outputs = self.layoutlm( 2025-08-14T21:38:35.0451966Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0452026Z return func(*args, **kwargs) 2025-08-14T21:38:35.0452255Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0452315Z return func(*args, **kwargs) 2025-08-14T21:38:35.0452522Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0452589Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0452832Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:35.0452906Z encoder_outputs = self.encoder( 2025-08-14T21:38:35.0453124Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0453184Z return func(*args, **kwargs) 2025-08-14T21:38:35.0453410Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0453471Z return func(*args, **kwargs) 2025-08-14T21:38:35.0453694Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0453753Z return func(*args, **kwargs) 2025-08-14T21:38:35.0453821Z [Previous line repeated 1 more time] 2025-08-14T21:38:35.0454026Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0454089Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0454332Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:35.0454404Z layer_outputs = layer_module( 2025-08-14T21:38:35.0454604Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:35.0454682Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:35.0454899Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0454958Z return func(*args, **kwargs) 2025-08-14T21:38:35.0455179Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0455256Z return func(*args, **kwargs) 2025-08-14T21:38:35.0455478Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0455537Z return func(*args, **kwargs) 2025-08-14T21:38:35.0455781Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 338, in forward 2025-08-14T21:38:35.0455882Z self_attention_outputs = self.attention( 2025-08-14T21:38:35.0456096Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0456156Z return func(*args, **kwargs) 2025-08-14T21:38:35.0456393Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0456454Z return func(*args, **kwargs) 2025-08-14T21:38:35.0456680Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0456739Z return func(*args, **kwargs) 2025-08-14T21:38:35.0456983Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 278, in forward 2025-08-14T21:38:35.0457108Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:38:35.0457368Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 225, in forward 2025-08-14T21:38:35.0457444Z hidden_states = self.dense(hidden_states) 2025-08-14T21:38:35.0457454Z 2025-08-14T21:38:35.0457547Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:35.0457727Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:35.0457795Z return mod(**inputs) 2025-08-14T21:38:35.0458013Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0458073Z return func(*args, **kwargs) 2025-08-14T21:38:35.0458294Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0458355Z return func(*args, **kwargs) 2025-08-14T21:38:35.0458560Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0458629Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0458873Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 757, in forward 2025-08-14T21:38:35.0458945Z outputs = self.layoutlm( 2025-08-14T21:38:35.0459160Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0459219Z return func(*args, **kwargs) 2025-08-14T21:38:35.0459446Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0459505Z return func(*args, **kwargs) 2025-08-14T21:38:35.0459708Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0459774Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0460022Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:35.0460094Z encoder_outputs = self.encoder( 2025-08-14T21:38:35.0460309Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0460371Z return func(*args, **kwargs) 2025-08-14T21:38:35.0460596Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0460656Z return func(*args, **kwargs) 2025-08-14T21:38:35.0460894Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0460955Z return func(*args, **kwargs) 2025-08-14T21:38:35.0461022Z [Previous line repeated 1 more time] 2025-08-14T21:38:35.0461226Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0461307Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0461552Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:35.0461622Z layer_outputs = layer_module( 2025-08-14T21:38:35.0461838Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:35.0461920Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:35.0462136Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0462197Z return func(*args, **kwargs) 2025-08-14T21:38:35.0462420Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0462481Z return func(*args, **kwargs) 2025-08-14T21:38:35.0462705Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0463010Z return func(*args, **kwargs) 2025-08-14T21:38:35.0463256Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 348, in forward 2025-08-14T21:38:35.0463340Z layer_output = apply_chunking_to_forward( 2025-08-14T21:38:35.0463581Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:38:35.0463650Z return forward_fn(*input_tensors) 2025-08-14T21:38:35.0463934Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 356, in feed_forward_chunk 2025-08-14T21:38:35.0464046Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:38:35.0464299Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 294, in forward 2025-08-14T21:38:35.0464376Z hidden_states = self.dense(hidden_states) 2025-08-14T21:38:35.0464379Z 2025-08-14T21:38:35.0464474Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:35.0464673Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:35.0464735Z return mod(**inputs) 2025-08-14T21:38:35.0465032Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0465099Z return func(*args, **kwargs) 2025-08-14T21:38:35.0465327Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0465399Z return func(*args, **kwargs) 2025-08-14T21:38:35.0465603Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0465671Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0465944Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 757, in forward 2025-08-14T21:38:35.0466009Z outputs = self.layoutlm( 2025-08-14T21:38:35.0466235Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0466297Z return func(*args, **kwargs) 2025-08-14T21:38:35.0466513Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0466581Z return func(*args, **kwargs) 2025-08-14T21:38:35.0466797Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0466866Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0467125Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:35.0467219Z encoder_outputs = self.encoder( 2025-08-14T21:38:35.0467450Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0467512Z return func(*args, **kwargs) 2025-08-14T21:38:35.0467752Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0467823Z return func(*args, **kwargs) 2025-08-14T21:38:35.0468046Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0468111Z return func(*args, **kwargs) 2025-08-14T21:38:35.0468187Z [Previous line repeated 1 more time] 2025-08-14T21:38:35.0468388Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0468462Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0468711Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:35.0468797Z layer_outputs = layer_module( 2025-08-14T21:38:35.0469011Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:35.0469087Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:35.0469309Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0469378Z return func(*args, **kwargs) 2025-08-14T21:38:35.0469603Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0469672Z return func(*args, **kwargs) 2025-08-14T21:38:35.0469895Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0469956Z return func(*args, **kwargs) 2025-08-14T21:38:35.0470218Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 348, in forward 2025-08-14T21:38:35.0470294Z layer_output = apply_chunking_to_forward( 2025-08-14T21:38:35.0470547Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:38:35.0470618Z return forward_fn(*input_tensors) 2025-08-14T21:38:35.0470902Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 356, in feed_forward_chunk 2025-08-14T21:38:35.0471023Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:38:35.0471275Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 295, in forward 2025-08-14T21:38:35.0471381Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:38:35.0471590Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:38:35.0471659Z return self.act(input) 2025-08-14T21:38:35.0471662Z 2025-08-14T21:38:35.0471765Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:35.0471955Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:35.0472016Z return mod(**inputs) 2025-08-14T21:38:35.0472249Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0472310Z return func(*args, **kwargs) 2025-08-14T21:38:35.0472558Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0472621Z return func(*args, **kwargs) 2025-08-14T21:38:35.0472826Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0472921Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0473172Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 757, in forward 2025-08-14T21:38:35.0473236Z outputs = self.layoutlm( 2025-08-14T21:38:35.0473480Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0473551Z return func(*args, **kwargs) 2025-08-14T21:38:35.0473783Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0473844Z return func(*args, **kwargs) 2025-08-14T21:38:35.0474045Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0474119Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0474369Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:35.0474454Z encoder_outputs = self.encoder( 2025-08-14T21:38:35.0474687Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0474747Z return func(*args, **kwargs) 2025-08-14T21:38:35.0474979Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0475039Z return func(*args, **kwargs) 2025-08-14T21:38:35.0475266Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0475333Z return func(*args, **kwargs) 2025-08-14T21:38:35.0475402Z [Previous line repeated 1 more time] 2025-08-14T21:38:35.0475606Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0475680Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0475934Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:35.0476005Z layer_outputs = layer_module( 2025-08-14T21:38:35.0476210Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:35.0476285Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:35.0476516Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0476580Z return func(*args, **kwargs) 2025-08-14T21:38:35.0476814Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0476876Z return func(*args, **kwargs) 2025-08-14T21:38:35.0477101Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0477172Z return func(*args, **kwargs) 2025-08-14T21:38:35.0477424Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 348, in forward 2025-08-14T21:38:35.0477503Z layer_output = apply_chunking_to_forward( 2025-08-14T21:38:35.0477756Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:38:35.0477826Z return forward_fn(*input_tensors) 2025-08-14T21:38:35.0478141Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 357, in feed_forward_chunk 2025-08-14T21:38:35.0478263Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:38:35.0478506Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 308, in forward 2025-08-14T21:38:35.0478587Z hidden_states = self.dense(hidden_states) 2025-08-14T21:38:35.0478607Z 2025-08-14T21:38:35.0478701Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:35.0478889Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:35.0478948Z return mod(**inputs) 2025-08-14T21:38:35.0479178Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0479247Z return func(*args, **kwargs) 2025-08-14T21:38:35.0479462Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0479523Z return func(*args, **kwargs) 2025-08-14T21:38:35.0479725Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0479789Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0480034Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 757, in forward 2025-08-14T21:38:35.0480115Z outputs = self.layoutlm( 2025-08-14T21:38:35.0480333Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0480398Z return func(*args, **kwargs) 2025-08-14T21:38:35.0480615Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0480674Z return func(*args, **kwargs) 2025-08-14T21:38:35.0480879Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0480943Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0481195Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:35.0481260Z encoder_outputs = self.encoder( 2025-08-14T21:38:35.0481479Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0481549Z return func(*args, **kwargs) 2025-08-14T21:38:35.0481767Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0481827Z return func(*args, **kwargs) 2025-08-14T21:38:35.0482049Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0482109Z return func(*args, **kwargs) 2025-08-14T21:38:35.0482185Z [Previous line repeated 1 more time] 2025-08-14T21:38:35.0482385Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0482450Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0482700Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:35.0482766Z layer_outputs = layer_module( 2025-08-14T21:38:35.0482967Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:35.0483045Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:35.0483262Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0483327Z return func(*args, **kwargs) 2025-08-14T21:38:35.0483544Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0483618Z return func(*args, **kwargs) 2025-08-14T21:38:35.0483842Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0483902Z return func(*args, **kwargs) 2025-08-14T21:38:35.0484153Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 338, in forward 2025-08-14T21:38:35.0484246Z self_attention_outputs = self.attention( 2025-08-14T21:38:35.0484462Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0484528Z return func(*args, **kwargs) 2025-08-14T21:38:35.0485079Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0485158Z return func(*args, **kwargs) 2025-08-14T21:38:35.0485405Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0485470Z return func(*args, **kwargs) 2025-08-14T21:38:35.0485745Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 271, in forward 2025-08-14T21:38:35.0485815Z self_outputs = self.self( 2025-08-14T21:38:35.0486077Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0486151Z return func(*args, **kwargs) 2025-08-14T21:38:35.0486385Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0486451Z return func(*args, **kwargs) 2025-08-14T21:38:35.0486690Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0486750Z return func(*args, **kwargs) 2025-08-14T21:38:35.0487003Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 191, in forward 2025-08-14T21:38:35.0487138Z query_states = self.query(hidden_states).view(hidden_shape).transpose(1, 2) 2025-08-14T21:38:35.0487142Z 2025-08-14T21:38:35.0487235Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:35.0487424Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:35.0487484Z return mod(**inputs) 2025-08-14T21:38:35.0487708Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0487768Z return func(*args, **kwargs) 2025-08-14T21:38:35.0487984Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0488048Z return func(*args, **kwargs) 2025-08-14T21:38:35.0488243Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0488310Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0488561Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 757, in forward 2025-08-14T21:38:35.0488623Z outputs = self.layoutlm( 2025-08-14T21:38:35.0488846Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0488908Z return func(*args, **kwargs) 2025-08-14T21:38:35.0489123Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0489191Z return func(*args, **kwargs) 2025-08-14T21:38:35.0489387Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0489454Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0489731Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:35.0489800Z encoder_outputs = self.encoder( 2025-08-14T21:38:35.0490023Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0490105Z return func(*args, **kwargs) 2025-08-14T21:38:35.0490320Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0490389Z return func(*args, **kwargs) 2025-08-14T21:38:35.0490619Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0490688Z return func(*args, **kwargs) 2025-08-14T21:38:35.0490758Z [Previous line repeated 1 more time] 2025-08-14T21:38:35.0490955Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0491028Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0491271Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:35.0491334Z layer_outputs = layer_module( 2025-08-14T21:38:35.0491538Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:35.0491627Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:35.0491851Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0491911Z return func(*args, **kwargs) 2025-08-14T21:38:35.0492125Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0492191Z return func(*args, **kwargs) 2025-08-14T21:38:35.0492409Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0492468Z return func(*args, **kwargs) 2025-08-14T21:38:35.0492715Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 338, in forward 2025-08-14T21:38:35.0492789Z self_attention_outputs = self.attention( 2025-08-14T21:38:35.0493013Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0493074Z return func(*args, **kwargs) 2025-08-14T21:38:35.0493289Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0493354Z return func(*args, **kwargs) 2025-08-14T21:38:35.0493569Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0493629Z return func(*args, **kwargs) 2025-08-14T21:38:35.0493878Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 271, in forward 2025-08-14T21:38:35.0493941Z self_outputs = self.self( 2025-08-14T21:38:35.0494162Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0494225Z return func(*args, **kwargs) 2025-08-14T21:38:35.0494437Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0494501Z return func(*args, **kwargs) 2025-08-14T21:38:35.0494715Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0494773Z return func(*args, **kwargs) 2025-08-14T21:38:35.0495023Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 192, in forward 2025-08-14T21:38:35.0495162Z key_states = self.key(hidden_states).view(hidden_shape).transpose(1, 2) 2025-08-14T21:38:35.0495166Z 2025-08-14T21:38:35.0495267Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:35.0495450Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:35.0495509Z return mod(**inputs) 2025-08-14T21:38:35.0495749Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0495808Z return func(*args, **kwargs) 2025-08-14T21:38:35.0496029Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0496103Z return func(*args, **kwargs) 2025-08-14T21:38:35.0496302Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0496375Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0496619Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 757, in forward 2025-08-14T21:38:35.0496682Z outputs = self.layoutlm( 2025-08-14T21:38:35.0496906Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0496988Z return func(*args, **kwargs) 2025-08-14T21:38:35.0497214Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0497273Z return func(*args, **kwargs) 2025-08-14T21:38:35.0497475Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0497547Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0497796Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:35.0497863Z encoder_outputs = self.encoder( 2025-08-14T21:38:35.0498090Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0498150Z return func(*args, **kwargs) 2025-08-14T21:38:35.0498378Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0498441Z return func(*args, **kwargs) 2025-08-14T21:38:35.0498660Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0498727Z return func(*args, **kwargs) 2025-08-14T21:38:35.0498795Z [Previous line repeated 1 more time] 2025-08-14T21:38:35.0499004Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0499070Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0499318Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:35.0499387Z layer_outputs = layer_module( 2025-08-14T21:38:35.0499590Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:35.0499664Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:35.0499896Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0499956Z return func(*args, **kwargs) 2025-08-14T21:38:35.0500182Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0500243Z return func(*args, **kwargs) 2025-08-14T21:38:35.0500461Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0500528Z return func(*args, **kwargs) 2025-08-14T21:38:35.0500790Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 338, in forward 2025-08-14T21:38:35.0500868Z self_attention_outputs = self.attention( 2025-08-14T21:38:35.0501090Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0501168Z return func(*args, **kwargs) 2025-08-14T21:38:35.0501390Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0501449Z return func(*args, **kwargs) 2025-08-14T21:38:35.0501678Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0501747Z return func(*args, **kwargs) 2025-08-14T21:38:35.0501994Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 271, in forward 2025-08-14T21:38:35.0502057Z self_outputs = self.self( 2025-08-14T21:38:35.0502281Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0502339Z return func(*args, **kwargs) 2025-08-14T21:38:35.0502561Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0502639Z return func(*args, **kwargs) 2025-08-14T21:38:35.0502857Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0502923Z return func(*args, **kwargs) 2025-08-14T21:38:35.0503167Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 193, in forward 2025-08-14T21:38:35.0503308Z value_states = self.value(hidden_states).view(hidden_shape).transpose(1, 2) 2025-08-14T21:38:35.0503311Z 2025-08-14T21:38:35.0503383Z cudagraph partition due to non gpu ops 2025-08-14T21:38:35.0503454Z cudagraph partition due to non gpu ops 2025-08-14T21:38:35.0503557Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:35.0503738Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:35.0503798Z return mod(**inputs) 2025-08-14T21:38:35.0504025Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0504086Z return func(*args, **kwargs) 2025-08-14T21:38:35.0504310Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0504371Z return func(*args, **kwargs) 2025-08-14T21:38:35.0504572Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0504648Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0504952Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 757, in forward 2025-08-14T21:38:35.0505023Z outputs = self.layoutlm( 2025-08-14T21:38:35.0505256Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0505318Z return func(*args, **kwargs) 2025-08-14T21:38:35.0505547Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0505607Z return func(*args, **kwargs) 2025-08-14T21:38:35.0505806Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0505880Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0506138Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:35.0506203Z encoder_outputs = self.encoder( 2025-08-14T21:38:35.0506447Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0506511Z return func(*args, **kwargs) 2025-08-14T21:38:35.0506747Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0506830Z return func(*args, **kwargs) 2025-08-14T21:38:35.0507055Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0507123Z return func(*args, **kwargs) 2025-08-14T21:38:35.0507191Z [Previous line repeated 1 more time] 2025-08-14T21:38:35.0507409Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0507485Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0507735Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:35.0507805Z layer_outputs = layer_module( 2025-08-14T21:38:35.0508012Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:35.0508083Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:35.0508337Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0508398Z return func(*args, **kwargs) 2025-08-14T21:38:35.0508629Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0508690Z return func(*args, **kwargs) 2025-08-14T21:38:35.0508910Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0508977Z return func(*args, **kwargs) 2025-08-14T21:38:35.0509227Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 338, in forward 2025-08-14T21:38:35.0509302Z self_attention_outputs = self.attention( 2025-08-14T21:38:35.0509529Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0509593Z return func(*args, **kwargs) 2025-08-14T21:38:35.0509821Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0509881Z return func(*args, **kwargs) 2025-08-14T21:38:35.0510102Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0510170Z return func(*args, **kwargs) 2025-08-14T21:38:35.0510419Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 278, in forward 2025-08-14T21:38:35.0510541Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:38:35.0510795Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 225, in forward 2025-08-14T21:38:35.0510872Z hidden_states = self.dense(hidden_states) 2025-08-14T21:38:35.0510875Z 2025-08-14T21:38:35.0510980Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:35.0511166Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:35.0511227Z return mod(**inputs) 2025-08-14T21:38:35.0511457Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0511519Z return func(*args, **kwargs) 2025-08-14T21:38:35.0511747Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0511808Z return func(*args, **kwargs) 2025-08-14T21:38:35.0512022Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0512102Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0512352Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 757, in forward 2025-08-14T21:38:35.0512436Z outputs = self.layoutlm( 2025-08-14T21:38:35.0512667Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0512728Z return func(*args, **kwargs) 2025-08-14T21:38:35.0512969Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0513034Z return func(*args, **kwargs) 2025-08-14T21:38:35.0513235Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0513310Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0513563Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:35.0513630Z encoder_outputs = self.encoder( 2025-08-14T21:38:35.0513861Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0513945Z return func(*args, **kwargs) 2025-08-14T21:38:35.0514182Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0514244Z return func(*args, **kwargs) 2025-08-14T21:38:35.0514471Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0514537Z return func(*args, **kwargs) 2025-08-14T21:38:35.0514607Z [Previous line repeated 1 more time] 2025-08-14T21:38:35.0514812Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0514886Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0515135Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:35.0515206Z layer_outputs = layer_module( 2025-08-14T21:38:35.0515414Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:35.0515488Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:35.0515717Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0515780Z return func(*args, **kwargs) 2025-08-14T21:38:35.0516009Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0516071Z return func(*args, **kwargs) 2025-08-14T21:38:35.0516295Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0516362Z return func(*args, **kwargs) 2025-08-14T21:38:35.0516611Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 348, in forward 2025-08-14T21:38:35.0516693Z layer_output = apply_chunking_to_forward( 2025-08-14T21:38:35.0516945Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:38:35.0517016Z return forward_fn(*input_tensors) 2025-08-14T21:38:35.0517306Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 356, in feed_forward_chunk 2025-08-14T21:38:35.0517419Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:38:35.0517697Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 294, in forward 2025-08-14T21:38:35.0517786Z hidden_states = self.dense(hidden_states) 2025-08-14T21:38:35.0517789Z 2025-08-14T21:38:35.0517886Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:35.0518075Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:35.0518155Z return mod(**inputs) 2025-08-14T21:38:35.0518376Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0518444Z return func(*args, **kwargs) 2025-08-14T21:38:35.0518677Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0518738Z return func(*args, **kwargs) 2025-08-14T21:38:35.0518942Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0519009Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0519264Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 757, in forward 2025-08-14T21:38:35.0519328Z outputs = self.layoutlm( 2025-08-14T21:38:35.0519544Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0519627Z return func(*args, **kwargs) 2025-08-14T21:38:35.0519841Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0519901Z return func(*args, **kwargs) 2025-08-14T21:38:35.0520107Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0520174Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0520422Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:35.0520487Z encoder_outputs = self.encoder( 2025-08-14T21:38:35.0520704Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0520769Z return func(*args, **kwargs) 2025-08-14T21:38:35.0520984Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0521045Z return func(*args, **kwargs) 2025-08-14T21:38:35.0521265Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0521324Z return func(*args, **kwargs) 2025-08-14T21:38:35.0521400Z [Previous line repeated 1 more time] 2025-08-14T21:38:35.0521595Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0521660Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0521910Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:35.0521974Z layer_outputs = layer_module( 2025-08-14T21:38:35.0522180Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:35.0522253Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:35.0522469Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0522535Z return func(*args, **kwargs) 2025-08-14T21:38:35.0522750Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0522810Z return func(*args, **kwargs) 2025-08-14T21:38:35.0523033Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0523109Z return func(*args, **kwargs) 2025-08-14T21:38:35.0523363Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 348, in forward 2025-08-14T21:38:35.0523439Z layer_output = apply_chunking_to_forward( 2025-08-14T21:38:35.0523677Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:38:35.0523770Z return forward_fn(*input_tensors) 2025-08-14T21:38:35.0524045Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 356, in feed_forward_chunk 2025-08-14T21:38:35.0524167Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:38:35.0524421Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 295, in forward 2025-08-14T21:38:35.0524522Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:38:35.0524722Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:38:35.0524784Z return self.act(input) 2025-08-14T21:38:35.0524788Z 2025-08-14T21:38:35.0524881Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:35.0525068Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:35.0525147Z return mod(**inputs) 2025-08-14T21:38:35.0525375Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0525437Z return func(*args, **kwargs) 2025-08-14T21:38:35.0525656Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0525723Z return func(*args, **kwargs) 2025-08-14T21:38:35.0525919Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0525986Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0526234Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 757, in forward 2025-08-14T21:38:35.0526299Z outputs = self.layoutlm( 2025-08-14T21:38:35.0526521Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0526584Z return func(*args, **kwargs) 2025-08-14T21:38:35.0526799Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0526866Z return func(*args, **kwargs) 2025-08-14T21:38:35.0527064Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0527130Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0527379Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:35.0527445Z encoder_outputs = self.encoder( 2025-08-14T21:38:35.0527670Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0527730Z return func(*args, **kwargs) 2025-08-14T21:38:35.0527948Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0528020Z return func(*args, **kwargs) 2025-08-14T21:38:35.0528238Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0528305Z return func(*args, **kwargs) 2025-08-14T21:38:35.0528373Z [Previous line repeated 1 more time] 2025-08-14T21:38:35.0528568Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0528654Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0528898Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:35.0528962Z layer_outputs = layer_module( 2025-08-14T21:38:35.0529165Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:35.0529253Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:35.0529475Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0529536Z return func(*args, **kwargs) 2025-08-14T21:38:35.0529776Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0529841Z return func(*args, **kwargs) 2025-08-14T21:38:35.0530059Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0530118Z return func(*args, **kwargs) 2025-08-14T21:38:35.0530372Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 348, in forward 2025-08-14T21:38:35.0530446Z layer_output = apply_chunking_to_forward( 2025-08-14T21:38:35.0530695Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:38:35.0530780Z return forward_fn(*input_tensors) 2025-08-14T21:38:35.0531052Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 357, in feed_forward_chunk 2025-08-14T21:38:35.0531181Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:38:35.0531424Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 308, in forward 2025-08-14T21:38:35.0531507Z hidden_states = self.dense(hidden_states) 2025-08-14T21:38:35.0531511Z 2025-08-14T21:38:35.0531605Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:35.0531785Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:35.0531851Z return mod(**inputs) 2025-08-14T21:38:35.0532069Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0532128Z return func(*args, **kwargs) 2025-08-14T21:38:35.0532351Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0532412Z return func(*args, **kwargs) 2025-08-14T21:38:35.0532615Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0532681Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0532925Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 757, in forward 2025-08-14T21:38:35.0532994Z outputs = self.layoutlm( 2025-08-14T21:38:35.0533208Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0533266Z return func(*args, **kwargs) 2025-08-14T21:38:35.0533489Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0533547Z return func(*args, **kwargs) 2025-08-14T21:38:35.0533749Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0533816Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0534057Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:35.0534125Z encoder_outputs = self.encoder( 2025-08-14T21:38:35.0534352Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0534419Z return func(*args, **kwargs) 2025-08-14T21:38:35.0534634Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0534711Z return func(*args, **kwargs) 2025-08-14T21:38:35.0534933Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0534991Z return func(*args, **kwargs) 2025-08-14T21:38:35.0535060Z [Previous line repeated 1 more time] 2025-08-14T21:38:35.0535275Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0535342Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0535591Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:35.0535654Z layer_outputs = layer_module( 2025-08-14T21:38:35.0535854Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:35.0535931Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:35.0536152Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0536227Z return func(*args, **kwargs) 2025-08-14T21:38:35.0536453Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0536514Z return func(*args, **kwargs) 2025-08-14T21:38:35.0536741Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0536797Z return func(*args, **kwargs) 2025-08-14T21:38:35.0537041Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 338, in forward 2025-08-14T21:38:35.0537123Z self_attention_outputs = self.attention( 2025-08-14T21:38:35.0537336Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0537398Z return func(*args, **kwargs) 2025-08-14T21:38:35.0537615Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0537675Z return func(*args, **kwargs) 2025-08-14T21:38:35.0537898Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0537956Z return func(*args, **kwargs) 2025-08-14T21:38:35.0538198Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 271, in forward 2025-08-14T21:38:35.0538268Z self_outputs = self.self( 2025-08-14T21:38:35.0538488Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0538550Z return func(*args, **kwargs) 2025-08-14T21:38:35.0538774Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0538837Z return func(*args, **kwargs) 2025-08-14T21:38:35.0539059Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0539119Z return func(*args, **kwargs) 2025-08-14T21:38:35.0539366Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 191, in forward 2025-08-14T21:38:35.0539508Z query_states = self.query(hidden_states).view(hidden_shape).transpose(1, 2) 2025-08-14T21:38:35.0539511Z 2025-08-14T21:38:35.0539605Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:35.0539815Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:35.0539877Z return mod(**inputs) 2025-08-14T21:38:35.0540096Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0540181Z return func(*args, **kwargs) 2025-08-14T21:38:35.0540396Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0540456Z return func(*args, **kwargs) 2025-08-14T21:38:35.0540677Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0540745Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0540996Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 757, in forward 2025-08-14T21:38:35.0541060Z outputs = self.layoutlm( 2025-08-14T21:38:35.0541279Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0541345Z return func(*args, **kwargs) 2025-08-14T21:38:35.0541562Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0541640Z return func(*args, **kwargs) 2025-08-14T21:38:35.0541846Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0541911Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0542168Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:35.0542234Z encoder_outputs = self.encoder( 2025-08-14T21:38:35.0542455Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0542523Z return func(*args, **kwargs) 2025-08-14T21:38:35.0542745Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0542811Z return func(*args, **kwargs) 2025-08-14T21:38:35.0543029Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0543091Z return func(*args, **kwargs) 2025-08-14T21:38:35.0543166Z [Previous line repeated 1 more time] 2025-08-14T21:38:35.0543365Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0543431Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0543687Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:35.0543749Z layer_outputs = layer_module( 2025-08-14T21:38:35.0543961Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:35.0544032Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:35.0544252Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0544320Z return func(*args, **kwargs) 2025-08-14T21:38:35.0544546Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0544607Z return func(*args, **kwargs) 2025-08-14T21:38:35.0544935Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0545005Z return func(*args, **kwargs) 2025-08-14T21:38:35.0545268Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 338, in forward 2025-08-14T21:38:35.0545364Z self_attention_outputs = self.attention( 2025-08-14T21:38:35.0545591Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0545660Z return func(*args, **kwargs) 2025-08-14T21:38:35.0545893Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0545972Z return func(*args, **kwargs) 2025-08-14T21:38:35.0546193Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0546253Z return func(*args, **kwargs) 2025-08-14T21:38:35.0546519Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 271, in forward 2025-08-14T21:38:35.0546583Z self_outputs = self.self( 2025-08-14T21:38:35.0546800Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0546869Z return func(*args, **kwargs) 2025-08-14T21:38:35.0547084Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0547151Z return func(*args, **kwargs) 2025-08-14T21:38:35.0547367Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0547443Z return func(*args, **kwargs) 2025-08-14T21:38:35.0547698Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 192, in forward 2025-08-14T21:38:35.0547824Z key_states = self.key(hidden_states).view(hidden_shape).transpose(1, 2) 2025-08-14T21:38:35.0547828Z 2025-08-14T21:38:35.0547920Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:35.0548106Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:35.0548166Z return mod(**inputs) 2025-08-14T21:38:35.0548393Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0548455Z return func(*args, **kwargs) 2025-08-14T21:38:35.0548670Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0548738Z return func(*args, **kwargs) 2025-08-14T21:38:35.0548934Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0549001Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0549253Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 757, in forward 2025-08-14T21:38:35.0549316Z outputs = self.layoutlm( 2025-08-14T21:38:35.0549537Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0549599Z return func(*args, **kwargs) 2025-08-14T21:38:35.0549814Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0549880Z return func(*args, **kwargs) 2025-08-14T21:38:35.0550077Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0550153Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0550397Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:35.0550463Z encoder_outputs = self.encoder( 2025-08-14T21:38:35.0550685Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0550745Z return func(*args, **kwargs) 2025-08-14T21:38:35.0550974Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0551045Z return func(*args, **kwargs) 2025-08-14T21:38:35.0551262Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0551328Z return func(*args, **kwargs) 2025-08-14T21:38:35.0551426Z [Previous line repeated 1 more time] 2025-08-14T21:38:35.0551625Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0551699Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0551958Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:35.0552025Z layer_outputs = layer_module( 2025-08-14T21:38:35.0552234Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:35.0552310Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:35.0552532Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0552596Z return func(*args, **kwargs) 2025-08-14T21:38:35.0552814Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0552900Z return func(*args, **kwargs) 2025-08-14T21:38:35.0553120Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0553181Z return func(*args, **kwargs) 2025-08-14T21:38:35.0553438Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 338, in forward 2025-08-14T21:38:35.0553515Z self_attention_outputs = self.attention( 2025-08-14T21:38:35.0553743Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0553805Z return func(*args, **kwargs) 2025-08-14T21:38:35.0554026Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0554094Z return func(*args, **kwargs) 2025-08-14T21:38:35.0554315Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0554386Z return func(*args, **kwargs) 2025-08-14T21:38:35.0554637Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 271, in forward 2025-08-14T21:38:35.0554702Z self_outputs = self.self( 2025-08-14T21:38:35.0554933Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0554992Z return func(*args, **kwargs) 2025-08-14T21:38:35.0555215Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0555283Z return func(*args, **kwargs) 2025-08-14T21:38:35.0555505Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0555572Z return func(*args, **kwargs) 2025-08-14T21:38:35.0555824Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 193, in forward 2025-08-14T21:38:35.0555958Z value_states = self.value(hidden_states).view(hidden_shape).transpose(1, 2) 2025-08-14T21:38:35.0555962Z 2025-08-14T21:38:35.0556041Z cudagraph partition due to non gpu ops 2025-08-14T21:38:35.0556113Z cudagraph partition due to non gpu ops 2025-08-14T21:38:35.0556208Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:35.0556397Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:35.0556473Z return mod(**inputs) 2025-08-14T21:38:35.0556698Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0556757Z return func(*args, **kwargs) 2025-08-14T21:38:35.0556971Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0557055Z return func(*args, **kwargs) 2025-08-14T21:38:35.0557257Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0557324Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0557596Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 757, in forward 2025-08-14T21:38:35.0557661Z outputs = self.layoutlm( 2025-08-14T21:38:35.0557885Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0557946Z return func(*args, **kwargs) 2025-08-14T21:38:35.0558160Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0558226Z return func(*args, **kwargs) 2025-08-14T21:38:35.0558422Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0558510Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0558754Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:35.0558819Z encoder_outputs = self.encoder( 2025-08-14T21:38:35.0559041Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0559101Z return func(*args, **kwargs) 2025-08-14T21:38:35.0559316Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0559383Z return func(*args, **kwargs) 2025-08-14T21:38:35.0559596Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0559660Z return func(*args, **kwargs) 2025-08-14T21:38:35.0559728Z [Previous line repeated 1 more time] 2025-08-14T21:38:35.0559927Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0559998Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0560241Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:35.0560304Z layer_outputs = layer_module( 2025-08-14T21:38:35.0560511Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:35.0560583Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:35.0560806Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0560865Z return func(*args, **kwargs) 2025-08-14T21:38:35.0561079Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0561148Z return func(*args, **kwargs) 2025-08-14T21:38:35.0561363Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0561423Z return func(*args, **kwargs) 2025-08-14T21:38:35.0561673Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 338, in forward 2025-08-14T21:38:35.0561746Z self_attention_outputs = self.attention( 2025-08-14T21:38:35.0561968Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0562043Z return func(*args, **kwargs) 2025-08-14T21:38:35.0562259Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0562325Z return func(*args, **kwargs) 2025-08-14T21:38:35.0562539Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0562619Z return func(*args, **kwargs) 2025-08-14T21:38:35.0562872Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 278, in forward 2025-08-14T21:38:35.0563003Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:38:35.0563256Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 225, in forward 2025-08-14T21:38:35.0563332Z hidden_states = self.dense(hidden_states) 2025-08-14T21:38:35.0563335Z 2025-08-14T21:38:35.0563430Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:35.0563617Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:35.0563677Z return mod(**inputs) 2025-08-14T21:38:35.0563897Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0563980Z return func(*args, **kwargs) 2025-08-14T21:38:35.0564198Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0564262Z return func(*args, **kwargs) 2025-08-14T21:38:35.0564459Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0564526Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0564777Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 757, in forward 2025-08-14T21:38:35.0564840Z outputs = self.layoutlm( 2025-08-14T21:38:35.0565065Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0565123Z return func(*args, **kwargs) 2025-08-14T21:38:35.0565340Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0565408Z return func(*args, **kwargs) 2025-08-14T21:38:35.0565607Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0565679Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0565929Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:35.0565994Z encoder_outputs = self.encoder( 2025-08-14T21:38:35.0566222Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0566284Z return func(*args, **kwargs) 2025-08-14T21:38:35.0566502Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0566570Z return func(*args, **kwargs) 2025-08-14T21:38:35.0566788Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0566857Z return func(*args, **kwargs) 2025-08-14T21:38:35.0566926Z [Previous line repeated 1 more time] 2025-08-14T21:38:35.0567125Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0567198Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0567442Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:35.0567521Z layer_outputs = layer_module( 2025-08-14T21:38:35.0567727Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:35.0567798Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:35.0568019Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0568094Z return func(*args, **kwargs) 2025-08-14T21:38:35.0568314Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0568381Z return func(*args, **kwargs) 2025-08-14T21:38:35.0568616Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0568678Z return func(*args, **kwargs) 2025-08-14T21:38:35.0568933Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 348, in forward 2025-08-14T21:38:35.0569008Z layer_output = apply_chunking_to_forward( 2025-08-14T21:38:35.0569254Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:38:35.0569321Z return forward_fn(*input_tensors) 2025-08-14T21:38:35.0569609Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 356, in feed_forward_chunk 2025-08-14T21:38:35.0569726Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:38:35.0569972Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 294, in forward 2025-08-14T21:38:35.0570053Z hidden_states = self.dense(hidden_states) 2025-08-14T21:38:35.0570056Z 2025-08-14T21:38:35.0570148Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:35.0570331Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:35.0570398Z return mod(**inputs) 2025-08-14T21:38:35.0570613Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0570674Z return func(*args, **kwargs) 2025-08-14T21:38:35.0570896Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0570958Z return func(*args, **kwargs) 2025-08-14T21:38:35.0571160Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0571228Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0571471Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 757, in forward 2025-08-14T21:38:35.0571543Z outputs = self.layoutlm( 2025-08-14T21:38:35.0571761Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0571821Z return func(*args, **kwargs) 2025-08-14T21:38:35.0572045Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0572105Z return func(*args, **kwargs) 2025-08-14T21:38:35.0572315Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0572381Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0572625Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:35.0572698Z encoder_outputs = self.encoder( 2025-08-14T21:38:35.0572917Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0572984Z return func(*args, **kwargs) 2025-08-14T21:38:35.0573223Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0573284Z return func(*args, **kwargs) 2025-08-14T21:38:35.0573506Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0573584Z return func(*args, **kwargs) 2025-08-14T21:38:35.0573653Z [Previous line repeated 1 more time] 2025-08-14T21:38:35.0573858Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0573924Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0574188Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:35.0574254Z layer_outputs = layer_module( 2025-08-14T21:38:35.0574454Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:35.0574534Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:35.0574747Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0574806Z return func(*args, **kwargs) 2025-08-14T21:38:35.0575026Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0575104Z return func(*args, **kwargs) 2025-08-14T21:38:35.0575324Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0575383Z return func(*args, **kwargs) 2025-08-14T21:38:35.0575626Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 348, in forward 2025-08-14T21:38:35.0575707Z layer_output = apply_chunking_to_forward( 2025-08-14T21:38:35.0575942Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:38:35.0576009Z return forward_fn(*input_tensors) 2025-08-14T21:38:35.0576288Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 356, in feed_forward_chunk 2025-08-14T21:38:35.0576399Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:38:35.0576647Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 295, in forward 2025-08-14T21:38:35.0576750Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:38:35.0576944Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:38:35.0577014Z return self.act(input) 2025-08-14T21:38:35.0577017Z 2025-08-14T21:38:35.0577111Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:35.0577301Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:35.0577362Z return mod(**inputs) 2025-08-14T21:38:35.0577580Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0577647Z return func(*args, **kwargs) 2025-08-14T21:38:35.0577865Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0577925Z return func(*args, **kwargs) 2025-08-14T21:38:35.0578131Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0578200Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0578451Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 757, in forward 2025-08-14T21:38:35.0578514Z outputs = self.layoutlm( 2025-08-14T21:38:35.0578746Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0578816Z return func(*args, **kwargs) 2025-08-14T21:38:35.0579031Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0579115Z return func(*args, **kwargs) 2025-08-14T21:38:35.0579317Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0579384Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0579652Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:35.0579720Z encoder_outputs = self.encoder( 2025-08-14T21:38:35.0579938Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0580010Z return func(*args, **kwargs) 2025-08-14T21:38:35.0580228Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0580296Z return func(*args, **kwargs) 2025-08-14T21:38:35.0580513Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0580590Z return func(*args, **kwargs) 2025-08-14T21:38:35.0580666Z [Previous line repeated 1 more time] 2025-08-14T21:38:35.0580863Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0580927Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0581182Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:35.0581245Z layer_outputs = layer_module( 2025-08-14T21:38:35.0581457Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:35.0581534Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:35.0581752Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0581816Z return func(*args, **kwargs) 2025-08-14T21:38:35.0582037Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0582095Z return func(*args, **kwargs) 2025-08-14T21:38:35.0582316Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0582375Z return func(*args, **kwargs) 2025-08-14T21:38:35.0582623Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 348, in forward 2025-08-14T21:38:35.0582694Z layer_output = apply_chunking_to_forward( 2025-08-14T21:38:35.0582929Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:38:35.0583000Z return forward_fn(*input_tensors) 2025-08-14T21:38:35.0583269Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 357, in feed_forward_chunk 2025-08-14T21:38:35.0583394Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:38:35.0583637Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 308, in forward 2025-08-14T21:38:35.0583707Z hidden_states = self.dense(hidden_states) 2025-08-14T21:38:35.0583711Z 2025-08-14T21:38:35.0583808Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:35.0583989Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:35.0584048Z return mod(**inputs) 2025-08-14T21:38:35.0584292Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0584357Z return func(*args, **kwargs) 2025-08-14T21:38:35.0584845Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0584989Z return func(*args, **kwargs) 2025-08-14T21:38:35.0585223Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0585307Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0585621Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 757, in forward 2025-08-14T21:38:35.0585699Z outputs = self.layoutlm( 2025-08-14T21:38:35.0585961Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0586032Z return func(*args, **kwargs) 2025-08-14T21:38:35.0586292Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0586354Z return func(*args, **kwargs) 2025-08-14T21:38:35.0586549Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0586651Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0586902Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:35.0586979Z encoder_outputs = self.encoder( 2025-08-14T21:38:35.0587210Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0587273Z return func(*args, **kwargs) 2025-08-14T21:38:35.0587507Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0587568Z return func(*args, **kwargs) 2025-08-14T21:38:35.0587796Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0587865Z return func(*args, **kwargs) 2025-08-14T21:38:35.0587937Z [Previous line repeated 1 more time] 2025-08-14T21:38:35.0588151Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0588221Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0588481Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:35.0588553Z layer_outputs = layer_module( 2025-08-14T21:38:35.0588766Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:35.0588842Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:35.0589077Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0589140Z return func(*args, **kwargs) 2025-08-14T21:38:35.0589373Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0589438Z return func(*args, **kwargs) 2025-08-14T21:38:35.0589665Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0589734Z return func(*args, **kwargs) 2025-08-14T21:38:35.0589992Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 338, in forward 2025-08-14T21:38:35.0590070Z self_attention_outputs = self.attention( 2025-08-14T21:38:35.0590298Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0590382Z return func(*args, **kwargs) 2025-08-14T21:38:35.0590619Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0590683Z return func(*args, **kwargs) 2025-08-14T21:38:35.0590911Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0591001Z return func(*args, **kwargs) 2025-08-14T21:38:35.0591262Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 271, in forward 2025-08-14T21:38:35.0591337Z self_outputs = self.self( 2025-08-14T21:38:35.0591585Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0591650Z return func(*args, **kwargs) 2025-08-14T21:38:35.0591888Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0591950Z return func(*args, **kwargs) 2025-08-14T21:38:35.0592181Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0592250Z return func(*args, **kwargs) 2025-08-14T21:38:35.0592509Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 191, in forward 2025-08-14T21:38:35.0592686Z query_states = self.query(hidden_states).view(hidden_shape).transpose(1, 2) 2025-08-14T21:38:35.0592690Z 2025-08-14T21:38:35.0592789Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:35.0592983Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:35.0593049Z return mod(**inputs) 2025-08-14T21:38:35.0593280Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0593340Z return func(*args, **kwargs) 2025-08-14T21:38:35.0593574Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0593632Z return func(*args, **kwargs) 2025-08-14T21:38:35.0593837Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0593906Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0594159Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 757, in forward 2025-08-14T21:38:35.0594229Z outputs = self.layoutlm( 2025-08-14T21:38:35.0594445Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0594513Z return func(*args, **kwargs) 2025-08-14T21:38:35.0594730Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0594788Z return func(*args, **kwargs) 2025-08-14T21:38:35.0594992Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0595056Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0595301Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:35.0595375Z encoder_outputs = self.encoder( 2025-08-14T21:38:35.0595591Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0595658Z return func(*args, **kwargs) 2025-08-14T21:38:35.0595873Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0595932Z return func(*args, **kwargs) 2025-08-14T21:38:35.0596166Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0596226Z return func(*args, **kwargs) 2025-08-14T21:38:35.0596292Z [Previous line repeated 1 more time] 2025-08-14T21:38:35.0596491Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0596576Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0596821Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:35.0596881Z layer_outputs = layer_module( 2025-08-14T21:38:35.0597097Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:35.0597176Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:35.0597395Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0597457Z return func(*args, **kwargs) 2025-08-14T21:38:35.0597680Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0597739Z return func(*args, **kwargs) 2025-08-14T21:38:35.0597960Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0598037Z return func(*args, **kwargs) 2025-08-14T21:38:35.0598279Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 338, in forward 2025-08-14T21:38:35.0598361Z self_attention_outputs = self.attention( 2025-08-14T21:38:35.0598576Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0598641Z return func(*args, **kwargs) 2025-08-14T21:38:35.0598859Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0598919Z return func(*args, **kwargs) 2025-08-14T21:38:35.0599141Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0599201Z return func(*args, **kwargs) 2025-08-14T21:38:35.0599447Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 271, in forward 2025-08-14T21:38:35.0599518Z self_outputs = self.self( 2025-08-14T21:38:35.0599737Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0599804Z return func(*args, **kwargs) 2025-08-14T21:38:35.0600018Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0600073Z return func(*args, **kwargs) 2025-08-14T21:38:35.0600295Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0600353Z return func(*args, **kwargs) 2025-08-14T21:38:35.0600591Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 192, in forward 2025-08-14T21:38:35.0600723Z key_states = self.key(hidden_states).view(hidden_shape).transpose(1, 2) 2025-08-14T21:38:35.0600727Z 2025-08-14T21:38:35.0600820Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:35.0601009Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:35.0601068Z return mod(**inputs) 2025-08-14T21:38:35.0601286Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0601353Z return func(*args, **kwargs) 2025-08-14T21:38:35.0601584Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0601653Z return func(*args, **kwargs) 2025-08-14T21:38:35.0601851Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0601916Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0602180Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 757, in forward 2025-08-14T21:38:35.0602243Z outputs = self.layoutlm( 2025-08-14T21:38:35.0602457Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0602539Z return func(*args, **kwargs) 2025-08-14T21:38:35.0602755Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0602821Z return func(*args, **kwargs) 2025-08-14T21:38:35.0603019Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0603083Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0603329Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:35.0603414Z encoder_outputs = self.encoder( 2025-08-14T21:38:35.0603636Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0603703Z return func(*args, **kwargs) 2025-08-14T21:38:35.0603924Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0603990Z return func(*args, **kwargs) 2025-08-14T21:38:35.0604211Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0604270Z return func(*args, **kwargs) 2025-08-14T21:38:35.0604349Z [Previous line repeated 1 more time] 2025-08-14T21:38:35.0604550Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0604614Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0604866Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:35.0604932Z layer_outputs = layer_module( 2025-08-14T21:38:35.0605141Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:35.0605212Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:35.0605432Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0605496Z return func(*args, **kwargs) 2025-08-14T21:38:35.0605710Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0605770Z return func(*args, **kwargs) 2025-08-14T21:38:35.0605992Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0606048Z return func(*args, **kwargs) 2025-08-14T21:38:35.0606296Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 338, in forward 2025-08-14T21:38:35.0606367Z self_attention_outputs = self.attention( 2025-08-14T21:38:35.0606582Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0606647Z return func(*args, **kwargs) 2025-08-14T21:38:35.0606862Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0606929Z return func(*args, **kwargs) 2025-08-14T21:38:35.0607163Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0607225Z return func(*args, **kwargs) 2025-08-14T21:38:35.0607476Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 271, in forward 2025-08-14T21:38:35.0607557Z self_outputs = self.self( 2025-08-14T21:38:35.0607781Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0607848Z return func(*args, **kwargs) 2025-08-14T21:38:35.0608082Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0608150Z return func(*args, **kwargs) 2025-08-14T21:38:35.0608372Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0608433Z return func(*args, **kwargs) 2025-08-14T21:38:35.0608690Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 193, in forward 2025-08-14T21:38:35.0608824Z value_states = self.value(hidden_states).view(hidden_shape).transpose(1, 2) 2025-08-14T21:38:35.0608827Z 2025-08-14T21:38:35.0608908Z cudagraph partition due to non gpu ops 2025-08-14T21:38:35.0608996Z cudagraph partition due to non gpu ops 2025-08-14T21:38:35.0609090Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:35.0609277Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:35.0609336Z return mod(**inputs) 2025-08-14T21:38:35.0609556Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0609624Z return func(*args, **kwargs) 2025-08-14T21:38:35.0609842Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0609904Z return func(*args, **kwargs) 2025-08-14T21:38:35.0610108Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0610175Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0610428Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 757, in forward 2025-08-14T21:38:35.0610491Z outputs = self.layoutlm( 2025-08-14T21:38:35.0610702Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0610771Z return func(*args, **kwargs) 2025-08-14T21:38:35.0610985Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0611053Z return func(*args, **kwargs) 2025-08-14T21:38:35.0611250Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0611316Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0611565Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:35.0611632Z encoder_outputs = self.encoder( 2025-08-14T21:38:35.0611850Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0611919Z return func(*args, **kwargs) 2025-08-14T21:38:35.0612135Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0612201Z return func(*args, **kwargs) 2025-08-14T21:38:35.0612417Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0612475Z return func(*args, **kwargs) 2025-08-14T21:38:35.0612563Z [Previous line repeated 1 more time] 2025-08-14T21:38:35.0612761Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0612825Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0613073Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:35.0613150Z layer_outputs = layer_module( 2025-08-14T21:38:35.0613354Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:35.0613420Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:35.0613659Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0613725Z return func(*args, **kwargs) 2025-08-14T21:38:35.0613939Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0613996Z return func(*args, **kwargs) 2025-08-14T21:38:35.0614214Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0614273Z return func(*args, **kwargs) 2025-08-14T21:38:35.0614541Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 338, in forward 2025-08-14T21:38:35.0614614Z self_attention_outputs = self.attention( 2025-08-14T21:38:35.0614832Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0614900Z return func(*args, **kwargs) 2025-08-14T21:38:35.0615117Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0615183Z return func(*args, **kwargs) 2025-08-14T21:38:35.0615401Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0615460Z return func(*args, **kwargs) 2025-08-14T21:38:35.0615714Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 278, in forward 2025-08-14T21:38:35.0615834Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:38:35.0616077Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 225, in forward 2025-08-14T21:38:35.0616162Z hidden_states = self.dense(hidden_states) 2025-08-14T21:38:35.0616165Z 2025-08-14T21:38:35.0616260Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:35.0616448Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:35.0616507Z return mod(**inputs) 2025-08-14T21:38:35.0616726Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0616793Z return func(*args, **kwargs) 2025-08-14T21:38:35.0617009Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0617067Z return func(*args, **kwargs) 2025-08-14T21:38:35.0617273Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0617339Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0617584Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 757, in forward 2025-08-14T21:38:35.0617648Z outputs = self.layoutlm( 2025-08-14T21:38:35.0617863Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0617921Z return func(*args, **kwargs) 2025-08-14T21:38:35.0618147Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0618207Z return func(*args, **kwargs) 2025-08-14T21:38:35.0618406Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0618502Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0618755Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:35.0618821Z encoder_outputs = self.encoder( 2025-08-14T21:38:35.0619055Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0619123Z return func(*args, **kwargs) 2025-08-14T21:38:35.0619338Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0619407Z return func(*args, **kwargs) 2025-08-14T21:38:35.0619622Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0619683Z return func(*args, **kwargs) 2025-08-14T21:38:35.0619760Z [Previous line repeated 1 more time] 2025-08-14T21:38:35.0619955Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0620039Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0620292Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:35.0620356Z layer_outputs = layer_module( 2025-08-14T21:38:35.0620562Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:35.0620633Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:35.0620849Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0620916Z return func(*args, **kwargs) 2025-08-14T21:38:35.0621131Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0621190Z return func(*args, **kwargs) 2025-08-14T21:38:35.0621415Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0621477Z return func(*args, **kwargs) 2025-08-14T21:38:35.0621723Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 348, in forward 2025-08-14T21:38:35.0621801Z layer_output = apply_chunking_to_forward( 2025-08-14T21:38:35.0622036Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:38:35.0622110Z return forward_fn(*input_tensors) 2025-08-14T21:38:35.0622385Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 356, in feed_forward_chunk 2025-08-14T21:38:35.0622500Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:38:35.0622742Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 294, in forward 2025-08-14T21:38:35.0622819Z hidden_states = self.dense(hidden_states) 2025-08-14T21:38:35.0622822Z 2025-08-14T21:38:35.0622923Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:35.0623106Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:35.0623165Z return mod(**inputs) 2025-08-14T21:38:35.0623391Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0623452Z return func(*args, **kwargs) 2025-08-14T21:38:35.0623689Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0623751Z return func(*args, **kwargs) 2025-08-14T21:38:35.0623951Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0624044Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0624287Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 757, in forward 2025-08-14T21:38:35.0624354Z outputs = self.layoutlm( 2025-08-14T21:38:35.0624587Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0624646Z return func(*args, **kwargs) 2025-08-14T21:38:35.0624938Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0625007Z return func(*args, **kwargs) 2025-08-14T21:38:35.0625204Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0625273Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0625554Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:35.0625651Z encoder_outputs = self.encoder( 2025-08-14T21:38:35.0625878Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0625939Z return func(*args, **kwargs) 2025-08-14T21:38:35.0626212Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0626284Z return func(*args, **kwargs) 2025-08-14T21:38:35.0626504Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0626575Z return func(*args, **kwargs) 2025-08-14T21:38:35.0626644Z [Previous line repeated 1 more time] 2025-08-14T21:38:35.0626849Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0626916Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0627163Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:35.0627235Z layer_outputs = layer_module( 2025-08-14T21:38:35.0627436Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:35.0627509Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:35.0627731Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0627791Z return func(*args, **kwargs) 2025-08-14T21:38:35.0628014Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0628073Z return func(*args, **kwargs) 2025-08-14T21:38:35.0628289Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0628359Z return func(*args, **kwargs) 2025-08-14T21:38:35.0628604Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 348, in forward 2025-08-14T21:38:35.0628681Z layer_output = apply_chunking_to_forward( 2025-08-14T21:38:35.0628927Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:38:35.0628998Z return forward_fn(*input_tensors) 2025-08-14T21:38:35.0629295Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 356, in feed_forward_chunk 2025-08-14T21:38:35.0629405Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:38:35.0629647Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 295, in forward 2025-08-14T21:38:35.0629757Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:38:35.0629968Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:38:35.0630036Z return self.act(input) 2025-08-14T21:38:35.0630039Z 2025-08-14T21:38:35.0630133Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:35.0630326Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:35.0630389Z return mod(**inputs) 2025-08-14T21:38:35.0630605Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0630662Z return func(*args, **kwargs) 2025-08-14T21:38:35.0630878Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0630934Z return func(*args, **kwargs) 2025-08-14T21:38:35.0631132Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0631213Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0631450Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 757, in forward 2025-08-14T21:38:35.0631515Z outputs = self.layoutlm( 2025-08-14T21:38:35.0631729Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0631797Z return func(*args, **kwargs) 2025-08-14T21:38:35.0632013Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0632073Z return func(*args, **kwargs) 2025-08-14T21:38:35.0632277Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0632343Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0632586Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:35.0632663Z encoder_outputs = self.encoder( 2025-08-14T21:38:35.0632879Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0632948Z return func(*args, **kwargs) 2025-08-14T21:38:35.0633163Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0633223Z return func(*args, **kwargs) 2025-08-14T21:38:35.0633450Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0633509Z return func(*args, **kwargs) 2025-08-14T21:38:35.0633578Z [Previous line repeated 1 more time] 2025-08-14T21:38:35.0633784Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0633852Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0634103Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:35.0634169Z layer_outputs = layer_module( 2025-08-14T21:38:35.0634371Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:35.0634452Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:35.0634665Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0634740Z return func(*args, **kwargs) 2025-08-14T21:38:35.0634966Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0635026Z return func(*args, **kwargs) 2025-08-14T21:38:35.0635250Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0635333Z return func(*args, **kwargs) 2025-08-14T21:38:35.0635579Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 348, in forward 2025-08-14T21:38:35.0635661Z layer_output = apply_chunking_to_forward( 2025-08-14T21:38:35.0635912Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:38:35.0635998Z return forward_fn(*input_tensors) 2025-08-14T21:38:35.0636271Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 357, in feed_forward_chunk 2025-08-14T21:38:35.0636391Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:38:35.0636641Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 308, in forward 2025-08-14T21:38:35.0636730Z hidden_states = self.dense(hidden_states) 2025-08-14T21:38:35.0636734Z 2025-08-14T21:38:35.0636827Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:35.0637016Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:35.0637076Z return mod(**inputs) 2025-08-14T21:38:35.0637302Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0637362Z return func(*args, **kwargs) 2025-08-14T21:38:35.0637579Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0637648Z return func(*args, **kwargs) 2025-08-14T21:38:35.0637844Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0637917Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0638160Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 757, in forward 2025-08-14T21:38:35.0638226Z outputs = self.layoutlm( 2025-08-14T21:38:35.0638449Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0638508Z return func(*args, **kwargs) 2025-08-14T21:38:35.0638723Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0638791Z return func(*args, **kwargs) 2025-08-14T21:38:35.0638987Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0639059Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0639300Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:35.0639366Z encoder_outputs = self.encoder( 2025-08-14T21:38:35.0639592Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0639652Z return func(*args, **kwargs) 2025-08-14T21:38:35.0639867Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0639936Z return func(*args, **kwargs) 2025-08-14T21:38:35.0640151Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0640219Z return func(*args, **kwargs) 2025-08-14T21:38:35.0640303Z [Previous line repeated 1 more time] 2025-08-14T21:38:35.0640502Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0640577Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0640819Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:35.0640902Z layer_outputs = layer_module( 2025-08-14T21:38:35.0641111Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:35.0641182Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:35.0641416Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0641478Z return func(*args, **kwargs) 2025-08-14T21:38:35.0641695Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0641762Z return func(*args, **kwargs) 2025-08-14T21:38:35.0641978Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0642038Z return func(*args, **kwargs) 2025-08-14T21:38:35.0642290Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 338, in forward 2025-08-14T21:38:35.0642385Z self_attention_outputs = self.attention( 2025-08-14T21:38:35.0642609Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0642670Z return func(*args, **kwargs) 2025-08-14T21:38:35.0642885Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0642953Z return func(*args, **kwargs) 2025-08-14T21:38:35.0643169Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0643236Z return func(*args, **kwargs) 2025-08-14T21:38:35.0643477Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 271, in forward 2025-08-14T21:38:35.0643542Z self_outputs = self.self( 2025-08-14T21:38:35.0643766Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0643825Z return func(*args, **kwargs) 2025-08-14T21:38:35.0644041Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0644111Z return func(*args, **kwargs) 2025-08-14T21:38:35.0644324Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0644391Z return func(*args, **kwargs) 2025-08-14T21:38:35.0644636Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 191, in forward 2025-08-14T21:38:35.0644769Z query_states = self.query(hidden_states).view(hidden_shape).transpose(1, 2) 2025-08-14T21:38:35.0644772Z 2025-08-14T21:38:35.0644875Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:35.0645059Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:35.0645125Z return mod(**inputs) 2025-08-14T21:38:35.0645345Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0645407Z return func(*args, **kwargs) 2025-08-14T21:38:35.0645631Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0645691Z return func(*args, **kwargs) 2025-08-14T21:38:35.0645903Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0645981Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0646228Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 757, in forward 2025-08-14T21:38:35.0646321Z outputs = self.layoutlm( 2025-08-14T21:38:35.0646542Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0646604Z return func(*args, **kwargs) 2025-08-14T21:38:35.0646841Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0646903Z return func(*args, **kwargs) 2025-08-14T21:38:35.0647102Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0647178Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0647426Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:35.0647501Z encoder_outputs = self.encoder( 2025-08-14T21:38:35.0647723Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0647799Z return func(*args, **kwargs) 2025-08-14T21:38:35.0648025Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0648085Z return func(*args, **kwargs) 2025-08-14T21:38:35.0648304Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0648371Z return func(*args, **kwargs) 2025-08-14T21:38:35.0648441Z [Previous line repeated 1 more time] 2025-08-14T21:38:35.0648649Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0648714Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0648956Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:35.0649026Z layer_outputs = layer_module( 2025-08-14T21:38:35.0649228Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:35.0649299Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:35.0649523Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0649584Z return func(*args, **kwargs) 2025-08-14T21:38:35.0649808Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0649868Z return func(*args, **kwargs) 2025-08-14T21:38:35.0650086Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0650151Z return func(*args, **kwargs) 2025-08-14T21:38:35.0650395Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 338, in forward 2025-08-14T21:38:35.0650479Z self_attention_outputs = self.attention( 2025-08-14T21:38:35.0650696Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0650755Z return func(*args, **kwargs) 2025-08-14T21:38:35.0650982Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0651042Z return func(*args, **kwargs) 2025-08-14T21:38:35.0651260Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0651326Z return func(*args, **kwargs) 2025-08-14T21:38:35.0651588Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 271, in forward 2025-08-14T21:38:35.0651662Z self_outputs = self.self( 2025-08-14T21:38:35.0651880Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0651957Z return func(*args, **kwargs) 2025-08-14T21:38:35.0652181Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0652240Z return func(*args, **kwargs) 2025-08-14T21:38:35.0652476Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0652544Z return func(*args, **kwargs) 2025-08-14T21:38:35.0652790Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 192, in forward 2025-08-14T21:38:35.0652921Z key_states = self.key(hidden_states).view(hidden_shape).transpose(1, 2) 2025-08-14T21:38:35.0652925Z 2025-08-14T21:38:35.0653018Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:35.0653203Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:35.0653293Z return mod(**inputs) 2025-08-14T21:38:35.0653510Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0653577Z return func(*args, **kwargs) 2025-08-14T21:38:35.0653795Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0653856Z return func(*args, **kwargs) 2025-08-14T21:38:35.0654062Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0654129Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0654375Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 757, in forward 2025-08-14T21:38:35.0654446Z outputs = self.layoutlm( 2025-08-14T21:38:35.0654662Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0654730Z return func(*args, **kwargs) 2025-08-14T21:38:35.0654947Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0655007Z return func(*args, **kwargs) 2025-08-14T21:38:35.0655212Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0655278Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0655522Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:35.0655598Z encoder_outputs = self.encoder( 2025-08-14T21:38:35.0655816Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0655883Z return func(*args, **kwargs) 2025-08-14T21:38:35.0656101Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0656162Z return func(*args, **kwargs) 2025-08-14T21:38:35.0656385Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0656445Z return func(*args, **kwargs) 2025-08-14T21:38:35.0656515Z [Previous line repeated 1 more time] 2025-08-14T21:38:35.0656718Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0656781Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0657058Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:35.0657122Z layer_outputs = layer_module( 2025-08-14T21:38:35.0657320Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:35.0657416Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:35.0657633Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0657702Z return func(*args, **kwargs) 2025-08-14T21:38:35.0657931Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0657992Z return func(*args, **kwargs) 2025-08-14T21:38:35.0658215Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0658276Z return func(*args, **kwargs) 2025-08-14T21:38:35.0658518Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 338, in forward 2025-08-14T21:38:35.0658602Z self_attention_outputs = self.attention( 2025-08-14T21:38:35.0658819Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0658906Z return func(*args, **kwargs) 2025-08-14T21:38:35.0659125Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0659185Z return func(*args, **kwargs) 2025-08-14T21:38:35.0659413Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0659473Z return func(*args, **kwargs) 2025-08-14T21:38:35.0659724Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 271, in forward 2025-08-14T21:38:35.0659795Z self_outputs = self.self( 2025-08-14T21:38:35.0660017Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0660084Z return func(*args, **kwargs) 2025-08-14T21:38:35.0660303Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0660366Z return func(*args, **kwargs) 2025-08-14T21:38:35.0660591Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0660651Z return func(*args, **kwargs) 2025-08-14T21:38:35.0660901Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 193, in forward 2025-08-14T21:38:35.0661042Z value_states = self.value(hidden_states).view(hidden_shape).transpose(1, 2) 2025-08-14T21:38:35.0661045Z 2025-08-14T21:38:35.0661118Z cudagraph partition due to non gpu ops 2025-08-14T21:38:35.0661196Z cudagraph partition due to non gpu ops 2025-08-14T21:38:35.0661292Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:35.0661477Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:35.0661546Z return mod(**inputs) 2025-08-14T21:38:35.0661767Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0661833Z return func(*args, **kwargs) 2025-08-14T21:38:35.0662053Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0662112Z return func(*args, **kwargs) 2025-08-14T21:38:35.0662321Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0662387Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0662651Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 757, in forward 2025-08-14T21:38:35.0662725Z outputs = self.layoutlm( 2025-08-14T21:38:35.0662938Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0663022Z return func(*args, **kwargs) 2025-08-14T21:38:35.0663237Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0663297Z return func(*args, **kwargs) 2025-08-14T21:38:35.0663521Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0663591Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0663839Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:35.0663913Z encoder_outputs = self.encoder( 2025-08-14T21:38:35.0664133Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0664200Z return func(*args, **kwargs) 2025-08-14T21:38:35.0664417Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0664494Z return func(*args, **kwargs) 2025-08-14T21:38:35.0664725Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0664850Z return func(*args, **kwargs) 2025-08-14T21:38:35.0664933Z [Previous line repeated 1 more time] 2025-08-14T21:38:35.0665144Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0665210Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0665467Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:35.0665533Z layer_outputs = layer_module( 2025-08-14T21:38:35.0665741Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:35.0665822Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:35.0666052Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0666118Z return func(*args, **kwargs) 2025-08-14T21:38:35.0666348Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0666406Z return func(*args, **kwargs) 2025-08-14T21:38:35.0666633Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0666693Z return func(*args, **kwargs) 2025-08-14T21:38:35.0666937Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 338, in forward 2025-08-14T21:38:35.0667022Z self_attention_outputs = self.attention( 2025-08-14T21:38:35.0667237Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0667308Z return func(*args, **kwargs) 2025-08-14T21:38:35.0667531Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0667591Z return func(*args, **kwargs) 2025-08-14T21:38:35.0667820Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0667882Z return func(*args, **kwargs) 2025-08-14T21:38:35.0668132Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 278, in forward 2025-08-14T21:38:35.0668276Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:38:35.0668531Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 225, in forward 2025-08-14T21:38:35.0668614Z hidden_states = self.dense(hidden_states) 2025-08-14T21:38:35.0668632Z 2025-08-14T21:38:35.0668729Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:35.0668914Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:35.0668982Z return mod(**inputs) 2025-08-14T21:38:35.0669218Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0669287Z return func(*args, **kwargs) 2025-08-14T21:38:35.0669514Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0669577Z return func(*args, **kwargs) 2025-08-14T21:38:35.0669788Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0669855Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0670108Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 757, in forward 2025-08-14T21:38:35.0670197Z outputs = self.layoutlm( 2025-08-14T21:38:35.0670421Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0670487Z return func(*args, **kwargs) 2025-08-14T21:38:35.0670711Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0670772Z return func(*args, **kwargs) 2025-08-14T21:38:35.0670981Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0671051Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0671301Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:35.0671376Z encoder_outputs = self.encoder( 2025-08-14T21:38:35.0671598Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0671667Z return func(*args, **kwargs) 2025-08-14T21:38:35.0671891Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0671953Z return func(*args, **kwargs) 2025-08-14T21:38:35.0672180Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0672241Z return func(*args, **kwargs) 2025-08-14T21:38:35.0672312Z [Previous line repeated 1 more time] 2025-08-14T21:38:35.0672522Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0672590Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0672846Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:35.0672916Z layer_outputs = layer_module( 2025-08-14T21:38:35.0673123Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:35.0673205Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:35.0673430Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0673498Z return func(*args, **kwargs) 2025-08-14T21:38:35.0673720Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0673797Z return func(*args, **kwargs) 2025-08-14T21:38:35.0674029Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0674092Z return func(*args, **kwargs) 2025-08-14T21:38:35.0674349Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 348, in forward 2025-08-14T21:38:35.0674455Z layer_output = apply_chunking_to_forward( 2025-08-14T21:38:35.0674702Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:38:35.0674780Z return forward_fn(*input_tensors) 2025-08-14T21:38:35.0675076Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 356, in feed_forward_chunk 2025-08-14T21:38:35.0675191Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:38:35.0675449Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 294, in forward 2025-08-14T21:38:35.0675523Z hidden_states = self.dense(hidden_states) 2025-08-14T21:38:35.0675527Z 2025-08-14T21:38:35.0675622Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:35.0675805Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:35.0675889Z return mod(**inputs) 2025-08-14T21:38:35.0676116Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0676175Z return func(*args, **kwargs) 2025-08-14T21:38:35.0676401Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0676468Z return func(*args, **kwargs) 2025-08-14T21:38:35.0676670Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0676741Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0676987Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 757, in forward 2025-08-14T21:38:35.0677049Z outputs = self.layoutlm( 2025-08-14T21:38:35.0677271Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0677333Z return func(*args, **kwargs) 2025-08-14T21:38:35.0677551Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0677615Z return func(*args, **kwargs) 2025-08-14T21:38:35.0677815Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0677889Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0678134Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:35.0678200Z encoder_outputs = self.encoder( 2025-08-14T21:38:35.0678424Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0678481Z return func(*args, **kwargs) 2025-08-14T21:38:35.0678711Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0678778Z return func(*args, **kwargs) 2025-08-14T21:38:35.0678992Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0679060Z return func(*args, **kwargs) 2025-08-14T21:38:35.0679129Z [Previous line repeated 1 more time] 2025-08-14T21:38:35.0679328Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0679416Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0679660Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:35.0679725Z layer_outputs = layer_module( 2025-08-14T21:38:35.0679932Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:35.0680020Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:35.0680248Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0680308Z return func(*args, **kwargs) 2025-08-14T21:38:35.0680728Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0680802Z return func(*args, **kwargs) 2025-08-14T21:38:35.0681021Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0681090Z return func(*args, **kwargs) 2025-08-14T21:38:35.0681337Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 348, in forward 2025-08-14T21:38:35.0681414Z layer_output = apply_chunking_to_forward( 2025-08-14T21:38:35.0681659Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:38:35.0681744Z return forward_fn(*input_tensors) 2025-08-14T21:38:35.0682018Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 356, in feed_forward_chunk 2025-08-14T21:38:35.0682138Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:38:35.0682382Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 295, in forward 2025-08-14T21:38:35.0682493Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:38:35.0682685Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:38:35.0682748Z return self.act(input) 2025-08-14T21:38:35.0682752Z 2025-08-14T21:38:35.0682854Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:35.0683039Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:35.0683109Z return mod(**inputs) 2025-08-14T21:38:35.0683327Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0683387Z return func(*args, **kwargs) 2025-08-14T21:38:35.0683610Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0683670Z return func(*args, **kwargs) 2025-08-14T21:38:35.0683868Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0683941Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0684184Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 757, in forward 2025-08-14T21:38:35.0684254Z outputs = self.layoutlm( 2025-08-14T21:38:35.0684471Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0684527Z return func(*args, **kwargs) 2025-08-14T21:38:35.0684990Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0685064Z return func(*args, **kwargs) 2025-08-14T21:38:35.0685277Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0685356Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0685673Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:35.0685757Z encoder_outputs = self.encoder( 2025-08-14T21:38:35.0686005Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0686102Z return func(*args, **kwargs) 2025-08-14T21:38:35.0686350Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0686410Z return func(*args, **kwargs) 2025-08-14T21:38:35.0686659Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0686737Z return func(*args, **kwargs) 2025-08-14T21:38:35.0686806Z [Previous line repeated 1 more time] 2025-08-14T21:38:35.0687007Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0687073Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0687315Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:35.0687387Z layer_outputs = layer_module( 2025-08-14T21:38:35.0687589Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:35.0687687Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:35.0687913Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0687973Z return func(*args, **kwargs) 2025-08-14T21:38:35.0688202Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0688262Z return func(*args, **kwargs) 2025-08-14T21:38:35.0688483Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0688552Z return func(*args, **kwargs) 2025-08-14T21:38:35.0688795Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 348, in forward 2025-08-14T21:38:35.0688879Z layer_output = apply_chunking_to_forward( 2025-08-14T21:38:35.0689116Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:38:35.0689185Z return forward_fn(*input_tensors) 2025-08-14T21:38:35.0689466Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 357, in feed_forward_chunk 2025-08-14T21:38:35.0689588Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:38:35.0689830Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 308, in forward 2025-08-14T21:38:35.0689914Z hidden_states = self.dense(hidden_states) 2025-08-14T21:38:35.0689918Z 2025-08-14T21:38:35.0690012Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:35.0690201Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:35.0690263Z return mod(**inputs) 2025-08-14T21:38:35.0690478Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0690550Z return func(*args, **kwargs) 2025-08-14T21:38:35.0690764Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0690833Z return func(*args, **kwargs) 2025-08-14T21:38:35.0691029Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0691095Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0691361Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 757, in forward 2025-08-14T21:38:35.0691428Z outputs = self.layoutlm( 2025-08-14T21:38:35.0691647Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0691732Z return func(*args, **kwargs) 2025-08-14T21:38:35.0691948Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0692016Z return func(*args, **kwargs) 2025-08-14T21:38:35.0692230Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0692298Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0692547Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:35.0692613Z encoder_outputs = self.encoder( 2025-08-14T21:38:35.0692828Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0692895Z return func(*args, **kwargs) 2025-08-14T21:38:35.0693108Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0693194Z return func(*args, **kwargs) 2025-08-14T21:38:35.0693412Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0693473Z return func(*args, **kwargs) 2025-08-14T21:38:35.0693552Z [Previous line repeated 1 more time] 2025-08-14T21:38:35.0693752Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0693819Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0694071Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:35.0694135Z layer_outputs = layer_module( 2025-08-14T21:38:35.0694343Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:35.0694415Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:35.0694636Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0694704Z return func(*args, **kwargs) 2025-08-14T21:38:35.0694923Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0694991Z return func(*args, **kwargs) 2025-08-14T21:38:35.0695208Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0695270Z return func(*args, **kwargs) 2025-08-14T21:38:35.0695521Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 338, in forward 2025-08-14T21:38:35.0695598Z self_attention_outputs = self.attention( 2025-08-14T21:38:35.0695814Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0695885Z return func(*args, **kwargs) 2025-08-14T21:38:35.0696102Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0696168Z return func(*args, **kwargs) 2025-08-14T21:38:35.0696386Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0696446Z return func(*args, **kwargs) 2025-08-14T21:38:35.0696700Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 271, in forward 2025-08-14T21:38:35.0696781Z self_outputs = self.self( 2025-08-14T21:38:35.0696996Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0697063Z return func(*args, **kwargs) 2025-08-14T21:38:35.0697276Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0697360Z return func(*args, **kwargs) 2025-08-14T21:38:35.0697574Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0697633Z return func(*args, **kwargs) 2025-08-14T21:38:35.0697903Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 191, in forward 2025-08-14T21:38:35.0698039Z query_states = self.query(hidden_states).view(hidden_shape).transpose(1, 2) 2025-08-14T21:38:35.0698043Z 2025-08-14T21:38:35.0698142Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:35.0698326Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:35.0698385Z return mod(**inputs) 2025-08-14T21:38:35.0698606Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0698684Z return func(*args, **kwargs) 2025-08-14T21:38:35.0698900Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0698966Z return func(*args, **kwargs) 2025-08-14T21:38:35.0699165Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0699239Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0699484Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 757, in forward 2025-08-14T21:38:35.0699547Z outputs = self.layoutlm( 2025-08-14T21:38:35.0699770Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0699829Z return func(*args, **kwargs) 2025-08-14T21:38:35.0700044Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0700113Z return func(*args, **kwargs) 2025-08-14T21:38:35.0700310Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0700384Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0700630Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:35.0700696Z encoder_outputs = self.encoder( 2025-08-14T21:38:35.0700925Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0700987Z return func(*args, **kwargs) 2025-08-14T21:38:35.0701202Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0701269Z return func(*args, **kwargs) 2025-08-14T21:38:35.0701491Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0701560Z return func(*args, **kwargs) 2025-08-14T21:38:35.0701628Z [Previous line repeated 1 more time] 2025-08-14T21:38:35.0701830Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0701905Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0702150Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:35.0702222Z layer_outputs = layer_module( 2025-08-14T21:38:35.0702438Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:35.0702512Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:35.0702736Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0702815Z return func(*args, **kwargs) 2025-08-14T21:38:35.0703032Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0703100Z return func(*args, **kwargs) 2025-08-14T21:38:35.0703329Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0703399Z return func(*args, **kwargs) 2025-08-14T21:38:35.0703645Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 338, in forward 2025-08-14T21:38:35.0703721Z self_attention_outputs = self.attention( 2025-08-14T21:38:35.0703947Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0704007Z return func(*args, **kwargs) 2025-08-14T21:38:35.0704223Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0704308Z return func(*args, **kwargs) 2025-08-14T21:38:35.0704524Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0704591Z return func(*args, **kwargs) 2025-08-14T21:38:35.0704903Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 271, in forward 2025-08-14T21:38:35.0704973Z self_outputs = self.self( 2025-08-14T21:38:35.0705213Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0705275Z return func(*args, **kwargs) 2025-08-14T21:38:35.0705503Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0705572Z return func(*args, **kwargs) 2025-08-14T21:38:35.0705801Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0705872Z return func(*args, **kwargs) 2025-08-14T21:38:35.0706129Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 192, in forward 2025-08-14T21:38:35.0706274Z key_states = self.key(hidden_states).view(hidden_shape).transpose(1, 2) 2025-08-14T21:38:35.0706278Z 2025-08-14T21:38:35.0706379Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:35.0706561Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:35.0706628Z return mod(**inputs) 2025-08-14T21:38:35.0706850Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0706914Z return func(*args, **kwargs) 2025-08-14T21:38:35.0707149Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0707214Z return func(*args, **kwargs) 2025-08-14T21:38:35.0707423Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0707502Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0707759Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 757, in forward 2025-08-14T21:38:35.0707860Z outputs = self.layoutlm( 2025-08-14T21:38:35.0708112Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0708178Z return func(*args, **kwargs) 2025-08-14T21:38:35.0708415Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0708477Z return func(*args, **kwargs) 2025-08-14T21:38:35.0708706Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0708784Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0709043Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:35.0709132Z encoder_outputs = self.encoder( 2025-08-14T21:38:35.0709361Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0709423Z return func(*args, **kwargs) 2025-08-14T21:38:35.0709657Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0709719Z return func(*args, **kwargs) 2025-08-14T21:38:35.0709956Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0710019Z return func(*args, **kwargs) 2025-08-14T21:38:35.0710108Z [Previous line repeated 1 more time] 2025-08-14T21:38:35.0710326Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0710394Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0710655Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:35.0710729Z layer_outputs = layer_module( 2025-08-14T21:38:35.0710938Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:35.0711020Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:35.0711248Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0711311Z return func(*args, **kwargs) 2025-08-14T21:38:35.0711547Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0711613Z return func(*args, **kwargs) 2025-08-14T21:38:35.0711841Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0711910Z return func(*args, **kwargs) 2025-08-14T21:38:35.0712168Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 338, in forward 2025-08-14T21:38:35.0712255Z self_attention_outputs = self.attention( 2025-08-14T21:38:35.0712487Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0712550Z return func(*args, **kwargs) 2025-08-14T21:38:35.0712788Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0712851Z return func(*args, **kwargs) 2025-08-14T21:38:35.0713083Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0713152Z return func(*args, **kwargs) 2025-08-14T21:38:35.0713409Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 271, in forward 2025-08-14T21:38:35.0713485Z self_outputs = self.self( 2025-08-14T21:38:35.0713715Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0713778Z return func(*args, **kwargs) 2025-08-14T21:38:35.0714031Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0714096Z return func(*args, **kwargs) 2025-08-14T21:38:35.0714330Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0714412Z return func(*args, **kwargs) 2025-08-14T21:38:35.0714670Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 193, in forward 2025-08-14T21:38:35.0714826Z value_states = self.value(hidden_states).view(hidden_shape).transpose(1, 2) 2025-08-14T21:38:35.0714830Z 2025-08-14T21:38:35.0714916Z cudagraph partition due to non gpu ops 2025-08-14T21:38:35.0714989Z cudagraph partition due to non gpu ops 2025-08-14T21:38:35.0715090Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:35.0715271Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:35.0715338Z return mod(**inputs) 2025-08-14T21:38:35.0715552Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0715612Z return func(*args, **kwargs) 2025-08-14T21:38:35.0715835Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0715915Z return func(*args, **kwargs) 2025-08-14T21:38:35.0716111Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0716186Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0716430Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 757, in forward 2025-08-14T21:38:35.0716501Z outputs = self.layoutlm( 2025-08-14T21:38:35.0716720Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0716780Z return func(*args, **kwargs) 2025-08-14T21:38:35.0717001Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0717061Z return func(*args, **kwargs) 2025-08-14T21:38:35.0717258Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0717333Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0717575Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:35.0717647Z encoder_outputs = self.encoder( 2025-08-14T21:38:35.0717862Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0717920Z return func(*args, **kwargs) 2025-08-14T21:38:35.0718141Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0718200Z return func(*args, **kwargs) 2025-08-14T21:38:35.0718422Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0718481Z return func(*args, **kwargs) 2025-08-14T21:38:35.0718551Z [Previous line repeated 1 more time] 2025-08-14T21:38:35.0718753Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0718819Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0719062Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:35.0719135Z layer_outputs = layer_module( 2025-08-14T21:38:35.0719335Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:35.0719430Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:35.0719651Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0719713Z return func(*args, **kwargs) 2025-08-14T21:38:35.0719941Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0720030Z return func(*args, **kwargs) 2025-08-14T21:38:35.0720250Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0720319Z return func(*args, **kwargs) 2025-08-14T21:38:35.0720586Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 338, in forward 2025-08-14T21:38:35.0720672Z self_attention_outputs = self.attention( 2025-08-14T21:38:35.0720899Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0720959Z return func(*args, **kwargs) 2025-08-14T21:38:35.0721192Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0721253Z return func(*args, **kwargs) 2025-08-14T21:38:35.0721491Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0721561Z return func(*args, **kwargs) 2025-08-14T21:38:35.0721809Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 278, in forward 2025-08-14T21:38:35.0721940Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:38:35.0722190Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 225, in forward 2025-08-14T21:38:35.0722267Z hidden_states = self.dense(hidden_states) 2025-08-14T21:38:35.0722271Z 2025-08-14T21:38:35.0722374Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:35.0722559Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:35.0722625Z return mod(**inputs) 2025-08-14T21:38:35.0722848Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0722912Z return func(*args, **kwargs) 2025-08-14T21:38:35.0723140Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0723200Z return func(*args, **kwargs) 2025-08-14T21:38:35.0723404Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0723476Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0723726Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 757, in forward 2025-08-14T21:38:35.0723797Z outputs = self.layoutlm( 2025-08-14T21:38:35.0724020Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0724081Z return func(*args, **kwargs) 2025-08-14T21:38:35.0724310Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0724372Z return func(*args, **kwargs) 2025-08-14T21:38:35.0724571Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0724646Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0724895Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:35.0724967Z encoder_outputs = self.encoder( 2025-08-14T21:38:35.0725203Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0725266Z return func(*args, **kwargs) 2025-08-14T21:38:35.0725500Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0725585Z return func(*args, **kwargs) 2025-08-14T21:38:35.0725819Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0725881Z return func(*args, **kwargs) 2025-08-14T21:38:35.0725953Z [Previous line repeated 1 more time] 2025-08-14T21:38:35.0726181Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0726254Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0726513Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:35.0726587Z layer_outputs = layer_module( 2025-08-14T21:38:35.0726806Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:35.0726883Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:35.0727098Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0727175Z return func(*args, **kwargs) 2025-08-14T21:38:35.0727403Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0727463Z return func(*args, **kwargs) 2025-08-14T21:38:35.0727801Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0727891Z return func(*args, **kwargs) 2025-08-14T21:38:35.0728311Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 348, in forward 2025-08-14T21:38:35.0728426Z layer_output = apply_chunking_to_forward( 2025-08-14T21:38:35.0728734Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:38:35.0728805Z return forward_fn(*input_tensors) 2025-08-14T21:38:35.0729087Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 356, in feed_forward_chunk 2025-08-14T21:38:35.0729198Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:38:35.0729443Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 294, in forward 2025-08-14T21:38:35.0729527Z hidden_states = self.dense(hidden_states) 2025-08-14T21:38:35.0729531Z 2025-08-14T21:38:35.0729626Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:35.0729817Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:35.0729877Z return mod(**inputs) 2025-08-14T21:38:35.0730094Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0730162Z return func(*args, **kwargs) 2025-08-14T21:38:35.0730381Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0730449Z return func(*args, **kwargs) 2025-08-14T21:38:35.0730646Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0730716Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0730965Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 757, in forward 2025-08-14T21:38:35.0731028Z outputs = self.layoutlm( 2025-08-14T21:38:35.0731268Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0731335Z return func(*args, **kwargs) 2025-08-14T21:38:35.0731552Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0731638Z return func(*args, **kwargs) 2025-08-14T21:38:35.0731841Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0731907Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0732171Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:35.0732239Z encoder_outputs = self.encoder( 2025-08-14T21:38:35.0732455Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0732523Z return func(*args, **kwargs) 2025-08-14T21:38:35.0732738Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0732804Z return func(*args, **kwargs) 2025-08-14T21:38:35.0733019Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0733096Z return func(*args, **kwargs) 2025-08-14T21:38:35.0733173Z [Previous line repeated 1 more time] 2025-08-14T21:38:35.0733375Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0733440Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0733692Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:35.0733757Z layer_outputs = layer_module( 2025-08-14T21:38:35.0733967Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:35.0734039Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:35.0734258Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0734326Z return func(*args, **kwargs) 2025-08-14T21:38:35.0734545Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0734613Z return func(*args, **kwargs) 2025-08-14T21:38:35.0734829Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0734892Z return func(*args, **kwargs) 2025-08-14T21:38:35.0735144Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 348, in forward 2025-08-14T21:38:35.0735221Z layer_output = apply_chunking_to_forward( 2025-08-14T21:38:35.0735461Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:38:35.0735537Z return forward_fn(*input_tensors) 2025-08-14T21:38:35.0735811Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 356, in feed_forward_chunk 2025-08-14T21:38:35.0735929Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:38:35.0736173Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 295, in forward 2025-08-14T21:38:35.0736275Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:38:35.0736477Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:38:35.0736542Z return self.act(input) 2025-08-14T21:38:35.0736545Z 2025-08-14T21:38:35.0736664Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:35.0736850Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:35.0736909Z return mod(**inputs) 2025-08-14T21:38:35.0737135Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0737220Z return func(*args, **kwargs) 2025-08-14T21:38:35.0737441Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0737507Z return func(*args, **kwargs) 2025-08-14T21:38:35.0737724Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0737799Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0738040Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 757, in forward 2025-08-14T21:38:35.0738104Z outputs = self.layoutlm( 2025-08-14T21:38:35.0738325Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0738383Z return func(*args, **kwargs) 2025-08-14T21:38:35.0738600Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0738682Z return func(*args, **kwargs) 2025-08-14T21:38:35.0738881Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0738952Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0739198Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:35.0739263Z encoder_outputs = self.encoder( 2025-08-14T21:38:35.0739488Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0739549Z return func(*args, **kwargs) 2025-08-14T21:38:35.0739766Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0739833Z return func(*args, **kwargs) 2025-08-14T21:38:35.0740052Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0740121Z return func(*args, **kwargs) 2025-08-14T21:38:35.0740191Z [Previous line repeated 1 more time] 2025-08-14T21:38:35.0740394Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0740469Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0740722Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:35.0740793Z layer_outputs = layer_module( 2025-08-14T21:38:35.0741002Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:35.0741074Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:35.0741306Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0741369Z return func(*args, **kwargs) 2025-08-14T21:38:35.0741593Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0741663Z return func(*args, **kwargs) 2025-08-14T21:38:35.0741887Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0741954Z return func(*args, **kwargs) 2025-08-14T21:38:35.0742209Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 348, in forward 2025-08-14T21:38:35.0742312Z layer_output = apply_chunking_to_forward( 2025-08-14T21:38:35.0742562Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:38:35.0742631Z return forward_fn(*input_tensors) 2025-08-14T21:38:35.0742908Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 357, in feed_forward_chunk 2025-08-14T21:38:35.0743058Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:38:35.0743307Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 308, in forward 2025-08-14T21:38:35.0743407Z hidden_states = self.dense(hidden_states) 2025-08-14T21:38:35.0743411Z 2025-08-14T21:38:35.0743510Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:35.0743696Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:35.0743766Z return mod(**inputs) 2025-08-14T21:38:35.0743989Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0744059Z return func(*args, **kwargs) 2025-08-14T21:38:35.0744283Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0744364Z return func(*args, **kwargs) 2025-08-14T21:38:35.0744577Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0744647Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0744967Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 771, in forward 2025-08-14T21:38:35.0745068Z prediction_scores = self.cls(sequence_output) 2025-08-14T21:38:35.0745329Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 484, in forward 2025-08-14T21:38:35.0745453Z prediction_scores = self.predictions(sequence_output) 2025-08-14T21:38:35.0745730Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 472, in forward 2025-08-14T21:38:35.0745832Z hidden_states = self.transform(hidden_states) 2025-08-14T21:38:35.0746111Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 447, in forward 2025-08-14T21:38:35.0746189Z hidden_states = self.dense(hidden_states) 2025-08-14T21:38:35.0746192Z 2025-08-14T21:38:35.0746297Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:35.0746483Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:35.0746545Z return mod(**inputs) 2025-08-14T21:38:35.0746779Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0746843Z return func(*args, **kwargs) 2025-08-14T21:38:35.0747068Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0747141Z return func(*args, **kwargs) 2025-08-14T21:38:35.0747343Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0747422Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0747672Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 771, in forward 2025-08-14T21:38:35.0747789Z prediction_scores = self.cls(sequence_output) 2025-08-14T21:38:35.0748043Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 484, in forward 2025-08-14T21:38:35.0748145Z prediction_scores = self.predictions(sequence_output) 2025-08-14T21:38:35.0748420Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 473, in forward 2025-08-14T21:38:35.0748508Z hidden_states = self.decoder(hidden_states) 2025-08-14T21:38:35.0748511Z 2025-08-14T21:38:35.0748608Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:35.0748817Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:35.0748879Z return mod(**inputs) 2025-08-14T21:38:35.0749109Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0749180Z return func(*args, **kwargs) 2025-08-14T21:38:35.0749422Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:35.0749495Z return func(*args, **kwargs) 2025-08-14T21:38:35.0749712Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:35.0749781Z output = func(self, *args, **kwargs) 2025-08-14T21:38:35.0750040Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 776, in forward 2025-08-14T21:38:35.0750105Z masked_lm_loss = loss_fct( 2025-08-14T21:38:35.0750124Z 2025-08-14T21:38:42.4990255Z Compilation time (from dynamo_timed): 13.955746799 2025-08-14T21:38:42.5022984Z pass 2025-08-14T21:38:42.5025444Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:38:42.5030846Z TIMING: _recursive_pre_grad_passes:0.00706 _recursive_joint_graph_passes:0.41528 _recursive_post_grad_passes:0.07277 async_compile.wait:0.62852 code_gen:6.75335 inductor_compile:7.84711 backend_compile:11.15832 gc:0.00017 entire_frame_compile:13.95575 total_wall_time:13.95575 2025-08-14T21:38:42.5031854Z STATS: call_* op count: 432 | FakeTensorMode.__torch_dispatch__:15442 | FakeTensor.__torch_dispatch__:4798 | ProxyTorchDispatchMode.__torch_dispatch__:5848 2025-08-14T21:38:42.5032334Z Dynamo produced 1 graphs covering 432 ops with 0 graph breaks (0 unique) 2025-08-14T21:38:46.6127650Z /opt/conda/envs/py_3.9/lib/python3.9/site-packages/llvmlite/binding/ffi.py:175: UserWarning: pkg_resources is deprecated as an API. See https://setuptools.pypa.io/en/latest/pkg_resources.html. The pkg_resources package is slated for removal as early as 2025-11-30. Refrain from using this package or pin to Setuptools<81. 2025-08-14T21:38:46.6128668Z from pkg_resources import resource_filename 2025-08-14T21:38:47.2029657Z 2025-08-14T21:38:48.2580424Z loading model: 0it [00:00, ?it/s] 2025-08-14T21:38:48.2580722Z loading model: 0it [00:01, ?it/s] 2025-08-14T21:38:48.2596474Z cpu eval LayoutLMForSequenceClassification 2025-08-14T21:38:48.7055552Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:38:48.8532617Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:38:48.9953537Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:38:56.3807355Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:56.3811571Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:56.3813693Z return mod(**inputs) 2025-08-14T21:38:56.3818341Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.3822924Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.3823476Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 875, in forward 2025-08-14T21:38:56.3823877Z outputs = self.layoutlm( 2025-08-14T21:38:56.3824523Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.3825001Z return func(*args, **kwargs) 2025-08-14T21:38:56.3825350Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.3825693Z return func(*args, **kwargs) 2025-08-14T21:38:56.3826007Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.3826408Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.3826785Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:56.3827231Z encoder_outputs = self.encoder( 2025-08-14T21:38:56.3827574Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.3827917Z return func(*args, **kwargs) 2025-08-14T21:38:56.3828253Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.3828586Z return func(*args, **kwargs) 2025-08-14T21:38:56.3828916Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.3829253Z return func(*args, **kwargs) 2025-08-14T21:38:56.3829493Z [Previous line repeated 1 more time] 2025-08-14T21:38:56.3829832Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.3830180Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.3830570Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:56.3830946Z layer_outputs = layer_module( 2025-08-14T21:38:56.3831281Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:56.3831629Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:56.3831989Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.3832328Z return func(*args, **kwargs) 2025-08-14T21:38:56.3832669Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.3833016Z return func(*args, **kwargs) 2025-08-14T21:38:56.3833352Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.3833687Z return func(*args, **kwargs) 2025-08-14T21:38:56.3834050Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 338, in forward 2025-08-14T21:38:56.3834436Z self_attention_outputs = self.attention( 2025-08-14T21:38:56.3834788Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.3835134Z return func(*args, **kwargs) 2025-08-14T21:38:56.3835468Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.3835808Z return func(*args, **kwargs) 2025-08-14T21:38:56.3836137Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.3836481Z return func(*args, **kwargs) 2025-08-14T21:38:56.3836843Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 271, in forward 2025-08-14T21:38:56.3837213Z self_outputs = self.self( 2025-08-14T21:38:56.3837556Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.3837899Z return func(*args, **kwargs) 2025-08-14T21:38:56.3838260Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.3838595Z return func(*args, **kwargs) 2025-08-14T21:38:56.3838923Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.3839257Z return func(*args, **kwargs) 2025-08-14T21:38:56.3839634Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 191, in forward 2025-08-14T21:38:56.3840080Z query_states = self.query(hidden_states).view(hidden_shape).transpose(1, 2) 2025-08-14T21:38:56.3840279Z 2025-08-14T21:38:56.3840404Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:56.3840746Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:56.3841041Z return mod(**inputs) 2025-08-14T21:38:56.3841353Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.3841682Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.3842052Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 875, in forward 2025-08-14T21:38:56.3842412Z outputs = self.layoutlm( 2025-08-14T21:38:56.3842775Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.3843116Z return func(*args, **kwargs) 2025-08-14T21:38:56.3843438Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.3843781Z return func(*args, **kwargs) 2025-08-14T21:38:56.3844089Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.3844422Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.3844789Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:56.3845169Z encoder_outputs = self.encoder( 2025-08-14T21:38:56.3845512Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.3845849Z return func(*args, **kwargs) 2025-08-14T21:38:56.3846183Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.3846520Z return func(*args, **kwargs) 2025-08-14T21:38:56.3846850Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.3847183Z return func(*args, **kwargs) 2025-08-14T21:38:56.3847362Z [Previous line repeated 1 more time] 2025-08-14T21:38:56.3847690Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.3848015Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.3848387Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:56.3848761Z layer_outputs = layer_module( 2025-08-14T21:38:56.3849093Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:56.3849561Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:56.3849912Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.3850253Z return func(*args, **kwargs) 2025-08-14T21:38:56.3850586Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.3850921Z return func(*args, **kwargs) 2025-08-14T21:38:56.3851270Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.3851611Z return func(*args, **kwargs) 2025-08-14T21:38:56.3851960Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 338, in forward 2025-08-14T21:38:56.3852343Z self_attention_outputs = self.attention( 2025-08-14T21:38:56.3852744Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.3853077Z return func(*args, **kwargs) 2025-08-14T21:38:56.3853394Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.3853744Z return func(*args, **kwargs) 2025-08-14T21:38:56.3854074Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.3854401Z return func(*args, **kwargs) 2025-08-14T21:38:56.3854756Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 271, in forward 2025-08-14T21:38:56.3855122Z self_outputs = self.self( 2025-08-14T21:38:56.3855458Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.3855806Z return func(*args, **kwargs) 2025-08-14T21:38:56.3856135Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.3856467Z return func(*args, **kwargs) 2025-08-14T21:38:56.3856792Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.3857140Z return func(*args, **kwargs) 2025-08-14T21:38:56.3857501Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 192, in forward 2025-08-14T21:38:56.3857934Z key_states = self.key(hidden_states).view(hidden_shape).transpose(1, 2) 2025-08-14T21:38:56.3858111Z 2025-08-14T21:38:56.3858212Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:56.3858548Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:56.3858849Z return mod(**inputs) 2025-08-14T21:38:56.3859156Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.3859475Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.3859842Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 875, in forward 2025-08-14T21:38:56.3860213Z outputs = self.layoutlm( 2025-08-14T21:38:56.3860541Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.3860896Z return func(*args, **kwargs) 2025-08-14T21:38:56.3861225Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.3861559Z return func(*args, **kwargs) 2025-08-14T21:38:56.3861864Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.3862196Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.3862566Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:56.3862935Z encoder_outputs = self.encoder( 2025-08-14T21:38:56.3863266Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.3863598Z return func(*args, **kwargs) 2025-08-14T21:38:56.3863918Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.3864262Z return func(*args, **kwargs) 2025-08-14T21:38:56.3864592Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.3865024Z return func(*args, **kwargs) 2025-08-14T21:38:56.3865209Z [Previous line repeated 1 more time] 2025-08-14T21:38:56.3865532Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.3865880Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.3866254Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:56.3866618Z layer_outputs = layer_module( 2025-08-14T21:38:56.3866960Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:56.3867310Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:56.3867672Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.3868003Z return func(*args, **kwargs) 2025-08-14T21:38:56.3868330Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.3868665Z return func(*args, **kwargs) 2025-08-14T21:38:56.3869019Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.3869363Z return func(*args, **kwargs) 2025-08-14T21:38:56.3869722Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 338, in forward 2025-08-14T21:38:56.3870103Z self_attention_outputs = self.attention( 2025-08-14T21:38:56.3870444Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.3870780Z return func(*args, **kwargs) 2025-08-14T21:38:56.3871117Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.3871455Z return func(*args, **kwargs) 2025-08-14T21:38:56.3871773Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.3872112Z return func(*args, **kwargs) 2025-08-14T21:38:56.3872466Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 271, in forward 2025-08-14T21:38:56.3872825Z self_outputs = self.self( 2025-08-14T21:38:56.3873158Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.3873493Z return func(*args, **kwargs) 2025-08-14T21:38:56.3873822Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.3874152Z return func(*args, **kwargs) 2025-08-14T21:38:56.3874479Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.3874814Z return func(*args, **kwargs) 2025-08-14T21:38:56.3875159Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 193, in forward 2025-08-14T21:38:56.3875608Z value_states = self.value(hidden_states).view(hidden_shape).transpose(1, 2) 2025-08-14T21:38:56.3875803Z 2025-08-14T21:38:56.3875878Z cudagraph partition due to non gpu ops 2025-08-14T21:38:56.3876073Z cudagraph partition due to non gpu ops 2025-08-14T21:38:56.3876286Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:56.3876620Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:56.3876919Z return mod(**inputs) 2025-08-14T21:38:56.3877235Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.3877566Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.3877933Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 875, in forward 2025-08-14T21:38:56.3878299Z outputs = self.layoutlm( 2025-08-14T21:38:56.3878642Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.3878989Z return func(*args, **kwargs) 2025-08-14T21:38:56.3879328Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.3879682Z return func(*args, **kwargs) 2025-08-14T21:38:56.3879990Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.3880316Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.3880688Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:56.3881055Z encoder_outputs = self.encoder( 2025-08-14T21:38:56.3881396Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.3881754Z return func(*args, **kwargs) 2025-08-14T21:38:56.3882085Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.3882415Z return func(*args, **kwargs) 2025-08-14T21:38:56.3882748Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.3883098Z return func(*args, **kwargs) 2025-08-14T21:38:56.3883274Z [Previous line repeated 1 more time] 2025-08-14T21:38:56.3883610Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.3883936Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.3884305Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:56.3884967Z layer_outputs = layer_module( 2025-08-14T21:38:56.3885299Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:56.3885647Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:56.3885993Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.3886341Z return func(*args, **kwargs) 2025-08-14T21:38:56.3886674Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.3887014Z return func(*args, **kwargs) 2025-08-14T21:38:56.3887339Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.3887678Z return func(*args, **kwargs) 2025-08-14T21:38:56.3888038Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 338, in forward 2025-08-14T21:38:56.3888413Z self_attention_outputs = self.attention( 2025-08-14T21:38:56.3888772Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.3889106Z return func(*args, **kwargs) 2025-08-14T21:38:56.3889435Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.3889766Z return func(*args, **kwargs) 2025-08-14T21:38:56.3890094Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.3890432Z return func(*args, **kwargs) 2025-08-14T21:38:56.3890837Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 278, in forward 2025-08-14T21:38:56.3891259Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:38:56.3891686Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 225, in forward 2025-08-14T21:38:56.3892122Z hidden_states = self.dense(hidden_states) 2025-08-14T21:38:56.3892257Z 2025-08-14T21:38:56.3892361Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:56.3892702Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:56.3893040Z return mod(**inputs) 2025-08-14T21:38:56.3893350Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.3893673Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.3894050Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 875, in forward 2025-08-14T21:38:56.3894425Z outputs = self.layoutlm( 2025-08-14T21:38:56.3894751Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.3895096Z return func(*args, **kwargs) 2025-08-14T21:38:56.3895451Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.3895787Z return func(*args, **kwargs) 2025-08-14T21:38:56.3896097Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.3896425Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.3896792Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:56.3897160Z encoder_outputs = self.encoder( 2025-08-14T21:38:56.3897493Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.3897834Z return func(*args, **kwargs) 2025-08-14T21:38:56.3898163Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.3898496Z return func(*args, **kwargs) 2025-08-14T21:38:56.3898817Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.3899150Z return func(*args, **kwargs) 2025-08-14T21:38:56.3899327Z [Previous line repeated 1 more time] 2025-08-14T21:38:56.3899643Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.3899965Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.3900330Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:56.3900692Z layer_outputs = layer_module( 2025-08-14T21:38:56.3901010Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:56.3901341Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:56.3901692Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.3902028Z return func(*args, **kwargs) 2025-08-14T21:38:56.3902352Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.3902691Z return func(*args, **kwargs) 2025-08-14T21:38:56.3903011Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.3903344Z return func(*args, **kwargs) 2025-08-14T21:38:56.3903724Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 348, in forward 2025-08-14T21:38:56.3904111Z layer_output = apply_chunking_to_forward( 2025-08-14T21:38:56.3904480Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:38:56.3904919Z return forward_fn(*input_tensors) 2025-08-14T21:38:56.3905321Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 356, in feed_forward_chunk 2025-08-14T21:38:56.3905768Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:38:56.3906199Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 294, in forward 2025-08-14T21:38:56.3906584Z hidden_states = self.dense(hidden_states) 2025-08-14T21:38:56.3906713Z 2025-08-14T21:38:56.3906818Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:56.3907151Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:56.3907454Z return mod(**inputs) 2025-08-14T21:38:56.3907759Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.3908098Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.3908490Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 875, in forward 2025-08-14T21:38:56.3908872Z outputs = self.layoutlm( 2025-08-14T21:38:56.3909209Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.3909555Z return func(*args, **kwargs) 2025-08-14T21:38:56.3909900Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.3910247Z return func(*args, **kwargs) 2025-08-14T21:38:56.3910564Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.3910887Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.3911260Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:56.3911643Z encoder_outputs = self.encoder( 2025-08-14T21:38:56.3911989Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.3912323Z return func(*args, **kwargs) 2025-08-14T21:38:56.3912659Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.3913005Z return func(*args, **kwargs) 2025-08-14T21:38:56.3913330Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.3913677Z return func(*args, **kwargs) 2025-08-14T21:38:56.3913862Z [Previous line repeated 1 more time] 2025-08-14T21:38:56.3914193Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.3914520Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.3914898Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:56.3915275Z layer_outputs = layer_module( 2025-08-14T21:38:56.3915595Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:56.3915936Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:56.3916287Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.3916628Z return func(*args, **kwargs) 2025-08-14T21:38:56.3916972Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.3917312Z return func(*args, **kwargs) 2025-08-14T21:38:56.3917640Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.3917988Z return func(*args, **kwargs) 2025-08-14T21:38:56.3918339Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 348, in forward 2025-08-14T21:38:56.3918718Z layer_output = apply_chunking_to_forward( 2025-08-14T21:38:56.3919106Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:38:56.3919470Z return forward_fn(*input_tensors) 2025-08-14T21:38:56.3919872Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 356, in feed_forward_chunk 2025-08-14T21:38:56.3920322Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:38:56.3920741Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 295, in forward 2025-08-14T21:38:56.3921149Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:38:56.3921523Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:38:56.3921838Z return self.act(input) 2025-08-14T21:38:56.3921942Z 2025-08-14T21:38:56.3922038Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:56.3922376Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:56.3922677Z return mod(**inputs) 2025-08-14T21:38:56.3922981Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.3923303Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.3923677Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 875, in forward 2025-08-14T21:38:56.3924045Z outputs = self.layoutlm( 2025-08-14T21:38:56.3924366Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.3924708Z return func(*args, **kwargs) 2025-08-14T21:38:56.3925035Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.3925372Z return func(*args, **kwargs) 2025-08-14T21:38:56.3925673Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.3926002Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.3926368Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:56.3926736Z encoder_outputs = self.encoder( 2025-08-14T21:38:56.3927066Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.3927397Z return func(*args, **kwargs) 2025-08-14T21:38:56.3927725Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.3928057Z return func(*args, **kwargs) 2025-08-14T21:38:56.3928384Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.3928720Z return func(*args, **kwargs) 2025-08-14T21:38:56.3928900Z [Previous line repeated 1 more time] 2025-08-14T21:38:56.3929217Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.3929541Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.3929929Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:56.3930297Z layer_outputs = layer_module( 2025-08-14T21:38:56.3930619Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:56.3931008Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:56.3931358Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.3931690Z return func(*args, **kwargs) 2025-08-14T21:38:56.3932032Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.3932369Z return func(*args, **kwargs) 2025-08-14T21:38:56.3932687Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.3933022Z return func(*args, **kwargs) 2025-08-14T21:38:56.3933376Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 348, in forward 2025-08-14T21:38:56.3933753Z layer_output = apply_chunking_to_forward( 2025-08-14T21:38:56.3934115Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:38:56.3934494Z return forward_fn(*input_tensors) 2025-08-14T21:38:56.3934892Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 357, in feed_forward_chunk 2025-08-14T21:38:56.3935339Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:38:56.3935753Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 308, in forward 2025-08-14T21:38:56.3936134Z hidden_states = self.dense(hidden_states) 2025-08-14T21:38:56.3936260Z 2025-08-14T21:38:56.3936366Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:56.3936693Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:56.3936995Z return mod(**inputs) 2025-08-14T21:38:56.3937300Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.3937631Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.3937994Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 875, in forward 2025-08-14T21:38:56.3938366Z outputs = self.layoutlm( 2025-08-14T21:38:56.3938698Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.3939028Z return func(*args, **kwargs) 2025-08-14T21:38:56.3939361Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.3939701Z return func(*args, **kwargs) 2025-08-14T21:38:56.3940015Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.3940334Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.3940706Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:56.3941083Z encoder_outputs = self.encoder( 2025-08-14T21:38:56.3941425Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.3941759Z return func(*args, **kwargs) 2025-08-14T21:38:56.3942089Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.3942427Z return func(*args, **kwargs) 2025-08-14T21:38:56.3942766Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.3943110Z return func(*args, **kwargs) 2025-08-14T21:38:56.3943298Z [Previous line repeated 1 more time] 2025-08-14T21:38:56.3943623Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.3943972Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.3944340Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:56.3944708Z layer_outputs = layer_module( 2025-08-14T21:38:56.3945100Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:56.3945440Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:56.3945785Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.3946125Z return func(*args, **kwargs) 2025-08-14T21:38:56.3946444Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.3946780Z return func(*args, **kwargs) 2025-08-14T21:38:56.3947113Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.3947471Z return func(*args, **kwargs) 2025-08-14T21:38:56.3947832Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 338, in forward 2025-08-14T21:38:56.3948218Z self_attention_outputs = self.attention( 2025-08-14T21:38:56.3948575Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.3948907Z return func(*args, **kwargs) 2025-08-14T21:38:56.3949238Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.3949577Z return func(*args, **kwargs) 2025-08-14T21:38:56.3949898Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.3950233Z return func(*args, **kwargs) 2025-08-14T21:38:56.3950587Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 271, in forward 2025-08-14T21:38:56.3950958Z self_outputs = self.self( 2025-08-14T21:38:56.3951285Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.3951623Z return func(*args, **kwargs) 2025-08-14T21:38:56.3951950Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.3952286Z return func(*args, **kwargs) 2025-08-14T21:38:56.3952606Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.3952940Z return func(*args, **kwargs) 2025-08-14T21:38:56.3953299Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 191, in forward 2025-08-14T21:38:56.3953732Z query_states = self.query(hidden_states).view(hidden_shape).transpose(1, 2) 2025-08-14T21:38:56.3953929Z 2025-08-14T21:38:56.3954028Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:56.3954364Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:56.3954666Z return mod(**inputs) 2025-08-14T21:38:56.3954962Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.3955288Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.3955717Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 875, in forward 2025-08-14T21:38:56.3956085Z outputs = self.layoutlm( 2025-08-14T21:38:56.3956421Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.3956763Z return func(*args, **kwargs) 2025-08-14T21:38:56.3957108Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.3957438Z return func(*args, **kwargs) 2025-08-14T21:38:56.3957747Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.3958089Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.3958457Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:56.3958830Z encoder_outputs = self.encoder( 2025-08-14T21:38:56.3959172Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.3959513Z return func(*args, **kwargs) 2025-08-14T21:38:56.3959834Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.3960175Z return func(*args, **kwargs) 2025-08-14T21:38:56.3960519Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.3960855Z return func(*args, **kwargs) 2025-08-14T21:38:56.3961024Z [Previous line repeated 1 more time] 2025-08-14T21:38:56.3961350Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.3961675Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.3962037Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:56.3962407Z layer_outputs = layer_module( 2025-08-14T21:38:56.3962728Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:56.3963062Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:56.3963399Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.3963742Z return func(*args, **kwargs) 2025-08-14T21:38:56.3964070Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.3964396Z return func(*args, **kwargs) 2025-08-14T21:38:56.3964725Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.3965058Z return func(*args, **kwargs) 2025-08-14T21:38:56.3965409Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 338, in forward 2025-08-14T21:38:56.3965783Z self_attention_outputs = self.attention( 2025-08-14T21:38:56.3966129Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.3966464Z return func(*args, **kwargs) 2025-08-14T21:38:56.3966785Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.3967119Z return func(*args, **kwargs) 2025-08-14T21:38:56.3967445Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.3967779Z return func(*args, **kwargs) 2025-08-14T21:38:56.3968123Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 271, in forward 2025-08-14T21:38:56.3968487Z self_outputs = self.self( 2025-08-14T21:38:56.3968839Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.3969170Z return func(*args, **kwargs) 2025-08-14T21:38:56.3969500Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.3969859Z return func(*args, **kwargs) 2025-08-14T21:38:56.3970185Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.3970514Z return func(*args, **kwargs) 2025-08-14T21:38:56.3970884Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 192, in forward 2025-08-14T21:38:56.3971325Z key_states = self.key(hidden_states).view(hidden_shape).transpose(1, 2) 2025-08-14T21:38:56.3971509Z 2025-08-14T21:38:56.3971619Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:56.3971954Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:56.3972264Z return mod(**inputs) 2025-08-14T21:38:56.3972575Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.3972908Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.3973303Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 875, in forward 2025-08-14T21:38:56.3973685Z outputs = self.layoutlm( 2025-08-14T21:38:56.3974017Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.3974354Z return func(*args, **kwargs) 2025-08-14T21:38:56.3974684Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.3975022Z return func(*args, **kwargs) 2025-08-14T21:38:56.3975329Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.3975661Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.3976037Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:56.3976421Z encoder_outputs = self.encoder( 2025-08-14T21:38:56.3976755Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.3977096Z return func(*args, **kwargs) 2025-08-14T21:38:56.3977427Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.3977769Z return func(*args, **kwargs) 2025-08-14T21:38:56.3978092Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.3978430Z return func(*args, **kwargs) 2025-08-14T21:38:56.3978612Z [Previous line repeated 1 more time] 2025-08-14T21:38:56.3978932Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.3979262Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.3979641Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:56.3980017Z layer_outputs = layer_module( 2025-08-14T21:38:56.3980333Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:56.3980673Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:56.3981022Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.3981355Z return func(*args, **kwargs) 2025-08-14T21:38:56.3981703Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.3982040Z return func(*args, **kwargs) 2025-08-14T21:38:56.3982363Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.3982691Z return func(*args, **kwargs) 2025-08-14T21:38:56.3983059Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 338, in forward 2025-08-14T21:38:56.3983434Z self_attention_outputs = self.attention( 2025-08-14T21:38:56.3983776Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.3984127Z return func(*args, **kwargs) 2025-08-14T21:38:56.3984459Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.3984996Z return func(*args, **kwargs) 2025-08-14T21:38:56.3985325Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.3985670Z return func(*args, **kwargs) 2025-08-14T21:38:56.3986027Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 271, in forward 2025-08-14T21:38:56.3986439Z self_outputs = self.self( 2025-08-14T21:38:56.3986770Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.3987109Z return func(*args, **kwargs) 2025-08-14T21:38:56.3987439Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.3987770Z return func(*args, **kwargs) 2025-08-14T21:38:56.3988100Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.3988442Z return func(*args, **kwargs) 2025-08-14T21:38:56.3988789Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 193, in forward 2025-08-14T21:38:56.3989231Z value_states = self.value(hidden_states).view(hidden_shape).transpose(1, 2) 2025-08-14T21:38:56.3989422Z 2025-08-14T21:38:56.3989499Z cudagraph partition due to non gpu ops 2025-08-14T21:38:56.3989699Z cudagraph partition due to non gpu ops 2025-08-14T21:38:56.3989913Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:56.3990249Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:56.3990545Z return mod(**inputs) 2025-08-14T21:38:56.3990848Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.3991167Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.3991541Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 875, in forward 2025-08-14T21:38:56.3991909Z outputs = self.layoutlm( 2025-08-14T21:38:56.3992231Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.3992568Z return func(*args, **kwargs) 2025-08-14T21:38:56.3992900Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.3993232Z return func(*args, **kwargs) 2025-08-14T21:38:56.3993533Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.3993860Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.3994228Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:56.3994593Z encoder_outputs = self.encoder( 2025-08-14T21:38:56.3994963Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.3995300Z return func(*args, **kwargs) 2025-08-14T21:38:56.3995621Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.3995986Z return func(*args, **kwargs) 2025-08-14T21:38:56.3996310Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.3996646Z return func(*args, **kwargs) 2025-08-14T21:38:56.3996818Z [Previous line repeated 1 more time] 2025-08-14T21:38:56.3997175Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.3997508Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.3997881Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:56.3998242Z layer_outputs = layer_module( 2025-08-14T21:38:56.3998563Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:56.3998899Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:56.3999235Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.3999591Z return func(*args, **kwargs) 2025-08-14T21:38:56.3999918Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4000253Z return func(*args, **kwargs) 2025-08-14T21:38:56.4000574Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4000909Z return func(*args, **kwargs) 2025-08-14T21:38:56.4001267Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 338, in forward 2025-08-14T21:38:56.4001650Z self_attention_outputs = self.attention( 2025-08-14T21:38:56.4001989Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4002324Z return func(*args, **kwargs) 2025-08-14T21:38:56.4002652Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4002982Z return func(*args, **kwargs) 2025-08-14T21:38:56.4003307Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4003644Z return func(*args, **kwargs) 2025-08-14T21:38:56.4003999Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 278, in forward 2025-08-14T21:38:56.4004414Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:38:56.4004835Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 225, in forward 2025-08-14T21:38:56.4005214Z hidden_states = self.dense(hidden_states) 2025-08-14T21:38:56.4005341Z 2025-08-14T21:38:56.4005438Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:56.4005777Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:56.4006075Z return mod(**inputs) 2025-08-14T21:38:56.4006374Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4006699Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4007067Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 875, in forward 2025-08-14T21:38:56.4007432Z outputs = self.layoutlm( 2025-08-14T21:38:56.4007772Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4008114Z return func(*args, **kwargs) 2025-08-14T21:38:56.4008445Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4008787Z return func(*args, **kwargs) 2025-08-14T21:38:56.4009111Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4009438Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4009828Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:56.4010198Z encoder_outputs = self.encoder( 2025-08-14T21:38:56.4010530Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4010869Z return func(*args, **kwargs) 2025-08-14T21:38:56.4011195Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4011524Z return func(*args, **kwargs) 2025-08-14T21:38:56.4011851Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4012209Z return func(*args, **kwargs) 2025-08-14T21:38:56.4012384Z [Previous line repeated 1 more time] 2025-08-14T21:38:56.4012702Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4013026Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4013398Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:56.4013759Z layer_outputs = layer_module( 2025-08-14T21:38:56.4014080Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:56.4014412Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:56.4014757Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4015086Z return func(*args, **kwargs) 2025-08-14T21:38:56.4015415Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4015750Z return func(*args, **kwargs) 2025-08-14T21:38:56.4016071Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4016409Z return func(*args, **kwargs) 2025-08-14T21:38:56.4016766Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 348, in forward 2025-08-14T21:38:56.4017147Z layer_output = apply_chunking_to_forward( 2025-08-14T21:38:56.4017516Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:38:56.4017878Z return forward_fn(*input_tensors) 2025-08-14T21:38:56.4018278Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 356, in feed_forward_chunk 2025-08-14T21:38:56.4018722Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:38:56.4019126Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 294, in forward 2025-08-14T21:38:56.4019504Z hidden_states = self.dense(hidden_states) 2025-08-14T21:38:56.4019630Z 2025-08-14T21:38:56.4019737Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:56.4020065Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:56.4020367Z return mod(**inputs) 2025-08-14T21:38:56.4020688Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4021016Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4021380Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 875, in forward 2025-08-14T21:38:56.4021768Z outputs = self.layoutlm( 2025-08-14T21:38:56.4022103Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4022437Z return func(*args, **kwargs) 2025-08-14T21:38:56.4022786Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4023124Z return func(*args, **kwargs) 2025-08-14T21:38:56.4023433Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4023750Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4024120Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:56.4024491Z encoder_outputs = self.encoder( 2025-08-14T21:38:56.4024895Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4025265Z return func(*args, **kwargs) 2025-08-14T21:38:56.4025602Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4025943Z return func(*args, **kwargs) 2025-08-14T21:38:56.4026268Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4026611Z return func(*args, **kwargs) 2025-08-14T21:38:56.4026793Z [Previous line repeated 1 more time] 2025-08-14T21:38:56.4027127Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4027453Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4027828Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:56.4028199Z layer_outputs = layer_module( 2025-08-14T21:38:56.4028518Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:56.4028855Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:56.4029200Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4029539Z return func(*args, **kwargs) 2025-08-14T21:38:56.4029862Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4030198Z return func(*args, **kwargs) 2025-08-14T21:38:56.4030529Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4030862Z return func(*args, **kwargs) 2025-08-14T21:38:56.4031219Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 348, in forward 2025-08-14T21:38:56.4031605Z layer_output = apply_chunking_to_forward( 2025-08-14T21:38:56.4031974Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:38:56.4032329Z return forward_fn(*input_tensors) 2025-08-14T21:38:56.4032728Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 356, in feed_forward_chunk 2025-08-14T21:38:56.4033173Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:38:56.4033600Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 295, in forward 2025-08-14T21:38:56.4034008Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:38:56.4034363Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:38:56.4034681Z return self.act(input) 2025-08-14T21:38:56.4034803Z 2025-08-14T21:38:56.4034901Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:56.4035238Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:56.4035537Z return mod(**inputs) 2025-08-14T21:38:56.4035855Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4036176Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4036548Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 875, in forward 2025-08-14T21:38:56.4036918Z outputs = self.layoutlm( 2025-08-14T21:38:56.4037247Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4037596Z return func(*args, **kwargs) 2025-08-14T21:38:56.4037925Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4038279Z return func(*args, **kwargs) 2025-08-14T21:38:56.4038582Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4038908Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4039278Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:56.4039640Z encoder_outputs = self.encoder( 2025-08-14T21:38:56.4039979Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4040317Z return func(*args, **kwargs) 2025-08-14T21:38:56.4040647Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4040977Z return func(*args, **kwargs) 2025-08-14T21:38:56.4041302Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4041641Z return func(*args, **kwargs) 2025-08-14T21:38:56.4041815Z [Previous line repeated 1 more time] 2025-08-14T21:38:56.4042139Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4042464Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4042834Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:56.4043194Z layer_outputs = layer_module( 2025-08-14T21:38:56.4043519Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:56.4043849Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:56.4044195Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4044529Z return func(*args, **kwargs) 2025-08-14T21:38:56.4044855Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4045194Z return func(*args, **kwargs) 2025-08-14T21:38:56.4063526Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4063912Z return func(*args, **kwargs) 2025-08-14T21:38:56.4064305Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 348, in forward 2025-08-14T21:38:56.4064853Z layer_output = apply_chunking_to_forward( 2025-08-14T21:38:56.4065252Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:38:56.4065625Z return forward_fn(*input_tensors) 2025-08-14T21:38:56.4066031Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 357, in feed_forward_chunk 2025-08-14T21:38:56.4066529Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:38:56.4066960Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 308, in forward 2025-08-14T21:38:56.4067387Z hidden_states = self.dense(hidden_states) 2025-08-14T21:38:56.4067522Z 2025-08-14T21:38:56.4067633Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:56.4067970Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:56.4068286Z return mod(**inputs) 2025-08-14T21:38:56.4068604Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4068942Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4069310Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 875, in forward 2025-08-14T21:38:56.4069724Z outputs = self.layoutlm( 2025-08-14T21:38:56.4070070Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4070415Z return func(*args, **kwargs) 2025-08-14T21:38:56.4070756Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4071103Z return func(*args, **kwargs) 2025-08-14T21:38:56.4071417Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4071743Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4072120Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:56.4072497Z encoder_outputs = self.encoder( 2025-08-14T21:38:56.4072836Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4073183Z return func(*args, **kwargs) 2025-08-14T21:38:56.4073521Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4073865Z return func(*args, **kwargs) 2025-08-14T21:38:56.4074191Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4074634Z return func(*args, **kwargs) 2025-08-14T21:38:56.4074822Z [Previous line repeated 1 more time] 2025-08-14T21:38:56.4075146Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4075476Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4075850Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:56.4076224Z layer_outputs = layer_module( 2025-08-14T21:38:56.4076545Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:56.4076890Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:56.4077250Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4077592Z return func(*args, **kwargs) 2025-08-14T21:38:56.4077917Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4078278Z return func(*args, **kwargs) 2025-08-14T21:38:56.4078612Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4078939Z return func(*args, **kwargs) 2025-08-14T21:38:56.4079301Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 338, in forward 2025-08-14T21:38:56.4079703Z self_attention_outputs = self.attention( 2025-08-14T21:38:56.4080056Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4080384Z return func(*args, **kwargs) 2025-08-14T21:38:56.4080728Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4081072Z return func(*args, **kwargs) 2025-08-14T21:38:56.4081395Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4081737Z return func(*args, **kwargs) 2025-08-14T21:38:56.4082098Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 271, in forward 2025-08-14T21:38:56.4082470Z self_outputs = self.self( 2025-08-14T21:38:56.4082801Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4083160Z return func(*args, **kwargs) 2025-08-14T21:38:56.4083488Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4083814Z return func(*args, **kwargs) 2025-08-14T21:38:56.4084143Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4084483Z return func(*args, **kwargs) 2025-08-14T21:38:56.4084970Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 191, in forward 2025-08-14T21:38:56.4085412Z query_states = self.query(hidden_states).view(hidden_shape).transpose(1, 2) 2025-08-14T21:38:56.4085611Z 2025-08-14T21:38:56.4085714Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:56.4086056Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:56.4086369Z return mod(**inputs) 2025-08-14T21:38:56.4086674Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4087008Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4087389Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 875, in forward 2025-08-14T21:38:56.4087755Z outputs = self.layoutlm( 2025-08-14T21:38:56.4088093Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4088438Z return func(*args, **kwargs) 2025-08-14T21:38:56.4088769Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4089101Z return func(*args, **kwargs) 2025-08-14T21:38:56.4089413Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4089738Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4090099Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:56.4090472Z encoder_outputs = self.encoder( 2025-08-14T21:38:56.4090814Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4091149Z return func(*args, **kwargs) 2025-08-14T21:38:56.4091526Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4091867Z return func(*args, **kwargs) 2025-08-14T21:38:56.4092193Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4092552Z return func(*args, **kwargs) 2025-08-14T21:38:56.4092775Z [Previous line repeated 1 more time] 2025-08-14T21:38:56.4093118Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4093460Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4093864Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:56.4094252Z layer_outputs = layer_module( 2025-08-14T21:38:56.4094587Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:56.4094925Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:56.4095288Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4095641Z return func(*args, **kwargs) 2025-08-14T21:38:56.4095978Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4096355Z return func(*args, **kwargs) 2025-08-14T21:38:56.4096749Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4097105Z return func(*args, **kwargs) 2025-08-14T21:38:56.4097477Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 338, in forward 2025-08-14T21:38:56.4097865Z self_attention_outputs = self.attention( 2025-08-14T21:38:56.4098235Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4098587Z return func(*args, **kwargs) 2025-08-14T21:38:56.4098926Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4099277Z return func(*args, **kwargs) 2025-08-14T21:38:56.4099624Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4100003Z return func(*args, **kwargs) 2025-08-14T21:38:56.4100370Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 271, in forward 2025-08-14T21:38:56.4100751Z self_outputs = self.self( 2025-08-14T21:38:56.4101095Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4101495Z return func(*args, **kwargs) 2025-08-14T21:38:56.4101840Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4102189Z return func(*args, **kwargs) 2025-08-14T21:38:56.4102525Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4102873Z return func(*args, **kwargs) 2025-08-14T21:38:56.4103248Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 192, in forward 2025-08-14T21:38:56.4103695Z key_states = self.key(hidden_states).view(hidden_shape).transpose(1, 2) 2025-08-14T21:38:56.4103882Z 2025-08-14T21:38:56.4103989Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:56.4104340Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:56.4104657Z return mod(**inputs) 2025-08-14T21:38:56.4105054Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4105433Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4105812Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 875, in forward 2025-08-14T21:38:56.4106192Z outputs = self.layoutlm( 2025-08-14T21:38:56.4106547Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4106896Z return func(*args, **kwargs) 2025-08-14T21:38:56.4107234Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4107604Z return func(*args, **kwargs) 2025-08-14T21:38:56.4107932Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4108282Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4108654Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:56.4109025Z encoder_outputs = self.encoder( 2025-08-14T21:38:56.4109370Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4109722Z return func(*args, **kwargs) 2025-08-14T21:38:56.4110043Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4110380Z return func(*args, **kwargs) 2025-08-14T21:38:56.4110705Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4111038Z return func(*args, **kwargs) 2025-08-14T21:38:56.4111210Z [Previous line repeated 1 more time] 2025-08-14T21:38:56.4111536Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4111862Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4112220Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:56.4112587Z layer_outputs = layer_module( 2025-08-14T21:38:56.4112907Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:56.4113245Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:56.4113578Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4113913Z return func(*args, **kwargs) 2025-08-14T21:38:56.4114239Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4114577Z return func(*args, **kwargs) 2025-08-14T21:38:56.4114900Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4115234Z return func(*args, **kwargs) 2025-08-14T21:38:56.4115585Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 338, in forward 2025-08-14T21:38:56.4115961Z self_attention_outputs = self.attention( 2025-08-14T21:38:56.4116315Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4116651Z return func(*args, **kwargs) 2025-08-14T21:38:56.4116977Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4117308Z return func(*args, **kwargs) 2025-08-14T21:38:56.4117633Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4117968Z return func(*args, **kwargs) 2025-08-14T21:38:56.4118329Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 271, in forward 2025-08-14T21:38:56.4118699Z self_outputs = self.self( 2025-08-14T21:38:56.4119033Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4119386Z return func(*args, **kwargs) 2025-08-14T21:38:56.4119701Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4120035Z return func(*args, **kwargs) 2025-08-14T21:38:56.4120378Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4120710Z return func(*args, **kwargs) 2025-08-14T21:38:56.4121065Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 193, in forward 2025-08-14T21:38:56.4121508Z value_states = self.value(hidden_states).view(hidden_shape).transpose(1, 2) 2025-08-14T21:38:56.4121694Z 2025-08-14T21:38:56.4121778Z cudagraph partition due to non gpu ops 2025-08-14T21:38:56.4121971Z cudagraph partition due to non gpu ops 2025-08-14T21:38:56.4122194Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:56.4122530Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:56.4122838Z return mod(**inputs) 2025-08-14T21:38:56.4123143Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4123471Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4123841Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 875, in forward 2025-08-14T21:38:56.4124200Z outputs = self.layoutlm( 2025-08-14T21:38:56.4124533Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4124871Z return func(*args, **kwargs) 2025-08-14T21:38:56.4125199Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4125529Z return func(*args, **kwargs) 2025-08-14T21:38:56.4125839Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4126163Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4126521Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:56.4126894Z encoder_outputs = self.encoder( 2025-08-14T21:38:56.4127238Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4127576Z return func(*args, **kwargs) 2025-08-14T21:38:56.4127896Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4128232Z return func(*args, **kwargs) 2025-08-14T21:38:56.4128559Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4128891Z return func(*args, **kwargs) 2025-08-14T21:38:56.4129075Z [Previous line repeated 1 more time] 2025-08-14T21:38:56.4129398Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4129724Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4130084Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:56.4130452Z layer_outputs = layer_module( 2025-08-14T21:38:56.4130776Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:56.4131120Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:56.4131465Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4131800Z return func(*args, **kwargs) 2025-08-14T21:38:56.4132122Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4132475Z return func(*args, **kwargs) 2025-08-14T21:38:56.4132800Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4133138Z return func(*args, **kwargs) 2025-08-14T21:38:56.4133501Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 338, in forward 2025-08-14T21:38:56.4133884Z self_attention_outputs = self.attention( 2025-08-14T21:38:56.4134232Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4134564Z return func(*args, **kwargs) 2025-08-14T21:38:56.4134883Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4135216Z return func(*args, **kwargs) 2025-08-14T21:38:56.4135570Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4135910Z return func(*args, **kwargs) 2025-08-14T21:38:56.4136261Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 278, in forward 2025-08-14T21:38:56.4136685Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:38:56.4137105Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 225, in forward 2025-08-14T21:38:56.4137482Z hidden_states = self.dense(hidden_states) 2025-08-14T21:38:56.4137619Z 2025-08-14T21:38:56.4137716Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:56.4138050Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:56.4138350Z return mod(**inputs) 2025-08-14T21:38:56.4138651Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4138981Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4139351Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 875, in forward 2025-08-14T21:38:56.4139715Z outputs = self.layoutlm( 2025-08-14T21:38:56.4140049Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4140386Z return func(*args, **kwargs) 2025-08-14T21:38:56.4140714Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4141043Z return func(*args, **kwargs) 2025-08-14T21:38:56.4141350Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4141677Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4142041Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:56.4142413Z encoder_outputs = self.encoder( 2025-08-14T21:38:56.4142756Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4143098Z return func(*args, **kwargs) 2025-08-14T21:38:56.4143908Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4144248Z return func(*args, **kwargs) 2025-08-14T21:38:56.4144600Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4145026Z return func(*args, **kwargs) 2025-08-14T21:38:56.4145206Z [Previous line repeated 1 more time] 2025-08-14T21:38:56.4145541Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4145903Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4146301Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:56.4146675Z layer_outputs = layer_module( 2025-08-14T21:38:56.4147023Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:56.4147373Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:56.4147729Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4148084Z return func(*args, **kwargs) 2025-08-14T21:38:56.4148426Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4148763Z return func(*args, **kwargs) 2025-08-14T21:38:56.4149100Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4149464Z return func(*args, **kwargs) 2025-08-14T21:38:56.4149826Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 348, in forward 2025-08-14T21:38:56.4150214Z layer_output = apply_chunking_to_forward( 2025-08-14T21:38:56.4150602Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:38:56.4150978Z return forward_fn(*input_tensors) 2025-08-14T21:38:56.4151387Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 356, in feed_forward_chunk 2025-08-14T21:38:56.4151838Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:38:56.4152263Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 294, in forward 2025-08-14T21:38:56.4152653Z hidden_states = self.dense(hidden_states) 2025-08-14T21:38:56.4152796Z 2025-08-14T21:38:56.4152896Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:56.4153237Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:56.4153550Z return mod(**inputs) 2025-08-14T21:38:56.4153859Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4154199Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4154584Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 875, in forward 2025-08-14T21:38:56.4154956Z outputs = self.layoutlm( 2025-08-14T21:38:56.4155297Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4155646Z return func(*args, **kwargs) 2025-08-14T21:38:56.4155987Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4156328Z return func(*args, **kwargs) 2025-08-14T21:38:56.4156645Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4156984Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4157356Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:56.4157737Z encoder_outputs = self.encoder( 2025-08-14T21:38:56.4158144Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4158489Z return func(*args, **kwargs) 2025-08-14T21:38:56.4158820Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4159199Z return func(*args, **kwargs) 2025-08-14T21:38:56.4159529Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4159863Z return func(*args, **kwargs) 2025-08-14T21:38:56.4160044Z [Previous line repeated 1 more time] 2025-08-14T21:38:56.4160385Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4160711Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4161070Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:56.4161437Z layer_outputs = layer_module( 2025-08-14T21:38:56.4161755Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:56.4162087Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:56.4162442Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4162776Z return func(*args, **kwargs) 2025-08-14T21:38:56.4163101Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4163429Z return func(*args, **kwargs) 2025-08-14T21:38:56.4163752Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4164089Z return func(*args, **kwargs) 2025-08-14T21:38:56.4164440Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 348, in forward 2025-08-14T21:38:56.4164813Z layer_output = apply_chunking_to_forward( 2025-08-14T21:38:56.4165186Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:38:56.4165551Z return forward_fn(*input_tensors) 2025-08-14T21:38:56.4165939Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 356, in feed_forward_chunk 2025-08-14T21:38:56.4166384Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:38:56.4166797Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 295, in forward 2025-08-14T21:38:56.4167200Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:38:56.4167543Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:38:56.4167856Z return self.act(input) 2025-08-14T21:38:56.4167960Z 2025-08-14T21:38:56.4168065Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:56.4168398Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:56.4168694Z return mod(**inputs) 2025-08-14T21:38:56.4169001Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4169329Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4169689Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 875, in forward 2025-08-14T21:38:56.4170060Z outputs = self.layoutlm( 2025-08-14T21:38:56.4170389Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4170731Z return func(*args, **kwargs) 2025-08-14T21:38:56.4171063Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4171404Z return func(*args, **kwargs) 2025-08-14T21:38:56.4171713Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4172056Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4172430Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:56.4172811Z encoder_outputs = self.encoder( 2025-08-14T21:38:56.4173174Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4173508Z return func(*args, **kwargs) 2025-08-14T21:38:56.4173838Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4174175Z return func(*args, **kwargs) 2025-08-14T21:38:56.4174500Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4174836Z return func(*args, **kwargs) 2025-08-14T21:38:56.4175016Z [Previous line repeated 1 more time] 2025-08-14T21:38:56.4175357Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4175676Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4176045Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:56.4176415Z layer_outputs = layer_module( 2025-08-14T21:38:56.4176729Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:56.4177059Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:56.4177411Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4177746Z return func(*args, **kwargs) 2025-08-14T21:38:56.4178065Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4178402Z return func(*args, **kwargs) 2025-08-14T21:38:56.4178731Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4179064Z return func(*args, **kwargs) 2025-08-14T21:38:56.4179414Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 348, in forward 2025-08-14T21:38:56.4179793Z layer_output = apply_chunking_to_forward( 2025-08-14T21:38:56.4180165Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:38:56.4180526Z return forward_fn(*input_tensors) 2025-08-14T21:38:56.4180921Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 357, in feed_forward_chunk 2025-08-14T21:38:56.4181369Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:38:56.4181781Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 308, in forward 2025-08-14T21:38:56.4182159Z hidden_states = self.dense(hidden_states) 2025-08-14T21:38:56.4182293Z 2025-08-14T21:38:56.4182392Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:56.4182727Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:56.4183018Z return mod(**inputs) 2025-08-14T21:38:56.4183322Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4183666Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4184033Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 875, in forward 2025-08-14T21:38:56.4184437Z outputs = self.layoutlm( 2025-08-14T21:38:56.4184989Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4185405Z return func(*args, **kwargs) 2025-08-14T21:38:56.4185750Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4186107Z return func(*args, **kwargs) 2025-08-14T21:38:56.4186461Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4186802Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4187168Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:56.4187569Z encoder_outputs = self.encoder( 2025-08-14T21:38:56.4187919Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4188257Z return func(*args, **kwargs) 2025-08-14T21:38:56.4188590Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4188959Z return func(*args, **kwargs) 2025-08-14T21:38:56.4189289Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4189627Z return func(*args, **kwargs) 2025-08-14T21:38:56.4189808Z [Previous line repeated 1 more time] 2025-08-14T21:38:56.4190135Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4190458Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4190834Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:56.4191210Z layer_outputs = layer_module( 2025-08-14T21:38:56.4191533Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:56.4191870Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:56.4192220Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4192283Z return func(*args, **kwargs) 2025-08-14T21:38:56.4192511Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4192573Z return func(*args, **kwargs) 2025-08-14T21:38:56.4192796Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4192866Z return func(*args, **kwargs) 2025-08-14T21:38:56.4193120Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 338, in forward 2025-08-14T21:38:56.4193207Z self_attention_outputs = self.attention( 2025-08-14T21:38:56.4193429Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4193494Z return func(*args, **kwargs) 2025-08-14T21:38:56.4193722Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4193783Z return func(*args, **kwargs) 2025-08-14T21:38:56.4194008Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4194069Z return func(*args, **kwargs) 2025-08-14T21:38:56.4194346Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 271, in forward 2025-08-14T21:38:56.4194424Z self_outputs = self.self( 2025-08-14T21:38:56.4194648Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4194709Z return func(*args, **kwargs) 2025-08-14T21:38:56.4194939Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4195022Z return func(*args, **kwargs) 2025-08-14T21:38:56.4195255Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4195316Z return func(*args, **kwargs) 2025-08-14T21:38:56.4195582Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 191, in forward 2025-08-14T21:38:56.4195734Z query_states = self.query(hidden_states).view(hidden_shape).transpose(1, 2) 2025-08-14T21:38:56.4195738Z 2025-08-14T21:38:56.4195842Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:56.4196039Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:56.4196101Z return mod(**inputs) 2025-08-14T21:38:56.4196306Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4196399Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4196657Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 875, in forward 2025-08-14T21:38:56.4196723Z outputs = self.layoutlm( 2025-08-14T21:38:56.4196959Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4197020Z return func(*args, **kwargs) 2025-08-14T21:38:56.4197254Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4197315Z return func(*args, **kwargs) 2025-08-14T21:38:56.4197519Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4197596Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4197847Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:56.4197919Z encoder_outputs = self.encoder( 2025-08-14T21:38:56.4198152Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4198215Z return func(*args, **kwargs) 2025-08-14T21:38:56.4198447Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4198511Z return func(*args, **kwargs) 2025-08-14T21:38:56.4198734Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4198803Z return func(*args, **kwargs) 2025-08-14T21:38:56.4198883Z [Previous line repeated 1 more time] 2025-08-14T21:38:56.4199079Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4199155Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4199398Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:56.4199468Z layer_outputs = layer_module( 2025-08-14T21:38:56.4199674Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:56.4199746Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:56.4199969Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4200052Z return func(*args, **kwargs) 2025-08-14T21:38:56.4200270Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4200336Z return func(*args, **kwargs) 2025-08-14T21:38:56.4200551Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4200636Z return func(*args, **kwargs) 2025-08-14T21:38:56.4200882Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 338, in forward 2025-08-14T21:38:56.4200958Z self_attention_outputs = self.attention( 2025-08-14T21:38:56.4201197Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4201258Z return func(*args, **kwargs) 2025-08-14T21:38:56.4201483Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4201543Z return func(*args, **kwargs) 2025-08-14T21:38:56.4201762Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4201828Z return func(*args, **kwargs) 2025-08-14T21:38:56.4202076Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 271, in forward 2025-08-14T21:38:56.4202155Z self_outputs = self.self( 2025-08-14T21:38:56.4202379Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4202438Z return func(*args, **kwargs) 2025-08-14T21:38:56.4202663Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4202723Z return func(*args, **kwargs) 2025-08-14T21:38:56.4202939Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4203004Z return func(*args, **kwargs) 2025-08-14T21:38:56.4203249Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 192, in forward 2025-08-14T21:38:56.4203377Z key_states = self.key(hidden_states).view(hidden_shape).transpose(1, 2) 2025-08-14T21:38:56.4203389Z 2025-08-14T21:38:56.4203485Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:56.4203669Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:56.4203733Z return mod(**inputs) 2025-08-14T21:38:56.4203935Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4204002Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4204254Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 875, in forward 2025-08-14T21:38:56.4204317Z outputs = self.layoutlm( 2025-08-14T21:38:56.4204541Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4204600Z return func(*args, **kwargs) 2025-08-14T21:38:56.4204821Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4204888Z return func(*args, **kwargs) 2025-08-14T21:38:56.4205085Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4205152Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4205405Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:56.4205474Z encoder_outputs = self.encoder( 2025-08-14T21:38:56.4205714Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4205776Z return func(*args, **kwargs) 2025-08-14T21:38:56.4205997Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4206082Z return func(*args, **kwargs) 2025-08-14T21:38:56.4206301Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4206361Z return func(*args, **kwargs) 2025-08-14T21:38:56.4206437Z [Previous line repeated 1 more time] 2025-08-14T21:38:56.4206647Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4206723Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4206968Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:56.4207035Z layer_outputs = layer_module( 2025-08-14T21:38:56.4207247Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:56.4207319Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:56.4207535Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4207621Z return func(*args, **kwargs) 2025-08-14T21:38:56.4207840Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4207909Z return func(*args, **kwargs) 2025-08-14T21:38:56.4208127Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4208186Z return func(*args, **kwargs) 2025-08-14T21:38:56.4208442Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 338, in forward 2025-08-14T21:38:56.4208519Z self_attention_outputs = self.attention( 2025-08-14T21:38:56.4208745Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4208806Z return func(*args, **kwargs) 2025-08-14T21:38:56.4209027Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4209097Z return func(*args, **kwargs) 2025-08-14T21:38:56.4209314Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4209377Z return func(*args, **kwargs) 2025-08-14T21:38:56.4209628Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 271, in forward 2025-08-14T21:38:56.4209694Z self_outputs = self.self( 2025-08-14T21:38:56.4209919Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4209981Z return func(*args, **kwargs) 2025-08-14T21:38:56.4210200Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4210271Z return func(*args, **kwargs) 2025-08-14T21:38:56.4210489Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4210549Z return func(*args, **kwargs) 2025-08-14T21:38:56.4210802Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 193, in forward 2025-08-14T21:38:56.4210935Z value_states = self.value(hidden_states).view(hidden_shape).transpose(1, 2) 2025-08-14T21:38:56.4210939Z 2025-08-14T21:38:56.4211019Z cudagraph partition due to non gpu ops 2025-08-14T21:38:56.4211108Z cudagraph partition due to non gpu ops 2025-08-14T21:38:56.4211208Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:56.4211398Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:56.4211457Z return mod(**inputs) 2025-08-14T21:38:56.4211659Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4211750Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4211995Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 875, in forward 2025-08-14T21:38:56.4212066Z outputs = self.layoutlm( 2025-08-14T21:38:56.4212296Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4212359Z return func(*args, **kwargs) 2025-08-14T21:38:56.4212587Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4212646Z return func(*args, **kwargs) 2025-08-14T21:38:56.4212852Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4212917Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4213165Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:56.4213266Z encoder_outputs = self.encoder( 2025-08-14T21:38:56.4213490Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4213552Z return func(*args, **kwargs) 2025-08-14T21:38:56.4213780Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4213839Z return func(*args, **kwargs) 2025-08-14T21:38:56.4214065Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4214124Z return func(*args, **kwargs) 2025-08-14T21:38:56.4214192Z [Previous line repeated 1 more time] 2025-08-14T21:38:56.4214399Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4214468Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4214715Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:56.4214785Z layer_outputs = layer_module( 2025-08-14T21:38:56.4214992Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:56.4215071Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:56.4215292Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4215354Z return func(*args, **kwargs) 2025-08-14T21:38:56.4215581Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4215643Z return func(*args, **kwargs) 2025-08-14T21:38:56.4215863Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4215933Z return func(*args, **kwargs) 2025-08-14T21:38:56.4216181Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 338, in forward 2025-08-14T21:38:56.4216264Z self_attention_outputs = self.attention( 2025-08-14T21:38:56.4216486Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4216546Z return func(*args, **kwargs) 2025-08-14T21:38:56.4216787Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4216849Z return func(*args, **kwargs) 2025-08-14T21:38:56.4217072Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4217131Z return func(*args, **kwargs) 2025-08-14T21:38:56.4217393Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 278, in forward 2025-08-14T21:38:56.4217518Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:38:56.4217777Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 225, in forward 2025-08-14T21:38:56.4217854Z hidden_states = self.dense(hidden_states) 2025-08-14T21:38:56.4217857Z 2025-08-14T21:38:56.4217958Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:56.4218142Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:56.4218209Z return mod(**inputs) 2025-08-14T21:38:56.4218410Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4218476Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4218727Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 875, in forward 2025-08-14T21:38:56.4218820Z outputs = self.layoutlm( 2025-08-14T21:38:56.4219039Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4219107Z return func(*args, **kwargs) 2025-08-14T21:38:56.4219325Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4219391Z return func(*args, **kwargs) 2025-08-14T21:38:56.4219590Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4219657Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4219906Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:56.4219972Z encoder_outputs = self.encoder( 2025-08-14T21:38:56.4220200Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4220260Z return func(*args, **kwargs) 2025-08-14T21:38:56.4220477Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4220544Z return func(*args, **kwargs) 2025-08-14T21:38:56.4220760Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4220819Z return func(*args, **kwargs) 2025-08-14T21:38:56.4220894Z [Previous line repeated 1 more time] 2025-08-14T21:38:56.4221090Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4221164Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4221406Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:56.4221474Z layer_outputs = layer_module( 2025-08-14T21:38:56.4221683Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:56.4221755Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:56.4221972Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4222039Z return func(*args, **kwargs) 2025-08-14T21:38:56.4222270Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4222340Z return func(*args, **kwargs) 2025-08-14T21:38:56.4222556Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4222615Z return func(*args, **kwargs) 2025-08-14T21:38:56.4222865Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 348, in forward 2025-08-14T21:38:56.4222963Z layer_output = apply_chunking_to_forward( 2025-08-14T21:38:56.4223202Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:38:56.4223292Z return forward_fn(*input_tensors) 2025-08-14T21:38:56.4223567Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 356, in feed_forward_chunk 2025-08-14T21:38:56.4223686Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:38:56.4223930Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 294, in forward 2025-08-14T21:38:56.4224007Z hidden_states = self.dense(hidden_states) 2025-08-14T21:38:56.4224010Z 2025-08-14T21:38:56.4224117Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:56.4224317Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:56.4224383Z return mod(**inputs) 2025-08-14T21:38:56.4224582Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4224649Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4224965Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 875, in forward 2025-08-14T21:38:56.4225035Z outputs = self.layoutlm( 2025-08-14T21:38:56.4225256Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4225326Z return func(*args, **kwargs) 2025-08-14T21:38:56.4225543Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4225611Z return func(*args, **kwargs) 2025-08-14T21:38:56.4225812Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4225879Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4226138Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:56.4226205Z encoder_outputs = self.encoder( 2025-08-14T21:38:56.4226437Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4226498Z return func(*args, **kwargs) 2025-08-14T21:38:56.4226723Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4226793Z return func(*args, **kwargs) 2025-08-14T21:38:56.4227015Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4227079Z return func(*args, **kwargs) 2025-08-14T21:38:56.4227156Z [Previous line repeated 1 more time] 2025-08-14T21:38:56.4227357Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4227442Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4227694Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:56.4227760Z layer_outputs = layer_module( 2025-08-14T21:38:56.4227991Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:56.4228066Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:56.4228281Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4228348Z return func(*args, **kwargs) 2025-08-14T21:38:56.4228587Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4228654Z return func(*args, **kwargs) 2025-08-14T21:38:56.4228875Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4230066Z return func(*args, **kwargs) 2025-08-14T21:38:56.4230327Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 348, in forward 2025-08-14T21:38:56.4230404Z layer_output = apply_chunking_to_forward( 2025-08-14T21:38:56.4230649Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:38:56.4230726Z return forward_fn(*input_tensors) 2025-08-14T21:38:56.4231003Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 356, in feed_forward_chunk 2025-08-14T21:38:56.4231142Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:38:56.4231388Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 295, in forward 2025-08-14T21:38:56.4231493Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:38:56.4231699Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:38:56.4231764Z return self.act(input) 2025-08-14T21:38:56.4231768Z 2025-08-14T21:38:56.4231868Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:56.4232054Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:56.4232114Z return mod(**inputs) 2025-08-14T21:38:56.4232320Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4232387Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4232635Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 875, in forward 2025-08-14T21:38:56.4232705Z outputs = self.layoutlm( 2025-08-14T21:38:56.4232926Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4232994Z return func(*args, **kwargs) 2025-08-14T21:38:56.4233210Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4233270Z return func(*args, **kwargs) 2025-08-14T21:38:56.4233476Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4233542Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4233785Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:56.4233860Z encoder_outputs = self.encoder( 2025-08-14T21:38:56.4234077Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4234143Z return func(*args, **kwargs) 2025-08-14T21:38:56.4234360Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4234420Z return func(*args, **kwargs) 2025-08-14T21:38:56.4234643Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4234715Z return func(*args, **kwargs) 2025-08-14T21:38:56.4234793Z [Previous line repeated 1 more time] 2025-08-14T21:38:56.4234990Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4235056Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4235324Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:56.4235392Z layer_outputs = layer_module( 2025-08-14T21:38:56.4235592Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:56.4235685Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:56.4235907Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4235974Z return func(*args, **kwargs) 2025-08-14T21:38:56.4236194Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4236255Z return func(*args, **kwargs) 2025-08-14T21:38:56.4236480Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4236540Z return func(*args, **kwargs) 2025-08-14T21:38:56.4236803Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 348, in forward 2025-08-14T21:38:56.4236885Z layer_output = apply_chunking_to_forward( 2025-08-14T21:38:56.4237122Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:38:56.4237198Z return forward_fn(*input_tensors) 2025-08-14T21:38:56.4237473Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 357, in feed_forward_chunk 2025-08-14T21:38:56.4237597Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:38:56.4237853Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 308, in forward 2025-08-14T21:38:56.4237928Z hidden_states = self.dense(hidden_states) 2025-08-14T21:38:56.4237933Z 2025-08-14T21:38:56.4238036Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:56.4238220Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:56.4238278Z return mod(**inputs) 2025-08-14T21:38:56.4238482Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4238547Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4238791Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 875, in forward 2025-08-14T21:38:56.4238862Z outputs = self.layoutlm( 2025-08-14T21:38:56.4239081Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4239148Z return func(*args, **kwargs) 2025-08-14T21:38:56.4239367Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4239430Z return func(*args, **kwargs) 2025-08-14T21:38:56.4239635Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4239701Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4239945Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:56.4240020Z encoder_outputs = self.encoder( 2025-08-14T21:38:56.4240239Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4240325Z return func(*args, **kwargs) 2025-08-14T21:38:56.4240544Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4240604Z return func(*args, **kwargs) 2025-08-14T21:38:56.4240829Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4240906Z return func(*args, **kwargs) 2025-08-14T21:38:56.4240983Z [Previous line repeated 1 more time] 2025-08-14T21:38:56.4241179Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4241270Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4241518Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:56.4241584Z layer_outputs = layer_module( 2025-08-14T21:38:56.4241785Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:56.4241866Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:56.4242082Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4242165Z return func(*args, **kwargs) 2025-08-14T21:38:56.4242385Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4242446Z return func(*args, **kwargs) 2025-08-14T21:38:56.4242673Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4242733Z return func(*args, **kwargs) 2025-08-14T21:38:56.4242979Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 338, in forward 2025-08-14T21:38:56.4243063Z self_attention_outputs = self.attention( 2025-08-14T21:38:56.4243281Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4243347Z return func(*args, **kwargs) 2025-08-14T21:38:56.4243565Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4243629Z return func(*args, **kwargs) 2025-08-14T21:38:56.4243852Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4243912Z return func(*args, **kwargs) 2025-08-14T21:38:56.4244159Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 271, in forward 2025-08-14T21:38:56.4244229Z self_outputs = self.self( 2025-08-14T21:38:56.4244449Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4244516Z return func(*args, **kwargs) 2025-08-14T21:38:56.4244736Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4244795Z return func(*args, **kwargs) 2025-08-14T21:38:56.4245022Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4245085Z return func(*args, **kwargs) 2025-08-14T21:38:56.4245336Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 191, in forward 2025-08-14T21:38:56.4245473Z query_states = self.query(hidden_states).view(hidden_shape).transpose(1, 2) 2025-08-14T21:38:56.4245477Z 2025-08-14T21:38:56.4245572Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:56.4245765Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:56.4245838Z return mod(**inputs) 2025-08-14T21:38:56.4246038Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4246112Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4246357Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 875, in forward 2025-08-14T21:38:56.4246450Z outputs = self.layoutlm( 2025-08-14T21:38:56.4246666Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4246725Z return func(*args, **kwargs) 2025-08-14T21:38:56.4246963Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4247024Z return func(*args, **kwargs) 2025-08-14T21:38:56.4247221Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4247294Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4247538Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:56.4247610Z encoder_outputs = self.encoder( 2025-08-14T21:38:56.4247826Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4247903Z return func(*args, **kwargs) 2025-08-14T21:38:56.4248126Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4248185Z return func(*args, **kwargs) 2025-08-14T21:38:56.4248410Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4248471Z return func(*args, **kwargs) 2025-08-14T21:38:56.4248541Z [Previous line repeated 1 more time] 2025-08-14T21:38:56.4248747Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4248816Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4249061Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:56.4249137Z layer_outputs = layer_module( 2025-08-14T21:38:56.4249340Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:56.4249419Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:56.4249638Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4249699Z return func(*args, **kwargs) 2025-08-14T21:38:56.4249924Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4249987Z return func(*args, **kwargs) 2025-08-14T21:38:56.4250205Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4250275Z return func(*args, **kwargs) 2025-08-14T21:38:56.4250517Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 338, in forward 2025-08-14T21:38:56.4250604Z self_attention_outputs = self.attention( 2025-08-14T21:38:56.4250819Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4250879Z return func(*args, **kwargs) 2025-08-14T21:38:56.4251103Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4251164Z return func(*args, **kwargs) 2025-08-14T21:38:56.4251395Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4251464Z return func(*args, **kwargs) 2025-08-14T21:38:56.4251710Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 271, in forward 2025-08-14T21:38:56.4251779Z self_outputs = self.self( 2025-08-14T21:38:56.4252014Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4252075Z return func(*args, **kwargs) 2025-08-14T21:38:56.4252296Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4252370Z return func(*args, **kwargs) 2025-08-14T21:38:56.4252594Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4252653Z return func(*args, **kwargs) 2025-08-14T21:38:56.4252900Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 192, in forward 2025-08-14T21:38:56.4253033Z key_states = self.key(hidden_states).view(hidden_shape).transpose(1, 2) 2025-08-14T21:38:56.4253037Z 2025-08-14T21:38:56.4253132Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:56.4253314Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:56.4253398Z return mod(**inputs) 2025-08-14T21:38:56.4253596Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4253670Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4253913Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 875, in forward 2025-08-14T21:38:56.4253976Z outputs = self.layoutlm( 2025-08-14T21:38:56.4254201Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4254261Z return func(*args, **kwargs) 2025-08-14T21:38:56.4254477Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4254545Z return func(*args, **kwargs) 2025-08-14T21:38:56.4254740Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4254814Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4255054Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:56.4255122Z encoder_outputs = self.encoder( 2025-08-14T21:38:56.4255348Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4255408Z return func(*args, **kwargs) 2025-08-14T21:38:56.4255632Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4255692Z return func(*args, **kwargs) 2025-08-14T21:38:56.4255904Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4255971Z return func(*args, **kwargs) 2025-08-14T21:38:56.4256043Z [Previous line repeated 1 more time] 2025-08-14T21:38:56.4256239Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4256310Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4256552Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:56.4256622Z layer_outputs = layer_module( 2025-08-14T21:38:56.4256821Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:56.4256907Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:56.4257133Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4257194Z return func(*args, **kwargs) 2025-08-14T21:38:56.4257409Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4257493Z return func(*args, **kwargs) 2025-08-14T21:38:56.4257710Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4257776Z return func(*args, **kwargs) 2025-08-14T21:38:56.4258032Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 338, in forward 2025-08-14T21:38:56.4258111Z self_attention_outputs = self.attention( 2025-08-14T21:38:56.4258337Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4258395Z return func(*args, **kwargs) 2025-08-14T21:38:56.4258610Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4258674Z return func(*args, **kwargs) 2025-08-14T21:38:56.4258904Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4258970Z return func(*args, **kwargs) 2025-08-14T21:38:56.4259212Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 271, in forward 2025-08-14T21:38:56.4259277Z self_outputs = self.self( 2025-08-14T21:38:56.4259499Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4259558Z return func(*args, **kwargs) 2025-08-14T21:38:56.4259779Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4259838Z return func(*args, **kwargs) 2025-08-14T21:38:56.4260054Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4260122Z return func(*args, **kwargs) 2025-08-14T21:38:56.4260366Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 193, in forward 2025-08-14T21:38:56.4260498Z value_states = self.value(hidden_states).view(hidden_shape).transpose(1, 2) 2025-08-14T21:38:56.4260501Z 2025-08-14T21:38:56.4260582Z cudagraph partition due to non gpu ops 2025-08-14T21:38:56.4260653Z cudagraph partition due to non gpu ops 2025-08-14T21:38:56.4260754Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:56.4260934Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:56.4260995Z return mod(**inputs) 2025-08-14T21:38:56.4261201Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4261270Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4261515Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 875, in forward 2025-08-14T21:38:56.4261588Z outputs = self.layoutlm( 2025-08-14T21:38:56.4261804Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4261871Z return func(*args, **kwargs) 2025-08-14T21:38:56.4262087Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4262146Z return func(*args, **kwargs) 2025-08-14T21:38:56.4262367Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4262435Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4262679Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:56.4262753Z encoder_outputs = self.encoder( 2025-08-14T21:38:56.4262993Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4263064Z return func(*args, **kwargs) 2025-08-14T21:38:56.4263279Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4263354Z return func(*args, **kwargs) 2025-08-14T21:38:56.4263583Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4263643Z return func(*args, **kwargs) 2025-08-14T21:38:56.4263712Z [Previous line repeated 1 more time] 2025-08-14T21:38:56.4263919Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4263987Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4264242Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:56.4264351Z layer_outputs = layer_module( 2025-08-14T21:38:56.4264554Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:56.4264633Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:56.4264919Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4264994Z return func(*args, **kwargs) 2025-08-14T21:38:56.4265212Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4265276Z return func(*args, **kwargs) 2025-08-14T21:38:56.4265502Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4265564Z return func(*args, **kwargs) 2025-08-14T21:38:56.4265814Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 338, in forward 2025-08-14T21:38:56.4265902Z self_attention_outputs = self.attention( 2025-08-14T21:38:56.4266118Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4266188Z return func(*args, **kwargs) 2025-08-14T21:38:56.4266405Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4266467Z return func(*args, **kwargs) 2025-08-14T21:38:56.4266693Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4266753Z return func(*args, **kwargs) 2025-08-14T21:38:56.4266994Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 278, in forward 2025-08-14T21:38:56.4267122Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:38:56.4267366Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 225, in forward 2025-08-14T21:38:56.4267451Z hidden_states = self.dense(hidden_states) 2025-08-14T21:38:56.4267454Z 2025-08-14T21:38:56.4267548Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:56.4267728Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:56.4267797Z return mod(**inputs) 2025-08-14T21:38:56.4268015Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4268092Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4268333Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 875, in forward 2025-08-14T21:38:56.4268395Z outputs = self.layoutlm( 2025-08-14T21:38:56.4268619Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4268707Z return func(*args, **kwargs) 2025-08-14T21:38:56.4268927Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4268996Z return func(*args, **kwargs) 2025-08-14T21:38:56.4269208Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4269284Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4269527Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:56.4269593Z encoder_outputs = self.encoder( 2025-08-14T21:38:56.4269817Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4269876Z return func(*args, **kwargs) 2025-08-14T21:38:56.4270108Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4270174Z return func(*args, **kwargs) 2025-08-14T21:38:56.4270388Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4270455Z return func(*args, **kwargs) 2025-08-14T21:38:56.4270524Z [Previous line repeated 1 more time] 2025-08-14T21:38:56.4270723Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4270796Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4271040Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:56.4271104Z layer_outputs = layer_module( 2025-08-14T21:38:56.4271313Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:56.4271388Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:56.4271612Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4271672Z return func(*args, **kwargs) 2025-08-14T21:38:56.4271887Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4271956Z return func(*args, **kwargs) 2025-08-14T21:38:56.4272171Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4272237Z return func(*args, **kwargs) 2025-08-14T21:38:56.4272480Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 348, in forward 2025-08-14T21:38:56.4272556Z layer_output = apply_chunking_to_forward( 2025-08-14T21:38:56.4272800Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:38:56.4272871Z return forward_fn(*input_tensors) 2025-08-14T21:38:56.4273143Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 356, in feed_forward_chunk 2025-08-14T21:38:56.4273262Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:38:56.4273505Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 294, in forward 2025-08-14T21:38:56.4273601Z hidden_states = self.dense(hidden_states) 2025-08-14T21:38:56.4273604Z 2025-08-14T21:38:56.4273700Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:56.4273880Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:56.4273943Z return mod(**inputs) 2025-08-14T21:38:56.4274159Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4274236Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4274482Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 875, in forward 2025-08-14T21:38:56.4274560Z outputs = self.layoutlm( 2025-08-14T21:38:56.4274788Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4274848Z return func(*args, **kwargs) 2025-08-14T21:38:56.4275067Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4275136Z return func(*args, **kwargs) 2025-08-14T21:38:56.4275332Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4275405Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4275666Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:56.4275732Z encoder_outputs = self.encoder( 2025-08-14T21:38:56.4275959Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4276020Z return func(*args, **kwargs) 2025-08-14T21:38:56.4276240Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4276307Z return func(*args, **kwargs) 2025-08-14T21:38:56.4276525Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4276592Z return func(*args, **kwargs) 2025-08-14T21:38:56.4276661Z [Previous line repeated 1 more time] 2025-08-14T21:38:56.4276858Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4276934Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4277179Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:56.4277245Z layer_outputs = layer_module( 2025-08-14T21:38:56.4277456Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:56.4277528Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:56.4277756Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4277817Z return func(*args, **kwargs) 2025-08-14T21:38:56.4278036Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4278102Z return func(*args, **kwargs) 2025-08-14T21:38:56.4278323Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4278392Z return func(*args, **kwargs) 2025-08-14T21:38:56.4278637Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 348, in forward 2025-08-14T21:38:56.4278714Z layer_output = apply_chunking_to_forward( 2025-08-14T21:38:56.4278957Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:38:56.4279028Z return forward_fn(*input_tensors) 2025-08-14T21:38:56.4279318Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 356, in feed_forward_chunk 2025-08-14T21:38:56.4279440Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:38:56.4279688Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 295, in forward 2025-08-14T21:38:56.4279817Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:38:56.4280011Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:38:56.4280076Z return self.act(input) 2025-08-14T21:38:56.4280079Z 2025-08-14T21:38:56.4280197Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:56.4280381Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:56.4280447Z return mod(**inputs) 2025-08-14T21:38:56.4280646Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4280715Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4280964Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 875, in forward 2025-08-14T21:38:56.4281028Z outputs = self.layoutlm( 2025-08-14T21:38:56.4281261Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4281331Z return func(*args, **kwargs) 2025-08-14T21:38:56.4281546Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4281612Z return func(*args, **kwargs) 2025-08-14T21:38:56.4281808Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4281875Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4282126Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:56.4282193Z encoder_outputs = self.encoder( 2025-08-14T21:38:56.4282411Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4282481Z return func(*args, **kwargs) 2025-08-14T21:38:56.4282697Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4282765Z return func(*args, **kwargs) 2025-08-14T21:38:56.4282983Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4283042Z return func(*args, **kwargs) 2025-08-14T21:38:56.4283119Z [Previous line repeated 1 more time] 2025-08-14T21:38:56.4283315Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4283380Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4283627Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:56.4283690Z layer_outputs = layer_module( 2025-08-14T21:38:56.4283902Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:56.4283976Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:56.4284191Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4284260Z return func(*args, **kwargs) 2025-08-14T21:38:56.4284476Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4284535Z return func(*args, **kwargs) 2025-08-14T21:38:56.4284958Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4285026Z return func(*args, **kwargs) 2025-08-14T21:38:56.4285287Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 348, in forward 2025-08-14T21:38:56.4285368Z layer_output = apply_chunking_to_forward( 2025-08-14T21:38:56.4285637Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:38:56.4285713Z return forward_fn(*input_tensors) 2025-08-14T21:38:56.4286019Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 357, in feed_forward_chunk 2025-08-14T21:38:56.4286162Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:38:56.4286409Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 308, in forward 2025-08-14T21:38:56.4286484Z hidden_states = self.dense(hidden_states) 2025-08-14T21:38:56.4286488Z 2025-08-14T21:38:56.4286590Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:56.4286774Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:56.4286864Z return mod(**inputs) 2025-08-14T21:38:56.4287075Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4287140Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4287459Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 875, in forward 2025-08-14T21:38:56.4287523Z outputs = self.layoutlm( 2025-08-14T21:38:56.4287753Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4287822Z return func(*args, **kwargs) 2025-08-14T21:38:56.4288050Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4288118Z return func(*args, **kwargs) 2025-08-14T21:38:56.4288325Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4288395Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4288656Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:56.4288723Z encoder_outputs = self.encoder( 2025-08-14T21:38:56.4288953Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4289022Z return func(*args, **kwargs) 2025-08-14T21:38:56.4289251Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4289319Z return func(*args, **kwargs) 2025-08-14T21:38:56.4289549Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4289611Z return func(*args, **kwargs) 2025-08-14T21:38:56.4289687Z [Previous line repeated 1 more time] 2025-08-14T21:38:56.4289897Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4289966Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4290227Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:56.4290296Z layer_outputs = layer_module( 2025-08-14T21:38:56.4290511Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:56.4290586Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:56.4290831Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4290901Z return func(*args, **kwargs) 2025-08-14T21:38:56.4291125Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4291205Z return func(*args, **kwargs) 2025-08-14T21:38:56.4291444Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4291504Z return func(*args, **kwargs) 2025-08-14T21:38:56.4291783Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 338, in forward 2025-08-14T21:38:56.4291866Z self_attention_outputs = self.attention( 2025-08-14T21:38:56.4292089Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4292156Z return func(*args, **kwargs) 2025-08-14T21:38:56.4292379Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4292447Z return func(*args, **kwargs) 2025-08-14T21:38:56.4292670Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4292750Z return func(*args, **kwargs) 2025-08-14T21:38:56.4293012Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 271, in forward 2025-08-14T21:38:56.4293077Z self_outputs = self.self( 2025-08-14T21:38:56.4293306Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4293375Z return func(*args, **kwargs) 2025-08-14T21:38:56.4293603Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4293672Z return func(*args, **kwargs) 2025-08-14T21:38:56.4293900Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4293961Z return func(*args, **kwargs) 2025-08-14T21:38:56.4294227Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 191, in forward 2025-08-14T21:38:56.4294366Z query_states = self.query(hidden_states).view(hidden_shape).transpose(1, 2) 2025-08-14T21:38:56.4294370Z 2025-08-14T21:38:56.4294474Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:56.4294666Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:56.4294728Z return mod(**inputs) 2025-08-14T21:38:56.4294939Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4295007Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4295256Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 875, in forward 2025-08-14T21:38:56.4295320Z outputs = self.layoutlm( 2025-08-14T21:38:56.4295545Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4295613Z return func(*args, **kwargs) 2025-08-14T21:38:56.4295836Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4295899Z return func(*args, **kwargs) 2025-08-14T21:38:56.4296112Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4296180Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4296455Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:56.4296533Z encoder_outputs = self.encoder( 2025-08-14T21:38:56.4296759Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4296825Z return func(*args, **kwargs) 2025-08-14T21:38:56.4297050Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4297129Z return func(*args, **kwargs) 2025-08-14T21:38:56.4297359Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4297416Z return func(*args, **kwargs) 2025-08-14T21:38:56.4297498Z [Previous line repeated 1 more time] 2025-08-14T21:38:56.4297713Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4297782Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4298042Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:56.4298105Z layer_outputs = layer_module( 2025-08-14T21:38:56.4298310Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:56.4298408Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:56.4298633Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4298695Z return func(*args, **kwargs) 2025-08-14T21:38:56.4298925Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4298986Z return func(*args, **kwargs) 2025-08-14T21:38:56.4299222Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4299283Z return func(*args, **kwargs) 2025-08-14T21:38:56.4299529Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 338, in forward 2025-08-14T21:38:56.4299611Z self_attention_outputs = self.attention( 2025-08-14T21:38:56.4299829Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4299899Z return func(*args, **kwargs) 2025-08-14T21:38:56.4300116Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4300176Z return func(*args, **kwargs) 2025-08-14T21:38:56.4300401Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4300460Z return func(*args, **kwargs) 2025-08-14T21:38:56.4300705Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 271, in forward 2025-08-14T21:38:56.4300776Z self_outputs = self.self( 2025-08-14T21:38:56.4300996Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4301062Z return func(*args, **kwargs) 2025-08-14T21:38:56.4301281Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4301343Z return func(*args, **kwargs) 2025-08-14T21:38:56.4301570Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4301628Z return func(*args, **kwargs) 2025-08-14T21:38:56.4301874Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 192, in forward 2025-08-14T21:38:56.4302007Z key_states = self.key(hidden_states).view(hidden_shape).transpose(1, 2) 2025-08-14T21:38:56.4302011Z 2025-08-14T21:38:56.4302119Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:56.4302301Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:56.4302356Z return mod(**inputs) 2025-08-14T21:38:56.4302553Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4302643Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4302887Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 875, in forward 2025-08-14T21:38:56.4302954Z outputs = self.layoutlm( 2025-08-14T21:38:56.4303186Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4303249Z return func(*args, **kwargs) 2025-08-14T21:38:56.4303467Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4303523Z return func(*args, **kwargs) 2025-08-14T21:38:56.4303719Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4303794Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4304040Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:56.4304136Z encoder_outputs = self.encoder( 2025-08-14T21:38:56.4304354Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4304416Z return func(*args, **kwargs) 2025-08-14T21:38:56.4304641Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4304701Z return func(*args, **kwargs) 2025-08-14T21:38:56.4304978Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4305050Z return func(*args, **kwargs) 2025-08-14T21:38:56.4305117Z [Previous line repeated 1 more time] 2025-08-14T21:38:56.4305326Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4305396Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4305641Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:56.4305712Z layer_outputs = layer_module( 2025-08-14T21:38:56.4305917Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:56.4305988Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:56.4306216Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4306280Z return func(*args, **kwargs) 2025-08-14T21:38:56.4306511Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4306575Z return func(*args, **kwargs) 2025-08-14T21:38:56.4306788Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4306859Z return func(*args, **kwargs) 2025-08-14T21:38:56.4307099Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 338, in forward 2025-08-14T21:38:56.4307179Z self_attention_outputs = self.attention( 2025-08-14T21:38:56.4307401Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4307463Z return func(*args, **kwargs) 2025-08-14T21:38:56.4307709Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4307772Z return func(*args, **kwargs) 2025-08-14T21:38:56.4307994Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4308062Z return func(*args, **kwargs) 2025-08-14T21:38:56.4308323Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 271, in forward 2025-08-14T21:38:56.4308394Z self_outputs = self.self( 2025-08-14T21:38:56.4308607Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4308680Z return func(*args, **kwargs) 2025-08-14T21:38:56.4308903Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4308963Z return func(*args, **kwargs) 2025-08-14T21:38:56.4309178Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4309245Z return func(*args, **kwargs) 2025-08-14T21:38:56.4309488Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 193, in forward 2025-08-14T21:38:56.4309633Z value_states = self.value(hidden_states).view(hidden_shape).transpose(1, 2) 2025-08-14T21:38:56.4309654Z 2025-08-14T21:38:56.4309728Z cudagraph partition due to non gpu ops 2025-08-14T21:38:56.4309799Z cudagraph partition due to non gpu ops 2025-08-14T21:38:56.4309900Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:56.4310083Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:56.4310142Z return mod(**inputs) 2025-08-14T21:38:56.4310348Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4310417Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4310669Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 875, in forward 2025-08-14T21:38:56.4310730Z outputs = self.layoutlm( 2025-08-14T21:38:56.4310948Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4311012Z return func(*args, **kwargs) 2025-08-14T21:38:56.4311228Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4311290Z return func(*args, **kwargs) 2025-08-14T21:38:56.4311489Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4311551Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4311800Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:56.4311867Z encoder_outputs = self.encoder( 2025-08-14T21:38:56.4312087Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4312153Z return func(*args, **kwargs) 2025-08-14T21:38:56.4312371Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4312440Z return func(*args, **kwargs) 2025-08-14T21:38:56.4312658Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4312718Z return func(*args, **kwargs) 2025-08-14T21:38:56.4312795Z [Previous line repeated 1 more time] 2025-08-14T21:38:56.4312994Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4313059Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4313326Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:56.4313393Z layer_outputs = layer_module( 2025-08-14T21:38:56.4313602Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:56.4313690Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:56.4313910Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4313976Z return func(*args, **kwargs) 2025-08-14T21:38:56.4314209Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4314270Z return func(*args, **kwargs) 2025-08-14T21:38:56.4314497Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4314559Z return func(*args, **kwargs) 2025-08-14T21:38:56.4314808Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 338, in forward 2025-08-14T21:38:56.4314884Z self_attention_outputs = self.attention( 2025-08-14T21:38:56.4315102Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4315180Z return func(*args, **kwargs) 2025-08-14T21:38:56.4315401Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4315463Z return func(*args, **kwargs) 2025-08-14T21:38:56.4315686Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4315745Z return func(*args, **kwargs) 2025-08-14T21:38:56.4316000Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 278, in forward 2025-08-14T21:38:56.4316119Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:38:56.4316363Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 225, in forward 2025-08-14T21:38:56.4316444Z hidden_states = self.dense(hidden_states) 2025-08-14T21:38:56.4316450Z 2025-08-14T21:38:56.4316545Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:56.4316731Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:56.4316790Z return mod(**inputs) 2025-08-14T21:38:56.4316990Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4317064Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4317313Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 875, in forward 2025-08-14T21:38:56.4317376Z outputs = self.layoutlm( 2025-08-14T21:38:56.4317602Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4317660Z return func(*args, **kwargs) 2025-08-14T21:38:56.4317884Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4317946Z return func(*args, **kwargs) 2025-08-14T21:38:56.4318142Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4318215Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4318462Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:56.4318532Z encoder_outputs = self.encoder( 2025-08-14T21:38:56.4318764Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4318823Z return func(*args, **kwargs) 2025-08-14T21:38:56.4319046Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4319107Z return func(*args, **kwargs) 2025-08-14T21:38:56.4319337Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4319400Z return func(*args, **kwargs) 2025-08-14T21:38:56.4319465Z [Previous line repeated 1 more time] 2025-08-14T21:38:56.4319682Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4319750Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4319990Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:56.4320058Z layer_outputs = layer_module( 2025-08-14T21:38:56.4320256Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:56.4320329Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:56.4320551Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4320627Z return func(*args, **kwargs) 2025-08-14T21:38:56.4320854Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4320913Z return func(*args, **kwargs) 2025-08-14T21:38:56.4321132Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4321198Z return func(*args, **kwargs) 2025-08-14T21:38:56.4321446Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 348, in forward 2025-08-14T21:38:56.4321524Z layer_output = apply_chunking_to_forward( 2025-08-14T21:38:56.4321770Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:38:56.4321838Z return forward_fn(*input_tensors) 2025-08-14T21:38:56.4322123Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 356, in feed_forward_chunk 2025-08-14T21:38:56.4322234Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:38:56.4322480Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 294, in forward 2025-08-14T21:38:56.4322563Z hidden_states = self.dense(hidden_states) 2025-08-14T21:38:56.4322566Z 2025-08-14T21:38:56.4322660Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:56.4322852Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:56.4322913Z return mod(**inputs) 2025-08-14T21:38:56.4323112Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4323181Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4323428Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 875, in forward 2025-08-14T21:38:56.4323495Z outputs = self.layoutlm( 2025-08-14T21:38:56.4323726Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4323787Z return func(*args, **kwargs) 2025-08-14T21:38:56.4324013Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4324073Z return func(*args, **kwargs) 2025-08-14T21:38:56.4324286Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4324365Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4324612Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:56.4324686Z encoder_outputs = self.encoder( 2025-08-14T21:38:56.4324920Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4324980Z return func(*args, **kwargs) 2025-08-14T21:38:56.4325202Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4325281Z return func(*args, **kwargs) 2025-08-14T21:38:56.4325500Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4325566Z return func(*args, **kwargs) 2025-08-14T21:38:56.4325635Z [Previous line repeated 1 more time] 2025-08-14T21:38:56.4325839Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4325905Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4326148Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:56.4326237Z layer_outputs = layer_module( 2025-08-14T21:38:56.4326437Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:56.4326509Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:56.4326732Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4326788Z return func(*args, **kwargs) 2025-08-14T21:38:56.4327007Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4327063Z return func(*args, **kwargs) 2025-08-14T21:38:56.4327278Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4327341Z return func(*args, **kwargs) 2025-08-14T21:38:56.4327581Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 348, in forward 2025-08-14T21:38:56.4327657Z layer_output = apply_chunking_to_forward( 2025-08-14T21:38:56.4327897Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:38:56.4327963Z return forward_fn(*input_tensors) 2025-08-14T21:38:56.4328240Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 356, in feed_forward_chunk 2025-08-14T21:38:56.4328347Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:38:56.4328588Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 295, in forward 2025-08-14T21:38:56.4328699Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:38:56.4328895Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:38:56.4328968Z return self.act(input) 2025-08-14T21:38:56.4328971Z 2025-08-14T21:38:56.4329066Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:56.4329250Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:56.4329318Z return mod(**inputs) 2025-08-14T21:38:56.4329517Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4329583Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4329849Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 875, in forward 2025-08-14T21:38:56.4329914Z outputs = self.layoutlm( 2025-08-14T21:38:56.4330141Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4330199Z return func(*args, **kwargs) 2025-08-14T21:38:56.4330436Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4330503Z return func(*args, **kwargs) 2025-08-14T21:38:56.4330700Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4330781Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4331040Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:56.4331107Z encoder_outputs = self.encoder( 2025-08-14T21:38:56.4331335Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4331395Z return func(*args, **kwargs) 2025-08-14T21:38:56.4331614Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4331699Z return func(*args, **kwargs) 2025-08-14T21:38:56.4331916Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4331976Z return func(*args, **kwargs) 2025-08-14T21:38:56.4332046Z [Previous line repeated 1 more time] 2025-08-14T21:38:56.4332246Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4332320Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4332567Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:56.4332631Z layer_outputs = layer_module( 2025-08-14T21:38:56.4332842Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:56.4332915Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:56.4333140Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4333203Z return func(*args, **kwargs) 2025-08-14T21:38:56.4333418Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4333485Z return func(*args, **kwargs) 2025-08-14T21:38:56.4333701Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4333758Z return func(*args, **kwargs) 2025-08-14T21:38:56.4334008Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 348, in forward 2025-08-14T21:38:56.4334083Z layer_output = apply_chunking_to_forward( 2025-08-14T21:38:56.4334326Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:38:56.4334397Z return forward_fn(*input_tensors) 2025-08-14T21:38:56.4334667Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 357, in feed_forward_chunk 2025-08-14T21:38:56.4334795Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:38:56.4335041Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 308, in forward 2025-08-14T21:38:56.4335121Z hidden_states = self.dense(hidden_states) 2025-08-14T21:38:56.4335124Z 2025-08-14T21:38:56.4335221Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:56.4335419Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:56.4335485Z return mod(**inputs) 2025-08-14T21:38:56.4335684Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4335750Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4336020Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 875, in forward 2025-08-14T21:38:56.4336083Z outputs = self.layoutlm( 2025-08-14T21:38:56.4336323Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4336386Z return func(*args, **kwargs) 2025-08-14T21:38:56.4336605Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4336672Z return func(*args, **kwargs) 2025-08-14T21:38:56.4336870Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4336937Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4337193Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:56.4337279Z encoder_outputs = self.encoder( 2025-08-14T21:38:56.4337501Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4337556Z return func(*args, **kwargs) 2025-08-14T21:38:56.4337771Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4337838Z return func(*args, **kwargs) 2025-08-14T21:38:56.4338053Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4338120Z return func(*args, **kwargs) 2025-08-14T21:38:56.4338189Z [Previous line repeated 1 more time] 2025-08-14T21:38:56.4338385Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4338457Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4338704Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:56.4338767Z layer_outputs = layer_module( 2025-08-14T21:38:56.4338975Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:56.4339049Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:56.4339273Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4339332Z return func(*args, **kwargs) 2025-08-14T21:38:56.4339548Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4339616Z return func(*args, **kwargs) 2025-08-14T21:38:56.4339831Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4339893Z return func(*args, **kwargs) 2025-08-14T21:38:56.4340144Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 338, in forward 2025-08-14T21:38:56.4340219Z self_attention_outputs = self.attention( 2025-08-14T21:38:56.4340444Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4340504Z return func(*args, **kwargs) 2025-08-14T21:38:56.4340720Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4340804Z return func(*args, **kwargs) 2025-08-14T21:38:56.4341023Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4341084Z return func(*args, **kwargs) 2025-08-14T21:38:56.4341335Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 271, in forward 2025-08-14T21:38:56.4341417Z self_outputs = self.self( 2025-08-14T21:38:56.4341643Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4341702Z return func(*args, **kwargs) 2025-08-14T21:38:56.4341935Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4342003Z return func(*args, **kwargs) 2025-08-14T21:38:56.4342222Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4342284Z return func(*args, **kwargs) 2025-08-14T21:38:56.4342537Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 191, in forward 2025-08-14T21:38:56.4342666Z query_states = self.query(hidden_states).view(hidden_shape).transpose(1, 2) 2025-08-14T21:38:56.4342684Z 2025-08-14T21:38:56.4342790Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:56.4342966Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:56.4343021Z return mod(**inputs) 2025-08-14T21:38:56.4343219Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4343285Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4343526Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 875, in forward 2025-08-14T21:38:56.4343588Z outputs = self.layoutlm( 2025-08-14T21:38:56.4343799Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4343859Z return func(*args, **kwargs) 2025-08-14T21:38:56.4344071Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4344136Z return func(*args, **kwargs) 2025-08-14T21:38:56.4344333Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4344399Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4344643Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:56.4344709Z encoder_outputs = self.encoder( 2025-08-14T21:38:56.4344994Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4345071Z return func(*args, **kwargs) 2025-08-14T21:38:56.4345290Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4345351Z return func(*args, **kwargs) 2025-08-14T21:38:56.4345583Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4345649Z return func(*args, **kwargs) 2025-08-14T21:38:56.4345735Z [Previous line repeated 1 more time] 2025-08-14T21:38:56.4345935Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4346006Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4346262Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:56.4346329Z layer_outputs = layer_module( 2025-08-14T21:38:56.4346564Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:56.4346640Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:56.4346859Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4346954Z return func(*args, **kwargs) 2025-08-14T21:38:56.4347180Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4347239Z return func(*args, **kwargs) 2025-08-14T21:38:56.4347483Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4347545Z return func(*args, **kwargs) 2025-08-14T21:38:56.4347800Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 338, in forward 2025-08-14T21:38:56.4347878Z self_attention_outputs = self.attention( 2025-08-14T21:38:56.4348095Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4348161Z return func(*args, **kwargs) 2025-08-14T21:38:56.4348378Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4348456Z return func(*args, **kwargs) 2025-08-14T21:38:56.4348689Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4348749Z return func(*args, **kwargs) 2025-08-14T21:38:56.4349005Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 271, in forward 2025-08-14T21:38:56.4349072Z self_outputs = self.self( 2025-08-14T21:38:56.4349294Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4349362Z return func(*args, **kwargs) 2025-08-14T21:38:56.4349580Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4349640Z return func(*args, **kwargs) 2025-08-14T21:38:56.4349867Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4349930Z return func(*args, **kwargs) 2025-08-14T21:38:56.4350184Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 192, in forward 2025-08-14T21:38:56.4350314Z key_states = self.key(hidden_states).view(hidden_shape).transpose(1, 2) 2025-08-14T21:38:56.4350317Z 2025-08-14T21:38:56.4350412Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:56.4350605Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:56.4350666Z return mod(**inputs) 2025-08-14T21:38:56.4350876Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4350944Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4351191Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 875, in forward 2025-08-14T21:38:56.4351264Z outputs = self.layoutlm( 2025-08-14T21:38:56.4351486Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4351548Z return func(*args, **kwargs) 2025-08-14T21:38:56.4351774Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4351835Z return func(*args, **kwargs) 2025-08-14T21:38:56.4352055Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4352128Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4352375Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:56.4352447Z encoder_outputs = self.encoder( 2025-08-14T21:38:56.4352669Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4352748Z return func(*args, **kwargs) 2025-08-14T21:38:56.4352975Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4353034Z return func(*args, **kwargs) 2025-08-14T21:38:56.4353303Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4353366Z return func(*args, **kwargs) 2025-08-14T21:38:56.4353435Z [Previous line repeated 1 more time] 2025-08-14T21:38:56.4353639Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4353704Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4353955Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:56.4354037Z layer_outputs = layer_module( 2025-08-14T21:38:56.4354242Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:56.4354321Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:56.4354538Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4354598Z return func(*args, **kwargs) 2025-08-14T21:38:56.4354823Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4354886Z return func(*args, **kwargs) 2025-08-14T21:38:56.4355110Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4355170Z return func(*args, **kwargs) 2025-08-14T21:38:56.4355414Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 338, in forward 2025-08-14T21:38:56.4355499Z self_attention_outputs = self.attention( 2025-08-14T21:38:56.4355716Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4355777Z return func(*args, **kwargs) 2025-08-14T21:38:56.4356002Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4356063Z return func(*args, **kwargs) 2025-08-14T21:38:56.4356289Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4356349Z return func(*args, **kwargs) 2025-08-14T21:38:56.4356594Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 271, in forward 2025-08-14T21:38:56.4356666Z self_outputs = self.self( 2025-08-14T21:38:56.4356890Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4356950Z return func(*args, **kwargs) 2025-08-14T21:38:56.4357175Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4357237Z return func(*args, **kwargs) 2025-08-14T21:38:56.4357462Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4357522Z return func(*args, **kwargs) 2025-08-14T21:38:56.4357783Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 193, in forward 2025-08-14T21:38:56.4357927Z value_states = self.value(hidden_states).view(hidden_shape).transpose(1, 2) 2025-08-14T21:38:56.4357931Z 2025-08-14T21:38:56.4358002Z cudagraph partition due to non gpu ops 2025-08-14T21:38:56.4358080Z cudagraph partition due to non gpu ops 2025-08-14T21:38:56.4358192Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:56.4358377Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:56.4358445Z return mod(**inputs) 2025-08-14T21:38:56.4358661Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4358731Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4358983Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 875, in forward 2025-08-14T21:38:56.4359049Z outputs = self.layoutlm( 2025-08-14T21:38:56.4359273Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4359332Z return func(*args, **kwargs) 2025-08-14T21:38:56.4359548Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4359634Z return func(*args, **kwargs) 2025-08-14T21:38:56.4359834Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4359899Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4360153Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:56.4360218Z encoder_outputs = self.encoder( 2025-08-14T21:38:56.4360445Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4360505Z return func(*args, **kwargs) 2025-08-14T21:38:56.4360723Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4360793Z return func(*args, **kwargs) 2025-08-14T21:38:56.4361009Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4361073Z return func(*args, **kwargs) 2025-08-14T21:38:56.4361148Z [Previous line repeated 1 more time] 2025-08-14T21:38:56.4361346Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4361421Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4361665Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:56.4361730Z layer_outputs = layer_module( 2025-08-14T21:38:56.4361943Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:56.4362016Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:56.4362242Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4362305Z return func(*args, **kwargs) 2025-08-14T21:38:56.4362523Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4362590Z return func(*args, **kwargs) 2025-08-14T21:38:56.4362808Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4362868Z return func(*args, **kwargs) 2025-08-14T21:38:56.4363121Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 338, in forward 2025-08-14T21:38:56.4363211Z self_attention_outputs = self.attention( 2025-08-14T21:38:56.4363438Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4363498Z return func(*args, **kwargs) 2025-08-14T21:38:56.4363713Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4363798Z return func(*args, **kwargs) 2025-08-14T21:38:56.4364017Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4364076Z return func(*args, **kwargs) 2025-08-14T21:38:56.4364348Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 278, in forward 2025-08-14T21:38:56.4364470Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:38:56.4364722Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 225, in forward 2025-08-14T21:38:56.4364801Z hidden_states = self.dense(hidden_states) 2025-08-14T21:38:56.4364804Z 2025-08-14T21:38:56.4364899Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:56.4365090Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:56.4365167Z return mod(**inputs) 2025-08-14T21:38:56.4365373Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4365441Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4365685Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 875, in forward 2025-08-14T21:38:56.4365755Z outputs = self.layoutlm( 2025-08-14T21:38:56.4365973Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4366036Z return func(*args, **kwargs) 2025-08-14T21:38:56.4366263Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4366323Z return func(*args, **kwargs) 2025-08-14T21:38:56.4366525Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4366591Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4366832Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:56.4366899Z encoder_outputs = self.encoder( 2025-08-14T21:38:56.4367117Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4367174Z return func(*args, **kwargs) 2025-08-14T21:38:56.4367394Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4367452Z return func(*args, **kwargs) 2025-08-14T21:38:56.4367670Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4367728Z return func(*args, **kwargs) 2025-08-14T21:38:56.4367798Z [Previous line repeated 1 more time] 2025-08-14T21:38:56.4367998Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4368062Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4368304Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:56.4368372Z layer_outputs = layer_module( 2025-08-14T21:38:56.4368574Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:56.4368672Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:56.4368891Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4368951Z return func(*args, **kwargs) 2025-08-14T21:38:56.4369175Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4369250Z return func(*args, **kwargs) 2025-08-14T21:38:56.4369477Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4369536Z return func(*args, **kwargs) 2025-08-14T21:38:56.4369795Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 348, in forward 2025-08-14T21:38:56.4369883Z layer_output = apply_chunking_to_forward( 2025-08-14T21:38:56.4370125Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:38:56.4370194Z return forward_fn(*input_tensors) 2025-08-14T21:38:56.4370474Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 356, in feed_forward_chunk 2025-08-14T21:38:56.4370585Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:38:56.4370853Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 294, in forward 2025-08-14T21:38:56.4370926Z hidden_states = self.dense(hidden_states) 2025-08-14T21:38:56.4370929Z 2025-08-14T21:38:56.4371023Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:56.4371215Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:56.4371275Z return mod(**inputs) 2025-08-14T21:38:56.4371482Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4371548Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4371790Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 875, in forward 2025-08-14T21:38:56.4371861Z outputs = self.layoutlm( 2025-08-14T21:38:56.4372080Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4372143Z return func(*args, **kwargs) 2025-08-14T21:38:56.4372369Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4372430Z return func(*args, **kwargs) 2025-08-14T21:38:56.4372637Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4372703Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4372947Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:56.4373022Z encoder_outputs = self.encoder( 2025-08-14T21:38:56.4373237Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4373296Z return func(*args, **kwargs) 2025-08-14T21:38:56.4373517Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4373573Z return func(*args, **kwargs) 2025-08-14T21:38:56.4373785Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4373843Z return func(*args, **kwargs) 2025-08-14T21:38:56.4373912Z [Previous line repeated 1 more time] 2025-08-14T21:38:56.4374109Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4374184Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4374428Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:56.4374494Z layer_outputs = layer_module( 2025-08-14T21:38:56.4374694Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:56.4374787Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:56.4375005Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4375065Z return func(*args, **kwargs) 2025-08-14T21:38:56.4375302Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4375361Z return func(*args, **kwargs) 2025-08-14T21:38:56.4375582Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4375648Z return func(*args, **kwargs) 2025-08-14T21:38:56.4375892Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 348, in forward 2025-08-14T21:38:56.4375975Z layer_output = apply_chunking_to_forward( 2025-08-14T21:38:56.4376215Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:38:56.4376299Z return forward_fn(*input_tensors) 2025-08-14T21:38:56.4376582Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 356, in feed_forward_chunk 2025-08-14T21:38:56.4376693Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:38:56.4376945Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 295, in forward 2025-08-14T21:38:56.4377049Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:38:56.4377243Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:38:56.4377313Z return self.act(input) 2025-08-14T21:38:56.4377317Z 2025-08-14T21:38:56.4377413Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:56.4377607Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:56.4377665Z return mod(**inputs) 2025-08-14T21:38:56.4377864Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4377937Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4378179Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 875, in forward 2025-08-14T21:38:56.4378242Z outputs = self.layoutlm( 2025-08-14T21:38:56.4378470Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4378531Z return func(*args, **kwargs) 2025-08-14T21:38:56.4378756Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4378815Z return func(*args, **kwargs) 2025-08-14T21:38:56.4379015Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4379086Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4379330Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:56.4379394Z encoder_outputs = self.encoder( 2025-08-14T21:38:56.4379622Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4379683Z return func(*args, **kwargs) 2025-08-14T21:38:56.4379922Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4379984Z return func(*args, **kwargs) 2025-08-14T21:38:56.4380202Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4380288Z return func(*args, **kwargs) 2025-08-14T21:38:56.4380357Z [Previous line repeated 1 more time] 2025-08-14T21:38:56.4380554Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4380629Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4380885Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:56.4380959Z layer_outputs = layer_module( 2025-08-14T21:38:56.4381161Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:56.4381231Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:56.4381454Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4381515Z return func(*args, **kwargs) 2025-08-14T21:38:56.4381731Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4381817Z return func(*args, **kwargs) 2025-08-14T21:38:56.4382034Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4382100Z return func(*args, **kwargs) 2025-08-14T21:38:56.4382344Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 348, in forward 2025-08-14T21:38:56.4382421Z layer_output = apply_chunking_to_forward( 2025-08-14T21:38:56.4382667Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:38:56.4382737Z return forward_fn(*input_tensors) 2025-08-14T21:38:56.4383017Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 357, in feed_forward_chunk 2025-08-14T21:38:56.4383141Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:38:56.4383385Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 308, in forward 2025-08-14T21:38:56.4383465Z hidden_states = self.dense(hidden_states) 2025-08-14T21:38:56.4383469Z 2025-08-14T21:38:56.4383564Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:56.4383746Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:56.4383813Z return mod(**inputs) 2025-08-14T21:38:56.4384013Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4384088Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4384331Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 875, in forward 2025-08-14T21:38:56.4384395Z outputs = self.layoutlm( 2025-08-14T21:38:56.4384845Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4384913Z return func(*args, **kwargs) 2025-08-14T21:38:56.4385146Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4385207Z return func(*args, **kwargs) 2025-08-14T21:38:56.4385416Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4385495Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4385782Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:56.4385854Z encoder_outputs = self.encoder( 2025-08-14T21:38:56.4386132Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4386222Z return func(*args, **kwargs) 2025-08-14T21:38:56.4386453Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4386516Z return func(*args, **kwargs) 2025-08-14T21:38:56.4386762Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4386834Z return func(*args, **kwargs) 2025-08-14T21:38:56.4386909Z [Previous line repeated 1 more time] 2025-08-14T21:38:56.4387113Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4387192Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4387445Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:56.4387520Z layer_outputs = layer_module( 2025-08-14T21:38:56.4387756Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:56.4387831Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:56.4388061Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4388123Z return func(*args, **kwargs) 2025-08-14T21:38:56.4388346Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4388414Z return func(*args, **kwargs) 2025-08-14T21:38:56.4388642Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4388709Z return func(*args, **kwargs) 2025-08-14T21:38:56.4388965Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 338, in forward 2025-08-14T21:38:56.4389043Z self_attention_outputs = self.attention( 2025-08-14T21:38:56.4389272Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4389333Z return func(*args, **kwargs) 2025-08-14T21:38:56.4389564Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4389624Z return func(*args, **kwargs) 2025-08-14T21:38:56.4389846Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4389914Z return func(*args, **kwargs) 2025-08-14T21:38:56.4390167Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 271, in forward 2025-08-14T21:38:56.4390234Z self_outputs = self.self( 2025-08-14T21:38:56.4390463Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4390527Z return func(*args, **kwargs) 2025-08-14T21:38:56.4390759Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4390820Z return func(*args, **kwargs) 2025-08-14T21:38:56.4391043Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4391112Z return func(*args, **kwargs) 2025-08-14T21:38:56.4391363Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 191, in forward 2025-08-14T21:38:56.4391517Z query_states = self.query(hidden_states).view(hidden_shape).transpose(1, 2) 2025-08-14T21:38:56.4391528Z 2025-08-14T21:38:56.4391630Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:56.4391818Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:56.4391904Z return mod(**inputs) 2025-08-14T21:38:56.4392114Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4392183Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4392456Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 875, in forward 2025-08-14T21:38:56.4392523Z outputs = self.layoutlm( 2025-08-14T21:38:56.4392751Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4392813Z return func(*args, **kwargs) 2025-08-14T21:38:56.4393032Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4393099Z return func(*args, **kwargs) 2025-08-14T21:38:56.4393301Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4393389Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4393643Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:56.4393709Z encoder_outputs = self.encoder( 2025-08-14T21:38:56.4393939Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4394001Z return func(*args, **kwargs) 2025-08-14T21:38:56.4394222Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4394291Z return func(*args, **kwargs) 2025-08-14T21:38:56.4394509Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4394569Z return func(*args, **kwargs) 2025-08-14T21:38:56.4394646Z [Previous line repeated 1 more time] 2025-08-14T21:38:56.4394848Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4394921Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4395170Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:56.4395238Z layer_outputs = layer_module( 2025-08-14T21:38:56.4395449Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:56.4395521Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:56.4395742Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4395811Z return func(*args, **kwargs) 2025-08-14T21:38:56.4396031Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4396101Z return func(*args, **kwargs) 2025-08-14T21:38:56.4396317Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4396378Z return func(*args, **kwargs) 2025-08-14T21:38:56.4396636Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 338, in forward 2025-08-14T21:38:56.4396713Z self_attention_outputs = self.attention( 2025-08-14T21:38:56.4396939Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4397017Z return func(*args, **kwargs) 2025-08-14T21:38:56.4397242Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4397310Z return func(*args, **kwargs) 2025-08-14T21:38:56.4397534Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4397612Z return func(*args, **kwargs) 2025-08-14T21:38:56.4397884Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 271, in forward 2025-08-14T21:38:56.4397948Z self_outputs = self.self( 2025-08-14T21:38:56.4398187Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4398248Z return func(*args, **kwargs) 2025-08-14T21:38:56.4398467Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4398533Z return func(*args, **kwargs) 2025-08-14T21:38:56.4398746Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4398804Z return func(*args, **kwargs) 2025-08-14T21:38:56.4399052Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 192, in forward 2025-08-14T21:38:56.4399201Z key_states = self.key(hidden_states).view(hidden_shape).transpose(1, 2) 2025-08-14T21:38:56.4399205Z 2025-08-14T21:38:56.4399306Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:56.4399488Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:56.4399547Z return mod(**inputs) 2025-08-14T21:38:56.4399748Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4399815Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4400066Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 875, in forward 2025-08-14T21:38:56.4400135Z outputs = self.layoutlm( 2025-08-14T21:38:56.4400355Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4400426Z return func(*args, **kwargs) 2025-08-14T21:38:56.4400645Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4400705Z return func(*args, **kwargs) 2025-08-14T21:38:56.4400913Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4400981Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4401233Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:56.4401299Z encoder_outputs = self.encoder( 2025-08-14T21:38:56.4401517Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4401585Z return func(*args, **kwargs) 2025-08-14T21:38:56.4401800Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4401862Z return func(*args, **kwargs) 2025-08-14T21:38:56.4402084Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4402144Z return func(*args, **kwargs) 2025-08-14T21:38:56.4402220Z [Previous line repeated 1 more time] 2025-08-14T21:38:56.4402419Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4402482Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4402747Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:56.4402816Z layer_outputs = layer_module( 2025-08-14T21:38:56.4403018Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:56.4403115Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:56.4403332Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4403399Z return func(*args, **kwargs) 2025-08-14T21:38:56.4403640Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4403703Z return func(*args, **kwargs) 2025-08-14T21:38:56.4403932Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4403994Z return func(*args, **kwargs) 2025-08-14T21:38:56.4404246Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 338, in forward 2025-08-14T21:38:56.4404328Z self_attention_outputs = self.attention( 2025-08-14T21:38:56.4404543Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4404628Z return func(*args, **kwargs) 2025-08-14T21:38:56.4404846Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4404905Z return func(*args, **kwargs) 2025-08-14T21:38:56.4405131Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4405192Z return func(*args, **kwargs) 2025-08-14T21:38:56.4405448Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 271, in forward 2025-08-14T21:38:56.4405511Z self_outputs = self.self( 2025-08-14T21:38:56.4405730Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4405796Z return func(*args, **kwargs) 2025-08-14T21:38:56.4406014Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4406075Z return func(*args, **kwargs) 2025-08-14T21:38:56.4406298Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4406358Z return func(*args, **kwargs) 2025-08-14T21:38:56.4406611Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 193, in forward 2025-08-14T21:38:56.4406742Z value_states = self.value(hidden_states).view(hidden_shape).transpose(1, 2) 2025-08-14T21:38:56.4406747Z 2025-08-14T21:38:56.4406819Z cudagraph partition due to non gpu ops 2025-08-14T21:38:56.4406895Z cudagraph partition due to non gpu ops 2025-08-14T21:38:56.4406990Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:56.4407174Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:56.4407242Z return mod(**inputs) 2025-08-14T21:38:56.4407444Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4407517Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4407762Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 875, in forward 2025-08-14T21:38:56.4407825Z outputs = self.layoutlm( 2025-08-14T21:38:56.4408050Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4408125Z return func(*args, **kwargs) 2025-08-14T21:38:56.4408352Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4408413Z return func(*args, **kwargs) 2025-08-14T21:38:56.4408609Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4408703Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4408945Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:56.4409011Z encoder_outputs = self.encoder( 2025-08-14T21:38:56.4409259Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4409321Z return func(*args, **kwargs) 2025-08-14T21:38:56.4409548Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4409608Z return func(*args, **kwargs) 2025-08-14T21:38:56.4409826Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4409893Z return func(*args, **kwargs) 2025-08-14T21:38:56.4409961Z [Previous line repeated 1 more time] 2025-08-14T21:38:56.4410174Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4410250Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4410491Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:56.4410564Z layer_outputs = layer_module( 2025-08-14T21:38:56.4410765Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:56.4410837Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:56.4411060Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4411120Z return func(*args, **kwargs) 2025-08-14T21:38:56.4411338Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4411408Z return func(*args, **kwargs) 2025-08-14T21:38:56.4411622Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4411690Z return func(*args, **kwargs) 2025-08-14T21:38:56.4411934Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 338, in forward 2025-08-14T21:38:56.4412011Z self_attention_outputs = self.attention( 2025-08-14T21:38:56.4412235Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4412297Z return func(*args, **kwargs) 2025-08-14T21:38:56.4412512Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4412581Z return func(*args, **kwargs) 2025-08-14T21:38:56.4412796Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4412868Z return func(*args, **kwargs) 2025-08-14T21:38:56.4413113Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 278, in forward 2025-08-14T21:38:56.4413232Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:38:56.4413485Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 225, in forward 2025-08-14T21:38:56.4413562Z hidden_states = self.dense(hidden_states) 2025-08-14T21:38:56.4413566Z 2025-08-14T21:38:56.4413682Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:56.4413867Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:56.4413926Z return mod(**inputs) 2025-08-14T21:38:56.4414132Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4414217Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4414462Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 875, in forward 2025-08-14T21:38:56.4414534Z outputs = self.layoutlm( 2025-08-14T21:38:56.4414770Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4414840Z return func(*args, **kwargs) 2025-08-14T21:38:56.4415058Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4415121Z return func(*args, **kwargs) 2025-08-14T21:38:56.4415325Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4415391Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4415644Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:56.4415727Z encoder_outputs = self.encoder( 2025-08-14T21:38:56.4415944Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4416011Z return func(*args, **kwargs) 2025-08-14T21:38:56.4416228Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4416288Z return func(*args, **kwargs) 2025-08-14T21:38:56.4416514Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4416574Z return func(*args, **kwargs) 2025-08-14T21:38:56.4416650Z [Previous line repeated 1 more time] 2025-08-14T21:38:56.4416849Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4416914Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4417171Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:56.4417236Z layer_outputs = layer_module( 2025-08-14T21:38:56.4417438Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:56.4417518Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:56.4417739Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4417809Z return func(*args, **kwargs) 2025-08-14T21:38:56.4418027Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4418089Z return func(*args, **kwargs) 2025-08-14T21:38:56.4418312Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4418375Z return func(*args, **kwargs) 2025-08-14T21:38:56.4418619Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 348, in forward 2025-08-14T21:38:56.4418704Z layer_output = apply_chunking_to_forward( 2025-08-14T21:38:56.4418942Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:38:56.4419018Z return forward_fn(*input_tensors) 2025-08-14T21:38:56.4419305Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 356, in feed_forward_chunk 2025-08-14T21:38:56.4419418Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:38:56.4419670Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 294, in forward 2025-08-14T21:38:56.4419743Z hidden_states = self.dense(hidden_states) 2025-08-14T21:38:56.4419762Z 2025-08-14T21:38:56.4419865Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:56.4420049Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:56.4420109Z return mod(**inputs) 2025-08-14T21:38:56.4420332Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4420401Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4420646Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 875, in forward 2025-08-14T21:38:56.4420718Z outputs = self.layoutlm( 2025-08-14T21:38:56.4420933Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4421000Z return func(*args, **kwargs) 2025-08-14T21:38:56.4421215Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4421293Z return func(*args, **kwargs) 2025-08-14T21:38:56.4421499Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4421566Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4421811Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:56.4421885Z encoder_outputs = self.encoder( 2025-08-14T21:38:56.4422103Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4422170Z return func(*args, **kwargs) 2025-08-14T21:38:56.4422389Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4422448Z return func(*args, **kwargs) 2025-08-14T21:38:56.4422675Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4422736Z return func(*args, **kwargs) 2025-08-14T21:38:56.4422812Z [Previous line repeated 1 more time] 2025-08-14T21:38:56.4423012Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4423079Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4423333Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:56.4423400Z layer_outputs = layer_module( 2025-08-14T21:38:56.4423605Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:56.4423684Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:56.4423903Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4423974Z return func(*args, **kwargs) 2025-08-14T21:38:56.4424191Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4424252Z return func(*args, **kwargs) 2025-08-14T21:38:56.4424479Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4424539Z return func(*args, **kwargs) 2025-08-14T21:38:56.4424872Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 348, in forward 2025-08-14T21:38:56.4424967Z layer_output = apply_chunking_to_forward( 2025-08-14T21:38:56.4425210Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:38:56.4425290Z return forward_fn(*input_tensors) 2025-08-14T21:38:56.4425593Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 356, in feed_forward_chunk 2025-08-14T21:38:56.4425706Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:38:56.4425983Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 295, in forward 2025-08-14T21:38:56.4426093Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:38:56.4426299Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:38:56.4426364Z return self.act(input) 2025-08-14T21:38:56.4426369Z 2025-08-14T21:38:56.4426466Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:56.4426660Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:56.4426721Z return mod(**inputs) 2025-08-14T21:38:56.4426922Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4427021Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4427266Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 875, in forward 2025-08-14T21:38:56.4427339Z outputs = self.layoutlm( 2025-08-14T21:38:56.4427560Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4427622Z return func(*args, **kwargs) 2025-08-14T21:38:56.4427854Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4427917Z return func(*args, **kwargs) 2025-08-14T21:38:56.4428117Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4428196Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4428441Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:56.4428514Z encoder_outputs = self.encoder( 2025-08-14T21:38:56.4428732Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4428794Z return func(*args, **kwargs) 2025-08-14T21:38:56.4429020Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4429081Z return func(*args, **kwargs) 2025-08-14T21:38:56.4429304Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4429364Z return func(*args, **kwargs) 2025-08-14T21:38:56.4429435Z [Previous line repeated 1 more time] 2025-08-14T21:38:56.4429639Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4429708Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4429957Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:56.4430031Z layer_outputs = layer_module( 2025-08-14T21:38:56.4430234Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:56.4430314Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:56.4430549Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4430612Z return func(*args, **kwargs) 2025-08-14T21:38:56.4430839Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4430900Z return func(*args, **kwargs) 2025-08-14T21:38:56.4431119Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4431214Z return func(*args, **kwargs) 2025-08-14T21:38:56.4431460Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 348, in forward 2025-08-14T21:38:56.4431558Z layer_output = apply_chunking_to_forward( 2025-08-14T21:38:56.4431795Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:38:56.4431863Z return forward_fn(*input_tensors) 2025-08-14T21:38:56.4432144Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 357, in feed_forward_chunk 2025-08-14T21:38:56.4432266Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:38:56.4432513Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 308, in forward 2025-08-14T21:38:56.4432604Z hidden_states = self.dense(hidden_states) 2025-08-14T21:38:56.4432607Z 2025-08-14T21:38:56.4432701Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:56.4432889Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:56.4432950Z return mod(**inputs) 2025-08-14T21:38:56.4433154Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4433230Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4433477Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 875, in forward 2025-08-14T21:38:56.4433550Z outputs = self.layoutlm( 2025-08-14T21:38:56.4433774Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4433837Z return func(*args, **kwargs) 2025-08-14T21:38:56.4434069Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4434132Z return func(*args, **kwargs) 2025-08-14T21:38:56.4434332Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4434411Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4434659Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:56.4434736Z encoder_outputs = self.encoder( 2025-08-14T21:38:56.4434960Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4435024Z return func(*args, **kwargs) 2025-08-14T21:38:56.4435249Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4435314Z return func(*args, **kwargs) 2025-08-14T21:38:56.4435535Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4435605Z return func(*args, **kwargs) 2025-08-14T21:38:56.4435677Z [Previous line repeated 1 more time] 2025-08-14T21:38:56.4435888Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4435956Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4436219Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:56.4436293Z layer_outputs = layer_module( 2025-08-14T21:38:56.4436495Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:56.4436575Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:56.4436813Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4436874Z return func(*args, **kwargs) 2025-08-14T21:38:56.4437101Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4437175Z return func(*args, **kwargs) 2025-08-14T21:38:56.4437394Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4437462Z return func(*args, **kwargs) 2025-08-14T21:38:56.4437710Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 338, in forward 2025-08-14T21:38:56.4437794Z self_attention_outputs = self.attention( 2025-08-14T21:38:56.4438011Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4438073Z return func(*args, **kwargs) 2025-08-14T21:38:56.4438318Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4438379Z return func(*args, **kwargs) 2025-08-14T21:38:56.4438597Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4438664Z return func(*args, **kwargs) 2025-08-14T21:38:56.4438909Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 271, in forward 2025-08-14T21:38:56.4438979Z self_outputs = self.self( 2025-08-14T21:38:56.4439200Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4439260Z return func(*args, **kwargs) 2025-08-14T21:38:56.4439485Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4439547Z return func(*args, **kwargs) 2025-08-14T21:38:56.4439768Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4439839Z return func(*args, **kwargs) 2025-08-14T21:38:56.4440091Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 191, in forward 2025-08-14T21:38:56.4440234Z query_states = self.query(hidden_states).view(hidden_shape).transpose(1, 2) 2025-08-14T21:38:56.4440238Z 2025-08-14T21:38:56.4440335Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:56.4440519Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:56.4440589Z return mod(**inputs) 2025-08-14T21:38:56.4440791Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4440870Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4441119Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 875, in forward 2025-08-14T21:38:56.4441182Z outputs = self.layoutlm( 2025-08-14T21:38:56.4441409Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4441472Z return func(*args, **kwargs) 2025-08-14T21:38:56.4441691Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4441760Z return func(*args, **kwargs) 2025-08-14T21:38:56.4441972Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4442048Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4442293Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:56.4442405Z encoder_outputs = self.encoder( 2025-08-14T21:38:56.4442631Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4442692Z return func(*args, **kwargs) 2025-08-14T21:38:56.4442924Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4442994Z return func(*args, **kwargs) 2025-08-14T21:38:56.4443214Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4443284Z return func(*args, **kwargs) 2025-08-14T21:38:56.4443355Z [Previous line repeated 1 more time] 2025-08-14T21:38:56.4443554Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4443628Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4443887Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:56.4443958Z layer_outputs = layer_module( 2025-08-14T21:38:56.4444160Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:56.4444236Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:56.4444461Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4444523Z return func(*args, **kwargs) 2025-08-14T21:38:56.4444740Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4444810Z return func(*args, **kwargs) 2025-08-14T21:38:56.4445026Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4445099Z return func(*args, **kwargs) 2025-08-14T21:38:56.4445341Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 338, in forward 2025-08-14T21:38:56.4445416Z self_attention_outputs = self.attention( 2025-08-14T21:38:56.4445642Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4445703Z return func(*args, **kwargs) 2025-08-14T21:38:56.4445919Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4445989Z return func(*args, **kwargs) 2025-08-14T21:38:56.4446203Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4446270Z return func(*args, **kwargs) 2025-08-14T21:38:56.4446514Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 271, in forward 2025-08-14T21:38:56.4446581Z self_outputs = self.self( 2025-08-14T21:38:56.4446805Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4446864Z return func(*args, **kwargs) 2025-08-14T21:38:56.4447080Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4447146Z return func(*args, **kwargs) 2025-08-14T21:38:56.4447361Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4447440Z return func(*args, **kwargs) 2025-08-14T21:38:56.4447687Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 192, in forward 2025-08-14T21:38:56.4447813Z key_states = self.key(hidden_states).view(hidden_shape).transpose(1, 2) 2025-08-14T21:38:56.4448124Z 2025-08-14T21:38:56.4448232Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:56.4448416Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:56.4448484Z return mod(**inputs) 2025-08-14T21:38:56.4448699Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4448770Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4449022Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 875, in forward 2025-08-14T21:38:56.4449087Z outputs = self.layoutlm( 2025-08-14T21:38:56.4449301Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4449369Z return func(*args, **kwargs) 2025-08-14T21:38:56.4449586Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4449672Z return func(*args, **kwargs) 2025-08-14T21:38:56.4449875Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4449942Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4450203Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:56.4450271Z encoder_outputs = self.encoder( 2025-08-14T21:38:56.4450497Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4450567Z return func(*args, **kwargs) 2025-08-14T21:38:56.4450787Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4450855Z return func(*args, **kwargs) 2025-08-14T21:38:56.4451075Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4451138Z return func(*args, **kwargs) 2025-08-14T21:38:56.4451214Z [Previous line repeated 1 more time] 2025-08-14T21:38:56.4451416Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4451483Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4451739Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:56.4451804Z layer_outputs = layer_module( 2025-08-14T21:38:56.4452018Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:56.4452089Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:56.4452310Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4452381Z return func(*args, **kwargs) 2025-08-14T21:38:56.4452601Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4452667Z return func(*args, **kwargs) 2025-08-14T21:38:56.4452887Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4452947Z return func(*args, **kwargs) 2025-08-14T21:38:56.4453203Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 338, in forward 2025-08-14T21:38:56.4453305Z self_attention_outputs = self.attention( 2025-08-14T21:38:56.4453529Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4453598Z return func(*args, **kwargs) 2025-08-14T21:38:56.4453817Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4453903Z return func(*args, **kwargs) 2025-08-14T21:38:56.4454125Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4454187Z return func(*args, **kwargs) 2025-08-14T21:38:56.4454459Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 271, in forward 2025-08-14T21:38:56.4454527Z self_outputs = self.self( 2025-08-14T21:38:56.4454748Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4454817Z return func(*args, **kwargs) 2025-08-14T21:38:56.4455038Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4455109Z return func(*args, **kwargs) 2025-08-14T21:38:56.4455331Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4455407Z return func(*args, **kwargs) 2025-08-14T21:38:56.4455661Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 193, in forward 2025-08-14T21:38:56.4455796Z value_states = self.value(hidden_states).view(hidden_shape).transpose(1, 2) 2025-08-14T21:38:56.4455799Z 2025-08-14T21:38:56.4455879Z cudagraph partition due to non gpu ops 2025-08-14T21:38:56.4455949Z cudagraph partition due to non gpu ops 2025-08-14T21:38:56.4456043Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:56.4456232Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:56.4456292Z return mod(**inputs) 2025-08-14T21:38:56.4456489Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4456567Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4456811Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 875, in forward 2025-08-14T21:38:56.4456883Z outputs = self.layoutlm( 2025-08-14T21:38:56.4457104Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4457164Z return func(*args, **kwargs) 2025-08-14T21:38:56.4457388Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4457449Z return func(*args, **kwargs) 2025-08-14T21:38:56.4457647Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4457723Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4457965Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:56.4458039Z encoder_outputs = self.encoder( 2025-08-14T21:38:56.4458257Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4458317Z return func(*args, **kwargs) 2025-08-14T21:38:56.4458541Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4458601Z return func(*args, **kwargs) 2025-08-14T21:38:56.4458817Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4458908Z return func(*args, **kwargs) 2025-08-14T21:38:56.4458980Z [Previous line repeated 1 more time] 2025-08-14T21:38:56.4459187Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4459251Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4459514Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:56.4459587Z layer_outputs = layer_module( 2025-08-14T21:38:56.4459790Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:56.4459877Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:56.4460106Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4460167Z return func(*args, **kwargs) 2025-08-14T21:38:56.4460392Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4460454Z return func(*args, **kwargs) 2025-08-14T21:38:56.4460669Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4460753Z return func(*args, **kwargs) 2025-08-14T21:38:56.4460998Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 338, in forward 2025-08-14T21:38:56.4461079Z self_attention_outputs = self.attention( 2025-08-14T21:38:56.4461295Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4461355Z return func(*args, **kwargs) 2025-08-14T21:38:56.4461580Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4461642Z return func(*args, **kwargs) 2025-08-14T21:38:56.4461859Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4461928Z return func(*args, **kwargs) 2025-08-14T21:38:56.4462171Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 278, in forward 2025-08-14T21:38:56.4462300Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:38:56.4462545Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 225, in forward 2025-08-14T21:38:56.4462621Z hidden_states = self.dense(hidden_states) 2025-08-14T21:38:56.4462625Z 2025-08-14T21:38:56.4462725Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:56.4462907Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:56.4462972Z return mod(**inputs) 2025-08-14T21:38:56.4463171Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4463239Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4463489Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 875, in forward 2025-08-14T21:38:56.4463555Z outputs = self.layoutlm( 2025-08-14T21:38:56.4463773Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4463840Z return func(*args, **kwargs) 2025-08-14T21:38:56.4464058Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4464125Z return func(*args, **kwargs) 2025-08-14T21:38:56.4464323Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4464406Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4464661Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:56.4464728Z encoder_outputs = self.encoder( 2025-08-14T21:38:56.4465006Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4465107Z return func(*args, **kwargs) 2025-08-14T21:38:56.4465325Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4465392Z return func(*args, **kwargs) 2025-08-14T21:38:56.4465629Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4465693Z return func(*args, **kwargs) 2025-08-14T21:38:56.4465775Z [Previous line repeated 1 more time] 2025-08-14T21:38:56.4465981Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4466048Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4466306Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:56.4466390Z layer_outputs = layer_module( 2025-08-14T21:38:56.4466601Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:56.4466674Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:56.4466893Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4466963Z return func(*args, **kwargs) 2025-08-14T21:38:56.4467182Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4467243Z return func(*args, **kwargs) 2025-08-14T21:38:56.4467475Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4467539Z return func(*args, **kwargs) 2025-08-14T21:38:56.4467793Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 348, in forward 2025-08-14T21:38:56.4467874Z layer_output = apply_chunking_to_forward( 2025-08-14T21:38:56.4468112Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:38:56.4468189Z return forward_fn(*input_tensors) 2025-08-14T21:38:56.4468464Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 356, in feed_forward_chunk 2025-08-14T21:38:56.4468582Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:38:56.4468828Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 294, in forward 2025-08-14T21:38:56.4468902Z hidden_states = self.dense(hidden_states) 2025-08-14T21:38:56.4468905Z 2025-08-14T21:38:56.4469007Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:56.4469189Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:56.4469260Z return mod(**inputs) 2025-08-14T21:38:56.4469458Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4469524Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4469778Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 875, in forward 2025-08-14T21:38:56.4469842Z outputs = self.layoutlm( 2025-08-14T21:38:56.4470061Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4470145Z return func(*args, **kwargs) 2025-08-14T21:38:56.4470370Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4470437Z return func(*args, **kwargs) 2025-08-14T21:38:56.4470636Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4470722Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4470971Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:56.4471039Z encoder_outputs = self.encoder( 2025-08-14T21:38:56.4471275Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4471346Z return func(*args, **kwargs) 2025-08-14T21:38:56.4471567Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4471635Z return func(*args, **kwargs) 2025-08-14T21:38:56.4471858Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4471918Z return func(*args, **kwargs) 2025-08-14T21:38:56.4472019Z [Previous line repeated 1 more time] 2025-08-14T21:38:56.4472219Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4472283Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4472537Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:56.4472601Z layer_outputs = layer_module( 2025-08-14T21:38:56.4472810Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:56.4472883Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:56.4473099Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4473168Z return func(*args, **kwargs) 2025-08-14T21:38:56.4473384Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4473447Z return func(*args, **kwargs) 2025-08-14T21:38:56.4473669Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4473731Z return func(*args, **kwargs) 2025-08-14T21:38:56.4473985Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 348, in forward 2025-08-14T21:38:56.4474060Z layer_output = apply_chunking_to_forward( 2025-08-14T21:38:56.4474295Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:38:56.4474372Z return forward_fn(*input_tensors) 2025-08-14T21:38:56.4474647Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 356, in feed_forward_chunk 2025-08-14T21:38:56.4474764Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:38:56.4475013Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 295, in forward 2025-08-14T21:38:56.4475115Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:38:56.4475318Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:38:56.4475381Z return self.act(input) 2025-08-14T21:38:56.4475384Z 2025-08-14T21:38:56.4475478Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:56.4475684Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:56.4475746Z return mod(**inputs) 2025-08-14T21:38:56.4475949Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4476016Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4476257Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 875, in forward 2025-08-14T21:38:56.4476348Z outputs = self.layoutlm( 2025-08-14T21:38:56.4476570Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4476640Z return func(*args, **kwargs) 2025-08-14T21:38:56.4476874Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4476936Z return func(*args, **kwargs) 2025-08-14T21:38:56.4477144Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4477210Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4477454Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:56.4477528Z encoder_outputs = self.encoder( 2025-08-14T21:38:56.4477762Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4477829Z return func(*args, **kwargs) 2025-08-14T21:38:56.4478044Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4478105Z return func(*args, **kwargs) 2025-08-14T21:38:56.4478328Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4478387Z return func(*args, **kwargs) 2025-08-14T21:38:56.4478458Z [Previous line repeated 1 more time] 2025-08-14T21:38:56.4478663Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4478728Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4478982Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:56.4479049Z layer_outputs = layer_module( 2025-08-14T21:38:56.4479249Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:56.4479329Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:56.4479547Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4479610Z return func(*args, **kwargs) 2025-08-14T21:38:56.4479835Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4479895Z return func(*args, **kwargs) 2025-08-14T21:38:56.4480117Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4480177Z return func(*args, **kwargs) 2025-08-14T21:38:56.4480420Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 348, in forward 2025-08-14T21:38:56.4480505Z layer_output = apply_chunking_to_forward( 2025-08-14T21:38:56.4480740Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:38:56.4480820Z return forward_fn(*input_tensors) 2025-08-14T21:38:56.4481092Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 357, in feed_forward_chunk 2025-08-14T21:38:56.4481214Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:38:56.4481482Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 308, in forward 2025-08-14T21:38:56.4481558Z hidden_states = self.dense(hidden_states) 2025-08-14T21:38:56.4481562Z 2025-08-14T21:38:56.4481658Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:56.4481866Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:56.4481926Z return mod(**inputs) 2025-08-14T21:38:56.4482135Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4482201Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4482461Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 875, in forward 2025-08-14T21:38:56.4482536Z outputs = self.layoutlm( 2025-08-14T21:38:56.4482756Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4482823Z return func(*args, **kwargs) 2025-08-14T21:38:56.4483039Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4483099Z return func(*args, **kwargs) 2025-08-14T21:38:56.4483323Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4483391Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4483639Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:56.4483713Z encoder_outputs = self.encoder( 2025-08-14T21:38:56.4483934Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4484003Z return func(*args, **kwargs) 2025-08-14T21:38:56.4484225Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4484287Z return func(*args, **kwargs) 2025-08-14T21:38:56.4484515Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4484746Z return func(*args, **kwargs) 2025-08-14T21:38:56.4484827Z [Previous line repeated 1 more time] 2025-08-14T21:38:56.4485042Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4485111Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4485374Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:56.4485442Z layer_outputs = layer_module( 2025-08-14T21:38:56.4485653Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:56.4485739Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:56.4485965Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4486029Z return func(*args, **kwargs) 2025-08-14T21:38:56.4486259Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4486325Z return func(*args, **kwargs) 2025-08-14T21:38:56.4486566Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4486627Z return func(*args, **kwargs) 2025-08-14T21:38:56.4486871Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 338, in forward 2025-08-14T21:38:56.4486957Z self_attention_outputs = self.attention( 2025-08-14T21:38:56.4487209Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4487272Z return func(*args, **kwargs) 2025-08-14T21:38:56.4487503Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4487562Z return func(*args, **kwargs) 2025-08-14T21:38:56.4487813Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4487872Z return func(*args, **kwargs) 2025-08-14T21:38:56.4488140Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 271, in forward 2025-08-14T21:38:56.4488215Z self_outputs = self.self( 2025-08-14T21:38:56.4488434Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4488503Z return func(*args, **kwargs) 2025-08-14T21:38:56.4488728Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4488790Z return func(*args, **kwargs) 2025-08-14T21:38:56.4489014Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4489098Z return func(*args, **kwargs) 2025-08-14T21:38:56.4489344Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 191, in forward 2025-08-14T21:38:56.4489483Z query_states = self.query(hidden_states).view(hidden_shape).transpose(1, 2) 2025-08-14T21:38:56.4489487Z 2025-08-14T21:38:56.4489584Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:56.4489772Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:56.4489832Z return mod(**inputs) 2025-08-14T21:38:56.4490033Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4490108Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4490352Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 875, in forward 2025-08-14T21:38:56.4490424Z outputs = self.layoutlm( 2025-08-14T21:38:56.4490644Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4490704Z return func(*args, **kwargs) 2025-08-14T21:38:56.4490931Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4490991Z return func(*args, **kwargs) 2025-08-14T21:38:56.4491189Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4491264Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4491510Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:56.4491585Z encoder_outputs = self.encoder( 2025-08-14T21:38:56.4491805Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4491868Z return func(*args, **kwargs) 2025-08-14T21:38:56.4492091Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4492152Z return func(*args, **kwargs) 2025-08-14T21:38:56.4492371Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4492439Z return func(*args, **kwargs) 2025-08-14T21:38:56.4492508Z [Previous line repeated 1 more time] 2025-08-14T21:38:56.4492725Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4492794Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4493037Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:56.4493110Z layer_outputs = layer_module( 2025-08-14T21:38:56.4493330Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:56.4493404Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:56.4493632Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4493716Z return func(*args, **kwargs) 2025-08-14T21:38:56.4493943Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4494005Z return func(*args, **kwargs) 2025-08-14T21:38:56.4494225Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4494293Z return func(*args, **kwargs) 2025-08-14T21:38:56.4494541Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 338, in forward 2025-08-14T21:38:56.4494634Z self_attention_outputs = self.attention( 2025-08-14T21:38:56.4494859Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4494921Z return func(*args, **kwargs) 2025-08-14T21:38:56.4495148Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4495208Z return func(*args, **kwargs) 2025-08-14T21:38:56.4495421Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4495490Z return func(*args, **kwargs) 2025-08-14T21:38:56.4495736Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 271, in forward 2025-08-14T21:38:56.4495809Z self_outputs = self.self( 2025-08-14T21:38:56.4496024Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4496086Z return func(*args, **kwargs) 2025-08-14T21:38:56.4496310Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4496370Z return func(*args, **kwargs) 2025-08-14T21:38:56.4496587Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4496653Z return func(*args, **kwargs) 2025-08-14T21:38:56.4496897Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 192, in forward 2025-08-14T21:38:56.4497033Z key_states = self.key(hidden_states).view(hidden_shape).transpose(1, 2) 2025-08-14T21:38:56.4497037Z 2025-08-14T21:38:56.4497131Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:56.4497311Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:56.4497382Z return mod(**inputs) 2025-08-14T21:38:56.4497579Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4497645Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4497896Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 875, in forward 2025-08-14T21:38:56.4497960Z outputs = self.layoutlm( 2025-08-14T21:38:56.4498185Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4498261Z return func(*args, **kwargs) 2025-08-14T21:38:56.4498480Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4498549Z return func(*args, **kwargs) 2025-08-14T21:38:56.4498747Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4498844Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4499089Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:56.4499154Z encoder_outputs = self.encoder( 2025-08-14T21:38:56.4499396Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4499458Z return func(*args, **kwargs) 2025-08-14T21:38:56.4499676Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4499744Z return func(*args, **kwargs) 2025-08-14T21:38:56.4499964Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4500033Z return func(*args, **kwargs) 2025-08-14T21:38:56.4500104Z [Previous line repeated 1 more time] 2025-08-14T21:38:56.4500320Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4500392Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4500636Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:56.4500702Z layer_outputs = layer_module( 2025-08-14T21:38:56.4500911Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:56.4500983Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:56.4501210Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4501271Z return func(*args, **kwargs) 2025-08-14T21:38:56.4501485Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4501555Z return func(*args, **kwargs) 2025-08-14T21:38:56.4501773Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4501832Z return func(*args, **kwargs) 2025-08-14T21:38:56.4502088Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 338, in forward 2025-08-14T21:38:56.4502164Z self_attention_outputs = self.attention( 2025-08-14T21:38:56.4502389Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4502451Z return func(*args, **kwargs) 2025-08-14T21:38:56.4502668Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4502734Z return func(*args, **kwargs) 2025-08-14T21:38:56.4502954Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4503024Z return func(*args, **kwargs) 2025-08-14T21:38:56.4503268Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 271, in forward 2025-08-14T21:38:56.4503331Z self_outputs = self.self( 2025-08-14T21:38:56.4503557Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4503617Z return func(*args, **kwargs) 2025-08-14T21:38:56.4503847Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4503915Z return func(*args, **kwargs) 2025-08-14T21:38:56.4504133Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4504201Z return func(*args, **kwargs) 2025-08-14T21:38:56.4504448Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 193, in forward 2025-08-14T21:38:56.4504597Z value_states = self.value(hidden_states).view(hidden_shape).transpose(1, 2) 2025-08-14T21:38:56.4504601Z 2025-08-14T21:38:56.4504681Z cudagraph partition due to non gpu ops 2025-08-14T21:38:56.4504803Z cudagraph partition due to non gpu ops 2025-08-14T21:38:56.4504925Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:56.4505119Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:56.4505178Z return mod(**inputs) 2025-08-14T21:38:56.4505387Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4505457Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4505702Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 875, in forward 2025-08-14T21:38:56.4505792Z outputs = self.layoutlm( 2025-08-14T21:38:56.4506015Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4506076Z return func(*args, **kwargs) 2025-08-14T21:38:56.4506306Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4506367Z return func(*args, **kwargs) 2025-08-14T21:38:56.4506576Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4506642Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4506893Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:56.4506999Z encoder_outputs = self.encoder( 2025-08-14T21:38:56.4507222Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4507294Z return func(*args, **kwargs) 2025-08-14T21:38:56.4507516Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4507577Z return func(*args, **kwargs) 2025-08-14T21:38:56.4507811Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4507872Z return func(*args, **kwargs) 2025-08-14T21:38:56.4507941Z [Previous line repeated 1 more time] 2025-08-14T21:38:56.4508153Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4508217Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4508472Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:56.4508537Z layer_outputs = layer_module( 2025-08-14T21:38:56.4508745Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:56.4508830Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:56.4509053Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4509115Z return func(*args, **kwargs) 2025-08-14T21:38:56.4509345Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4509407Z return func(*args, **kwargs) 2025-08-14T21:38:56.4509656Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4509718Z return func(*args, **kwargs) 2025-08-14T21:38:56.4509964Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 338, in forward 2025-08-14T21:38:56.4510071Z self_attention_outputs = self.attention( 2025-08-14T21:38:56.4510288Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4510347Z return func(*args, **kwargs) 2025-08-14T21:38:56.4510587Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4510647Z return func(*args, **kwargs) 2025-08-14T21:38:56.4510873Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4510935Z return func(*args, **kwargs) 2025-08-14T21:38:56.4511180Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 278, in forward 2025-08-14T21:38:56.4511306Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:38:56.4511550Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 225, in forward 2025-08-14T21:38:56.4511651Z hidden_states = self.dense(hidden_states) 2025-08-14T21:38:56.4511655Z 2025-08-14T21:38:56.4511752Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:56.4511937Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:56.4512006Z return mod(**inputs) 2025-08-14T21:38:56.4512211Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4512276Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4512531Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 875, in forward 2025-08-14T21:38:56.4512594Z outputs = self.layoutlm( 2025-08-14T21:38:56.4512823Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4512886Z return func(*args, **kwargs) 2025-08-14T21:38:56.4513105Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4513174Z return func(*args, **kwargs) 2025-08-14T21:38:56.4513375Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4513443Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4513698Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:56.4513765Z encoder_outputs = self.encoder( 2025-08-14T21:38:56.4513992Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4514052Z return func(*args, **kwargs) 2025-08-14T21:38:56.4514270Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4514340Z return func(*args, **kwargs) 2025-08-14T21:38:56.4514562Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4514630Z return func(*args, **kwargs) 2025-08-14T21:38:56.4514700Z [Previous line repeated 1 more time] 2025-08-14T21:38:56.4514901Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4514976Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4515239Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:56.4515304Z layer_outputs = layer_module( 2025-08-14T21:38:56.4515513Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:56.4515610Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:56.4515836Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4515897Z return func(*args, **kwargs) 2025-08-14T21:38:56.4516127Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4516196Z return func(*args, **kwargs) 2025-08-14T21:38:56.4516413Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4516472Z return func(*args, **kwargs) 2025-08-14T21:38:56.4516725Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 348, in forward 2025-08-14T21:38:56.4516801Z layer_output = apply_chunking_to_forward( 2025-08-14T21:38:56.4517045Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:38:56.4517133Z return forward_fn(*input_tensors) 2025-08-14T21:38:56.4517405Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 356, in feed_forward_chunk 2025-08-14T21:38:56.4517525Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:38:56.4517770Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 294, in forward 2025-08-14T21:38:56.4517851Z hidden_states = self.dense(hidden_states) 2025-08-14T21:38:56.4517855Z 2025-08-14T21:38:56.4517950Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:56.4518132Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:56.4518198Z return mod(**inputs) 2025-08-14T21:38:56.4518396Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4518466Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4518719Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 875, in forward 2025-08-14T21:38:56.4518782Z outputs = self.layoutlm( 2025-08-14T21:38:56.4519007Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4519068Z return func(*args, **kwargs) 2025-08-14T21:38:56.4519286Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4519354Z return func(*args, **kwargs) 2025-08-14T21:38:56.4519552Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4519620Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4519867Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:56.4519936Z encoder_outputs = self.encoder( 2025-08-14T21:38:56.4520162Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4520223Z return func(*args, **kwargs) 2025-08-14T21:38:56.4520439Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4520507Z return func(*args, **kwargs) 2025-08-14T21:38:56.4520742Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4520812Z return func(*args, **kwargs) 2025-08-14T21:38:56.4520883Z [Previous line repeated 1 more time] 2025-08-14T21:38:56.4521083Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4521178Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4521424Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:56.4521489Z layer_outputs = layer_module( 2025-08-14T21:38:56.4521717Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:56.4521792Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:56.4522019Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4522082Z return func(*args, **kwargs) 2025-08-14T21:38:56.4522299Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4522367Z return func(*args, **kwargs) 2025-08-14T21:38:56.4522585Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4522663Z return func(*args, **kwargs) 2025-08-14T21:38:56.4522917Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 348, in forward 2025-08-14T21:38:56.4522995Z layer_output = apply_chunking_to_forward( 2025-08-14T21:38:56.4523244Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:38:56.4523314Z return forward_fn(*input_tensors) 2025-08-14T21:38:56.4523589Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 356, in feed_forward_chunk 2025-08-14T21:38:56.4523707Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:38:56.4523950Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 295, in forward 2025-08-14T21:38:56.4524059Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:38:56.4524257Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:38:56.4524321Z return self.act(input) 2025-08-14T21:38:56.4524324Z 2025-08-14T21:38:56.4524426Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:56.4524614Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:56.4524674Z return mod(**inputs) 2025-08-14T21:38:56.4524883Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4524951Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4525202Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 875, in forward 2025-08-14T21:38:56.4525266Z outputs = self.layoutlm( 2025-08-14T21:38:56.4525484Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4525555Z return func(*args, **kwargs) 2025-08-14T21:38:56.4525775Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4525837Z return func(*args, **kwargs) 2025-08-14T21:38:56.4526044Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4526114Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4526382Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:56.4526452Z encoder_outputs = self.encoder( 2025-08-14T21:38:56.4526670Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4526739Z return func(*args, **kwargs) 2025-08-14T21:38:56.4526975Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4527037Z return func(*args, **kwargs) 2025-08-14T21:38:56.4527262Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4527339Z return func(*args, **kwargs) 2025-08-14T21:38:56.4527418Z [Previous line repeated 1 more time] 2025-08-14T21:38:56.4527618Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4527686Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4527941Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:56.4528005Z layer_outputs = layer_module( 2025-08-14T21:38:56.4528216Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:56.4528306Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:56.4528525Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4528592Z return func(*args, **kwargs) 2025-08-14T21:38:56.4528813Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4528874Z return func(*args, **kwargs) 2025-08-14T21:38:56.4529098Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4529161Z return func(*args, **kwargs) 2025-08-14T21:38:56.4529411Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 348, in forward 2025-08-14T21:38:56.4529487Z layer_output = apply_chunking_to_forward( 2025-08-14T21:38:56.4529721Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:38:56.4529799Z return forward_fn(*input_tensors) 2025-08-14T21:38:56.4530073Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 357, in feed_forward_chunk 2025-08-14T21:38:56.4530194Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:38:56.4530442Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 308, in forward 2025-08-14T21:38:56.4530516Z hidden_states = self.dense(hidden_states) 2025-08-14T21:38:56.4530521Z 2025-08-14T21:38:56.4530622Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:56.4530800Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:56.4530859Z return mod(**inputs) 2025-08-14T21:38:56.4531064Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4531133Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4531382Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 875, in forward 2025-08-14T21:38:56.4531444Z outputs = self.layoutlm( 2025-08-14T21:38:56.4531663Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4531731Z return func(*args, **kwargs) 2025-08-14T21:38:56.4531987Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4532051Z return func(*args, **kwargs) 2025-08-14T21:38:56.4532258Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4532325Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4532592Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:56.4532659Z encoder_outputs = self.encoder( 2025-08-14T21:38:56.4532876Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4532958Z return func(*args, **kwargs) 2025-08-14T21:38:56.4533179Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4533240Z return func(*args, **kwargs) 2025-08-14T21:38:56.4533467Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4533527Z return func(*args, **kwargs) 2025-08-14T21:38:56.4533605Z [Previous line repeated 1 more time] 2025-08-14T21:38:56.4533805Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4533889Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4534142Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:56.4534208Z layer_outputs = layer_module( 2025-08-14T21:38:56.4534410Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:56.4534490Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:56.4534710Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4534779Z return func(*args, **kwargs) 2025-08-14T21:38:56.4534994Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4535054Z return func(*args, **kwargs) 2025-08-14T21:38:56.4535276Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4535339Z return func(*args, **kwargs) 2025-08-14T21:38:56.4535591Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 338, in forward 2025-08-14T21:38:56.4535669Z self_attention_outputs = self.attention( 2025-08-14T21:38:56.4535888Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4535957Z return func(*args, **kwargs) 2025-08-14T21:38:56.4536175Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4536236Z return func(*args, **kwargs) 2025-08-14T21:38:56.4536460Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4536520Z return func(*args, **kwargs) 2025-08-14T21:38:56.4536773Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 271, in forward 2025-08-14T21:38:56.4536837Z self_outputs = self.self( 2025-08-14T21:38:56.4537053Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4537121Z return func(*args, **kwargs) 2025-08-14T21:38:56.4537335Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4537395Z return func(*args, **kwargs) 2025-08-14T21:38:56.4537640Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4537702Z return func(*args, **kwargs) 2025-08-14T21:38:56.4537958Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 191, in forward 2025-08-14T21:38:56.4538109Z query_states = self.query(hidden_states).view(hidden_shape).transpose(1, 2) 2025-08-14T21:38:56.4538112Z 2025-08-14T21:38:56.4538208Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:56.4538398Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:56.4538472Z return mod(**inputs) 2025-08-14T21:38:56.4538679Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4538747Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4538995Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 875, in forward 2025-08-14T21:38:56.4539066Z outputs = self.layoutlm( 2025-08-14T21:38:56.4539282Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4539342Z return func(*args, **kwargs) 2025-08-14T21:38:56.4539583Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4539644Z return func(*args, **kwargs) 2025-08-14T21:38:56.4539847Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4539915Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4540159Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:56.4540235Z encoder_outputs = self.encoder( 2025-08-14T21:38:56.4540452Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4540514Z return func(*args, **kwargs) 2025-08-14T21:38:56.4540740Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4540806Z return func(*args, **kwargs) 2025-08-14T21:38:56.4541032Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4541094Z return func(*args, **kwargs) 2025-08-14T21:38:56.4541164Z [Previous line repeated 1 more time] 2025-08-14T21:38:56.4541370Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4541437Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4541682Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:56.4541755Z layer_outputs = layer_module( 2025-08-14T21:38:56.4541957Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:56.4542038Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:56.4542257Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4542319Z return func(*args, **kwargs) 2025-08-14T21:38:56.4542541Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4542605Z return func(*args, **kwargs) 2025-08-14T21:38:56.4542828Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4542890Z return func(*args, **kwargs) 2025-08-14T21:38:56.4543147Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 338, in forward 2025-08-14T21:38:56.4543233Z self_attention_outputs = self.attention( 2025-08-14T21:38:56.4543450Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4543529Z return func(*args, **kwargs) 2025-08-14T21:38:56.4543758Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4543818Z return func(*args, **kwargs) 2025-08-14T21:38:56.4544058Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4544121Z return func(*args, **kwargs) 2025-08-14T21:38:56.4544369Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 271, in forward 2025-08-14T21:38:56.4544440Z self_outputs = self.self( 2025-08-14T21:38:56.4544659Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4544721Z return func(*args, **kwargs) 2025-08-14T21:38:56.4545007Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4545091Z return func(*args, **kwargs) 2025-08-14T21:38:56.4545318Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4545381Z return func(*args, **kwargs) 2025-08-14T21:38:56.4545628Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 192, in forward 2025-08-14T21:38:56.4545765Z key_states = self.key(hidden_states).view(hidden_shape).transpose(1, 2) 2025-08-14T21:38:56.4545770Z 2025-08-14T21:38:56.4545866Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:56.4546060Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:56.4546121Z return mod(**inputs) 2025-08-14T21:38:56.4546321Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4546399Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4546644Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 875, in forward 2025-08-14T21:38:56.4546707Z outputs = self.layoutlm( 2025-08-14T21:38:56.4546936Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4547000Z return func(*args, **kwargs) 2025-08-14T21:38:56.4547226Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4547287Z return func(*args, **kwargs) 2025-08-14T21:38:56.4547487Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4547564Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4547813Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:56.4547886Z encoder_outputs = self.encoder( 2025-08-14T21:38:56.4548115Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4548176Z return func(*args, **kwargs) 2025-08-14T21:38:56.4548404Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4548465Z return func(*args, **kwargs) 2025-08-14T21:38:56.4548682Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4548763Z return func(*args, **kwargs) 2025-08-14T21:38:56.4548835Z [Previous line repeated 1 more time] 2025-08-14T21:38:56.4549034Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4549109Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4549369Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:56.4549444Z layer_outputs = layer_module( 2025-08-14T21:38:56.4549648Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:56.4549737Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:56.4549964Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4550026Z return func(*args, **kwargs) 2025-08-14T21:38:56.4550251Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4550313Z return func(*args, **kwargs) 2025-08-14T21:38:56.4550533Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4550616Z return func(*args, **kwargs) 2025-08-14T21:38:56.4550865Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 338, in forward 2025-08-14T21:38:56.4550939Z self_attention_outputs = self.attention( 2025-08-14T21:38:56.4551169Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4551230Z return func(*args, **kwargs) 2025-08-14T21:38:56.4551457Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4551519Z return func(*args, **kwargs) 2025-08-14T21:38:56.4551742Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4551809Z return func(*args, **kwargs) 2025-08-14T21:38:56.4552057Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 271, in forward 2025-08-14T21:38:56.4552123Z self_outputs = self.self( 2025-08-14T21:38:56.4552350Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4552410Z return func(*args, **kwargs) 2025-08-14T21:38:56.4552640Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4552700Z return func(*args, **kwargs) 2025-08-14T21:38:56.4552919Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4552988Z return func(*args, **kwargs) 2025-08-14T21:38:56.4553237Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 193, in forward 2025-08-14T21:38:56.4553380Z value_states = self.value(hidden_states).view(hidden_shape).transpose(1, 2) 2025-08-14T21:38:56.4553385Z 2025-08-14T21:38:56.4553457Z cudagraph partition due to non gpu ops 2025-08-14T21:38:56.4553529Z cudagraph partition due to non gpu ops 2025-08-14T21:38:56.4553630Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:56.4553816Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:56.4553877Z return mod(**inputs) 2025-08-14T21:38:56.4554087Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4554154Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4554422Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 875, in forward 2025-08-14T21:38:56.4554490Z outputs = self.layoutlm( 2025-08-14T21:38:56.4554710Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4554799Z return func(*args, **kwargs) 2025-08-14T21:38:56.4555017Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4555078Z return func(*args, **kwargs) 2025-08-14T21:38:56.4555297Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4555366Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4555622Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:56.4555689Z encoder_outputs = self.encoder( 2025-08-14T21:38:56.4555908Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4555975Z return func(*args, **kwargs) 2025-08-14T21:38:56.4556195Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4556283Z return func(*args, **kwargs) 2025-08-14T21:38:56.4556507Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4556566Z return func(*args, **kwargs) 2025-08-14T21:38:56.4556641Z [Previous line repeated 1 more time] 2025-08-14T21:38:56.4556842Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4556905Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4557159Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:56.4557224Z layer_outputs = layer_module( 2025-08-14T21:38:56.4557426Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:56.4557503Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:56.4557724Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4557790Z return func(*args, **kwargs) 2025-08-14T21:38:56.4558008Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4558069Z return func(*args, **kwargs) 2025-08-14T21:38:56.4558295Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4558354Z return func(*args, **kwargs) 2025-08-14T21:38:56.4558609Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 338, in forward 2025-08-14T21:38:56.4558685Z self_attention_outputs = self.attention( 2025-08-14T21:38:56.4558903Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4558972Z return func(*args, **kwargs) 2025-08-14T21:38:56.4559190Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4559249Z return func(*args, **kwargs) 2025-08-14T21:38:56.4559475Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4559535Z return func(*args, **kwargs) 2025-08-14T21:38:56.4559788Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 278, in forward 2025-08-14T21:38:56.4559923Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:38:56.4560169Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 225, in forward 2025-08-14T21:38:56.4560253Z hidden_states = self.dense(hidden_states) 2025-08-14T21:38:56.4560257Z 2025-08-14T21:38:56.4560371Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:56.4560562Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:56.4560622Z return mod(**inputs) 2025-08-14T21:38:56.4560824Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4560913Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4561160Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 875, in forward 2025-08-14T21:38:56.4561222Z outputs = self.layoutlm( 2025-08-14T21:38:56.4561450Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4561511Z return func(*args, **kwargs) 2025-08-14T21:38:56.4561735Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4561810Z return func(*args, **kwargs) 2025-08-14T21:38:56.4562007Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4562082Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4562330Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:56.4562397Z encoder_outputs = self.encoder( 2025-08-14T21:38:56.4562622Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4562685Z return func(*args, **kwargs) 2025-08-14T21:38:56.4562912Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4562973Z return func(*args, **kwargs) 2025-08-14T21:38:56.4563195Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4563268Z return func(*args, **kwargs) 2025-08-14T21:38:56.4563338Z [Previous line repeated 1 more time] 2025-08-14T21:38:56.4563536Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4563612Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4563856Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:56.4563928Z layer_outputs = layer_module( 2025-08-14T21:38:56.4564131Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:56.4564203Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:56.4564425Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4564487Z return func(*args, **kwargs) 2025-08-14T21:38:56.4564705Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4564773Z return func(*args, **kwargs) 2025-08-14T21:38:56.4564991Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4565058Z return func(*args, **kwargs) 2025-08-14T21:38:56.4565300Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 348, in forward 2025-08-14T21:38:56.4565391Z layer_output = apply_chunking_to_forward( 2025-08-14T21:38:56.4565639Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:38:56.4565710Z return forward_fn(*input_tensors) 2025-08-14T21:38:56.4565992Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 356, in feed_forward_chunk 2025-08-14T21:38:56.4566124Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:38:56.4566372Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 294, in forward 2025-08-14T21:38:56.4566471Z hidden_states = self.dense(hidden_states) 2025-08-14T21:38:56.4566474Z 2025-08-14T21:38:56.4566571Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:56.4566753Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:56.4566821Z return mod(**inputs) 2025-08-14T21:38:56.4567019Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4567093Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4567336Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 875, in forward 2025-08-14T21:38:56.4567418Z outputs = self.layoutlm( 2025-08-14T21:38:56.4567649Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4567710Z return func(*args, **kwargs) 2025-08-14T21:38:56.4567938Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4567997Z return func(*args, **kwargs) 2025-08-14T21:38:56.4568197Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4568274Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4568521Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:56.4568587Z encoder_outputs = self.encoder( 2025-08-14T21:38:56.4568815Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4568878Z return func(*args, **kwargs) 2025-08-14T21:38:56.4569106Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4569167Z return func(*args, **kwargs) 2025-08-14T21:38:56.4569387Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4569453Z return func(*args, **kwargs) 2025-08-14T21:38:56.4569523Z [Previous line repeated 1 more time] 2025-08-14T21:38:56.4569725Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4569796Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4570045Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:56.4570119Z layer_outputs = layer_module( 2025-08-14T21:38:56.4570322Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:56.4570393Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:56.4570623Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4570684Z return func(*args, **kwargs) 2025-08-14T21:38:56.4570904Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4570971Z return func(*args, **kwargs) 2025-08-14T21:38:56.4571206Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4571275Z return func(*args, **kwargs) 2025-08-14T21:38:56.4571523Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 348, in forward 2025-08-14T21:38:56.4571618Z layer_output = apply_chunking_to_forward( 2025-08-14T21:38:56.4571866Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:38:56.4571935Z return forward_fn(*input_tensors) 2025-08-14T21:38:56.4572233Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 356, in feed_forward_chunk 2025-08-14T21:38:56.4572347Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:38:56.4572593Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 295, in forward 2025-08-14T21:38:56.4572705Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:38:56.4572902Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:38:56.4572966Z return self.act(input) 2025-08-14T21:38:56.4572999Z 2025-08-14T21:38:56.4573104Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:56.4573287Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:56.4573351Z return mod(**inputs) 2025-08-14T21:38:56.4573551Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4573618Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4573867Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 875, in forward 2025-08-14T21:38:56.4573933Z outputs = self.layoutlm( 2025-08-14T21:38:56.4574151Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4574220Z return func(*args, **kwargs) 2025-08-14T21:38:56.4574437Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4574508Z return func(*args, **kwargs) 2025-08-14T21:38:56.4574706Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4574773Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4575027Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:56.4575093Z encoder_outputs = self.encoder( 2025-08-14T21:38:56.4575318Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4575379Z return func(*args, **kwargs) 2025-08-14T21:38:56.4575596Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4575664Z return func(*args, **kwargs) 2025-08-14T21:38:56.4575887Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4575947Z return func(*args, **kwargs) 2025-08-14T21:38:56.4576027Z [Previous line repeated 1 more time] 2025-08-14T21:38:56.4576227Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4576304Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4576551Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:56.4576634Z layer_outputs = layer_module( 2025-08-14T21:38:56.4576847Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:56.4576919Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:56.4577135Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4577223Z return func(*args, **kwargs) 2025-08-14T21:38:56.4577438Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4577506Z return func(*args, **kwargs) 2025-08-14T21:38:56.4577749Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4577811Z return func(*args, **kwargs) 2025-08-14T21:38:56.4578066Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 348, in forward 2025-08-14T21:38:56.4578143Z layer_output = apply_chunking_to_forward( 2025-08-14T21:38:56.4578380Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:38:56.4578456Z return forward_fn(*input_tensors) 2025-08-14T21:38:56.4578732Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 357, in feed_forward_chunk 2025-08-14T21:38:56.4578878Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:38:56.4579121Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 308, in forward 2025-08-14T21:38:56.4579197Z hidden_states = self.dense(hidden_states) 2025-08-14T21:38:56.4579200Z 2025-08-14T21:38:56.4579300Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:56.4579482Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:56.4579546Z return mod(**inputs) 2025-08-14T21:38:56.4579744Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4579810Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4580059Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 875, in forward 2025-08-14T21:38:56.4580124Z outputs = self.layoutlm( 2025-08-14T21:38:56.4580343Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4580411Z return func(*args, **kwargs) 2025-08-14T21:38:56.4580628Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4580695Z return func(*args, **kwargs) 2025-08-14T21:38:56.4580892Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4580959Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4581211Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:56.4581277Z encoder_outputs = self.encoder( 2025-08-14T21:38:56.4581506Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4581567Z return func(*args, **kwargs) 2025-08-14T21:38:56.4581782Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4581850Z return func(*args, **kwargs) 2025-08-14T21:38:56.4582066Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4582126Z return func(*args, **kwargs) 2025-08-14T21:38:56.4582218Z [Previous line repeated 1 more time] 2025-08-14T21:38:56.4582419Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4582492Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4582736Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:56.4582818Z layer_outputs = layer_module( 2025-08-14T21:38:56.4583026Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:56.4583098Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:56.4583332Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4583402Z return func(*args, **kwargs) 2025-08-14T21:38:56.4583623Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4583690Z return func(*args, **kwargs) 2025-08-14T21:38:56.4583909Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4583969Z return func(*args, **kwargs) 2025-08-14T21:38:56.4584224Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 338, in forward 2025-08-14T21:38:56.4584315Z self_attention_outputs = self.attention( 2025-08-14T21:38:56.4584534Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4584795Z return func(*args, **kwargs) 2025-08-14T21:38:56.4585027Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4585096Z return func(*args, **kwargs) 2025-08-14T21:38:56.4585369Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4585433Z return func(*args, **kwargs) 2025-08-14T21:38:56.4585696Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 271, in forward 2025-08-14T21:38:56.4585764Z self_outputs = self.self( 2025-08-14T21:38:56.4586001Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4586077Z return func(*args, **kwargs) 2025-08-14T21:38:56.4586301Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4586374Z return func(*args, **kwargs) 2025-08-14T21:38:56.4586596Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4586659Z return func(*args, **kwargs) 2025-08-14T21:38:56.4586916Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 191, in forward 2025-08-14T21:38:56.4587055Z query_states = self.query(hidden_states).view(hidden_shape).transpose(1, 2) 2025-08-14T21:38:56.4587059Z 2025-08-14T21:38:56.4587164Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:56.4587353Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:56.4587414Z return mod(**inputs) 2025-08-14T21:38:56.4587622Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4587691Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4587934Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 875, in forward 2025-08-14T21:38:56.4588007Z outputs = self.layoutlm( 2025-08-14T21:38:56.4588262Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4588333Z return func(*args, **kwargs) 2025-08-14T21:38:56.4588558Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4588621Z return func(*args, **kwargs) 2025-08-14T21:38:56.4588859Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4588927Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4589185Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:56.4589276Z encoder_outputs = self.encoder( 2025-08-14T21:38:56.4589506Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4589577Z return func(*args, **kwargs) 2025-08-14T21:38:56.4589806Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4589871Z return func(*args, **kwargs) 2025-08-14T21:38:56.4590106Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4590193Z return func(*args, **kwargs) 2025-08-14T21:38:56.4590276Z [Previous line repeated 1 more time] 2025-08-14T21:38:56.4590479Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4590547Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4590810Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:56.4590878Z layer_outputs = layer_module( 2025-08-14T21:38:56.4591089Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:56.4591172Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:56.4591399Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4591468Z return func(*args, **kwargs) 2025-08-14T21:38:56.4591694Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4591757Z return func(*args, **kwargs) 2025-08-14T21:38:56.4591987Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4592051Z return func(*args, **kwargs) 2025-08-14T21:38:56.4592303Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 338, in forward 2025-08-14T21:38:56.4592390Z self_attention_outputs = self.attention( 2025-08-14T21:38:56.4592615Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4592682Z return func(*args, **kwargs) 2025-08-14T21:38:56.4592907Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4592972Z return func(*args, **kwargs) 2025-08-14T21:38:56.4593206Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4593269Z return func(*args, **kwargs) 2025-08-14T21:38:56.4593531Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 271, in forward 2025-08-14T21:38:56.4593596Z self_outputs = self.self( 2025-08-14T21:38:56.4593820Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4593889Z return func(*args, **kwargs) 2025-08-14T21:38:56.4594127Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4594193Z return func(*args, **kwargs) 2025-08-14T21:38:56.4594425Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4594504Z return func(*args, **kwargs) 2025-08-14T21:38:56.4594763Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 192, in forward 2025-08-14T21:38:56.4594895Z key_states = self.key(hidden_states).view(hidden_shape).transpose(1, 2) 2025-08-14T21:38:56.4594899Z 2025-08-14T21:38:56.4595013Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:56.4595209Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:56.4595269Z return mod(**inputs) 2025-08-14T21:38:56.4595478Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4595555Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4595808Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 875, in forward 2025-08-14T21:38:56.4595898Z outputs = self.layoutlm( 2025-08-14T21:38:56.4596128Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4596189Z return func(*args, **kwargs) 2025-08-14T21:38:56.4596427Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4596489Z return func(*args, **kwargs) 2025-08-14T21:38:56.4596700Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4596768Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4597021Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:56.4597095Z encoder_outputs = self.encoder( 2025-08-14T21:38:56.4597322Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4597387Z return func(*args, **kwargs) 2025-08-14T21:38:56.4597622Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4597683Z return func(*args, **kwargs) 2025-08-14T21:38:56.4597918Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4597980Z return func(*args, **kwargs) 2025-08-14T21:38:56.4598050Z [Previous line repeated 1 more time] 2025-08-14T21:38:56.4598265Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4598333Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4598586Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:56.4598659Z layer_outputs = layer_module( 2025-08-14T21:38:56.4598872Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:56.4598954Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:56.4599188Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4599249Z return func(*args, **kwargs) 2025-08-14T21:38:56.4599475Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4599535Z return func(*args, **kwargs) 2025-08-14T21:38:56.4599766Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4599836Z return func(*args, **kwargs) 2025-08-14T21:38:56.4600082Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 338, in forward 2025-08-14T21:38:56.4600194Z self_attention_outputs = self.attention( 2025-08-14T21:38:56.4600413Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4600471Z return func(*args, **kwargs) 2025-08-14T21:38:56.4600708Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4600770Z return func(*args, **kwargs) 2025-08-14T21:38:56.4600987Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4601056Z return func(*args, **kwargs) 2025-08-14T21:38:56.4601301Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 271, in forward 2025-08-14T21:38:56.4601372Z self_outputs = self.self( 2025-08-14T21:38:56.4601589Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4601666Z return func(*args, **kwargs) 2025-08-14T21:38:56.4601892Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4601953Z return func(*args, **kwargs) 2025-08-14T21:38:56.4602181Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4602242Z return func(*args, **kwargs) 2025-08-14T21:38:56.4602489Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 193, in forward 2025-08-14T21:38:56.4602634Z value_states = self.value(hidden_states).view(hidden_shape).transpose(1, 2) 2025-08-14T21:38:56.4602638Z 2025-08-14T21:38:56.4602710Z cudagraph partition due to non gpu ops 2025-08-14T21:38:56.4602781Z cudagraph partition due to non gpu ops 2025-08-14T21:38:56.4602884Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:56.4603073Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:56.4603139Z return mod(**inputs) 2025-08-14T21:38:56.4603343Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4603413Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4603671Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 875, in forward 2025-08-14T21:38:56.4603736Z outputs = self.layoutlm( 2025-08-14T21:38:56.4603961Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4604031Z return func(*args, **kwargs) 2025-08-14T21:38:56.4604254Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4604325Z return func(*args, **kwargs) 2025-08-14T21:38:56.4604529Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4604597Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4604852Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:56.4604919Z encoder_outputs = self.encoder( 2025-08-14T21:38:56.4605146Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4605206Z return func(*args, **kwargs) 2025-08-14T21:38:56.4605440Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4605510Z return func(*args, **kwargs) 2025-08-14T21:38:56.4605729Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4605806Z return func(*args, **kwargs) 2025-08-14T21:38:56.4605884Z [Previous line repeated 1 more time] 2025-08-14T21:38:56.4606085Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4606158Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4606422Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:56.4606488Z layer_outputs = layer_module( 2025-08-14T21:38:56.4606698Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:56.4606770Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:56.4606984Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4607053Z return func(*args, **kwargs) 2025-08-14T21:38:56.4607288Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4607357Z return func(*args, **kwargs) 2025-08-14T21:38:56.4607575Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4607637Z return func(*args, **kwargs) 2025-08-14T21:38:56.4607895Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 338, in forward 2025-08-14T21:38:56.4607970Z self_attention_outputs = self.attention( 2025-08-14T21:38:56.4608191Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4608258Z return func(*args, **kwargs) 2025-08-14T21:38:56.4608479Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4608549Z return func(*args, **kwargs) 2025-08-14T21:38:56.4608767Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4608828Z return func(*args, **kwargs) 2025-08-14T21:38:56.4609083Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 278, in forward 2025-08-14T21:38:56.4609202Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:38:56.4609453Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 225, in forward 2025-08-14T21:38:56.4609529Z hidden_states = self.dense(hidden_states) 2025-08-14T21:38:56.4609532Z 2025-08-14T21:38:56.4609630Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:56.4609817Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:56.4609879Z return mod(**inputs) 2025-08-14T21:38:56.4610081Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4610156Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4610402Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 875, in forward 2025-08-14T21:38:56.4610473Z outputs = self.layoutlm( 2025-08-14T21:38:56.4610696Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4610756Z return func(*args, **kwargs) 2025-08-14T21:38:56.4611002Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4611066Z return func(*args, **kwargs) 2025-08-14T21:38:56.4611261Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4611354Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4611598Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:56.4611672Z encoder_outputs = self.encoder( 2025-08-14T21:38:56.4611903Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4611964Z return func(*args, **kwargs) 2025-08-14T21:38:56.4612187Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4612249Z return func(*args, **kwargs) 2025-08-14T21:38:56.4612464Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4612530Z return func(*args, **kwargs) 2025-08-14T21:38:56.4612600Z [Previous line repeated 1 more time] 2025-08-14T21:38:56.4612806Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4612888Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4613133Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:56.4613208Z layer_outputs = layer_module( 2025-08-14T21:38:56.4613408Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:56.4613488Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:56.4613707Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4613766Z return func(*args, **kwargs) 2025-08-14T21:38:56.4613991Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4614051Z return func(*args, **kwargs) 2025-08-14T21:38:56.4614271Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4614340Z return func(*args, **kwargs) 2025-08-14T21:38:56.4614586Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 348, in forward 2025-08-14T21:38:56.4614670Z layer_output = apply_chunking_to_forward( 2025-08-14T21:38:56.4614907Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:38:56.4614976Z return forward_fn(*input_tensors) 2025-08-14T21:38:56.4615259Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 356, in feed_forward_chunk 2025-08-14T21:38:56.4615371Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:38:56.4615617Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 294, in forward 2025-08-14T21:38:56.4615701Z hidden_states = self.dense(hidden_states) 2025-08-14T21:38:56.4615704Z 2025-08-14T21:38:56.4615799Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:56.4615990Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:56.4616050Z return mod(**inputs) 2025-08-14T21:38:56.4616251Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4616326Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4616585Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 875, in forward 2025-08-14T21:38:56.4616660Z outputs = self.layoutlm( 2025-08-14T21:38:56.4616880Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4616959Z return func(*args, **kwargs) 2025-08-14T21:38:56.4617186Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4617248Z return func(*args, **kwargs) 2025-08-14T21:38:56.4617461Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4617542Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4617788Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:56.4617870Z encoder_outputs = self.encoder( 2025-08-14T21:38:56.4618090Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4618151Z return func(*args, **kwargs) 2025-08-14T21:38:56.4618375Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4618481Z return func(*args, **kwargs) 2025-08-14T21:38:56.4618701Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4618769Z return func(*args, **kwargs) 2025-08-14T21:38:56.4618840Z [Previous line repeated 1 more time] 2025-08-14T21:38:56.4619050Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4619115Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4619364Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:56.4619436Z layer_outputs = layer_module( 2025-08-14T21:38:56.4619640Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:56.4619721Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:56.4619949Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4620010Z return func(*args, **kwargs) 2025-08-14T21:38:56.4620240Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4620300Z return func(*args, **kwargs) 2025-08-14T21:38:56.4620518Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4620586Z return func(*args, **kwargs) 2025-08-14T21:38:56.4620835Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 348, in forward 2025-08-14T21:38:56.4620918Z layer_output = apply_chunking_to_forward( 2025-08-14T21:38:56.4621158Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:38:56.4621232Z return forward_fn(*input_tensors) 2025-08-14T21:38:56.4621515Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 356, in feed_forward_chunk 2025-08-14T21:38:56.4621625Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:38:56.4621871Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 295, in forward 2025-08-14T21:38:56.4621983Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:38:56.4622204Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:38:56.4622276Z return self.act(input) 2025-08-14T21:38:56.4622280Z 2025-08-14T21:38:56.4622376Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:56.4622559Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:56.4622646Z return mod(**inputs) 2025-08-14T21:38:56.4622848Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4622927Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4623190Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 875, in forward 2025-08-14T21:38:56.4623258Z outputs = self.layoutlm( 2025-08-14T21:38:56.4623487Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4623550Z return func(*args, **kwargs) 2025-08-14T21:38:56.4623769Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4623838Z return func(*args, **kwargs) 2025-08-14T21:38:56.4624036Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4624127Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4624370Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 645, in forward 2025-08-14T21:38:56.4624437Z encoder_outputs = self.encoder( 2025-08-14T21:38:56.4624667Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4624729Z return func(*args, **kwargs) 2025-08-14T21:38:56.4625020Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4625093Z return func(*args, **kwargs) 2025-08-14T21:38:56.4625313Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4625382Z return func(*args, **kwargs) 2025-08-14T21:38:56.4625454Z [Previous line repeated 1 more time] 2025-08-14T21:38:56.4625655Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4625730Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4625976Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 397, in forward 2025-08-14T21:38:56.4626042Z layer_outputs = layer_module( 2025-08-14T21:38:56.4626254Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:38:56.4626330Z return super().__call__(*args, **kwargs) 2025-08-14T21:38:56.4626561Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4626624Z return func(*args, **kwargs) 2025-08-14T21:38:56.4626841Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4626913Z return func(*args, **kwargs) 2025-08-14T21:38:56.4627129Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4627197Z return func(*args, **kwargs) 2025-08-14T21:38:56.4627444Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 348, in forward 2025-08-14T21:38:56.4627521Z layer_output = apply_chunking_to_forward( 2025-08-14T21:38:56.4627786Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:38:56.4627859Z return forward_fn(*input_tensors) 2025-08-14T21:38:56.4628138Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 357, in feed_forward_chunk 2025-08-14T21:38:56.4628267Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:38:56.4628542Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 308, in forward 2025-08-14T21:38:56.4628626Z hidden_states = self.dense(hidden_states) 2025-08-14T21:38:56.4628630Z 2025-08-14T21:38:56.4628723Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:56.4628919Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:56.4628990Z return mod(**inputs) 2025-08-14T21:38:56.4629191Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4629268Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4629513Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 875, in forward 2025-08-14T21:38:56.4629577Z outputs = self.layoutlm( 2025-08-14T21:38:56.4629808Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4629888Z return func(*args, **kwargs) 2025-08-14T21:38:56.4630105Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4630175Z return func(*args, **kwargs) 2025-08-14T21:38:56.4630378Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4630454Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4630701Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 654, in forward 2025-08-14T21:38:56.4630787Z pooled_output = self.pooler(sequence_output) 2025-08-14T21:38:56.4631040Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 430, in forward 2025-08-14T21:38:56.4631128Z pooled_output = self.dense(first_token_tensor) 2025-08-14T21:38:56.4631134Z 2025-08-14T21:38:56.4631240Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:56.4631421Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:56.4631483Z return mod(**inputs) 2025-08-14T21:38:56.4631692Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4631759Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4632005Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 875, in forward 2025-08-14T21:38:56.4632076Z outputs = self.layoutlm( 2025-08-14T21:38:56.4632294Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4632363Z return func(*args, **kwargs) 2025-08-14T21:38:56.4632581Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:38:56.4632644Z return func(*args, **kwargs) 2025-08-14T21:38:56.4632848Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4632914Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4633162Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 654, in forward 2025-08-14T21:38:56.4633254Z pooled_output = self.pooler(sequence_output) 2025-08-14T21:38:56.4633509Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 431, in forward 2025-08-14T21:38:56.4633608Z pooled_output = self.activation(pooled_output) 2025-08-14T21:38:56.4633612Z 2025-08-14T21:38:56.4633706Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:56.4633886Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:56.4633972Z return mod(**inputs) 2025-08-14T21:38:56.4634178Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4634252Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4634517Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 891, in forward 2025-08-14T21:38:56.4634596Z logits = self.classifier(pooled_output) 2025-08-14T21:38:56.4634600Z 2025-08-14T21:38:56.4634702Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:56.4634885Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:56.4634944Z return mod(**inputs) 2025-08-14T21:38:56.4635152Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4635222Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4635492Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 911, in forward 2025-08-14T21:38:56.4635614Z loss = loss_fct(logits.view(-1, self.num_labels), labels.view(-1)) 2025-08-14T21:38:56.4635618Z 2025-08-14T21:38:56.4635711Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:56.4635900Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:56.4635959Z return mod(**inputs) 2025-08-14T21:38:56.4636158Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4636233Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4636476Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 911, in forward 2025-08-14T21:38:56.4636598Z loss = loss_fct(logits.view(-1, self.num_labels), labels.view(-1)) 2025-08-14T21:38:56.4636604Z 2025-08-14T21:38:56.4636696Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:38:56.4636874Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:38:56.4636941Z return mod(**inputs) 2025-08-14T21:38:56.4637143Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:38:56.4637217Z output = func(self, *args, **kwargs) 2025-08-14T21:38:56.4637460Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/layoutlm/modeling_layoutlm.py", line 911, in forward 2025-08-14T21:38:56.4637570Z loss = loss_fct(logits.view(-1, self.num_labels), labels.view(-1)) 2025-08-14T21:38:56.4637573Z 2025-08-14T21:39:07.0785292Z Compilation time (from dynamo_timed): 17.119200818 2025-08-14T21:39:07.0787893Z pass 2025-08-14T21:39:07.0788319Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:39:07.0789246Z TIMING: _recursive_pre_grad_passes:0.01072 _recursive_joint_graph_passes:0.40902 _recursive_post_grad_passes:0.0732 async_compile.wait:0.66615 code_gen:6.73306 inductor_compile:7.80719 backend_compile:11.77561 gc:0.00106 entire_frame_compile:17.1192 total_wall_time:17.1192 2025-08-14T21:39:07.0790251Z STATS: call_* op count: 860 | FakeTensorMode.__torch_dispatch__:16781 | FakeTensor.__torch_dispatch__:4682 | ProxyTorchDispatchMode.__torch_dispatch__:5774 2025-08-14T21:39:07.0790776Z Dynamo produced 2 graphs covering 860 ops with 0 graph breaks (0 unique) 2025-08-14T21:39:11.4215289Z /opt/conda/envs/py_3.9/lib/python3.9/site-packages/llvmlite/binding/ffi.py:175: UserWarning: pkg_resources is deprecated as an API. See https://setuptools.pypa.io/en/latest/pkg_resources.html. The pkg_resources package is slated for removal as early as 2025-11-30. Refrain from using this package or pin to Setuptools<81. 2025-08-14T21:39:11.4216227Z from pkg_resources import resource_filename 2025-08-14T21:39:11.9751133Z 2025-08-14T21:39:18.2584239Z loading model: 0it [00:00, ?it/s] 2025-08-14T21:39:18.2585896Z loading model: 0it [00:06, ?it/s] 2025-08-14T21:39:18.2610520Z cpu eval M2M100ForConditionalGeneration 2025-08-14T21:39:18.9852879Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:39:19.2894831Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:39:19.5893629Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:39:34.8108907Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8111696Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8116212Z return mod(**inputs) 2025-08-14T21:39:34.8123465Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8124157Z outputs = self.model( 2025-08-14T21:39:34.8128881Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1261, in forward 2025-08-14T21:39:34.8130263Z encoder_outputs = self.encoder( 2025-08-14T21:39:34.8130688Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 844, in forward 2025-08-14T21:39:34.8131226Z embed_pos = self.embed_positions(input_ids, inputs_embeds) 2025-08-14T21:39:34.8131615Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/torch/utils/_contextlib.py", line 120, in decorate_context 2025-08-14T21:39:34.8131958Z return func(*args, **kwargs) 2025-08-14T21:39:34.8132318Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 148, in forward 2025-08-14T21:39:34.8132808Z position_ids = create_position_ids_from_input_ids(input_ids, self.padding_idx, past_key_values_length).to( 2025-08-14T21:39:34.8133371Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 80, in create_position_ids_from_input_ids 2025-08-14T21:39:34.8133788Z mask = input_ids.ne(padding_idx).int() 2025-08-14T21:39:34.8133927Z 2025-08-14T21:39:34.8134030Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8134375Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8134677Z return mod(**inputs) 2025-08-14T21:39:34.8135023Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8135383Z outputs = self.model( 2025-08-14T21:39:34.8135724Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8136087Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8136444Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1095, in forward 2025-08-14T21:39:34.8136890Z positions = self.embed_positions(input_ids, inputs_embeds, past_key_values_length) 2025-08-14T21:39:34.8137319Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/torch/utils/_contextlib.py", line 120, in decorate_context 2025-08-14T21:39:34.8137651Z return func(*args, **kwargs) 2025-08-14T21:39:34.8138278Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 148, in forward 2025-08-14T21:39:34.8138771Z position_ids = create_position_ids_from_input_ids(input_ids, self.padding_idx, past_key_values_length).to( 2025-08-14T21:39:34.8139305Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 80, in create_position_ids_from_input_ids 2025-08-14T21:39:34.8139784Z mask = input_ids.ne(padding_idx).int() 2025-08-14T21:39:34.8139919Z 2025-08-14T21:39:34.8139995Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8140191Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8140376Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8140566Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8140808Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8140989Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8141177Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8141368Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8141554Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8141746Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8141937Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8142128Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8142344Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8142692Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8143028Z return mod(**inputs) 2025-08-14T21:39:34.8143360Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8143742Z outputs = self.model( 2025-08-14T21:39:34.8144079Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1261, in forward 2025-08-14T21:39:34.8144425Z encoder_outputs = self.encoder( 2025-08-14T21:39:34.8144782Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 844, in forward 2025-08-14T21:39:34.8145295Z embed_pos = self.embed_positions(input_ids, inputs_embeds) 2025-08-14T21:39:34.8145676Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/torch/utils/_contextlib.py", line 120, in decorate_context 2025-08-14T21:39:34.8146007Z return func(*args, **kwargs) 2025-08-14T21:39:34.8146361Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 148, in forward 2025-08-14T21:39:34.8146845Z position_ids = create_position_ids_from_input_ids(input_ids, self.padding_idx, past_key_values_length).to( 2025-08-14T21:39:34.8147386Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 81, in create_position_ids_from_input_ids 2025-08-14T21:39:34.8147903Z incremental_indices = (torch.cumsum(mask, dim=1).type_as(mask) + past_key_values_length) * mask 2025-08-14T21:39:34.8148137Z 2025-08-14T21:39:34.8148238Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8148580Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8148883Z return mod(**inputs) 2025-08-14T21:39:34.8149216Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8149576Z outputs = self.model( 2025-08-14T21:39:34.8149914Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1261, in forward 2025-08-14T21:39:34.8150277Z encoder_outputs = self.encoder( 2025-08-14T21:39:34.8150638Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 844, in forward 2025-08-14T21:39:34.8151036Z embed_pos = self.embed_positions(input_ids, inputs_embeds) 2025-08-14T21:39:34.8151438Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/torch/utils/_contextlib.py", line 120, in decorate_context 2025-08-14T21:39:34.8151764Z return func(*args, **kwargs) 2025-08-14T21:39:34.8152105Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 148, in forward 2025-08-14T21:39:34.8152576Z position_ids = create_position_ids_from_input_ids(input_ids, self.padding_idx, past_key_values_length).to( 2025-08-14T21:39:34.8153121Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 81, in create_position_ids_from_input_ids 2025-08-14T21:39:34.8153637Z incremental_indices = (torch.cumsum(mask, dim=1).type_as(mask) + past_key_values_length) * mask 2025-08-14T21:39:34.8153869Z 2025-08-14T21:39:34.8153969Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8154307Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8154610Z return mod(**inputs) 2025-08-14T21:39:34.8154944Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8155301Z outputs = self.model( 2025-08-14T21:39:34.8155636Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1261, in forward 2025-08-14T21:39:34.8156005Z encoder_outputs = self.encoder( 2025-08-14T21:39:34.8156362Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 878, in forward 2025-08-14T21:39:34.8156726Z layer_outputs = encoder_layer( 2025-08-14T21:39:34.8157059Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8157394Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8157765Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 378, in forward 2025-08-14T21:39:34.8158142Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:39:34.8158509Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 281, in forward 2025-08-14T21:39:34.8158936Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:39:34.8159139Z 2025-08-14T21:39:34.8159236Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8159573Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8159876Z return mod(**inputs) 2025-08-14T21:39:34.8160223Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8160584Z outputs = self.model( 2025-08-14T21:39:34.8160926Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1261, in forward 2025-08-14T21:39:34.8161282Z encoder_outputs = self.encoder( 2025-08-14T21:39:34.8161637Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 878, in forward 2025-08-14T21:39:34.8161995Z layer_outputs = encoder_layer( 2025-08-14T21:39:34.8162316Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8162661Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8163026Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 378, in forward 2025-08-14T21:39:34.8163404Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:39:34.8163773Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 300, in forward 2025-08-14T21:39:34.8164136Z key_states = self.k_proj(current_states) 2025-08-14T21:39:34.8164264Z 2025-08-14T21:39:34.8164388Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8164722Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8165018Z return mod(**inputs) 2025-08-14T21:39:34.8165353Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8165729Z outputs = self.model( 2025-08-14T21:39:34.8166056Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1261, in forward 2025-08-14T21:39:34.8166414Z encoder_outputs = self.encoder( 2025-08-14T21:39:34.8166774Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 878, in forward 2025-08-14T21:39:34.8167129Z layer_outputs = encoder_layer( 2025-08-14T21:39:34.8167446Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8167775Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8168131Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 378, in forward 2025-08-14T21:39:34.8168495Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:39:34.8168875Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 301, in forward 2025-08-14T21:39:34.8169241Z value_states = self.v_proj(current_states) 2025-08-14T21:39:34.8169372Z 2025-08-14T21:39:34.8169453Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8169650Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8169842Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8170033Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8170238Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8170572Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8170870Z return mod(**inputs) 2025-08-14T21:39:34.8171207Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8171558Z outputs = self.model( 2025-08-14T21:39:34.8171895Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1261, in forward 2025-08-14T21:39:34.8172249Z encoder_outputs = self.encoder( 2025-08-14T21:39:34.8172591Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 878, in forward 2025-08-14T21:39:34.8172946Z layer_outputs = encoder_layer( 2025-08-14T21:39:34.8173265Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8173598Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8173951Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 378, in forward 2025-08-14T21:39:34.8174323Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:39:34.8174691Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 319, in forward 2025-08-14T21:39:34.8175071Z attn_output, attn_weights = attention_interface( 2025-08-14T21:39:34.8175476Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:39:34.8175920Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:39:34.8176089Z 2025-08-14T21:39:34.8176191Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8176516Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8176816Z return mod(**inputs) 2025-08-14T21:39:34.8177171Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8177525Z outputs = self.model( 2025-08-14T21:39:34.8177854Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1261, in forward 2025-08-14T21:39:34.8178257Z encoder_outputs = self.encoder( 2025-08-14T21:39:34.8178604Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 878, in forward 2025-08-14T21:39:34.8178957Z layer_outputs = encoder_layer( 2025-08-14T21:39:34.8179283Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8179626Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8179994Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 378, in forward 2025-08-14T21:39:34.8180371Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:39:34.8180752Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 319, in forward 2025-08-14T21:39:34.8181136Z attn_output, attn_weights = attention_interface( 2025-08-14T21:39:34.8181565Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:39:34.8181983Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:39:34.8182141Z 2025-08-14T21:39:34.8182237Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8182574Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8182872Z return mod(**inputs) 2025-08-14T21:39:34.8183203Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8183572Z outputs = self.model( 2025-08-14T21:39:34.8183913Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1261, in forward 2025-08-14T21:39:34.8184267Z encoder_outputs = self.encoder( 2025-08-14T21:39:34.8184961Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 878, in forward 2025-08-14T21:39:34.8185369Z layer_outputs = encoder_layer( 2025-08-14T21:39:34.8185698Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8186028Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8186389Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 378, in forward 2025-08-14T21:39:34.8186762Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:39:34.8187128Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 333, in forward 2025-08-14T21:39:34.8187497Z attn_output = self.out_proj(attn_output) 2025-08-14T21:39:34.8187633Z 2025-08-14T21:39:34.8187729Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8188066Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8188364Z return mod(**inputs) 2025-08-14T21:39:34.8188698Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8189047Z outputs = self.model( 2025-08-14T21:39:34.8189384Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1261, in forward 2025-08-14T21:39:34.8189736Z encoder_outputs = self.encoder( 2025-08-14T21:39:34.8190135Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 878, in forward 2025-08-14T21:39:34.8190492Z layer_outputs = encoder_layer( 2025-08-14T21:39:34.8190806Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8191141Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8191526Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 389, in forward 2025-08-14T21:39:34.8191923Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:39:34.8192084Z 2025-08-14T21:39:34.8192178Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8193105Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8193409Z return mod(**inputs) 2025-08-14T21:39:34.8193751Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8194098Z outputs = self.model( 2025-08-14T21:39:34.8194432Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1261, in forward 2025-08-14T21:39:34.8194788Z encoder_outputs = self.encoder( 2025-08-14T21:39:34.8195130Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 878, in forward 2025-08-14T21:39:34.8195512Z layer_outputs = encoder_layer( 2025-08-14T21:39:34.8195829Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8196160Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8196506Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 389, in forward 2025-08-14T21:39:34.8196898Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:39:34.8197059Z 2025-08-14T21:39:34.8197161Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8197495Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8197790Z return mod(**inputs) 2025-08-14T21:39:34.8198123Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8198478Z outputs = self.model( 2025-08-14T21:39:34.8198805Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1261, in forward 2025-08-14T21:39:34.8199158Z encoder_outputs = self.encoder( 2025-08-14T21:39:34.8199507Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 878, in forward 2025-08-14T21:39:34.8199858Z layer_outputs = encoder_layer( 2025-08-14T21:39:34.8200171Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8200504Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8200864Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 391, in forward 2025-08-14T21:39:34.8201216Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:39:34.8201350Z 2025-08-14T21:39:34.8201442Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8201770Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8202077Z return mod(**inputs) 2025-08-14T21:39:34.8202401Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8202749Z outputs = self.model( 2025-08-14T21:39:34.8203078Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1261, in forward 2025-08-14T21:39:34.8203446Z encoder_outputs = self.encoder( 2025-08-14T21:39:34.8203797Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 878, in forward 2025-08-14T21:39:34.8204157Z layer_outputs = encoder_layer( 2025-08-14T21:39:34.8204487Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8204838Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8205200Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 378, in forward 2025-08-14T21:39:34.8205595Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:39:34.8205965Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 281, in forward 2025-08-14T21:39:34.8206381Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:39:34.8206581Z 2025-08-14T21:39:34.8206675Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8207004Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8207299Z return mod(**inputs) 2025-08-14T21:39:34.8207624Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8207993Z outputs = self.model( 2025-08-14T21:39:34.8208326Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1261, in forward 2025-08-14T21:39:34.8208676Z encoder_outputs = self.encoder( 2025-08-14T21:39:34.8209025Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 878, in forward 2025-08-14T21:39:34.8209375Z layer_outputs = encoder_layer( 2025-08-14T21:39:34.8209698Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8210028Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8210386Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 378, in forward 2025-08-14T21:39:34.8210754Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:39:34.8211117Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 300, in forward 2025-08-14T21:39:34.8211477Z key_states = self.k_proj(current_states) 2025-08-14T21:39:34.8211607Z 2025-08-14T21:39:34.8211702Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8212032Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8212326Z return mod(**inputs) 2025-08-14T21:39:34.8212664Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8213014Z outputs = self.model( 2025-08-14T21:39:34.8213352Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1261, in forward 2025-08-14T21:39:34.8213702Z encoder_outputs = self.encoder( 2025-08-14T21:39:34.8214052Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 878, in forward 2025-08-14T21:39:34.8214411Z layer_outputs = encoder_layer( 2025-08-14T21:39:34.8214727Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8215063Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8215419Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 378, in forward 2025-08-14T21:39:34.8215787Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:39:34.8216166Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 301, in forward 2025-08-14T21:39:34.8216534Z value_states = self.v_proj(current_states) 2025-08-14T21:39:34.8216663Z 2025-08-14T21:39:34.8216750Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8216943Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8217152Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8217339Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8217551Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8217874Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8218188Z return mod(**inputs) 2025-08-14T21:39:34.8218521Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8218869Z outputs = self.model( 2025-08-14T21:39:34.8219204Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1261, in forward 2025-08-14T21:39:34.8219561Z encoder_outputs = self.encoder( 2025-08-14T21:39:34.8219907Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 878, in forward 2025-08-14T21:39:34.8220271Z layer_outputs = encoder_layer( 2025-08-14T21:39:34.8220593Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8220926Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8221273Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 378, in forward 2025-08-14T21:39:34.8221640Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:39:34.8222003Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 319, in forward 2025-08-14T21:39:34.8222376Z attn_output, attn_weights = attention_interface( 2025-08-14T21:39:34.8222776Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:39:34.8223213Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:39:34.8223388Z 2025-08-14T21:39:34.8223482Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8223810Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8224099Z return mod(**inputs) 2025-08-14T21:39:34.8224435Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8224795Z outputs = self.model( 2025-08-14T21:39:34.8225222Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1261, in forward 2025-08-14T21:39:34.8225599Z encoder_outputs = self.encoder( 2025-08-14T21:39:34.8225962Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 878, in forward 2025-08-14T21:39:34.8226321Z layer_outputs = encoder_layer( 2025-08-14T21:39:34.8226636Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8226976Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8227335Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 378, in forward 2025-08-14T21:39:34.8227711Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:39:34.8228076Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 319, in forward 2025-08-14T21:39:34.8228456Z attn_output, attn_weights = attention_interface( 2025-08-14T21:39:34.8228899Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:39:34.8229312Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:39:34.8229468Z 2025-08-14T21:39:34.8229561Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8229903Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8230202Z return mod(**inputs) 2025-08-14T21:39:34.8230525Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8230877Z outputs = self.model( 2025-08-14T21:39:34.8231223Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1261, in forward 2025-08-14T21:39:34.8231581Z encoder_outputs = self.encoder( 2025-08-14T21:39:34.8231924Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 878, in forward 2025-08-14T21:39:34.8232277Z layer_outputs = encoder_layer( 2025-08-14T21:39:34.8232601Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8232931Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8233309Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 378, in forward 2025-08-14T21:39:34.8233678Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:39:34.8234048Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 333, in forward 2025-08-14T21:39:34.8234401Z attn_output = self.out_proj(attn_output) 2025-08-14T21:39:34.8234532Z 2025-08-14T21:39:34.8234625Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8234956Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8235251Z return mod(**inputs) 2025-08-14T21:39:34.8235574Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8235922Z outputs = self.model( 2025-08-14T21:39:34.8236255Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1261, in forward 2025-08-14T21:39:34.8236603Z encoder_outputs = self.encoder( 2025-08-14T21:39:34.8236950Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 878, in forward 2025-08-14T21:39:34.8237297Z layer_outputs = encoder_layer( 2025-08-14T21:39:34.8237614Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8237940Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8238302Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 389, in forward 2025-08-14T21:39:34.8238701Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:39:34.8238857Z 2025-08-14T21:39:34.8238951Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8239280Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8239577Z return mod(**inputs) 2025-08-14T21:39:34.8239907Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8240251Z outputs = self.model( 2025-08-14T21:39:34.8240583Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1261, in forward 2025-08-14T21:39:34.8240940Z encoder_outputs = self.encoder( 2025-08-14T21:39:34.8241304Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 878, in forward 2025-08-14T21:39:34.8241653Z layer_outputs = encoder_layer( 2025-08-14T21:39:34.8241973Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8242307Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8242680Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 389, in forward 2025-08-14T21:39:34.8243081Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:39:34.8243245Z 2025-08-14T21:39:34.8243340Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8243685Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8243978Z return mod(**inputs) 2025-08-14T21:39:34.8244315Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8244663Z outputs = self.model( 2025-08-14T21:39:34.8244988Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1261, in forward 2025-08-14T21:39:34.8245342Z encoder_outputs = self.encoder( 2025-08-14T21:39:34.8245691Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 878, in forward 2025-08-14T21:39:34.8246059Z layer_outputs = encoder_layer( 2025-08-14T21:39:34.8246371Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8246707Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8247064Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 391, in forward 2025-08-14T21:39:34.8247421Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:39:34.8247547Z 2025-08-14T21:39:34.8247641Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8247969Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8248264Z return mod(**inputs) 2025-08-14T21:39:34.8248585Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8248939Z outputs = self.model( 2025-08-14T21:39:34.8249286Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1261, in forward 2025-08-14T21:39:34.8249637Z encoder_outputs = self.encoder( 2025-08-14T21:39:34.8249976Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 878, in forward 2025-08-14T21:39:34.8250328Z layer_outputs = encoder_layer( 2025-08-14T21:39:34.8250644Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8250966Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8251320Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 393, in forward 2025-08-14T21:39:34.8251675Z hidden_states = residual + hidden_states 2025-08-14T21:39:34.8251802Z 2025-08-14T21:39:34.8251901Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8252218Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8252517Z return mod(**inputs) 2025-08-14T21:39:34.8252852Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8253202Z outputs = self.model( 2025-08-14T21:39:34.8253528Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1261, in forward 2025-08-14T21:39:34.8253897Z encoder_outputs = self.encoder( 2025-08-14T21:39:34.8254248Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 878, in forward 2025-08-14T21:39:34.8254597Z layer_outputs = encoder_layer( 2025-08-14T21:39:34.8254916Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8255268Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8255626Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 378, in forward 2025-08-14T21:39:34.8255989Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:39:34.8256373Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 281, in forward 2025-08-14T21:39:34.8256801Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:39:34.8256992Z 2025-08-14T21:39:34.8257104Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8257435Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8257738Z return mod(**inputs) 2025-08-14T21:39:34.8258075Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8258444Z outputs = self.model( 2025-08-14T21:39:34.8258778Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1261, in forward 2025-08-14T21:39:34.8259134Z encoder_outputs = self.encoder( 2025-08-14T21:39:34.8259484Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 878, in forward 2025-08-14T21:39:34.8259831Z layer_outputs = encoder_layer( 2025-08-14T21:39:34.8260152Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8260488Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8260841Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 378, in forward 2025-08-14T21:39:34.8261219Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:39:34.8261591Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 300, in forward 2025-08-14T21:39:34.8261948Z key_states = self.k_proj(current_states) 2025-08-14T21:39:34.8262071Z 2025-08-14T21:39:34.8262165Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8262497Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8262795Z return mod(**inputs) 2025-08-14T21:39:34.8263126Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8263465Z outputs = self.model( 2025-08-14T21:39:34.8263796Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1261, in forward 2025-08-14T21:39:34.8264148Z encoder_outputs = self.encoder( 2025-08-14T21:39:34.8264491Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 878, in forward 2025-08-14T21:39:34.8264915Z layer_outputs = encoder_layer( 2025-08-14T21:39:34.8265250Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8265589Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8265951Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 378, in forward 2025-08-14T21:39:34.8266340Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:39:34.8266730Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 301, in forward 2025-08-14T21:39:34.8267102Z value_states = self.v_proj(current_states) 2025-08-14T21:39:34.8267235Z 2025-08-14T21:39:34.8267313Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8267513Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8267730Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8267917Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8268135Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8268476Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8268790Z return mod(**inputs) 2025-08-14T21:39:34.8269137Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8269496Z outputs = self.model( 2025-08-14T21:39:34.8269841Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1261, in forward 2025-08-14T21:39:34.8270200Z encoder_outputs = self.encoder( 2025-08-14T21:39:34.8270558Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 878, in forward 2025-08-14T21:39:34.8270961Z layer_outputs = encoder_layer( 2025-08-14T21:39:34.8271290Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8271627Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8271995Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 378, in forward 2025-08-14T21:39:34.8272377Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:39:34.8272754Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 319, in forward 2025-08-14T21:39:34.8273143Z attn_output, attn_weights = attention_interface( 2025-08-14T21:39:34.8273570Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:39:34.8274027Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:39:34.8274205Z 2025-08-14T21:39:34.8274302Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8274645Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8274949Z return mod(**inputs) 2025-08-14T21:39:34.8275297Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8275653Z outputs = self.model( 2025-08-14T21:39:34.8276000Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1261, in forward 2025-08-14T21:39:34.8291229Z encoder_outputs = self.encoder( 2025-08-14T21:39:34.8291736Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 878, in forward 2025-08-14T21:39:34.8292123Z layer_outputs = encoder_layer( 2025-08-14T21:39:34.8292471Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8292818Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8293193Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 378, in forward 2025-08-14T21:39:34.8293577Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:39:34.8293963Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 319, in forward 2025-08-14T21:39:34.8294340Z attn_output, attn_weights = attention_interface( 2025-08-14T21:39:34.8294878Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:39:34.8295313Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:39:34.8295467Z 2025-08-14T21:39:34.8295576Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8295979Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8296294Z return mod(**inputs) 2025-08-14T21:39:34.8296642Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8297000Z outputs = self.model( 2025-08-14T21:39:34.8297379Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1261, in forward 2025-08-14T21:39:34.8297746Z encoder_outputs = self.encoder( 2025-08-14T21:39:34.8298104Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 878, in forward 2025-08-14T21:39:34.8298452Z layer_outputs = encoder_layer( 2025-08-14T21:39:34.8298779Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8299117Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8299510Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 378, in forward 2025-08-14T21:39:34.8299874Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:39:34.8300245Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 333, in forward 2025-08-14T21:39:34.8300610Z attn_output = self.out_proj(attn_output) 2025-08-14T21:39:34.8300738Z 2025-08-14T21:39:34.8300835Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8301175Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8301478Z return mod(**inputs) 2025-08-14T21:39:34.8301813Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8302158Z outputs = self.model( 2025-08-14T21:39:34.8302498Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1261, in forward 2025-08-14T21:39:34.8302858Z encoder_outputs = self.encoder( 2025-08-14T21:39:34.8303201Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 878, in forward 2025-08-14T21:39:34.8303554Z layer_outputs = encoder_layer( 2025-08-14T21:39:34.8303876Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8304211Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8304562Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 389, in forward 2025-08-14T21:39:34.8305046Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:39:34.8305219Z 2025-08-14T21:39:34.8305318Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8305654Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8305955Z return mod(**inputs) 2025-08-14T21:39:34.8306298Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8306655Z outputs = self.model( 2025-08-14T21:39:34.8306989Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1261, in forward 2025-08-14T21:39:34.8307353Z encoder_outputs = self.encoder( 2025-08-14T21:39:34.8307730Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 878, in forward 2025-08-14T21:39:34.8308085Z layer_outputs = encoder_layer( 2025-08-14T21:39:34.8308398Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8308731Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8309110Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 389, in forward 2025-08-14T21:39:34.8309507Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:39:34.8309663Z 2025-08-14T21:39:34.8309758Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8310106Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8310414Z return mod(**inputs) 2025-08-14T21:39:34.8310744Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8311101Z outputs = self.model( 2025-08-14T21:39:34.8311444Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1261, in forward 2025-08-14T21:39:34.8311805Z encoder_outputs = self.encoder( 2025-08-14T21:39:34.8312155Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 878, in forward 2025-08-14T21:39:34.8312529Z layer_outputs = encoder_layer( 2025-08-14T21:39:34.8312850Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8313177Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8313538Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 391, in forward 2025-08-14T21:39:34.8313897Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:39:34.8314022Z 2025-08-14T21:39:34.8314127Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8314451Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8314749Z return mod(**inputs) 2025-08-14T21:39:34.8315083Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8315437Z outputs = self.model( 2025-08-14T21:39:34.8315762Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1261, in forward 2025-08-14T21:39:34.8316119Z encoder_outputs = self.encoder( 2025-08-14T21:39:34.8316467Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 878, in forward 2025-08-14T21:39:34.8316810Z layer_outputs = encoder_layer( 2025-08-14T21:39:34.8317129Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8317463Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8317817Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 378, in forward 2025-08-14T21:39:34.8318179Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:39:34.8318554Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 281, in forward 2025-08-14T21:39:34.8318982Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:39:34.8319172Z 2025-08-14T21:39:34.8319276Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8319601Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8319898Z return mod(**inputs) 2025-08-14T21:39:34.8320249Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8320597Z outputs = self.model( 2025-08-14T21:39:34.8320936Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1261, in forward 2025-08-14T21:39:34.8321291Z encoder_outputs = self.encoder( 2025-08-14T21:39:34.8321657Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 878, in forward 2025-08-14T21:39:34.8322002Z layer_outputs = encoder_layer( 2025-08-14T21:39:34.8322323Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8322672Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8323025Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 378, in forward 2025-08-14T21:39:34.8323400Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:39:34.8323769Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 300, in forward 2025-08-14T21:39:34.8324132Z key_states = self.k_proj(current_states) 2025-08-14T21:39:34.8324257Z 2025-08-14T21:39:34.8324350Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8324700Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8325001Z return mod(**inputs) 2025-08-14T21:39:34.8325333Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8326339Z outputs = self.model( 2025-08-14T21:39:34.8326676Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1261, in forward 2025-08-14T21:39:34.8327032Z encoder_outputs = self.encoder( 2025-08-14T21:39:34.8327382Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 878, in forward 2025-08-14T21:39:34.8327731Z layer_outputs = encoder_layer( 2025-08-14T21:39:34.8328049Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8328373Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8328735Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 378, in forward 2025-08-14T21:39:34.8329103Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:39:34.8329466Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 301, in forward 2025-08-14T21:39:34.8329830Z value_states = self.v_proj(current_states) 2025-08-14T21:39:34.8329966Z 2025-08-14T21:39:34.8330041Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8330239Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8330423Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8330611Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8330826Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8331144Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8331441Z return mod(**inputs) 2025-08-14T21:39:34.8331777Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8332123Z outputs = self.model( 2025-08-14T21:39:34.8332452Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1261, in forward 2025-08-14T21:39:34.8332805Z encoder_outputs = self.encoder( 2025-08-14T21:39:34.8333149Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 878, in forward 2025-08-14T21:39:34.8333509Z layer_outputs = encoder_layer( 2025-08-14T21:39:34.8333834Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8334167Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8334526Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 378, in forward 2025-08-14T21:39:34.8334908Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:39:34.8335277Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 319, in forward 2025-08-14T21:39:34.8335652Z attn_output, attn_weights = attention_interface( 2025-08-14T21:39:34.8336092Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:39:34.8336528Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:39:34.8336706Z 2025-08-14T21:39:34.8336803Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8337135Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8337422Z return mod(**inputs) 2025-08-14T21:39:34.8337759Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8338131Z outputs = self.model( 2025-08-14T21:39:34.8338467Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1261, in forward 2025-08-14T21:39:34.8338818Z encoder_outputs = self.encoder( 2025-08-14T21:39:34.8339170Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 878, in forward 2025-08-14T21:39:34.8339526Z layer_outputs = encoder_layer( 2025-08-14T21:39:34.8339837Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8340170Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8340525Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 378, in forward 2025-08-14T21:39:34.8340893Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:39:34.8341254Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 319, in forward 2025-08-14T21:39:34.8341629Z attn_output, attn_weights = attention_interface( 2025-08-14T21:39:34.8342037Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:39:34.8342458Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:39:34.8342603Z 2025-08-14T21:39:34.8342697Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8343026Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8343326Z return mod(**inputs) 2025-08-14T21:39:34.8343654Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8344005Z outputs = self.model( 2025-08-14T21:39:34.8344344Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1261, in forward 2025-08-14T21:39:34.8344701Z encoder_outputs = self.encoder( 2025-08-14T21:39:34.8345129Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 878, in forward 2025-08-14T21:39:34.8345495Z layer_outputs = encoder_layer( 2025-08-14T21:39:34.8345820Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8346155Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8346527Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 378, in forward 2025-08-14T21:39:34.8346902Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:39:34.8347266Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 333, in forward 2025-08-14T21:39:34.8347640Z attn_output = self.out_proj(attn_output) 2025-08-14T21:39:34.8347772Z 2025-08-14T21:39:34.8347867Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8348195Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8348490Z return mod(**inputs) 2025-08-14T21:39:34.8348829Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8349184Z outputs = self.model( 2025-08-14T21:39:34.8349516Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1261, in forward 2025-08-14T21:39:34.8349870Z encoder_outputs = self.encoder( 2025-08-14T21:39:34.8350213Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 878, in forward 2025-08-14T21:39:34.8350562Z layer_outputs = encoder_layer( 2025-08-14T21:39:34.8350901Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8351226Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8351583Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 389, in forward 2025-08-14T21:39:34.8351981Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:39:34.8352140Z 2025-08-14T21:39:34.8352242Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8352565Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8352863Z return mod(**inputs) 2025-08-14T21:39:34.8353195Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8353540Z outputs = self.model( 2025-08-14T21:39:34.8353877Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1261, in forward 2025-08-14T21:39:34.8354230Z encoder_outputs = self.encoder( 2025-08-14T21:39:34.8354575Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 878, in forward 2025-08-14T21:39:34.8354922Z layer_outputs = encoder_layer( 2025-08-14T21:39:34.8355237Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8355570Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8355926Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 389, in forward 2025-08-14T21:39:34.8356312Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:39:34.8356475Z 2025-08-14T21:39:34.8356569Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8356899Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8357189Z return mod(**inputs) 2025-08-14T21:39:34.8357521Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8357869Z outputs = self.model( 2025-08-14T21:39:34.8358204Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1261, in forward 2025-08-14T21:39:34.8358553Z encoder_outputs = self.encoder( 2025-08-14T21:39:34.8358917Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 878, in forward 2025-08-14T21:39:34.8359275Z layer_outputs = encoder_layer( 2025-08-14T21:39:34.8359590Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8359927Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8360303Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 391, in forward 2025-08-14T21:39:34.8360660Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:39:34.8360784Z 2025-08-14T21:39:34.8360877Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8361220Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8361519Z return mod(**inputs) 2025-08-14T21:39:34.8361850Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8362191Z outputs = self.model( 2025-08-14T21:39:34.8362524Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1261, in forward 2025-08-14T21:39:34.8362874Z encoder_outputs = self.encoder( 2025-08-14T21:39:34.8363213Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 878, in forward 2025-08-14T21:39:34.8363580Z layer_outputs = encoder_layer( 2025-08-14T21:39:34.8363893Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8364221Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8364569Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 393, in forward 2025-08-14T21:39:34.8364924Z hidden_states = residual + hidden_states 2025-08-14T21:39:34.8365046Z 2025-08-14T21:39:34.8365147Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8365466Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8365762Z return mod(**inputs) 2025-08-14T21:39:34.8366091Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8366443Z outputs = self.model( 2025-08-14T21:39:34.8366765Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1261, in forward 2025-08-14T21:39:34.8367119Z encoder_outputs = self.encoder( 2025-08-14T21:39:34.8367482Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 878, in forward 2025-08-14T21:39:34.8367830Z layer_outputs = encoder_layer( 2025-08-14T21:39:34.8368143Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8368475Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8368832Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 378, in forward 2025-08-14T21:39:34.8369195Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:39:34.8369569Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 281, in forward 2025-08-14T21:39:34.8369995Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:39:34.8370184Z 2025-08-14T21:39:34.8370286Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8370608Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8370905Z return mod(**inputs) 2025-08-14T21:39:34.8371255Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8371606Z outputs = self.model( 2025-08-14T21:39:34.8371933Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1261, in forward 2025-08-14T21:39:34.8372284Z encoder_outputs = self.encoder( 2025-08-14T21:39:34.8372655Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 878, in forward 2025-08-14T21:39:34.8373010Z layer_outputs = encoder_layer( 2025-08-14T21:39:34.8373337Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8373692Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8374052Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 378, in forward 2025-08-14T21:39:34.8374417Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:39:34.8374790Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 300, in forward 2025-08-14T21:39:34.8375153Z key_states = self.k_proj(current_states) 2025-08-14T21:39:34.8375277Z 2025-08-14T21:39:34.8375371Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8375704Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8376021Z return mod(**inputs) 2025-08-14T21:39:34.8376357Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8376704Z outputs = self.model( 2025-08-14T21:39:34.8377039Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1261, in forward 2025-08-14T21:39:34.8377395Z encoder_outputs = self.encoder( 2025-08-14T21:39:34.8377745Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 878, in forward 2025-08-14T21:39:34.8378091Z layer_outputs = encoder_layer( 2025-08-14T21:39:34.8378411Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8378744Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8379095Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 378, in forward 2025-08-14T21:39:34.8379462Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:39:34.8379828Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 301, in forward 2025-08-14T21:39:34.8380190Z value_states = self.v_proj(current_states) 2025-08-14T21:39:34.8380318Z 2025-08-14T21:39:34.8380390Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8380587Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8380782Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8380966Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8381182Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8381515Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8381814Z return mod(**inputs) 2025-08-14T21:39:34.8382144Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8382497Z outputs = self.model( 2025-08-14T21:39:34.8382834Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1261, in forward 2025-08-14T21:39:34.8383185Z encoder_outputs = self.encoder( 2025-08-14T21:39:34.8383536Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 878, in forward 2025-08-14T21:39:34.8383892Z layer_outputs = encoder_layer( 2025-08-14T21:39:34.8384227Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8384735Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8385156Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 378, in forward 2025-08-14T21:39:34.8385595Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:39:34.8385953Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 319, in forward 2025-08-14T21:39:34.8386331Z attn_output, attn_weights = attention_interface( 2025-08-14T21:39:34.8386756Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:39:34.8387206Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:39:34.8387376Z 2025-08-14T21:39:34.8387474Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8387812Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8388118Z return mod(**inputs) 2025-08-14T21:39:34.8388460Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8388832Z outputs = self.model( 2025-08-14T21:39:34.8389164Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1261, in forward 2025-08-14T21:39:34.8389520Z encoder_outputs = self.encoder( 2025-08-14T21:39:34.8389869Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 878, in forward 2025-08-14T21:39:34.8390223Z layer_outputs = encoder_layer( 2025-08-14T21:39:34.8390537Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8390873Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8391229Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 378, in forward 2025-08-14T21:39:34.8391596Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:39:34.8391959Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 319, in forward 2025-08-14T21:39:34.8392331Z attn_output, attn_weights = attention_interface( 2025-08-14T21:39:34.8392736Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:39:34.8393153Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:39:34.8393300Z 2025-08-14T21:39:34.8393393Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8393726Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8394024Z return mod(**inputs) 2025-08-14T21:39:34.8394350Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8394700Z outputs = self.model( 2025-08-14T21:39:34.8395036Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1261, in forward 2025-08-14T21:39:34.8395393Z encoder_outputs = self.encoder( 2025-08-14T21:39:34.8395731Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 878, in forward 2025-08-14T21:39:34.8396082Z layer_outputs = encoder_layer( 2025-08-14T21:39:34.8396401Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8396730Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8397103Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 378, in forward 2025-08-14T21:39:34.8397475Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:39:34.8397840Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 333, in forward 2025-08-14T21:39:34.8398212Z attn_output = self.out_proj(attn_output) 2025-08-14T21:39:34.8398341Z 2025-08-14T21:39:34.8398436Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8398764Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8399060Z return mod(**inputs) 2025-08-14T21:39:34.8399396Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8399752Z outputs = self.model( 2025-08-14T21:39:34.8400089Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1261, in forward 2025-08-14T21:39:34.8400438Z encoder_outputs = self.encoder( 2025-08-14T21:39:34.8400789Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 878, in forward 2025-08-14T21:39:34.8401141Z layer_outputs = encoder_layer( 2025-08-14T21:39:34.8401480Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8401810Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8402171Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 389, in forward 2025-08-14T21:39:34.8402572Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:39:34.8402733Z 2025-08-14T21:39:34.8402838Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8403168Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8403474Z return mod(**inputs) 2025-08-14T21:39:34.8403814Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8404163Z outputs = self.model( 2025-08-14T21:39:34.8404502Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1261, in forward 2025-08-14T21:39:34.8404863Z encoder_outputs = self.encoder( 2025-08-14T21:39:34.8405219Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 878, in forward 2025-08-14T21:39:34.8405571Z layer_outputs = encoder_layer( 2025-08-14T21:39:34.8405896Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8406234Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8406589Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 389, in forward 2025-08-14T21:39:34.8406991Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:39:34.8407158Z 2025-08-14T21:39:34.8407255Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8407592Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8407888Z return mod(**inputs) 2025-08-14T21:39:34.8408231Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8408587Z outputs = self.model( 2025-08-14T21:39:34.8408928Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1261, in forward 2025-08-14T21:39:34.8409282Z encoder_outputs = self.encoder( 2025-08-14T21:39:34.8409653Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 878, in forward 2025-08-14T21:39:34.8410011Z layer_outputs = encoder_layer( 2025-08-14T21:39:34.8410323Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8410660Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8411034Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 391, in forward 2025-08-14T21:39:34.8411393Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:39:34.8411519Z 2025-08-14T21:39:34.8411615Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8411965Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8412266Z return mod(**inputs) 2025-08-14T21:39:34.8412593Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8412949Z outputs = self.model( 2025-08-14T21:39:34.8413284Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1261, in forward 2025-08-14T21:39:34.8413640Z encoder_outputs = self.encoder( 2025-08-14T21:39:34.8413983Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 878, in forward 2025-08-14T21:39:34.8414351Z layer_outputs = encoder_layer( 2025-08-14T21:39:34.8414669Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8415002Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8415350Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 378, in forward 2025-08-14T21:39:34.8415719Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:39:34.8416084Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 281, in forward 2025-08-14T21:39:34.8416496Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:39:34.8416691Z 2025-08-14T21:39:34.8416785Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8417126Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8417410Z return mod(**inputs) 2025-08-14T21:39:34.8417743Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8418093Z outputs = self.model( 2025-08-14T21:39:34.8418427Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1261, in forward 2025-08-14T21:39:34.8418777Z encoder_outputs = self.encoder( 2025-08-14T21:39:34.8419123Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 878, in forward 2025-08-14T21:39:34.8419470Z layer_outputs = encoder_layer( 2025-08-14T21:39:34.8419785Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8420108Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8420465Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 378, in forward 2025-08-14T21:39:34.8420829Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:39:34.8421189Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 300, in forward 2025-08-14T21:39:34.8421540Z key_states = self.k_proj(current_states) 2025-08-14T21:39:34.8421667Z 2025-08-14T21:39:34.8421762Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8422104Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8422399Z return mod(**inputs) 2025-08-14T21:39:34.8422730Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8423082Z outputs = self.model( 2025-08-14T21:39:34.8423426Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1261, in forward 2025-08-14T21:39:34.8423787Z encoder_outputs = self.encoder( 2025-08-14T21:39:34.8424135Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 878, in forward 2025-08-14T21:39:34.8424518Z layer_outputs = encoder_layer( 2025-08-14T21:39:34.8424885Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8425217Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8425575Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 378, in forward 2025-08-14T21:39:34.8425948Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:39:34.8426307Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 301, in forward 2025-08-14T21:39:34.8426691Z value_states = self.v_proj(current_states) 2025-08-14T21:39:34.8426819Z 2025-08-14T21:39:34.8426901Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8427090Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8427284Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8427473Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8427681Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8428014Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8428307Z return mod(**inputs) 2025-08-14T21:39:34.8428636Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8428980Z outputs = self.model( 2025-08-14T21:39:34.8429312Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1261, in forward 2025-08-14T21:39:34.8429669Z encoder_outputs = self.encoder( 2025-08-14T21:39:34.8430018Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 878, in forward 2025-08-14T21:39:34.8430363Z layer_outputs = encoder_layer( 2025-08-14T21:39:34.8430682Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8431010Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8431356Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 378, in forward 2025-08-14T21:39:34.8431722Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:39:34.8432083Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 319, in forward 2025-08-14T21:39:34.8432454Z attn_output, attn_weights = attention_interface( 2025-08-14T21:39:34.8432850Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:39:34.8433284Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:39:34.8433454Z 2025-08-14T21:39:34.8433555Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8433883Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8434172Z return mod(**inputs) 2025-08-14T21:39:34.8434503Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8434878Z outputs = self.model( 2025-08-14T21:39:34.8435205Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1261, in forward 2025-08-14T21:39:34.8435553Z encoder_outputs = self.encoder( 2025-08-14T21:39:34.8435897Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 878, in forward 2025-08-14T21:39:34.8436267Z layer_outputs = encoder_layer( 2025-08-14T21:39:34.8436576Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8436908Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8437276Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 378, in forward 2025-08-14T21:39:34.8437644Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:39:34.8438012Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 319, in forward 2025-08-14T21:39:34.8438386Z attn_output, attn_weights = attention_interface( 2025-08-14T21:39:34.8438791Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:39:34.8439216Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:39:34.8439367Z 2025-08-14T21:39:34.8439461Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8439784Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8440074Z return mod(**inputs) 2025-08-14T21:39:34.8440399Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8440745Z outputs = self.model( 2025-08-14T21:39:34.8441078Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1261, in forward 2025-08-14T21:39:34.8441425Z encoder_outputs = self.encoder( 2025-08-14T21:39:34.8441763Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 878, in forward 2025-08-14T21:39:34.8442101Z layer_outputs = encoder_layer( 2025-08-14T21:39:34.8442418Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8442740Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8443093Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 378, in forward 2025-08-14T21:39:34.8443459Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:39:34.8443820Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 333, in forward 2025-08-14T21:39:34.8444172Z attn_output = self.out_proj(attn_output) 2025-08-14T21:39:34.8444301Z 2025-08-14T21:39:34.8444393Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8444721Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8445002Z return mod(**inputs) 2025-08-14T21:39:34.8445337Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8445683Z outputs = self.model( 2025-08-14T21:39:34.8446014Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1261, in forward 2025-08-14T21:39:34.8446364Z encoder_outputs = self.encoder( 2025-08-14T21:39:34.8446708Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 878, in forward 2025-08-14T21:39:34.8447055Z layer_outputs = encoder_layer( 2025-08-14T21:39:34.8447377Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8447708Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8448056Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 389, in forward 2025-08-14T21:39:34.8448468Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:39:34.8448626Z 2025-08-14T21:39:34.8448718Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8449043Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8449339Z return mod(**inputs) 2025-08-14T21:39:34.8449681Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8450023Z outputs = self.model( 2025-08-14T21:39:34.8450344Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1261, in forward 2025-08-14T21:39:34.8450690Z encoder_outputs = self.encoder( 2025-08-14T21:39:34.8451025Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 878, in forward 2025-08-14T21:39:34.8451377Z layer_outputs = encoder_layer( 2025-08-14T21:39:34.8451714Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8452044Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8452392Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 389, in forward 2025-08-14T21:39:34.8452789Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:39:34.8452943Z 2025-08-14T21:39:34.8453043Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8453369Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8453657Z return mod(**inputs) 2025-08-14T21:39:34.8453990Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8454337Z outputs = self.model( 2025-08-14T21:39:34.8454659Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1261, in forward 2025-08-14T21:39:34.8455012Z encoder_outputs = self.encoder( 2025-08-14T21:39:34.8455358Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 878, in forward 2025-08-14T21:39:34.8455712Z layer_outputs = encoder_layer( 2025-08-14T21:39:34.8456020Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8456343Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8456688Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 391, in forward 2025-08-14T21:39:34.8457034Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:39:34.8457164Z 2025-08-14T21:39:34.8457255Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8457581Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8457878Z return mod(**inputs) 2025-08-14T21:39:34.8458198Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8458544Z outputs = self.model( 2025-08-14T21:39:34.8458875Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1261, in forward 2025-08-14T21:39:34.8459219Z encoder_outputs = self.encoder( 2025-08-14T21:39:34.8459583Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 878, in forward 2025-08-14T21:39:34.8459938Z layer_outputs = encoder_layer( 2025-08-14T21:39:34.8460250Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8460573Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8460936Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 393, in forward 2025-08-14T21:39:34.8461296Z hidden_states = residual + hidden_states 2025-08-14T21:39:34.8461422Z 2025-08-14T21:39:34.8461525Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8461867Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8462165Z return mod(**inputs) 2025-08-14T21:39:34.8462495Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8462839Z outputs = self.model( 2025-08-14T21:39:34.8463172Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1261, in forward 2025-08-14T21:39:34.8463523Z encoder_outputs = self.encoder( 2025-08-14T21:39:34.8463868Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 878, in forward 2025-08-14T21:39:34.8464239Z layer_outputs = encoder_layer( 2025-08-14T21:39:34.8464560Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8464996Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8465347Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 378, in forward 2025-08-14T21:39:34.8465723Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:39:34.8466092Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 281, in forward 2025-08-14T21:39:34.8466514Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:39:34.8466703Z 2025-08-14T21:39:34.8466797Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8467127Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8467428Z return mod(**inputs) 2025-08-14T21:39:34.8467762Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8468110Z outputs = self.model( 2025-08-14T21:39:34.8468444Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1261, in forward 2025-08-14T21:39:34.8468790Z encoder_outputs = self.encoder( 2025-08-14T21:39:34.8469128Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 878, in forward 2025-08-14T21:39:34.8469478Z layer_outputs = encoder_layer( 2025-08-14T21:39:34.8469794Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8470122Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8470468Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 378, in forward 2025-08-14T21:39:34.8470835Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:39:34.8471202Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 300, in forward 2025-08-14T21:39:34.8471556Z key_states = self.k_proj(current_states) 2025-08-14T21:39:34.8471680Z 2025-08-14T21:39:34.8471772Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8472117Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8472414Z return mod(**inputs) 2025-08-14T21:39:34.8472736Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8473076Z outputs = self.model( 2025-08-14T21:39:34.8473411Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1261, in forward 2025-08-14T21:39:34.8473783Z encoder_outputs = self.encoder( 2025-08-14T21:39:34.8474120Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 878, in forward 2025-08-14T21:39:34.8474487Z layer_outputs = encoder_layer( 2025-08-14T21:39:34.8474805Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8475129Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8475485Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 378, in forward 2025-08-14T21:39:34.8475854Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:39:34.8476215Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 301, in forward 2025-08-14T21:39:34.8476596Z value_states = self.v_proj(current_states) 2025-08-14T21:39:34.8476734Z 2025-08-14T21:39:34.8476809Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8476998Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8477188Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8477363Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8477576Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8477898Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8478189Z return mod(**inputs) 2025-08-14T21:39:34.8478525Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8478879Z outputs = self.model( 2025-08-14T21:39:34.8479209Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1261, in forward 2025-08-14T21:39:34.8479566Z encoder_outputs = self.encoder( 2025-08-14T21:39:34.8479917Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 878, in forward 2025-08-14T21:39:34.8480269Z layer_outputs = encoder_layer( 2025-08-14T21:39:34.8480586Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8480919Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8481272Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 378, in forward 2025-08-14T21:39:34.8481641Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:39:34.8482000Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 319, in forward 2025-08-14T21:39:34.8482376Z attn_output, attn_weights = attention_interface( 2025-08-14T21:39:34.8482787Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:39:34.8483223Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:39:34.8483401Z 2025-08-14T21:39:34.8483495Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8483825Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8484121Z return mod(**inputs) 2025-08-14T21:39:34.8484448Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8485016Z outputs = self.model( 2025-08-14T21:39:34.8485347Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1261, in forward 2025-08-14T21:39:34.8485697Z encoder_outputs = self.encoder( 2025-08-14T21:39:34.8486030Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 878, in forward 2025-08-14T21:39:34.8486400Z layer_outputs = encoder_layer( 2025-08-14T21:39:34.8486711Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8487026Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8487394Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 378, in forward 2025-08-14T21:39:34.8487755Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:39:34.8488110Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 319, in forward 2025-08-14T21:39:34.8488474Z attn_output, attn_weights = attention_interface( 2025-08-14T21:39:34.8488877Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:39:34.8489317Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:39:34.8489464Z 2025-08-14T21:39:34.8489563Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8489883Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8490181Z return mod(**inputs) 2025-08-14T21:39:34.8490512Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8490851Z outputs = self.model( 2025-08-14T21:39:34.8491177Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1261, in forward 2025-08-14T21:39:34.8491526Z encoder_outputs = self.encoder( 2025-08-14T21:39:34.8491874Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 878, in forward 2025-08-14T21:39:34.8492215Z layer_outputs = encoder_layer( 2025-08-14T21:39:34.8492430Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8492501Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8492731Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 378, in forward 2025-08-14T21:39:34.8492822Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:39:34.8493052Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 333, in forward 2025-08-14T21:39:34.8493133Z attn_output = self.out_proj(attn_output) 2025-08-14T21:39:34.8493136Z 2025-08-14T21:39:34.8493230Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8493412Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8493474Z return mod(**inputs) 2025-08-14T21:39:34.8493705Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8493773Z outputs = self.model( 2025-08-14T21:39:34.8494004Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1261, in forward 2025-08-14T21:39:34.8494071Z encoder_outputs = self.encoder( 2025-08-14T21:39:34.8494302Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 878, in forward 2025-08-14T21:39:34.8494367Z layer_outputs = encoder_layer( 2025-08-14T21:39:34.8494597Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8494676Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8494907Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 389, in forward 2025-08-14T21:39:34.8495041Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:39:34.8495045Z 2025-08-14T21:39:34.8495138Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8495322Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8495388Z return mod(**inputs) 2025-08-14T21:39:34.8495635Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8495705Z outputs = self.model( 2025-08-14T21:39:34.8495937Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1261, in forward 2025-08-14T21:39:34.8496002Z encoder_outputs = self.encoder( 2025-08-14T21:39:34.8496237Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 878, in forward 2025-08-14T21:39:34.8496296Z layer_outputs = encoder_layer( 2025-08-14T21:39:34.8496514Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8496590Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8496815Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 389, in forward 2025-08-14T21:39:34.8496927Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:39:34.8496931Z 2025-08-14T21:39:34.8497025Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8497208Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8497273Z return mod(**inputs) 2025-08-14T21:39:34.8497504Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8497563Z outputs = self.model( 2025-08-14T21:39:34.8497800Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1261, in forward 2025-08-14T21:39:34.8497867Z encoder_outputs = self.encoder( 2025-08-14T21:39:34.8498101Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 878, in forward 2025-08-14T21:39:34.8498165Z layer_outputs = encoder_layer( 2025-08-14T21:39:34.8498365Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8498442Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8498668Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 391, in forward 2025-08-14T21:39:34.8498748Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:39:34.8498752Z 2025-08-14T21:39:34.8498843Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8499021Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8499090Z return mod(**inputs) 2025-08-14T21:39:34.8499321Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8499381Z outputs = self.model( 2025-08-14T21:39:34.8499618Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1261, in forward 2025-08-14T21:39:34.8499683Z encoder_outputs = self.encoder( 2025-08-14T21:39:34.8499932Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 878, in forward 2025-08-14T21:39:34.8499999Z layer_outputs = encoder_layer( 2025-08-14T21:39:34.8500201Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8500278Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8500521Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 378, in forward 2025-08-14T21:39:34.8500611Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:39:34.8500836Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 281, in forward 2025-08-14T21:39:34.8500985Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:39:34.8500989Z 2025-08-14T21:39:34.8501092Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8501274Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8501331Z return mod(**inputs) 2025-08-14T21:39:34.8501568Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8501629Z outputs = self.model( 2025-08-14T21:39:34.8501883Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1261, in forward 2025-08-14T21:39:34.8501946Z encoder_outputs = self.encoder( 2025-08-14T21:39:34.8502172Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 878, in forward 2025-08-14T21:39:34.8502242Z layer_outputs = encoder_layer( 2025-08-14T21:39:34.8502441Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8502517Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8502744Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 378, in forward 2025-08-14T21:39:34.8502824Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:39:34.8503057Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 300, in forward 2025-08-14T21:39:34.8503131Z key_states = self.k_proj(current_states) 2025-08-14T21:39:34.8503134Z 2025-08-14T21:39:34.8503225Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8503410Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8503467Z return mod(**inputs) 2025-08-14T21:39:34.8503705Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8503765Z outputs = self.model( 2025-08-14T21:39:34.8503997Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1261, in forward 2025-08-14T21:39:34.8504069Z encoder_outputs = self.encoder( 2025-08-14T21:39:34.8504297Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 878, in forward 2025-08-14T21:39:34.8504362Z layer_outputs = encoder_layer( 2025-08-14T21:39:34.8504568Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8504637Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8504920Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 378, in forward 2025-08-14T21:39:34.8505006Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:39:34.8505237Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 301, in forward 2025-08-14T21:39:34.8505333Z value_states = self.v_proj(current_states) 2025-08-14T21:39:34.8505337Z 2025-08-14T21:39:34.8505411Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8505485Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8505549Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8505617Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8505736Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8505921Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8505980Z return mod(**inputs) 2025-08-14T21:39:34.8506241Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8506305Z outputs = self.model( 2025-08-14T21:39:34.8506547Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1261, in forward 2025-08-14T21:39:34.8506615Z encoder_outputs = self.encoder( 2025-08-14T21:39:34.8506847Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 878, in forward 2025-08-14T21:39:34.8506922Z layer_outputs = encoder_layer( 2025-08-14T21:39:34.8507125Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8507210Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8507440Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 378, in forward 2025-08-14T21:39:34.8507517Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:39:34.8507749Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 319, in forward 2025-08-14T21:39:34.8507835Z attn_output, attn_weights = attention_interface( 2025-08-14T21:39:34.8508102Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:39:34.8508227Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:39:34.8508231Z 2025-08-14T21:39:34.8508322Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8508509Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8508570Z return mod(**inputs) 2025-08-14T21:39:34.8508802Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8508869Z outputs = self.model( 2025-08-14T21:39:34.8509100Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1261, in forward 2025-08-14T21:39:34.8509164Z encoder_outputs = self.encoder( 2025-08-14T21:39:34.8509401Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 878, in forward 2025-08-14T21:39:34.8509466Z layer_outputs = encoder_layer( 2025-08-14T21:39:34.8509674Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8509744Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8509977Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 378, in forward 2025-08-14T21:39:34.8510064Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:39:34.8510291Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 319, in forward 2025-08-14T21:39:34.8510385Z attn_output, attn_weights = attention_interface( 2025-08-14T21:39:34.8510651Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:39:34.8510766Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:39:34.8510769Z 2025-08-14T21:39:34.8510871Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8511052Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8511111Z return mod(**inputs) 2025-08-14T21:39:34.8511365Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8511425Z outputs = self.model( 2025-08-14T21:39:34.8511662Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1261, in forward 2025-08-14T21:39:34.8511739Z encoder_outputs = self.encoder( 2025-08-14T21:39:34.8511965Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 878, in forward 2025-08-14T21:39:34.8512037Z layer_outputs = encoder_layer( 2025-08-14T21:39:34.8512238Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8512317Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8512544Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 378, in forward 2025-08-14T21:39:34.8512640Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:39:34.8512875Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 333, in forward 2025-08-14T21:39:34.8512948Z attn_output = self.out_proj(attn_output) 2025-08-14T21:39:34.8512951Z 2025-08-14T21:39:34.8513044Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8513236Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8513294Z return mod(**inputs) 2025-08-14T21:39:34.8513530Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8513590Z outputs = self.model( 2025-08-14T21:39:34.8513823Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1261, in forward 2025-08-14T21:39:34.8513896Z encoder_outputs = self.encoder( 2025-08-14T21:39:34.8514124Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 878, in forward 2025-08-14T21:39:34.8514187Z layer_outputs = encoder_layer( 2025-08-14T21:39:34.8514392Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8514463Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8514694Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 389, in forward 2025-08-14T21:39:34.8514798Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:39:34.8514802Z 2025-08-14T21:39:34.8514893Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8515078Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8515137Z return mod(**inputs) 2025-08-14T21:39:34.8515376Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8515434Z outputs = self.model( 2025-08-14T21:39:34.8515666Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1261, in forward 2025-08-14T21:39:34.8515737Z encoder_outputs = self.encoder( 2025-08-14T21:39:34.8515965Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 878, in forward 2025-08-14T21:39:34.8516029Z layer_outputs = encoder_layer( 2025-08-14T21:39:34.8516261Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8516332Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8516564Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 389, in forward 2025-08-14T21:39:34.8516688Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:39:34.8516691Z 2025-08-14T21:39:34.8516784Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8516972Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8517030Z return mod(**inputs) 2025-08-14T21:39:34.8517283Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8517347Z outputs = self.model( 2025-08-14T21:39:34.8517584Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1261, in forward 2025-08-14T21:39:34.8517656Z encoder_outputs = self.encoder( 2025-08-14T21:39:34.8517887Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 878, in forward 2025-08-14T21:39:34.8517952Z layer_outputs = encoder_layer( 2025-08-14T21:39:34.8518173Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8518241Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8518474Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 391, in forward 2025-08-14T21:39:34.8518547Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:39:34.8518551Z 2025-08-14T21:39:34.8518640Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8518826Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8518884Z return mod(**inputs) 2025-08-14T21:39:34.8519117Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8519178Z outputs = self.model( 2025-08-14T21:39:34.8519408Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1261, in forward 2025-08-14T21:39:34.8519484Z encoder_outputs = self.encoder( 2025-08-14T21:39:34.8519713Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 878, in forward 2025-08-14T21:39:34.8519777Z layer_outputs = encoder_layer( 2025-08-14T21:39:34.8519981Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8520051Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8520285Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 393, in forward 2025-08-14T21:39:34.8520355Z hidden_states = residual + hidden_states 2025-08-14T21:39:34.8520358Z 2025-08-14T21:39:34.8520449Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8520636Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8520696Z return mod(**inputs) 2025-08-14T21:39:34.8520934Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8520993Z outputs = self.model( 2025-08-14T21:39:34.8521225Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1261, in forward 2025-08-14T21:39:34.8521296Z encoder_outputs = self.encoder( 2025-08-14T21:39:34.8521541Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 878, in forward 2025-08-14T21:39:34.8521606Z layer_outputs = encoder_layer( 2025-08-14T21:39:34.8521813Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8521882Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8522137Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 378, in forward 2025-08-14T21:39:34.8522218Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:39:34.8522443Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 281, in forward 2025-08-14T21:39:34.8522597Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:39:34.8522601Z 2025-08-14T21:39:34.8522697Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8522886Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8522943Z return mod(**inputs) 2025-08-14T21:39:34.8523174Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8523239Z outputs = self.model( 2025-08-14T21:39:34.8523474Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1261, in forward 2025-08-14T21:39:34.8523557Z encoder_outputs = self.encoder( 2025-08-14T21:39:34.8523793Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 878, in forward 2025-08-14T21:39:34.8523858Z layer_outputs = encoder_layer( 2025-08-14T21:39:34.8524066Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8524135Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8524365Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 378, in forward 2025-08-14T21:39:34.8524454Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:39:34.8524686Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 300, in forward 2025-08-14T21:39:34.8524759Z key_states = self.k_proj(current_states) 2025-08-14T21:39:34.8524770Z 2025-08-14T21:39:34.8524863Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8525043Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8525106Z return mod(**inputs) 2025-08-14T21:39:34.8525339Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8525400Z outputs = self.model( 2025-08-14T21:39:34.8525640Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1261, in forward 2025-08-14T21:39:34.8525704Z encoder_outputs = self.encoder( 2025-08-14T21:39:34.8525941Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 878, in forward 2025-08-14T21:39:34.8526003Z layer_outputs = encoder_layer( 2025-08-14T21:39:34.8526207Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8526279Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8526505Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 378, in forward 2025-08-14T21:39:34.8526586Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:39:34.8526818Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 301, in forward 2025-08-14T21:39:34.8526903Z value_states = self.v_proj(current_states) 2025-08-14T21:39:34.8526907Z 2025-08-14T21:39:34.8526978Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8527046Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8527113Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8527188Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8527306Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8527484Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8527549Z return mod(**inputs) 2025-08-14T21:39:34.8527796Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8527864Z outputs = self.model( 2025-08-14T21:39:34.8528100Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1261, in forward 2025-08-14T21:39:34.8528164Z encoder_outputs = self.encoder( 2025-08-14T21:39:34.8528400Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 878, in forward 2025-08-14T21:39:34.8528463Z layer_outputs = encoder_layer( 2025-08-14T21:39:34.8528663Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8528756Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8528988Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 378, in forward 2025-08-14T21:39:34.8529075Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:39:34.8529302Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 319, in forward 2025-08-14T21:39:34.8529390Z attn_output, attn_weights = attention_interface( 2025-08-14T21:39:34.8529664Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:39:34.8529784Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:39:34.8529788Z 2025-08-14T21:39:34.8529885Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8530062Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8530120Z return mod(**inputs) 2025-08-14T21:39:34.8530356Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8530417Z outputs = self.model( 2025-08-14T21:39:34.8530646Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1261, in forward 2025-08-14T21:39:34.8530718Z encoder_outputs = self.encoder( 2025-08-14T21:39:34.8530947Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 878, in forward 2025-08-14T21:39:34.8531017Z layer_outputs = encoder_layer( 2025-08-14T21:39:34.8531218Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8531287Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8531524Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 378, in forward 2025-08-14T21:39:34.8531604Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:39:34.8531838Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 319, in forward 2025-08-14T21:39:34.8531926Z attn_output, attn_weights = attention_interface( 2025-08-14T21:39:34.8532190Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:39:34.8532308Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:39:34.8532313Z 2025-08-14T21:39:34.8532405Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8532592Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8532651Z return mod(**inputs) 2025-08-14T21:39:34.8532901Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8532969Z outputs = self.model( 2025-08-14T21:39:34.8533198Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1261, in forward 2025-08-14T21:39:34.8533276Z encoder_outputs = self.encoder( 2025-08-14T21:39:34.8533509Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 878, in forward 2025-08-14T21:39:34.8533574Z layer_outputs = encoder_layer( 2025-08-14T21:39:34.8533781Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8533853Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8534080Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 378, in forward 2025-08-14T21:39:34.8534184Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:39:34.8534409Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 333, in forward 2025-08-14T21:39:34.8534482Z attn_output = self.out_proj(attn_output) 2025-08-14T21:39:34.8534493Z 2025-08-14T21:39:34.8534590Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8534772Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8534836Z return mod(**inputs) 2025-08-14T21:39:34.8535069Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8535130Z outputs = self.model( 2025-08-14T21:39:34.8535369Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1261, in forward 2025-08-14T21:39:34.8535433Z encoder_outputs = self.encoder( 2025-08-14T21:39:34.8535672Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 878, in forward 2025-08-14T21:39:34.8535735Z layer_outputs = encoder_layer( 2025-08-14T21:39:34.8535935Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8536013Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8536240Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 389, in forward 2025-08-14T21:39:34.8536350Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:39:34.8536360Z 2025-08-14T21:39:34.8536455Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8536636Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8536703Z return mod(**inputs) 2025-08-14T21:39:34.8536939Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8537000Z outputs = self.model( 2025-08-14T21:39:34.8537237Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1261, in forward 2025-08-14T21:39:34.8537303Z encoder_outputs = self.encoder( 2025-08-14T21:39:34.8537535Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 878, in forward 2025-08-14T21:39:34.8537600Z layer_outputs = encoder_layer( 2025-08-14T21:39:34.8537815Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8537896Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8538128Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 389, in forward 2025-08-14T21:39:34.8538255Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:39:34.8538259Z 2025-08-14T21:39:34.8538358Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8538540Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8538604Z return mod(**inputs) 2025-08-14T21:39:34.8538850Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8538914Z outputs = self.model( 2025-08-14T21:39:34.8539158Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1261, in forward 2025-08-14T21:39:34.8539223Z encoder_outputs = self.encoder( 2025-08-14T21:39:34.8539461Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 878, in forward 2025-08-14T21:39:34.8539524Z layer_outputs = encoder_layer( 2025-08-14T21:39:34.8539742Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8539820Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8540048Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 391, in forward 2025-08-14T21:39:34.8540121Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:39:34.8540125Z 2025-08-14T21:39:34.8540223Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8540406Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8540469Z return mod(**inputs) 2025-08-14T21:39:34.8540701Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8540761Z outputs = self.model( 2025-08-14T21:39:34.8540995Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1261, in forward 2025-08-14T21:39:34.8541061Z encoder_outputs = self.encoder( 2025-08-14T21:39:34.8541286Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 878, in forward 2025-08-14T21:39:34.8541357Z layer_outputs = encoder_layer( 2025-08-14T21:39:34.8541555Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8541631Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8541860Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 378, in forward 2025-08-14T21:39:34.8541941Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:39:34.8542174Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 281, in forward 2025-08-14T21:39:34.8542310Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:39:34.8542313Z 2025-08-14T21:39:34.8542412Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8542590Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8542648Z return mod(**inputs) 2025-08-14T21:39:34.8542884Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8542944Z outputs = self.model( 2025-08-14T21:39:34.8543184Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1261, in forward 2025-08-14T21:39:34.8543258Z encoder_outputs = self.encoder( 2025-08-14T21:39:34.8543489Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 878, in forward 2025-08-14T21:39:34.8543560Z layer_outputs = encoder_layer( 2025-08-14T21:39:34.8543777Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8543846Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8544098Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 378, in forward 2025-08-14T21:39:34.8544182Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:39:34.8544417Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 300, in forward 2025-08-14T21:39:34.8544491Z key_states = self.k_proj(current_states) 2025-08-14T21:39:34.8544495Z 2025-08-14T21:39:34.8544588Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8544777Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8544900Z return mod(**inputs) 2025-08-14T21:39:34.8545157Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8545227Z outputs = self.model( 2025-08-14T21:39:34.8545458Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1261, in forward 2025-08-14T21:39:34.8545533Z encoder_outputs = self.encoder( 2025-08-14T21:39:34.8545762Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 878, in forward 2025-08-14T21:39:34.8545828Z layer_outputs = encoder_layer( 2025-08-14T21:39:34.8546039Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8546111Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8546351Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 378, in forward 2025-08-14T21:39:34.8546438Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:39:34.8546666Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 301, in forward 2025-08-14T21:39:34.8546752Z value_states = self.v_proj(current_states) 2025-08-14T21:39:34.8546756Z 2025-08-14T21:39:34.8546831Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8546905Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8546985Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8547054Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8547158Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8547337Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8547394Z return mod(**inputs) 2025-08-14T21:39:34.8547635Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8547699Z outputs = self.model( 2025-08-14T21:39:34.8547931Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1261, in forward 2025-08-14T21:39:34.8548003Z encoder_outputs = self.encoder( 2025-08-14T21:39:34.8548231Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 878, in forward 2025-08-14T21:39:34.8548302Z layer_outputs = encoder_layer( 2025-08-14T21:39:34.8548502Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8548597Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8548833Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 378, in forward 2025-08-14T21:39:34.8548913Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:39:34.8549140Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 319, in forward 2025-08-14T21:39:34.8549253Z attn_output, attn_weights = attention_interface( 2025-08-14T21:39:34.8549517Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:39:34.8549657Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:39:34.8549661Z 2025-08-14T21:39:34.8549755Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8549939Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8550005Z return mod(**inputs) 2025-08-14T21:39:34.8550240Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8550309Z outputs = self.model( 2025-08-14T21:39:34.8550541Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1261, in forward 2025-08-14T21:39:34.8550624Z encoder_outputs = self.encoder( 2025-08-14T21:39:34.8550870Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 878, in forward 2025-08-14T21:39:34.8550933Z layer_outputs = encoder_layer( 2025-08-14T21:39:34.8551142Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8551220Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8551460Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 378, in forward 2025-08-14T21:39:34.8551547Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:39:34.8551782Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 319, in forward 2025-08-14T21:39:34.8551873Z attn_output, attn_weights = attention_interface( 2025-08-14T21:39:34.8552154Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:39:34.8552251Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:39:34.8552255Z 2025-08-14T21:39:34.8552354Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8552539Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8552597Z return mod(**inputs) 2025-08-14T21:39:34.8552846Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8552906Z outputs = self.model( 2025-08-14T21:39:34.8553148Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1261, in forward 2025-08-14T21:39:34.8553221Z encoder_outputs = self.encoder( 2025-08-14T21:39:34.8553461Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 878, in forward 2025-08-14T21:39:34.8553530Z layer_outputs = encoder_layer( 2025-08-14T21:39:34.8553739Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8553809Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8554051Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 378, in forward 2025-08-14T21:39:34.8554146Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:39:34.8554386Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 333, in forward 2025-08-14T21:39:34.8554459Z attn_output = self.out_proj(attn_output) 2025-08-14T21:39:34.8554462Z 2025-08-14T21:39:34.8554554Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8554762Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8554820Z return mod(**inputs) 2025-08-14T21:39:34.8555050Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8555132Z outputs = self.model( 2025-08-14T21:39:34.8555364Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1261, in forward 2025-08-14T21:39:34.8555437Z encoder_outputs = self.encoder( 2025-08-14T21:39:34.8555666Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 878, in forward 2025-08-14T21:39:34.8555730Z layer_outputs = encoder_layer( 2025-08-14T21:39:34.8555936Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8556023Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8556260Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 389, in forward 2025-08-14T21:39:34.8556367Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:39:34.8556370Z 2025-08-14T21:39:34.8556462Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8556650Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8556708Z return mod(**inputs) 2025-08-14T21:39:34.8556944Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8557012Z outputs = self.model( 2025-08-14T21:39:34.8557245Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1261, in forward 2025-08-14T21:39:34.8557316Z encoder_outputs = self.encoder( 2025-08-14T21:39:34.8557549Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 878, in forward 2025-08-14T21:39:34.8557612Z layer_outputs = encoder_layer( 2025-08-14T21:39:34.8557819Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8557891Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8558120Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 389, in forward 2025-08-14T21:39:34.8558236Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:39:34.8558240Z 2025-08-14T21:39:34.8558331Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8558519Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8558577Z return mod(**inputs) 2025-08-14T21:39:34.8558811Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8558880Z outputs = self.model( 2025-08-14T21:39:34.8559111Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1261, in forward 2025-08-14T21:39:34.8559184Z encoder_outputs = self.encoder( 2025-08-14T21:39:34.8559416Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 878, in forward 2025-08-14T21:39:34.8559480Z layer_outputs = encoder_layer( 2025-08-14T21:39:34.8559703Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8559777Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8560007Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 391, in forward 2025-08-14T21:39:34.8560105Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:39:34.8560108Z 2025-08-14T21:39:34.8560200Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8560387Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8560446Z return mod(**inputs) 2025-08-14T21:39:34.8560689Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8560759Z outputs = self.model( 2025-08-14T21:39:34.8560993Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1261, in forward 2025-08-14T21:39:34.8561067Z encoder_outputs = self.encoder( 2025-08-14T21:39:34.8561301Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 878, in forward 2025-08-14T21:39:34.8561365Z layer_outputs = encoder_layer( 2025-08-14T21:39:34.8561590Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8561660Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8561887Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 393, in forward 2025-08-14T21:39:34.8561968Z hidden_states = residual + hidden_states 2025-08-14T21:39:34.8561971Z 2025-08-14T21:39:34.8562062Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8562251Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8562308Z return mod(**inputs) 2025-08-14T21:39:34.8562538Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8562605Z outputs = self.model( 2025-08-14T21:39:34.8562834Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1261, in forward 2025-08-14T21:39:34.8562908Z encoder_outputs = self.encoder( 2025-08-14T21:39:34.8563136Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 878, in forward 2025-08-14T21:39:34.8563199Z layer_outputs = encoder_layer( 2025-08-14T21:39:34.8563407Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8563477Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8563706Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 378, in forward 2025-08-14T21:39:34.8563794Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:39:34.8564020Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 281, in forward 2025-08-14T21:39:34.8564164Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:39:34.8564168Z 2025-08-14T21:39:34.8564259Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8564438Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8564503Z return mod(**inputs) 2025-08-14T21:39:34.8564733Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8564798Z outputs = self.model( 2025-08-14T21:39:34.8565043Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1261, in forward 2025-08-14T21:39:34.8565110Z encoder_outputs = self.encoder( 2025-08-14T21:39:34.8565343Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 878, in forward 2025-08-14T21:39:34.8565408Z layer_outputs = encoder_layer( 2025-08-14T21:39:34.8565628Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8565707Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8565949Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 378, in forward 2025-08-14T21:39:34.8566040Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:39:34.8566269Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 300, in forward 2025-08-14T21:39:34.8566342Z key_states = self.k_proj(current_states) 2025-08-14T21:39:34.8566346Z 2025-08-14T21:39:34.8566445Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8566624Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8566688Z return mod(**inputs) 2025-08-14T21:39:34.8566942Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8567003Z outputs = self.model( 2025-08-14T21:39:34.8567239Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1261, in forward 2025-08-14T21:39:34.8567306Z encoder_outputs = self.encoder( 2025-08-14T21:39:34.8567531Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 878, in forward 2025-08-14T21:39:34.8567601Z layer_outputs = encoder_layer( 2025-08-14T21:39:34.8567802Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8567879Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8568104Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 378, in forward 2025-08-14T21:39:34.8568186Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:39:34.8568417Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 301, in forward 2025-08-14T21:39:34.8568493Z value_states = self.v_proj(current_states) 2025-08-14T21:39:34.8568496Z 2025-08-14T21:39:34.8568568Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8568646Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8568714Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8568786Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8568878Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8569057Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8569120Z return mod(**inputs) 2025-08-14T21:39:34.8569351Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8569415Z outputs = self.model( 2025-08-14T21:39:34.8569653Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1261, in forward 2025-08-14T21:39:34.8569717Z encoder_outputs = self.encoder( 2025-08-14T21:39:34.8569951Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 878, in forward 2025-08-14T21:39:34.8570016Z layer_outputs = encoder_layer( 2025-08-14T21:39:34.8570216Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8570308Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8570543Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 378, in forward 2025-08-14T21:39:34.8570632Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:39:34.8570861Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 319, in forward 2025-08-14T21:39:34.8570967Z attn_output, attn_weights = attention_interface( 2025-08-14T21:39:34.8571238Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:39:34.8571369Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:39:34.8571373Z 2025-08-14T21:39:34.8571467Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8571655Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8571715Z return mod(**inputs) 2025-08-14T21:39:34.8571957Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8572015Z outputs = self.model( 2025-08-14T21:39:34.8572246Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1261, in forward 2025-08-14T21:39:34.8572330Z encoder_outputs = self.encoder( 2025-08-14T21:39:34.8572558Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 878, in forward 2025-08-14T21:39:34.8572625Z layer_outputs = encoder_layer( 2025-08-14T21:39:34.8572825Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8572892Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8573125Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 378, in forward 2025-08-14T21:39:34.8573203Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:39:34.8573428Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 319, in forward 2025-08-14T21:39:34.8573520Z attn_output, attn_weights = attention_interface( 2025-08-14T21:39:34.8573781Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:39:34.8573882Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:39:34.8573886Z 2025-08-14T21:39:34.8573977Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8574152Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8574216Z return mod(**inputs) 2025-08-14T21:39:34.8574447Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8574511Z outputs = self.model( 2025-08-14T21:39:34.8574739Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1261, in forward 2025-08-14T21:39:34.8574802Z encoder_outputs = self.encoder( 2025-08-14T21:39:34.8575030Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 878, in forward 2025-08-14T21:39:34.8575090Z layer_outputs = encoder_layer( 2025-08-14T21:39:34.8575286Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8575356Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8575579Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 378, in forward 2025-08-14T21:39:34.8575673Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:39:34.8575901Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 333, in forward 2025-08-14T21:39:34.8575971Z attn_output = self.out_proj(attn_output) 2025-08-14T21:39:34.8575974Z 2025-08-14T21:39:34.8576068Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8576264Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8576322Z return mod(**inputs) 2025-08-14T21:39:34.8576557Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8576631Z outputs = self.model( 2025-08-14T21:39:34.8576867Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1261, in forward 2025-08-14T21:39:34.8576932Z encoder_outputs = self.encoder( 2025-08-14T21:39:34.8577162Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 878, in forward 2025-08-14T21:39:34.8577233Z layer_outputs = encoder_layer( 2025-08-14T21:39:34.8577434Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8577545Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8577776Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 389, in forward 2025-08-14T21:39:34.8577883Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:39:34.8577886Z 2025-08-14T21:39:34.8577986Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8578167Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8578225Z return mod(**inputs) 2025-08-14T21:39:34.8578465Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8578525Z outputs = self.model( 2025-08-14T21:39:34.8578763Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1261, in forward 2025-08-14T21:39:34.8578826Z encoder_outputs = self.encoder( 2025-08-14T21:39:34.8579057Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 878, in forward 2025-08-14T21:39:34.8579129Z layer_outputs = encoder_layer( 2025-08-14T21:39:34.8579328Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8579404Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8579635Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 389, in forward 2025-08-14T21:39:34.8579740Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:39:34.8579744Z 2025-08-14T21:39:34.8579842Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8580022Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8580078Z return mod(**inputs) 2025-08-14T21:39:34.8580321Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8580380Z outputs = self.model( 2025-08-14T21:39:34.8580618Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1261, in forward 2025-08-14T21:39:34.8580682Z encoder_outputs = self.encoder( 2025-08-14T21:39:34.8580910Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 878, in forward 2025-08-14T21:39:34.8580979Z layer_outputs = encoder_layer( 2025-08-14T21:39:34.8581191Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8581262Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8581485Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 391, in forward 2025-08-14T21:39:34.8582037Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:39:34.8582041Z 2025-08-14T21:39:34.8582139Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8582320Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8582378Z return mod(**inputs) 2025-08-14T21:39:34.8582631Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8582692Z outputs = self.model( 2025-08-14T21:39:34.8582931Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1261, in forward 2025-08-14T21:39:34.8582995Z encoder_outputs = self.encoder( 2025-08-14T21:39:34.8583221Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 878, in forward 2025-08-14T21:39:34.8583291Z layer_outputs = encoder_layer( 2025-08-14T21:39:34.8583508Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8583578Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8583811Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 378, in forward 2025-08-14T21:39:34.8583892Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:39:34.8584128Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 281, in forward 2025-08-14T21:39:34.8584264Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:39:34.8584268Z 2025-08-14T21:39:34.8584359Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8584546Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8584761Z return mod(**inputs) 2025-08-14T21:39:34.8585057Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8585124Z outputs = self.model( 2025-08-14T21:39:34.8585357Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1261, in forward 2025-08-14T21:39:34.8585434Z encoder_outputs = self.encoder( 2025-08-14T21:39:34.8585663Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 878, in forward 2025-08-14T21:39:34.8585729Z layer_outputs = encoder_layer( 2025-08-14T21:39:34.8585939Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8586012Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8586249Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 378, in forward 2025-08-14T21:39:34.8586332Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:39:34.8586561Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 300, in forward 2025-08-14T21:39:34.8586640Z key_states = self.k_proj(current_states) 2025-08-14T21:39:34.8586644Z 2025-08-14T21:39:34.8586739Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8586929Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8586989Z return mod(**inputs) 2025-08-14T21:39:34.8587255Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8587327Z outputs = self.model( 2025-08-14T21:39:34.8587559Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1261, in forward 2025-08-14T21:39:34.8587624Z encoder_outputs = self.encoder( 2025-08-14T21:39:34.8587881Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 878, in forward 2025-08-14T21:39:34.8587944Z layer_outputs = encoder_layer( 2025-08-14T21:39:34.8588181Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8588253Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8588487Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 378, in forward 2025-08-14T21:39:34.8588576Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:39:34.8588807Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 301, in forward 2025-08-14T21:39:34.8588890Z value_states = self.v_proj(current_states) 2025-08-14T21:39:34.8588894Z 2025-08-14T21:39:34.8588964Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8589056Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8589132Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8589198Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8589290Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8589478Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8589535Z return mod(**inputs) 2025-08-14T21:39:34.8589774Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8589836Z outputs = self.model( 2025-08-14T21:39:34.8590068Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1261, in forward 2025-08-14T21:39:34.8590138Z encoder_outputs = self.encoder( 2025-08-14T21:39:34.8590365Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 878, in forward 2025-08-14T21:39:34.8590433Z layer_outputs = encoder_layer( 2025-08-14T21:39:34.8590641Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8590711Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8590946Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 378, in forward 2025-08-14T21:39:34.8591026Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:39:34.8591255Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 319, in forward 2025-08-14T21:39:34.8591349Z attn_output, attn_weights = attention_interface( 2025-08-14T21:39:34.8591613Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:39:34.8591738Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:39:34.8591744Z 2025-08-14T21:39:34.8591837Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8592018Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8592083Z return mod(**inputs) 2025-08-14T21:39:34.8592317Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8592379Z outputs = self.model( 2025-08-14T21:39:34.8592629Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1261, in forward 2025-08-14T21:39:34.8592697Z encoder_outputs = self.encoder( 2025-08-14T21:39:34.8592933Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 878, in forward 2025-08-14T21:39:34.8592996Z layer_outputs = encoder_layer( 2025-08-14T21:39:34.8593212Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8593290Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8593519Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 378, in forward 2025-08-14T21:39:34.8593613Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:39:34.8593848Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 319, in forward 2025-08-14T21:39:34.8593935Z attn_output, attn_weights = attention_interface( 2025-08-14T21:39:34.8594207Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:39:34.8594306Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:39:34.8594310Z 2025-08-14T21:39:34.8594402Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8594602Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8594660Z return mod(**inputs) 2025-08-14T21:39:34.8594899Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8594960Z outputs = self.model( 2025-08-14T21:39:34.8595194Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1261, in forward 2025-08-14T21:39:34.8595265Z encoder_outputs = self.encoder( 2025-08-14T21:39:34.8595496Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 878, in forward 2025-08-14T21:39:34.8595559Z layer_outputs = encoder_layer( 2025-08-14T21:39:34.8595766Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8595838Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8596072Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 378, in forward 2025-08-14T21:39:34.8596151Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:39:34.8596380Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 333, in forward 2025-08-14T21:39:34.8596459Z attn_output = self.out_proj(attn_output) 2025-08-14T21:39:34.8596463Z 2025-08-14T21:39:34.8596553Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8596739Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8596797Z return mod(**inputs) 2025-08-14T21:39:34.8597030Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8597099Z outputs = self.model( 2025-08-14T21:39:34.8597331Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1261, in forward 2025-08-14T21:39:34.8597396Z encoder_outputs = self.encoder( 2025-08-14T21:39:34.8597632Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 878, in forward 2025-08-14T21:39:34.8597696Z layer_outputs = encoder_layer( 2025-08-14T21:39:34.8597903Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8597988Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8598219Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 389, in forward 2025-08-14T21:39:34.8598335Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:39:34.8598338Z 2025-08-14T21:39:34.8598432Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8598639Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8598698Z return mod(**inputs) 2025-08-14T21:39:34.8598931Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8599014Z outputs = self.model( 2025-08-14T21:39:34.8599247Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1261, in forward 2025-08-14T21:39:34.8599314Z encoder_outputs = self.encoder( 2025-08-14T21:39:34.8599548Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 878, in forward 2025-08-14T21:39:34.8599612Z layer_outputs = encoder_layer( 2025-08-14T21:39:34.8599818Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8599906Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8600135Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 389, in forward 2025-08-14T21:39:34.8600248Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:39:34.8600252Z 2025-08-14T21:39:34.8600344Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8600530Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8600591Z return mod(**inputs) 2025-08-14T21:39:34.8600826Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8600894Z outputs = self.model( 2025-08-14T21:39:34.8601126Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1261, in forward 2025-08-14T21:39:34.8601192Z encoder_outputs = self.encoder( 2025-08-14T21:39:34.8601430Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 878, in forward 2025-08-14T21:39:34.8601496Z layer_outputs = encoder_layer( 2025-08-14T21:39:34.8601706Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8601778Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8602005Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 391, in forward 2025-08-14T21:39:34.8602086Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:39:34.8602090Z 2025-08-14T21:39:34.8602181Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8602367Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8602424Z return mod(**inputs) 2025-08-14T21:39:34.8602660Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8602728Z outputs = self.model( 2025-08-14T21:39:34.8602960Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1261, in forward 2025-08-14T21:39:34.8603024Z encoder_outputs = self.encoder( 2025-08-14T21:39:34.8603259Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 878, in forward 2025-08-14T21:39:34.8603324Z layer_outputs = encoder_layer( 2025-08-14T21:39:34.8603544Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8603615Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8603844Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 393, in forward 2025-08-14T21:39:34.8603940Z hidden_states = residual + hidden_states 2025-08-14T21:39:34.8603943Z 2025-08-14T21:39:34.8604034Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8604214Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8604280Z return mod(**inputs) 2025-08-14T21:39:34.8604528Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8604598Z outputs = self.model( 2025-08-14T21:39:34.8604832Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8604898Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8605137Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1095, in forward 2025-08-14T21:39:34.8605290Z positions = self.embed_positions(input_ids, inputs_embeds, past_key_values_length) 2025-08-14T21:39:34.8605526Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/torch/utils/_contextlib.py", line 120, in decorate_context 2025-08-14T21:39:34.8605592Z return func(*args, **kwargs) 2025-08-14T21:39:34.8605820Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 148, in forward 2025-08-14T21:39:34.8606024Z position_ids = create_position_ids_from_input_ids(input_ids, self.padding_idx, past_key_values_length).to( 2025-08-14T21:39:34.8606314Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 81, in create_position_ids_from_input_ids 2025-08-14T21:39:34.8606493Z incremental_indices = (torch.cumsum(mask, dim=1).type_as(mask) + past_key_values_length) * mask 2025-08-14T21:39:34.8606498Z 2025-08-14T21:39:34.8606591Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8606769Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8606860Z return mod(**inputs) 2025-08-14T21:39:34.8607090Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8607149Z outputs = self.model( 2025-08-14T21:39:34.8607386Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8607450Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8607686Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1095, in forward 2025-08-14T21:39:34.8607836Z positions = self.embed_positions(input_ids, inputs_embeds, past_key_values_length) 2025-08-14T21:39:34.8608043Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/torch/utils/_contextlib.py", line 120, in decorate_context 2025-08-14T21:39:34.8608117Z return func(*args, **kwargs) 2025-08-14T21:39:34.8608347Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 148, in forward 2025-08-14T21:39:34.8608545Z position_ids = create_position_ids_from_input_ids(input_ids, self.padding_idx, past_key_values_length).to( 2025-08-14T21:39:34.8608831Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 81, in create_position_ids_from_input_ids 2025-08-14T21:39:34.8609000Z incremental_indices = (torch.cumsum(mask, dim=1).type_as(mask) + past_key_values_length) * mask 2025-08-14T21:39:34.8609004Z 2025-08-14T21:39:34.8609122Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8609305Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8609371Z return mod(**inputs) 2025-08-14T21:39:34.8609604Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8609689Z outputs = self.model( 2025-08-14T21:39:34.8609934Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8609998Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8610251Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8610325Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8610526Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8610605Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8610835Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 473, in forward 2025-08-14T21:39:34.8610928Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:39:34.8611183Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 281, in forward 2025-08-14T21:39:34.8611320Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:39:34.8611323Z 2025-08-14T21:39:34.8611422Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8611601Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8611660Z return mod(**inputs) 2025-08-14T21:39:34.8611898Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8611958Z outputs = self.model( 2025-08-14T21:39:34.8612187Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8612259Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8612494Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8612567Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8612768Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8612840Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8613077Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 473, in forward 2025-08-14T21:39:34.8613166Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:39:34.8613405Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 300, in forward 2025-08-14T21:39:34.8613478Z key_states = self.k_proj(current_states) 2025-08-14T21:39:34.8613482Z 2025-08-14T21:39:34.8613574Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8613765Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8613825Z return mod(**inputs) 2025-08-14T21:39:34.8614054Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8614124Z outputs = self.model( 2025-08-14T21:39:34.8614352Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8614424Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8614668Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8614735Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8614942Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8615015Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8615261Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 473, in forward 2025-08-14T21:39:34.8615359Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:39:34.8615603Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 301, in forward 2025-08-14T21:39:34.8615690Z value_states = self.v_proj(current_states) 2025-08-14T21:39:34.8615694Z 2025-08-14T21:39:34.8615766Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8615835Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8615913Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8615980Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8616072Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8616262Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8616337Z return mod(**inputs) 2025-08-14T21:39:34.8616577Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8616637Z outputs = self.model( 2025-08-14T21:39:34.8616869Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8616941Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8617174Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8617246Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8617446Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8617515Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8617749Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 473, in forward 2025-08-14T21:39:34.8617840Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:39:34.8618069Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 319, in forward 2025-08-14T21:39:34.8618166Z attn_output, attn_weights = attention_interface( 2025-08-14T21:39:34.8618433Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:39:34.8618560Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:39:34.8618564Z 2025-08-14T21:39:34.8618656Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8618837Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8618904Z return mod(**inputs) 2025-08-14T21:39:34.8619137Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8619208Z outputs = self.model( 2025-08-14T21:39:34.8619439Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8619504Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8619744Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8619807Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8620025Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8620107Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8620338Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 473, in forward 2025-08-14T21:39:34.8620429Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:39:34.8620676Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 319, in forward 2025-08-14T21:39:34.8620764Z attn_output, attn_weights = attention_interface( 2025-08-14T21:39:34.8621047Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:39:34.8621148Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:39:34.8621152Z 2025-08-14T21:39:34.8621249Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8621428Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8621486Z return mod(**inputs) 2025-08-14T21:39:34.8621723Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8621782Z outputs = self.model( 2025-08-14T21:39:34.8622030Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8622102Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8622333Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8622403Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8622602Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8622673Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8622909Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 473, in forward 2025-08-14T21:39:34.8622996Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:39:34.8623231Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 333, in forward 2025-08-14T21:39:34.8623307Z attn_output = self.out_proj(attn_output) 2025-08-14T21:39:34.8623311Z 2025-08-14T21:39:34.8623403Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8623591Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8623651Z return mod(**inputs) 2025-08-14T21:39:34.8623880Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8623946Z outputs = self.model( 2025-08-14T21:39:34.8624175Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8624245Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8624474Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8624539Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8624745Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8624877Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8625121Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 490, in forward 2025-08-14T21:39:34.8625228Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:39:34.8625478Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 281, in forward 2025-08-14T21:39:34.8625625Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:39:34.8625628Z 2025-08-14T21:39:34.8625720Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8625900Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8625986Z return mod(**inputs) 2025-08-14T21:39:34.8626221Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8626289Z outputs = self.model( 2025-08-14T21:39:34.8626535Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8626604Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8626843Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8626909Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8627110Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8627188Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8627418Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 490, in forward 2025-08-14T21:39:34.8627558Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:39:34.8627788Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 300, in forward 2025-08-14T21:39:34.8627864Z key_states = self.k_proj(current_states) 2025-08-14T21:39:34.8627867Z 2025-08-14T21:39:34.8627969Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8628150Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8628216Z return mod(**inputs) 2025-08-14T21:39:34.8628448Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8628508Z outputs = self.model( 2025-08-14T21:39:34.8628747Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8628817Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8629047Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8629121Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8629322Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8629398Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8629631Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 490, in forward 2025-08-14T21:39:34.8629727Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:39:34.8629964Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 301, in forward 2025-08-14T21:39:34.8630040Z value_states = self.v_proj(current_states) 2025-08-14T21:39:34.8630046Z 2025-08-14T21:39:34.8630124Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8630195Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8630263Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8630339Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8630432Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8630613Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8630678Z return mod(**inputs) 2025-08-14T21:39:34.8630934Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8631003Z outputs = self.model( 2025-08-14T21:39:34.8631235Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8631301Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8631556Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8631623Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8631823Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8631916Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8632152Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 490, in forward 2025-08-14T21:39:34.8632256Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:39:34.8632486Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 319, in forward 2025-08-14T21:39:34.8632574Z attn_output, attn_weights = attention_interface( 2025-08-14T21:39:34.8632843Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:39:34.8632978Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:39:34.8632982Z 2025-08-14T21:39:34.8633080Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8633260Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8633319Z return mod(**inputs) 2025-08-14T21:39:34.8633557Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8633617Z outputs = self.model( 2025-08-14T21:39:34.8633848Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8633920Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8634150Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8634223Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8634422Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8634492Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8634726Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 490, in forward 2025-08-14T21:39:34.8634821Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:39:34.8635058Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 319, in forward 2025-08-14T21:39:34.8635143Z attn_output, attn_weights = attention_interface( 2025-08-14T21:39:34.8635404Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:39:34.8635506Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:39:34.8635512Z 2025-08-14T21:39:34.8635602Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8635783Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8635848Z return mod(**inputs) 2025-08-14T21:39:34.8636082Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8636147Z outputs = self.model( 2025-08-14T21:39:34.8636391Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8636458Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8636696Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8636761Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8636982Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8637056Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8637288Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 490, in forward 2025-08-14T21:39:34.8637404Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:39:34.8637636Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 333, in forward 2025-08-14T21:39:34.8637709Z attn_output = self.out_proj(attn_output) 2025-08-14T21:39:34.8637714Z 2025-08-14T21:39:34.8637814Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8637993Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8638058Z return mod(**inputs) 2025-08-14T21:39:34.8638289Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8638365Z outputs = self.model( 2025-08-14T21:39:34.8638604Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8638668Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8638899Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8638969Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8639173Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8639251Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8639484Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 504, in forward 2025-08-14T21:39:34.8639594Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:39:34.8639600Z 2025-08-14T21:39:34.8639702Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8639884Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8639950Z return mod(**inputs) 2025-08-14T21:39:34.8640187Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8640249Z outputs = self.model( 2025-08-14T21:39:34.8640492Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8640558Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8640788Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8640861Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8641065Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8641145Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8641374Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 504, in forward 2025-08-14T21:39:34.8641482Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:39:34.8641486Z 2025-08-14T21:39:34.8641588Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8641769Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8641853Z return mod(**inputs) 2025-08-14T21:39:34.8642088Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8642152Z outputs = self.model( 2025-08-14T21:39:34.8642391Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8642474Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8642703Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8642775Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8642988Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8643074Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8643304Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 506, in forward 2025-08-14T21:39:34.8643376Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:39:34.8643380Z 2025-08-14T21:39:34.8643478Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8643659Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8643744Z return mod(**inputs) 2025-08-14T21:39:34.8643979Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8644040Z outputs = self.model( 2025-08-14T21:39:34.8644282Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8644347Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8644580Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8644652Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8644857Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8644935Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8645168Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 473, in forward 2025-08-14T21:39:34.8645262Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:39:34.8645501Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 281, in forward 2025-08-14T21:39:34.8645640Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:39:34.8645643Z 2025-08-14T21:39:34.8645742Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8645928Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8645985Z return mod(**inputs) 2025-08-14T21:39:34.8646230Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8646290Z outputs = self.model( 2025-08-14T21:39:34.8646523Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8646599Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8646834Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8646906Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8647107Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8647178Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8647431Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 473, in forward 2025-08-14T21:39:34.8647523Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:39:34.8647758Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 300, in forward 2025-08-14T21:39:34.8647846Z key_states = self.k_proj(current_states) 2025-08-14T21:39:34.8647850Z 2025-08-14T21:39:34.8647942Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8648129Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8648187Z return mod(**inputs) 2025-08-14T21:39:34.8648433Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8648503Z outputs = self.model( 2025-08-14T21:39:34.8648741Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8648813Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8649041Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8649107Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8649341Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8649413Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8649643Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 473, in forward 2025-08-14T21:39:34.8649741Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:39:34.8649969Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 301, in forward 2025-08-14T21:39:34.8650056Z value_states = self.v_proj(current_states) 2025-08-14T21:39:34.8650059Z 2025-08-14T21:39:34.8650132Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8650200Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8650275Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8650344Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8650439Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8650629Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8650687Z return mod(**inputs) 2025-08-14T21:39:34.8650930Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8650992Z outputs = self.model( 2025-08-14T21:39:34.8651223Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8651298Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8651532Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8651604Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8651805Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8651880Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8652117Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 473, in forward 2025-08-14T21:39:34.8652205Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:39:34.8652435Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 319, in forward 2025-08-14T21:39:34.8652531Z attn_output, attn_weights = attention_interface( 2025-08-14T21:39:34.8652813Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:39:34.8652943Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:39:34.8652947Z 2025-08-14T21:39:34.8653040Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8653222Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8653305Z return mod(**inputs) 2025-08-14T21:39:34.8653539Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8653607Z outputs = self.model( 2025-08-14T21:39:34.8653852Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8653919Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8654167Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8654232Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8654432Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8654510Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8654742Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 473, in forward 2025-08-14T21:39:34.8654855Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:39:34.8655090Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 319, in forward 2025-08-14T21:39:34.8655179Z attn_output, attn_weights = attention_interface( 2025-08-14T21:39:34.8655454Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:39:34.8655553Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:39:34.8655557Z 2025-08-14T21:39:34.8655656Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8655839Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8655898Z return mod(**inputs) 2025-08-14T21:39:34.8656141Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8656200Z outputs = self.model( 2025-08-14T21:39:34.8656433Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8656508Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8656741Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8656812Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8657014Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8657084Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8657324Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 473, in forward 2025-08-14T21:39:34.8657412Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:39:34.8657651Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 333, in forward 2025-08-14T21:39:34.8657724Z attn_output = self.out_proj(attn_output) 2025-08-14T21:39:34.8657728Z 2025-08-14T21:39:34.8657820Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8658011Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8658069Z return mod(**inputs) 2025-08-14T21:39:34.8658324Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8658395Z outputs = self.model( 2025-08-14T21:39:34.8658629Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8658700Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8658947Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8659010Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8659228Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8659304Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8659537Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 482, in forward 2025-08-14T21:39:34.8659621Z hidden_states = residual + hidden_states 2025-08-14T21:39:34.8659624Z 2025-08-14T21:39:34.8659719Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8659907Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8659969Z return mod(**inputs) 2025-08-14T21:39:34.8660204Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8660289Z outputs = self.model( 2025-08-14T21:39:34.8660522Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8660596Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8660832Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8660895Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8661105Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8661174Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8661405Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 490, in forward 2025-08-14T21:39:34.8661514Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:39:34.8661746Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 281, in forward 2025-08-14T21:39:34.8661889Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:39:34.8661892Z 2025-08-14T21:39:34.8661987Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8662167Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8662231Z return mod(**inputs) 2025-08-14T21:39:34.8662467Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8662535Z outputs = self.model( 2025-08-14T21:39:34.8662767Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8662834Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8663071Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8663135Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8663336Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8663416Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8663646Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 490, in forward 2025-08-14T21:39:34.8663765Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:39:34.8663997Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 300, in forward 2025-08-14T21:39:34.8664070Z key_states = self.k_proj(current_states) 2025-08-14T21:39:34.8664073Z 2025-08-14T21:39:34.8664189Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8664369Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8664435Z return mod(**inputs) 2025-08-14T21:39:34.8664699Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8664761Z outputs = self.model( 2025-08-14T21:39:34.8665068Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8665140Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8665375Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8665447Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8665648Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8665747Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8665981Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 490, in forward 2025-08-14T21:39:34.8666079Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:39:34.8666322Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 301, in forward 2025-08-14T21:39:34.8666399Z value_states = self.v_proj(current_states) 2025-08-14T21:39:34.8666403Z 2025-08-14T21:39:34.8666482Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8666555Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8666625Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8666700Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8666792Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8666973Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8667045Z return mod(**inputs) 2025-08-14T21:39:34.8667279Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8667347Z outputs = self.model( 2025-08-14T21:39:34.8667581Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8667647Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8667888Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8667952Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8668153Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8668234Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8668468Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 490, in forward 2025-08-14T21:39:34.8668571Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:39:34.8668801Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 319, in forward 2025-08-14T21:39:34.8668890Z attn_output, attn_weights = attention_interface( 2025-08-14T21:39:34.8669160Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:39:34.8669294Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:39:34.8669298Z 2025-08-14T21:39:34.8669401Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8669583Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8669642Z return mod(**inputs) 2025-08-14T21:39:34.8669900Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8669960Z outputs = self.model( 2025-08-14T21:39:34.8670190Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8670284Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8670517Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8670588Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8670790Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8670861Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8671099Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 490, in forward 2025-08-14T21:39:34.8671214Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:39:34.8671442Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 319, in forward 2025-08-14T21:39:34.8671536Z attn_output, attn_weights = attention_interface( 2025-08-14T21:39:34.8671800Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:39:34.8671902Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:39:34.8671905Z 2025-08-14T21:39:34.8671997Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8672177Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8672243Z return mod(**inputs) 2025-08-14T21:39:34.8672477Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8672545Z outputs = self.model( 2025-08-14T21:39:34.8672777Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8672842Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8673080Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8673142Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8673343Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8673422Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8673653Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 490, in forward 2025-08-14T21:39:34.8673757Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:39:34.8673991Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 333, in forward 2025-08-14T21:39:34.8674065Z attn_output = self.out_proj(attn_output) 2025-08-14T21:39:34.8674068Z 2025-08-14T21:39:34.8674165Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8674345Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8674409Z return mod(**inputs) 2025-08-14T21:39:34.8674643Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8674717Z outputs = self.model( 2025-08-14T21:39:34.8674956Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8675021Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8675249Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8675337Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8675538Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8675618Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8675864Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 504, in forward 2025-08-14T21:39:34.8675976Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:39:34.8675979Z 2025-08-14T21:39:34.8676077Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8676256Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8676321Z return mod(**inputs) 2025-08-14T21:39:34.8676553Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8676630Z outputs = self.model( 2025-08-14T21:39:34.8676868Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8676933Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8677162Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8677233Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8677435Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8677513Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8677743Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 504, in forward 2025-08-14T21:39:34.8677851Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:39:34.8677857Z 2025-08-14T21:39:34.8677957Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8678135Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8678199Z return mod(**inputs) 2025-08-14T21:39:34.8678435Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8678497Z outputs = self.model( 2025-08-14T21:39:34.8678736Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8678802Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8679033Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8679105Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8679306Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8679389Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8679618Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 506, in forward 2025-08-14T21:39:34.8679692Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:39:34.8679696Z 2025-08-14T21:39:34.8679796Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8679976Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8680035Z return mod(**inputs) 2025-08-14T21:39:34.8680289Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8680352Z outputs = self.model( 2025-08-14T21:39:34.8680591Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8680719Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8680950Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8681023Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8681237Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8681318Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8681550Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 473, in forward 2025-08-14T21:39:34.8681640Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:39:34.8681876Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 281, in forward 2025-08-14T21:39:34.8682013Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:39:34.8682032Z 2025-08-14T21:39:34.8682135Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8682317Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8682374Z return mod(**inputs) 2025-08-14T21:39:34.8682614Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8682676Z outputs = self.model( 2025-08-14T21:39:34.8682907Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8682981Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8683214Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8683286Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8683489Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8683563Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8683802Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 473, in forward 2025-08-14T21:39:34.8683894Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:39:34.8684124Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 300, in forward 2025-08-14T21:39:34.8684207Z key_states = self.k_proj(current_states) 2025-08-14T21:39:34.8684210Z 2025-08-14T21:39:34.8684305Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8684493Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8684551Z return mod(**inputs) 2025-08-14T21:39:34.8684943Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8685022Z outputs = self.model( 2025-08-14T21:39:34.8685255Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8685329Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8685565Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8685629Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8685874Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8685947Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8686175Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 473, in forward 2025-08-14T21:39:34.8686270Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:39:34.8686528Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 301, in forward 2025-08-14T21:39:34.8686615Z value_states = self.v_proj(current_states) 2025-08-14T21:39:34.8686618Z 2025-08-14T21:39:34.8686711Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8686803Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8686884Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8686953Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8687047Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8687242Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8687301Z return mod(**inputs) 2025-08-14T21:39:34.8687543Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8687605Z outputs = self.model( 2025-08-14T21:39:34.8687862Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8687936Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8688167Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8688231Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8688438Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8688510Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8688749Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 473, in forward 2025-08-14T21:39:34.8688836Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:39:34.8689063Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 319, in forward 2025-08-14T21:39:34.8689161Z attn_output, attn_weights = attention_interface( 2025-08-14T21:39:34.8689422Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:39:34.8689549Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:39:34.8689552Z 2025-08-14T21:39:34.8689644Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8689823Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8689890Z return mod(**inputs) 2025-08-14T21:39:34.8690123Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8690190Z outputs = self.model( 2025-08-14T21:39:34.8690419Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8690488Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8690722Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8690785Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8690986Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8691064Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8691305Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 473, in forward 2025-08-14T21:39:34.8691400Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:39:34.8691628Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 319, in forward 2025-08-14T21:39:34.8691712Z attn_output, attn_weights = attention_interface( 2025-08-14T21:39:34.8692002Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:39:34.8692101Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:39:34.8692105Z 2025-08-14T21:39:34.8692200Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8692394Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8692453Z return mod(**inputs) 2025-08-14T21:39:34.8692693Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8692754Z outputs = self.model( 2025-08-14T21:39:34.8692986Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8693058Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8693290Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8693379Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8693580Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8693651Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8693888Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 473, in forward 2025-08-14T21:39:34.8693974Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:39:34.8694205Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 333, in forward 2025-08-14T21:39:34.8694287Z attn_output = self.out_proj(attn_output) 2025-08-14T21:39:34.8694290Z 2025-08-14T21:39:34.8694381Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8694571Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8694632Z return mod(**inputs) 2025-08-14T21:39:34.8694861Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8694929Z outputs = self.model( 2025-08-14T21:39:34.8695160Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8695234Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8695465Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8695529Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8695737Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8695807Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8696038Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 490, in forward 2025-08-14T21:39:34.8696144Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:39:34.8696373Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 281, in forward 2025-08-14T21:39:34.8696515Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:39:34.8696519Z 2025-08-14T21:39:34.8696609Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8696808Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8696877Z return mod(**inputs) 2025-08-14T21:39:34.8697111Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8697183Z outputs = self.model( 2025-08-14T21:39:34.8697435Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8697501Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8697753Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8697819Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8698019Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8698096Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8698326Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 490, in forward 2025-08-14T21:39:34.8698431Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:39:34.8698659Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 300, in forward 2025-08-14T21:39:34.8698747Z key_states = self.k_proj(current_states) 2025-08-14T21:39:34.8698750Z 2025-08-14T21:39:34.8698848Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8699026Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8699090Z return mod(**inputs) 2025-08-14T21:39:34.8699323Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8699384Z outputs = self.model( 2025-08-14T21:39:34.8699623Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8699689Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8699918Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8699994Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8700194Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8700271Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8700502Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 490, in forward 2025-08-14T21:39:34.8700598Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:39:34.8700835Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 301, in forward 2025-08-14T21:39:34.8700914Z value_states = self.v_proj(current_states) 2025-08-14T21:39:34.8700917Z 2025-08-14T21:39:34.8700996Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8701066Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8701133Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8701213Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8701304Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8701484Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8701548Z return mod(**inputs) 2025-08-14T21:39:34.8701780Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8701839Z outputs = self.model( 2025-08-14T21:39:34.8702077Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8702158Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8702397Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8704576Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8704789Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8704936Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8705178Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 490, in forward 2025-08-14T21:39:34.8705296Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:39:34.8705540Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 319, in forward 2025-08-14T21:39:34.8705627Z attn_output, attn_weights = attention_interface( 2025-08-14T21:39:34.8705901Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:39:34.8706021Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:39:34.8706055Z 2025-08-14T21:39:34.8706149Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8706359Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8706419Z return mod(**inputs) 2025-08-14T21:39:34.8706651Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8706722Z outputs = self.model( 2025-08-14T21:39:34.8706952Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8707019Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8707261Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8707326Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8707534Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8707606Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8707836Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 490, in forward 2025-08-14T21:39:34.8707937Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:39:34.8708168Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 319, in forward 2025-08-14T21:39:34.8708263Z attn_output, attn_weights = attention_interface( 2025-08-14T21:39:34.8708527Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:39:34.8708623Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:39:34.8708626Z 2025-08-14T21:39:34.8708725Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8708905Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8708966Z return mod(**inputs) 2025-08-14T21:39:34.8709204Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8709264Z outputs = self.model( 2025-08-14T21:39:34.8709503Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8709569Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8709797Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8709884Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8710084Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8710163Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8710445Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 490, in forward 2025-08-14T21:39:34.8710542Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:39:34.8710778Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 333, in forward 2025-08-14T21:39:34.8710866Z attn_output = self.out_proj(attn_output) 2025-08-14T21:39:34.8710869Z 2025-08-14T21:39:34.8710962Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8711147Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8711206Z return mod(**inputs) 2025-08-14T21:39:34.8711448Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8711510Z outputs = self.model( 2025-08-14T21:39:34.8711742Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8711836Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8712073Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8712144Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8712352Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8712423Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8712667Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 499, in forward 2025-08-14T21:39:34.8712739Z hidden_states = residual + hidden_states 2025-08-14T21:39:34.8712743Z 2025-08-14T21:39:34.8712835Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8713025Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8713085Z return mod(**inputs) 2025-08-14T21:39:34.8713330Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8713390Z outputs = self.model( 2025-08-14T21:39:34.8713625Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8713697Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8713934Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8713998Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8714210Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8714280Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8714524Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 504, in forward 2025-08-14T21:39:34.8714635Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:39:34.8714638Z 2025-08-14T21:39:34.8714729Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8714922Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8714980Z return mod(**inputs) 2025-08-14T21:39:34.8715223Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8715283Z outputs = self.model( 2025-08-14T21:39:34.8715544Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8715619Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8715852Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8715947Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8716157Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8716228Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8716484Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 504, in forward 2025-08-14T21:39:34.8716595Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:39:34.8716598Z 2025-08-14T21:39:34.8716690Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8716878Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8716936Z return mod(**inputs) 2025-08-14T21:39:34.8717173Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8717253Z outputs = self.model( 2025-08-14T21:39:34.8717484Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8717556Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8717788Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8717852Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8718059Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8718132Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8718368Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 506, in forward 2025-08-14T21:39:34.8718440Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:39:34.8718444Z 2025-08-14T21:39:34.8718537Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8718723Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8718780Z return mod(**inputs) 2025-08-14T21:39:34.8719017Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8719078Z outputs = self.model( 2025-08-14T21:39:34.8719307Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8719381Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8719610Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8719675Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8719883Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8719956Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8720193Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 473, in forward 2025-08-14T21:39:34.8720282Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:39:34.8720512Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 281, in forward 2025-08-14T21:39:34.8720656Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:39:34.8720660Z 2025-08-14T21:39:34.8720768Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8720957Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8721015Z return mod(**inputs) 2025-08-14T21:39:34.8721247Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8721334Z outputs = self.model( 2025-08-14T21:39:34.8721565Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8721629Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8721880Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8721946Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8722150Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8722225Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8722458Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 473, in forward 2025-08-14T21:39:34.8722553Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:39:34.8722786Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 300, in forward 2025-08-14T21:39:34.8722875Z key_states = self.k_proj(current_states) 2025-08-14T21:39:34.8722886Z 2025-08-14T21:39:34.8722980Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8723164Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8723230Z return mod(**inputs) 2025-08-14T21:39:34.8723469Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8723532Z outputs = self.model( 2025-08-14T21:39:34.8723778Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8723843Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8724089Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8724153Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8724357Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8724437Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8724674Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 473, in forward 2025-08-14T21:39:34.8724762Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:39:34.8725006Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 301, in forward 2025-08-14T21:39:34.8725084Z value_states = self.v_proj(current_states) 2025-08-14T21:39:34.8725087Z 2025-08-14T21:39:34.8725164Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8725237Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8725307Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8725382Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8725474Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8725657Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8725724Z return mod(**inputs) 2025-08-14T21:39:34.8725961Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8726027Z outputs = self.model( 2025-08-14T21:39:34.8726279Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8726346Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8726583Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8726664Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8726876Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8726947Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8727179Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 473, in forward 2025-08-14T21:39:34.8727288Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:39:34.8727518Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 319, in forward 2025-08-14T21:39:34.8727608Z attn_output, attn_weights = attention_interface( 2025-08-14T21:39:34.8727885Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:39:34.8728004Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:39:34.8728009Z 2025-08-14T21:39:34.8728108Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8728306Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8728365Z return mod(**inputs) 2025-08-14T21:39:34.8728604Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8728668Z outputs = self.model( 2025-08-14T21:39:34.8728905Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8728972Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8729204Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8729277Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8729477Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8729551Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8729790Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 473, in forward 2025-08-14T21:39:34.8729878Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:39:34.8730116Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 319, in forward 2025-08-14T21:39:34.8730203Z attn_output, attn_weights = attention_interface( 2025-08-14T21:39:34.8730467Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:39:34.8730573Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:39:34.8730576Z 2025-08-14T21:39:34.8730669Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8730857Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8730917Z return mod(**inputs) 2025-08-14T21:39:34.8731150Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8731219Z outputs = self.model( 2025-08-14T21:39:34.8731450Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8731516Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8731770Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8731835Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8732046Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8732116Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8732363Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 473, in forward 2025-08-14T21:39:34.8732457Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:39:34.8732685Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 333, in forward 2025-08-14T21:39:34.8732779Z attn_output = self.out_proj(attn_output) 2025-08-14T21:39:34.8732783Z 2025-08-14T21:39:34.8732878Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8733058Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8733123Z return mod(**inputs) 2025-08-14T21:39:34.8733358Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8733417Z outputs = self.model( 2025-08-14T21:39:34.8733659Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8733754Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8733990Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8734053Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8734254Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8734330Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8734561Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 490, in forward 2025-08-14T21:39:34.8734658Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:39:34.8734893Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 281, in forward 2025-08-14T21:39:34.8735028Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:39:34.8735034Z 2025-08-14T21:39:34.8735131Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8735306Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8735365Z return mod(**inputs) 2025-08-14T21:39:34.8735606Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8735665Z outputs = self.model( 2025-08-14T21:39:34.8735903Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8735969Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8736198Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8736270Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8736470Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8736540Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8736776Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 490, in forward 2025-08-14T21:39:34.8736872Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:39:34.8737105Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 300, in forward 2025-08-14T21:39:34.8737191Z key_states = self.k_proj(current_states) 2025-08-14T21:39:34.8737194Z 2025-08-14T21:39:34.8737288Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8737475Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8737554Z return mod(**inputs) 2025-08-14T21:39:34.8737799Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8737862Z outputs = self.model( 2025-08-14T21:39:34.8738099Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8738187Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8738419Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8738482Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8738691Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8738762Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8738998Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 490, in forward 2025-08-14T21:39:34.8739111Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:39:34.8739340Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 301, in forward 2025-08-14T21:39:34.8739425Z value_states = self.v_proj(current_states) 2025-08-14T21:39:34.8739428Z 2025-08-14T21:39:34.8739499Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8739577Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8739644Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8739710Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8739809Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8739988Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8740045Z return mod(**inputs) 2025-08-14T21:39:34.8740284Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8740347Z outputs = self.model( 2025-08-14T21:39:34.8740583Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8740647Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8740875Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8740945Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8741145Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8741218Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8741455Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 490, in forward 2025-08-14T21:39:34.8741550Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:39:34.8741786Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 319, in forward 2025-08-14T21:39:34.8741876Z attn_output, attn_weights = attention_interface( 2025-08-14T21:39:34.8742139Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:39:34.8742266Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:39:34.8742269Z 2025-08-14T21:39:34.8742361Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8742561Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8742623Z return mod(**inputs) 2025-08-14T21:39:34.8742859Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8742945Z outputs = self.model( 2025-08-14T21:39:34.8743182Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8743249Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8743494Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8743572Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8743782Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8743851Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8744081Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 490, in forward 2025-08-14T21:39:34.8744185Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:39:34.8744415Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 319, in forward 2025-08-14T21:39:34.8744525Z attn_output, attn_weights = attention_interface( 2025-08-14T21:39:34.8744791Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:39:34.8744953Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:39:34.8744958Z 2025-08-14T21:39:34.8745067Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8745248Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8745307Z return mod(**inputs) 2025-08-14T21:39:34.8745551Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8745612Z outputs = self.model( 2025-08-14T21:39:34.8745853Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8745924Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8746157Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8746230Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8746433Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8746506Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8746747Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 490, in forward 2025-08-14T21:39:34.8746847Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:39:34.8747082Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 333, in forward 2025-08-14T21:39:34.8747157Z attn_output = self.out_proj(attn_output) 2025-08-14T21:39:34.8747161Z 2025-08-14T21:39:34.8747256Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8747447Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8747505Z return mod(**inputs) 2025-08-14T21:39:34.8747748Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8747809Z outputs = self.model( 2025-08-14T21:39:34.8748041Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8748119Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8748369Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8748434Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8748640Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8748728Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8748967Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 504, in forward 2025-08-14T21:39:34.8749076Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:39:34.8749080Z 2025-08-14T21:39:34.8749187Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8749377Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8749436Z return mod(**inputs) 2025-08-14T21:39:34.8749675Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8749736Z outputs = self.model( 2025-08-14T21:39:34.8749968Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8750058Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8750291Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8750356Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8750567Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8750637Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8750876Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 504, in forward 2025-08-14T21:39:34.8750984Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:39:34.8750987Z 2025-08-14T21:39:34.8751080Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8751270Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8751328Z return mod(**inputs) 2025-08-14T21:39:34.8751570Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8751629Z outputs = self.model( 2025-08-14T21:39:34.8751861Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8751935Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8752167Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8752231Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8752443Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8752514Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8752752Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 506, in forward 2025-08-14T21:39:34.8752830Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:39:34.8752834Z 2025-08-14T21:39:34.8752924Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8753119Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8753178Z return mod(**inputs) 2025-08-14T21:39:34.8753418Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8753478Z outputs = self.model( 2025-08-14T21:39:34.8753724Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8753799Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8754033Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8754135Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8754343Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8754413Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8754671Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 508, in forward 2025-08-14T21:39:34.8754744Z hidden_states = residual + hidden_states 2025-08-14T21:39:34.8754747Z 2025-08-14T21:39:34.8754839Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8755028Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8755088Z return mod(**inputs) 2025-08-14T21:39:34.8755321Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8755390Z outputs = self.model( 2025-08-14T21:39:34.8755622Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8755709Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8755940Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8756005Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8756211Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8756280Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8756519Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 473, in forward 2025-08-14T21:39:34.8756609Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:39:34.8756838Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 281, in forward 2025-08-14T21:39:34.8756983Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:39:34.8756986Z 2025-08-14T21:39:34.8757076Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8757263Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8757323Z return mod(**inputs) 2025-08-14T21:39:34.8757554Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8757621Z outputs = self.model( 2025-08-14T21:39:34.8757852Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8757918Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8758155Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8758221Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8758427Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8758497Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8758727Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 473, in forward 2025-08-14T21:39:34.8758825Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:39:34.8759054Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 300, in forward 2025-08-14T21:39:34.8759139Z key_states = self.k_proj(current_states) 2025-08-14T21:39:34.8759151Z 2025-08-14T21:39:34.8759244Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8759425Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8759507Z return mod(**inputs) 2025-08-14T21:39:34.8759745Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8759805Z outputs = self.model( 2025-08-14T21:39:34.8760061Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8760127Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8760370Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8760433Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8760633Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8760711Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8760938Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 473, in forward 2025-08-14T21:39:34.8761043Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:39:34.8761280Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 301, in forward 2025-08-14T21:39:34.8761357Z value_states = self.v_proj(current_states) 2025-08-14T21:39:34.8761360Z 2025-08-14T21:39:34.8761439Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8761509Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8761577Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8761650Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8761741Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8761919Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8761985Z return mod(**inputs) 2025-08-14T21:39:34.8762219Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8762288Z outputs = self.model( 2025-08-14T21:39:34.8762516Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8762579Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8762816Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8762880Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8763082Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8763158Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8763387Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 473, in forward 2025-08-14T21:39:34.8763483Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:39:34.8763713Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 319, in forward 2025-08-14T21:39:34.8763802Z attn_output, attn_weights = attention_interface( 2025-08-14T21:39:34.8764076Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:39:34.8764196Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:39:34.8764199Z 2025-08-14T21:39:34.8764299Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8764493Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8764554Z return mod(**inputs) 2025-08-14T21:39:34.8764794Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8764870Z outputs = self.model( 2025-08-14T21:39:34.8765102Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8765175Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8765406Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8765491Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8765692Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8765763Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8766001Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 473, in forward 2025-08-14T21:39:34.8766089Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:39:34.8766325Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 319, in forward 2025-08-14T21:39:34.8766429Z attn_output, attn_weights = attention_interface( 2025-08-14T21:39:34.8766700Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:39:34.8766806Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:39:34.8766810Z 2025-08-14T21:39:34.8766903Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8767091Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8767150Z return mod(**inputs) 2025-08-14T21:39:34.8767388Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8767457Z outputs = self.model( 2025-08-14T21:39:34.8767690Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8767759Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8767999Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8768063Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8768273Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8768346Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8768580Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 473, in forward 2025-08-14T21:39:34.8768678Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:39:34.8768911Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 333, in forward 2025-08-14T21:39:34.8768986Z attn_output = self.out_proj(attn_output) 2025-08-14T21:39:34.8768998Z 2025-08-14T21:39:34.8769093Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8769273Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8769339Z return mod(**inputs) 2025-08-14T21:39:34.8769575Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8769637Z outputs = self.model( 2025-08-14T21:39:34.8769879Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8769962Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8770199Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8770263Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8770480Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8770561Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8770790Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 490, in forward 2025-08-14T21:39:34.8770889Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:39:34.8771138Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 281, in forward 2025-08-14T21:39:34.8771276Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:39:34.8771279Z 2025-08-14T21:39:34.8771381Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8771564Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8771622Z return mod(**inputs) 2025-08-14T21:39:34.8771866Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8771940Z outputs = self.model( 2025-08-14T21:39:34.8772183Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8772248Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8772478Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8772549Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8772751Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8772821Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8773057Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 490, in forward 2025-08-14T21:39:34.8773156Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:39:34.8773394Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 300, in forward 2025-08-14T21:39:34.8773465Z key_states = self.k_proj(current_states) 2025-08-14T21:39:34.8773468Z 2025-08-14T21:39:34.8773559Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8773750Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8773807Z return mod(**inputs) 2025-08-14T21:39:34.8774046Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8774105Z outputs = self.model( 2025-08-14T21:39:34.8774338Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8774412Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8774643Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8774709Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8774916Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8774987Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8775223Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 490, in forward 2025-08-14T21:39:34.8775319Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:39:34.8775563Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 301, in forward 2025-08-14T21:39:34.8775650Z value_states = self.v_proj(current_states) 2025-08-14T21:39:34.8775653Z 2025-08-14T21:39:34.8775725Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8775826Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8775898Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8775968Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8776070Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8776252Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8776325Z return mod(**inputs) 2025-08-14T21:39:34.8776565Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8776627Z outputs = self.model( 2025-08-14T21:39:34.8776857Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8776931Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8777160Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8777249Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8777450Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8777521Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8777759Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 490, in forward 2025-08-14T21:39:34.8777857Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:39:34.8778091Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 319, in forward 2025-08-14T21:39:34.8778181Z attn_output, attn_weights = attention_interface( 2025-08-14T21:39:34.8778448Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:39:34.8778574Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:39:34.8778579Z 2025-08-14T21:39:34.8778670Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8778856Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8778916Z return mod(**inputs) 2025-08-14T21:39:34.8779149Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8779216Z outputs = self.model( 2025-08-14T21:39:34.8779445Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8779513Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8779753Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8779816Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8780025Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8780098Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8780324Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 490, in forward 2025-08-14T21:39:34.8780429Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:39:34.8780658Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 319, in forward 2025-08-14T21:39:34.8780744Z attn_output, attn_weights = attention_interface( 2025-08-14T21:39:34.8781027Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:39:34.8781125Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:39:34.8781128Z 2025-08-14T21:39:34.8781243Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8781426Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8781483Z return mod(**inputs) 2025-08-14T21:39:34.8781725Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8781786Z outputs = self.model( 2025-08-14T21:39:34.8782038Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8782106Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8782342Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8782413Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8782615Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8782686Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8782942Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 490, in forward 2025-08-14T21:39:34.8783037Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:39:34.8783272Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 333, in forward 2025-08-14T21:39:34.8783346Z attn_output = self.out_proj(attn_output) 2025-08-14T21:39:34.8783349Z 2025-08-14T21:39:34.8783441Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8783631Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8783688Z return mod(**inputs) 2025-08-14T21:39:34.8783925Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8783987Z outputs = self.model( 2025-08-14T21:39:34.8784218Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8784290Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8784519Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8784844Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8785065Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8785139Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8785378Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 504, in forward 2025-08-14T21:39:34.8785488Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:39:34.8785492Z 2025-08-14T21:39:34.8785586Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8785777Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8785836Z return mod(**inputs) 2025-08-14T21:39:34.8786073Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8786134Z outputs = self.model( 2025-08-14T21:39:34.8786365Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8786437Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8786703Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8786767Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8786974Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8787071Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8787309Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 504, in forward 2025-08-14T21:39:34.8787414Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:39:34.8787418Z 2025-08-14T21:39:34.8787533Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8787724Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8787782Z return mod(**inputs) 2025-08-14T21:39:34.8788026Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8788087Z outputs = self.model( 2025-08-14T21:39:34.8788319Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8788393Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8788642Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8788706Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8788911Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8788983Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8789220Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 506, in forward 2025-08-14T21:39:34.8789292Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:39:34.8789297Z 2025-08-14T21:39:34.8789389Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8789575Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8789632Z return mod(**inputs) 2025-08-14T21:39:34.8789863Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8789932Z outputs = self.model( 2025-08-14T21:39:34.8790161Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8790232Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8790461Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8790525Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8790733Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8790803Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8791036Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 473, in forward 2025-08-14T21:39:34.8791127Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:39:34.8791358Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 281, in forward 2025-08-14T21:39:34.8791502Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:39:34.8791505Z 2025-08-14T21:39:34.8791598Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8791781Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8791847Z return mod(**inputs) 2025-08-14T21:39:34.8792096Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8792166Z outputs = self.model( 2025-08-14T21:39:34.8792399Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8792483Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8792723Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8792787Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8792996Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8793080Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8793313Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 473, in forward 2025-08-14T21:39:34.8793407Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:39:34.8793637Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 300, in forward 2025-08-14T21:39:34.8793708Z key_states = self.k_proj(current_states) 2025-08-14T21:39:34.8793720Z 2025-08-14T21:39:34.8793812Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8794005Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8794072Z return mod(**inputs) 2025-08-14T21:39:34.8794301Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8794363Z outputs = self.model( 2025-08-14T21:39:34.8794602Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8794667Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8794904Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8794968Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8795167Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8795248Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8795478Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 473, in forward 2025-08-14T21:39:34.8795569Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:39:34.8795809Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 301, in forward 2025-08-14T21:39:34.8795886Z value_states = self.v_proj(current_states) 2025-08-14T21:39:34.8795890Z 2025-08-14T21:39:34.8795969Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8796042Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8796111Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8796186Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8796279Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8796462Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8796530Z return mod(**inputs) 2025-08-14T21:39:34.8796764Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8796831Z outputs = self.model( 2025-08-14T21:39:34.8797064Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8797131Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8797370Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8797579Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8797784Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8797863Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8798129Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 473, in forward 2025-08-14T21:39:34.8798226Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:39:34.8798456Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 319, in forward 2025-08-14T21:39:34.8798559Z attn_output, attn_weights = attention_interface( 2025-08-14T21:39:34.8798834Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:39:34.8798952Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:39:34.8798957Z 2025-08-14T21:39:34.8799057Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8799240Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8799300Z return mod(**inputs) 2025-08-14T21:39:34.8799545Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8799622Z outputs = self.model( 2025-08-14T21:39:34.8799855Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8799931Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8800163Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8800234Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8800436Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8800505Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8800743Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 473, in forward 2025-08-14T21:39:34.8800833Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:39:34.8801072Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 319, in forward 2025-08-14T21:39:34.8801158Z attn_output, attn_weights = attention_interface( 2025-08-14T21:39:34.8801423Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:39:34.8801528Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:39:34.8801532Z 2025-08-14T21:39:34.8801623Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8801812Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8801872Z return mod(**inputs) 2025-08-14T21:39:34.8802104Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8802172Z outputs = self.model( 2025-08-14T21:39:34.8802405Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8802470Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8802710Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8802774Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8802982Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8803069Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8803298Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 473, in forward 2025-08-14T21:39:34.8803392Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:39:34.8803639Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 333, in forward 2025-08-14T21:39:34.8803713Z attn_output = self.out_proj(attn_output) 2025-08-14T21:39:34.8803724Z 2025-08-14T21:39:34.8803818Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8804012Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8804081Z return mod(**inputs) 2025-08-14T21:39:34.8804313Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8804373Z outputs = self.model( 2025-08-14T21:39:34.8804611Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8804676Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8804914Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8804997Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8805196Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8805275Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8805504Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 482, in forward 2025-08-14T21:39:34.8805577Z hidden_states = residual + hidden_states 2025-08-14T21:39:34.8805580Z 2025-08-14T21:39:34.8805680Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8805860Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8805925Z return mod(**inputs) 2025-08-14T21:39:34.8806157Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8806220Z outputs = self.model( 2025-08-14T21:39:34.8806458Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8806523Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8806753Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8806824Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8807025Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8807104Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8807340Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 490, in forward 2025-08-14T21:39:34.8807438Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:39:34.8807676Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 281, in forward 2025-08-14T21:39:34.8807813Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:39:34.8807816Z 2025-08-14T21:39:34.8807914Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8808094Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8808152Z return mod(**inputs) 2025-08-14T21:39:34.8808391Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8808468Z outputs = self.model( 2025-08-14T21:39:34.8808706Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8808772Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8809022Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8809095Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8809295Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8809364Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8809615Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 490, in forward 2025-08-14T21:39:34.8809716Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:39:34.8809957Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 300, in forward 2025-08-14T21:39:34.8810030Z key_states = self.k_proj(current_states) 2025-08-14T21:39:34.8810034Z 2025-08-14T21:39:34.8810124Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8810317Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8810392Z return mod(**inputs) 2025-08-14T21:39:34.8810624Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8810692Z outputs = self.model( 2025-08-14T21:39:34.8810924Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8810997Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8811227Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8811294Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8811503Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8811574Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8811813Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 490, in forward 2025-08-14T21:39:34.8811911Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:39:34.8812138Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 301, in forward 2025-08-14T21:39:34.8812223Z value_states = self.v_proj(current_states) 2025-08-14T21:39:34.8812227Z 2025-08-14T21:39:34.8812298Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8812369Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8812443Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8812510Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8812607Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8812786Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8812846Z return mod(**inputs) 2025-08-14T21:39:34.8813084Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8813145Z outputs = self.model( 2025-08-14T21:39:34.8813374Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8813447Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8813678Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8813750Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8813962Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8814037Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8814272Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 490, in forward 2025-08-14T21:39:34.8814386Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:39:34.8814620Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 319, in forward 2025-08-14T21:39:34.8814707Z attn_output, attn_weights = attention_interface( 2025-08-14T21:39:34.8814994Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:39:34.8815121Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:39:34.8815124Z 2025-08-14T21:39:34.8815215Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8815398Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8815465Z return mod(**inputs) 2025-08-14T21:39:34.8815697Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8815789Z outputs = self.model( 2025-08-14T21:39:34.8816020Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8816086Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8816324Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8816389Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8816596Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8816668Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8816899Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 490, in forward 2025-08-14T21:39:34.8817001Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:39:34.8817229Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 319, in forward 2025-08-14T21:39:34.8817319Z attn_output, attn_weights = attention_interface( 2025-08-14T21:39:34.8817587Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:39:34.8817684Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:39:34.8817687Z 2025-08-14T21:39:34.8817786Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8817964Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8818024Z return mod(**inputs) 2025-08-14T21:39:34.8818266Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8818327Z outputs = self.model( 2025-08-14T21:39:34.8818567Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8818633Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8818863Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8818933Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8819134Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8819206Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8819455Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 490, in forward 2025-08-14T21:39:34.8819552Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:39:34.8819791Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 333, in forward 2025-08-14T21:39:34.8819878Z attn_output = self.out_proj(attn_output) 2025-08-14T21:39:34.8819883Z 2025-08-14T21:39:34.8819976Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8820164Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8820224Z return mod(**inputs) 2025-08-14T21:39:34.8820478Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8820542Z outputs = self.model( 2025-08-14T21:39:34.8820777Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8820849Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8821079Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8821144Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8821368Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8821438Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8821674Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 504, in forward 2025-08-14T21:39:34.8821786Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:39:34.8821789Z 2025-08-14T21:39:34.8821879Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8822066Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8822124Z return mod(**inputs) 2025-08-14T21:39:34.8822356Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8822424Z outputs = self.model( 2025-08-14T21:39:34.8822655Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8822729Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8822959Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8823023Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8823231Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8823302Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8823540Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 504, in forward 2025-08-14T21:39:34.8823647Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:39:34.8823651Z 2025-08-14T21:39:34.8823742Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8823933Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8823992Z return mod(**inputs) 2025-08-14T21:39:34.8824222Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8824290Z outputs = self.model( 2025-08-14T21:39:34.8824521Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8824594Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8824896Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8824973Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8825187Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8825275Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8825518Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 506, in forward 2025-08-14T21:39:34.8825592Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:39:34.8825596Z 2025-08-14T21:39:34.8825691Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8825892Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8825954Z return mod(**inputs) 2025-08-14T21:39:34.8826190Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8826260Z outputs = self.model( 2025-08-14T21:39:34.8826492Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8826565Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8826799Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8826880Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8827089Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8827160Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8827390Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 473, in forward 2025-08-14T21:39:34.8827491Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:39:34.8827720Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 281, in forward 2025-08-14T21:39:34.8827865Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:39:34.8827869Z 2025-08-14T21:39:34.8827961Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8828140Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8828207Z return mod(**inputs) 2025-08-14T21:39:34.8828439Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8828505Z outputs = self.model( 2025-08-14T21:39:34.8828737Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8828802Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8829040Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8829106Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8829313Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8829386Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8829616Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 473, in forward 2025-08-14T21:39:34.8829710Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:39:34.8829940Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 300, in forward 2025-08-14T21:39:34.8830011Z key_states = self.k_proj(current_states) 2025-08-14T21:39:34.8830014Z 2025-08-14T21:39:34.8830114Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8830306Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8830374Z return mod(**inputs) 2025-08-14T21:39:34.8830610Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8830686Z outputs = self.model( 2025-08-14T21:39:34.8830924Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8830991Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8831221Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8831308Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8831509Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8831586Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8831816Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 473, in forward 2025-08-14T21:39:34.8831904Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:39:34.8832139Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 301, in forward 2025-08-14T21:39:34.8832231Z value_states = self.v_proj(current_states) 2025-08-14T21:39:34.8832235Z 2025-08-14T21:39:34.8832315Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8832385Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8832455Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8832529Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8832622Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8832800Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8832866Z return mod(**inputs) 2025-08-14T21:39:34.8833098Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8833166Z outputs = self.model( 2025-08-14T21:39:34.8833394Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8833462Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8833698Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8833760Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8833961Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8834039Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8834268Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 473, in forward 2025-08-14T21:39:34.8834362Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:39:34.8834592Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 319, in forward 2025-08-14T21:39:34.8834677Z attn_output, attn_weights = attention_interface( 2025-08-14T21:39:34.8834949Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:39:34.8835069Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:39:34.8835072Z 2025-08-14T21:39:34.8835169Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8835347Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8835405Z return mod(**inputs) 2025-08-14T21:39:34.8835641Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8835714Z outputs = self.model( 2025-08-14T21:39:34.8835948Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8836020Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8836265Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8836336Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8836534Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8836606Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8836862Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 473, in forward 2025-08-14T21:39:34.8836950Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:39:34.8837190Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 319, in forward 2025-08-14T21:39:34.8837277Z attn_output, attn_weights = attention_interface( 2025-08-14T21:39:34.8837540Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:39:34.8837660Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:39:34.8837664Z 2025-08-14T21:39:34.8837755Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8837935Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8837999Z return mod(**inputs) 2025-08-14T21:39:34.8838233Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8838298Z outputs = self.model( 2025-08-14T21:39:34.8838532Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8838596Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8838833Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8838898Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8839108Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8839179Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8839409Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 473, in forward 2025-08-14T21:39:34.8839502Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:39:34.8839729Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 333, in forward 2025-08-14T21:39:34.8839802Z attn_output = self.out_proj(attn_output) 2025-08-14T21:39:34.8839805Z 2025-08-14T21:39:34.8839903Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8840082Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8840147Z return mod(**inputs) 2025-08-14T21:39:34.8840380Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8840439Z outputs = self.model( 2025-08-14T21:39:34.8840674Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8840738Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8840967Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8841037Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8841266Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8841344Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8841574Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 490, in forward 2025-08-14T21:39:34.8841688Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:39:34.8841927Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 281, in forward 2025-08-14T21:39:34.8842061Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:39:34.8842079Z 2025-08-14T21:39:34.8842180Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8842359Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8842417Z return mod(**inputs) 2025-08-14T21:39:34.8842659Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8842719Z outputs = self.model( 2025-08-14T21:39:34.8842948Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8843038Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8843269Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8843339Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8843541Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8843613Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8843852Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 490, in forward 2025-08-14T21:39:34.8843951Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:39:34.8844187Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 300, in forward 2025-08-14T21:39:34.8844259Z key_states = self.k_proj(current_states) 2025-08-14T21:39:34.8844263Z 2025-08-14T21:39:34.8844357Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8844545Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8844604Z return mod(**inputs) 2025-08-14T21:39:34.8844837Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8844904Z outputs = self.model( 2025-08-14T21:39:34.8845137Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8845210Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8845443Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8845507Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8845715Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8845787Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8846026Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 490, in forward 2025-08-14T21:39:34.8846124Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:39:34.8846353Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 301, in forward 2025-08-14T21:39:34.8846437Z value_states = self.v_proj(current_states) 2025-08-14T21:39:34.8846440Z 2025-08-14T21:39:34.8846525Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8846598Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8846674Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8846743Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8846843Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8847035Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8847094Z return mod(**inputs) 2025-08-14T21:39:34.8847331Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8847392Z outputs = self.model( 2025-08-14T21:39:34.8847635Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8847709Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8847940Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8848011Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8848210Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8848282Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8848538Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 490, in forward 2025-08-14T21:39:34.8848635Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:39:34.8848873Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 319, in forward 2025-08-14T21:39:34.8848960Z attn_output, attn_weights = attention_interface( 2025-08-14T21:39:34.8849226Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:39:34.8849355Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:39:34.8849358Z 2025-08-14T21:39:34.8849454Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8849635Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8849701Z return mod(**inputs) 2025-08-14T21:39:34.8849936Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8850004Z outputs = self.model( 2025-08-14T21:39:34.8850237Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8850303Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8850542Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8850608Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8850817Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8850888Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8851120Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 490, in forward 2025-08-14T21:39:34.8851227Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:39:34.8851457Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 319, in forward 2025-08-14T21:39:34.8851544Z attn_output, attn_weights = attention_interface( 2025-08-14T21:39:34.8851815Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:39:34.8851913Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:39:34.8851916Z 2025-08-14T21:39:34.8852031Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8852216Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8852274Z return mod(**inputs) 2025-08-14T21:39:34.8852529Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8852591Z outputs = self.model( 2025-08-14T21:39:34.8852827Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8852893Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8853134Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8853207Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8853411Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8853481Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8853719Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 490, in forward 2025-08-14T21:39:34.8853816Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:39:34.8854068Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 333, in forward 2025-08-14T21:39:34.8854141Z attn_output = self.out_proj(attn_output) 2025-08-14T21:39:34.8854144Z 2025-08-14T21:39:34.8854236Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8854425Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8854483Z return mod(**inputs) 2025-08-14T21:39:34.8854720Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8854782Z outputs = self.model( 2025-08-14T21:39:34.8855011Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8855082Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8855312Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8855377Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8855585Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8855655Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8855891Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 499, in forward 2025-08-14T21:39:34.8855962Z hidden_states = residual + hidden_states 2025-08-14T21:39:34.8855966Z 2025-08-14T21:39:34.8856059Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8856245Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8856303Z return mod(**inputs) 2025-08-14T21:39:34.8856535Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8856603Z outputs = self.model( 2025-08-14T21:39:34.8856834Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8856905Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8857135Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8857201Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8857422Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8857495Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8857740Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 504, in forward 2025-08-14T21:39:34.8857873Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:39:34.8857878Z 2025-08-14T21:39:34.8857971Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8858159Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8858216Z return mod(**inputs) 2025-08-14T21:39:34.8858464Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8858533Z outputs = self.model( 2025-08-14T21:39:34.8858765Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8858839Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8859072Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8859136Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8859344Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8859432Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8859668Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 504, in forward 2025-08-14T21:39:34.8859776Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:39:34.8859780Z 2025-08-14T21:39:34.8859871Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8860056Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8860116Z return mod(**inputs) 2025-08-14T21:39:34.8860348Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8860415Z outputs = self.model( 2025-08-14T21:39:34.8860645Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8860719Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8860948Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8861017Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8861223Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8861294Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8861524Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 506, in forward 2025-08-14T21:39:34.8861606Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:39:34.8861609Z 2025-08-14T21:39:34.8861701Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8861888Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8861949Z return mod(**inputs) 2025-08-14T21:39:34.8862181Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8862247Z outputs = self.model( 2025-08-14T21:39:34.8862476Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8862547Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8862777Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8862855Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8863064Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8863135Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8863378Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 473, in forward 2025-08-14T21:39:34.8863476Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:39:34.8863703Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 281, in forward 2025-08-14T21:39:34.8863865Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:39:34.8863868Z 2025-08-14T21:39:34.8863963Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8864144Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8864210Z return mod(**inputs) 2025-08-14T21:39:34.8864440Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8864508Z outputs = self.model( 2025-08-14T21:39:34.8864740Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8864880Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8865129Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8865193Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8865397Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8865477Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8865710Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 473, in forward 2025-08-14T21:39:34.8865807Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:39:34.8866040Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 300, in forward 2025-08-14T21:39:34.8866115Z key_states = self.k_proj(current_states) 2025-08-14T21:39:34.8866120Z 2025-08-14T21:39:34.8866222Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8866401Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8866466Z return mod(**inputs) 2025-08-14T21:39:34.8866699Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8866760Z outputs = self.model( 2025-08-14T21:39:34.8866999Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8867066Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8867297Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8867371Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8867571Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8867651Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8867881Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 473, in forward 2025-08-14T21:39:34.8867970Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:39:34.8868203Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 301, in forward 2025-08-14T21:39:34.8868282Z value_states = self.v_proj(current_states) 2025-08-14T21:39:34.8868285Z 2025-08-14T21:39:34.8868389Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8868460Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8868529Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8868603Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8868712Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8868893Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8868959Z return mod(**inputs) 2025-08-14T21:39:34.8869192Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8869272Z outputs = self.model( 2025-08-14T21:39:34.8869502Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8869567Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8869804Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8869867Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8870068Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8870148Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8870398Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 473, in forward 2025-08-14T21:39:34.8870494Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:39:34.8870725Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 319, in forward 2025-08-14T21:39:34.8870813Z attn_output, attn_weights = attention_interface( 2025-08-14T21:39:34.8871087Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:39:34.8871206Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:39:34.8871210Z 2025-08-14T21:39:34.8871306Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8871488Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8871547Z return mod(**inputs) 2025-08-14T21:39:34.8871787Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8871848Z outputs = self.model( 2025-08-14T21:39:34.8872079Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8872154Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8872384Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8872457Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8872655Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8872726Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8872961Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 473, in forward 2025-08-14T21:39:34.8873050Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:39:34.8873278Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 319, in forward 2025-08-14T21:39:34.8873372Z attn_output, attn_weights = attention_interface( 2025-08-14T21:39:34.8873636Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:39:34.8873742Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:39:34.8873759Z 2025-08-14T21:39:34.8873852Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8874033Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8874100Z return mod(**inputs) 2025-08-14T21:39:34.8874349Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8874418Z outputs = self.model( 2025-08-14T21:39:34.8874648Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8874712Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8874963Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8875028Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8875228Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8875306Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8875533Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 473, in forward 2025-08-14T21:39:34.8875627Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:39:34.8875873Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 333, in forward 2025-08-14T21:39:34.8875946Z attn_output = self.out_proj(attn_output) 2025-08-14T21:39:34.8875949Z 2025-08-14T21:39:34.8876047Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8876229Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8876296Z return mod(**inputs) 2025-08-14T21:39:34.8876529Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8876590Z outputs = self.model( 2025-08-14T21:39:34.8876828Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8876893Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8877124Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8877197Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8877397Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8877474Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8877705Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 490, in forward 2025-08-14T21:39:34.8877804Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:39:34.8878040Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 281, in forward 2025-08-14T21:39:34.8878174Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:39:34.8878179Z 2025-08-14T21:39:34.8878280Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8878462Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8878519Z return mod(**inputs) 2025-08-14T21:39:34.8878755Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8878814Z outputs = self.model( 2025-08-14T21:39:34.8879048Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8879120Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8879370Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8879442Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8879642Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8879732Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8879973Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 490, in forward 2025-08-14T21:39:34.8880071Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:39:34.8880324Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 300, in forward 2025-08-14T21:39:34.8880397Z key_states = self.k_proj(current_states) 2025-08-14T21:39:34.8880401Z 2025-08-14T21:39:34.8880492Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8880681Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8880739Z return mod(**inputs) 2025-08-14T21:39:34.8880972Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8881041Z outputs = self.model( 2025-08-14T21:39:34.8881286Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8881357Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8881588Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8881651Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8881860Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8881931Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8882170Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 490, in forward 2025-08-14T21:39:34.8882266Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:39:34.8882495Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 301, in forward 2025-08-14T21:39:34.8882580Z value_states = self.v_proj(current_states) 2025-08-14T21:39:34.8882584Z 2025-08-14T21:39:34.8882653Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8882721Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8882798Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8882866Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8882964Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8883144Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8883203Z return mod(**inputs) 2025-08-14T21:39:34.8883441Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8883502Z outputs = self.model( 2025-08-14T21:39:34.8883734Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8883809Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8884038Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8884109Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8884310Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8884381Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8884773Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 490, in forward 2025-08-14T21:39:34.8884879Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:39:34.8885108Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 319, in forward 2025-08-14T21:39:34.8885235Z attn_output, attn_weights = attention_interface( 2025-08-14T21:39:34.8885504Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:39:34.8885632Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:39:34.8885635Z 2025-08-14T21:39:34.8885749Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8885931Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8885998Z return mod(**inputs) 2025-08-14T21:39:34.8886233Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8886303Z outputs = self.model( 2025-08-14T21:39:34.8886537Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8886605Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8886864Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8886931Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8887130Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8887212Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8887440Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 490, in forward 2025-08-14T21:39:34.8887545Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:39:34.8887776Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 319, in forward 2025-08-14T21:39:34.8887862Z attn_output, attn_weights = attention_interface( 2025-08-14T21:39:34.8888134Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:39:34.8888230Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:39:34.8888234Z 2025-08-14T21:39:34.8888333Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8888514Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8888572Z return mod(**inputs) 2025-08-14T21:39:34.8888812Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8888871Z outputs = self.model( 2025-08-14T21:39:34.8889102Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8889175Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8889405Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8889479Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8889679Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8889751Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8889987Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 490, in forward 2025-08-14T21:39:34.8890082Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:39:34.8890332Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 333, in forward 2025-08-14T21:39:34.8890408Z attn_output = self.out_proj(attn_output) 2025-08-14T21:39:34.8890411Z 2025-08-14T21:39:34.8890505Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8890711Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8890772Z return mod(**inputs) 2025-08-14T21:39:34.8891005Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8891073Z outputs = self.model( 2025-08-14T21:39:34.8891318Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8891394Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8891627Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8891693Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8891903Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8891973Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8892214Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 504, in forward 2025-08-14T21:39:34.8892341Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:39:34.8892345Z 2025-08-14T21:39:34.8892437Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8892625Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8892683Z return mod(**inputs) 2025-08-14T21:39:34.8892915Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8892983Z outputs = self.model( 2025-08-14T21:39:34.8893214Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8893285Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8893515Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8893579Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8893786Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8893855Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8894090Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 504, in forward 2025-08-14T21:39:34.8894198Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:39:34.8894201Z 2025-08-14T21:39:34.8894294Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8894476Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8894534Z return mod(**inputs) 2025-08-14T21:39:34.8894761Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8894831Z outputs = self.model( 2025-08-14T21:39:34.8895057Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8895128Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8895358Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8895420Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8895623Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8895706Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8895940Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 506, in forward 2025-08-14T21:39:34.8896018Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:39:34.8896036Z 2025-08-14T21:39:34.8896129Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8896317Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8896374Z return mod(**inputs) 2025-08-14T21:39:34.8896620Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8896690Z outputs = self.model( 2025-08-14T21:39:34.8896922Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8896991Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8897221Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8897286Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8897494Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8897591Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8897819Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 508, in forward 2025-08-14T21:39:34.8897897Z hidden_states = residual + hidden_states 2025-08-14T21:39:34.8897901Z 2025-08-14T21:39:34.8897993Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8898179Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8898237Z return mod(**inputs) 2025-08-14T21:39:34.8898468Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8898537Z outputs = self.model( 2025-08-14T21:39:34.8898766Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8898839Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8899072Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8899135Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8899345Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8899415Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8899644Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 473, in forward 2025-08-14T21:39:34.8899742Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:39:34.8899971Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 281, in forward 2025-08-14T21:39:34.8900112Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:39:34.8900117Z 2025-08-14T21:39:34.8900211Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8900390Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8900457Z return mod(**inputs) 2025-08-14T21:39:34.8900690Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8900757Z outputs = self.model( 2025-08-14T21:39:34.8900986Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8901065Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8901303Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8901366Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8901584Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8901665Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8901892Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 473, in forward 2025-08-14T21:39:34.8901987Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:39:34.8902227Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 300, in forward 2025-08-14T21:39:34.8902302Z key_states = self.k_proj(current_states) 2025-08-14T21:39:34.8902306Z 2025-08-14T21:39:34.8902408Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8902588Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8902653Z return mod(**inputs) 2025-08-14T21:39:34.8902886Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8902963Z outputs = self.model( 2025-08-14T21:39:34.8903199Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8903263Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8903495Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8903568Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8903767Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8903846Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8904076Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 473, in forward 2025-08-14T21:39:34.8904163Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:39:34.8904401Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 301, in forward 2025-08-14T21:39:34.8904479Z value_states = self.v_proj(current_states) 2025-08-14T21:39:34.8904482Z 2025-08-14T21:39:34.8904559Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8904630Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8904699Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8904774Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8904920Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8905107Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8905173Z return mod(**inputs) 2025-08-14T21:39:34.8905408Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8905470Z outputs = self.model( 2025-08-14T21:39:34.8905710Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8905775Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8906017Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8906084Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8906287Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8906365Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8906614Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 473, in forward 2025-08-14T21:39:34.8906713Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:39:34.8906943Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 319, in forward 2025-08-14T21:39:34.8907050Z attn_output, attn_weights = attention_interface( 2025-08-14T21:39:34.8907320Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:39:34.8907439Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:39:34.8907442Z 2025-08-14T21:39:34.8907553Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8907743Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8907801Z return mod(**inputs) 2025-08-14T21:39:34.8908039Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8908099Z outputs = self.model( 2025-08-14T21:39:34.8908329Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8908416Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8908645Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8908715Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8908913Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8908983Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8909219Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 473, in forward 2025-08-14T21:39:34.8909307Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:39:34.8909535Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 319, in forward 2025-08-14T21:39:34.8909628Z attn_output, attn_weights = attention_interface( 2025-08-14T21:39:34.8909894Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:39:34.8909999Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:39:34.8910002Z 2025-08-14T21:39:34.8910094Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8910277Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8910344Z return mod(**inputs) 2025-08-14T21:39:34.8910576Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8910644Z outputs = self.model( 2025-08-14T21:39:34.8910875Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8910939Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8911176Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8911240Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8911440Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8911519Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8911752Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 473, in forward 2025-08-14T21:39:34.8911844Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:39:34.8912092Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 333, in forward 2025-08-14T21:39:34.8912167Z attn_output = self.out_proj(attn_output) 2025-08-14T21:39:34.8912170Z 2025-08-14T21:39:34.8912271Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8912470Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8912537Z return mod(**inputs) 2025-08-14T21:39:34.8912770Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8912830Z outputs = self.model( 2025-08-14T21:39:34.8913083Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8913151Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8913382Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8913452Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8913652Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8913733Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8913967Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 490, in forward 2025-08-14T21:39:34.8914083Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:39:34.8914322Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 281, in forward 2025-08-14T21:39:34.8914459Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:39:34.8914463Z 2025-08-14T21:39:34.8914564Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8914746Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8914803Z return mod(**inputs) 2025-08-14T21:39:34.8915046Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8915108Z outputs = self.model( 2025-08-14T21:39:34.8915338Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8915412Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8915641Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8915712Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8915913Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8915983Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8916222Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 490, in forward 2025-08-14T21:39:34.8916319Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:39:34.8916557Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 300, in forward 2025-08-14T21:39:34.8916631Z key_states = self.k_proj(current_states) 2025-08-14T21:39:34.8916634Z 2025-08-14T21:39:34.8916727Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8916913Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8916970Z return mod(**inputs) 2025-08-14T21:39:34.8917203Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8917270Z outputs = self.model( 2025-08-14T21:39:34.8917515Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8917588Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8917817Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8917899Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8918112Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8918182Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8918438Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 490, in forward 2025-08-14T21:39:34.8918545Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:39:34.8918773Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 301, in forward 2025-08-14T21:39:34.8918860Z value_states = self.v_proj(current_states) 2025-08-14T21:39:34.8918863Z 2025-08-14T21:39:34.8918933Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8919004Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8919078Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8919148Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8919257Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8919443Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8919502Z return mod(**inputs) 2025-08-14T21:39:34.8919745Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8919806Z outputs = self.model( 2025-08-14T21:39:34.8920036Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8920110Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8920340Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8920412Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8920611Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8920684Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8920922Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 490, in forward 2025-08-14T21:39:34.8921017Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:39:34.8921246Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 319, in forward 2025-08-14T21:39:34.8921341Z attn_output, attn_weights = attention_interface( 2025-08-14T21:39:34.8921608Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:39:34.8921732Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:39:34.8921736Z 2025-08-14T21:39:34.8921826Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8922008Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8922075Z return mod(**inputs) 2025-08-14T21:39:34.8922308Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8922375Z outputs = self.model( 2025-08-14T21:39:34.8922606Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8922671Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8922923Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8922989Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8923189Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8923283Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8923519Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 490, in forward 2025-08-14T21:39:34.8923621Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:39:34.8923869Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 319, in forward 2025-08-14T21:39:34.8923957Z attn_output, attn_weights = attention_interface( 2025-08-14T21:39:34.8924226Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:39:34.8924321Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:39:34.8924324Z 2025-08-14T21:39:34.8924422Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8924603Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8924663Z return mod(**inputs) 2025-08-14T21:39:34.8924936Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8924996Z outputs = self.model( 2025-08-14T21:39:34.8925224Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8925298Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8925528Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8925599Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8925802Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8925872Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8926109Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 490, in forward 2025-08-14T21:39:34.8926208Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:39:34.8926445Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 333, in forward 2025-08-14T21:39:34.8926518Z attn_output = self.out_proj(attn_output) 2025-08-14T21:39:34.8926521Z 2025-08-14T21:39:34.8926616Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8926803Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8926860Z return mod(**inputs) 2025-08-14T21:39:34.8927094Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8927162Z outputs = self.model( 2025-08-14T21:39:34.8927392Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8927467Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8927697Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8927760Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8927968Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8928038Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8928265Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 504, in forward 2025-08-14T21:39:34.8928397Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:39:34.8928401Z 2025-08-14T21:39:34.8928495Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8928679Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8928753Z return mod(**inputs) 2025-08-14T21:39:34.8928987Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8929053Z outputs = self.model( 2025-08-14T21:39:34.8929283Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8929369Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8929602Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8929666Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8929874Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8929946Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8930174Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 504, in forward 2025-08-14T21:39:34.8930304Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:39:34.8930307Z 2025-08-14T21:39:34.8930399Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8930588Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8930647Z return mod(**inputs) 2025-08-14T21:39:34.8930882Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8930950Z outputs = self.model( 2025-08-14T21:39:34.8931186Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8931257Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8931491Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8931558Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8931768Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8931838Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8932071Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 506, in forward 2025-08-14T21:39:34.8932152Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:39:34.8932155Z 2025-08-14T21:39:34.8932248Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8932438Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8932496Z return mod(**inputs) 2025-08-14T21:39:34.8932730Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8932798Z outputs = self.model( 2025-08-14T21:39:34.8933033Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8933104Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8933334Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8933399Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8933606Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8933675Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8933922Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 473, in forward 2025-08-14T21:39:34.8934021Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:39:34.8934253Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 281, in forward 2025-08-14T21:39:34.8934413Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:39:34.8934416Z 2025-08-14T21:39:34.8934507Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8934690Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8934770Z return mod(**inputs) 2025-08-14T21:39:34.8935003Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8935068Z outputs = self.model( 2025-08-14T21:39:34.8935300Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8935363Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8935601Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8935680Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8935882Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8935959Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8936190Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 473, in forward 2025-08-14T21:39:34.8936287Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:39:34.8936520Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 300, in forward 2025-08-14T21:39:34.8936592Z key_states = self.k_proj(current_states) 2025-08-14T21:39:34.8936596Z 2025-08-14T21:39:34.8936694Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8936874Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8936940Z return mod(**inputs) 2025-08-14T21:39:34.8937172Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8937232Z outputs = self.model( 2025-08-14T21:39:34.8937471Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8937534Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8937768Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8937843Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8938044Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8938122Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8938356Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 473, in forward 2025-08-14T21:39:34.8938446Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:39:34.8938684Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 301, in forward 2025-08-14T21:39:34.8938760Z value_states = self.v_proj(current_states) 2025-08-14T21:39:34.8938765Z 2025-08-14T21:39:34.8938846Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8938919Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8938988Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8939064Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8939172Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8939355Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8939422Z return mod(**inputs) 2025-08-14T21:39:34.8939678Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8939741Z outputs = self.model( 2025-08-14T21:39:34.8939977Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8940040Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8940289Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8940356Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8940561Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8940640Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8940871Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 473, in forward 2025-08-14T21:39:34.8940966Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:39:34.8941210Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 319, in forward 2025-08-14T21:39:34.8941296Z attn_output, attn_weights = attention_interface( 2025-08-14T21:39:34.8941567Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:39:34.8941685Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:39:34.8941689Z 2025-08-14T21:39:34.8941780Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8941970Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8942028Z return mod(**inputs) 2025-08-14T21:39:34.8942267Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8942328Z outputs = self.model( 2025-08-14T21:39:34.8942559Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8942631Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8942861Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8942932Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8943132Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8943203Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8943438Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 473, in forward 2025-08-14T21:39:34.8943526Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:39:34.8943756Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 319, in forward 2025-08-14T21:39:34.8943850Z attn_output, attn_weights = attention_interface( 2025-08-14T21:39:34.8944111Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:39:34.8944214Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:39:34.8944219Z 2025-08-14T21:39:34.8944310Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8944492Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8944558Z return mod(**inputs) 2025-08-14T21:39:34.8944804Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8944948Z outputs = self.model( 2025-08-14T21:39:34.8945182Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8945270Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8945511Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8945576Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8945789Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8945872Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8946107Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 473, in forward 2025-08-14T21:39:34.8946203Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:39:34.8946435Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 333, in forward 2025-08-14T21:39:34.8946510Z attn_output = self.out_proj(attn_output) 2025-08-14T21:39:34.8946527Z 2025-08-14T21:39:34.8946628Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8946809Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8946874Z return mod(**inputs) 2025-08-14T21:39:34.8947108Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8947169Z outputs = self.model( 2025-08-14T21:39:34.8947408Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8947475Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8947708Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8947779Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8947982Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8948058Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8948289Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 482, in forward 2025-08-14T21:39:34.8948359Z hidden_states = residual + hidden_states 2025-08-14T21:39:34.8948364Z 2025-08-14T21:39:34.8948461Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8948643Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8948708Z return mod(**inputs) 2025-08-14T21:39:34.8948941Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8949001Z outputs = self.model( 2025-08-14T21:39:34.8949237Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8949306Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8949537Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8949608Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8949809Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8949884Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8950116Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 490, in forward 2025-08-14T21:39:34.8950227Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:39:34.8950465Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 281, in forward 2025-08-14T21:39:34.8950616Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:39:34.8950621Z 2025-08-14T21:39:34.8950720Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8950901Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8950960Z return mod(**inputs) 2025-08-14T21:39:34.8951221Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8951284Z outputs = self.model( 2025-08-14T21:39:34.8951515Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8951589Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8951820Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8951893Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8952092Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8952179Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8952414Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 490, in forward 2025-08-14T21:39:34.8952512Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:39:34.8952740Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 300, in forward 2025-08-14T21:39:34.8952819Z key_states = self.k_proj(current_states) 2025-08-14T21:39:34.8952822Z 2025-08-14T21:39:34.8952915Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8953101Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8953158Z return mod(**inputs) 2025-08-14T21:39:34.8953389Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8953457Z outputs = self.model( 2025-08-14T21:39:34.8953686Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8953760Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8953995Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8954060Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8954267Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8954336Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8954564Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 490, in forward 2025-08-14T21:39:34.8954668Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:39:34.8954896Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 301, in forward 2025-08-14T21:39:34.8954979Z value_states = self.v_proj(current_states) 2025-08-14T21:39:34.8954983Z 2025-08-14T21:39:34.8955053Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8955124Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8955199Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8955265Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8955356Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8955557Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8955616Z return mod(**inputs) 2025-08-14T21:39:34.8955855Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8955931Z outputs = self.model( 2025-08-14T21:39:34.8956163Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8956234Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8956477Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8956550Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8956752Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8956823Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8957065Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 490, in forward 2025-08-14T21:39:34.8957161Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:39:34.8957395Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 319, in forward 2025-08-14T21:39:34.8957511Z attn_output, attn_weights = attention_interface( 2025-08-14T21:39:34.8957776Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:39:34.8957903Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:39:34.8957907Z 2025-08-14T21:39:34.8958000Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8958179Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8958247Z return mod(**inputs) 2025-08-14T21:39:34.8958480Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8958547Z outputs = self.model( 2025-08-14T21:39:34.8958781Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8958848Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8959084Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8959151Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8959350Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8959426Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8959659Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 490, in forward 2025-08-14T21:39:34.8959762Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:39:34.8959990Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 319, in forward 2025-08-14T21:39:34.8960078Z attn_output, attn_weights = attention_interface( 2025-08-14T21:39:34.8960349Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:39:34.8960445Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:39:34.8960448Z 2025-08-14T21:39:34.8960548Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8960727Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8960784Z return mod(**inputs) 2025-08-14T21:39:34.8961045Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8961108Z outputs = self.model( 2025-08-14T21:39:34.8961339Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8961427Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8961661Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8961729Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8961930Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8962013Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8962255Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 490, in forward 2025-08-14T21:39:34.8962350Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:39:34.8962576Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 333, in forward 2025-08-14T21:39:34.8962657Z attn_output = self.out_proj(attn_output) 2025-08-14T21:39:34.8962662Z 2025-08-14T21:39:34.8962752Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8962956Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8963016Z return mod(**inputs) 2025-08-14T21:39:34.8963247Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8963318Z outputs = self.model( 2025-08-14T21:39:34.8963552Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8963627Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8963858Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8963923Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8964130Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8964203Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8964434Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 504, in forward 2025-08-14T21:39:34.8964552Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:39:34.8964555Z 2025-08-14T21:39:34.8964649Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8964835Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8964895Z return mod(**inputs) 2025-08-14T21:39:34.8965127Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8965197Z outputs = self.model( 2025-08-14T21:39:34.8965426Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8965501Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8965733Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8965798Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8966004Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8966076Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8966304Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 504, in forward 2025-08-14T21:39:34.8966433Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:39:34.8966437Z 2025-08-14T21:39:34.8966529Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8966720Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8966795Z return mod(**inputs) 2025-08-14T21:39:34.8967036Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8967105Z outputs = self.model( 2025-08-14T21:39:34.8967346Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8967431Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8967662Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8967725Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8967934Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8968004Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8968232Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 506, in forward 2025-08-14T21:39:34.8968315Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:39:34.8968333Z 2025-08-14T21:39:34.8968427Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8968619Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8968675Z return mod(**inputs) 2025-08-14T21:39:34.8968907Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8968976Z outputs = self.model( 2025-08-14T21:39:34.8969208Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8969271Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8969511Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8969575Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8969784Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8969853Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8970085Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 473, in forward 2025-08-14T21:39:34.8970181Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:39:34.8970413Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 281, in forward 2025-08-14T21:39:34.8970556Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:39:34.8970560Z 2025-08-14T21:39:34.8970652Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8970833Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8970901Z return mod(**inputs) 2025-08-14T21:39:34.8971135Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8971203Z outputs = self.model( 2025-08-14T21:39:34.8971434Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8971501Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8971738Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8971802Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8972016Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8972097Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8972326Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 473, in forward 2025-08-14T21:39:34.8972445Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:39:34.8972679Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 300, in forward 2025-08-14T21:39:34.8972751Z key_states = self.k_proj(current_states) 2025-08-14T21:39:34.8972754Z 2025-08-14T21:39:34.8972869Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8973051Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8973110Z return mod(**inputs) 2025-08-14T21:39:34.8973349Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8973408Z outputs = self.model( 2025-08-14T21:39:34.8973646Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8973710Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8973958Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8974030Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8974229Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8974309Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8974539Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 473, in forward 2025-08-14T21:39:34.8974626Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:39:34.8974861Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 301, in forward 2025-08-14T21:39:34.8974938Z value_states = self.v_proj(current_states) 2025-08-14T21:39:34.8974942Z 2025-08-14T21:39:34.8975013Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8975093Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8975162Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8975237Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8975329Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8975510Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8975574Z return mod(**inputs) 2025-08-14T21:39:34.8975805Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8975866Z outputs = self.model( 2025-08-14T21:39:34.8976104Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8976169Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8976409Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8976473Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8976672Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8976749Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8976979Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 473, in forward 2025-08-14T21:39:34.8977073Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:39:34.8977317Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 319, in forward 2025-08-14T21:39:34.8977406Z attn_output, attn_weights = attention_interface( 2025-08-14T21:39:34.8977678Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:39:34.8977816Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:39:34.8977821Z 2025-08-14T21:39:34.8977913Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8978104Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8978162Z return mod(**inputs) 2025-08-14T21:39:34.8978420Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8978483Z outputs = self.model( 2025-08-14T21:39:34.8978719Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8978792Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8979022Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8979096Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8979328Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8979399Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8979637Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 473, in forward 2025-08-14T21:39:34.8979726Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:39:34.8979957Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 319, in forward 2025-08-14T21:39:34.8980052Z attn_output, attn_weights = attention_interface( 2025-08-14T21:39:34.8980317Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:39:34.8980422Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:39:34.8980427Z 2025-08-14T21:39:34.8980518Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8980701Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8980766Z return mod(**inputs) 2025-08-14T21:39:34.8980998Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8981066Z outputs = self.model( 2025-08-14T21:39:34.8981300Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8981362Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8981602Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8981666Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8981865Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8981948Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8982178Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 473, in forward 2025-08-14T21:39:34.8982276Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:39:34.8982509Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 333, in forward 2025-08-14T21:39:34.8982584Z attn_output = self.out_proj(attn_output) 2025-08-14T21:39:34.8982587Z 2025-08-14T21:39:34.8982684Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8982881Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8982943Z return mod(**inputs) 2025-08-14T21:39:34.8983186Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8983267Z outputs = self.model( 2025-08-14T21:39:34.8983508Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8983571Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8983818Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8983892Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8984091Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8984169Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8984397Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 490, in forward 2025-08-14T21:39:34.8984494Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:39:34.8984929Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 281, in forward 2025-08-14T21:39:34.8985103Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:39:34.8985107Z 2025-08-14T21:39:34.8985208Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8985391Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8985450Z return mod(**inputs) 2025-08-14T21:39:34.8985690Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8985753Z outputs = self.model( 2025-08-14T21:39:34.8985982Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8986054Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8986284Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8986355Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8986554Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8986624Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8986860Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 490, in forward 2025-08-14T21:39:34.8986958Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:39:34.8987188Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 300, in forward 2025-08-14T21:39:34.8987272Z key_states = self.k_proj(current_states) 2025-08-14T21:39:34.8987275Z 2025-08-14T21:39:34.8987366Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8987555Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8987616Z return mod(**inputs) 2025-08-14T21:39:34.8987845Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8987915Z outputs = self.model( 2025-08-14T21:39:34.8988142Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8988213Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8988463Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8988528Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8988738Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8988832Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8989061Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 490, in forward 2025-08-14T21:39:34.8989163Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:39:34.8989390Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 301, in forward 2025-08-14T21:39:34.8989491Z value_states = self.v_proj(current_states) 2025-08-14T21:39:34.8989495Z 2025-08-14T21:39:34.8989569Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8989639Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8989715Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8989784Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.8989877Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8990067Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8990126Z return mod(**inputs) 2025-08-14T21:39:34.8990383Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8990442Z outputs = self.model( 2025-08-14T21:39:34.8990671Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8990744Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8990977Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8991040Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8991249Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8991319Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8991556Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 490, in forward 2025-08-14T21:39:34.8991653Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:39:34.8991885Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 319, in forward 2025-08-14T21:39:34.8991977Z attn_output, attn_weights = attention_interface( 2025-08-14T21:39:34.8992245Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:39:34.8992371Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:39:34.8992374Z 2025-08-14T21:39:34.8992466Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8992647Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8992714Z return mod(**inputs) 2025-08-14T21:39:34.8992948Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8993019Z outputs = self.model( 2025-08-14T21:39:34.8993251Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8993317Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8993557Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8993622Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8993822Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8993914Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8994144Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 490, in forward 2025-08-14T21:39:34.8994247Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:39:34.8994493Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 319, in forward 2025-08-14T21:39:34.8994581Z attn_output, attn_weights = attention_interface( 2025-08-14T21:39:34.8994849Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:39:34.8994957Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:39:34.8994961Z 2025-08-14T21:39:34.8995063Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8995244Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8995304Z return mod(**inputs) 2025-08-14T21:39:34.8995544Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8995607Z outputs = self.model( 2025-08-14T21:39:34.8995838Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8995926Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8996155Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8996227Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8996426Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8996497Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8996735Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 490, in forward 2025-08-14T21:39:34.8996832Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:39:34.8997060Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 333, in forward 2025-08-14T21:39:34.8997143Z attn_output = self.out_proj(attn_output) 2025-08-14T21:39:34.8997147Z 2025-08-14T21:39:34.8997237Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8997422Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8997480Z return mod(**inputs) 2025-08-14T21:39:34.8997713Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8997781Z outputs = self.model( 2025-08-14T21:39:34.8998013Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8998084Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.8998313Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.8998377Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.8998587Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.8998656Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.8998885Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 499, in forward 2025-08-14T21:39:34.8998966Z hidden_states = residual + hidden_states 2025-08-14T21:39:34.8998969Z 2025-08-14T21:39:34.8999060Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.8999259Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.8999320Z return mod(**inputs) 2025-08-14T21:39:34.8999551Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.8999619Z outputs = self.model( 2025-08-14T21:39:34.8999865Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.8999938Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.9000166Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.9000228Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.9000456Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.9000529Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.9000759Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 504, in forward 2025-08-14T21:39:34.9000876Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:39:34.9000879Z 2025-08-14T21:39:34.9000970Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.9001159Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.9001234Z return mod(**inputs) 2025-08-14T21:39:34.9001471Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.9001539Z outputs = self.model( 2025-08-14T21:39:34.9001769Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.9001841Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.9002069Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.9002133Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.9002339Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.9002410Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.9002640Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 504, in forward 2025-08-14T21:39:34.9002753Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:39:34.9002756Z 2025-08-14T21:39:34.9002847Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.9003033Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.9003090Z return mod(**inputs) 2025-08-14T21:39:34.9003321Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.9003389Z outputs = self.model( 2025-08-14T21:39:34.9003617Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.9003680Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.9003918Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.9003982Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.9004186Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.9004255Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.9004483Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 506, in forward 2025-08-14T21:39:34.9004562Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:39:34.9004566Z 2025-08-14T21:39:34.9004671Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.9004862Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.9004920Z return mod(**inputs) 2025-08-14T21:39:34.9005153Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.9005238Z outputs = self.model( 2025-08-14T21:39:34.9005471Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.9005534Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.9005786Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.9005853Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.9006062Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.9006132Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.9006362Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 473, in forward 2025-08-14T21:39:34.9006461Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:39:34.9006711Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 281, in forward 2025-08-14T21:39:34.9006853Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:39:34.9006857Z 2025-08-14T21:39:34.9006948Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.9007129Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.9007197Z return mod(**inputs) 2025-08-14T21:39:34.9007429Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.9007489Z outputs = self.model( 2025-08-14T21:39:34.9007724Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.9007790Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.9008023Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.9008089Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.9008289Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.9008368Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.9008595Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 473, in forward 2025-08-14T21:39:34.9008693Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:39:34.9008920Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 300, in forward 2025-08-14T21:39:34.9008992Z key_states = self.k_proj(current_states) 2025-08-14T21:39:34.9008995Z 2025-08-14T21:39:34.9009096Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.9009273Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.9009331Z return mod(**inputs) 2025-08-14T21:39:34.9009567Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.9009628Z outputs = self.model( 2025-08-14T21:39:34.9009864Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.9009927Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.9010167Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.9010241Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.9010446Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.9010538Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.9010770Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 473, in forward 2025-08-14T21:39:34.9010858Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:39:34.9011127Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 301, in forward 2025-08-14T21:39:34.9011206Z value_states = self.v_proj(current_states) 2025-08-14T21:39:34.9011209Z 2025-08-14T21:39:34.9011280Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.9011359Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.9011430Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.9011504Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.9011595Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.9011774Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.9011842Z return mod(**inputs) 2025-08-14T21:39:34.9012093Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.9012153Z outputs = self.model( 2025-08-14T21:39:34.9012397Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.9012461Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.9012703Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.9012768Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.9012971Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.9013048Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.9013279Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 473, in forward 2025-08-14T21:39:34.9013369Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:39:34.9013607Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 319, in forward 2025-08-14T21:39:34.9013693Z attn_output, attn_weights = attention_interface( 2025-08-14T21:39:34.9013966Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:39:34.9014086Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:39:34.9014089Z 2025-08-14T21:39:34.9014182Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.9014370Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.9014429Z return mod(**inputs) 2025-08-14T21:39:34.9014671Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.9014732Z outputs = self.model( 2025-08-14T21:39:34.9014963Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.9015038Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.9015270Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.9015333Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.9015552Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.9015624Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.9015862Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 473, in forward 2025-08-14T21:39:34.9015966Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:39:34.9016197Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 319, in forward 2025-08-14T21:39:34.9016290Z attn_output, attn_weights = attention_interface( 2025-08-14T21:39:34.9016569Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:39:34.9016674Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:39:34.9016677Z 2025-08-14T21:39:34.9016768Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.9016951Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.9017017Z return mod(**inputs) 2025-08-14T21:39:34.9017249Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.9017312Z outputs = self.model( 2025-08-14T21:39:34.9017564Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.9017630Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.9017867Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.9017932Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.9018132Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.9018211Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.9018437Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 473, in forward 2025-08-14T21:39:34.9018529Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:39:34.9018757Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 333, in forward 2025-08-14T21:39:34.9018832Z attn_output = self.out_proj(attn_output) 2025-08-14T21:39:34.9018835Z 2025-08-14T21:39:34.9018934Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.9019111Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.9019172Z return mod(**inputs) 2025-08-14T21:39:34.9019409Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.9019467Z outputs = self.model( 2025-08-14T21:39:34.9019705Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.9019770Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.9019998Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.9020071Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.9020272Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.9020352Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.9020581Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 490, in forward 2025-08-14T21:39:34.9020679Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:39:34.9020913Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 281, in forward 2025-08-14T21:39:34.9021063Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:39:34.9021067Z 2025-08-14T21:39:34.9021162Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.9021350Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.9021432Z return mod(**inputs) 2025-08-14T21:39:34.9021673Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.9021734Z outputs = self.model( 2025-08-14T21:39:34.9021980Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.9022057Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.9022285Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.9022357Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.9022561Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.9022631Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.9022866Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 490, in forward 2025-08-14T21:39:34.9022977Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:39:34.9023204Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 300, in forward 2025-08-14T21:39:34.9023281Z key_states = self.k_proj(current_states) 2025-08-14T21:39:34.9023286Z 2025-08-14T21:39:34.9023377Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.9023561Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.9023619Z return mod(**inputs) 2025-08-14T21:39:34.9023848Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.9023915Z outputs = self.model( 2025-08-14T21:39:34.9024143Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.9024214Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.9024440Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.9024500Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.9024702Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.9024768Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.9025061Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 490, in forward 2025-08-14T21:39:34.9025164Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:39:34.9025388Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 301, in forward 2025-08-14T21:39:34.9025465Z value_states = self.v_proj(current_states) 2025-08-14T21:39:34.9025470Z 2025-08-14T21:39:34.9025539Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.9025604Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.9025678Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.9025745Z cudagraph partition due to non gpu ops 2025-08-14T21:39:34.9025837Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.9026025Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.9026083Z return mod(**inputs) 2025-08-14T21:39:34.9026344Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.9026406Z outputs = self.model( 2025-08-14T21:39:34.9026638Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.9026727Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.9026966Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.9027030Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.9027244Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.9027327Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.9027564Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 490, in forward 2025-08-14T21:39:34.9027659Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:39:34.9027889Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 319, in forward 2025-08-14T21:39:34.9027984Z attn_output, attn_weights = attention_interface( 2025-08-14T21:39:34.9028245Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:39:34.9028385Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:39:34.9028389Z 2025-08-14T21:39:34.9028476Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.9028655Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.9028721Z return mod(**inputs) 2025-08-14T21:39:34.9028951Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.9029010Z outputs = self.model( 2025-08-14T21:39:34.9036933Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.9037053Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.9037346Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.9037424Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.9037645Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.9037730Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.9037973Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 490, in forward 2025-08-14T21:39:34.9038083Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:39:34.9038320Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 319, in forward 2025-08-14T21:39:34.9038414Z attn_output, attn_weights = attention_interface( 2025-08-14T21:39:34.9038693Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:39:34.9038795Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:39:34.9038802Z 2025-08-14T21:39:34.9038908Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.9039101Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.9039166Z return mod(**inputs) 2025-08-14T21:39:34.9039411Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.9039478Z outputs = self.model( 2025-08-14T21:39:34.9039711Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.9039856Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.9040090Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.9040164Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.9040401Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.9040479Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.9040771Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 490, in forward 2025-08-14T21:39:34.9040900Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:39:34.9041151Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 333, in forward 2025-08-14T21:39:34.9041228Z attn_output = self.out_proj(attn_output) 2025-08-14T21:39:34.9041232Z 2025-08-14T21:39:34.9041335Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.9041535Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.9041600Z return mod(**inputs) 2025-08-14T21:39:34.9041842Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.9041932Z outputs = self.model( 2025-08-14T21:39:34.9042172Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.9042248Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.9042485Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.9042551Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.9042770Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.9042843Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.9043084Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 504, in forward 2025-08-14T21:39:34.9043211Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:39:34.9043216Z 2025-08-14T21:39:34.9043314Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.9043515Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.9043576Z return mod(**inputs) 2025-08-14T21:39:34.9043821Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.9043891Z outputs = self.model( 2025-08-14T21:39:34.9044128Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.9044205Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.9044441Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.9044507Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.9044724Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.9044797Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.9045033Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 504, in forward 2025-08-14T21:39:34.9045150Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:39:34.9045154Z 2025-08-14T21:39:34.9045251Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.9045446Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.9045521Z return mod(**inputs) 2025-08-14T21:39:34.9045762Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.9045831Z outputs = self.model( 2025-08-14T21:39:34.9046086Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.9046160Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.9046400Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.9046467Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.9046696Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.9046770Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.9047010Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 506, in forward 2025-08-14T21:39:34.9047093Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:39:34.9047097Z 2025-08-14T21:39:34.9047193Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.9047392Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.9047468Z return mod(**inputs) 2025-08-14T21:39:34.9047707Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1404, in forward 2025-08-14T21:39:34.9047775Z outputs = self.model( 2025-08-14T21:39:34.9048016Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1279, in forward 2025-08-14T21:39:34.9048089Z decoder_outputs = self.decoder( 2025-08-14T21:39:34.9048326Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1127, in forward 2025-08-14T21:39:34.9048393Z layer_outputs = decoder_layer( 2025-08-14T21:39:34.9048606Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:39:34.9048679Z return super().__call__(*args, **kwargs) 2025-08-14T21:39:34.9048919Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 508, in forward 2025-08-14T21:39:34.9049005Z hidden_states = residual + hidden_states 2025-08-14T21:39:34.9049008Z 2025-08-14T21:39:34.9049101Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.9049295Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.9049353Z return mod(**inputs) 2025-08-14T21:39:34.9049590Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1422, in forward 2025-08-14T21:39:34.9049671Z lm_logits = self.lm_head(outputs[0]) 2025-08-14T21:39:34.9049676Z 2025-08-14T21:39:34.9049771Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:39:34.9049961Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:39:34.9050022Z return mod(**inputs) 2025-08-14T21:39:34.9050264Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/m2m_100/modeling_m2m_100.py", line 1429, in forward 2025-08-14T21:39:34.9050432Z masked_lm_loss = loss_fct(lm_logits.view(-1, self.config.vocab_size), labels.view(-1)) 2025-08-14T21:39:34.9050436Z 2025-08-14T21:39:45.6340385Z Compilation time (from dynamo_timed): 24.847219112 2025-08-14T21:39:45.6445234Z pass 2025-08-14T21:39:45.6447158Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:39:45.6448231Z TIMING: _recursive_pre_grad_passes:0.01244 _recursive_joint_graph_passes:0.97311 _recursive_post_grad_passes:0.14279 async_compile.wait:0.72434 code_gen:10.32571 inductor_compile:13.00563 backend_compile:19.61043 gc:0.00016 entire_frame_compile:24.84722 total_wall_time:24.84722 2025-08-14T21:39:45.6449432Z STATS: call_* op count: 1014 | FakeTensorMode.__torch_dispatch__:33764 | FakeTensor.__torch_dispatch__:11261 | ProxyTorchDispatchMode.__torch_dispatch__:12417 2025-08-14T21:39:45.6450036Z Dynamo produced 1 graphs covering 1014 ops with 0 graph breaks (0 unique) 2025-08-14T21:39:50.1109698Z /opt/conda/envs/py_3.9/lib/python3.9/site-packages/llvmlite/binding/ffi.py:175: UserWarning: pkg_resources is deprecated as an API. See https://setuptools.pypa.io/en/latest/pkg_resources.html. The pkg_resources package is slated for removal as early as 2025-11-30. Refrain from using this package or pin to Setuptools<81. 2025-08-14T21:39:50.1110576Z from pkg_resources import resource_filename 2025-08-14T21:39:50.7592996Z 2025-08-14T21:39:53.2665334Z loading model: 0it [00:00, ?it/s] 2025-08-14T21:39:53.2669583Z loading model: 0it [00:02, ?it/s] 2025-08-14T21:39:53.2684255Z cpu eval MBartForCausalLM 2025-08-14T21:39:54.4390267Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:39:54.8870301Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:39:55.3618124Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:40:02.2701332Z cudagraph partition due to non gpu ops 2025-08-14T21:40:02.2703067Z cudagraph partition due to non gpu ops 2025-08-14T21:40:02.2703438Z cudagraph partition due to non gpu ops 2025-08-14T21:40:02.2703722Z cudagraph partition due to non gpu ops 2025-08-14T21:40:02.2703957Z cudagraph partition due to non gpu ops 2025-08-14T21:40:02.2704264Z cudagraph partition due to non gpu ops 2025-08-14T21:40:02.2704516Z cudagraph partition due to non gpu ops 2025-08-14T21:40:02.2704980Z cudagraph partition due to non gpu ops 2025-08-14T21:40:02.2705323Z cudagraph partition due to non gpu ops 2025-08-14T21:40:02.2705631Z cudagraph partition due to non gpu ops 2025-08-14T21:40:02.2710335Z cudagraph partition due to non gpu ops 2025-08-14T21:40:02.2715255Z cudagraph partition due to non gpu ops 2025-08-14T21:40:02.2717267Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:02.2717812Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:02.2723330Z return mod(**inputs) 2025-08-14T21:40:02.2726774Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1864, in forward 2025-08-14T21:40:02.2727299Z outputs = self.model.decoder( 2025-08-14T21:40:02.2732484Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:02.2736796Z layer_outputs = decoder_layer( 2025-08-14T21:40:02.2739132Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:02.2739626Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:02.2745231Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:02.2751154Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:02.2755776Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 225, in forward 2025-08-14T21:40:02.2757922Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:40:02.2758236Z 2025-08-14T21:40:02.2762902Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:02.2767767Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:02.2772003Z return mod(**inputs) 2025-08-14T21:40:02.2774510Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1864, in forward 2025-08-14T21:40:02.2775069Z outputs = self.model.decoder( 2025-08-14T21:40:02.2780220Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:02.2782148Z layer_outputs = decoder_layer( 2025-08-14T21:40:02.2782627Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:02.2782999Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:02.2783406Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:02.2784084Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:02.2784499Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 244, in forward 2025-08-14T21:40:02.2785260Z key_states = self.k_proj(current_states) 2025-08-14T21:40:02.2785408Z 2025-08-14T21:40:02.2785520Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:02.2785879Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:02.2786303Z return mod(**inputs) 2025-08-14T21:40:02.2786655Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1864, in forward 2025-08-14T21:40:02.2787114Z outputs = self.model.decoder( 2025-08-14T21:40:02.2787495Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:02.2787862Z layer_outputs = decoder_layer( 2025-08-14T21:40:02.2788198Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:02.2788547Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:02.2788914Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:02.2789296Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:02.2789686Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 245, in forward 2025-08-14T21:40:02.2790075Z value_states = self.v_proj(current_states) 2025-08-14T21:40:02.2790203Z 2025-08-14T21:40:02.2790279Z cudagraph partition due to non gpu ops 2025-08-14T21:40:02.2790474Z cudagraph partition due to non gpu ops 2025-08-14T21:40:02.2790660Z cudagraph partition due to non gpu ops 2025-08-14T21:40:02.2790845Z cudagraph partition due to non gpu ops 2025-08-14T21:40:02.2791053Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:02.2791382Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:02.2791684Z return mod(**inputs) 2025-08-14T21:40:02.2792017Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1864, in forward 2025-08-14T21:40:02.2792598Z outputs = self.model.decoder( 2025-08-14T21:40:02.2792949Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:02.2793309Z layer_outputs = decoder_layer( 2025-08-14T21:40:02.2793623Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:02.2793956Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:02.2794314Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:02.2794688Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:02.2795054Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 263, in forward 2025-08-14T21:40:02.2795467Z attn_output, attn_weights = attention_interface( 2025-08-14T21:40:02.2795876Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:40:02.2796313Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:40:02.2796526Z 2025-08-14T21:40:02.2796624Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:02.2796952Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:02.2797250Z return mod(**inputs) 2025-08-14T21:40:02.2797609Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1864, in forward 2025-08-14T21:40:02.2797972Z outputs = self.model.decoder( 2025-08-14T21:40:02.2798326Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:02.2798688Z layer_outputs = decoder_layer( 2025-08-14T21:40:02.2799006Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:02.2799345Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:02.2799712Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:02.2800110Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:02.2800486Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 263, in forward 2025-08-14T21:40:02.2800868Z attn_output, attn_weights = attention_interface( 2025-08-14T21:40:02.2801280Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:40:02.2801698Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:40:02.2801857Z 2025-08-14T21:40:02.2801958Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:02.2802294Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:02.2802597Z return mod(**inputs) 2025-08-14T21:40:02.2802932Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1864, in forward 2025-08-14T21:40:02.2803295Z outputs = self.model.decoder( 2025-08-14T21:40:02.2803650Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:02.2804001Z layer_outputs = decoder_layer( 2025-08-14T21:40:02.2804327Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:02.2804662Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:02.2805022Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:02.2805395Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:02.2805772Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 277, in forward 2025-08-14T21:40:02.2806141Z attn_output = self.out_proj(attn_output) 2025-08-14T21:40:02.2806269Z 2025-08-14T21:40:02.2806364Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:02.2806697Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:02.2806996Z return mod(**inputs) 2025-08-14T21:40:02.2807334Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1864, in forward 2025-08-14T21:40:02.2807684Z outputs = self.model.decoder( 2025-08-14T21:40:02.2808039Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:02.2808414Z layer_outputs = decoder_layer( 2025-08-14T21:40:02.2808732Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:02.2809065Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:02.2809498Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 446, in forward 2025-08-14T21:40:02.2809903Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:40:02.2810067Z 2025-08-14T21:40:02.2810163Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:02.2810506Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:02.2810806Z return mod(**inputs) 2025-08-14T21:40:02.2811139Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1864, in forward 2025-08-14T21:40:02.2811488Z outputs = self.model.decoder( 2025-08-14T21:40:02.2811837Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:02.2812191Z layer_outputs = decoder_layer( 2025-08-14T21:40:02.2812501Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:02.2812855Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:02.2813212Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 446, in forward 2025-08-14T21:40:02.2813608Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:40:02.2813962Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:40:02.2814274Z return self.act(input) 2025-08-14T21:40:02.2814376Z 2025-08-14T21:40:02.2814477Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:02.2814808Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:02.2815099Z return mod(**inputs) 2025-08-14T21:40:02.2815435Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1864, in forward 2025-08-14T21:40:02.2815789Z outputs = self.model.decoder( 2025-08-14T21:40:02.2816130Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:02.2816488Z layer_outputs = decoder_layer( 2025-08-14T21:40:02.2816806Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:02.2817135Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:02.2817484Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 448, in forward 2025-08-14T21:40:02.2817851Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:40:02.2817977Z 2025-08-14T21:40:02.2818078Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:02.2818402Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:02.2818705Z return mod(**inputs) 2025-08-14T21:40:02.2819036Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1864, in forward 2025-08-14T21:40:02.2819393Z outputs = self.model.decoder( 2025-08-14T21:40:02.2819733Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:02.2820090Z layer_outputs = decoder_layer( 2025-08-14T21:40:02.2820411Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:02.2820734Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:02.2821110Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:02.2821491Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:02.2821866Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 225, in forward 2025-08-14T21:40:02.2822301Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:40:02.2822496Z 2025-08-14T21:40:02.2822594Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:02.2822930Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:02.2823273Z return mod(**inputs) 2025-08-14T21:40:02.2823610Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1864, in forward 2025-08-14T21:40:02.2823964Z outputs = self.model.decoder( 2025-08-14T21:40:02.2824305Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:02.2824657Z layer_outputs = decoder_layer( 2025-08-14T21:40:02.2825054Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:02.2825397Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:02.2825786Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:02.2826174Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:02.2826617Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 244, in forward 2025-08-14T21:40:02.2826971Z key_states = self.k_proj(current_states) 2025-08-14T21:40:02.2827105Z 2025-08-14T21:40:02.2827199Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:02.2827529Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:02.2827828Z return mod(**inputs) 2025-08-14T21:40:02.2828204Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1864, in forward 2025-08-14T21:40:02.2828569Z outputs = self.model.decoder( 2025-08-14T21:40:02.2828928Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:02.2829283Z layer_outputs = decoder_layer( 2025-08-14T21:40:02.2829606Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:02.2829945Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:02.2830304Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:02.2830680Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:02.2831064Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 245, in forward 2025-08-14T21:40:02.2831442Z value_states = self.v_proj(current_states) 2025-08-14T21:40:02.2831573Z 2025-08-14T21:40:02.2831657Z cudagraph partition due to non gpu ops 2025-08-14T21:40:02.2831848Z cudagraph partition due to non gpu ops 2025-08-14T21:40:02.2832043Z cudagraph partition due to non gpu ops 2025-08-14T21:40:02.2832235Z cudagraph partition due to non gpu ops 2025-08-14T21:40:02.2832445Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:02.2832784Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:02.2833092Z return mod(**inputs) 2025-08-14T21:40:02.2833425Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1864, in forward 2025-08-14T21:40:02.2833786Z outputs = self.model.decoder( 2025-08-14T21:40:02.2834159Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:02.2834526Z layer_outputs = decoder_layer( 2025-08-14T21:40:02.2834847Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:02.2835208Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:02.2835569Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:02.2835955Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:02.2836346Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 263, in forward 2025-08-14T21:40:02.2836737Z attn_output, attn_weights = attention_interface( 2025-08-14T21:40:02.2837158Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:40:02.2837608Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:40:02.2837791Z 2025-08-14T21:40:02.2837886Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:02.2838228Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:02.2838554Z return mod(**inputs) 2025-08-14T21:40:02.2838896Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1864, in forward 2025-08-14T21:40:02.2839254Z outputs = self.model.decoder( 2025-08-14T21:40:02.2839603Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:02.2839948Z layer_outputs = decoder_layer( 2025-08-14T21:40:02.2840266Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:02.2840597Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:02.2840951Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:02.2841317Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:02.2841691Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 263, in forward 2025-08-14T21:40:02.2842065Z attn_output, attn_weights = attention_interface( 2025-08-14T21:40:02.2842465Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:40:02.2842876Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:40:02.2843029Z 2025-08-14T21:40:02.2843122Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:02.2843446Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:02.2843735Z return mod(**inputs) 2025-08-14T21:40:02.2844065Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1864, in forward 2025-08-14T21:40:02.2844419Z outputs = self.model.decoder( 2025-08-14T21:40:02.2844766Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:02.2845111Z layer_outputs = decoder_layer( 2025-08-14T21:40:02.2845438Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:02.2845775Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:02.2846145Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:02.2846511Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:02.2846927Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 277, in forward 2025-08-14T21:40:02.2847293Z attn_output = self.out_proj(attn_output) 2025-08-14T21:40:02.2847417Z 2025-08-14T21:40:02.2847512Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:02.2847851Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:02.2848145Z return mod(**inputs) 2025-08-14T21:40:02.2848475Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1864, in forward 2025-08-14T21:40:02.2848820Z outputs = self.model.decoder( 2025-08-14T21:40:02.2849181Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:02.2849537Z layer_outputs = decoder_layer( 2025-08-14T21:40:02.2849848Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:02.2850181Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:02.2850541Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 446, in forward 2025-08-14T21:40:02.2850955Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:40:02.2851141Z 2025-08-14T21:40:02.2851234Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:02.2851561Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:02.2851857Z return mod(**inputs) 2025-08-14T21:40:02.2852183Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1864, in forward 2025-08-14T21:40:02.2852537Z outputs = self.model.decoder( 2025-08-14T21:40:02.2852885Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:02.2853242Z layer_outputs = decoder_layer( 2025-08-14T21:40:02.2853554Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:02.2853889Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:02.2854246Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 446, in forward 2025-08-14T21:40:02.2854635Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:40:02.2854994Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:40:02.2855307Z return self.act(input) 2025-08-14T21:40:02.2855408Z 2025-08-14T21:40:02.2855510Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:02.2855831Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:02.2856131Z return mod(**inputs) 2025-08-14T21:40:02.2856463Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1864, in forward 2025-08-14T21:40:02.2856808Z outputs = self.model.decoder( 2025-08-14T21:40:02.2857158Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:02.2857512Z layer_outputs = decoder_layer( 2025-08-14T21:40:02.2857828Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:02.2858149Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:02.2858503Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 448, in forward 2025-08-14T21:40:02.2858864Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:40:02.2858988Z 2025-08-14T21:40:02.2859089Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:02.2859428Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:02.2859728Z return mod(**inputs) 2025-08-14T21:40:02.2860062Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1864, in forward 2025-08-14T21:40:02.2860425Z outputs = self.model.decoder( 2025-08-14T21:40:02.2860775Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:02.2861125Z layer_outputs = decoder_layer( 2025-08-14T21:40:02.2861442Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:02.2861782Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:02.2862141Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 450, in forward 2025-08-14T21:40:02.2862503Z hidden_states = residual + hidden_states 2025-08-14T21:40:02.2862625Z 2025-08-14T21:40:02.2862719Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:02.2863047Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:02.2863350Z return mod(**inputs) 2025-08-14T21:40:02.2863688Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1864, in forward 2025-08-14T21:40:02.2864054Z outputs = self.model.decoder( 2025-08-14T21:40:02.2864401Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:02.2864762Z layer_outputs = decoder_layer( 2025-08-14T21:40:02.2865156Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:02.2865506Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:02.2865878Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:02.2866275Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:02.2866647Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 225, in forward 2025-08-14T21:40:02.2867084Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:40:02.2867293Z 2025-08-14T21:40:02.2867390Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:02.2867733Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:02.2868030Z return mod(**inputs) 2025-08-14T21:40:02.2868371Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1864, in forward 2025-08-14T21:40:02.2868734Z outputs = self.model.decoder( 2025-08-14T21:40:02.2869085Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:02.2869452Z layer_outputs = decoder_layer( 2025-08-14T21:40:02.2869779Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:02.2870120Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:02.2870482Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:02.2870867Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:02.2871253Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 244, in forward 2025-08-14T21:40:02.2871624Z key_states = self.k_proj(current_states) 2025-08-14T21:40:02.2871749Z 2025-08-14T21:40:02.2871843Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:02.2872200Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:02.2872514Z return mod(**inputs) 2025-08-14T21:40:02.2872849Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1864, in forward 2025-08-14T21:40:02.2873213Z outputs = self.model.decoder( 2025-08-14T21:40:02.2873606Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:02.2873968Z layer_outputs = decoder_layer( 2025-08-14T21:40:02.2874286Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:02.2874626Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:02.2875007Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:02.2875410Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:02.2875793Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 245, in forward 2025-08-14T21:40:02.2876167Z value_states = self.v_proj(current_states) 2025-08-14T21:40:02.2876301Z 2025-08-14T21:40:02.2876383Z cudagraph partition due to non gpu ops 2025-08-14T21:40:02.2876579Z cudagraph partition due to non gpu ops 2025-08-14T21:40:02.2876797Z cudagraph partition due to non gpu ops 2025-08-14T21:40:02.2876989Z cudagraph partition due to non gpu ops 2025-08-14T21:40:02.2877200Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:02.2877539Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:02.2877844Z return mod(**inputs) 2025-08-14T21:40:02.2878185Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1864, in forward 2025-08-14T21:40:02.2878547Z outputs = self.model.decoder( 2025-08-14T21:40:02.2878906Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:02.2879268Z layer_outputs = decoder_layer( 2025-08-14T21:40:02.2879590Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:02.2879935Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:02.2880304Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:02.2880689Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:02.2881067Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 263, in forward 2025-08-14T21:40:02.2881453Z attn_output, attn_weights = attention_interface( 2025-08-14T21:40:02.2881868Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:40:02.2882321Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:40:02.2882493Z 2025-08-14T21:40:02.2882588Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:02.2882920Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:02.2883227Z return mod(**inputs) 2025-08-14T21:40:02.2883563Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1864, in forward 2025-08-14T21:40:02.2883933Z outputs = self.model.decoder( 2025-08-14T21:40:02.2884292Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:02.2884786Z layer_outputs = decoder_layer( 2025-08-14T21:40:02.2885112Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:02.2885460Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:02.2885871Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:02.2886255Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:02.2886650Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 263, in forward 2025-08-14T21:40:02.2887049Z attn_output, attn_weights = attention_interface( 2025-08-14T21:40:02.2887452Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:40:02.2887890Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:40:02.2888047Z 2025-08-14T21:40:02.2888141Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:02.2888472Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:02.2888771Z return mod(**inputs) 2025-08-14T21:40:02.2889101Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1864, in forward 2025-08-14T21:40:02.2889460Z outputs = self.model.decoder( 2025-08-14T21:40:02.2889811Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:02.2890194Z layer_outputs = decoder_layer( 2025-08-14T21:40:02.2890514Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:02.2890846Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:02.2891206Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:02.2891579Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:02.2891958Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 277, in forward 2025-08-14T21:40:02.2892320Z attn_output = self.out_proj(attn_output) 2025-08-14T21:40:02.2892446Z 2025-08-14T21:40:02.2892545Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:02.2892871Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:02.2893176Z return mod(**inputs) 2025-08-14T21:40:02.2893510Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1864, in forward 2025-08-14T21:40:02.2893864Z outputs = self.model.decoder( 2025-08-14T21:40:02.2894216Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:02.2894571Z layer_outputs = decoder_layer( 2025-08-14T21:40:02.2894892Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:02.2895222Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:02.2895580Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 446, in forward 2025-08-14T21:40:02.2895979Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:40:02.2896141Z 2025-08-14T21:40:02.2896240Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:02.2896565Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:02.2896864Z return mod(**inputs) 2025-08-14T21:40:02.2897198Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1864, in forward 2025-08-14T21:40:02.2897556Z outputs = self.model.decoder( 2025-08-14T21:40:02.2897914Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:02.2898270Z layer_outputs = decoder_layer( 2025-08-14T21:40:02.2898604Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:02.2898936Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:02.2899295Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 446, in forward 2025-08-14T21:40:02.2899707Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:40:02.2900066Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:40:02.2900381Z return self.act(input) 2025-08-14T21:40:02.2900489Z 2025-08-14T21:40:02.2900601Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:02.2900933Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:02.2901226Z return mod(**inputs) 2025-08-14T21:40:02.2901563Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1864, in forward 2025-08-14T21:40:02.2901922Z outputs = self.model.decoder( 2025-08-14T21:40:02.2902268Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:02.2902632Z layer_outputs = decoder_layer( 2025-08-14T21:40:02.2902978Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:02.2903312Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:02.2903667Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 448, in forward 2025-08-14T21:40:02.2904030Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:40:02.2904155Z 2025-08-14T21:40:02.2904253Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:02.2904582Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:02.2904935Z return mod(**inputs) 2025-08-14T21:40:02.2905286Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1864, in forward 2025-08-14T21:40:02.2905660Z outputs = self.model.decoder( 2025-08-14T21:40:02.2906015Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:02.2906395Z layer_outputs = decoder_layer( 2025-08-14T21:40:02.2906716Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:02.2907047Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:02.2907406Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:02.2907786Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:02.2908161Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 225, in forward 2025-08-14T21:40:02.2908583Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:40:02.2908773Z 2025-08-14T21:40:02.2908869Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:02.2909195Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:02.2909490Z return mod(**inputs) 2025-08-14T21:40:02.2909815Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1864, in forward 2025-08-14T21:40:02.2910179Z outputs = self.model.decoder( 2025-08-14T21:40:02.2910529Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:02.2910882Z layer_outputs = decoder_layer( 2025-08-14T21:40:02.2911218Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:02.2911556Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:02.2911918Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:02.2912314Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:02.2912692Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 244, in forward 2025-08-14T21:40:02.2913053Z key_states = self.k_proj(current_states) 2025-08-14T21:40:02.2913181Z 2025-08-14T21:40:02.2913283Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:02.2913621Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:02.2913923Z return mod(**inputs) 2025-08-14T21:40:02.2914257Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1864, in forward 2025-08-14T21:40:02.2914623Z outputs = self.model.decoder( 2025-08-14T21:40:02.2914971Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:02.2915342Z layer_outputs = decoder_layer( 2025-08-14T21:40:02.2915663Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:02.2916014Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:02.2916380Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:02.2916767Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:02.2917149Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 245, in forward 2025-08-14T21:40:02.2917514Z value_states = self.v_proj(current_states) 2025-08-14T21:40:02.2917654Z 2025-08-14T21:40:02.2917728Z cudagraph partition due to non gpu ops 2025-08-14T21:40:02.2917930Z cudagraph partition due to non gpu ops 2025-08-14T21:40:02.2918117Z cudagraph partition due to non gpu ops 2025-08-14T21:40:02.2918313Z cudagraph partition due to non gpu ops 2025-08-14T21:40:02.2918536Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:02.2918872Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:02.2919168Z return mod(**inputs) 2025-08-14T21:40:02.2919511Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1864, in forward 2025-08-14T21:40:02.2919878Z outputs = self.model.decoder( 2025-08-14T21:40:02.2920230Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:02.2920590Z layer_outputs = decoder_layer( 2025-08-14T21:40:02.2920918Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:02.2921255Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:02.2921612Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:02.2922001Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:02.2922383Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 263, in forward 2025-08-14T21:40:02.2922760Z attn_output, attn_weights = attention_interface( 2025-08-14T21:40:02.2923180Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:40:02.2923628Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:40:02.2923803Z 2025-08-14T21:40:02.2923905Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:02.2924263Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:02.2924566Z return mod(**inputs) 2025-08-14T21:40:02.2924907Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1864, in forward 2025-08-14T21:40:02.2925317Z outputs = self.model.decoder( 2025-08-14T21:40:02.2925683Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:02.2926049Z layer_outputs = decoder_layer( 2025-08-14T21:40:02.2926397Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:02.2926732Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:02.2927099Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:02.2927495Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:02.2927880Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 263, in forward 2025-08-14T21:40:02.2928259Z attn_output, attn_weights = attention_interface( 2025-08-14T21:40:02.2928677Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:40:02.2929132Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:40:02.2929286Z 2025-08-14T21:40:02.2929391Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:02.2929722Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:02.2930037Z return mod(**inputs) 2025-08-14T21:40:02.2930370Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1864, in forward 2025-08-14T21:40:02.2930720Z outputs = self.model.decoder( 2025-08-14T21:40:02.2931071Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:02.2931421Z layer_outputs = decoder_layer( 2025-08-14T21:40:02.2931740Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:02.2932067Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:02.2932422Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:02.2932796Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:02.2933165Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 277, in forward 2025-08-14T21:40:02.2933520Z attn_output = self.out_proj(attn_output) 2025-08-14T21:40:02.2933650Z 2025-08-14T21:40:02.2933745Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:02.2934074Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:02.2934361Z return mod(**inputs) 2025-08-14T21:40:02.2934695Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1864, in forward 2025-08-14T21:40:02.2935049Z outputs = self.model.decoder( 2025-08-14T21:40:02.2935394Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:02.2935739Z layer_outputs = decoder_layer( 2025-08-14T21:40:02.2936056Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:02.2936385Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:02.2936728Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 446, in forward 2025-08-14T21:40:02.2937136Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:40:02.2937302Z 2025-08-14T21:40:02.2937395Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:02.2937720Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:02.2938027Z return mod(**inputs) 2025-08-14T21:40:02.2938359Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1864, in forward 2025-08-14T21:40:02.2938711Z outputs = self.model.decoder( 2025-08-14T21:40:02.2939070Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:02.2939420Z layer_outputs = decoder_layer( 2025-08-14T21:40:02.2939739Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:02.2940072Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:02.2940420Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 446, in forward 2025-08-14T21:40:02.2940814Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:40:02.2941170Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:40:02.2941503Z return self.act(input) 2025-08-14T21:40:02.2941605Z 2025-08-14T21:40:02.2941700Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:02.2942030Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:02.2942331Z return mod(**inputs) 2025-08-14T21:40:02.2942659Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1864, in forward 2025-08-14T21:40:02.2943019Z outputs = self.model.decoder( 2025-08-14T21:40:02.2943371Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:02.2943729Z layer_outputs = decoder_layer( 2025-08-14T21:40:02.2944039Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:02.2944374Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:02.2944734Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 448, in forward 2025-08-14T21:40:02.2945164Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:40:02.2945296Z 2025-08-14T21:40:02.2945395Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:02.2945739Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:02.2946043Z return mod(**inputs) 2025-08-14T21:40:02.2946381Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1864, in forward 2025-08-14T21:40:02.2946758Z outputs = self.model.decoder( 2025-08-14T21:40:02.2947107Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:02.2947466Z layer_outputs = decoder_layer( 2025-08-14T21:40:02.2947779Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:02.2948112Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:02.2948468Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 450, in forward 2025-08-14T21:40:02.2948822Z hidden_states = residual + hidden_states 2025-08-14T21:40:02.2948952Z 2025-08-14T21:40:02.2949045Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:02.2949371Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:02.2949691Z return mod(**inputs) 2025-08-14T21:40:02.2950022Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1864, in forward 2025-08-14T21:40:02.2950379Z outputs = self.model.decoder( 2025-08-14T21:40:02.2950745Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:02.2951098Z layer_outputs = decoder_layer( 2025-08-14T21:40:02.2951419Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:02.2951752Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:02.2952128Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:02.2952502Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:02.2952878Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 225, in forward 2025-08-14T21:40:02.2953307Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:40:02.2953498Z 2025-08-14T21:40:02.2953598Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:02.2953924Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:02.2954238Z return mod(**inputs) 2025-08-14T21:40:02.2954574Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1864, in forward 2025-08-14T21:40:02.2954925Z outputs = self.model.decoder( 2025-08-14T21:40:02.2955281Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:02.2955639Z layer_outputs = decoder_layer( 2025-08-14T21:40:02.2955962Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:02.2956294Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:02.2956654Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:02.2957037Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:02.2957418Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 244, in forward 2025-08-14T21:40:02.2957779Z key_states = self.k_proj(current_states) 2025-08-14T21:40:02.2957913Z 2025-08-14T21:40:02.2958009Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:02.2958345Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:02.2958643Z return mod(**inputs) 2025-08-14T21:40:02.2958981Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1864, in forward 2025-08-14T21:40:02.2959343Z outputs = self.model.decoder( 2025-08-14T21:40:02.2959695Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:02.2960049Z layer_outputs = decoder_layer( 2025-08-14T21:40:02.2960374Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:02.2960713Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:02.2961065Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:02.2961444Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:02.2961824Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 245, in forward 2025-08-14T21:40:02.2962195Z value_states = self.v_proj(current_states) 2025-08-14T21:40:02.2962328Z 2025-08-14T21:40:02.2962420Z cudagraph partition due to non gpu ops 2025-08-14T21:40:02.2962620Z cudagraph partition due to non gpu ops 2025-08-14T21:40:02.2962809Z cudagraph partition due to non gpu ops 2025-08-14T21:40:02.2962993Z cudagraph partition due to non gpu ops 2025-08-14T21:40:02.2963223Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:02.2963556Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:02.2963855Z return mod(**inputs) 2025-08-14T21:40:02.2964180Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1864, in forward 2025-08-14T21:40:02.2964536Z outputs = self.model.decoder( 2025-08-14T21:40:02.2964903Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:02.2965254Z layer_outputs = decoder_layer( 2025-08-14T21:40:02.2965587Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:02.2965976Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:02.2966351Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:02.2966739Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:02.2967151Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 263, in forward 2025-08-14T21:40:02.2967539Z attn_output, attn_weights = attention_interface( 2025-08-14T21:40:02.2967955Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:40:02.2968401Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:40:02.2968585Z 2025-08-14T21:40:02.2968683Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:02.2969023Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:02.2969321Z return mod(**inputs) 2025-08-14T21:40:02.2969660Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1864, in forward 2025-08-14T21:40:02.2970031Z outputs = self.model.decoder( 2025-08-14T21:40:02.2970391Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:02.2970747Z layer_outputs = decoder_layer( 2025-08-14T21:40:02.2971076Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:02.2971418Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:02.2971785Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:02.2972165Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:02.2972548Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 263, in forward 2025-08-14T21:40:02.2972934Z attn_output, attn_weights = attention_interface( 2025-08-14T21:40:02.2973344Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:40:02.2973778Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:40:02.2973940Z 2025-08-14T21:40:02.2974038Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:02.2974380Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:02.2974681Z return mod(**inputs) 2025-08-14T21:40:02.2975027Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1864, in forward 2025-08-14T21:40:02.2975398Z outputs = self.model.decoder( 2025-08-14T21:40:02.2975765Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:02.2976137Z layer_outputs = decoder_layer( 2025-08-14T21:40:02.2976466Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:02.2976823Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:02.2977178Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:02.2977560Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:02.2977958Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 277, in forward 2025-08-14T21:40:02.2978333Z attn_output = self.out_proj(attn_output) 2025-08-14T21:40:02.2978470Z 2025-08-14T21:40:02.2978567Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:02.2978902Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:02.2979212Z return mod(**inputs) 2025-08-14T21:40:02.2979549Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1864, in forward 2025-08-14T21:40:02.2979925Z outputs = self.model.decoder( 2025-08-14T21:40:02.2980271Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:02.2980622Z layer_outputs = decoder_layer( 2025-08-14T21:40:02.2980932Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:02.2981263Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:02.2981617Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 446, in forward 2025-08-14T21:40:02.2982014Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:40:02.2982173Z 2025-08-14T21:40:02.2982269Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:02.2982593Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:02.2982890Z return mod(**inputs) 2025-08-14T21:40:02.2983218Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1864, in forward 2025-08-14T21:40:02.2983574Z outputs = self.model.decoder( 2025-08-14T21:40:02.2983919Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:02.2984270Z layer_outputs = decoder_layer( 2025-08-14T21:40:02.2984686Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:02.2985085Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:02.2985459Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 446, in forward 2025-08-14T21:40:02.2985862Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:40:02.2986234Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:40:02.2986566Z return self.act(input) 2025-08-14T21:40:02.2986670Z 2025-08-14T21:40:02.2986774Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:02.2987112Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:02.2987414Z return mod(**inputs) 2025-08-14T21:40:02.2987760Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1864, in forward 2025-08-14T21:40:02.2988125Z outputs = self.model.decoder( 2025-08-14T21:40:02.2988516Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:02.2988876Z layer_outputs = decoder_layer( 2025-08-14T21:40:02.2989194Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:02.2989543Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:02.2989905Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 448, in forward 2025-08-14T21:40:02.2990267Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:40:02.2990393Z 2025-08-14T21:40:02.2990493Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:02.2990841Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:02.2991145Z return mod(**inputs) 2025-08-14T21:40:02.2991482Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1864, in forward 2025-08-14T21:40:02.2991841Z outputs = self.model.decoder( 2025-08-14T21:40:02.2992196Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:02.2992554Z layer_outputs = decoder_layer( 2025-08-14T21:40:02.2992880Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:02.2993236Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:02.2993593Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:02.2993973Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:02.2994343Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 225, in forward 2025-08-14T21:40:02.2994767Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:40:02.2994963Z 2025-08-14T21:40:02.2995061Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:02.2995387Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:02.2995678Z return mod(**inputs) 2025-08-14T21:40:02.2996014Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1864, in forward 2025-08-14T21:40:02.2996371Z outputs = self.model.decoder( 2025-08-14T21:40:02.2996717Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:02.2997074Z layer_outputs = decoder_layer( 2025-08-14T21:40:02.2997403Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:02.2997742Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:02.2998102Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:02.2998487Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:02.2998872Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 244, in forward 2025-08-14T21:40:02.2999245Z key_states = self.k_proj(current_states) 2025-08-14T21:40:02.2999372Z 2025-08-14T21:40:02.2999468Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:02.2999803Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:02.3000107Z return mod(**inputs) 2025-08-14T21:40:02.3000444Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1864, in forward 2025-08-14T21:40:02.3000802Z outputs = self.model.decoder( 2025-08-14T21:40:02.3001179Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:02.3001548Z layer_outputs = decoder_layer( 2025-08-14T21:40:02.3001871Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:02.3002215Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:02.3002595Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:02.3002976Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:02.3003349Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 245, in forward 2025-08-14T21:40:02.3003736Z value_states = self.v_proj(current_states) 2025-08-14T21:40:02.3003867Z 2025-08-14T21:40:02.3003949Z cudagraph partition due to non gpu ops 2025-08-14T21:40:02.3004142Z cudagraph partition due to non gpu ops 2025-08-14T21:40:02.3004338Z cudagraph partition due to non gpu ops 2025-08-14T21:40:02.3004534Z cudagraph partition due to non gpu ops 2025-08-14T21:40:02.3004752Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:02.3005081Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:02.3005389Z return mod(**inputs) 2025-08-14T21:40:02.3005733Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1864, in forward 2025-08-14T21:40:02.3006112Z outputs = self.model.decoder( 2025-08-14T21:40:02.3006468Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:02.3006831Z layer_outputs = decoder_layer( 2025-08-14T21:40:02.3007161Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:02.3007495Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:02.3007865Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:02.3008252Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:02.3008630Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 263, in forward 2025-08-14T21:40:02.3009018Z attn_output, attn_weights = attention_interface( 2025-08-14T21:40:02.3009437Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:40:02.3009888Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:40:02.3010058Z 2025-08-14T21:40:02.3010156Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:02.3010492Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:02.3010794Z return mod(**inputs) 2025-08-14T21:40:02.3011138Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1864, in forward 2025-08-14T21:40:02.3011499Z outputs = self.model.decoder( 2025-08-14T21:40:02.3011856Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:02.3012220Z layer_outputs = decoder_layer( 2025-08-14T21:40:02.3012538Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:02.3012876Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:02.3013242Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:02.3013629Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:02.3014000Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 263, in forward 2025-08-14T21:40:02.3014404Z attn_output, attn_weights = attention_interface( 2025-08-14T21:40:02.3014819Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:40:02.3015242Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:40:02.3015415Z 2025-08-14T21:40:02.3015512Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:02.3015840Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:02.3016134Z return mod(**inputs) 2025-08-14T21:40:02.3016487Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1864, in forward 2025-08-14T21:40:02.3016847Z outputs = self.model.decoder( 2025-08-14T21:40:02.3017193Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:02.3017548Z layer_outputs = decoder_layer( 2025-08-14T21:40:02.3017858Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:02.3018187Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:02.3018543Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:02.3018935Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:02.3019301Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 277, in forward 2025-08-14T21:40:02.3019658Z attn_output = self.out_proj(attn_output) 2025-08-14T21:40:02.3019784Z 2025-08-14T21:40:02.3019885Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:02.3020203Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:02.3020499Z return mod(**inputs) 2025-08-14T21:40:02.3020834Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1864, in forward 2025-08-14T21:40:02.3021186Z outputs = self.model.decoder( 2025-08-14T21:40:02.3021527Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:02.3021885Z layer_outputs = decoder_layer( 2025-08-14T21:40:02.3022200Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:02.3022521Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:02.3022879Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 446, in forward 2025-08-14T21:40:02.3023275Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:40:02.3023429Z 2025-08-14T21:40:02.3023532Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:02.3023852Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:02.3024150Z return mod(**inputs) 2025-08-14T21:40:02.3024482Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1864, in forward 2025-08-14T21:40:02.3024895Z outputs = self.model.decoder( 2025-08-14T21:40:02.3025276Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:02.3025664Z layer_outputs = decoder_layer( 2025-08-14T21:40:02.3026008Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:02.3026368Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:02.3026757Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 446, in forward 2025-08-14T21:40:02.3027179Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:40:02.3027543Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:40:02.3027852Z return self.act(input) 2025-08-14T21:40:02.3027963Z 2025-08-14T21:40:02.3028073Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:02.3028402Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:02.3028690Z return mod(**inputs) 2025-08-14T21:40:02.3029019Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1864, in forward 2025-08-14T21:40:02.3029371Z outputs = self.model.decoder( 2025-08-14T21:40:02.3029734Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:02.3030083Z layer_outputs = decoder_layer( 2025-08-14T21:40:02.3030405Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:02.3030739Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:02.3031091Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 448, in forward 2025-08-14T21:40:02.3031455Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:40:02.3031605Z 2025-08-14T21:40:02.3031697Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:02.3032022Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:02.3032310Z return mod(**inputs) 2025-08-14T21:40:02.3032643Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1864, in forward 2025-08-14T21:40:02.3032998Z outputs = self.model.decoder( 2025-08-14T21:40:02.3033343Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:02.3033686Z layer_outputs = decoder_layer( 2025-08-14T21:40:02.3034000Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:02.3034326Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:02.3034677Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 450, in forward 2025-08-14T21:40:02.3035038Z hidden_states = residual + hidden_states 2025-08-14T21:40:02.3035165Z 2025-08-14T21:40:02.3035257Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:02.3035585Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:02.3035873Z return mod(**inputs) 2025-08-14T21:40:02.3036202Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1864, in forward 2025-08-14T21:40:02.3036558Z outputs = self.model.decoder( 2025-08-14T21:40:02.3036895Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:02.3037249Z layer_outputs = decoder_layer( 2025-08-14T21:40:02.3037567Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:02.3037897Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:02.3038244Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:02.3038619Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:02.3038995Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 225, in forward 2025-08-14T21:40:02.3039416Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:40:02.3039603Z 2025-08-14T21:40:02.3039713Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:02.3040047Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:02.3040348Z return mod(**inputs) 2025-08-14T21:40:02.3040677Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1864, in forward 2025-08-14T21:40:02.3041053Z outputs = self.model.decoder( 2025-08-14T21:40:02.3041398Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:02.3041751Z layer_outputs = decoder_layer( 2025-08-14T21:40:02.3042080Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:02.3042415Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:02.3042772Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:02.3043148Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:02.3043516Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 244, in forward 2025-08-14T21:40:02.3043879Z key_states = self.k_proj(current_states) 2025-08-14T21:40:02.3044049Z 2025-08-14T21:40:02.3044149Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:02.3044475Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:02.3044772Z return mod(**inputs) 2025-08-14T21:40:02.3045102Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1864, in forward 2025-08-14T21:40:02.3045465Z outputs = self.model.decoder( 2025-08-14T21:40:02.3045814Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:02.3046182Z layer_outputs = decoder_layer( 2025-08-14T21:40:02.3046506Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:02.3046839Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:02.3047195Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:02.3047575Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:02.3047945Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 245, in forward 2025-08-14T21:40:02.3048303Z value_states = self.v_proj(current_states) 2025-08-14T21:40:02.3048439Z 2025-08-14T21:40:02.3048513Z cudagraph partition due to non gpu ops 2025-08-14T21:40:02.3048709Z cudagraph partition due to non gpu ops 2025-08-14T21:40:02.3048899Z cudagraph partition due to non gpu ops 2025-08-14T21:40:02.3049080Z cudagraph partition due to non gpu ops 2025-08-14T21:40:02.3049293Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:02.3049625Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:02.3049913Z return mod(**inputs) 2025-08-14T21:40:02.3050244Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1864, in forward 2025-08-14T21:40:02.3050600Z outputs = self.model.decoder( 2025-08-14T21:40:02.3050939Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:02.3051289Z layer_outputs = decoder_layer( 2025-08-14T21:40:02.3051606Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:02.3051934Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:02.3052298Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:02.3052681Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:02.3053060Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 263, in forward 2025-08-14T21:40:02.3053452Z attn_output, attn_weights = attention_interface( 2025-08-14T21:40:02.3053859Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:40:02.3054303Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:40:02.3054473Z 2025-08-14T21:40:02.3054601Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:02.3054928Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:02.3055231Z return mod(**inputs) 2025-08-14T21:40:02.3055567Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1864, in forward 2025-08-14T21:40:02.3055925Z outputs = self.model.decoder( 2025-08-14T21:40:02.3056266Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:02.3056621Z layer_outputs = decoder_layer( 2025-08-14T21:40:02.3056956Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:02.3057285Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:02.3057633Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:02.3058008Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:02.3058378Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 263, in forward 2025-08-14T21:40:02.3058747Z attn_output, attn_weights = attention_interface( 2025-08-14T21:40:02.3059150Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:40:02.3059565Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:40:02.3059714Z 2025-08-14T21:40:02.3059813Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:02.3060133Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:02.3060433Z return mod(**inputs) 2025-08-14T21:40:02.3060762Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1864, in forward 2025-08-14T21:40:02.3061118Z outputs = self.model.decoder( 2025-08-14T21:40:02.3061457Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:02.3061807Z layer_outputs = decoder_layer( 2025-08-14T21:40:02.3062126Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:02.3062450Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:02.3062805Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:02.3063181Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:02.3063547Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 277, in forward 2025-08-14T21:40:02.3063902Z attn_output = self.out_proj(attn_output) 2025-08-14T21:40:02.3064033Z 2025-08-14T21:40:02.3064129Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:02.3064452Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:02.3064745Z return mod(**inputs) 2025-08-14T21:40:02.3065180Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1864, in forward 2025-08-14T21:40:02.3065559Z outputs = self.model.decoder( 2025-08-14T21:40:02.3065925Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:02.3066313Z layer_outputs = decoder_layer( 2025-08-14T21:40:02.3066647Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:02.3067022Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:02.3067404Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 446, in forward 2025-08-14T21:40:02.3067809Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:40:02.3067982Z 2025-08-14T21:40:02.3068079Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:02.3068418Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:02.3068720Z return mod(**inputs) 2025-08-14T21:40:02.3069060Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1864, in forward 2025-08-14T21:40:02.3069428Z outputs = self.model.decoder( 2025-08-14T21:40:02.3069802Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:02.3070159Z layer_outputs = decoder_layer( 2025-08-14T21:40:02.3070487Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:02.3070827Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:02.3071185Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 446, in forward 2025-08-14T21:40:02.3071590Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:40:02.3071957Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:40:02.3072274Z return self.act(input) 2025-08-14T21:40:02.3072378Z 2025-08-14T21:40:02.3072475Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:02.3072812Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:02.3073117Z return mod(**inputs) 2025-08-14T21:40:02.3073446Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1864, in forward 2025-08-14T21:40:02.3073810Z outputs = self.model.decoder( 2025-08-14T21:40:02.3074165Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:02.3074524Z layer_outputs = decoder_layer( 2025-08-14T21:40:02.3074843Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:02.3075177Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:02.3075540Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 448, in forward 2025-08-14T21:40:02.3075910Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:40:02.3076038Z 2025-08-14T21:40:02.3076134Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:02.3076470Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:02.3076776Z return mod(**inputs) 2025-08-14T21:40:02.3077110Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1864, in forward 2025-08-14T21:40:02.3077480Z outputs = self.model.decoder( 2025-08-14T21:40:02.3077836Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:02.3078219Z layer_outputs = decoder_layer( 2025-08-14T21:40:02.3078543Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:02.3078881Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:02.3079267Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:02.3079646Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:02.3080009Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 225, in forward 2025-08-14T21:40:02.3080452Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:40:02.3080638Z 2025-08-14T21:40:02.3080736Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:02.3081054Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:02.3081349Z return mod(**inputs) 2025-08-14T21:40:02.3081679Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1864, in forward 2025-08-14T21:40:02.3082029Z outputs = self.model.decoder( 2025-08-14T21:40:02.3082368Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:02.3082744Z layer_outputs = decoder_layer( 2025-08-14T21:40:02.3083064Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:02.3083391Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:02.3083755Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:02.3084136Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:02.3084511Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 244, in forward 2025-08-14T21:40:02.3084983Z key_states = self.k_proj(current_states) 2025-08-14T21:40:02.3085119Z 2025-08-14T21:40:02.3085217Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:02.3085565Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:02.3085874Z return mod(**inputs) 2025-08-14T21:40:02.3086213Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1864, in forward 2025-08-14T21:40:02.3086588Z outputs = self.model.decoder( 2025-08-14T21:40:02.3086940Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:02.3087294Z layer_outputs = decoder_layer( 2025-08-14T21:40:02.3087621Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:02.3087956Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:02.3088316Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:02.3088692Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:02.3089068Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 245, in forward 2025-08-14T21:40:02.3089436Z value_states = self.v_proj(current_states) 2025-08-14T21:40:02.3089564Z 2025-08-14T21:40:02.3089645Z cudagraph partition due to non gpu ops 2025-08-14T21:40:02.3089833Z cudagraph partition due to non gpu ops 2025-08-14T21:40:02.3090026Z cudagraph partition due to non gpu ops 2025-08-14T21:40:02.3090213Z cudagraph partition due to non gpu ops 2025-08-14T21:40:02.3090421Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:02.3090792Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:02.3091092Z return mod(**inputs) 2025-08-14T21:40:02.3091416Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1864, in forward 2025-08-14T21:40:02.3091772Z outputs = self.model.decoder( 2025-08-14T21:40:02.3092143Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:02.3092501Z layer_outputs = decoder_layer( 2025-08-14T21:40:02.3092813Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:02.3093147Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:02.3093525Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:02.3093903Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:02.3094282Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 263, in forward 2025-08-14T21:40:02.3094661Z attn_output, attn_weights = attention_interface( 2025-08-14T21:40:02.3095072Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:40:02.3095539Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:40:02.3095719Z 2025-08-14T21:40:02.3095817Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:02.3096158Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:02.3096465Z return mod(**inputs) 2025-08-14T21:40:02.3096804Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1864, in forward 2025-08-14T21:40:02.3097174Z outputs = self.model.decoder( 2025-08-14T21:40:02.3097536Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:02.3097897Z layer_outputs = decoder_layer( 2025-08-14T21:40:02.3098227Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:02.3098568Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:02.3098940Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:02.3099321Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:02.3099707Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 263, in forward 2025-08-14T21:40:02.3100096Z attn_output, attn_weights = attention_interface( 2025-08-14T21:40:02.3100513Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:40:02.3100936Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:40:02.3101096Z 2025-08-14T21:40:02.3101193Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:02.3101532Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:02.3101834Z return mod(**inputs) 2025-08-14T21:40:02.3102185Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1864, in forward 2025-08-14T21:40:02.3102555Z outputs = self.model.decoder( 2025-08-14T21:40:02.3102917Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:02.3103280Z layer_outputs = decoder_layer( 2025-08-14T21:40:02.3103615Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:02.3103954Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:02.3104331Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:02.3104715Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:02.3105186Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 277, in forward 2025-08-14T21:40:02.3105564Z attn_output = self.out_proj(attn_output) 2025-08-14T21:40:02.3105691Z 2025-08-14T21:40:02.3105789Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:02.3106134Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:02.3106473Z return mod(**inputs) 2025-08-14T21:40:02.3106818Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1864, in forward 2025-08-14T21:40:02.3107178Z outputs = self.model.decoder( 2025-08-14T21:40:02.3107542Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:02.3107913Z layer_outputs = decoder_layer( 2025-08-14T21:40:02.3108234Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:02.3108593Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:02.3108958Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 446, in forward 2025-08-14T21:40:02.3109370Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:40:02.3109536Z 2025-08-14T21:40:02.3109636Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:02.3109982Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:02.3110286Z return mod(**inputs) 2025-08-14T21:40:02.3110629Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1864, in forward 2025-08-14T21:40:02.3110994Z outputs = self.model.decoder( 2025-08-14T21:40:02.3111350Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:02.3111716Z layer_outputs = decoder_layer( 2025-08-14T21:40:02.3112041Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:02.3112382Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:02.3112749Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 446, in forward 2025-08-14T21:40:02.3113151Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:40:02.3113507Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:40:02.3113828Z return self.act(input) 2025-08-14T21:40:02.3113933Z 2025-08-14T21:40:02.3114039Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:02.3114367Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:02.3114685Z return mod(**inputs) 2025-08-14T21:40:02.3115825Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1864, in forward 2025-08-14T21:40:02.3116596Z outputs = self.model.decoder( 2025-08-14T21:40:02.3117195Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:02.3117743Z layer_outputs = decoder_layer( 2025-08-14T21:40:02.3118230Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:02.3118760Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:02.3119403Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 448, in forward 2025-08-14T21:40:02.3119796Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:40:02.3119945Z 2025-08-14T21:40:02.3120057Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:02.3120430Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:02.3120754Z return mod(**inputs) 2025-08-14T21:40:02.3121094Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1864, in forward 2025-08-14T21:40:02.3121458Z outputs = self.model.decoder( 2025-08-14T21:40:02.3121825Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:02.3122187Z layer_outputs = decoder_layer( 2025-08-14T21:40:02.3122509Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:02.3122839Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:02.3123208Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 450, in forward 2025-08-14T21:40:02.3123578Z hidden_states = residual + hidden_states 2025-08-14T21:40:02.3123708Z 2025-08-14T21:40:02.3123841Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:02.3124033Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:02.3124095Z return mod(**inputs) 2025-08-14T21:40:02.3124343Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1864, in forward 2025-08-14T21:40:02.3124414Z outputs = self.model.decoder( 2025-08-14T21:40:02.3124664Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:02.3124730Z layer_outputs = decoder_layer( 2025-08-14T21:40:02.3124933Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:02.3125010Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:02.3125241Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:02.3125341Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:02.3125581Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 225, in forward 2025-08-14T21:40:02.3125724Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:40:02.3125729Z 2025-08-14T21:40:02.3125833Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:02.3126018Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:02.3126078Z return mod(**inputs) 2025-08-14T21:40:02.3126325Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1864, in forward 2025-08-14T21:40:02.3126393Z outputs = self.model.decoder( 2025-08-14T21:40:02.3126635Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:02.3126704Z layer_outputs = decoder_layer( 2025-08-14T21:40:02.3126912Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:02.3126992Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:02.3127228Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:02.3127321Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:02.3127587Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 244, in forward 2025-08-14T21:40:02.3127664Z key_states = self.k_proj(current_states) 2025-08-14T21:40:02.3127667Z 2025-08-14T21:40:02.3127769Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:02.3127951Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:02.3128029Z return mod(**inputs) 2025-08-14T21:40:02.3128267Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1864, in forward 2025-08-14T21:40:02.3128332Z outputs = self.model.decoder( 2025-08-14T21:40:02.3128640Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:02.3128735Z layer_outputs = decoder_layer( 2025-08-14T21:40:02.3129050Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:02.3129131Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:02.3129371Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:02.3129468Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:02.3129726Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 245, in forward 2025-08-14T21:40:02.3129833Z value_states = self.v_proj(current_states) 2025-08-14T21:40:02.3129837Z 2025-08-14T21:40:02.3129926Z cudagraph partition due to non gpu ops 2025-08-14T21:40:02.3130002Z cudagraph partition due to non gpu ops 2025-08-14T21:40:02.3130074Z cudagraph partition due to non gpu ops 2025-08-14T21:40:02.3130155Z cudagraph partition due to non gpu ops 2025-08-14T21:40:02.3130255Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:02.3130451Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:02.3130523Z return mod(**inputs) 2025-08-14T21:40:02.3130771Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1864, in forward 2025-08-14T21:40:02.3130846Z outputs = self.model.decoder( 2025-08-14T21:40:02.3131099Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:02.3131170Z layer_outputs = decoder_layer( 2025-08-14T21:40:02.3131393Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:02.3131469Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:02.3131718Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:02.3131819Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:02.3132083Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 263, in forward 2025-08-14T21:40:02.3132192Z attn_output, attn_weights = attention_interface( 2025-08-14T21:40:02.3132498Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:40:02.3132642Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:40:02.3132648Z 2025-08-14T21:40:02.3132759Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:02.3132964Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:02.3133048Z return mod(**inputs) 2025-08-14T21:40:02.3133288Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1864, in forward 2025-08-14T21:40:02.3133356Z outputs = self.model.decoder( 2025-08-14T21:40:02.3133667Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:02.3133733Z layer_outputs = decoder_layer( 2025-08-14T21:40:02.3133936Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:02.3134032Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:02.3134263Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:02.3134358Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:02.3134586Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 263, in forward 2025-08-14T21:40:02.3134690Z attn_output, attn_weights = attention_interface( 2025-08-14T21:40:02.3134966Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:40:02.3135068Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:40:02.3135072Z 2025-08-14T21:40:02.3135172Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:02.3135354Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:02.3135413Z return mod(**inputs) 2025-08-14T21:40:02.3135672Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1864, in forward 2025-08-14T21:40:02.3135740Z outputs = self.model.decoder( 2025-08-14T21:40:02.3135970Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:02.3136043Z layer_outputs = decoder_layer( 2025-08-14T21:40:02.3136244Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:02.3136328Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:02.3136566Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:02.3136656Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:02.3136905Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 277, in forward 2025-08-14T21:40:02.3136992Z attn_output = self.out_proj(attn_output) 2025-08-14T21:40:02.3136996Z 2025-08-14T21:40:02.3137108Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:02.3137314Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:02.3137380Z return mod(**inputs) 2025-08-14T21:40:02.3137656Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1864, in forward 2025-08-14T21:40:02.3137731Z outputs = self.model.decoder( 2025-08-14T21:40:02.3138005Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:02.3138085Z layer_outputs = decoder_layer( 2025-08-14T21:40:02.3138311Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:02.3138399Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:02.3138664Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 446, in forward 2025-08-14T21:40:02.3138791Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:40:02.3138795Z 2025-08-14T21:40:02.3138906Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:02.3139111Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:02.3139183Z return mod(**inputs) 2025-08-14T21:40:02.3139466Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1864, in forward 2025-08-14T21:40:02.3139543Z outputs = self.model.decoder( 2025-08-14T21:40:02.3139812Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:02.3139906Z layer_outputs = decoder_layer( 2025-08-14T21:40:02.3140134Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:02.3140224Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:02.3140486Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 446, in forward 2025-08-14T21:40:02.3140631Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:40:02.3140858Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:40:02.3140932Z return self.act(input) 2025-08-14T21:40:02.3140936Z 2025-08-14T21:40:02.3141050Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:02.3141254Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:02.3141320Z return mod(**inputs) 2025-08-14T21:40:02.3141593Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1864, in forward 2025-08-14T21:40:02.3141695Z outputs = self.model.decoder( 2025-08-14T21:40:02.3141966Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:02.3142039Z layer_outputs = decoder_layer( 2025-08-14T21:40:02.3142269Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:02.3142357Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:02.3142622Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 448, in forward 2025-08-14T21:40:02.3142712Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:40:02.3142716Z 2025-08-14T21:40:02.3142821Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:02.3143029Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:02.3143107Z return mod(**inputs) 2025-08-14T21:40:02.3143370Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1864, in forward 2025-08-14T21:40:02.3143445Z outputs = self.model.decoder( 2025-08-14T21:40:02.3143716Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:02.3143789Z layer_outputs = decoder_layer( 2025-08-14T21:40:02.3144025Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:02.3144107Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:02.3144371Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:02.3144477Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:02.3144740Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 225, in forward 2025-08-14T21:40:02.3145035Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:40:02.3145043Z 2025-08-14T21:40:02.3145151Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:02.3145360Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:02.3145437Z return mod(**inputs) 2025-08-14T21:40:02.3145702Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1864, in forward 2025-08-14T21:40:02.3145800Z outputs = self.model.decoder( 2025-08-14T21:40:02.3146073Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:02.3146147Z layer_outputs = decoder_layer( 2025-08-14T21:40:02.3146403Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:02.3146488Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:02.3146750Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:02.3146862Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:02.3147141Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 244, in forward 2025-08-14T21:40:02.3147227Z key_states = self.k_proj(current_states) 2025-08-14T21:40:02.3147238Z 2025-08-14T21:40:02.3147346Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:02.3147552Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:02.3147626Z return mod(**inputs) 2025-08-14T21:40:02.3147888Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1864, in forward 2025-08-14T21:40:02.3147982Z outputs = self.model.decoder( 2025-08-14T21:40:02.3148255Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:02.3148329Z layer_outputs = decoder_layer( 2025-08-14T21:40:02.3148570Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:02.3148651Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:02.3148913Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:02.3149023Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:02.3149287Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 245, in forward 2025-08-14T21:40:02.3149374Z value_states = self.v_proj(current_states) 2025-08-14T21:40:02.3149386Z 2025-08-14T21:40:02.3149471Z cudagraph partition due to non gpu ops 2025-08-14T21:40:02.3149552Z cudagraph partition due to non gpu ops 2025-08-14T21:40:02.3149636Z cudagraph partition due to non gpu ops 2025-08-14T21:40:02.3149714Z cudagraph partition due to non gpu ops 2025-08-14T21:40:02.3149818Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:02.3150035Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:02.3150112Z return mod(**inputs) 2025-08-14T21:40:02.3150348Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1864, in forward 2025-08-14T21:40:02.3150426Z outputs = self.model.decoder( 2025-08-14T21:40:02.3150662Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:02.3150734Z layer_outputs = decoder_layer( 2025-08-14T21:40:02.3150941Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:02.3151015Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:02.3151258Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:02.3151348Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:02.3151594Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 263, in forward 2025-08-14T21:40:02.3151686Z attn_output, attn_weights = attention_interface( 2025-08-14T21:40:02.3152002Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:40:02.3152138Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:40:02.3152142Z 2025-08-14T21:40:02.3152253Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:02.3152441Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:02.3152509Z return mod(**inputs) 2025-08-14T21:40:02.3152746Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1864, in forward 2025-08-14T21:40:02.3152822Z outputs = self.model.decoder( 2025-08-14T21:40:02.3153075Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:02.3153144Z layer_outputs = decoder_layer( 2025-08-14T21:40:02.3153361Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:02.3153434Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:02.3153677Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:02.3153770Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:02.3154026Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 263, in forward 2025-08-14T21:40:02.3154126Z attn_output, attn_weights = attention_interface( 2025-08-14T21:40:02.3154398Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:40:02.3154502Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:40:02.3154515Z 2025-08-14T21:40:02.3154612Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:02.3154801Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:02.3154871Z return mod(**inputs) 2025-08-14T21:40:02.3155111Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1864, in forward 2025-08-14T21:40:02.3155180Z outputs = self.model.decoder( 2025-08-14T21:40:02.3155424Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:02.3155491Z layer_outputs = decoder_layer( 2025-08-14T21:40:02.3155703Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:02.3155778Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:02.3156012Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:02.3156107Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:02.3156341Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 277, in forward 2025-08-14T21:40:02.3156415Z attn_output = self.out_proj(attn_output) 2025-08-14T21:40:02.3156420Z 2025-08-14T21:40:02.3156523Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:02.3156710Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:02.3156775Z return mod(**inputs) 2025-08-14T21:40:02.3157014Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1864, in forward 2025-08-14T21:40:02.3157083Z outputs = self.model.decoder( 2025-08-14T21:40:02.3157326Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:02.3157391Z layer_outputs = decoder_layer( 2025-08-14T21:40:02.3157619Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:02.3157701Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:02.3157935Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 446, in forward 2025-08-14T21:40:02.3158071Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:40:02.3158076Z 2025-08-14T21:40:02.3158170Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:02.3158354Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:02.3158421Z return mod(**inputs) 2025-08-14T21:40:02.3158673Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1864, in forward 2025-08-14T21:40:02.3158749Z outputs = self.model.decoder( 2025-08-14T21:40:02.3158988Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:02.3159054Z layer_outputs = decoder_layer( 2025-08-14T21:40:02.3159267Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:02.3159340Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:02.3159593Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 446, in forward 2025-08-14T21:40:02.3159708Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:40:02.3159905Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:40:02.3159976Z return self.act(input) 2025-08-14T21:40:02.3159979Z 2025-08-14T21:40:02.3160072Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:02.3160256Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:02.3160325Z return mod(**inputs) 2025-08-14T21:40:02.3160558Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1864, in forward 2025-08-14T21:40:02.3160629Z outputs = self.model.decoder( 2025-08-14T21:40:02.3160864Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:02.3160931Z layer_outputs = decoder_layer( 2025-08-14T21:40:02.3161138Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:02.3161208Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:02.3161443Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 448, in forward 2025-08-14T21:40:02.3161526Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:40:02.3161529Z 2025-08-14T21:40:02.3161623Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:02.3161814Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:02.3161872Z return mod(**inputs) 2025-08-14T21:40:02.3162108Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1864, in forward 2025-08-14T21:40:02.3162185Z outputs = self.model.decoder( 2025-08-14T21:40:02.3162418Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:02.3162483Z layer_outputs = decoder_layer( 2025-08-14T21:40:02.3162697Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:02.3162770Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:02.3163013Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 450, in forward 2025-08-14T21:40:02.3163101Z hidden_states = residual + hidden_states 2025-08-14T21:40:02.3163105Z 2025-08-14T21:40:02.3163201Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:02.3163390Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:02.3163467Z return mod(**inputs) 2025-08-14T21:40:02.3163713Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1864, in forward 2025-08-14T21:40:02.3163779Z outputs = self.model.decoder( 2025-08-14T21:40:02.3164015Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:02.3164113Z layer_outputs = decoder_layer( 2025-08-14T21:40:02.3164321Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:02.3164395Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:02.3164639Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:02.3164730Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:02.3164972Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 225, in forward 2025-08-14T21:40:02.3165132Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:40:02.3165136Z 2025-08-14T21:40:02.3165230Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:02.3165422Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:02.3165483Z return mod(**inputs) 2025-08-14T21:40:02.3165731Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1864, in forward 2025-08-14T21:40:02.3165797Z outputs = self.model.decoder( 2025-08-14T21:40:02.3166035Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:02.3166110Z layer_outputs = decoder_layer( 2025-08-14T21:40:02.3166317Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:02.3166391Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:02.3166637Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:02.3166727Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:02.3166971Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 244, in forward 2025-08-14T21:40:02.3167045Z key_states = self.k_proj(current_states) 2025-08-14T21:40:02.3167049Z 2025-08-14T21:40:02.3167156Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:02.3167350Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:02.3167410Z return mod(**inputs) 2025-08-14T21:40:02.3167655Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1864, in forward 2025-08-14T21:40:02.3167723Z outputs = self.model.decoder( 2025-08-14T21:40:02.3167964Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:02.3168037Z layer_outputs = decoder_layer( 2025-08-14T21:40:02.3168244Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:02.3168318Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:02.3168566Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:02.3168670Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:02.3168937Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 245, in forward 2025-08-14T21:40:02.3169037Z value_states = self.v_proj(current_states) 2025-08-14T21:40:02.3169062Z 2025-08-14T21:40:02.3169144Z cudagraph partition due to non gpu ops 2025-08-14T21:40:02.3169238Z cudagraph partition due to non gpu ops 2025-08-14T21:40:02.3169309Z cudagraph partition due to non gpu ops 2025-08-14T21:40:02.3169379Z cudagraph partition due to non gpu ops 2025-08-14T21:40:02.3169481Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:02.3169679Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:02.3169750Z return mod(**inputs) 2025-08-14T21:40:02.3169994Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1864, in forward 2025-08-14T21:40:02.3170063Z outputs = self.model.decoder( 2025-08-14T21:40:02.3170308Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:02.3170374Z layer_outputs = decoder_layer( 2025-08-14T21:40:02.3170586Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:02.3170684Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:02.3170920Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:02.3171019Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:02.3171257Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 263, in forward 2025-08-14T21:40:02.3171347Z attn_output, attn_weights = attention_interface( 2025-08-14T21:40:02.3171630Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:40:02.3171756Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:40:02.3171760Z 2025-08-14T21:40:02.3171861Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:02.3172049Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:02.3172110Z return mod(**inputs) 2025-08-14T21:40:02.3172355Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1864, in forward 2025-08-14T21:40:02.3172423Z outputs = self.model.decoder( 2025-08-14T21:40:02.3172661Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:02.3172735Z layer_outputs = decoder_layer( 2025-08-14T21:40:02.3172944Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:02.3173025Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:02.3173258Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:02.3173348Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:02.3173591Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 263, in forward 2025-08-14T21:40:02.3173680Z attn_output, attn_weights = attention_interface( 2025-08-14T21:40:02.3173960Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:40:02.3174064Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:40:02.3174067Z 2025-08-14T21:40:02.3174161Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:02.3174369Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:02.3174432Z return mod(**inputs) 2025-08-14T21:40:02.3174678Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1864, in forward 2025-08-14T21:40:02.3174768Z outputs = self.model.decoder( 2025-08-14T21:40:02.3175005Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:02.3175079Z layer_outputs = decoder_layer( 2025-08-14T21:40:02.3175285Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:02.3176387Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:02.3176633Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:02.3176720Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:02.3176958Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 277, in forward 2025-08-14T21:40:02.3177031Z attn_output = self.out_proj(attn_output) 2025-08-14T21:40:02.3177035Z 2025-08-14T21:40:02.3177126Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:02.3177314Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:02.3177392Z return mod(**inputs) 2025-08-14T21:40:02.3177625Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1864, in forward 2025-08-14T21:40:02.3177700Z outputs = self.model.decoder( 2025-08-14T21:40:02.3177933Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:02.3178007Z layer_outputs = decoder_layer( 2025-08-14T21:40:02.3178212Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:02.3178285Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:02.3178523Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 446, in forward 2025-08-14T21:40:02.3178631Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:40:02.3178636Z 2025-08-14T21:40:02.3178735Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:02.3178918Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:02.3178976Z return mod(**inputs) 2025-08-14T21:40:02.3179215Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1864, in forward 2025-08-14T21:40:02.3179281Z outputs = self.model.decoder( 2025-08-14T21:40:02.3179512Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:02.3179583Z layer_outputs = decoder_layer( 2025-08-14T21:40:02.3179782Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:02.3179859Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:02.3180091Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 446, in forward 2025-08-14T21:40:02.3180200Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:40:02.3180402Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:40:02.3180467Z return self.act(input) 2025-08-14T21:40:02.3180471Z 2025-08-14T21:40:02.3180569Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:02.3180748Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:02.3180807Z return mod(**inputs) 2025-08-14T21:40:02.3181063Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1864, in forward 2025-08-14T21:40:02.3181132Z outputs = self.model.decoder( 2025-08-14T21:40:02.3181364Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:02.3181456Z layer_outputs = decoder_layer( 2025-08-14T21:40:02.3181660Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:02.3181739Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:02.3182018Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 448, in forward 2025-08-14T21:40:02.3182095Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:40:02.3182099Z 2025-08-14T21:40:02.3182198Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:02.3182379Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:02.3182437Z return mod(**inputs) 2025-08-14T21:40:02.3182677Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1864, in forward 2025-08-14T21:40:02.3182743Z outputs = self.model.decoder( 2025-08-14T21:40:02.3182997Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:02.3183060Z layer_outputs = decoder_layer( 2025-08-14T21:40:02.3183261Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:02.3183341Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:02.3183567Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:02.3183663Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:02.3183892Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 225, in forward 2025-08-14T21:40:02.3184027Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:40:02.3184031Z 2025-08-14T21:40:02.3184134Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:02.3184312Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:02.3184369Z return mod(**inputs) 2025-08-14T21:40:02.3184819Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1864, in forward 2025-08-14T21:40:02.3184984Z outputs = self.model.decoder( 2025-08-14T21:40:02.3185270Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:02.3185345Z layer_outputs = decoder_layer( 2025-08-14T21:40:02.3185589Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:02.3185680Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:02.3185945Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:02.3186061Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:02.3186322Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 244, in forward 2025-08-14T21:40:02.3186411Z key_states = self.k_proj(current_states) 2025-08-14T21:40:02.3186414Z 2025-08-14T21:40:02.3186515Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:02.3186695Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:02.3186756Z return mod(**inputs) 2025-08-14T21:40:02.3187065Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1864, in forward 2025-08-14T21:40:02.3187133Z outputs = self.model.decoder( 2025-08-14T21:40:02.3187372Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:02.3187483Z layer_outputs = decoder_layer( 2025-08-14T21:40:02.3187684Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:02.3187762Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:02.3188024Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:02.3188123Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:02.3188352Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 245, in forward 2025-08-14T21:40:02.3188430Z value_states = self.v_proj(current_states) 2025-08-14T21:40:02.3188434Z 2025-08-14T21:40:02.3188511Z cudagraph partition due to non gpu ops 2025-08-14T21:40:02.3188581Z cudagraph partition due to non gpu ops 2025-08-14T21:40:02.3188649Z cudagraph partition due to non gpu ops 2025-08-14T21:40:02.3188725Z cudagraph partition due to non gpu ops 2025-08-14T21:40:02.3188840Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:02.3189028Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:02.3189086Z return mod(**inputs) 2025-08-14T21:40:02.3189319Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1864, in forward 2025-08-14T21:40:02.3189392Z outputs = self.model.decoder( 2025-08-14T21:40:02.3189622Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:02.3189687Z layer_outputs = decoder_layer( 2025-08-14T21:40:02.3189894Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:02.3189965Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:02.3190207Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:02.3190296Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:02.3190528Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 263, in forward 2025-08-14T21:40:02.3190622Z attn_output, attn_weights = attention_interface( 2025-08-14T21:40:02.3190890Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:40:02.3191011Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:40:02.3191023Z 2025-08-14T21:40:02.3191115Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:02.3191297Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:02.3191363Z return mod(**inputs) 2025-08-14T21:40:02.3191597Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1864, in forward 2025-08-14T21:40:02.3191663Z outputs = self.model.decoder( 2025-08-14T21:40:02.3191902Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:02.3191968Z layer_outputs = decoder_layer( 2025-08-14T21:40:02.3192180Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:02.3192251Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:02.3192496Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:02.3192601Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:02.3192836Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 263, in forward 2025-08-14T21:40:02.3192943Z attn_output, attn_weights = attention_interface( 2025-08-14T21:40:02.3193217Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:40:02.3193315Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:40:02.3193319Z 2025-08-14T21:40:02.3193430Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:02.3193613Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:02.3193672Z return mod(**inputs) 2025-08-14T21:40:02.3193915Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1864, in forward 2025-08-14T21:40:02.3193982Z outputs = self.model.decoder( 2025-08-14T21:40:02.3194220Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:02.3194288Z layer_outputs = decoder_layer( 2025-08-14T21:40:02.3194504Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:02.3194581Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:02.3194809Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:02.3194897Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:02.3195134Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 277, in forward 2025-08-14T21:40:02.3195208Z attn_output = self.out_proj(attn_output) 2025-08-14T21:40:02.3195213Z 2025-08-14T21:40:02.3195312Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:02.3195490Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:02.3195551Z return mod(**inputs) 2025-08-14T21:40:02.3195789Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1864, in forward 2025-08-14T21:40:02.3195856Z outputs = self.model.decoder( 2025-08-14T21:40:02.3196095Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:02.3196160Z layer_outputs = decoder_layer( 2025-08-14T21:40:02.3196360Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:02.3196437Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:02.3196666Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 446, in forward 2025-08-14T21:40:02.3196773Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:40:02.3196784Z 2025-08-14T21:40:02.3196876Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:02.3197056Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:02.3197123Z return mod(**inputs) 2025-08-14T21:40:02.3197354Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1864, in forward 2025-08-14T21:40:02.3197420Z outputs = self.model.decoder( 2025-08-14T21:40:02.3197659Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:02.3197723Z layer_outputs = decoder_layer( 2025-08-14T21:40:02.3197965Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:02.3198040Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:02.3198275Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 446, in forward 2025-08-14T21:40:02.3198409Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:40:02.3198608Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:40:02.3198673Z return self.act(input) 2025-08-14T21:40:02.3198676Z 2025-08-14T21:40:02.3198778Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:02.3198976Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:02.3199045Z return mod(**inputs) 2025-08-14T21:40:02.3199287Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1864, in forward 2025-08-14T21:40:02.3199357Z outputs = self.model.decoder( 2025-08-14T21:40:02.3199607Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:02.3199672Z layer_outputs = decoder_layer( 2025-08-14T21:40:02.3199886Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:02.3199985Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:02.3200220Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 448, in forward 2025-08-14T21:40:02.3200301Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:40:02.3200304Z 2025-08-14T21:40:02.3200399Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:02.3200585Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:02.3200652Z return mod(**inputs) 2025-08-14T21:40:02.3200891Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1864, in forward 2025-08-14T21:40:02.3200964Z outputs = self.model.decoder( 2025-08-14T21:40:02.3201201Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:02.3201267Z layer_outputs = decoder_layer( 2025-08-14T21:40:02.3201481Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:02.3201552Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:02.3201797Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 450, in forward 2025-08-14T21:40:02.3201877Z hidden_states = residual + hidden_states 2025-08-14T21:40:02.3201880Z 2025-08-14T21:40:02.3201971Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:02.3202156Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:02.3202214Z return mod(**inputs) 2025-08-14T21:40:02.3202445Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1880, in forward 2025-08-14T21:40:02.3202523Z logits = self.lm_head(outputs[0]) 2025-08-14T21:40:02.3202527Z 2025-08-14T21:40:02.3202618Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:02.3202802Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:02.3202861Z return mod(**inputs) 2025-08-14T21:40:02.3203094Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1886, in forward 2025-08-14T21:40:02.3203234Z loss = loss_fct(logits.view(-1, self.config.vocab_size), labels.view(-1)) 2025-08-14T21:40:02.3203237Z 2025-08-14T21:40:10.9338031Z Compilation time (from dynamo_timed): 14.0141678 2025-08-14T21:40:10.9523732Z pass 2025-08-14T21:40:10.9524696Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:40:10.9526048Z TIMING: _recursive_pre_grad_passes:0.00648 _recursive_joint_graph_passes:0.55661 _recursive_post_grad_passes:0.07503 async_compile.wait:0.72436 code_gen:7.52873 inductor_compile:8.69171 backend_compile:11.65539 gc:0.00105 entire_frame_compile:14.01417 total_wall_time:14.01417 2025-08-14T21:40:10.9527714Z STATS: call_* op count: 373 | FakeTensorMode.__torch_dispatch__:13266 | FakeTensor.__torch_dispatch__:4931 | ProxyTorchDispatchMode.__torch_dispatch__:4844 2025-08-14T21:40:10.9529909Z Dynamo produced 1 graphs covering 373 ops with 0 graph breaks (0 unique) 2025-08-14T21:40:14.9572829Z /opt/conda/envs/py_3.9/lib/python3.9/site-packages/llvmlite/binding/ffi.py:175: UserWarning: pkg_resources is deprecated as an API. See https://setuptools.pypa.io/en/latest/pkg_resources.html. The pkg_resources package is slated for removal as early as 2025-11-30. Refrain from using this package or pin to Setuptools<81. 2025-08-14T21:40:14.9573878Z from pkg_resources import resource_filename 2025-08-14T21:40:15.4977285Z 2025-08-14T21:40:20.1056927Z loading model: 0it [00:00, ?it/s] 2025-08-14T21:40:20.1059105Z loading model: 0it [00:04, ?it/s] 2025-08-14T21:40:20.1083243Z cpu eval MBartForConditionalGeneration 2025-08-14T21:40:22.4908560Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:40:23.4131150Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:40:24.3437548Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:40:39.6613173Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.6614751Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.6615208Z return mod(**inputs) 2025-08-14T21:40:39.6620271Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1436, in forward 2025-08-14T21:40:39.6621878Z decoder_input_ids = shift_tokens_right(labels, self.config.pad_token_id) 2025-08-14T21:40:39.6622522Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 76, in shift_tokens_right 2025-08-14T21:40:39.6626765Z index_of_eos = (prev_output_tokens.ne(pad_token_id).sum(dim=1) - 1).unsqueeze(-1) 2025-08-14T21:40:39.6628450Z 2025-08-14T21:40:39.6628815Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.6629126Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.6633946Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.6638257Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.6642605Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.6646737Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.6648410Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.6648710Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.6648949Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.6649142Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.6649333Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.6649523Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.6649825Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.6653969Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.6655830Z return mod(**inputs) 2025-08-14T21:40:39.6656369Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.6660891Z outputs = self.model( 2025-08-14T21:40:39.6662955Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1248, in forward 2025-08-14T21:40:39.6663783Z encoder_outputs = self.encoder( 2025-08-14T21:40:39.6667873Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 861, in forward 2025-08-14T21:40:39.6669390Z layer_outputs = encoder_layer( 2025-08-14T21:40:39.6670115Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.6673162Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.6673705Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 321, in forward 2025-08-14T21:40:39.6678028Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:40:39.6682756Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 225, in forward 2025-08-14T21:40:39.6683904Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:40:39.6684151Z 2025-08-14T21:40:39.6684268Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.6684814Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.6685131Z return mod(**inputs) 2025-08-14T21:40:39.6685497Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.6685938Z outputs = self.model( 2025-08-14T21:40:39.6686301Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1248, in forward 2025-08-14T21:40:39.6686672Z encoder_outputs = self.encoder( 2025-08-14T21:40:39.6687046Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 861, in forward 2025-08-14T21:40:39.6687434Z layer_outputs = encoder_layer( 2025-08-14T21:40:39.6687768Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.6688139Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.6688523Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 321, in forward 2025-08-14T21:40:39.6688925Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:40:39.6689310Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 244, in forward 2025-08-14T21:40:39.6689697Z key_states = self.k_proj(current_states) 2025-08-14T21:40:39.6689831Z 2025-08-14T21:40:39.6689951Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.6690310Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.6690628Z return mod(**inputs) 2025-08-14T21:40:39.6690986Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.6691364Z outputs = self.model( 2025-08-14T21:40:39.6691707Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1248, in forward 2025-08-14T21:40:39.6692079Z encoder_outputs = self.encoder( 2025-08-14T21:40:39.6692444Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 861, in forward 2025-08-14T21:40:39.6692811Z layer_outputs = encoder_layer( 2025-08-14T21:40:39.6693139Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.6693484Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.6693852Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 321, in forward 2025-08-14T21:40:39.6694233Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:40:39.6694645Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 245, in forward 2025-08-14T21:40:39.6695020Z value_states = self.v_proj(current_states) 2025-08-14T21:40:39.6695151Z 2025-08-14T21:40:39.6695233Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.6695422Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.6695667Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.6695877Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.6696098Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.6696445Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.6696758Z return mod(**inputs) 2025-08-14T21:40:39.6697129Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.6697477Z outputs = self.model( 2025-08-14T21:40:39.6697811Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1248, in forward 2025-08-14T21:40:39.6698165Z encoder_outputs = self.encoder( 2025-08-14T21:40:39.6698514Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 861, in forward 2025-08-14T21:40:39.6698861Z layer_outputs = encoder_layer( 2025-08-14T21:40:39.6699184Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.6699560Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.6699910Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 321, in forward 2025-08-14T21:40:39.6700287Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:40:39.6700680Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 263, in forward 2025-08-14T21:40:39.6701056Z attn_output, attn_weights = attention_interface( 2025-08-14T21:40:39.6701479Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:40:39.6702011Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:40:39.6702815Z 2025-08-14T21:40:39.6702915Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.6703249Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.6703543Z return mod(**inputs) 2025-08-14T21:40:39.6703882Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.6704241Z outputs = self.model( 2025-08-14T21:40:39.6704568Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1248, in forward 2025-08-14T21:40:39.6705041Z encoder_outputs = self.encoder( 2025-08-14T21:40:39.6705398Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 861, in forward 2025-08-14T21:40:39.6705753Z layer_outputs = encoder_layer( 2025-08-14T21:40:39.6706071Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.6706408Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.6706766Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 321, in forward 2025-08-14T21:40:39.6707137Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:40:39.6707498Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 263, in forward 2025-08-14T21:40:39.6707873Z attn_output, attn_weights = attention_interface( 2025-08-14T21:40:39.6708286Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:40:39.6708729Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:40:39.6708897Z 2025-08-14T21:40:39.6708996Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.6709326Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.6709644Z return mod(**inputs) 2025-08-14T21:40:39.6709966Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.6710316Z outputs = self.model( 2025-08-14T21:40:39.6710664Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1248, in forward 2025-08-14T21:40:39.6711024Z encoder_outputs = self.encoder( 2025-08-14T21:40:39.6711364Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 861, in forward 2025-08-14T21:40:39.6711714Z layer_outputs = encoder_layer( 2025-08-14T21:40:39.6712038Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.6712361Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.6712717Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 321, in forward 2025-08-14T21:40:39.6713107Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:40:39.6713474Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 277, in forward 2025-08-14T21:40:39.6713831Z attn_output = self.out_proj(attn_output) 2025-08-14T21:40:39.6713965Z 2025-08-14T21:40:39.6714062Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.6714395Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.6714692Z return mod(**inputs) 2025-08-14T21:40:39.6715025Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.6715380Z outputs = self.model( 2025-08-14T21:40:39.6715717Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1248, in forward 2025-08-14T21:40:39.6716069Z encoder_outputs = self.encoder( 2025-08-14T21:40:39.6716420Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 861, in forward 2025-08-14T21:40:39.6716790Z layer_outputs = encoder_layer( 2025-08-14T21:40:39.6717107Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.6717445Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.6717804Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 332, in forward 2025-08-14T21:40:39.6718207Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:40:39.6718370Z 2025-08-14T21:40:39.6718465Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.6718793Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.6719097Z return mod(**inputs) 2025-08-14T21:40:39.6719435Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.6719780Z outputs = self.model( 2025-08-14T21:40:39.6720115Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1248, in forward 2025-08-14T21:40:39.6720477Z encoder_outputs = self.encoder( 2025-08-14T21:40:39.6720820Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 861, in forward 2025-08-14T21:40:39.6721177Z layer_outputs = encoder_layer( 2025-08-14T21:40:39.6721514Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.6721851Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.6722200Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 332, in forward 2025-08-14T21:40:39.6722616Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:40:39.6722973Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:40:39.6723275Z return self.act(input) 2025-08-14T21:40:39.6723385Z 2025-08-14T21:40:39.6723512Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.6723843Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.6724140Z return mod(**inputs) 2025-08-14T21:40:39.6724466Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.6724815Z outputs = self.model( 2025-08-14T21:40:39.6725148Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1248, in forward 2025-08-14T21:40:39.6725496Z encoder_outputs = self.encoder( 2025-08-14T21:40:39.6725864Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 861, in forward 2025-08-14T21:40:39.6726221Z layer_outputs = encoder_layer( 2025-08-14T21:40:39.6726547Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.6726880Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.6727245Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 334, in forward 2025-08-14T21:40:39.6727612Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:40:39.6727741Z 2025-08-14T21:40:39.6727848Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.6728180Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.6728486Z return mod(**inputs) 2025-08-14T21:40:39.6728824Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.6729175Z outputs = self.model( 2025-08-14T21:40:39.6729514Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1248, in forward 2025-08-14T21:40:39.6729875Z encoder_outputs = self.encoder( 2025-08-14T21:40:39.6730232Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 861, in forward 2025-08-14T21:40:39.6730583Z layer_outputs = encoder_layer( 2025-08-14T21:40:39.6730911Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.6731246Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.6731597Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 321, in forward 2025-08-14T21:40:39.6731977Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:40:39.6732352Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 225, in forward 2025-08-14T21:40:39.6732783Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:40:39.6732975Z 2025-08-14T21:40:39.6733073Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.6733410Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.6733712Z return mod(**inputs) 2025-08-14T21:40:39.6734068Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.6734413Z outputs = self.model( 2025-08-14T21:40:39.6734741Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1248, in forward 2025-08-14T21:40:39.6735116Z encoder_outputs = self.encoder( 2025-08-14T21:40:39.6735457Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 861, in forward 2025-08-14T21:40:39.6735813Z layer_outputs = encoder_layer( 2025-08-14T21:40:39.6736133Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.6736480Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.6736828Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 321, in forward 2025-08-14T21:40:39.6737195Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:40:39.6737561Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 244, in forward 2025-08-14T21:40:39.6737916Z key_states = self.k_proj(current_states) 2025-08-14T21:40:39.6738039Z 2025-08-14T21:40:39.6738132Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.6738459Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.6738773Z return mod(**inputs) 2025-08-14T21:40:39.6739096Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.6739445Z outputs = self.model( 2025-08-14T21:40:39.6739778Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1248, in forward 2025-08-14T21:40:39.6740132Z encoder_outputs = self.encoder( 2025-08-14T21:40:39.6740471Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 861, in forward 2025-08-14T21:40:39.6740824Z layer_outputs = encoder_layer( 2025-08-14T21:40:39.6741143Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.6741466Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.6741819Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 321, in forward 2025-08-14T21:40:39.6742189Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:40:39.6742556Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 245, in forward 2025-08-14T21:40:39.6742915Z value_states = self.v_proj(current_states) 2025-08-14T21:40:39.6743050Z 2025-08-14T21:40:39.6743123Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.6743319Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.6743503Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.6743695Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.6743909Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.6744238Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.6744538Z return mod(**inputs) 2025-08-14T21:40:39.6744962Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.6745325Z outputs = self.model( 2025-08-14T21:40:39.6745652Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1248, in forward 2025-08-14T21:40:39.6746016Z encoder_outputs = self.encoder( 2025-08-14T21:40:39.6746370Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 861, in forward 2025-08-14T21:40:39.6746730Z layer_outputs = encoder_layer( 2025-08-14T21:40:39.6747067Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.6747406Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.6747767Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 321, in forward 2025-08-14T21:40:39.6748150Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:40:39.6748517Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 263, in forward 2025-08-14T21:40:39.6748889Z attn_output, attn_weights = attention_interface( 2025-08-14T21:40:39.6749310Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:40:39.6749744Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:40:39.6749920Z 2025-08-14T21:40:39.6750013Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.6750344Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.6750640Z return mod(**inputs) 2025-08-14T21:40:39.6750962Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.6751340Z outputs = self.model( 2025-08-14T21:40:39.6751682Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1248, in forward 2025-08-14T21:40:39.6752041Z encoder_outputs = self.encoder( 2025-08-14T21:40:39.6752403Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 861, in forward 2025-08-14T21:40:39.6752764Z layer_outputs = encoder_layer( 2025-08-14T21:40:39.6753087Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.6753416Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.6753778Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 321, in forward 2025-08-14T21:40:39.6754151Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:40:39.6754521Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 263, in forward 2025-08-14T21:40:39.6754896Z attn_output, attn_weights = attention_interface( 2025-08-14T21:40:39.6755309Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:40:39.6755740Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:40:39.6755890Z 2025-08-14T21:40:39.6755987Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.6756321Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.6756621Z return mod(**inputs) 2025-08-14T21:40:39.6756959Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.6757311Z outputs = self.model( 2025-08-14T21:40:39.6757648Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1248, in forward 2025-08-14T21:40:39.6758013Z encoder_outputs = self.encoder( 2025-08-14T21:40:39.6758358Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 861, in forward 2025-08-14T21:40:39.6758716Z layer_outputs = encoder_layer( 2025-08-14T21:40:39.6759040Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.6759377Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.6759748Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 321, in forward 2025-08-14T21:40:39.6760119Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:40:39.6760486Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 277, in forward 2025-08-14T21:40:39.6760863Z attn_output = self.out_proj(attn_output) 2025-08-14T21:40:39.6760992Z 2025-08-14T21:40:39.6761086Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.6761418Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.6761716Z return mod(**inputs) 2025-08-14T21:40:39.6762058Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.6762412Z outputs = self.model( 2025-08-14T21:40:39.6762746Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1248, in forward 2025-08-14T21:40:39.6763104Z encoder_outputs = self.encoder( 2025-08-14T21:40:39.6763447Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 861, in forward 2025-08-14T21:40:39.6763801Z layer_outputs = encoder_layer( 2025-08-14T21:40:39.6764122Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.6764461Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.6764816Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 332, in forward 2025-08-14T21:40:39.6765213Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:40:39.6765373Z 2025-08-14T21:40:39.6765475Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.6765795Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.6766091Z return mod(**inputs) 2025-08-14T21:40:39.6766425Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.6766774Z outputs = self.model( 2025-08-14T21:40:39.6767098Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1248, in forward 2025-08-14T21:40:39.6767457Z encoder_outputs = self.encoder( 2025-08-14T21:40:39.6767807Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 861, in forward 2025-08-14T21:40:39.6768151Z layer_outputs = encoder_layer( 2025-08-14T21:40:39.6768471Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.6768802Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.6769160Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 332, in forward 2025-08-14T21:40:39.6769551Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:40:39.6769907Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:40:39.6770220Z return self.act(input) 2025-08-14T21:40:39.6770320Z 2025-08-14T21:40:39.6770418Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.6770742Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.6771039Z return mod(**inputs) 2025-08-14T21:40:39.6771368Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.6771711Z outputs = self.model( 2025-08-14T21:40:39.6772041Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1248, in forward 2025-08-14T21:40:39.6772394Z encoder_outputs = self.encoder( 2025-08-14T21:40:39.6772753Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 861, in forward 2025-08-14T21:40:39.6773103Z layer_outputs = encoder_layer( 2025-08-14T21:40:39.6773419Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.6773784Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.6774136Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 334, in forward 2025-08-14T21:40:39.6774499Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:40:39.6774633Z 2025-08-14T21:40:39.6774743Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.6775074Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.6775363Z return mod(**inputs) 2025-08-14T21:40:39.6775696Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.6776048Z outputs = self.model( 2025-08-14T21:40:39.6776377Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1248, in forward 2025-08-14T21:40:39.6776736Z encoder_outputs = self.encoder( 2025-08-14T21:40:39.6777107Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 861, in forward 2025-08-14T21:40:39.6777462Z layer_outputs = encoder_layer( 2025-08-14T21:40:39.6777775Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.6778107Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.6778464Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 336, in forward 2025-08-14T21:40:39.6778824Z hidden_states = residual + hidden_states 2025-08-14T21:40:39.6778949Z 2025-08-14T21:40:39.6779044Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.6779377Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.6779674Z return mod(**inputs) 2025-08-14T21:40:39.6780010Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.6780362Z outputs = self.model( 2025-08-14T21:40:39.6780696Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1248, in forward 2025-08-14T21:40:39.6781050Z encoder_outputs = self.encoder( 2025-08-14T21:40:39.6781392Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 861, in forward 2025-08-14T21:40:39.6781745Z layer_outputs = encoder_layer( 2025-08-14T21:40:39.6782067Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.6782390Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.6782747Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 321, in forward 2025-08-14T21:40:39.6783120Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:40:39.6783489Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 225, in forward 2025-08-14T21:40:39.6783905Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:40:39.6784100Z 2025-08-14T21:40:39.6784193Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.6784525Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.6785030Z return mod(**inputs) 2025-08-14T21:40:39.6785402Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.6785765Z outputs = self.model( 2025-08-14T21:40:39.6786099Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1248, in forward 2025-08-14T21:40:39.6786474Z encoder_outputs = self.encoder( 2025-08-14T21:40:39.6786828Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 861, in forward 2025-08-14T21:40:39.6787181Z layer_outputs = encoder_layer( 2025-08-14T21:40:39.6787501Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.6787854Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.6788219Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 321, in forward 2025-08-14T21:40:39.6788594Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:40:39.6788962Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 244, in forward 2025-08-14T21:40:39.6789331Z key_states = self.k_proj(current_states) 2025-08-14T21:40:39.6789464Z 2025-08-14T21:40:39.6789560Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.6789897Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.6790213Z return mod(**inputs) 2025-08-14T21:40:39.6790545Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.6790894Z outputs = self.model( 2025-08-14T21:40:39.6791227Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1248, in forward 2025-08-14T21:40:39.6791576Z encoder_outputs = self.encoder( 2025-08-14T21:40:39.6791924Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 861, in forward 2025-08-14T21:40:39.6792275Z layer_outputs = encoder_layer( 2025-08-14T21:40:39.6792585Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.6792919Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.6793274Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 321, in forward 2025-08-14T21:40:39.6793642Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:40:39.6794003Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 245, in forward 2025-08-14T21:40:39.6794372Z value_states = self.v_proj(current_states) 2025-08-14T21:40:39.6794500Z 2025-08-14T21:40:39.6794581Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.6794770Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.6794962Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.6795153Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.6795366Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.6795689Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.6795988Z return mod(**inputs) 2025-08-14T21:40:39.6796321Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.6796666Z outputs = self.model( 2025-08-14T21:40:39.6797001Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1248, in forward 2025-08-14T21:40:39.6797355Z encoder_outputs = self.encoder( 2025-08-14T21:40:39.6797703Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 861, in forward 2025-08-14T21:40:39.6798050Z layer_outputs = encoder_layer( 2025-08-14T21:40:39.6798385Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.6798722Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.6799068Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 321, in forward 2025-08-14T21:40:39.6799484Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:40:39.6799850Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 263, in forward 2025-08-14T21:40:39.6800225Z attn_output, attn_weights = attention_interface( 2025-08-14T21:40:39.6800642Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:40:39.6801082Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:40:39.6801259Z 2025-08-14T21:40:39.6801354Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.6801685Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.6801974Z return mod(**inputs) 2025-08-14T21:40:39.6802303Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.6802677Z outputs = self.model( 2025-08-14T21:40:39.6803003Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1248, in forward 2025-08-14T21:40:39.6803359Z encoder_outputs = self.encoder( 2025-08-14T21:40:39.6803707Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 861, in forward 2025-08-14T21:40:39.6804062Z layer_outputs = encoder_layer( 2025-08-14T21:40:39.6804376Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.6804706Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.6805063Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 321, in forward 2025-08-14T21:40:39.6805423Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:40:39.6805791Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 263, in forward 2025-08-14T21:40:39.6806167Z attn_output, attn_weights = attention_interface( 2025-08-14T21:40:39.6806571Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:40:39.6806981Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:40:39.6807134Z 2025-08-14T21:40:39.6807229Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.6807556Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.6807858Z return mod(**inputs) 2025-08-14T21:40:39.6808185Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.6808537Z outputs = self.model( 2025-08-14T21:40:39.6808871Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1248, in forward 2025-08-14T21:40:39.6809224Z encoder_outputs = self.encoder( 2025-08-14T21:40:39.6809575Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 861, in forward 2025-08-14T21:40:39.6809928Z layer_outputs = encoder_layer( 2025-08-14T21:40:39.6810248Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.6810574Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.6810944Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 321, in forward 2025-08-14T21:40:39.6811318Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:40:39.6811688Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 277, in forward 2025-08-14T21:40:39.6812060Z attn_output = self.out_proj(attn_output) 2025-08-14T21:40:39.6812193Z 2025-08-14T21:40:39.6812287Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.6812614Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.6812904Z return mod(**inputs) 2025-08-14T21:40:39.6813255Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.6813605Z outputs = self.model( 2025-08-14T21:40:39.6813940Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1248, in forward 2025-08-14T21:40:39.6814292Z encoder_outputs = self.encoder( 2025-08-14T21:40:39.6814640Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 861, in forward 2025-08-14T21:40:39.6814994Z layer_outputs = encoder_layer( 2025-08-14T21:40:39.6815307Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.6815674Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.6816028Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 332, in forward 2025-08-14T21:40:39.6816422Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:40:39.6816581Z 2025-08-14T21:40:39.6816675Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.6817009Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.6817305Z return mod(**inputs) 2025-08-14T21:40:39.6817639Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.6817985Z outputs = self.model( 2025-08-14T21:40:39.6818320Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1248, in forward 2025-08-14T21:40:39.6818679Z encoder_outputs = self.encoder( 2025-08-14T21:40:39.6819019Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 861, in forward 2025-08-14T21:40:39.6819370Z layer_outputs = encoder_layer( 2025-08-14T21:40:39.6819687Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.6820016Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.6820362Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 332, in forward 2025-08-14T21:40:39.6820751Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:40:39.6821106Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:40:39.6821411Z return self.act(input) 2025-08-14T21:40:39.6821520Z 2025-08-14T21:40:39.6821614Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.6821943Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.6822240Z return mod(**inputs) 2025-08-14T21:40:39.6822566Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.6822915Z outputs = self.model( 2025-08-14T21:40:39.6823249Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1248, in forward 2025-08-14T21:40:39.6823597Z encoder_outputs = self.encoder( 2025-08-14T21:40:39.6823969Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 861, in forward 2025-08-14T21:40:39.6824324Z layer_outputs = encoder_layer( 2025-08-14T21:40:39.6824642Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.6825061Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.6825433Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 334, in forward 2025-08-14T21:40:39.6825801Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:40:39.6825927Z 2025-08-14T21:40:39.6826047Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.6826373Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.6826670Z return mod(**inputs) 2025-08-14T21:40:39.6827003Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.6827348Z outputs = self.model( 2025-08-14T21:40:39.6827682Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1248, in forward 2025-08-14T21:40:39.6828037Z encoder_outputs = self.encoder( 2025-08-14T21:40:39.6828402Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 861, in forward 2025-08-14T21:40:39.6828745Z layer_outputs = encoder_layer( 2025-08-14T21:40:39.6829059Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.6829389Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.6829735Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 321, in forward 2025-08-14T21:40:39.6830103Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:40:39.6830474Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 225, in forward 2025-08-14T21:40:39.6830897Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:40:39.6831086Z 2025-08-14T21:40:39.6831181Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.6831510Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.6831807Z return mod(**inputs) 2025-08-14T21:40:39.6832138Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.6832483Z outputs = self.model( 2025-08-14T21:40:39.6832816Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1248, in forward 2025-08-14T21:40:39.6833170Z encoder_outputs = self.encoder( 2025-08-14T21:40:39.6833513Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 861, in forward 2025-08-14T21:40:39.6833866Z layer_outputs = encoder_layer( 2025-08-14T21:40:39.6834181Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.6834510Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.6834857Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 321, in forward 2025-08-14T21:40:39.6835224Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:40:39.6835591Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 244, in forward 2025-08-14T21:40:39.6835949Z key_states = self.k_proj(current_states) 2025-08-14T21:40:39.6836070Z 2025-08-14T21:40:39.6836162Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.6836504Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.6836802Z return mod(**inputs) 2025-08-14T21:40:39.6837125Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.6837493Z outputs = self.model( 2025-08-14T21:40:39.6837827Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1248, in forward 2025-08-14T21:40:39.6838183Z encoder_outputs = self.encoder( 2025-08-14T21:40:39.6838526Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 861, in forward 2025-08-14T21:40:39.6838895Z layer_outputs = encoder_layer( 2025-08-14T21:40:39.6839216Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.6839538Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.6839894Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 321, in forward 2025-08-14T21:40:39.6840261Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:40:39.6840628Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 245, in forward 2025-08-14T21:40:39.6841013Z value_states = self.v_proj(current_states) 2025-08-14T21:40:39.6841154Z 2025-08-14T21:40:39.6841230Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.6841434Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.6841629Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.6841827Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.6842050Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.6842391Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.6842699Z return mod(**inputs) 2025-08-14T21:40:39.6843042Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.6843402Z outputs = self.model( 2025-08-14T21:40:39.6843741Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1248, in forward 2025-08-14T21:40:39.6844112Z encoder_outputs = self.encoder( 2025-08-14T21:40:39.6844472Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 861, in forward 2025-08-14T21:40:39.6844834Z layer_outputs = encoder_layer( 2025-08-14T21:40:39.6845157Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.6845498Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.6845863Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 321, in forward 2025-08-14T21:40:39.6846235Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:40:39.6846615Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 263, in forward 2025-08-14T21:40:39.6847000Z attn_output, attn_weights = attention_interface( 2025-08-14T21:40:39.6847422Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:40:39.6847869Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:40:39.6848051Z 2025-08-14T21:40:39.6848148Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.6848485Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.6848792Z return mod(**inputs) 2025-08-14T21:40:39.6849126Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.6849507Z outputs = self.model( 2025-08-14T21:40:39.6849844Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1248, in forward 2025-08-14T21:40:39.6850195Z encoder_outputs = self.encoder( 2025-08-14T21:40:39.6850558Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 861, in forward 2025-08-14T21:40:39.6850909Z layer_outputs = encoder_layer( 2025-08-14T21:40:39.6851223Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.6851545Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.6851912Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 321, in forward 2025-08-14T21:40:39.6852281Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:40:39.6852648Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 263, in forward 2025-08-14T21:40:39.6853014Z attn_output, attn_weights = attention_interface( 2025-08-14T21:40:39.6853416Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:40:39.6853834Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:40:39.6853999Z 2025-08-14T21:40:39.6854093Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.6854423Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.6854724Z return mod(**inputs) 2025-08-14T21:40:39.6855059Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.6855405Z outputs = self.model( 2025-08-14T21:40:39.6855739Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1248, in forward 2025-08-14T21:40:39.6856095Z encoder_outputs = self.encoder( 2025-08-14T21:40:39.6856439Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 861, in forward 2025-08-14T21:40:39.6856796Z layer_outputs = encoder_layer( 2025-08-14T21:40:39.6857120Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.6857452Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.6857804Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 321, in forward 2025-08-14T21:40:39.6858175Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:40:39.6858541Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 277, in forward 2025-08-14T21:40:39.6858904Z attn_output = self.out_proj(attn_output) 2025-08-14T21:40:39.6859029Z 2025-08-14T21:40:39.6859125Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.6859454Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.6859750Z return mod(**inputs) 2025-08-14T21:40:39.6860078Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.6860432Z outputs = self.model( 2025-08-14T21:40:39.6860765Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1248, in forward 2025-08-14T21:40:39.6861118Z encoder_outputs = self.encoder( 2025-08-14T21:40:39.6861461Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 861, in forward 2025-08-14T21:40:39.6861817Z layer_outputs = encoder_layer( 2025-08-14T21:40:39.6862154Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.6862485Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.6862843Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 332, in forward 2025-08-14T21:40:39.6863256Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:40:39.6863415Z 2025-08-14T21:40:39.6863516Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.6863837Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.6864133Z return mod(**inputs) 2025-08-14T21:40:39.6864484Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.6864903Z outputs = self.model( 2025-08-14T21:40:39.6865244Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1248, in forward 2025-08-14T21:40:39.6865611Z encoder_outputs = self.encoder( 2025-08-14T21:40:39.6865966Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 861, in forward 2025-08-14T21:40:39.6866318Z layer_outputs = encoder_layer( 2025-08-14T21:40:39.6866643Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.6867006Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.6867363Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 332, in forward 2025-08-14T21:40:39.6867749Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:40:39.6868103Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:40:39.6868413Z return self.act(input) 2025-08-14T21:40:39.6868512Z 2025-08-14T21:40:39.6868614Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.6868936Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.6869236Z return mod(**inputs) 2025-08-14T21:40:39.6869568Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.6869916Z outputs = self.model( 2025-08-14T21:40:39.6870245Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1248, in forward 2025-08-14T21:40:39.6870602Z encoder_outputs = self.encoder( 2025-08-14T21:40:39.6870949Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 861, in forward 2025-08-14T21:40:39.6871299Z layer_outputs = encoder_layer( 2025-08-14T21:40:39.6871616Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.6871948Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.6872296Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 334, in forward 2025-08-14T21:40:39.6872657Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:40:39.6872789Z 2025-08-14T21:40:39.6872883Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.6873213Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.6873505Z return mod(**inputs) 2025-08-14T21:40:39.6873840Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.6874189Z outputs = self.model( 2025-08-14T21:40:39.6874515Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1248, in forward 2025-08-14T21:40:39.6874869Z encoder_outputs = self.encoder( 2025-08-14T21:40:39.6875231Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 861, in forward 2025-08-14T21:40:39.6875587Z layer_outputs = encoder_layer( 2025-08-14T21:40:39.6875901Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.6876248Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.6876602Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 336, in forward 2025-08-14T21:40:39.6876962Z hidden_states = residual + hidden_states 2025-08-14T21:40:39.6877085Z 2025-08-14T21:40:39.6877195Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.6877531Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.6877829Z return mod(**inputs) 2025-08-14T21:40:39.6878157Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.6878511Z outputs = self.model( 2025-08-14T21:40:39.6878846Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1248, in forward 2025-08-14T21:40:39.6879203Z encoder_outputs = self.encoder( 2025-08-14T21:40:39.6879566Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 861, in forward 2025-08-14T21:40:39.6879921Z layer_outputs = encoder_layer( 2025-08-14T21:40:39.6880240Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.6880563Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.6880919Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 321, in forward 2025-08-14T21:40:39.6881290Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:40:39.6881659Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 225, in forward 2025-08-14T21:40:39.6882078Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:40:39.6882274Z 2025-08-14T21:40:39.6882369Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.6882700Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.6882995Z return mod(**inputs) 2025-08-14T21:40:39.6883318Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.6883670Z outputs = self.model( 2025-08-14T21:40:39.6884003Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1248, in forward 2025-08-14T21:40:39.6884351Z encoder_outputs = self.encoder( 2025-08-14T21:40:39.6884854Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 861, in forward 2025-08-14T21:40:39.6885215Z layer_outputs = encoder_layer( 2025-08-14T21:40:39.6885535Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.6885862Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.6886222Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 321, in forward 2025-08-14T21:40:39.6886594Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:40:39.6886957Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 244, in forward 2025-08-14T21:40:39.6887320Z key_states = self.k_proj(current_states) 2025-08-14T21:40:39.6887452Z 2025-08-14T21:40:39.6887548Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.6887914Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.6888208Z return mod(**inputs) 2025-08-14T21:40:39.6888542Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.6888919Z outputs = self.model( 2025-08-14T21:40:39.6889258Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1248, in forward 2025-08-14T21:40:39.6889608Z encoder_outputs = self.encoder( 2025-08-14T21:40:39.6889963Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 861, in forward 2025-08-14T21:40:39.6890346Z layer_outputs = encoder_layer( 2025-08-14T21:40:39.6890660Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.6890990Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.6891346Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 321, in forward 2025-08-14T21:40:39.6891713Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:40:39.6892075Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 245, in forward 2025-08-14T21:40:39.6892467Z value_states = self.v_proj(current_states) 2025-08-14T21:40:39.6892594Z 2025-08-14T21:40:39.6892675Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.6892862Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.6893055Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.6893243Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.6893457Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.6893782Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.6894080Z return mod(**inputs) 2025-08-14T21:40:39.6894413Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.6894758Z outputs = self.model( 2025-08-14T21:40:39.6895093Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1248, in forward 2025-08-14T21:40:39.6895451Z encoder_outputs = self.encoder( 2025-08-14T21:40:39.6895798Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 861, in forward 2025-08-14T21:40:39.6896143Z layer_outputs = encoder_layer( 2025-08-14T21:40:39.6896461Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.6896794Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.6897142Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 321, in forward 2025-08-14T21:40:39.6897513Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:40:39.6897879Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 263, in forward 2025-08-14T21:40:39.6898252Z attn_output, attn_weights = attention_interface( 2025-08-14T21:40:39.6898649Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:40:39.6899090Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:40:39.6899264Z 2025-08-14T21:40:39.6899359Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.6899689Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.6899979Z return mod(**inputs) 2025-08-14T21:40:39.6900311Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.6900676Z outputs = self.model( 2025-08-14T21:40:39.6901012Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1248, in forward 2025-08-14T21:40:39.6901377Z encoder_outputs = self.encoder( 2025-08-14T21:40:39.6901766Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 861, in forward 2025-08-14T21:40:39.6902120Z layer_outputs = encoder_layer( 2025-08-14T21:40:39.6902429Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.6902761Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.6903128Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 321, in forward 2025-08-14T21:40:39.6903495Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:40:39.6903871Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 263, in forward 2025-08-14T21:40:39.6904255Z attn_output, attn_weights = attention_interface( 2025-08-14T21:40:39.6904665Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:40:39.6905138Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:40:39.6905318Z 2025-08-14T21:40:39.6905413Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.6905742Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.6906042Z return mod(**inputs) 2025-08-14T21:40:39.6906371Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.6906737Z outputs = self.model( 2025-08-14T21:40:39.6907081Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1248, in forward 2025-08-14T21:40:39.6907431Z encoder_outputs = self.encoder( 2025-08-14T21:40:39.6907781Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 861, in forward 2025-08-14T21:40:39.6908136Z layer_outputs = encoder_layer( 2025-08-14T21:40:39.6908454Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.6908780Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.6909139Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 321, in forward 2025-08-14T21:40:39.6909512Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:40:39.6909878Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 277, in forward 2025-08-14T21:40:39.6910230Z attn_output = self.out_proj(attn_output) 2025-08-14T21:40:39.6910360Z 2025-08-14T21:40:39.6910456Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.6910785Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.6911077Z return mod(**inputs) 2025-08-14T21:40:39.6911411Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.6911765Z outputs = self.model( 2025-08-14T21:40:39.6912097Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1248, in forward 2025-08-14T21:40:39.6912445Z encoder_outputs = self.encoder( 2025-08-14T21:40:39.6912797Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 861, in forward 2025-08-14T21:40:39.6913149Z layer_outputs = encoder_layer( 2025-08-14T21:40:39.6913476Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.6913809Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.6914165Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 332, in forward 2025-08-14T21:40:39.6914577Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:40:39.6914738Z 2025-08-14T21:40:39.6914831Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.6915156Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.6915453Z return mod(**inputs) 2025-08-14T21:40:39.6915811Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.6916160Z outputs = self.model( 2025-08-14T21:40:39.6916492Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1248, in forward 2025-08-14T21:40:39.6916848Z encoder_outputs = self.encoder( 2025-08-14T21:40:39.6917190Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 861, in forward 2025-08-14T21:40:39.6917546Z layer_outputs = encoder_layer( 2025-08-14T21:40:39.6917868Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.6918221Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.6918570Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 332, in forward 2025-08-14T21:40:39.6918967Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:40:39.6919322Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:40:39.6919628Z return self.act(input) 2025-08-14T21:40:39.6919735Z 2025-08-14T21:40:39.6919831Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.6920159Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.6920457Z return mod(**inputs) 2025-08-14T21:40:39.6920781Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.6921136Z outputs = self.model( 2025-08-14T21:40:39.6921469Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1248, in forward 2025-08-14T21:40:39.6921823Z encoder_outputs = self.encoder( 2025-08-14T21:40:39.6922164Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 861, in forward 2025-08-14T21:40:39.6922517Z layer_outputs = encoder_layer( 2025-08-14T21:40:39.6922838Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.6923159Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.6923516Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 334, in forward 2025-08-14T21:40:39.6923875Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:40:39.6923998Z 2025-08-14T21:40:39.6924098Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.6924420Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.6924717Z return mod(**inputs) 2025-08-14T21:40:39.6925047Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.6925390Z outputs = self.model( 2025-08-14T21:40:39.6925724Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1248, in forward 2025-08-14T21:40:39.6926079Z encoder_outputs = self.encoder( 2025-08-14T21:40:39.6926443Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 861, in forward 2025-08-14T21:40:39.6926791Z layer_outputs = encoder_layer( 2025-08-14T21:40:39.6927110Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.6927457Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.6927805Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 321, in forward 2025-08-14T21:40:39.6928173Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:40:39.6928554Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 225, in forward 2025-08-14T21:40:39.6928980Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:40:39.6929166Z 2025-08-14T21:40:39.6929261Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.6929596Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.6929895Z return mod(**inputs) 2025-08-14T21:40:39.6930226Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.6930585Z outputs = self.model( 2025-08-14T21:40:39.6930916Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1248, in forward 2025-08-14T21:40:39.6931268Z encoder_outputs = self.encoder( 2025-08-14T21:40:39.6931605Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 861, in forward 2025-08-14T21:40:39.6931957Z layer_outputs = encoder_layer( 2025-08-14T21:40:39.6932276Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.6932607Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.6932955Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 321, in forward 2025-08-14T21:40:39.6933321Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:40:39.6933689Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 244, in forward 2025-08-14T21:40:39.6934051Z key_states = self.k_proj(current_states) 2025-08-14T21:40:39.6934172Z 2025-08-14T21:40:39.6934266Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.6934595Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.6934893Z return mod(**inputs) 2025-08-14T21:40:39.6935217Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.6935565Z outputs = self.model( 2025-08-14T21:40:39.6935898Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1248, in forward 2025-08-14T21:40:39.6936250Z encoder_outputs = self.encoder( 2025-08-14T21:40:39.6936589Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 861, in forward 2025-08-14T21:40:39.6936945Z layer_outputs = encoder_layer( 2025-08-14T21:40:39.6937259Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.6937581Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.6937941Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 321, in forward 2025-08-14T21:40:39.6938309Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:40:39.6938672Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 245, in forward 2025-08-14T21:40:39.6939044Z value_states = self.v_proj(current_states) 2025-08-14T21:40:39.6939183Z 2025-08-14T21:40:39.6939255Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.6939451Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.6939632Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.6939839Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.6940056Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.6940390Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.6940678Z return mod(**inputs) 2025-08-14T21:40:39.6941024Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.6941378Z outputs = self.model( 2025-08-14T21:40:39.6941703Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1248, in forward 2025-08-14T21:40:39.6942062Z encoder_outputs = self.encoder( 2025-08-14T21:40:39.6942412Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 861, in forward 2025-08-14T21:40:39.6942766Z layer_outputs = encoder_layer( 2025-08-14T21:40:39.6943081Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.6943429Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.6943789Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 321, in forward 2025-08-14T21:40:39.6944159Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:40:39.6944520Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 263, in forward 2025-08-14T21:40:39.6944960Z attn_output, attn_weights = attention_interface( 2025-08-14T21:40:39.6945378Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:40:39.6945809Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:40:39.6945987Z 2025-08-14T21:40:39.6946080Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.6946413Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.6946716Z return mod(**inputs) 2025-08-14T21:40:39.6947042Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.6947397Z outputs = self.model( 2025-08-14T21:40:39.6947731Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1248, in forward 2025-08-14T21:40:39.6948083Z encoder_outputs = self.encoder( 2025-08-14T21:40:39.6948433Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 861, in forward 2025-08-14T21:40:39.6948786Z layer_outputs = encoder_layer( 2025-08-14T21:40:39.6949103Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.6949428Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.6949785Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 321, in forward 2025-08-14T21:40:39.6950158Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:40:39.6950524Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 263, in forward 2025-08-14T21:40:39.6950894Z attn_output, attn_weights = attention_interface( 2025-08-14T21:40:39.6951299Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:40:39.6951742Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:40:39.6951892Z 2025-08-14T21:40:39.6951986Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.6952319Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.6952634Z return mod(**inputs) 2025-08-14T21:40:39.6952965Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.6953309Z outputs = self.model( 2025-08-14T21:40:39.6953638Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1248, in forward 2025-08-14T21:40:39.6954003Z encoder_outputs = self.encoder( 2025-08-14T21:40:39.6954352Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 861, in forward 2025-08-14T21:40:39.6954698Z layer_outputs = encoder_layer( 2025-08-14T21:40:39.6955015Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.6955345Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.6955694Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 321, in forward 2025-08-14T21:40:39.6956067Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:40:39.6956457Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 277, in forward 2025-08-14T21:40:39.6956820Z attn_output = self.out_proj(attn_output) 2025-08-14T21:40:39.6956944Z 2025-08-14T21:40:39.6957041Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.6957372Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.6957670Z return mod(**inputs) 2025-08-14T21:40:39.6957992Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.6958339Z outputs = self.model( 2025-08-14T21:40:39.6958672Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1248, in forward 2025-08-14T21:40:39.6959031Z encoder_outputs = self.encoder( 2025-08-14T21:40:39.6959374Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 861, in forward 2025-08-14T21:40:39.6959725Z layer_outputs = encoder_layer( 2025-08-14T21:40:39.6960045Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.6960372Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.6960726Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 332, in forward 2025-08-14T21:40:39.6961122Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:40:39.6961281Z 2025-08-14T21:40:39.6961383Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.6961702Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.6961999Z return mod(**inputs) 2025-08-14T21:40:39.6962332Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.6962680Z outputs = self.model( 2025-08-14T21:40:39.6963002Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1248, in forward 2025-08-14T21:40:39.6963353Z encoder_outputs = self.encoder( 2025-08-14T21:40:39.6963698Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 861, in forward 2025-08-14T21:40:39.6964045Z layer_outputs = encoder_layer( 2025-08-14T21:40:39.6964376Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.6964713Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.6965073Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 332, in forward 2025-08-14T21:40:39.6965486Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:40:39.6965847Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:40:39.6966162Z return self.act(input) 2025-08-14T21:40:39.6966264Z 2025-08-14T21:40:39.6966366Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.6966706Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.6967006Z return mod(**inputs) 2025-08-14T21:40:39.6967347Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.6967694Z outputs = self.model( 2025-08-14T21:40:39.6968029Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1248, in forward 2025-08-14T21:40:39.6968389Z encoder_outputs = self.encoder( 2025-08-14T21:40:39.6968744Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 861, in forward 2025-08-14T21:40:39.6969108Z layer_outputs = encoder_layer( 2025-08-14T21:40:39.6969433Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.6969764Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.6970113Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 334, in forward 2025-08-14T21:40:39.6970473Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:40:39.6970607Z 2025-08-14T21:40:39.6970703Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.6971033Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.6971323Z return mod(**inputs) 2025-08-14T21:40:39.6971653Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.6972004Z outputs = self.model( 2025-08-14T21:40:39.6972326Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1248, in forward 2025-08-14T21:40:39.6972681Z encoder_outputs = self.encoder( 2025-08-14T21:40:39.6973029Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 861, in forward 2025-08-14T21:40:39.6973382Z layer_outputs = encoder_layer( 2025-08-14T21:40:39.6973692Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.6974026Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.6974380Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 336, in forward 2025-08-14T21:40:39.6974739Z hidden_states = residual + hidden_states 2025-08-14T21:40:39.6974865Z 2025-08-14T21:40:39.6974959Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.6975287Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.6975583Z return mod(**inputs) 2025-08-14T21:40:39.6975906Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.6976254Z outputs = self.model( 2025-08-14T21:40:39.6976586Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1248, in forward 2025-08-14T21:40:39.6976940Z encoder_outputs = self.encoder( 2025-08-14T21:40:39.6994656Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 861, in forward 2025-08-14T21:40:39.6995063Z layer_outputs = encoder_layer( 2025-08-14T21:40:39.6995408Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.6995784Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.6996155Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 321, in forward 2025-08-14T21:40:39.6996542Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:40:39.6996956Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 225, in forward 2025-08-14T21:40:39.6997387Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:40:39.6997588Z 2025-08-14T21:40:39.6997692Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.6998040Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.6998351Z return mod(**inputs) 2025-08-14T21:40:39.6998690Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.6999089Z outputs = self.model( 2025-08-14T21:40:39.6999434Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1248, in forward 2025-08-14T21:40:39.6999789Z encoder_outputs = self.encoder( 2025-08-14T21:40:39.7000150Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 861, in forward 2025-08-14T21:40:39.7000511Z layer_outputs = encoder_layer( 2025-08-14T21:40:39.7000838Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7001167Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7001530Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 321, in forward 2025-08-14T21:40:39.7001907Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:40:39.7002272Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 244, in forward 2025-08-14T21:40:39.7002638Z key_states = self.k_proj(current_states) 2025-08-14T21:40:39.7002768Z 2025-08-14T21:40:39.7002867Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7003201Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7003495Z return mod(**inputs) 2025-08-14T21:40:39.7003829Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7004180Z outputs = self.model( 2025-08-14T21:40:39.7004515Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1248, in forward 2025-08-14T21:40:39.7004869Z encoder_outputs = self.encoder( 2025-08-14T21:40:39.7005223Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 861, in forward 2025-08-14T21:40:39.7005582Z layer_outputs = encoder_layer( 2025-08-14T21:40:39.7005896Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7006230Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7006591Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 321, in forward 2025-08-14T21:40:39.7006963Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:40:39.7007320Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 245, in forward 2025-08-14T21:40:39.7007707Z value_states = self.v_proj(current_states) 2025-08-14T21:40:39.7007841Z 2025-08-14T21:40:39.7007926Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.7008116Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.7008304Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.7008512Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.7008731Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7009058Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7009364Z return mod(**inputs) 2025-08-14T21:40:39.7009717Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7010065Z outputs = self.model( 2025-08-14T21:40:39.7010402Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1248, in forward 2025-08-14T21:40:39.7010762Z encoder_outputs = self.encoder( 2025-08-14T21:40:39.7011116Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 861, in forward 2025-08-14T21:40:39.7011467Z layer_outputs = encoder_layer( 2025-08-14T21:40:39.7011793Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7012145Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7012493Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 321, in forward 2025-08-14T21:40:39.7012863Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:40:39.7013233Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 263, in forward 2025-08-14T21:40:39.7013613Z attn_output, attn_weights = attention_interface( 2025-08-14T21:40:39.7014015Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:40:39.7014458Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:40:39.7014628Z 2025-08-14T21:40:39.7014734Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7015071Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7015366Z return mod(**inputs) 2025-08-14T21:40:39.7015701Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7016057Z outputs = self.model( 2025-08-14T21:40:39.7016386Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1248, in forward 2025-08-14T21:40:39.7016745Z encoder_outputs = self.encoder( 2025-08-14T21:40:39.7017096Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 861, in forward 2025-08-14T21:40:39.7017448Z layer_outputs = encoder_layer( 2025-08-14T21:40:39.7017764Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7018096Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7018457Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 321, in forward 2025-08-14T21:40:39.7018818Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:40:39.7019184Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 263, in forward 2025-08-14T21:40:39.7019560Z attn_output, attn_weights = attention_interface( 2025-08-14T21:40:39.7019968Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:40:39.7020392Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:40:39.7020552Z 2025-08-14T21:40:39.7020647Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7020979Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7021324Z return mod(**inputs) 2025-08-14T21:40:39.7021651Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7022005Z outputs = self.model( 2025-08-14T21:40:39.7022339Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1248, in forward 2025-08-14T21:40:39.7022702Z encoder_outputs = self.encoder( 2025-08-14T21:40:39.7023056Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 861, in forward 2025-08-14T21:40:39.7023413Z layer_outputs = encoder_layer( 2025-08-14T21:40:39.7023740Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7024070Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7024435Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 321, in forward 2025-08-14T21:40:39.7024884Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:40:39.7025291Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 277, in forward 2025-08-14T21:40:39.7025659Z attn_output = self.out_proj(attn_output) 2025-08-14T21:40:39.7025797Z 2025-08-14T21:40:39.7025898Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7026245Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7026550Z return mod(**inputs) 2025-08-14T21:40:39.7026901Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7027266Z outputs = self.model( 2025-08-14T21:40:39.7027616Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1248, in forward 2025-08-14T21:40:39.7027975Z encoder_outputs = self.encoder( 2025-08-14T21:40:39.7028340Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 861, in forward 2025-08-14T21:40:39.7028705Z layer_outputs = encoder_layer( 2025-08-14T21:40:39.7029028Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7029375Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7029740Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 332, in forward 2025-08-14T21:40:39.7030145Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:40:39.7030313Z 2025-08-14T21:40:39.7030413Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7030753Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7031062Z return mod(**inputs) 2025-08-14T21:40:39.7031402Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7031763Z outputs = self.model( 2025-08-14T21:40:39.7032104Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1248, in forward 2025-08-14T21:40:39.7032456Z encoder_outputs = self.encoder( 2025-08-14T21:40:39.7032813Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 861, in forward 2025-08-14T21:40:39.7033174Z layer_outputs = encoder_layer( 2025-08-14T21:40:39.7033518Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7033842Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7034201Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 332, in forward 2025-08-14T21:40:39.7034617Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:40:39.7034971Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:40:39.7035283Z return self.act(input) 2025-08-14T21:40:39.7035388Z 2025-08-14T21:40:39.7035483Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7035833Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7036126Z return mod(**inputs) 2025-08-14T21:40:39.7036458Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7036809Z outputs = self.model( 2025-08-14T21:40:39.7037134Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1248, in forward 2025-08-14T21:40:39.7037487Z encoder_outputs = self.encoder( 2025-08-14T21:40:39.7037724Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 861, in forward 2025-08-14T21:40:39.7037805Z layer_outputs = encoder_layer( 2025-08-14T21:40:39.7038009Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7038088Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7038320Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 334, in forward 2025-08-14T21:40:39.7038395Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:40:39.7038405Z 2025-08-14T21:40:39.7038500Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7038682Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7038748Z return mod(**inputs) 2025-08-14T21:40:39.7038978Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7039043Z outputs = self.model( 2025-08-14T21:40:39.7039284Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1248, in forward 2025-08-14T21:40:39.7039351Z encoder_outputs = self.encoder( 2025-08-14T21:40:39.7039590Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 861, in forward 2025-08-14T21:40:39.7039654Z layer_outputs = encoder_layer( 2025-08-14T21:40:39.7039857Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7039937Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7040169Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 321, in forward 2025-08-14T21:40:39.7040254Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:40:39.7040491Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 225, in forward 2025-08-14T21:40:39.7040631Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:40:39.7040634Z 2025-08-14T21:40:39.7040734Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7040915Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7040974Z return mod(**inputs) 2025-08-14T21:40:39.7041215Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7041292Z outputs = self.model( 2025-08-14T21:40:39.7041534Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1248, in forward 2025-08-14T21:40:39.7041602Z encoder_outputs = self.encoder( 2025-08-14T21:40:39.7041847Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 861, in forward 2025-08-14T21:40:39.7041919Z layer_outputs = encoder_layer( 2025-08-14T21:40:39.7042118Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7042188Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7042437Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 321, in forward 2025-08-14T21:40:39.7042522Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:40:39.7042759Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 244, in forward 2025-08-14T21:40:39.7042831Z key_states = self.k_proj(current_states) 2025-08-14T21:40:39.7042834Z 2025-08-14T21:40:39.7042928Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7043120Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7043195Z return mod(**inputs) 2025-08-14T21:40:39.7043433Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7043495Z outputs = self.model( 2025-08-14T21:40:39.7043726Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1248, in forward 2025-08-14T21:40:39.7043800Z encoder_outputs = self.encoder( 2025-08-14T21:40:39.7044030Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 861, in forward 2025-08-14T21:40:39.7044096Z layer_outputs = encoder_layer( 2025-08-14T21:40:39.7044303Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7044373Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7044609Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 321, in forward 2025-08-14T21:40:39.7044692Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:40:39.7044920Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 245, in forward 2025-08-14T21:40:39.7045004Z value_states = self.v_proj(current_states) 2025-08-14T21:40:39.7045008Z 2025-08-14T21:40:39.7045081Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.7045152Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.7045228Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.7045297Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.7045399Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7045578Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7045636Z return mod(**inputs) 2025-08-14T21:40:39.7045876Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7045939Z outputs = self.model( 2025-08-14T21:40:39.7046171Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1248, in forward 2025-08-14T21:40:39.7046243Z encoder_outputs = self.encoder( 2025-08-14T21:40:39.7046473Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 861, in forward 2025-08-14T21:40:39.7046545Z layer_outputs = encoder_layer( 2025-08-14T21:40:39.7046758Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7046833Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7047073Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 321, in forward 2025-08-14T21:40:39.7047171Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:40:39.7047406Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 263, in forward 2025-08-14T21:40:39.7047493Z attn_output, attn_weights = attention_interface( 2025-08-14T21:40:39.7047773Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:40:39.7047905Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:40:39.7047908Z 2025-08-14T21:40:39.7048002Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7048186Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7048253Z return mod(**inputs) 2025-08-14T21:40:39.7048487Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7048556Z outputs = self.model( 2025-08-14T21:40:39.7048803Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1248, in forward 2025-08-14T21:40:39.7048868Z encoder_outputs = self.encoder( 2025-08-14T21:40:39.7049105Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 861, in forward 2025-08-14T21:40:39.7049171Z layer_outputs = encoder_layer( 2025-08-14T21:40:39.7049374Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7049449Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7049679Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 321, in forward 2025-08-14T21:40:39.7049767Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:40:39.7049996Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 263, in forward 2025-08-14T21:40:39.7050085Z attn_output, attn_weights = attention_interface( 2025-08-14T21:40:39.7050357Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:40:39.7050457Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:40:39.7050461Z 2025-08-14T21:40:39.7050560Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7050740Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7050797Z return mod(**inputs) 2025-08-14T21:40:39.7051038Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7051100Z outputs = self.model( 2025-08-14T21:40:39.7051331Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1248, in forward 2025-08-14T21:40:39.7051404Z encoder_outputs = self.encoder( 2025-08-14T21:40:39.7051631Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 861, in forward 2025-08-14T21:40:39.7051702Z layer_outputs = encoder_layer( 2025-08-14T21:40:39.7051903Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7051973Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7052210Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 321, in forward 2025-08-14T21:40:39.7052306Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:40:39.7052542Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 277, in forward 2025-08-14T21:40:39.7052615Z attn_output = self.out_proj(attn_output) 2025-08-14T21:40:39.7052633Z 2025-08-14T21:40:39.7052725Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7052914Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7052972Z return mod(**inputs) 2025-08-14T21:40:39.7053204Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7053290Z outputs = self.model( 2025-08-14T21:40:39.7053521Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1248, in forward 2025-08-14T21:40:39.7053593Z encoder_outputs = self.encoder( 2025-08-14T21:40:39.7053822Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 861, in forward 2025-08-14T21:40:39.7053886Z layer_outputs = encoder_layer( 2025-08-14T21:40:39.7054095Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7054185Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7054422Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 332, in forward 2025-08-14T21:40:39.7054531Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:40:39.7054534Z 2025-08-14T21:40:39.7054626Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7054815Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7054874Z return mod(**inputs) 2025-08-14T21:40:39.7055105Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7055174Z outputs = self.model( 2025-08-14T21:40:39.7055403Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1248, in forward 2025-08-14T21:40:39.7055477Z encoder_outputs = self.encoder( 2025-08-14T21:40:39.7055709Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 861, in forward 2025-08-14T21:40:39.7055773Z layer_outputs = encoder_layer( 2025-08-14T21:40:39.7055978Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7056050Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7056278Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 332, in forward 2025-08-14T21:40:39.7056392Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:40:39.7056587Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:40:39.7056655Z return self.act(input) 2025-08-14T21:40:39.7056659Z 2025-08-14T21:40:39.7056751Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7056931Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7056999Z return mod(**inputs) 2025-08-14T21:40:39.7057228Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7057295Z outputs = self.model( 2025-08-14T21:40:39.7057525Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1248, in forward 2025-08-14T21:40:39.7057589Z encoder_outputs = self.encoder( 2025-08-14T21:40:39.7057840Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 861, in forward 2025-08-14T21:40:39.7057908Z layer_outputs = encoder_layer( 2025-08-14T21:40:39.7058108Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7058210Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7058441Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 334, in forward 2025-08-14T21:40:39.7058520Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:40:39.7058523Z 2025-08-14T21:40:39.7058613Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7058807Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7058877Z return mod(**inputs) 2025-08-14T21:40:39.7059107Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7059176Z outputs = self.model( 2025-08-14T21:40:39.7059406Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1248, in forward 2025-08-14T21:40:39.7059471Z encoder_outputs = self.encoder( 2025-08-14T21:40:39.7059705Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 861, in forward 2025-08-14T21:40:39.7059787Z layer_outputs = encoder_layer( 2025-08-14T21:40:39.7059987Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7060065Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7060296Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 336, in forward 2025-08-14T21:40:39.7060376Z hidden_states = residual + hidden_states 2025-08-14T21:40:39.7060379Z 2025-08-14T21:40:39.7060471Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7060651Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7060715Z return mod(**inputs) 2025-08-14T21:40:39.7060944Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7061009Z outputs = self.model( 2025-08-14T21:40:39.7061245Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1248, in forward 2025-08-14T21:40:39.7061310Z encoder_outputs = self.encoder( 2025-08-14T21:40:39.7061545Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 861, in forward 2025-08-14T21:40:39.7061608Z layer_outputs = encoder_layer( 2025-08-14T21:40:39.7061806Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7061882Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7062109Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 321, in forward 2025-08-14T21:40:39.7062199Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:40:39.7062427Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 225, in forward 2025-08-14T21:40:39.7062564Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:40:39.7062568Z 2025-08-14T21:40:39.7062664Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7062844Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7062901Z return mod(**inputs) 2025-08-14T21:40:39.7063139Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7063218Z outputs = self.model( 2025-08-14T21:40:39.7063458Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1248, in forward 2025-08-14T21:40:39.7063521Z encoder_outputs = self.encoder( 2025-08-14T21:40:39.7063765Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 861, in forward 2025-08-14T21:40:39.7063838Z layer_outputs = encoder_layer( 2025-08-14T21:40:39.7064037Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7064112Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7064356Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 321, in forward 2025-08-14T21:40:39.7064441Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:40:39.7064677Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 244, in forward 2025-08-14T21:40:39.7064751Z key_states = self.k_proj(current_states) 2025-08-14T21:40:39.7064754Z 2025-08-14T21:40:39.7064921Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7065117Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7065197Z return mod(**inputs) 2025-08-14T21:40:39.7065439Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7065502Z outputs = self.model( 2025-08-14T21:40:39.7065739Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1248, in forward 2025-08-14T21:40:39.7065814Z encoder_outputs = self.encoder( 2025-08-14T21:40:39.7066046Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 861, in forward 2025-08-14T21:40:39.7066114Z layer_outputs = encoder_layer( 2025-08-14T21:40:39.7066324Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7066395Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7066635Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 321, in forward 2025-08-14T21:40:39.7066719Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:40:39.7066950Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 245, in forward 2025-08-14T21:40:39.7067036Z value_states = self.v_proj(current_states) 2025-08-14T21:40:39.7067042Z 2025-08-14T21:40:39.7067114Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.7067192Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.7067261Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.7067329Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.7067431Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7067610Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7067669Z return mod(**inputs) 2025-08-14T21:40:39.7067909Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7067971Z outputs = self.model( 2025-08-14T21:40:39.7068209Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1248, in forward 2025-08-14T21:40:39.7068273Z encoder_outputs = self.encoder( 2025-08-14T21:40:39.7068504Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 861, in forward 2025-08-14T21:40:39.7068575Z layer_outputs = encoder_layer( 2025-08-14T21:40:39.7068792Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7068865Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7069098Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 321, in forward 2025-08-14T21:40:39.7069195Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:40:39.7069430Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 263, in forward 2025-08-14T21:40:39.7069519Z attn_output, attn_weights = attention_interface( 2025-08-14T21:40:39.7069803Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:40:39.7069934Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:40:39.7069937Z 2025-08-14T21:40:39.7070029Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7070219Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7070280Z return mod(**inputs) 2025-08-14T21:40:39.7070512Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7070582Z outputs = self.model( 2025-08-14T21:40:39.7070829Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1248, in forward 2025-08-14T21:40:39.7070893Z encoder_outputs = self.encoder( 2025-08-14T21:40:39.7071128Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 861, in forward 2025-08-14T21:40:39.7071193Z layer_outputs = encoder_layer( 2025-08-14T21:40:39.7071400Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7071470Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7071699Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 321, in forward 2025-08-14T21:40:39.7071787Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:40:39.7072014Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 263, in forward 2025-08-14T21:40:39.7072104Z attn_output, attn_weights = attention_interface( 2025-08-14T21:40:39.7072378Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:40:39.7072475Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:40:39.7072478Z 2025-08-14T21:40:39.7072578Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7072759Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7072817Z return mod(**inputs) 2025-08-14T21:40:39.7073057Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7073119Z outputs = self.model( 2025-08-14T21:40:39.7073358Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1248, in forward 2025-08-14T21:40:39.7073425Z encoder_outputs = self.encoder( 2025-08-14T21:40:39.7073652Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 861, in forward 2025-08-14T21:40:39.7073723Z layer_outputs = encoder_layer( 2025-08-14T21:40:39.7073924Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7073994Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7074227Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 321, in forward 2025-08-14T21:40:39.7074349Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:40:39.7074586Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 277, in forward 2025-08-14T21:40:39.7074659Z attn_output = self.out_proj(attn_output) 2025-08-14T21:40:39.7074676Z 2025-08-14T21:40:39.7074770Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7074961Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7075018Z return mod(**inputs) 2025-08-14T21:40:39.7075254Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7075328Z outputs = self.model( 2025-08-14T21:40:39.7075559Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1248, in forward 2025-08-14T21:40:39.7075632Z encoder_outputs = self.encoder( 2025-08-14T21:40:39.7075862Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 861, in forward 2025-08-14T21:40:39.7075927Z layer_outputs = encoder_layer( 2025-08-14T21:40:39.7076135Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7076221Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7076458Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 332, in forward 2025-08-14T21:40:39.7076563Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:40:39.7076566Z 2025-08-14T21:40:39.7076660Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7076849Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7076908Z return mod(**inputs) 2025-08-14T21:40:39.7077146Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7077207Z outputs = self.model( 2025-08-14T21:40:39.7077438Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1248, in forward 2025-08-14T21:40:39.7077511Z encoder_outputs = self.encoder( 2025-08-14T21:40:39.7077740Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 861, in forward 2025-08-14T21:40:39.7077803Z layer_outputs = encoder_layer( 2025-08-14T21:40:39.7078009Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7078078Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7078310Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 332, in forward 2025-08-14T21:40:39.7078413Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:40:39.7078607Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:40:39.7078677Z return self.act(input) 2025-08-14T21:40:39.7078681Z 2025-08-14T21:40:39.7078775Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7078958Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7079017Z return mod(**inputs) 2025-08-14T21:40:39.7079247Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7079314Z outputs = self.model( 2025-08-14T21:40:39.7079544Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1248, in forward 2025-08-14T21:40:39.7079608Z encoder_outputs = self.encoder( 2025-08-14T21:40:39.7079866Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 861, in forward 2025-08-14T21:40:39.7079932Z layer_outputs = encoder_layer( 2025-08-14T21:40:39.7080139Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7080224Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7080457Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 334, in forward 2025-08-14T21:40:39.7080536Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:40:39.7080540Z 2025-08-14T21:40:39.7080632Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7080828Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7080898Z return mod(**inputs) 2025-08-14T21:40:39.7081127Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7081195Z outputs = self.model( 2025-08-14T21:40:39.7081427Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1248, in forward 2025-08-14T21:40:39.7081492Z encoder_outputs = self.encoder( 2025-08-14T21:40:39.7081728Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 861, in forward 2025-08-14T21:40:39.7081808Z layer_outputs = encoder_layer( 2025-08-14T21:40:39.7082008Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7082086Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7082316Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 321, in forward 2025-08-14T21:40:39.7082405Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:40:39.7082633Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 225, in forward 2025-08-14T21:40:39.7082770Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:40:39.7082773Z 2025-08-14T21:40:39.7082872Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7083052Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7083119Z return mod(**inputs) 2025-08-14T21:40:39.7083349Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7083408Z outputs = self.model( 2025-08-14T21:40:39.7083645Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1248, in forward 2025-08-14T21:40:39.7083709Z encoder_outputs = self.encoder( 2025-08-14T21:40:39.7083940Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 861, in forward 2025-08-14T21:40:39.7084012Z layer_outputs = encoder_layer( 2025-08-14T21:40:39.7084209Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7084287Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7084514Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 321, in forward 2025-08-14T21:40:39.7084755Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:40:39.7085000Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 244, in forward 2025-08-14T21:40:39.7085075Z key_states = self.k_proj(current_states) 2025-08-14T21:40:39.7085079Z 2025-08-14T21:40:39.7085179Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7085356Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7085451Z return mod(**inputs) 2025-08-14T21:40:39.7085694Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7085762Z outputs = self.model( 2025-08-14T21:40:39.7086015Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1248, in forward 2025-08-14T21:40:39.7086082Z encoder_outputs = self.encoder( 2025-08-14T21:40:39.7086317Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 861, in forward 2025-08-14T21:40:39.7086381Z layer_outputs = encoder_layer( 2025-08-14T21:40:39.7086620Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7086692Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7086924Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 321, in forward 2025-08-14T21:40:39.7087010Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:40:39.7087238Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 245, in forward 2025-08-14T21:40:39.7087315Z value_states = self.v_proj(current_states) 2025-08-14T21:40:39.7088021Z 2025-08-14T21:40:39.7088099Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.7088170Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.7088244Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.7088312Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.7088405Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7088593Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7088652Z return mod(**inputs) 2025-08-14T21:40:39.7088886Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7088955Z outputs = self.model( 2025-08-14T21:40:39.7089186Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1248, in forward 2025-08-14T21:40:39.7089261Z encoder_outputs = self.encoder( 2025-08-14T21:40:39.7089493Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 861, in forward 2025-08-14T21:40:39.7089559Z layer_outputs = encoder_layer( 2025-08-14T21:40:39.7089766Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7089838Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7090073Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 321, in forward 2025-08-14T21:40:39.7090155Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:40:39.7090386Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 263, in forward 2025-08-14T21:40:39.7090483Z attn_output, attn_weights = attention_interface( 2025-08-14T21:40:39.7090749Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:40:39.7090873Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:40:39.7090882Z 2025-08-14T21:40:39.7090975Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7091153Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7091219Z return mod(**inputs) 2025-08-14T21:40:39.7091449Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7091511Z outputs = self.model( 2025-08-14T21:40:39.7091764Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1248, in forward 2025-08-14T21:40:39.7091832Z encoder_outputs = self.encoder( 2025-08-14T21:40:39.7092071Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 861, in forward 2025-08-14T21:40:39.7092154Z layer_outputs = encoder_layer( 2025-08-14T21:40:39.7092355Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7092434Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7092675Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 321, in forward 2025-08-14T21:40:39.7092758Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:40:39.7092996Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 263, in forward 2025-08-14T21:40:39.7093084Z attn_output, attn_weights = attention_interface( 2025-08-14T21:40:39.7093355Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:40:39.7093454Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:40:39.7093471Z 2025-08-14T21:40:39.7093562Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7093751Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7093807Z return mod(**inputs) 2025-08-14T21:40:39.7094046Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7094106Z outputs = self.model( 2025-08-14T21:40:39.7094336Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1248, in forward 2025-08-14T21:40:39.7094408Z encoder_outputs = self.encoder( 2025-08-14T21:40:39.7094636Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 861, in forward 2025-08-14T21:40:39.7094698Z layer_outputs = encoder_layer( 2025-08-14T21:40:39.7094908Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7094980Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7095213Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 321, in forward 2025-08-14T21:40:39.7095292Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:40:39.7095520Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 277, in forward 2025-08-14T21:40:39.7095598Z attn_output = self.out_proj(attn_output) 2025-08-14T21:40:39.7095602Z 2025-08-14T21:40:39.7095693Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7095872Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7095938Z return mod(**inputs) 2025-08-14T21:40:39.7096170Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7096240Z outputs = self.model( 2025-08-14T21:40:39.7096468Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1248, in forward 2025-08-14T21:40:39.7096533Z encoder_outputs = self.encoder( 2025-08-14T21:40:39.7096770Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 861, in forward 2025-08-14T21:40:39.7096834Z layer_outputs = encoder_layer( 2025-08-14T21:40:39.7097042Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7097127Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7097357Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 332, in forward 2025-08-14T21:40:39.7097470Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:40:39.7097487Z 2025-08-14T21:40:39.7097580Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7097760Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7097827Z return mod(**inputs) 2025-08-14T21:40:39.7098055Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7098141Z outputs = self.model( 2025-08-14T21:40:39.7098374Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1248, in forward 2025-08-14T21:40:39.7098441Z encoder_outputs = self.encoder( 2025-08-14T21:40:39.7098677Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 861, in forward 2025-08-14T21:40:39.7098742Z layer_outputs = encoder_layer( 2025-08-14T21:40:39.7098949Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7099048Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7099277Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 332, in forward 2025-08-14T21:40:39.7099390Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:40:39.7099583Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:40:39.7099646Z return self.act(input) 2025-08-14T21:40:39.7099649Z 2025-08-14T21:40:39.7099749Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7099931Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7099998Z return mod(**inputs) 2025-08-14T21:40:39.7100228Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7100289Z outputs = self.model( 2025-08-14T21:40:39.7100528Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1248, in forward 2025-08-14T21:40:39.7100592Z encoder_outputs = self.encoder( 2025-08-14T21:40:39.7100821Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 861, in forward 2025-08-14T21:40:39.7100894Z layer_outputs = encoder_layer( 2025-08-14T21:40:39.7101093Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7101171Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7101399Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 334, in forward 2025-08-14T21:40:39.7101473Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:40:39.7101477Z 2025-08-14T21:40:39.7101578Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7101757Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7101823Z return mod(**inputs) 2025-08-14T21:40:39.7102054Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7102114Z outputs = self.model( 2025-08-14T21:40:39.7102354Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1248, in forward 2025-08-14T21:40:39.7102419Z encoder_outputs = self.encoder( 2025-08-14T21:40:39.7102660Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 861, in forward 2025-08-14T21:40:39.7102734Z layer_outputs = encoder_layer( 2025-08-14T21:40:39.7102936Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7103031Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7103262Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 336, in forward 2025-08-14T21:40:39.7103334Z hidden_states = residual + hidden_states 2025-08-14T21:40:39.7103337Z 2025-08-14T21:40:39.7103435Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7103628Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7103690Z return mod(**inputs) 2025-08-14T21:40:39.7103929Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7103990Z outputs = self.model( 2025-08-14T21:40:39.7104230Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1248, in forward 2025-08-14T21:40:39.7104295Z encoder_outputs = self.encoder( 2025-08-14T21:40:39.7104527Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 861, in forward 2025-08-14T21:40:39.7104617Z layer_outputs = encoder_layer( 2025-08-14T21:40:39.7104872Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7104958Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7105192Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 321, in forward 2025-08-14T21:40:39.7105273Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:40:39.7105509Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 225, in forward 2025-08-14T21:40:39.7105645Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:40:39.7105649Z 2025-08-14T21:40:39.7105748Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7105928Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7105988Z return mod(**inputs) 2025-08-14T21:40:39.7106224Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7106285Z outputs = self.model( 2025-08-14T21:40:39.7106514Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1248, in forward 2025-08-14T21:40:39.7106589Z encoder_outputs = self.encoder( 2025-08-14T21:40:39.7106816Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 861, in forward 2025-08-14T21:40:39.7106888Z layer_outputs = encoder_layer( 2025-08-14T21:40:39.7107086Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7107158Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7107391Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 321, in forward 2025-08-14T21:40:39.7107472Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:40:39.7107696Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 244, in forward 2025-08-14T21:40:39.7107777Z key_states = self.k_proj(current_states) 2025-08-14T21:40:39.7107781Z 2025-08-14T21:40:39.7107870Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7108053Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7108127Z return mod(**inputs) 2025-08-14T21:40:39.7108362Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7108428Z outputs = self.model( 2025-08-14T21:40:39.7108673Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1248, in forward 2025-08-14T21:40:39.7108745Z encoder_outputs = self.encoder( 2025-08-14T21:40:39.7108972Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 861, in forward 2025-08-14T21:40:39.7109036Z layer_outputs = encoder_layer( 2025-08-14T21:40:39.7109254Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7109335Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7109568Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 321, in forward 2025-08-14T21:40:39.7109656Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:40:39.7109887Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 245, in forward 2025-08-14T21:40:39.7109965Z value_states = self.v_proj(current_states) 2025-08-14T21:40:39.7109983Z 2025-08-14T21:40:39.7110064Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.7110136Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.7110204Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.7110280Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.7110375Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7110561Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7110618Z return mod(**inputs) 2025-08-14T21:40:39.7110852Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7110920Z outputs = self.model( 2025-08-14T21:40:39.7111151Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1248, in forward 2025-08-14T21:40:39.7111215Z encoder_outputs = self.encoder( 2025-08-14T21:40:39.7111453Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 861, in forward 2025-08-14T21:40:39.7111518Z layer_outputs = encoder_layer( 2025-08-14T21:40:39.7111724Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7111795Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7112024Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 321, in forward 2025-08-14T21:40:39.7112109Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:40:39.7112342Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 263, in forward 2025-08-14T21:40:39.7112439Z attn_output, attn_weights = attention_interface( 2025-08-14T21:40:39.7112708Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:40:39.7112832Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:40:39.7112835Z 2025-08-14T21:40:39.7112935Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7113115Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7113174Z return mod(**inputs) 2025-08-14T21:40:39.7113415Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7113476Z outputs = self.model( 2025-08-14T21:40:39.7113727Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1248, in forward 2025-08-14T21:40:39.7113795Z encoder_outputs = self.encoder( 2025-08-14T21:40:39.7114023Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 861, in forward 2025-08-14T21:40:39.7114109Z layer_outputs = encoder_layer( 2025-08-14T21:40:39.7114310Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7114386Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7114625Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 321, in forward 2025-08-14T21:40:39.7114708Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:40:39.7114940Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 263, in forward 2025-08-14T21:40:39.7115026Z attn_output, attn_weights = attention_interface( 2025-08-14T21:40:39.7115289Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:40:39.7115393Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:40:39.7115410Z 2025-08-14T21:40:39.7115501Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7115688Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7115745Z return mod(**inputs) 2025-08-14T21:40:39.7115975Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7116039Z outputs = self.model( 2025-08-14T21:40:39.7116266Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1248, in forward 2025-08-14T21:40:39.7116337Z encoder_outputs = self.encoder( 2025-08-14T21:40:39.7116565Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 861, in forward 2025-08-14T21:40:39.7116628Z layer_outputs = encoder_layer( 2025-08-14T21:40:39.7116833Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7116903Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7117127Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 321, in forward 2025-08-14T21:40:39.7117214Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:40:39.7117442Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 277, in forward 2025-08-14T21:40:39.7117520Z attn_output = self.out_proj(attn_output) 2025-08-14T21:40:39.7117523Z 2025-08-14T21:40:39.7117614Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7117793Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7117858Z return mod(**inputs) 2025-08-14T21:40:39.7118088Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7118158Z outputs = self.model( 2025-08-14T21:40:39.7118385Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1248, in forward 2025-08-14T21:40:39.7118449Z encoder_outputs = self.encoder( 2025-08-14T21:40:39.7118683Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 861, in forward 2025-08-14T21:40:39.7118747Z layer_outputs = encoder_layer( 2025-08-14T21:40:39.7118944Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7119035Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7119263Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 332, in forward 2025-08-14T21:40:39.7119377Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:40:39.7119394Z 2025-08-14T21:40:39.7119487Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7119668Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7119733Z return mod(**inputs) 2025-08-14T21:40:39.7119962Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7120043Z outputs = self.model( 2025-08-14T21:40:39.7120280Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1248, in forward 2025-08-14T21:40:39.7120345Z encoder_outputs = self.encoder( 2025-08-14T21:40:39.7120580Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 861, in forward 2025-08-14T21:40:39.7120644Z layer_outputs = encoder_layer( 2025-08-14T21:40:39.7120843Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7120942Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7121170Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 332, in forward 2025-08-14T21:40:39.7121281Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:40:39.7121475Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:40:39.7121538Z return self.act(input) 2025-08-14T21:40:39.7121541Z 2025-08-14T21:40:39.7121638Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7121820Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7121878Z return mod(**inputs) 2025-08-14T21:40:39.7122113Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7122174Z outputs = self.model( 2025-08-14T21:40:39.7122409Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1248, in forward 2025-08-14T21:40:39.7122474Z encoder_outputs = self.encoder( 2025-08-14T21:40:39.7122702Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 861, in forward 2025-08-14T21:40:39.7122775Z layer_outputs = encoder_layer( 2025-08-14T21:40:39.7122974Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7123044Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7123280Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 334, in forward 2025-08-14T21:40:39.7123353Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:40:39.7123356Z 2025-08-14T21:40:39.7123454Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7123634Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7123694Z return mod(**inputs) 2025-08-14T21:40:39.7123931Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7123991Z outputs = self.model( 2025-08-14T21:40:39.7124229Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1248, in forward 2025-08-14T21:40:39.7124293Z encoder_outputs = self.encoder( 2025-08-14T21:40:39.7124534Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 861, in forward 2025-08-14T21:40:39.7124617Z layer_outputs = encoder_layer( 2025-08-14T21:40:39.7124818Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7124904Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7125142Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 321, in forward 2025-08-14T21:40:39.7125224Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:40:39.7125459Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 225, in forward 2025-08-14T21:40:39.7125611Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:40:39.7125615Z 2025-08-14T21:40:39.7125709Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7125901Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7125960Z return mod(**inputs) 2025-08-14T21:40:39.7126197Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7126260Z outputs = self.model( 2025-08-14T21:40:39.7126506Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1248, in forward 2025-08-14T21:40:39.7126579Z encoder_outputs = self.encoder( 2025-08-14T21:40:39.7126806Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 861, in forward 2025-08-14T21:40:39.7126873Z layer_outputs = encoder_layer( 2025-08-14T21:40:39.7127078Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7127149Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7127382Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 321, in forward 2025-08-14T21:40:39.7127465Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:40:39.7127690Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 244, in forward 2025-08-14T21:40:39.7127769Z key_states = self.k_proj(current_states) 2025-08-14T21:40:39.7127773Z 2025-08-14T21:40:39.7127863Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7128048Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7128106Z return mod(**inputs) 2025-08-14T21:40:39.7128334Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7128400Z outputs = self.model( 2025-08-14T21:40:39.7128629Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1248, in forward 2025-08-14T21:40:39.7128691Z encoder_outputs = self.encoder( 2025-08-14T21:40:39.7128924Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 861, in forward 2025-08-14T21:40:39.7128989Z layer_outputs = encoder_layer( 2025-08-14T21:40:39.7129195Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7129264Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7129489Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 321, in forward 2025-08-14T21:40:39.7129579Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:40:39.7129803Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 245, in forward 2025-08-14T21:40:39.7129878Z value_states = self.v_proj(current_states) 2025-08-14T21:40:39.7129904Z 2025-08-14T21:40:39.7129977Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.7130046Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.7130120Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.7130203Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.7130297Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7130484Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7130542Z return mod(**inputs) 2025-08-14T21:40:39.7130774Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7130856Z outputs = self.model( 2025-08-14T21:40:39.7131088Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1248, in forward 2025-08-14T21:40:39.7131160Z encoder_outputs = self.encoder( 2025-08-14T21:40:39.7131388Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 861, in forward 2025-08-14T21:40:39.7131452Z layer_outputs = encoder_layer( 2025-08-14T21:40:39.7131660Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7131747Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7131979Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 321, in forward 2025-08-14T21:40:39.7132058Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:40:39.7132287Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 263, in forward 2025-08-14T21:40:39.7132382Z attn_output, attn_weights = attention_interface( 2025-08-14T21:40:39.7132648Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:40:39.7132767Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:40:39.7132777Z 2025-08-14T21:40:39.7132869Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7133047Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7133114Z return mod(**inputs) 2025-08-14T21:40:39.7133345Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7133405Z outputs = self.model( 2025-08-14T21:40:39.7133645Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1248, in forward 2025-08-14T21:40:39.7133710Z encoder_outputs = self.encoder( 2025-08-14T21:40:39.7133944Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 861, in forward 2025-08-14T21:40:39.7134010Z layer_outputs = encoder_layer( 2025-08-14T21:40:39.7134209Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7134284Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7134512Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 321, in forward 2025-08-14T21:40:39.7134594Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:40:39.7134828Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 263, in forward 2025-08-14T21:40:39.7134916Z attn_output, attn_weights = attention_interface( 2025-08-14T21:40:39.7135185Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:40:39.7135285Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:40:39.7135288Z 2025-08-14T21:40:39.7135391Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7135580Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7135638Z return mod(**inputs) 2025-08-14T21:40:39.7135891Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7135953Z outputs = self.model( 2025-08-14T21:40:39.7136183Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1248, in forward 2025-08-14T21:40:39.7136253Z encoder_outputs = self.encoder( 2025-08-14T21:40:39.7136509Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 861, in forward 2025-08-14T21:40:39.7136575Z layer_outputs = encoder_layer( 2025-08-14T21:40:39.7136784Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7136853Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7137088Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 321, in forward 2025-08-14T21:40:39.7137170Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:40:39.7137411Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 277, in forward 2025-08-14T21:40:39.7137488Z attn_output = self.out_proj(attn_output) 2025-08-14T21:40:39.7137491Z 2025-08-14T21:40:39.7137580Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7137763Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7137827Z return mod(**inputs) 2025-08-14T21:40:39.7138057Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7138123Z outputs = self.model( 2025-08-14T21:40:39.7138352Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1248, in forward 2025-08-14T21:40:39.7138416Z encoder_outputs = self.encoder( 2025-08-14T21:40:39.7138652Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 861, in forward 2025-08-14T21:40:39.7138719Z layer_outputs = encoder_layer( 2025-08-14T21:40:39.7138924Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7138993Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7139221Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 332, in forward 2025-08-14T21:40:39.7139332Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:40:39.7139336Z 2025-08-14T21:40:39.7139426Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7139604Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7139668Z return mod(**inputs) 2025-08-14T21:40:39.7139896Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7139964Z outputs = self.model( 2025-08-14T21:40:39.7140192Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1248, in forward 2025-08-14T21:40:39.7140254Z encoder_outputs = self.encoder( 2025-08-14T21:40:39.7140487Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 861, in forward 2025-08-14T21:40:39.7140550Z layer_outputs = encoder_layer( 2025-08-14T21:40:39.7140753Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7140836Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7141072Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 332, in forward 2025-08-14T21:40:39.7141183Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:40:39.7141401Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:40:39.7141463Z return self.act(input) 2025-08-14T21:40:39.7141467Z 2025-08-14T21:40:39.7141565Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7141761Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7141827Z return mod(**inputs) 2025-08-14T21:40:39.7142059Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7142121Z outputs = self.model( 2025-08-14T21:40:39.7142356Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1248, in forward 2025-08-14T21:40:39.7142422Z encoder_outputs = self.encoder( 2025-08-14T21:40:39.7142650Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 861, in forward 2025-08-14T21:40:39.7142744Z layer_outputs = encoder_layer( 2025-08-14T21:40:39.7142946Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7143022Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7143252Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 334, in forward 2025-08-14T21:40:39.7143325Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:40:39.7143329Z 2025-08-14T21:40:39.7143426Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7143606Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7143668Z return mod(**inputs) 2025-08-14T21:40:39.7143898Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7143958Z outputs = self.model( 2025-08-14T21:40:39.7144197Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1248, in forward 2025-08-14T21:40:39.7144260Z encoder_outputs = self.encoder( 2025-08-14T21:40:39.7144490Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 861, in forward 2025-08-14T21:40:39.7144563Z layer_outputs = encoder_layer( 2025-08-14T21:40:39.7144764Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7144914Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7145153Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 336, in forward 2025-08-14T21:40:39.7145226Z hidden_states = residual + hidden_states 2025-08-14T21:40:39.7145230Z 2025-08-14T21:40:39.7145333Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7145513Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7145571Z return mod(**inputs) 2025-08-14T21:40:39.7145806Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7145866Z outputs = self.model( 2025-08-14T21:40:39.7146100Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7146165Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7146410Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7146486Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7146687Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7146781Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7147017Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:39.7147109Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:39.7147362Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 225, in forward 2025-08-14T21:40:39.7147500Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:40:39.7147504Z 2025-08-14T21:40:39.7147604Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7147784Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7147842Z return mod(**inputs) 2025-08-14T21:40:39.7148079Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7148141Z outputs = self.model( 2025-08-14T21:40:39.7148392Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7148465Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7148694Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7148764Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7148963Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7149032Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7149270Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:39.7149360Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:39.7149588Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 244, in forward 2025-08-14T21:40:39.7149668Z key_states = self.k_proj(current_states) 2025-08-14T21:40:39.7149671Z 2025-08-14T21:40:39.7149761Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7149946Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7150003Z return mod(**inputs) 2025-08-14T21:40:39.7150236Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7150301Z outputs = self.model( 2025-08-14T21:40:39.7150532Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7150601Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7150831Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7150896Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7151107Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7151175Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7151404Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:39.7151497Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:39.7151726Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 245, in forward 2025-08-14T21:40:39.7151824Z value_states = self.v_proj(current_states) 2025-08-14T21:40:39.7151828Z 2025-08-14T21:40:39.7151900Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.7151970Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.7152044Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.7152128Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.7152221Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7152408Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7152468Z return mod(**inputs) 2025-08-14T21:40:39.7152719Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7152782Z outputs = self.model( 2025-08-14T21:40:39.7153016Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7153089Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7153323Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7153388Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7153599Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7153687Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7153924Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:39.7154011Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:39.7154239Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 263, in forward 2025-08-14T21:40:39.7154331Z attn_output, attn_weights = attention_interface( 2025-08-14T21:40:39.7154597Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:40:39.7154723Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:40:39.7154726Z 2025-08-14T21:40:39.7154816Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7154997Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7155064Z return mod(**inputs) 2025-08-14T21:40:39.7155296Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7155355Z outputs = self.model( 2025-08-14T21:40:39.7155591Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7155655Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7155892Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7155955Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7156156Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7156234Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7156465Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:39.7156560Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:39.7156789Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 263, in forward 2025-08-14T21:40:39.7156874Z attn_output, attn_weights = attention_interface( 2025-08-14T21:40:39.7157144Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:40:39.7157241Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:40:39.7157260Z 2025-08-14T21:40:39.7157352Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7157534Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7157610Z return mod(**inputs) 2025-08-14T21:40:39.7157846Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7157908Z outputs = self.model( 2025-08-14T21:40:39.7158136Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7158208Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7158450Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7158524Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7158730Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7158798Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7159032Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:39.7159121Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:39.7159389Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 277, in forward 2025-08-14T21:40:39.7159467Z attn_output = self.out_proj(attn_output) 2025-08-14T21:40:39.7159471Z 2025-08-14T21:40:39.7159562Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7159750Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7159806Z return mod(**inputs) 2025-08-14T21:40:39.7160039Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7160105Z outputs = self.model( 2025-08-14T21:40:39.7160333Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7160399Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7160633Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7160699Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7160905Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7160976Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7161205Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 432, in forward 2025-08-14T21:40:39.7161308Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:40:39.7161538Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 225, in forward 2025-08-14T21:40:39.7161678Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:40:39.7161682Z 2025-08-14T21:40:39.7161774Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7161956Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7162021Z return mod(**inputs) 2025-08-14T21:40:39.7162250Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7162310Z outputs = self.model( 2025-08-14T21:40:39.7162550Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7162614Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7162872Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7162939Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7163141Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7163232Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7163462Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 432, in forward 2025-08-14T21:40:39.7163565Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:40:39.7163808Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 244, in forward 2025-08-14T21:40:39.7163882Z key_states = self.k_proj(current_states) 2025-08-14T21:40:39.7163885Z 2025-08-14T21:40:39.7163985Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7164163Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7164221Z return mod(**inputs) 2025-08-14T21:40:39.7164457Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7164518Z outputs = self.model( 2025-08-14T21:40:39.7164751Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7164833Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7165065Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7165137Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7165337Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7165413Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7165646Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 432, in forward 2025-08-14T21:40:39.7165742Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:40:39.7165980Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 245, in forward 2025-08-14T21:40:39.7166059Z value_states = self.v_proj(current_states) 2025-08-14T21:40:39.7166062Z 2025-08-14T21:40:39.7166131Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.7166209Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.7166277Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.7166350Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.7166443Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7166622Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7166686Z return mod(**inputs) 2025-08-14T21:40:39.7166919Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7166978Z outputs = self.model( 2025-08-14T21:40:39.7167216Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7167283Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7167520Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7167583Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7167783Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7167860Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7168089Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 432, in forward 2025-08-14T21:40:39.7168197Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:40:39.7168432Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 263, in forward 2025-08-14T21:40:39.7168517Z attn_output, attn_weights = attention_interface( 2025-08-14T21:40:39.7168804Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:40:39.7168925Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:40:39.7168928Z 2025-08-14T21:40:39.7169019Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7169220Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7169279Z return mod(**inputs) 2025-08-14T21:40:39.7169517Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7169580Z outputs = self.model( 2025-08-14T21:40:39.7169811Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7169882Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7170112Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7170190Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7170398Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7170467Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7170711Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 432, in forward 2025-08-14T21:40:39.7170809Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:40:39.7171043Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 263, in forward 2025-08-14T21:40:39.7171135Z attn_output, attn_weights = attention_interface( 2025-08-14T21:40:39.7171404Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:40:39.7171506Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:40:39.7171511Z 2025-08-14T21:40:39.7171602Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7171787Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7171853Z return mod(**inputs) 2025-08-14T21:40:39.7172091Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7172153Z outputs = self.model( 2025-08-14T21:40:39.7172397Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7172462Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7172704Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7172769Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7172974Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7173051Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7173286Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 432, in forward 2025-08-14T21:40:39.7173390Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:40:39.7173625Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 277, in forward 2025-08-14T21:40:39.7173697Z attn_output = self.out_proj(attn_output) 2025-08-14T21:40:39.7173713Z 2025-08-14T21:40:39.7173814Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7173994Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7174051Z return mod(**inputs) 2025-08-14T21:40:39.7174305Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7174367Z outputs = self.model( 2025-08-14T21:40:39.7174604Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7174668Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7174911Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7174982Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7175186Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7175254Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7175492Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 446, in forward 2025-08-14T21:40:39.7175600Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:40:39.7175618Z 2025-08-14T21:40:39.7175716Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7175898Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7175955Z return mod(**inputs) 2025-08-14T21:40:39.7176195Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7176255Z outputs = self.model( 2025-08-14T21:40:39.7176493Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7176558Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7176788Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7176859Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7177060Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7177131Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7177369Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 446, in forward 2025-08-14T21:40:39.7177476Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:40:39.7177677Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:40:39.7177739Z return self.act(input) 2025-08-14T21:40:39.7177742Z 2025-08-14T21:40:39.7177835Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7178020Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7178077Z return mod(**inputs) 2025-08-14T21:40:39.7178315Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7178379Z outputs = self.model( 2025-08-14T21:40:39.7178607Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7178678Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7178910Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7178973Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7179178Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7179262Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7179499Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 448, in forward 2025-08-14T21:40:39.7179571Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:40:39.7179591Z 2025-08-14T21:40:39.7179687Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7179876Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7179934Z return mod(**inputs) 2025-08-14T21:40:39.7180178Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7180247Z outputs = self.model( 2025-08-14T21:40:39.7180475Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7180544Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7180773Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7180837Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7181044Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7181139Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7181375Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:39.7181463Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:39.7181693Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 225, in forward 2025-08-14T21:40:39.7181837Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:40:39.7181840Z 2025-08-14T21:40:39.7181932Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7182112Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7182174Z return mod(**inputs) 2025-08-14T21:40:39.7182405Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7182474Z outputs = self.model( 2025-08-14T21:40:39.7182704Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7182769Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7183007Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7183071Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7183277Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7183350Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7183581Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:39.7183675Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:39.7183907Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 244, in forward 2025-08-14T21:40:39.7183980Z key_states = self.k_proj(current_states) 2025-08-14T21:40:39.7183983Z 2025-08-14T21:40:39.7184080Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7184259Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7184324Z return mod(**inputs) 2025-08-14T21:40:39.7184557Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7184820Z outputs = self.model( 2025-08-14T21:40:39.7185106Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7185172Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7185402Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7185498Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7185699Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7185776Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7186031Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:39.7186123Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:39.7186361Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 245, in forward 2025-08-14T21:40:39.7186439Z value_states = self.v_proj(current_states) 2025-08-14T21:40:39.7186443Z 2025-08-14T21:40:39.7186519Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.7186589Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.7186659Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.7186760Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.7186851Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7187031Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7187097Z return mod(**inputs) 2025-08-14T21:40:39.7187330Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7187397Z outputs = self.model( 2025-08-14T21:40:39.7187628Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7187694Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7187929Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7187995Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7188192Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7188271Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7188498Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:39.7188592Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:39.7188819Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 263, in forward 2025-08-14T21:40:39.7188906Z attn_output, attn_weights = attention_interface( 2025-08-14T21:40:39.7189180Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:40:39.7189297Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:40:39.7189302Z 2025-08-14T21:40:39.7189399Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7189577Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7189635Z return mod(**inputs) 2025-08-14T21:40:39.7189872Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7189933Z outputs = self.model( 2025-08-14T21:40:39.7190165Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7190236Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7190485Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7190559Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7190759Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7190846Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7191082Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:39.7191169Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:39.7191417Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 263, in forward 2025-08-14T21:40:39.7191506Z attn_output, attn_weights = attention_interface( 2025-08-14T21:40:39.7191772Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:40:39.7191880Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:40:39.7191883Z 2025-08-14T21:40:39.7191974Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7192157Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7192239Z return mod(**inputs) 2025-08-14T21:40:39.7192473Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7192538Z outputs = self.model( 2025-08-14T21:40:39.7192769Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7192834Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7193074Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7193136Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7193339Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7193417Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7193648Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:39.7193744Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:39.7193974Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 277, in forward 2025-08-14T21:40:39.7194045Z attn_output = self.out_proj(attn_output) 2025-08-14T21:40:39.7194048Z 2025-08-14T21:40:39.7194149Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7194330Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7194393Z return mod(**inputs) 2025-08-14T21:40:39.7194627Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7194686Z outputs = self.model( 2025-08-14T21:40:39.7194923Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7194990Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7195222Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7195293Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7195496Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7195569Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7195797Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 424, in forward 2025-08-14T21:40:39.7195885Z hidden_states = residual + hidden_states 2025-08-14T21:40:39.7195889Z 2025-08-14T21:40:39.7195989Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7196169Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7196249Z return mod(**inputs) 2025-08-14T21:40:39.7196481Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7196540Z outputs = self.model( 2025-08-14T21:40:39.7196774Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7196851Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7197082Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7197151Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7197350Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7197426Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7197653Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 432, in forward 2025-08-14T21:40:39.7197768Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:40:39.7198005Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 225, in forward 2025-08-14T21:40:39.7198140Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:40:39.7198144Z 2025-08-14T21:40:39.7198243Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7198421Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7198481Z return mod(**inputs) 2025-08-14T21:40:39.7198720Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7198779Z outputs = self.model( 2025-08-14T21:40:39.7199008Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7199080Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7199311Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7199383Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7199582Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7199651Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7199887Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 432, in forward 2025-08-14T21:40:39.7199985Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:40:39.7200213Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 244, in forward 2025-08-14T21:40:39.7200293Z key_states = self.k_proj(current_states) 2025-08-14T21:40:39.7200297Z 2025-08-14T21:40:39.7200389Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7200576Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7200634Z return mod(**inputs) 2025-08-14T21:40:39.7200864Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7200932Z outputs = self.model( 2025-08-14T21:40:39.7201161Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7201231Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7201476Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7201542Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7201747Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7201834Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7202063Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 432, in forward 2025-08-14T21:40:39.7202166Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:40:39.7202415Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 245, in forward 2025-08-14T21:40:39.7202501Z value_states = self.v_proj(current_states) 2025-08-14T21:40:39.7202504Z 2025-08-14T21:40:39.7202574Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.7202647Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.7202721Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.7202789Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.7202881Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7203069Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7203144Z return mod(**inputs) 2025-08-14T21:40:39.7203388Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7203448Z outputs = self.model( 2025-08-14T21:40:39.7203688Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7203763Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7204001Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7204066Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7204277Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7204347Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7204591Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 432, in forward 2025-08-14T21:40:39.7204689Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:40:39.7204925Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 263, in forward 2025-08-14T21:40:39.7205022Z attn_output, attn_weights = attention_interface( 2025-08-14T21:40:39.7205294Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:40:39.7205418Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:40:39.7205422Z 2025-08-14T21:40:39.7205515Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7205698Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7205765Z return mod(**inputs) 2025-08-14T21:40:39.7206006Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7206073Z outputs = self.model( 2025-08-14T21:40:39.7206312Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7206375Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7206619Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7206681Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7206897Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7206976Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7207208Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 432, in forward 2025-08-14T21:40:39.7207326Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:40:39.7207556Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 263, in forward 2025-08-14T21:40:39.7207643Z attn_output, attn_weights = attention_interface( 2025-08-14T21:40:39.7207926Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:40:39.7208024Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:40:39.7208027Z 2025-08-14T21:40:39.7208123Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7208304Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7208363Z return mod(**inputs) 2025-08-14T21:40:39.7208600Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7208661Z outputs = self.model( 2025-08-14T21:40:39.7208906Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7208976Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7209206Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7209276Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7209477Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7209547Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7209786Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 432, in forward 2025-08-14T21:40:39.7209881Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:40:39.7210113Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 277, in forward 2025-08-14T21:40:39.7210193Z attn_output = self.out_proj(attn_output) 2025-08-14T21:40:39.7210196Z 2025-08-14T21:40:39.7210286Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7210475Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7210534Z return mod(**inputs) 2025-08-14T21:40:39.7210767Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7210834Z outputs = self.model( 2025-08-14T21:40:39.7211066Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7211137Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7211370Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7211433Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7211643Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7211713Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7211947Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 446, in forward 2025-08-14T21:40:39.7212060Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:40:39.7212064Z 2025-08-14T21:40:39.7212155Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7212356Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7212417Z return mod(**inputs) 2025-08-14T21:40:39.7212648Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7212734Z outputs = self.model( 2025-08-14T21:40:39.7212965Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7213037Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7213269Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7213347Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7213558Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7213629Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7213859Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 446, in forward 2025-08-14T21:40:39.7213973Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:40:39.7214166Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:40:39.7214252Z return self.act(input) 2025-08-14T21:40:39.7214256Z 2025-08-14T21:40:39.7214347Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7214528Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7214593Z return mod(**inputs) 2025-08-14T21:40:39.7214825Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7214886Z outputs = self.model( 2025-08-14T21:40:39.7215121Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7215186Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7215421Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7215487Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7215686Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7215766Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7215996Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 448, in forward 2025-08-14T21:40:39.7216077Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:40:39.7216080Z 2025-08-14T21:40:39.7216177Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7216357Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7216422Z return mod(**inputs) 2025-08-14T21:40:39.7216652Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7216711Z outputs = self.model( 2025-08-14T21:40:39.7216949Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7217013Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7217247Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7217309Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7217508Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7217587Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7217831Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:39.7217922Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:39.7218157Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 225, in forward 2025-08-14T21:40:39.7218308Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:40:39.7218313Z 2025-08-14T21:40:39.7218410Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7218590Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7218650Z return mod(**inputs) 2025-08-14T21:40:39.7218902Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7218966Z outputs = self.model( 2025-08-14T21:40:39.7219200Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7219264Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7219492Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7219567Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7219763Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7219850Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7220083Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:39.7220173Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:39.7220409Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 244, in forward 2025-08-14T21:40:39.7220480Z key_states = self.k_proj(current_states) 2025-08-14T21:40:39.7220483Z 2025-08-14T21:40:39.7220574Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7220759Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7220816Z return mod(**inputs) 2025-08-14T21:40:39.7221052Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7221115Z outputs = self.model( 2025-08-14T21:40:39.7221343Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7221413Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7221639Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7221702Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7221913Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7221982Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7222215Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:39.7222304Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:39.7222532Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 245, in forward 2025-08-14T21:40:39.7222614Z value_states = self.v_proj(current_states) 2025-08-14T21:40:39.7222618Z 2025-08-14T21:40:39.7222687Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.7222762Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.7222831Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.7222898Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.7222994Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7223187Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7223248Z return mod(**inputs) 2025-08-14T21:40:39.7223482Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7223566Z outputs = self.model( 2025-08-14T21:40:39.7223796Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7223871Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7224100Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7224182Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7224384Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7224454Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7224693Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:39.7224778Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:39.7225089Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 263, in forward 2025-08-14T21:40:39.7225202Z attn_output, attn_weights = attention_interface( 2025-08-14T21:40:39.7225471Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:40:39.7225598Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:40:39.7225602Z 2025-08-14T21:40:39.7225694Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7225880Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7225938Z return mod(**inputs) 2025-08-14T21:40:39.7226171Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7226241Z outputs = self.model( 2025-08-14T21:40:39.7226477Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7226544Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7226782Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7226846Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7227057Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7227127Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7227356Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:39.7227451Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:39.7227682Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 263, in forward 2025-08-14T21:40:39.7227770Z attn_output, attn_weights = attention_interface( 2025-08-14T21:40:39.7228045Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:40:39.7228145Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:40:39.7228148Z 2025-08-14T21:40:39.7228247Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7228428Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7228485Z return mod(**inputs) 2025-08-14T21:40:39.7228725Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7228799Z outputs = self.model( 2025-08-14T21:40:39.7229034Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7229100Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7229342Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7229411Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7229613Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7229682Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7229937Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:39.7230026Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:39.7230261Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 277, in forward 2025-08-14T21:40:39.7230333Z attn_output = self.out_proj(attn_output) 2025-08-14T21:40:39.7230337Z 2025-08-14T21:40:39.7230426Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7230614Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7230689Z return mod(**inputs) 2025-08-14T21:40:39.7230927Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7230987Z outputs = self.model( 2025-08-14T21:40:39.7231219Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7231290Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7231518Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7231583Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7231790Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7231859Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7232098Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 432, in forward 2025-08-14T21:40:39.7232197Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:40:39.7232424Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 225, in forward 2025-08-14T21:40:39.7232569Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:40:39.7232572Z 2025-08-14T21:40:39.7232663Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7232848Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7232906Z return mod(**inputs) 2025-08-14T21:40:39.7233137Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7233204Z outputs = self.model( 2025-08-14T21:40:39.7233436Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7233501Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7233737Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7233799Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7234006Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7234076Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7234318Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 432, in forward 2025-08-14T21:40:39.7234421Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:40:39.7234647Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 244, in forward 2025-08-14T21:40:39.7234733Z key_states = self.k_proj(current_states) 2025-08-14T21:40:39.7234744Z 2025-08-14T21:40:39.7234834Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7235014Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7235076Z return mod(**inputs) 2025-08-14T21:40:39.7235326Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7235387Z outputs = self.model( 2025-08-14T21:40:39.7235624Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7235690Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7235925Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7235988Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7236190Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7236281Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7236513Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 432, in forward 2025-08-14T21:40:39.7236610Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:40:39.7236849Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 245, in forward 2025-08-14T21:40:39.7236924Z value_states = self.v_proj(current_states) 2025-08-14T21:40:39.7236928Z 2025-08-14T21:40:39.7237005Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.7237076Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.7237143Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.7237218Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.7237312Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7237497Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7237560Z return mod(**inputs) 2025-08-14T21:40:39.7237797Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7237862Z outputs = self.model( 2025-08-14T21:40:39.7238097Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7238162Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7238405Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7238469Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7238672Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7238748Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7238984Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 432, in forward 2025-08-14T21:40:39.7239087Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:40:39.7239321Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 263, in forward 2025-08-14T21:40:39.7239409Z attn_output, attn_weights = attention_interface( 2025-08-14T21:40:39.7239686Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:40:39.7239823Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:40:39.7239827Z 2025-08-14T21:40:39.7239926Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7240107Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7240182Z return mod(**inputs) 2025-08-14T21:40:39.7240421Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7240483Z outputs = self.model( 2025-08-14T21:40:39.7240728Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7240803Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7241035Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7241106Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7241310Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7241379Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7241616Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 432, in forward 2025-08-14T21:40:39.7241728Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:40:39.7241964Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 263, in forward 2025-08-14T21:40:39.7242050Z attn_output, attn_weights = attention_interface( 2025-08-14T21:40:39.7242313Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:40:39.7242416Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:40:39.7242420Z 2025-08-14T21:40:39.7242509Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7242695Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7242752Z return mod(**inputs) 2025-08-14T21:40:39.7242984Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7243055Z outputs = self.model( 2025-08-14T21:40:39.7243285Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7243348Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7243586Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7243649Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7243856Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7243925Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7244154Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 432, in forward 2025-08-14T21:40:39.7244255Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:40:39.7244486Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 277, in forward 2025-08-14T21:40:39.7244559Z attn_output = self.out_proj(attn_output) 2025-08-14T21:40:39.7244568Z 2025-08-14T21:40:39.7244658Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7244837Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7244902Z return mod(**inputs) 2025-08-14T21:40:39.7245132Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7245216Z outputs = self.model( 2025-08-14T21:40:39.7245455Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7245520Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7245768Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7245834Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7246034Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7246109Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7246368Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 441, in forward 2025-08-14T21:40:39.7246464Z hidden_states = residual + hidden_states 2025-08-14T21:40:39.7246469Z 2025-08-14T21:40:39.7246566Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7246746Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7246809Z return mod(**inputs) 2025-08-14T21:40:39.7247039Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7247114Z outputs = self.model( 2025-08-14T21:40:39.7247347Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7247412Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7247640Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7247711Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7247909Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7247984Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7248211Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 446, in forward 2025-08-14T21:40:39.7248318Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:40:39.7248322Z 2025-08-14T21:40:39.7248417Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7248596Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7248659Z return mod(**inputs) 2025-08-14T21:40:39.7248887Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7248946Z outputs = self.model( 2025-08-14T21:40:39.7249180Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7249244Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7249473Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7249543Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7249739Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7249815Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7250042Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 446, in forward 2025-08-14T21:40:39.7250146Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:40:39.7250345Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:40:39.7250408Z return self.act(input) 2025-08-14T21:40:39.7250411Z 2025-08-14T21:40:39.7250508Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7250699Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7250759Z return mod(**inputs) 2025-08-14T21:40:39.7250997Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7251072Z outputs = self.model( 2025-08-14T21:40:39.7251301Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7251372Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7251600Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7251684Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7251884Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7251954Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7252191Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 448, in forward 2025-08-14T21:40:39.7252264Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:40:39.7252267Z 2025-08-14T21:40:39.7252358Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7252545Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7252621Z return mod(**inputs) 2025-08-14T21:40:39.7252857Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7252916Z outputs = self.model( 2025-08-14T21:40:39.7253148Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7253219Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7253450Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7253519Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7253719Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7253791Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7254032Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:39.7254120Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:39.7254347Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 225, in forward 2025-08-14T21:40:39.7254492Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:40:39.7254495Z 2025-08-14T21:40:39.7254587Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7254774Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7254833Z return mod(**inputs) 2025-08-14T21:40:39.7255062Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7255131Z outputs = self.model( 2025-08-14T21:40:39.7255361Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7255433Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7255662Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7255727Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7255933Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7256001Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7256244Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:39.7256343Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:39.7256573Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 244, in forward 2025-08-14T21:40:39.7256669Z key_states = self.k_proj(current_states) 2025-08-14T21:40:39.7256672Z 2025-08-14T21:40:39.7256762Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7256938Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7257004Z return mod(**inputs) 2025-08-14T21:40:39.7257243Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7257306Z outputs = self.model( 2025-08-14T21:40:39.7257540Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7257606Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7257842Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7257906Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7258122Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7258200Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7258431Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:39.7258525Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:39.7258755Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 245, in forward 2025-08-14T21:40:39.7258832Z value_states = self.v_proj(current_states) 2025-08-14T21:40:39.7258836Z 2025-08-14T21:40:39.7258915Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.7258985Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.7259053Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.7259129Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.7259220Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7259407Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7259462Z return mod(**inputs) 2025-08-14T21:40:39.7259691Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7259758Z outputs = self.model( 2025-08-14T21:40:39.7259987Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7260051Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7260289Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7260350Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7260558Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7260631Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7260859Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:39.7260953Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:39.7261183Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 263, in forward 2025-08-14T21:40:39.7261270Z attn_output, attn_weights = attention_interface( 2025-08-14T21:40:39.7261555Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:40:39.7261678Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:40:39.7261681Z 2025-08-14T21:40:39.7261779Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7261969Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7262028Z return mod(**inputs) 2025-08-14T21:40:39.7262266Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7262326Z outputs = self.model( 2025-08-14T21:40:39.7262575Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7262642Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7262872Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7262941Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7263142Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7263210Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7263445Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:39.7263554Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:39.7263788Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 263, in forward 2025-08-14T21:40:39.7263876Z attn_output, attn_weights = attention_interface( 2025-08-14T21:40:39.7264137Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:40:39.7264240Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:40:39.7264243Z 2025-08-14T21:40:39.7264334Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7264520Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7264578Z return mod(**inputs) 2025-08-14T21:40:39.7264866Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7264949Z outputs = self.model( 2025-08-14T21:40:39.7265182Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7265248Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7265488Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7265552Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7265764Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7265835Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7266064Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:39.7266161Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:39.7266392Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 277, in forward 2025-08-14T21:40:39.7266472Z attn_output = self.out_proj(attn_output) 2025-08-14T21:40:39.7266476Z 2025-08-14T21:40:39.7266566Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7266750Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7266816Z return mod(**inputs) 2025-08-14T21:40:39.7267048Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7267127Z outputs = self.model( 2025-08-14T21:40:39.7267367Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7267432Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7267684Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7267748Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7267946Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7268023Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7268266Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 432, in forward 2025-08-14T21:40:39.7268372Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:40:39.7268600Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 225, in forward 2025-08-14T21:40:39.7268737Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:40:39.7268741Z 2025-08-14T21:40:39.7268841Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7269018Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7269091Z return mod(**inputs) 2025-08-14T21:40:39.7269331Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7269391Z outputs = self.model( 2025-08-14T21:40:39.7269627Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7269696Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7269926Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7269995Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7270194Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7270270Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7270501Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 432, in forward 2025-08-14T21:40:39.7270597Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:40:39.7270835Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 244, in forward 2025-08-14T21:40:39.7270906Z key_states = self.k_proj(current_states) 2025-08-14T21:40:39.7270909Z 2025-08-14T21:40:39.7271001Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7271192Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7271248Z return mod(**inputs) 2025-08-14T21:40:39.7271483Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7271544Z outputs = self.model( 2025-08-14T21:40:39.7271774Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7271846Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7272077Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7272139Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7272346Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7272416Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7272668Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 432, in forward 2025-08-14T21:40:39.7272768Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:40:39.7272998Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 245, in forward 2025-08-14T21:40:39.7273099Z value_states = self.v_proj(current_states) 2025-08-14T21:40:39.7273103Z 2025-08-14T21:40:39.7273174Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.7273251Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.7273318Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.7273388Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.7273501Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7273680Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7273738Z return mod(**inputs) 2025-08-14T21:40:39.7273975Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7274036Z outputs = self.model( 2025-08-14T21:40:39.7274271Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7274336Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7274583Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7274653Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7274853Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7274923Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7275155Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 432, in forward 2025-08-14T21:40:39.7275250Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:40:39.7275482Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 263, in forward 2025-08-14T21:40:39.7275567Z attn_output, attn_weights = attention_interface( 2025-08-14T21:40:39.7275830Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:40:39.7275953Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:40:39.7275957Z 2025-08-14T21:40:39.7276047Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7276233Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7276291Z return mod(**inputs) 2025-08-14T21:40:39.7276520Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7276587Z outputs = self.model( 2025-08-14T21:40:39.7276815Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7276879Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7277116Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7277180Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7277384Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7277451Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7277681Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 432, in forward 2025-08-14T21:40:39.7277782Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:40:39.7278021Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 263, in forward 2025-08-14T21:40:39.7278110Z attn_output, attn_weights = attention_interface( 2025-08-14T21:40:39.7278378Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:40:39.7278490Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:40:39.7278495Z 2025-08-14T21:40:39.7278590Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7278767Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7278824Z return mod(**inputs) 2025-08-14T21:40:39.7279074Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7279136Z outputs = self.model( 2025-08-14T21:40:39.7279375Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7279442Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7279671Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7279741Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7279940Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7280028Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7280264Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 432, in forward 2025-08-14T21:40:39.7280360Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:40:39.7280595Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 277, in forward 2025-08-14T21:40:39.7280667Z attn_output = self.out_proj(attn_output) 2025-08-14T21:40:39.7280671Z 2025-08-14T21:40:39.7280763Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7280950Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7281008Z return mod(**inputs) 2025-08-14T21:40:39.7281244Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7281305Z outputs = self.model( 2025-08-14T21:40:39.7281533Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7281602Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7281832Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7281895Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7282104Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7282173Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7282408Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 446, in forward 2025-08-14T21:40:39.7282514Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:40:39.7282519Z 2025-08-14T21:40:39.7282610Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7282796Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7282854Z return mod(**inputs) 2025-08-14T21:40:39.7283089Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7283148Z outputs = self.model( 2025-08-14T21:40:39.7283374Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7283458Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7283689Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7283754Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7283977Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7284048Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7284281Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 446, in forward 2025-08-14T21:40:39.7284407Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:40:39.7284770Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:40:39.7284851Z return self.act(input) 2025-08-14T21:40:39.7284855Z 2025-08-14T21:40:39.7284950Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7285137Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7285196Z return mod(**inputs) 2025-08-14T21:40:39.7285429Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7285532Z outputs = self.model( 2025-08-14T21:40:39.7285769Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7285833Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7286079Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7286142Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7286359Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7286430Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7286669Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 448, in forward 2025-08-14T21:40:39.7286746Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:40:39.7286750Z 2025-08-14T21:40:39.7286842Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7287024Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7287089Z return mod(**inputs) 2025-08-14T21:40:39.7287326Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7287392Z outputs = self.model( 2025-08-14T21:40:39.7287628Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7287691Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7287938Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7288001Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7288213Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7288285Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7288520Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 450, in forward 2025-08-14T21:40:39.7288596Z hidden_states = residual + hidden_states 2025-08-14T21:40:39.7288599Z 2025-08-14T21:40:39.7288689Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7288870Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7288935Z return mod(**inputs) 2025-08-14T21:40:39.7289196Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7289263Z outputs = self.model( 2025-08-14T21:40:39.7289492Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7289578Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7289812Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7289874Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7290072Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7290172Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7290400Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:39.7290492Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:39.7290721Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 225, in forward 2025-08-14T21:40:39.7290856Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:40:39.7290860Z 2025-08-14T21:40:39.7290957Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7291151Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7291215Z return mod(**inputs) 2025-08-14T21:40:39.7291444Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7291504Z outputs = self.model( 2025-08-14T21:40:39.7291736Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7291800Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7292030Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7292100Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7292298Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7292378Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7292606Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:39.7292694Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:39.7292930Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 244, in forward 2025-08-14T21:40:39.7293001Z key_states = self.k_proj(current_states) 2025-08-14T21:40:39.7293004Z 2025-08-14T21:40:39.7293099Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7293280Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7293337Z return mod(**inputs) 2025-08-14T21:40:39.7293572Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7293633Z outputs = self.model( 2025-08-14T21:40:39.7293864Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7293935Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7294165Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7294237Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7294438Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7294506Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7294753Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:39.7294843Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:39.7295073Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 245, in forward 2025-08-14T21:40:39.7295173Z value_states = self.v_proj(current_states) 2025-08-14T21:40:39.7295177Z 2025-08-14T21:40:39.7295247Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.7295324Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.7295392Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.7295475Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.7295575Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7295755Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7295813Z return mod(**inputs) 2025-08-14T21:40:39.7296051Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7296111Z outputs = self.model( 2025-08-14T21:40:39.7296349Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7296430Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7296661Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7296731Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7296932Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7297008Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7297237Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:39.7297325Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:39.7297560Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 263, in forward 2025-08-14T21:40:39.7297645Z attn_output, attn_weights = attention_interface( 2025-08-14T21:40:39.7297909Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:40:39.7298037Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:40:39.7298040Z 2025-08-14T21:40:39.7298130Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7298318Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7298376Z return mod(**inputs) 2025-08-14T21:40:39.7298607Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7298674Z outputs = self.model( 2025-08-14T21:40:39.7298905Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7298975Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7299206Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7299270Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7299477Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7299546Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7299776Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:39.7299870Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:39.7300113Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 263, in forward 2025-08-14T21:40:39.7300206Z attn_output, attn_weights = attention_interface( 2025-08-14T21:40:39.7300466Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:40:39.7300578Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:40:39.7300581Z 2025-08-14T21:40:39.7300680Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7300859Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7300923Z return mod(**inputs) 2025-08-14T21:40:39.7301173Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7301234Z outputs = self.model( 2025-08-14T21:40:39.7301475Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7301540Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7301771Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7301843Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7302071Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7302146Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7302374Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:39.7302459Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:39.7302695Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 277, in forward 2025-08-14T21:40:39.7302769Z attn_output = self.out_proj(attn_output) 2025-08-14T21:40:39.7302772Z 2025-08-14T21:40:39.7302870Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7303051Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7303110Z return mod(**inputs) 2025-08-14T21:40:39.7303347Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7303406Z outputs = self.model( 2025-08-14T21:40:39.7303639Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7303712Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7303943Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7304011Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7304216Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7304284Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7304522Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 432, in forward 2025-08-14T21:40:39.7304620Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:40:39.7304906Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 225, in forward 2025-08-14T21:40:39.7305054Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:40:39.7305058Z 2025-08-14T21:40:39.7305153Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7305340Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7305400Z return mod(**inputs) 2025-08-14T21:40:39.7305648Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7305719Z outputs = self.model( 2025-08-14T21:40:39.7305950Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7306047Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7306279Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7306344Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7306552Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7306639Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7306871Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 432, in forward 2025-08-14T21:40:39.7306977Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:40:39.7307206Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 244, in forward 2025-08-14T21:40:39.7307285Z key_states = self.k_proj(current_states) 2025-08-14T21:40:39.7307289Z 2025-08-14T21:40:39.7307381Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7307574Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7307639Z return mod(**inputs) 2025-08-14T21:40:39.7307870Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7307939Z outputs = self.model( 2025-08-14T21:40:39.7308169Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7308235Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7308473Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7308536Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7308734Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7308813Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7309046Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 432, in forward 2025-08-14T21:40:39.7309147Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:40:39.7309376Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 245, in forward 2025-08-14T21:40:39.7309453Z value_states = self.v_proj(current_states) 2025-08-14T21:40:39.7309456Z 2025-08-14T21:40:39.7309532Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.7309601Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.7309672Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.7309746Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.7309838Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7310024Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7310084Z return mod(**inputs) 2025-08-14T21:40:39.7310315Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7310383Z outputs = self.model( 2025-08-14T21:40:39.7310614Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7310680Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7310917Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7310997Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7311205Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7311275Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7311523Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 432, in forward 2025-08-14T21:40:39.7311628Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:40:39.7311858Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 263, in forward 2025-08-14T21:40:39.7311950Z attn_output, attn_weights = attention_interface( 2025-08-14T21:40:39.7312228Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:40:39.7312350Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:40:39.7312353Z 2025-08-14T21:40:39.7312451Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7312629Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7312687Z return mod(**inputs) 2025-08-14T21:40:39.7312924Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7312998Z outputs = self.model( 2025-08-14T21:40:39.7313236Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7313301Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7313533Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7313605Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7313806Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7313885Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7314114Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 432, in forward 2025-08-14T21:40:39.7314210Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:40:39.7314446Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 263, in forward 2025-08-14T21:40:39.7314533Z attn_output, attn_weights = attention_interface( 2025-08-14T21:40:39.7314796Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:40:39.7314896Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:40:39.7314900Z 2025-08-14T21:40:39.7314989Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7315178Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7315235Z return mod(**inputs) 2025-08-14T21:40:39.7315463Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7315532Z outputs = self.model( 2025-08-14T21:40:39.7315762Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7315830Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7316058Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7316122Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7316330Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7316400Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7316639Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 432, in forward 2025-08-14T21:40:39.7316741Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:40:39.7316970Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 277, in forward 2025-08-14T21:40:39.7317068Z attn_output = self.out_proj(attn_output) 2025-08-14T21:40:39.7317071Z 2025-08-14T21:40:39.7317161Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7317337Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7317401Z return mod(**inputs) 2025-08-14T21:40:39.7317643Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7317712Z outputs = self.model( 2025-08-14T21:40:39.7317942Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7318008Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7318244Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7318311Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7318525Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7318605Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7318833Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 446, in forward 2025-08-14T21:40:39.7318948Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:40:39.7318952Z 2025-08-14T21:40:39.7319046Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7319228Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7319294Z return mod(**inputs) 2025-08-14T21:40:39.7319522Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7319584Z outputs = self.model( 2025-08-14T21:40:39.7319822Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7319889Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7320124Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7320189Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7320389Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7320467Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7320696Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 446, in forward 2025-08-14T21:40:39.7320809Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:40:39.7321002Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:40:39.7321067Z return self.act(input) 2025-08-14T21:40:39.7321071Z 2025-08-14T21:40:39.7321168Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7321349Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7321406Z return mod(**inputs) 2025-08-14T21:40:39.7321644Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7321704Z outputs = self.model( 2025-08-14T21:40:39.7321939Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7322018Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7322250Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7322322Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7322538Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7322609Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7322844Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 448, in forward 2025-08-14T21:40:39.7322917Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:40:39.7322934Z 2025-08-14T21:40:39.7323033Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7323213Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7323269Z return mod(**inputs) 2025-08-14T21:40:39.7323506Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7323563Z outputs = self.model( 2025-08-14T21:40:39.7323796Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7323877Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7324103Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7324170Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7324369Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7324439Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7324673Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:39.7324762Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:39.7324992Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 225, in forward 2025-08-14T21:40:39.7325126Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:40:39.7325131Z 2025-08-14T21:40:39.7325220Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7325400Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7325454Z return mod(**inputs) 2025-08-14T21:40:39.7325685Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7325742Z outputs = self.model( 2025-08-14T21:40:39.7325968Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7326039Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7326267Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7326329Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7326535Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7326607Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7326841Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:39.7326927Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:39.7327154Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 244, in forward 2025-08-14T21:40:39.7327231Z key_states = self.k_proj(current_states) 2025-08-14T21:40:39.7327234Z 2025-08-14T21:40:39.7327350Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7327536Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7327594Z return mod(**inputs) 2025-08-14T21:40:39.7327838Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7327903Z outputs = self.model( 2025-08-14T21:40:39.7328129Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7328192Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7328438Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7328501Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7328700Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7328770Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7328998Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:39.7329091Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:39.7329315Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 245, in forward 2025-08-14T21:40:39.7329406Z value_states = self.v_proj(current_states) 2025-08-14T21:40:39.7329413Z 2025-08-14T21:40:39.7329479Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.7329545Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.7329616Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.7329682Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.7329771Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7329953Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7330009Z return mod(**inputs) 2025-08-14T21:40:39.7330237Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7330303Z outputs = self.model( 2025-08-14T21:40:39.7330531Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7330602Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7330830Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7330894Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7331098Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7331166Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7331404Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:39.7331487Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:39.7331713Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 263, in forward 2025-08-14T21:40:39.7331806Z attn_output, attn_weights = attention_interface( 2025-08-14T21:40:39.7332072Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:40:39.7332192Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:40:39.7332203Z 2025-08-14T21:40:39.7332296Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7332477Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7332541Z return mod(**inputs) 2025-08-14T21:40:39.7332787Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7332849Z outputs = self.model( 2025-08-14T21:40:39.7333088Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7333196Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7333433Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7333497Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7333711Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7333791Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7334020Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:39.7334108Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:39.7334342Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 263, in forward 2025-08-14T21:40:39.7334429Z attn_output, attn_weights = attention_interface( 2025-08-14T21:40:39.7334701Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:40:39.7334815Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:40:39.7334819Z 2025-08-14T21:40:39.7334909Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7335092Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7335149Z return mod(**inputs) 2025-08-14T21:40:39.7335386Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7335445Z outputs = self.model( 2025-08-14T21:40:39.7335673Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7335741Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7335967Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7336031Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7336233Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7336300Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7336535Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:39.7336619Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:39.7336845Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 277, in forward 2025-08-14T21:40:39.7336921Z attn_output = self.out_proj(attn_output) 2025-08-14T21:40:39.7336924Z 2025-08-14T21:40:39.7337012Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7337194Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7337252Z return mod(**inputs) 2025-08-14T21:40:39.7337478Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7337539Z outputs = self.model( 2025-08-14T21:40:39.7337765Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7337826Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7338057Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7338131Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7338334Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7338401Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7338640Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 424, in forward 2025-08-14T21:40:39.7338717Z hidden_states = residual + hidden_states 2025-08-14T21:40:39.7338721Z 2025-08-14T21:40:39.7338807Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7338981Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7339053Z return mod(**inputs) 2025-08-14T21:40:39.7339280Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7339341Z outputs = self.model( 2025-08-14T21:40:39.7339568Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7339630Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7339863Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7339941Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7340145Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7340211Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7340439Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 432, in forward 2025-08-14T21:40:39.7340539Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:40:39.7340765Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 225, in forward 2025-08-14T21:40:39.7340898Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:40:39.7340901Z 2025-08-14T21:40:39.7340996Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7341174Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7341234Z return mod(**inputs) 2025-08-14T21:40:39.7341461Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7341518Z outputs = self.model( 2025-08-14T21:40:39.7341749Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7341810Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7342043Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7342104Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7342305Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7342374Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7342602Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 432, in forward 2025-08-14T21:40:39.7342695Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:40:39.7342924Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 244, in forward 2025-08-14T21:40:39.7342993Z key_states = self.k_proj(current_states) 2025-08-14T21:40:39.7342996Z 2025-08-14T21:40:39.7343090Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7343266Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7343324Z return mod(**inputs) 2025-08-14T21:40:39.7343566Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7343628Z outputs = self.model( 2025-08-14T21:40:39.7343859Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7343942Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7344169Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7344231Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7344442Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7344509Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7344739Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 432, in forward 2025-08-14T21:40:39.7344895Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:40:39.7345134Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 245, in forward 2025-08-14T21:40:39.7345214Z value_states = self.v_proj(current_states) 2025-08-14T21:40:39.7345219Z 2025-08-14T21:40:39.7345312Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.7345383Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.7345449Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.7345513Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.7345605Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7345782Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7345844Z return mod(**inputs) 2025-08-14T21:40:39.7346073Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7346135Z outputs = self.model( 2025-08-14T21:40:39.7346366Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7346426Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7346653Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7346722Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7346919Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7346989Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7347215Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 432, in forward 2025-08-14T21:40:39.7347310Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:40:39.7347538Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 263, in forward 2025-08-14T21:40:39.7347624Z attn_output, attn_weights = attention_interface( 2025-08-14T21:40:39.7347883Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:40:39.7348006Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:40:39.7348010Z 2025-08-14T21:40:39.7348100Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7348285Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7348343Z return mod(**inputs) 2025-08-14T21:40:39.7348573Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7348640Z outputs = self.model( 2025-08-14T21:40:39.7348885Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7348958Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7349187Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7349267Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7349473Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7349542Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7349767Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 432, in forward 2025-08-14T21:40:39.7349884Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:40:39.7350113Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 263, in forward 2025-08-14T21:40:39.7350208Z attn_output, attn_weights = attention_interface( 2025-08-14T21:40:39.7350469Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:40:39.7350564Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:40:39.7350568Z 2025-08-14T21:40:39.7350664Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7350860Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7350925Z return mod(**inputs) 2025-08-14T21:40:39.7351157Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7351219Z outputs = self.model( 2025-08-14T21:40:39.7351451Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7351514Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7351744Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7351812Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7352008Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7352086Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7352311Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 432, in forward 2025-08-14T21:40:39.7352404Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:40:39.7352634Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 277, in forward 2025-08-14T21:40:39.7352707Z attn_output = self.out_proj(attn_output) 2025-08-14T21:40:39.7352711Z 2025-08-14T21:40:39.7352801Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7352980Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7353038Z return mod(**inputs) 2025-08-14T21:40:39.7353272Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7353329Z outputs = self.model( 2025-08-14T21:40:39.7353555Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7353619Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7353847Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7353910Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7354107Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7354174Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7354419Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 446, in forward 2025-08-14T21:40:39.7354526Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:40:39.7354543Z 2025-08-14T21:40:39.7354636Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7354814Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7354869Z return mod(**inputs) 2025-08-14T21:40:39.7355100Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7355175Z outputs = self.model( 2025-08-14T21:40:39.7355401Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7355470Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7355695Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7355762Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7355957Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7356027Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7356271Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 446, in forward 2025-08-14T21:40:39.7356371Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:40:39.7356560Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:40:39.7356627Z return self.act(input) 2025-08-14T21:40:39.7356630Z 2025-08-14T21:40:39.7356718Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7356901Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7356958Z return mod(**inputs) 2025-08-14T21:40:39.7357185Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7357252Z outputs = self.model( 2025-08-14T21:40:39.7357480Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7357552Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7357779Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7357844Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7358050Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7358119Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7358346Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 448, in forward 2025-08-14T21:40:39.7358423Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:40:39.7358426Z 2025-08-14T21:40:39.7358516Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7358700Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7358760Z return mod(**inputs) 2025-08-14T21:40:39.7358989Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7359047Z outputs = self.model( 2025-08-14T21:40:39.7359276Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7359342Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7359585Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7359651Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7359859Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7359947Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7360184Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:39.7360281Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:39.7360509Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 225, in forward 2025-08-14T21:40:39.7360669Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:40:39.7360673Z 2025-08-14T21:40:39.7360768Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7360953Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7361021Z return mod(**inputs) 2025-08-14T21:40:39.7361259Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7361323Z outputs = self.model( 2025-08-14T21:40:39.7361562Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7361645Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7361879Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7361941Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7362139Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7362211Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7362439Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:39.7362530Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:39.7362757Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 244, in forward 2025-08-14T21:40:39.7362829Z key_states = self.k_proj(current_states) 2025-08-14T21:40:39.7362833Z 2025-08-14T21:40:39.7362929Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7363107Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7363164Z return mod(**inputs) 2025-08-14T21:40:39.7363402Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7363460Z outputs = self.model( 2025-08-14T21:40:39.7363698Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7363762Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7363992Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7364064Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7364263Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7364339Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7364568Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:39.7364655Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:39.7364892Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 245, in forward 2025-08-14T21:40:39.7364971Z value_states = self.v_proj(current_states) 2025-08-14T21:40:39.7364974Z 2025-08-14T21:40:39.7365058Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.7365140Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.7365207Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.7365280Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.7365389Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7365571Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7365636Z return mod(**inputs) 2025-08-14T21:40:39.7365868Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7365948Z outputs = self.model( 2025-08-14T21:40:39.7366189Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7366255Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7366493Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7366557Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7366756Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7366830Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7367071Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:39.7367156Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:39.7367390Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 263, in forward 2025-08-14T21:40:39.7367474Z attn_output, attn_weights = attention_interface( 2025-08-14T21:40:39.7367739Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:40:39.7367859Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:40:39.7367863Z 2025-08-14T21:40:39.7367950Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7368130Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7368188Z return mod(**inputs) 2025-08-14T21:40:39.7368418Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7368475Z outputs = self.model( 2025-08-14T21:40:39.7368705Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7368776Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7369003Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7369069Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7369272Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7369342Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7369577Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:39.7369665Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:39.7369892Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 263, in forward 2025-08-14T21:40:39.7369984Z attn_output, attn_weights = attention_interface( 2025-08-14T21:40:39.7370245Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:40:39.7370340Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:40:39.7370343Z 2025-08-14T21:40:39.7370443Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7370623Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7370688Z return mod(**inputs) 2025-08-14T21:40:39.7370935Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7370997Z outputs = self.model( 2025-08-14T21:40:39.7371231Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7371296Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7371546Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7371612Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7371816Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7371892Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7372121Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:39.7372217Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:39.7372464Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 277, in forward 2025-08-14T21:40:39.7372538Z attn_output = self.out_proj(attn_output) 2025-08-14T21:40:39.7372541Z 2025-08-14T21:40:39.7372641Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7372824Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7372882Z return mod(**inputs) 2025-08-14T21:40:39.7373123Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7373184Z outputs = self.model( 2025-08-14T21:40:39.7373424Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7373488Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7373723Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7373797Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7373998Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7374068Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7374310Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 432, in forward 2025-08-14T21:40:39.7374408Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:40:39.7374641Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 225, in forward 2025-08-14T21:40:39.7374775Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:40:39.7374779Z 2025-08-14T21:40:39.7374869Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7375061Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7375121Z return mod(**inputs) 2025-08-14T21:40:39.7375358Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7375416Z outputs = self.model( 2025-08-14T21:40:39.7375648Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7375718Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7375968Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7376034Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7376240Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7376325Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7376564Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 432, in forward 2025-08-14T21:40:39.7376660Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:40:39.7376888Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 244, in forward 2025-08-14T21:40:39.7376982Z key_states = self.k_proj(current_states) 2025-08-14T21:40:39.7376986Z 2025-08-14T21:40:39.7377080Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7377268Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7377327Z return mod(**inputs) 2025-08-14T21:40:39.7377561Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7377628Z outputs = self.model( 2025-08-14T21:40:39.7377858Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7377940Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7378179Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7378244Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7378450Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7378521Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7378752Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 432, in forward 2025-08-14T21:40:39.7378855Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:40:39.7379084Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 245, in forward 2025-08-14T21:40:39.7379169Z value_states = self.v_proj(current_states) 2025-08-14T21:40:39.7379174Z 2025-08-14T21:40:39.7379245Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.7379315Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.7379391Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.7379459Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.7379553Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7379740Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7379798Z return mod(**inputs) 2025-08-14T21:40:39.7380040Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7380101Z outputs = self.model( 2025-08-14T21:40:39.7380334Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7380408Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7380639Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7380704Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7380912Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7380983Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7381218Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 432, in forward 2025-08-14T21:40:39.7381327Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:40:39.7381556Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 263, in forward 2025-08-14T21:40:39.7381650Z attn_output, attn_weights = attention_interface( 2025-08-14T21:40:39.7381934Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:40:39.7382052Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:40:39.7382063Z 2025-08-14T21:40:39.7382154Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7382346Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7382413Z return mod(**inputs) 2025-08-14T21:40:39.7382644Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7382705Z outputs = self.model( 2025-08-14T21:40:39.7382942Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7383007Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7383240Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7383322Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7383521Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7383599Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7383827Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 432, in forward 2025-08-14T21:40:39.7383923Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:40:39.7384160Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 263, in forward 2025-08-14T21:40:39.7384247Z attn_output, attn_weights = attention_interface( 2025-08-14T21:40:39.7384515Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:40:39.7384742Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:40:39.7384750Z 2025-08-14T21:40:39.7384891Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7385090Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7385151Z return mod(**inputs) 2025-08-14T21:40:39.7385398Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7385461Z outputs = self.model( 2025-08-14T21:40:39.7385693Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7385771Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7386004Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7386070Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7386289Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7386362Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7386606Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 432, in forward 2025-08-14T21:40:39.7386704Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:40:39.7386936Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 277, in forward 2025-08-14T21:40:39.7387022Z attn_output = self.out_proj(attn_output) 2025-08-14T21:40:39.7387025Z 2025-08-14T21:40:39.7387150Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7387343Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7387402Z return mod(**inputs) 2025-08-14T21:40:39.7387665Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7387734Z outputs = self.model( 2025-08-14T21:40:39.7387962Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7388027Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7388286Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7388355Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7388566Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7388638Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7388865Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 441, in forward 2025-08-14T21:40:39.7388948Z hidden_states = residual + hidden_states 2025-08-14T21:40:39.7388952Z 2025-08-14T21:40:39.7389065Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7389254Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7389314Z return mod(**inputs) 2025-08-14T21:40:39.7389548Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7389617Z outputs = self.model( 2025-08-14T21:40:39.7389849Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7389917Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7390151Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7390214Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7390421Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7390493Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7390722Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 446, in forward 2025-08-14T21:40:39.7390836Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:40:39.7390839Z 2025-08-14T21:40:39.7390933Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7391113Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7391178Z return mod(**inputs) 2025-08-14T21:40:39.7391409Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7391476Z outputs = self.model( 2025-08-14T21:40:39.7391704Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7391770Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7392006Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7392069Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7392278Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7392348Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7392577Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 446, in forward 2025-08-14T21:40:39.7392706Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:40:39.7392899Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:40:39.7392963Z return self.act(input) 2025-08-14T21:40:39.7392981Z 2025-08-14T21:40:39.7393082Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7393260Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7393327Z return mod(**inputs) 2025-08-14T21:40:39.7393558Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7393633Z outputs = self.model( 2025-08-14T21:40:39.7393870Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7393933Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7394164Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7394235Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7394433Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7394525Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7394755Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 448, in forward 2025-08-14T21:40:39.7394827Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:40:39.7394831Z 2025-08-14T21:40:39.7394930Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7395109Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7395172Z return mod(**inputs) 2025-08-14T21:40:39.7395403Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7395462Z outputs = self.model( 2025-08-14T21:40:39.7395697Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7395762Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7395992Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7396062Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7396262Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7396338Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7396566Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:39.7396653Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:39.7396888Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 225, in forward 2025-08-14T21:40:39.7397024Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:40:39.7397029Z 2025-08-14T21:40:39.7397126Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7397306Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7397364Z return mod(**inputs) 2025-08-14T21:40:39.7397599Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7397659Z outputs = self.model( 2025-08-14T21:40:39.7397886Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7397957Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7398202Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7398272Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7398471Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7398559Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7398799Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:39.7398886Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:39.7399137Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 244, in forward 2025-08-14T21:40:39.7399213Z key_states = self.k_proj(current_states) 2025-08-14T21:40:39.7399216Z 2025-08-14T21:40:39.7399310Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7399499Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7399557Z return mod(**inputs) 2025-08-14T21:40:39.7399787Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7399856Z outputs = self.model( 2025-08-14T21:40:39.7400106Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7400179Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7400414Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7400478Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7400691Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7400761Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7400996Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:39.7401093Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:39.7401327Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 245, in forward 2025-08-14T21:40:39.7401415Z value_states = self.v_proj(current_states) 2025-08-14T21:40:39.7401418Z 2025-08-14T21:40:39.7401488Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.7401558Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.7401634Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.7401701Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.7401794Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7401983Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7402041Z return mod(**inputs) 2025-08-14T21:40:39.7402284Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7402345Z outputs = self.model( 2025-08-14T21:40:39.7402578Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7402654Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7402889Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7402960Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7403167Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7403238Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7403479Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:39.7403579Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:39.7403810Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 263, in forward 2025-08-14T21:40:39.7403904Z attn_output, attn_weights = attention_interface( 2025-08-14T21:40:39.7404183Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:40:39.7404311Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:40:39.7404314Z 2025-08-14T21:40:39.7404404Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7404597Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7404665Z return mod(**inputs) 2025-08-14T21:40:39.7404895Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7404964Z outputs = self.model( 2025-08-14T21:40:39.7405194Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7405259Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7405495Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7405577Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7405777Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7405856Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7406088Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:39.7406184Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:39.7406417Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 263, in forward 2025-08-14T21:40:39.7406505Z attn_output, attn_weights = attention_interface( 2025-08-14T21:40:39.7406774Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:40:39.7406874Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:40:39.7406878Z 2025-08-14T21:40:39.7406974Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7407154Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7407214Z return mod(**inputs) 2025-08-14T21:40:39.7407456Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7407516Z outputs = self.model( 2025-08-14T21:40:39.7407750Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7407822Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7408052Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7408124Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7408326Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7408395Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7408631Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:39.7408720Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:39.7408950Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 277, in forward 2025-08-14T21:40:39.7409030Z attn_output = self.out_proj(attn_output) 2025-08-14T21:40:39.7409054Z 2025-08-14T21:40:39.7409148Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7409335Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7409413Z return mod(**inputs) 2025-08-14T21:40:39.7409648Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7409716Z outputs = self.model( 2025-08-14T21:40:39.7409950Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7410022Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7410269Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7410334Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7410543Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7410615Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7410844Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 432, in forward 2025-08-14T21:40:39.7410954Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:40:39.7411204Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 225, in forward 2025-08-14T21:40:39.7411346Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:40:39.7411350Z 2025-08-14T21:40:39.7411441Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7411621Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7411687Z return mod(**inputs) 2025-08-14T21:40:39.7411917Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7411985Z outputs = self.model( 2025-08-14T21:40:39.7412215Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7412280Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7412518Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7412583Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7412783Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7412861Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7413090Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 432, in forward 2025-08-14T21:40:39.7413194Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:40:39.7413425Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 244, in forward 2025-08-14T21:40:39.7413498Z key_states = self.k_proj(current_states) 2025-08-14T21:40:39.7413502Z 2025-08-14T21:40:39.7413603Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7413783Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7413850Z return mod(**inputs) 2025-08-14T21:40:39.7414078Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7414138Z outputs = self.model( 2025-08-14T21:40:39.7414377Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7414441Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7414686Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7414760Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7414958Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7415056Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7415290Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 432, in forward 2025-08-14T21:40:39.7415387Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:40:39.7415640Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 245, in forward 2025-08-14T21:40:39.7415719Z value_states = self.v_proj(current_states) 2025-08-14T21:40:39.7415723Z 2025-08-14T21:40:39.7415791Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.7415867Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.7415937Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.7416012Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.7416102Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7416279Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7416363Z return mod(**inputs) 2025-08-14T21:40:39.7416596Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7416657Z outputs = self.model( 2025-08-14T21:40:39.7416900Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7416965Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7417204Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7417268Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7417468Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7417546Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7417778Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 432, in forward 2025-08-14T21:40:39.7417883Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:40:39.7418114Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 263, in forward 2025-08-14T21:40:39.7418203Z attn_output, attn_weights = attention_interface( 2025-08-14T21:40:39.7418475Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:40:39.7418593Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:40:39.7418597Z 2025-08-14T21:40:39.7418688Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7418878Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7418935Z return mod(**inputs) 2025-08-14T21:40:39.7419177Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7419238Z outputs = self.model( 2025-08-14T21:40:39.7419468Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7419541Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7419773Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7419844Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7420085Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7420158Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7420394Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 432, in forward 2025-08-14T21:40:39.7420506Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:40:39.7420736Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 263, in forward 2025-08-14T21:40:39.7420829Z attn_output, attn_weights = attention_interface( 2025-08-14T21:40:39.7421089Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:40:39.7421204Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:40:39.7421208Z 2025-08-14T21:40:39.7421302Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7421485Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7421552Z return mod(**inputs) 2025-08-14T21:40:39.7421784Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7421852Z outputs = self.model( 2025-08-14T21:40:39.7422083Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7422165Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7422402Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7422467Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7422669Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7422748Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7422978Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 432, in forward 2025-08-14T21:40:39.7423080Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:40:39.7423308Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 277, in forward 2025-08-14T21:40:39.7423386Z attn_output = self.out_proj(attn_output) 2025-08-14T21:40:39.7423389Z 2025-08-14T21:40:39.7423487Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7423665Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7423723Z return mod(**inputs) 2025-08-14T21:40:39.7423959Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7424020Z outputs = self.model( 2025-08-14T21:40:39.7424256Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7424320Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7424548Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7424621Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7424881Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7424965Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7425196Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 446, in forward 2025-08-14T21:40:39.7425306Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:40:39.7425310Z 2025-08-14T21:40:39.7425411Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7425592Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7425668Z return mod(**inputs) 2025-08-14T21:40:39.7425910Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7425972Z outputs = self.model( 2025-08-14T21:40:39.7426228Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7426296Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7426525Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7426598Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7426812Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7426894Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7427132Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 446, in forward 2025-08-14T21:40:39.7427240Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:40:39.7427445Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:40:39.7427510Z return self.act(input) 2025-08-14T21:40:39.7427535Z 2025-08-14T21:40:39.7427630Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7427820Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7427879Z return mod(**inputs) 2025-08-14T21:40:39.7428116Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7428176Z outputs = self.model( 2025-08-14T21:40:39.7428405Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7428476Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7428701Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7428762Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7428965Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7429035Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7429269Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 448, in forward 2025-08-14T21:40:39.7429341Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:40:39.7429346Z 2025-08-14T21:40:39.7429436Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7429617Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7429681Z return mod(**inputs) 2025-08-14T21:40:39.7429918Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7429980Z outputs = self.model( 2025-08-14T21:40:39.7430209Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7430285Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7430512Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7430575Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7430784Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7430855Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7431089Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 450, in forward 2025-08-14T21:40:39.7431178Z hidden_states = residual + hidden_states 2025-08-14T21:40:39.7431182Z 2025-08-14T21:40:39.7431275Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7431462Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7431537Z return mod(**inputs) 2025-08-14T21:40:39.7431780Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7431840Z outputs = self.model( 2025-08-14T21:40:39.7432075Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7432161Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7432393Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7432459Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7432667Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7432737Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7432973Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:39.7433077Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:39.7433304Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 225, in forward 2025-08-14T21:40:39.7433445Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:40:39.7433449Z 2025-08-14T21:40:39.7433543Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7433729Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7433787Z return mod(**inputs) 2025-08-14T21:40:39.7434018Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7434085Z outputs = self.model( 2025-08-14T21:40:39.7434310Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7434377Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7434614Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7434676Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7434883Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7434952Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7435181Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:39.7435276Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:39.7435502Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 244, in forward 2025-08-14T21:40:39.7435574Z key_states = self.k_proj(current_states) 2025-08-14T21:40:39.7435585Z 2025-08-14T21:40:39.7435676Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7435858Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7435923Z return mod(**inputs) 2025-08-14T21:40:39.7436152Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7436212Z outputs = self.model( 2025-08-14T21:40:39.7436446Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7436509Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7436760Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7436826Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7437024Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7437119Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7437347Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:39.7437433Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:39.7437685Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 245, in forward 2025-08-14T21:40:39.7437764Z value_states = self.v_proj(current_states) 2025-08-14T21:40:39.7437767Z 2025-08-14T21:40:39.7437843Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.7437914Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.7437985Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.7438058Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.7438150Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7438331Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7438411Z return mod(**inputs) 2025-08-14T21:40:39.7438645Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7438713Z outputs = self.model( 2025-08-14T21:40:39.7438946Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7439012Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7439248Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7439314Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7439515Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7439593Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7439826Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:39.7439924Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:39.7440153Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 263, in forward 2025-08-14T21:40:39.7440242Z attn_output, attn_weights = attention_interface( 2025-08-14T21:40:39.7440515Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:40:39.7440634Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:40:39.7440638Z 2025-08-14T21:40:39.7440740Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7440922Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7440982Z return mod(**inputs) 2025-08-14T21:40:39.7441220Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7441283Z outputs = self.model( 2025-08-14T21:40:39.7441512Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7441587Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7441817Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7441888Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7442101Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7442172Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7442408Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:39.7442515Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:39.7442753Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 263, in forward 2025-08-14T21:40:39.7442840Z attn_output, attn_weights = attention_interface( 2025-08-14T21:40:39.7443123Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:40:39.7443229Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:40:39.7443233Z 2025-08-14T21:40:39.7443326Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7443506Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7443571Z return mod(**inputs) 2025-08-14T21:40:39.7443800Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7443869Z outputs = self.model( 2025-08-14T21:40:39.7444112Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7444179Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7444413Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7444477Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7444683Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7444753Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7444981Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:39.7445075Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:39.7445300Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 277, in forward 2025-08-14T21:40:39.7445375Z attn_output = self.out_proj(attn_output) 2025-08-14T21:40:39.7445379Z 2025-08-14T21:40:39.7445478Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7445658Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7445725Z return mod(**inputs) 2025-08-14T21:40:39.7445953Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7446012Z outputs = self.model( 2025-08-14T21:40:39.7446248Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7446312Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7446539Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7446613Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7446813Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7446892Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7447119Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 432, in forward 2025-08-14T21:40:39.7447217Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:40:39.7447451Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 225, in forward 2025-08-14T21:40:39.7447599Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:40:39.7447602Z 2025-08-14T21:40:39.7447703Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7447885Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7447960Z return mod(**inputs) 2025-08-14T21:40:39.7448205Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7448264Z outputs = self.model( 2025-08-14T21:40:39.7448498Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7448595Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7448825Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7448894Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7449091Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7449161Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7449395Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 432, in forward 2025-08-14T21:40:39.7449511Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:40:39.7449750Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 244, in forward 2025-08-14T21:40:39.7449822Z key_states = self.k_proj(current_states) 2025-08-14T21:40:39.7449826Z 2025-08-14T21:40:39.7449917Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7450103Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7450160Z return mod(**inputs) 2025-08-14T21:40:39.7450389Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7450455Z outputs = self.model( 2025-08-14T21:40:39.7450685Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7450758Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7450988Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7451050Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7451258Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7451328Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7451563Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 432, in forward 2025-08-14T21:40:39.7451661Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:40:39.7451888Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 245, in forward 2025-08-14T21:40:39.7451973Z value_states = self.v_proj(current_states) 2025-08-14T21:40:39.7451978Z 2025-08-14T21:40:39.7452047Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.7452120Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.7452196Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.7452263Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.7452362Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7452542Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7452600Z return mod(**inputs) 2025-08-14T21:40:39.7452838Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7452915Z outputs = self.model( 2025-08-14T21:40:39.7453148Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7453222Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7453466Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7453538Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7453736Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7453808Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7454054Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 432, in forward 2025-08-14T21:40:39.7454153Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:40:39.7454382Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 263, in forward 2025-08-14T21:40:39.7454478Z attn_output, attn_weights = attention_interface( 2025-08-14T21:40:39.7454740Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:40:39.7454868Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:40:39.7454887Z 2025-08-14T21:40:39.7454981Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7455163Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7455230Z return mod(**inputs) 2025-08-14T21:40:39.7455465Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7455533Z outputs = self.model( 2025-08-14T21:40:39.7455764Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7455830Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7456068Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7456134Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7456338Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7456418Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7456649Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 432, in forward 2025-08-14T21:40:39.7456755Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:40:39.7456987Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 263, in forward 2025-08-14T21:40:39.7457074Z attn_output, attn_weights = attention_interface( 2025-08-14T21:40:39.7457350Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:40:39.7457446Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:40:39.7457451Z 2025-08-14T21:40:39.7457551Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7457737Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7457796Z return mod(**inputs) 2025-08-14T21:40:39.7458036Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7458100Z outputs = self.model( 2025-08-14T21:40:39.7458333Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7458408Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7458655Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7458732Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7458934Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7459024Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7459264Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 432, in forward 2025-08-14T21:40:39.7459360Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:40:39.7459609Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 277, in forward 2025-08-14T21:40:39.7459684Z attn_output = self.out_proj(attn_output) 2025-08-14T21:40:39.7459687Z 2025-08-14T21:40:39.7459778Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7459968Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7460026Z return mod(**inputs) 2025-08-14T21:40:39.7460257Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7460327Z outputs = self.model( 2025-08-14T21:40:39.7460571Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7460644Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7460871Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7460936Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7461142Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7461211Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7461439Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 446, in forward 2025-08-14T21:40:39.7461553Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:40:39.7461556Z 2025-08-14T21:40:39.7461649Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7461837Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7461897Z return mod(**inputs) 2025-08-14T21:40:39.7462124Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7462191Z outputs = self.model( 2025-08-14T21:40:39.7462419Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7462492Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7462721Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7462785Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7462989Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7463060Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7463287Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 446, in forward 2025-08-14T21:40:39.7463401Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:40:39.7463593Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:40:39.7463661Z return self.act(input) 2025-08-14T21:40:39.7463665Z 2025-08-14T21:40:39.7463754Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7463950Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7464018Z return mod(**inputs) 2025-08-14T21:40:39.7464249Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7464334Z outputs = self.model( 2025-08-14T21:40:39.7464562Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7464627Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7464931Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7465001Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7465224Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7465306Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7465539Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 448, in forward 2025-08-14T21:40:39.7465620Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:40:39.7465624Z 2025-08-14T21:40:39.7465717Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7465899Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7465983Z return mod(**inputs) 2025-08-14T21:40:39.7466216Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7466278Z outputs = self.model( 2025-08-14T21:40:39.7466516Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7466581Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7466821Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7466886Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7467087Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7467167Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7467397Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:39.7467494Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:39.7467723Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 225, in forward 2025-08-14T21:40:39.7467860Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:40:39.7467864Z 2025-08-14T21:40:39.7467964Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7468144Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7468203Z return mod(**inputs) 2025-08-14T21:40:39.7468441Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7468502Z outputs = self.model( 2025-08-14T21:40:39.7468741Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7468809Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7469040Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7469112Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7469312Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7469391Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7469637Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:39.7469729Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:39.7469964Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 244, in forward 2025-08-14T21:40:39.7470063Z key_states = self.k_proj(current_states) 2025-08-14T21:40:39.7470068Z 2025-08-14T21:40:39.7470160Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7470348Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7470407Z return mod(**inputs) 2025-08-14T21:40:39.7470658Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7470721Z outputs = self.model( 2025-08-14T21:40:39.7470951Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7471026Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7471256Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7471327Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7471528Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7471616Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7471855Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:39.7471944Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:39.7472176Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 245, in forward 2025-08-14T21:40:39.7472260Z value_states = self.v_proj(current_states) 2025-08-14T21:40:39.7472263Z 2025-08-14T21:40:39.7472338Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.7472418Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.7472489Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.7472558Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.7472659Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7472844Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7472903Z return mod(**inputs) 2025-08-14T21:40:39.7473145Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7473205Z outputs = self.model( 2025-08-14T21:40:39.7473447Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7473513Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7473746Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7473817Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7474021Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7474094Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7474336Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:39.7474425Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:39.7474666Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 263, in forward 2025-08-14T21:40:39.7474755Z attn_output, attn_weights = attention_interface( 2025-08-14T21:40:39.7475025Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:40:39.7475164Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:40:39.7475169Z 2025-08-14T21:40:39.7475261Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7475450Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7475527Z return mod(**inputs) 2025-08-14T21:40:39.7475757Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7475824Z outputs = self.model( 2025-08-14T21:40:39.7476070Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7476137Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7476375Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7476439Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7476647Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7476717Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7476944Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:39.7477054Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:39.7477281Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 263, in forward 2025-08-14T21:40:39.7477374Z attn_output, attn_weights = attention_interface( 2025-08-14T21:40:39.7477638Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:40:39.7477736Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:40:39.7477740Z 2025-08-14T21:40:39.7477838Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7478021Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7478079Z return mod(**inputs) 2025-08-14T21:40:39.7478317Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7478379Z outputs = self.model( 2025-08-14T21:40:39.7478617Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7478683Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7478913Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7478988Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7479189Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7479262Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7479499Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:39.7479587Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:39.7479823Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 277, in forward 2025-08-14T21:40:39.7479899Z attn_output = self.out_proj(attn_output) 2025-08-14T21:40:39.7479902Z 2025-08-14T21:40:39.7479994Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7480180Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7480237Z return mod(**inputs) 2025-08-14T21:40:39.7480474Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7480547Z outputs = self.model( 2025-08-14T21:40:39.7480781Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7480854Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7481098Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7481166Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7481374Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7481444Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7481692Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 424, in forward 2025-08-14T21:40:39.7481766Z hidden_states = residual + hidden_states 2025-08-14T21:40:39.7481770Z 2025-08-14T21:40:39.7481862Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7482052Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7482111Z return mod(**inputs) 2025-08-14T21:40:39.7482349Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7482427Z outputs = self.model( 2025-08-14T21:40:39.7482657Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7482729Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7482959Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7483023Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7483235Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7483309Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7483553Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 432, in forward 2025-08-14T21:40:39.7483652Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:40:39.7483883Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 225, in forward 2025-08-14T21:40:39.7484027Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:40:39.7484031Z 2025-08-14T21:40:39.7484122Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7484310Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7484367Z return mod(**inputs) 2025-08-14T21:40:39.7484742Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7484822Z outputs = self.model( 2025-08-14T21:40:39.7485053Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7485120Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7485360Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7485427Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7485635Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7485705Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7485938Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 432, in forward 2025-08-14T21:40:39.7486043Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:40:39.7486304Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 244, in forward 2025-08-14T21:40:39.7486379Z key_states = self.k_proj(current_states) 2025-08-14T21:40:39.7486389Z 2025-08-14T21:40:39.7486481Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7486663Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7486754Z return mod(**inputs) 2025-08-14T21:40:39.7486986Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7487047Z outputs = self.model( 2025-08-14T21:40:39.7487316Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7487384Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7487623Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7487689Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7487889Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7487968Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7488197Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 432, in forward 2025-08-14T21:40:39.7488316Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:40:39.7488554Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 245, in forward 2025-08-14T21:40:39.7488634Z value_states = self.v_proj(current_states) 2025-08-14T21:40:39.7488638Z 2025-08-14T21:40:39.7488714Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.7488784Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.7488851Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.7488926Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.7489020Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7489202Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7489270Z return mod(**inputs) 2025-08-14T21:40:39.7489507Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7489575Z outputs = self.model( 2025-08-14T21:40:39.7489807Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7489871Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7490110Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7490174Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7490377Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7490452Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7490684Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 432, in forward 2025-08-14T21:40:39.7490787Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:40:39.7491020Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 263, in forward 2025-08-14T21:40:39.7491106Z attn_output, attn_weights = attention_interface( 2025-08-14T21:40:39.7491382Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:40:39.7491500Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:40:39.7491503Z 2025-08-14T21:40:39.7491601Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7491808Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7491869Z return mod(**inputs) 2025-08-14T21:40:39.7492111Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7492187Z outputs = self.model( 2025-08-14T21:40:39.7492430Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7492497Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7492728Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7492816Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7493018Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7493088Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7493326Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 432, in forward 2025-08-14T21:40:39.7493423Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:40:39.7493657Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 263, in forward 2025-08-14T21:40:39.7493766Z attn_output, attn_weights = attention_interface( 2025-08-14T21:40:39.7494032Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:40:39.7494136Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:40:39.7494141Z 2025-08-14T21:40:39.7494232Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7494421Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7494478Z return mod(**inputs) 2025-08-14T21:40:39.7494708Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7494775Z outputs = self.model( 2025-08-14T21:40:39.7495005Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7495072Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7495309Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7495373Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7495581Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7495651Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7495882Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 432, in forward 2025-08-14T21:40:39.7495988Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:40:39.7496217Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 277, in forward 2025-08-14T21:40:39.7496290Z attn_output = self.out_proj(attn_output) 2025-08-14T21:40:39.7496302Z 2025-08-14T21:40:39.7496504Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7496683Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7496746Z return mod(**inputs) 2025-08-14T21:40:39.7496977Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7497036Z outputs = self.model( 2025-08-14T21:40:39.7497271Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7497336Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7497584Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7497650Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7497853Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7497951Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7498182Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 446, in forward 2025-08-14T21:40:39.7498293Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:40:39.7498305Z 2025-08-14T21:40:39.7498414Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7498596Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7498662Z return mod(**inputs) 2025-08-14T21:40:39.7498894Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7498954Z outputs = self.model( 2025-08-14T21:40:39.7499191Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7499258Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7499511Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7499575Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7499777Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7499855Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7500085Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 446, in forward 2025-08-14T21:40:39.7500190Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:40:39.7500390Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:40:39.7500451Z return self.act(input) 2025-08-14T21:40:39.7500456Z 2025-08-14T21:40:39.7500553Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7500733Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7500791Z return mod(**inputs) 2025-08-14T21:40:39.7501026Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7501088Z outputs = self.model( 2025-08-14T21:40:39.7501323Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7501388Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7501616Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7501686Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7501884Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7501956Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7502194Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 448, in forward 2025-08-14T21:40:39.7502267Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:40:39.7502271Z 2025-08-14T21:40:39.7502369Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7502548Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7502604Z return mod(**inputs) 2025-08-14T21:40:39.7502855Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7502918Z outputs = self.model( 2025-08-14T21:40:39.7503147Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7503234Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7503467Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7503540Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7503741Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7503826Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7504064Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:39.7504155Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:39.7504392Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 225, in forward 2025-08-14T21:40:39.7504530Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:40:39.7504533Z 2025-08-14T21:40:39.7504628Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7504922Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7504987Z return mod(**inputs) 2025-08-14T21:40:39.7505221Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7505292Z outputs = self.model( 2025-08-14T21:40:39.7505526Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7505600Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7505833Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7505898Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7506107Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7506180Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7506419Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:39.7506509Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:39.7506739Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 244, in forward 2025-08-14T21:40:39.7506820Z key_states = self.k_proj(current_states) 2025-08-14T21:40:39.7506824Z 2025-08-14T21:40:39.7506916Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7507098Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7507165Z return mod(**inputs) 2025-08-14T21:40:39.7507396Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7507468Z outputs = self.model( 2025-08-14T21:40:39.7507700Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7507766Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7508006Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7508073Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7508275Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7508354Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7508601Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:39.7508700Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:39.7508925Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 245, in forward 2025-08-14T21:40:39.7509019Z value_states = self.v_proj(current_states) 2025-08-14T21:40:39.7509023Z 2025-08-14T21:40:39.7509104Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.7509179Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.7509259Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.7509332Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.7509442Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7509630Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7509689Z return mod(**inputs) 2025-08-14T21:40:39.7509919Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7509987Z outputs = self.model( 2025-08-14T21:40:39.7510219Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7510315Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7510545Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7510607Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7510815Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7510887Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7511116Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:39.7511213Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:39.7511442Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 263, in forward 2025-08-14T21:40:39.7511535Z attn_output, attn_weights = attention_interface( 2025-08-14T21:40:39.7511801Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:40:39.7511919Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:40:39.7511923Z 2025-08-14T21:40:39.7512022Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7512206Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7512273Z return mod(**inputs) 2025-08-14T21:40:39.7512509Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7512573Z outputs = self.model( 2025-08-14T21:40:39.7512818Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7512884Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7513123Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7513199Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7513405Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7513482Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7513720Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:39.7513810Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:39.7514062Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 263, in forward 2025-08-14T21:40:39.7514155Z attn_output, attn_weights = attention_interface( 2025-08-14T21:40:39.7514431Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:40:39.7515290Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:40:39.7515296Z 2025-08-14T21:40:39.7515389Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7515582Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7515644Z return mod(**inputs) 2025-08-14T21:40:39.7515901Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7515972Z outputs = self.model( 2025-08-14T21:40:39.7516211Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7516287Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7516524Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7516590Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7516800Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7516893Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7517128Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:39.7517226Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:39.7517464Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 277, in forward 2025-08-14T21:40:39.7517546Z attn_output = self.out_proj(attn_output) 2025-08-14T21:40:39.7517550Z 2025-08-14T21:40:39.7517645Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7517832Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7517897Z return mod(**inputs) 2025-08-14T21:40:39.7518142Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7518211Z outputs = self.model( 2025-08-14T21:40:39.7518451Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7518517Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7518766Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7518831Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7519042Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7519122Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7519360Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 432, in forward 2025-08-14T21:40:39.7519469Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:40:39.7519710Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 225, in forward 2025-08-14T21:40:39.7519851Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:40:39.7519854Z 2025-08-14T21:40:39.7519956Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7520146Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7520212Z return mod(**inputs) 2025-08-14T21:40:39.7520469Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7520533Z outputs = self.model( 2025-08-14T21:40:39.7520777Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7520861Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7521099Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7521172Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7521376Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7521469Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7521706Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 432, in forward 2025-08-14T21:40:39.7521805Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:40:39.7522052Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 244, in forward 2025-08-14T21:40:39.7522127Z key_states = self.k_proj(current_states) 2025-08-14T21:40:39.7522130Z 2025-08-14T21:40:39.7522232Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7522416Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7522493Z return mod(**inputs) 2025-08-14T21:40:39.7522736Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7522798Z outputs = self.model( 2025-08-14T21:40:39.7523036Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7523111Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7523347Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7523419Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7523623Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7523696Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7523933Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 432, in forward 2025-08-14T21:40:39.7524033Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:40:39.7524267Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 245, in forward 2025-08-14T21:40:39.7524353Z value_states = self.v_proj(current_states) 2025-08-14T21:40:39.7524356Z 2025-08-14T21:40:39.7524429Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.7524507Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.7524579Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.7524648Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.7524751Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7524936Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7524996Z return mod(**inputs) 2025-08-14T21:40:39.7525241Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7525303Z outputs = self.model( 2025-08-14T21:40:39.7525546Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7525614Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7525848Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7525919Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7526138Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7526219Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7526458Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 432, in forward 2025-08-14T21:40:39.7526575Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:40:39.7526818Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 263, in forward 2025-08-14T21:40:39.7526906Z attn_output, attn_weights = attention_interface( 2025-08-14T21:40:39.7527194Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:40:39.7527326Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:40:39.7527329Z 2025-08-14T21:40:39.7527424Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7527616Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7527674Z return mod(**inputs) 2025-08-14T21:40:39.7527910Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7527998Z outputs = self.model( 2025-08-14T21:40:39.7528237Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7528312Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7528550Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7528617Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7528830Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7528903Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7529142Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 432, in forward 2025-08-14T21:40:39.7529248Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:40:39.7529484Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 263, in forward 2025-08-14T21:40:39.7529582Z attn_output, attn_weights = attention_interface( 2025-08-14T21:40:39.7529863Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:40:39.7529960Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:40:39.7529963Z 2025-08-14T21:40:39.7530063Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7530246Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7530312Z return mod(**inputs) 2025-08-14T21:40:39.7530543Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7530603Z outputs = self.model( 2025-08-14T21:40:39.7530840Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7530906Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7531135Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7531206Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7531405Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7531480Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7531722Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 432, in forward 2025-08-14T21:40:39.7531819Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:40:39.7532056Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 277, in forward 2025-08-14T21:40:39.7532146Z attn_output = self.out_proj(attn_output) 2025-08-14T21:40:39.7532151Z 2025-08-14T21:40:39.7532249Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7532429Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7532486Z return mod(**inputs) 2025-08-14T21:40:39.7532747Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7532809Z outputs = self.model( 2025-08-14T21:40:39.7533042Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7533114Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7533342Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7533417Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7533617Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7533702Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7533940Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 441, in forward 2025-08-14T21:40:39.7534014Z hidden_states = residual + hidden_states 2025-08-14T21:40:39.7534017Z 2025-08-14T21:40:39.7534108Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7534295Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7534355Z return mod(**inputs) 2025-08-14T21:40:39.7534592Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7534653Z outputs = self.model( 2025-08-14T21:40:39.7534882Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7534957Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7535187Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7535256Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7535459Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7535529Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7535769Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 446, in forward 2025-08-14T21:40:39.7535881Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:40:39.7535885Z 2025-08-14T21:40:39.7535975Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7536164Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7536228Z return mod(**inputs) 2025-08-14T21:40:39.7536470Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7536531Z outputs = self.model( 2025-08-14T21:40:39.7536764Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7536837Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7537069Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7537149Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7537357Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7537428Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7537686Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 446, in forward 2025-08-14T21:40:39.7537795Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:40:39.7537990Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:40:39.7538060Z return self.act(input) 2025-08-14T21:40:39.7538063Z 2025-08-14T21:40:39.7538173Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7538368Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7538427Z return mod(**inputs) 2025-08-14T21:40:39.7538659Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7538727Z outputs = self.model( 2025-08-14T21:40:39.7538959Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7539043Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7539287Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7539352Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7539564Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7539635Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7539866Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 448, in forward 2025-08-14T21:40:39.7539946Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:40:39.7539949Z 2025-08-14T21:40:39.7540040Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7540229Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7540288Z return mod(**inputs) 2025-08-14T21:40:39.7540523Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7540588Z outputs = self.model( 2025-08-14T21:40:39.7540819Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7540883Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7541121Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7541185Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7541392Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7541462Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7541695Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:39.7541796Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:39.7542030Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 225, in forward 2025-08-14T21:40:39.7542167Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:40:39.7542178Z 2025-08-14T21:40:39.7542271Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7542452Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7542517Z return mod(**inputs) 2025-08-14T21:40:39.7542770Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7542834Z outputs = self.model( 2025-08-14T21:40:39.7543070Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7543149Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7543387Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7543451Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7543663Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7543741Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7543969Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:39.7544059Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:39.7544292Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 244, in forward 2025-08-14T21:40:39.7544364Z key_states = self.k_proj(current_states) 2025-08-14T21:40:39.7544369Z 2025-08-14T21:40:39.7544465Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7544667Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7544725Z return mod(**inputs) 2025-08-14T21:40:39.7545037Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7545108Z outputs = self.model( 2025-08-14T21:40:39.7545355Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7545422Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7545666Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7545750Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7545955Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7546028Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7546273Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:39.7546362Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:39.7546605Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 245, in forward 2025-08-14T21:40:39.7546682Z value_states = self.v_proj(current_states) 2025-08-14T21:40:39.7546686Z 2025-08-14T21:40:39.7546758Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.7546840Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.7546910Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.7546979Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.7547079Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7547264Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7547333Z return mod(**inputs) 2025-08-14T21:40:39.7547571Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7547632Z outputs = self.model( 2025-08-14T21:40:39.7547878Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7547943Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7548179Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7548267Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7548468Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7548545Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7548795Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:39.7548885Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:39.7549122Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 263, in forward 2025-08-14T21:40:39.7549209Z attn_output, attn_weights = attention_interface( 2025-08-14T21:40:39.7549495Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:40:39.7549617Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:40:39.7549620Z 2025-08-14T21:40:39.7549712Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7549902Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7549960Z return mod(**inputs) 2025-08-14T21:40:39.7550192Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7550278Z outputs = self.model( 2025-08-14T21:40:39.7550508Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7550581Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7550810Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7550875Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7551083Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7551152Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7551386Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:39.7551476Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:39.7551704Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 263, in forward 2025-08-14T21:40:39.7551798Z attn_output, attn_weights = attention_interface( 2025-08-14T21:40:39.7552063Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:40:39.7552160Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:40:39.7552172Z 2025-08-14T21:40:39.7552262Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7552442Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7552508Z return mod(**inputs) 2025-08-14T21:40:39.7552739Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7552801Z outputs = self.model( 2025-08-14T21:40:39.7553038Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7553103Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7553341Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7553406Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7553605Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7553683Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7553924Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 415, in forward 2025-08-14T21:40:39.7554015Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:40:39.7554248Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 277, in forward 2025-08-14T21:40:39.7554346Z attn_output = self.out_proj(attn_output) 2025-08-14T21:40:39.7554349Z 2025-08-14T21:40:39.7554446Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7554627Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7554685Z return mod(**inputs) 2025-08-14T21:40:39.7554934Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7554997Z outputs = self.model( 2025-08-14T21:40:39.7555239Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7555305Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7555539Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7555610Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7555825Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7555894Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7556130Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 432, in forward 2025-08-14T21:40:39.7556229Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:40:39.7556468Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 225, in forward 2025-08-14T21:40:39.7556605Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:40:39.7556608Z 2025-08-14T21:40:39.7556697Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7556882Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7556942Z return mod(**inputs) 2025-08-14T21:40:39.7557182Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7557241Z outputs = self.model( 2025-08-14T21:40:39.7557470Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7557540Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7557771Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7557834Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7558043Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7558113Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7558344Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 432, in forward 2025-08-14T21:40:39.7558442Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:40:39.7558668Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 244, in forward 2025-08-14T21:40:39.7558745Z key_states = self.k_proj(current_states) 2025-08-14T21:40:39.7558749Z 2025-08-14T21:40:39.7558840Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7559025Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7559083Z return mod(**inputs) 2025-08-14T21:40:39.7559327Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7559395Z outputs = self.model( 2025-08-14T21:40:39.7559624Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7559705Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7559947Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7560011Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7560219Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7560305Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7560535Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 432, in forward 2025-08-14T21:40:39.7560639Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:40:39.7560869Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 245, in forward 2025-08-14T21:40:39.7560946Z value_states = self.v_proj(current_states) 2025-08-14T21:40:39.7560957Z 2025-08-14T21:40:39.7561026Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.7561115Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.7561190Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.7561259Z cudagraph partition due to non gpu ops 2025-08-14T21:40:39.7561350Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7561540Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7561599Z return mod(**inputs) 2025-08-14T21:40:39.7561832Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7561899Z outputs = self.model( 2025-08-14T21:40:39.7562133Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7562205Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7562437Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7562504Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7562714Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7562785Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7563014Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 432, in forward 2025-08-14T21:40:39.7563118Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:40:39.7563349Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 263, in forward 2025-08-14T21:40:39.7563444Z attn_output, attn_weights = attention_interface( 2025-08-14T21:40:39.7563709Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:40:39.7563829Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:40:39.7563833Z 2025-08-14T21:40:39.7563932Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7564111Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7564178Z return mod(**inputs) 2025-08-14T21:40:39.7564411Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7564472Z outputs = self.model( 2025-08-14T21:40:39.7566964Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7567043Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7567289Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7567386Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7567591Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7567666Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7567906Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 432, in forward 2025-08-14T21:40:39.7568023Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:40:39.7568264Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 263, in forward 2025-08-14T21:40:39.7568353Z attn_output, attn_weights = attention_interface( 2025-08-14T21:40:39.7568643Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:40:39.7568740Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:40:39.7568745Z 2025-08-14T21:40:39.7568844Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7569045Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7569105Z return mod(**inputs) 2025-08-14T21:40:39.7569344Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7569408Z outputs = self.model( 2025-08-14T21:40:39.7569645Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7569711Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7569943Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7570017Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7570217Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7570290Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7570525Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 432, in forward 2025-08-14T21:40:39.7570619Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:40:39.7570857Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 277, in forward 2025-08-14T21:40:39.7570933Z attn_output = self.out_proj(attn_output) 2025-08-14T21:40:39.7570937Z 2025-08-14T21:40:39.7571028Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7571220Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7571279Z return mod(**inputs) 2025-08-14T21:40:39.7571516Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7571577Z outputs = self.model( 2025-08-14T21:40:39.7571807Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7571880Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7572108Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7572174Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7572384Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7572454Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7572739Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 446, in forward 2025-08-14T21:40:39.7572851Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:40:39.7572855Z 2025-08-14T21:40:39.7572965Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7573155Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7573216Z return mod(**inputs) 2025-08-14T21:40:39.7573457Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7573519Z outputs = self.model( 2025-08-14T21:40:39.7573768Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7573843Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7574078Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7574142Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7574355Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7574428Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7574693Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 446, in forward 2025-08-14T21:40:39.7574799Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:40:39.7574991Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:40:39.7575063Z return self.act(input) 2025-08-14T21:40:39.7575066Z 2025-08-14T21:40:39.7575158Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7575338Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7575406Z return mod(**inputs) 2025-08-14T21:40:39.7575637Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7575703Z outputs = self.model( 2025-08-14T21:40:39.7575934Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7576000Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7576235Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7576300Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7576506Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7576577Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7576806Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 448, in forward 2025-08-14T21:40:39.7576885Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:40:39.7576889Z 2025-08-14T21:40:39.7576980Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7577163Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7577229Z return mod(**inputs) 2025-08-14T21:40:39.7577456Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1438, in forward 2025-08-14T21:40:39.7577521Z outputs = self.model( 2025-08-14T21:40:39.7577752Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1266, in forward 2025-08-14T21:40:39.7577815Z decoder_outputs = self.decoder( 2025-08-14T21:40:39.7578051Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1109, in forward 2025-08-14T21:40:39.7578139Z layer_outputs = decoder_layer( 2025-08-14T21:40:39.7578345Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:40:39.7578423Z return super().__call__(*args, **kwargs) 2025-08-14T21:40:39.7578666Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 450, in forward 2025-08-14T21:40:39.7578745Z hidden_states = residual + hidden_states 2025-08-14T21:40:39.7578748Z 2025-08-14T21:40:39.7578838Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7579017Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7579096Z return mod(**inputs) 2025-08-14T21:40:39.7579327Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1456, in forward 2025-08-14T21:40:39.7579445Z lm_logits = self.lm_head(outputs[0]) + self.final_logits_bias 2025-08-14T21:40:39.7579450Z 2025-08-14T21:40:39.7579540Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:40:39.7579717Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:40:39.7579782Z return mod(**inputs) 2025-08-14T21:40:39.7580013Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mbart/modeling_mbart.py", line 1461, in forward 2025-08-14T21:40:39.7580188Z masked_lm_loss = loss_fct(lm_logits.view(-1, self.config.vocab_size), labels.view(-1)) 2025-08-14T21:40:39.7580192Z 2025-08-14T21:40:50.9834284Z Compilation time (from dynamo_timed): 24.734012081 2025-08-14T21:40:50.9919521Z pass 2025-08-14T21:40:50.9923393Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:40:50.9927884Z TIMING: _recursive_pre_grad_passes:0.01197 _recursive_joint_graph_passes:0.98198 _recursive_post_grad_passes:0.15454 async_compile.wait:0.73415 code_gen:10.12249 inductor_compile:12.91645 backend_compile:19.54852 gc:0.00011 entire_frame_compile:24.73401 total_wall_time:24.73401 2025-08-14T21:40:50.9929385Z STATS: call_* op count: 986 | FakeTensorMode.__torch_dispatch__:33703 | FakeTensor.__torch_dispatch__:12062 | ProxyTorchDispatchMode.__torch_dispatch__:12456 2025-08-14T21:40:50.9929928Z Dynamo produced 1 graphs covering 986 ops with 0 graph breaks (0 unique) 2025-08-14T21:40:55.5079019Z /opt/conda/envs/py_3.9/lib/python3.9/site-packages/llvmlite/binding/ffi.py:175: UserWarning: pkg_resources is deprecated as an API. See https://setuptools.pypa.io/en/latest/pkg_resources.html. The pkg_resources package is slated for removal as early as 2025-11-30. Refrain from using this package or pin to Setuptools<81. 2025-08-14T21:40:55.5080030Z from pkg_resources import resource_filename 2025-08-14T21:40:56.0723018Z 2025-08-14T21:40:58.4019335Z loading model: 0it [00:00, ?it/s] 2025-08-14T21:40:58.4023247Z loading model: 0it [00:02, ?it/s] 2025-08-14T21:40:58.4034419Z cpu eval MT5ForConditionalGeneration 2025-08-14T21:40:58.9741285Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:40:59.2235760Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:40:59.4653207Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:41:10.8402903Z cudagraph partition due to non gpu ops 2025-08-14T21:41:10.8406549Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8410433Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8412738Z return mod(**inputs) 2025-08-14T21:41:10.8413274Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.8417341Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.8421873Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8422453Z layer_outputs = layer_module( 2025-08-14T21:41:10.8423453Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8424172Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8424562Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.8425051Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.8425471Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.8425834Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.8426187Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 421, in forward 2025-08-14T21:41:10.8426550Z position_bias = position_bias + causal_mask 2025-08-14T21:41:10.8426687Z 2025-08-14T21:41:10.8426790Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8427136Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8427453Z return mod(**inputs) 2025-08-14T21:41:10.8427869Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.8428226Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.8428575Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8428930Z layer_outputs = layer_module( 2025-08-14T21:41:10.8429249Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8429604Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8429958Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.8430304Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.8430656Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 474, in forward 2025-08-14T21:41:10.8431037Z normed_hidden_states = self.layer_norm(hidden_states) 2025-08-14T21:41:10.8431408Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 146, in forward 2025-08-14T21:41:10.8431750Z return self.weight * hidden_states 2025-08-14T21:41:10.8431882Z 2025-08-14T21:41:10.8431984Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8432324Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8432631Z return mod(**inputs) 2025-08-14T21:41:10.8432955Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.8433307Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.8433651Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8435124Z layer_outputs = layer_module( 2025-08-14T21:41:10.8435596Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8435972Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8436349Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.8436732Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.8437103Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.8437468Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.8437943Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 365, in forward 2025-08-14T21:41:10.8438304Z query_states = self.q(hidden_states) 2025-08-14T21:41:10.8438434Z 2025-08-14T21:41:10.8438592Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8438934Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8439245Z return mod(**inputs) 2025-08-14T21:41:10.8439582Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.8439941Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.8440323Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8440696Z layer_outputs = layer_module( 2025-08-14T21:41:10.8441032Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8441371Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8441728Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.8442091Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.8442480Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.8442831Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.8443186Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 385, in forward 2025-08-14T21:41:10.8443540Z key_states = self.k(current_states) 2025-08-14T21:41:10.8443669Z 2025-08-14T21:41:10.8443776Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8444114Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8444419Z return mod(**inputs) 2025-08-14T21:41:10.8444749Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.8445094Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.8445442Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8445794Z layer_outputs = layer_module( 2025-08-14T21:41:10.8446115Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8446445Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8446797Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.8447148Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.8447494Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.8447852Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.8448201Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 401, in forward 2025-08-14T21:41:10.8448601Z scores = torch.matmul(query_states, key_states.transpose(3, 2)) 2025-08-14T21:41:10.8448974Z 2025-08-14T21:41:10.8449075Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8449412Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8449713Z return mod(**inputs) 2025-08-14T21:41:10.8450043Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.8450388Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.8450785Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8451135Z layer_outputs = layer_module( 2025-08-14T21:41:10.8451448Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8451802Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8452146Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.8452494Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.8452829Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.8453194Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.8453551Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 433, in forward 2025-08-14T21:41:10.8453980Z attn_weights = nn.functional.softmax(scores.float(), dim=-1).type_as(scores) 2025-08-14T21:41:10.8454183Z 2025-08-14T21:41:10.8454282Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8454620Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8454923Z return mod(**inputs) 2025-08-14T21:41:10.8455238Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.8455610Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.8455951Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8456295Z layer_outputs = layer_module( 2025-08-14T21:41:10.8456606Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8456942Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8457291Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.8457637Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.8457986Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.8458343Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.8458695Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 386, in forward 2025-08-14T21:41:10.8459042Z value_states = self.v(current_states) 2025-08-14T21:41:10.8459176Z 2025-08-14T21:41:10.8459274Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8459613Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8459913Z return mod(**inputs) 2025-08-14T21:41:10.8460231Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.8460581Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.8460925Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8461263Z layer_outputs = layer_module( 2025-08-14T21:41:10.8461583Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8461917Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8462274Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.8462622Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.8462980Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.8463333Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.8463699Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 440, in forward 2025-08-14T21:41:10.8464077Z attn_output = torch.matmul(attn_weights, value_states) 2025-08-14T21:41:10.8464234Z 2025-08-14T21:41:10.8464334Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8464687Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8465141Z return mod(**inputs) 2025-08-14T21:41:10.8465473Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.8465827Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.8466217Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8466564Z layer_outputs = layer_module( 2025-08-14T21:41:10.8466880Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8467210Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8467544Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.8467895Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.8468238Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.8468603Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.8468955Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 440, in forward 2025-08-14T21:41:10.8469336Z attn_output = torch.matmul(attn_weights, value_states) 2025-08-14T21:41:10.8469486Z 2025-08-14T21:41:10.8469589Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8469913Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8470213Z return mod(**inputs) 2025-08-14T21:41:10.8470539Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.8470878Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.8471221Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8471568Z layer_outputs = layer_module( 2025-08-14T21:41:10.8471885Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8472215Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8472563Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.8472921Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.8473277Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.8473623Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.8473971Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 442, in forward 2025-08-14T21:41:10.8474349Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:41:10.8474499Z 2025-08-14T21:41:10.8474593Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8474922Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8475220Z return mod(**inputs) 2025-08-14T21:41:10.8475545Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.8475885Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.8476222Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8476584Z layer_outputs = layer_module( 2025-08-14T21:41:10.8476900Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8477243Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8477608Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.8477959Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.8478299Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.8478652Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.8479016Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 444, in forward 2025-08-14T21:41:10.8479369Z attn_output = self.o(attn_output) 2025-08-14T21:41:10.8479487Z 2025-08-14T21:41:10.8479582Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8479919Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8480222Z return mod(**inputs) 2025-08-14T21:41:10.8480540Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.8480907Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.8481247Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8481591Z layer_outputs = layer_module( 2025-08-14T21:41:10.8481900Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8482232Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8482577Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 583, in forward 2025-08-14T21:41:10.8482921Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:41:10.8483266Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 512, in forward 2025-08-14T21:41:10.8483624Z attention_output = self.EncDecAttention( 2025-08-14T21:41:10.8483972Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 365, in forward 2025-08-14T21:41:10.8484313Z query_states = self.q(hidden_states) 2025-08-14T21:41:10.8484444Z 2025-08-14T21:41:10.8484539Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8485167Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8485471Z return mod(**inputs) 2025-08-14T21:41:10.8485790Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8486142Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8486484Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8486825Z layer_outputs = layer_module( 2025-08-14T21:41:10.8487148Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8487484Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8487831Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.8488174Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.8488523Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.8488878Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.8489218Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 365, in forward 2025-08-14T21:41:10.8489646Z query_states = self.q(hidden_states) 2025-08-14T21:41:10.8489778Z 2025-08-14T21:41:10.8489874Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8490203Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8490527Z return mod(**inputs) 2025-08-14T21:41:10.8490852Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8491198Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8491540Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8491900Z layer_outputs = layer_module( 2025-08-14T21:41:10.8492221Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8492554Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8492888Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.8493237Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.8493582Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.8493973Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.8494313Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 385, in forward 2025-08-14T21:41:10.8494661Z key_states = self.k(current_states) 2025-08-14T21:41:10.8494780Z 2025-08-14T21:41:10.8494886Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8495208Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8495506Z return mod(**inputs) 2025-08-14T21:41:10.8495831Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8496224Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8496566Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8496924Z layer_outputs = layer_module( 2025-08-14T21:41:10.8497253Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8497654Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8498009Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.8498363Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.8498713Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.8499059Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.8499406Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 401, in forward 2025-08-14T21:41:10.8499806Z scores = torch.matmul(query_states, key_states.transpose(3, 2)) 2025-08-14T21:41:10.8499976Z 2025-08-14T21:41:10.8500080Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8500408Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8500709Z return mod(**inputs) 2025-08-14T21:41:10.8501027Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8501370Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8501708Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8502050Z layer_outputs = layer_module( 2025-08-14T21:41:10.8502398Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8502735Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8503078Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.8503444Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.8503792Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.8504133Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.8504493Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 433, in forward 2025-08-14T21:41:10.8505005Z attn_weights = nn.functional.softmax(scores.float(), dim=-1).type_as(scores) 2025-08-14T21:41:10.8505201Z 2025-08-14T21:41:10.8505299Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8505637Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8505939Z return mod(**inputs) 2025-08-14T21:41:10.8506266Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8506611Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8506976Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8507323Z layer_outputs = layer_module( 2025-08-14T21:41:10.8507635Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8507973Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8508321Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.8508670Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.8509009Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.8509362Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.8509711Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 433, in forward 2025-08-14T21:41:10.8510133Z attn_weights = nn.functional.softmax(scores.float(), dim=-1).type_as(scores) 2025-08-14T21:41:10.8510328Z 2025-08-14T21:41:10.8510421Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8510756Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8511059Z return mod(**inputs) 2025-08-14T21:41:10.8511371Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8511719Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8512060Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8512408Z layer_outputs = layer_module( 2025-08-14T21:41:10.8512716Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8513050Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8513398Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.8513760Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.8514112Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.8514464Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.8514814Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 433, in forward 2025-08-14T21:41:10.8515237Z attn_weights = nn.functional.softmax(scores.float(), dim=-1).type_as(scores) 2025-08-14T21:41:10.8515439Z 2025-08-14T21:41:10.8515533Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8515864Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8516182Z return mod(**inputs) 2025-08-14T21:41:10.8516500Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8516847Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8517193Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8517556Z layer_outputs = layer_module( 2025-08-14T21:41:10.8517867Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8518199Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8518550Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.8518897Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.8519252Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.8519621Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.8519968Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 386, in forward 2025-08-14T21:41:10.8520307Z value_states = self.v(current_states) 2025-08-14T21:41:10.8520440Z 2025-08-14T21:41:10.8520538Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8520869Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8521158Z return mod(**inputs) 2025-08-14T21:41:10.8521480Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8521827Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8522164Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8522500Z layer_outputs = layer_module( 2025-08-14T21:41:10.8522827Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8523161Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8523499Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.8523853Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.8524200Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.8524547Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.8524885Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 440, in forward 2025-08-14T21:41:10.8525258Z attn_output = torch.matmul(attn_weights, value_states) 2025-08-14T21:41:10.8525413Z 2025-08-14T21:41:10.8525510Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8525842Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8526134Z return mod(**inputs) 2025-08-14T21:41:10.8526458Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8526805Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8527136Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8527479Z layer_outputs = layer_module( 2025-08-14T21:41:10.8527813Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8528148Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8528485Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.8528849Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.8529200Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.8529544Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.8547731Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 440, in forward 2025-08-14T21:41:10.8548172Z attn_output = torch.matmul(attn_weights, value_states) 2025-08-14T21:41:10.8548337Z 2025-08-14T21:41:10.8548445Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8548811Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8549136Z return mod(**inputs) 2025-08-14T21:41:10.8549488Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8549877Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8550242Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8550625Z layer_outputs = layer_module( 2025-08-14T21:41:10.8550946Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8551293Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8551647Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.8551992Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.8552344Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.8552701Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.8553055Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 442, in forward 2025-08-14T21:41:10.8553439Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:41:10.8553601Z 2025-08-14T21:41:10.8553699Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8554037Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8554343Z return mod(**inputs) 2025-08-14T21:41:10.8554664Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8555014Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8555361Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8555702Z layer_outputs = layer_module( 2025-08-14T21:41:10.8556023Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8556361Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8556710Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.8557055Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.8557405Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.8557762Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.8558101Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 444, in forward 2025-08-14T21:41:10.8558451Z attn_output = self.o(attn_output) 2025-08-14T21:41:10.8558576Z 2025-08-14T21:41:10.8559466Z cudagraph partition due to non gpu ops 2025-08-14T21:41:10.8559700Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8560030Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8560366Z return mod(**inputs) 2025-08-14T21:41:10.8560697Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8561042Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8561386Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8561749Z layer_outputs = layer_module( 2025-08-14T21:41:10.8562074Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8562400Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8562754Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 609, in forward 2025-08-14T21:41:10.8563117Z hidden_states = self.layer[-1](hidden_states) 2025-08-14T21:41:10.8563477Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 215, in forward 2025-08-14T21:41:10.8563857Z forwarded_states = self.layer_norm(hidden_states) 2025-08-14T21:41:10.8564222Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 146, in forward 2025-08-14T21:41:10.8564570Z return self.weight * hidden_states 2025-08-14T21:41:10.8564692Z 2025-08-14T21:41:10.8564791Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8565124Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8565425Z return mod(**inputs) 2025-08-14T21:41:10.8565751Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8566092Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8566434Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8566781Z layer_outputs = layer_module( 2025-08-14T21:41:10.8567093Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8567431Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8567776Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 609, in forward 2025-08-14T21:41:10.8568139Z hidden_states = self.layer[-1](hidden_states) 2025-08-14T21:41:10.8568489Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 216, in forward 2025-08-14T21:41:10.8568875Z forwarded_states = self.DenseReluDense(forwarded_states) 2025-08-14T21:41:10.8569259Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 183, in forward 2025-08-14T21:41:10.8569625Z hidden_gelu = self.act(self.wi_0(hidden_states)) 2025-08-14T21:41:10.8569766Z 2025-08-14T21:41:10.8569860Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8570192Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8570496Z return mod(**inputs) 2025-08-14T21:41:10.8570812Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8571160Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8571501Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8571845Z layer_outputs = layer_module( 2025-08-14T21:41:10.8572173Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8572511Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8572867Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 609, in forward 2025-08-14T21:41:10.8573235Z hidden_states = self.layer[-1](hidden_states) 2025-08-14T21:41:10.8573592Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 216, in forward 2025-08-14T21:41:10.8573978Z forwarded_states = self.DenseReluDense(forwarded_states) 2025-08-14T21:41:10.8574375Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 184, in forward 2025-08-14T21:41:10.8574719Z hidden_linear = self.wi_1(hidden_states) 2025-08-14T21:41:10.8574850Z 2025-08-14T21:41:10.8574944Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8575275Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8575576Z return mod(**inputs) 2025-08-14T21:41:10.8575892Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8576240Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8576602Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8576943Z layer_outputs = layer_module( 2025-08-14T21:41:10.8577267Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8577609Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8577960Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 609, in forward 2025-08-14T21:41:10.8578317Z hidden_states = self.layer[-1](hidden_states) 2025-08-14T21:41:10.8578681Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 216, in forward 2025-08-14T21:41:10.8579070Z forwarded_states = self.DenseReluDense(forwarded_states) 2025-08-14T21:41:10.8579449Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 185, in forward 2025-08-14T21:41:10.8579819Z hidden_states = hidden_gelu * hidden_linear 2025-08-14T21:41:10.8579961Z 2025-08-14T21:41:10.8580062Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8580402Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8580703Z return mod(**inputs) 2025-08-14T21:41:10.8581038Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8581389Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8581732Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8582085Z layer_outputs = layer_module( 2025-08-14T21:41:10.8582408Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8582740Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8583095Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 609, in forward 2025-08-14T21:41:10.8583457Z hidden_states = self.layer[-1](hidden_states) 2025-08-14T21:41:10.8583818Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 216, in forward 2025-08-14T21:41:10.8584198Z forwarded_states = self.DenseReluDense(forwarded_states) 2025-08-14T21:41:10.8584822Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 198, in forward 2025-08-14T21:41:10.8585278Z hidden_states = self.wo(hidden_states) 2025-08-14T21:41:10.8585409Z 2025-08-14T21:41:10.8585491Z cudagraph partition due to non gpu ops 2025-08-14T21:41:10.8585707Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8586038Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8586382Z return mod(**inputs) 2025-08-14T21:41:10.8586701Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8587049Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8587416Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8587763Z layer_outputs = layer_module( 2025-08-14T21:41:10.8588072Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8588402Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8588747Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.8589094Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.8589442Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 474, in forward 2025-08-14T21:41:10.8589841Z normed_hidden_states = self.layer_norm(hidden_states) 2025-08-14T21:41:10.8590209Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 146, in forward 2025-08-14T21:41:10.8590548Z return self.weight * hidden_states 2025-08-14T21:41:10.8590677Z 2025-08-14T21:41:10.8590774Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8591104Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8591395Z return mod(**inputs) 2025-08-14T21:41:10.8591717Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8592063Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8592402Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8592739Z layer_outputs = layer_module( 2025-08-14T21:41:10.8593056Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8593384Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8593720Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.8594071Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.8594417Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.8594771Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.8595115Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 365, in forward 2025-08-14T21:41:10.8595462Z query_states = self.q(hidden_states) 2025-08-14T21:41:10.8595593Z 2025-08-14T21:41:10.8595688Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8596022Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8596311Z return mod(**inputs) 2025-08-14T21:41:10.8596632Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8596980Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8597310Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8597661Z layer_outputs = layer_module( 2025-08-14T21:41:10.8598016Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8598360Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8598704Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.8599085Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.8599444Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.8599799Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.8600178Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 385, in forward 2025-08-14T21:41:10.8600543Z key_states = self.k(current_states) 2025-08-14T21:41:10.8600670Z 2025-08-14T21:41:10.8600777Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8601121Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8601422Z return mod(**inputs) 2025-08-14T21:41:10.8601749Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8602099Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8602445Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8602783Z layer_outputs = layer_module( 2025-08-14T21:41:10.8603098Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8603423Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8603765Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.8604110Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.8604453Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.8604800Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.8605146Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 401, in forward 2025-08-14T21:41:10.8605542Z scores = torch.matmul(query_states, key_states.transpose(3, 2)) 2025-08-14T21:41:10.8605711Z 2025-08-14T21:41:10.8605813Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8606137Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8606435Z return mod(**inputs) 2025-08-14T21:41:10.8606755Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8607093Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8607432Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8607777Z layer_outputs = layer_module( 2025-08-14T21:41:10.8608092Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8608413Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8608758Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.8609105Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.8609442Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.8609796Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.8610143Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 433, in forward 2025-08-14T21:41:10.8610579Z attn_weights = nn.functional.softmax(scores.float(), dim=-1).type_as(scores) 2025-08-14T21:41:10.8610777Z 2025-08-14T21:41:10.8610870Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8611202Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8611515Z return mod(**inputs) 2025-08-14T21:41:10.8611832Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8612180Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8612522Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8612881Z layer_outputs = layer_module( 2025-08-14T21:41:10.8613188Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8613518Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8613862Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.8614207Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.8614544Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.8614927Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.8615274Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 433, in forward 2025-08-14T21:41:10.8615677Z attn_weights = nn.functional.softmax(scores.float(), dim=-1).type_as(scores) 2025-08-14T21:41:10.8615874Z 2025-08-14T21:41:10.8615969Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8616299Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8616592Z return mod(**inputs) 2025-08-14T21:41:10.8616906Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8617252Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8617593Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8617939Z layer_outputs = layer_module( 2025-08-14T21:41:10.8618249Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8618576Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8618920Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.8619259Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.8619605Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.8619952Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.8620301Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 433, in forward 2025-08-14T21:41:10.8620706Z attn_weights = nn.functional.softmax(scores.float(), dim=-1).type_as(scores) 2025-08-14T21:41:10.8620903Z 2025-08-14T21:41:10.8620997Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8621325Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8621610Z return mod(**inputs) 2025-08-14T21:41:10.8621931Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8622275Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8622613Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8622944Z layer_outputs = layer_module( 2025-08-14T21:41:10.8623294Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8623634Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8623985Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.8624343Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.8624687Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.8625155Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.8625522Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 386, in forward 2025-08-14T21:41:10.8625875Z value_states = self.v(current_states) 2025-08-14T21:41:10.8626008Z 2025-08-14T21:41:10.8626104Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8626444Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8626737Z return mod(**inputs) 2025-08-14T21:41:10.8627061Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8627414Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8627770Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8628121Z layer_outputs = layer_module( 2025-08-14T21:41:10.8628448Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8628789Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8629136Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.8629495Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.8629853Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.8630214Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.8630563Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 440, in forward 2025-08-14T21:41:10.8630951Z attn_output = torch.matmul(attn_weights, value_states) 2025-08-14T21:41:10.8631104Z 2025-08-14T21:41:10.8631211Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8631544Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8631848Z return mod(**inputs) 2025-08-14T21:41:10.8632181Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8632533Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8632873Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8633229Z layer_outputs = layer_module( 2025-08-14T21:41:10.8633553Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8633887Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8634240Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.8634597Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.8634951Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.8635304Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.8635657Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 440, in forward 2025-08-14T21:41:10.8636041Z attn_output = torch.matmul(attn_weights, value_states) 2025-08-14T21:41:10.8636211Z 2025-08-14T21:41:10.8636314Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8636637Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8636953Z return mod(**inputs) 2025-08-14T21:41:10.8637272Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8637610Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8637950Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8638304Z layer_outputs = layer_module( 2025-08-14T21:41:10.8638624Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8638947Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8639291Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.8639638Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.8639977Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.8640333Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.8640704Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 442, in forward 2025-08-14T21:41:10.8641081Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:41:10.8641231Z 2025-08-14T21:41:10.8641325Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8641659Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8641957Z return mod(**inputs) 2025-08-14T21:41:10.8642279Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8642617Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8642955Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8643300Z layer_outputs = layer_module( 2025-08-14T21:41:10.8643608Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8643941Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8644288Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.8644641Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.8644990Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.8645343Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.8645686Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 444, in forward 2025-08-14T21:41:10.8646035Z attn_output = self.o(attn_output) 2025-08-14T21:41:10.8646153Z 2025-08-14T21:41:10.8646258Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8646583Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8646884Z return mod(**inputs) 2025-08-14T21:41:10.8647205Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8647550Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8647885Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8648229Z layer_outputs = layer_module( 2025-08-14T21:41:10.8648550Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8648894Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8649233Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 609, in forward 2025-08-14T21:41:10.8649610Z hidden_states = self.layer[-1](hidden_states) 2025-08-14T21:41:10.8649970Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 215, in forward 2025-08-14T21:41:10.8650330Z forwarded_states = self.layer_norm(hidden_states) 2025-08-14T21:41:10.8650694Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 146, in forward 2025-08-14T21:41:10.8651081Z return self.weight * hidden_states 2025-08-14T21:41:10.8651208Z 2025-08-14T21:41:10.8651308Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8651628Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8651931Z return mod(**inputs) 2025-08-14T21:41:10.8652255Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8652589Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8652928Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8653290Z layer_outputs = layer_module( 2025-08-14T21:41:10.8653604Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8653926Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8654272Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 609, in forward 2025-08-14T21:41:10.8654632Z hidden_states = self.layer[-1](hidden_states) 2025-08-14T21:41:10.8654986Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 216, in forward 2025-08-14T21:41:10.8655362Z forwarded_states = self.DenseReluDense(forwarded_states) 2025-08-14T21:41:10.8655742Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 183, in forward 2025-08-14T21:41:10.8656110Z hidden_gelu = self.act(self.wi_0(hidden_states)) 2025-08-14T21:41:10.8656250Z 2025-08-14T21:41:10.8656343Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8656669Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8656966Z return mod(**inputs) 2025-08-14T21:41:10.8657288Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8657629Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8657967Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8658315Z layer_outputs = layer_module( 2025-08-14T21:41:10.8658622Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8658953Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8659298Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 609, in forward 2025-08-14T21:41:10.8659658Z hidden_states = self.layer[-1](hidden_states) 2025-08-14T21:41:10.8660004Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 216, in forward 2025-08-14T21:41:10.8660388Z forwarded_states = self.DenseReluDense(forwarded_states) 2025-08-14T21:41:10.8660764Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 184, in forward 2025-08-14T21:41:10.8661114Z hidden_linear = self.wi_1(hidden_states) 2025-08-14T21:41:10.8661240Z 2025-08-14T21:41:10.8661349Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8661680Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8661978Z return mod(**inputs) 2025-08-14T21:41:10.8662312Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8662659Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8662999Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8663339Z layer_outputs = layer_module( 2025-08-14T21:41:10.8663661Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8663999Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8664348Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 609, in forward 2025-08-14T21:41:10.8664703Z hidden_states = self.layer[-1](hidden_states) 2025-08-14T21:41:10.8665159Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 216, in forward 2025-08-14T21:41:10.8665550Z forwarded_states = self.DenseReluDense(forwarded_states) 2025-08-14T21:41:10.8665951Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 185, in forward 2025-08-14T21:41:10.8666302Z hidden_states = hidden_gelu * hidden_linear 2025-08-14T21:41:10.8666442Z 2025-08-14T21:41:10.8666535Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8666865Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8667168Z return mod(**inputs) 2025-08-14T21:41:10.8667483Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8667830Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8668168Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8668503Z layer_outputs = layer_module( 2025-08-14T21:41:10.8668820Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8669154Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8669495Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 609, in forward 2025-08-14T21:41:10.8669842Z hidden_states = self.layer[-1](hidden_states) 2025-08-14T21:41:10.8670192Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 216, in forward 2025-08-14T21:41:10.8670570Z forwarded_states = self.DenseReluDense(forwarded_states) 2025-08-14T21:41:10.8670937Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 198, in forward 2025-08-14T21:41:10.8671291Z hidden_states = self.wo(hidden_states) 2025-08-14T21:41:10.8671412Z 2025-08-14T21:41:10.8671493Z cudagraph partition due to non gpu ops 2025-08-14T21:41:10.8671702Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8672028Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8672325Z return mod(**inputs) 2025-08-14T21:41:10.8672645Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8672985Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8673321Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8673665Z layer_outputs = layer_module( 2025-08-14T21:41:10.8673985Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8674318Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8674660Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.8675031Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.8675374Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 474, in forward 2025-08-14T21:41:10.8675746Z normed_hidden_states = self.layer_norm(hidden_states) 2025-08-14T21:41:10.8676126Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 146, in forward 2025-08-14T21:41:10.8676476Z return self.weight * hidden_states 2025-08-14T21:41:10.8676595Z 2025-08-14T21:41:10.8676689Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8677020Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8677317Z return mod(**inputs) 2025-08-14T21:41:10.8677629Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8677969Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8678315Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8678643Z layer_outputs = layer_module( 2025-08-14T21:41:10.8678944Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8679270Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8679612Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.8679952Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.8680297Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.8680643Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.8680988Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 365, in forward 2025-08-14T21:41:10.8681330Z query_states = self.q(hidden_states) 2025-08-14T21:41:10.8681457Z 2025-08-14T21:41:10.8681549Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8681876Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8682172Z return mod(**inputs) 2025-08-14T21:41:10.8682481Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8682815Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8683146Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8683479Z layer_outputs = layer_module( 2025-08-14T21:41:10.8683791Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8684125Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8684466Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.8684958Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.8685305Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.8685662Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.8686005Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 385, in forward 2025-08-14T21:41:10.8686355Z key_states = self.k(current_states) 2025-08-14T21:41:10.8686480Z 2025-08-14T21:41:10.8686623Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8686956Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8687247Z return mod(**inputs) 2025-08-14T21:41:10.8687599Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8687956Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8688305Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8688646Z layer_outputs = layer_module( 2025-08-14T21:41:10.8688998Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8689335Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8689678Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.8690028Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.8690383Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.8690739Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.8691104Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 401, in forward 2025-08-14T21:41:10.8691500Z scores = torch.matmul(query_states, key_states.transpose(3, 2)) 2025-08-14T21:41:10.8691667Z 2025-08-14T21:41:10.8691770Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8692094Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8692394Z return mod(**inputs) 2025-08-14T21:41:10.8692717Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8693064Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8693395Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8693734Z layer_outputs = layer_module( 2025-08-14T21:41:10.8694049Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8694378Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8694715Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.8695058Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.8695401Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.8695741Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.8696083Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 433, in forward 2025-08-14T21:41:10.8696497Z attn_weights = nn.functional.softmax(scores.float(), dim=-1).type_as(scores) 2025-08-14T21:41:10.8696687Z 2025-08-14T21:41:10.8696786Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8697106Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8697401Z return mod(**inputs) 2025-08-14T21:41:10.8697721Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8698055Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8698394Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8698734Z layer_outputs = layer_module( 2025-08-14T21:41:10.8699039Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8699375Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8699722Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.8700088Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.8700433Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.8700776Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.8701118Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 433, in forward 2025-08-14T21:41:10.8701543Z attn_weights = nn.functional.softmax(scores.float(), dim=-1).type_as(scores) 2025-08-14T21:41:10.8701735Z 2025-08-14T21:41:10.8701829Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8702159Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8702460Z return mod(**inputs) 2025-08-14T21:41:10.8702783Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8703123Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8703461Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8703812Z layer_outputs = layer_module( 2025-08-14T21:41:10.8704108Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8704435Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8704849Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.8705215Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.8705556Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.8705908Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.8706255Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 433, in forward 2025-08-14T21:41:10.8706673Z attn_weights = nn.functional.softmax(scores.float(), dim=-1).type_as(scores) 2025-08-14T21:41:10.8706864Z 2025-08-14T21:41:10.8706958Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8707288Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8707582Z return mod(**inputs) 2025-08-14T21:41:10.8707903Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8708252Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8708591Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8708934Z layer_outputs = layer_module( 2025-08-14T21:41:10.8709242Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8709575Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8709919Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.8710270Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.8710607Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.8710958Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.8711300Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 386, in forward 2025-08-14T21:41:10.8711640Z value_states = self.v(current_states) 2025-08-14T21:41:10.8711767Z 2025-08-14T21:41:10.8711880Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8712209Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8712506Z return mod(**inputs) 2025-08-14T21:41:10.8712835Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8713179Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8713519Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8713855Z layer_outputs = layer_module( 2025-08-14T21:41:10.8714183Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8714517Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8714862Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.8715211Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.8715561Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.8715919Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.8716273Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 440, in forward 2025-08-14T21:41:10.8716645Z attn_output = torch.matmul(attn_weights, value_states) 2025-08-14T21:41:10.8716800Z 2025-08-14T21:41:10.8716894Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8717222Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8717513Z return mod(**inputs) 2025-08-14T21:41:10.8717834Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8718177Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8718511Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8718835Z layer_outputs = layer_module( 2025-08-14T21:41:10.8719139Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8719469Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8719801Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.8720146Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.8720488Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.8720838Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.8721173Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 440, in forward 2025-08-14T21:41:10.8721541Z attn_output = torch.matmul(attn_weights, value_states) 2025-08-14T21:41:10.8721688Z 2025-08-14T21:41:10.8721787Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8722116Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8722408Z return mod(**inputs) 2025-08-14T21:41:10.8722715Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8723050Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8723376Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8723711Z layer_outputs = layer_module( 2025-08-14T21:41:10.8724016Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8724361Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8724701Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.8725048Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.8725407Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.8725747Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.8726087Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 442, in forward 2025-08-14T21:41:10.8726469Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:41:10.8726618Z 2025-08-14T21:41:10.8726716Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8727036Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8727329Z return mod(**inputs) 2025-08-14T21:41:10.8727643Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8727981Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8728313Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8728669Z layer_outputs = layer_module( 2025-08-14T21:41:10.8728982Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8729304Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8729647Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.8729993Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.8730341Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.8730682Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.8731021Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 444, in forward 2025-08-14T21:41:10.8731364Z attn_output = self.o(attn_output) 2025-08-14T21:41:10.8731482Z 2025-08-14T21:41:10.8731557Z cudagraph partition due to non gpu ops 2025-08-14T21:41:10.8731771Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8732094Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8732386Z return mod(**inputs) 2025-08-14T21:41:10.8732697Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8733041Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8733384Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8733722Z layer_outputs = layer_module( 2025-08-14T21:41:10.8734040Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8734369Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8734713Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 609, in forward 2025-08-14T21:41:10.8735063Z hidden_states = self.layer[-1](hidden_states) 2025-08-14T21:41:10.8735416Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 215, in forward 2025-08-14T21:41:10.8735781Z forwarded_states = self.layer_norm(hidden_states) 2025-08-14T21:41:10.8736140Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 146, in forward 2025-08-14T21:41:10.8736479Z return self.weight * hidden_states 2025-08-14T21:41:10.8736605Z 2025-08-14T21:41:10.8736714Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8737049Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8737339Z return mod(**inputs) 2025-08-14T21:41:10.8737694Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8738041Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8738378Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8738713Z layer_outputs = layer_module( 2025-08-14T21:41:10.8739041Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8739374Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8739710Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 609, in forward 2025-08-14T21:41:10.8740066Z hidden_states = self.layer[-1](hidden_states) 2025-08-14T21:41:10.8740418Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 216, in forward 2025-08-14T21:41:10.8740799Z forwarded_states = self.DenseReluDense(forwarded_states) 2025-08-14T21:41:10.8741187Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 183, in forward 2025-08-14T21:41:10.8741551Z hidden_gelu = self.act(self.wi_0(hidden_states)) 2025-08-14T21:41:10.8741686Z 2025-08-14T21:41:10.8741786Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8742114Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8742401Z return mod(**inputs) 2025-08-14T21:41:10.8742721Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8743066Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8743395Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8743742Z layer_outputs = layer_module( 2025-08-14T21:41:10.8744061Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8744382Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8744713Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 609, in forward 2025-08-14T21:41:10.8745156Z hidden_states = self.layer[-1](hidden_states) 2025-08-14T21:41:10.8745506Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 216, in forward 2025-08-14T21:41:10.8745874Z forwarded_states = self.DenseReluDense(forwarded_states) 2025-08-14T21:41:10.8746241Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 184, in forward 2025-08-14T21:41:10.8746577Z hidden_linear = self.wi_1(hidden_states) 2025-08-14T21:41:10.8746697Z 2025-08-14T21:41:10.8746792Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8747111Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8747406Z return mod(**inputs) 2025-08-14T21:41:10.8747722Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8748059Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8748384Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8748721Z layer_outputs = layer_module( 2025-08-14T21:41:10.8749030Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8749372Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8749714Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 609, in forward 2025-08-14T21:41:10.8750084Z hidden_states = self.layer[-1](hidden_states) 2025-08-14T21:41:10.8750430Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 216, in forward 2025-08-14T21:41:10.8750799Z forwarded_states = self.DenseReluDense(forwarded_states) 2025-08-14T21:41:10.8751168Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 185, in forward 2025-08-14T21:41:10.8751525Z hidden_states = hidden_gelu * hidden_linear 2025-08-14T21:41:10.8751653Z 2025-08-14T21:41:10.8751746Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8752057Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8752347Z return mod(**inputs) 2025-08-14T21:41:10.8752661Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8752991Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8753324Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8753674Z layer_outputs = layer_module( 2025-08-14T21:41:10.8753975Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8754289Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8754623Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 609, in forward 2025-08-14T21:41:10.8754970Z hidden_states = self.layer[-1](hidden_states) 2025-08-14T21:41:10.8755309Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 216, in forward 2025-08-14T21:41:10.8755676Z forwarded_states = self.DenseReluDense(forwarded_states) 2025-08-14T21:41:10.8756036Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 198, in forward 2025-08-14T21:41:10.8756374Z hidden_states = self.wo(hidden_states) 2025-08-14T21:41:10.8756495Z 2025-08-14T21:41:10.8756565Z cudagraph partition due to non gpu ops 2025-08-14T21:41:10.8756774Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8757097Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8757384Z return mod(**inputs) 2025-08-14T21:41:10.8757698Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8758030Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8758357Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8758685Z layer_outputs = layer_module( 2025-08-14T21:41:10.8758991Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8759314Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8759645Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.8759985Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.8760320Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 474, in forward 2025-08-14T21:41:10.8760681Z normed_hidden_states = self.layer_norm(hidden_states) 2025-08-14T21:41:10.8761029Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 146, in forward 2025-08-14T21:41:10.8761368Z return self.weight * hidden_states 2025-08-14T21:41:10.8761510Z 2025-08-14T21:41:10.8761608Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8761932Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8762235Z return mod(**inputs) 2025-08-14T21:41:10.8762556Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8762901Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8763234Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8763589Z layer_outputs = layer_module( 2025-08-14T21:41:10.8763909Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8764239Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8764570Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.8764908Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.8765241Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.8765581Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.8765938Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 365, in forward 2025-08-14T21:41:10.8766277Z query_states = self.q(hidden_states) 2025-08-14T21:41:10.8766395Z 2025-08-14T21:41:10.8766495Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8766816Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8767113Z return mod(**inputs) 2025-08-14T21:41:10.8767434Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8767775Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8768104Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8768445Z layer_outputs = layer_module( 2025-08-14T21:41:10.8768756Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8769076Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8769417Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.8769758Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.8770098Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.8770439Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.8770784Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 385, in forward 2025-08-14T21:41:10.8771126Z key_states = self.k(current_states) 2025-08-14T21:41:10.8771244Z 2025-08-14T21:41:10.8771343Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8771665Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8771960Z return mod(**inputs) 2025-08-14T21:41:10.8772278Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8772611Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8772948Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8773287Z layer_outputs = layer_module( 2025-08-14T21:41:10.8773601Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8773935Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8774282Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.8774642Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.8774973Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.8775325Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.8775669Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 401, in forward 2025-08-14T21:41:10.8776072Z scores = torch.matmul(query_states, key_states.transpose(3, 2)) 2025-08-14T21:41:10.8776244Z 2025-08-14T21:41:10.8776337Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8776669Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8776973Z return mod(**inputs) 2025-08-14T21:41:10.8777289Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8777637Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8777979Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8778335Z layer_outputs = layer_module( 2025-08-14T21:41:10.8778640Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8778969Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8779314Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.8779659Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.8779993Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.8780333Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.8780664Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 433, in forward 2025-08-14T21:41:10.8781065Z attn_weights = nn.functional.softmax(scores.float(), dim=-1).type_as(scores) 2025-08-14T21:41:10.8781261Z 2025-08-14T21:41:10.8781354Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8781682Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8781977Z return mod(**inputs) 2025-08-14T21:41:10.8782290Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8782635Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8782969Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8783304Z layer_outputs = layer_module( 2025-08-14T21:41:10.8783615Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8783945Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8784290Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.8784822Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.8785192Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.8785548Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.8785890Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 433, in forward 2025-08-14T21:41:10.8786026Z attn_weights = nn.functional.softmax(scores.float(), dim=-1).type_as(scores) 2025-08-14T21:41:10.8786030Z 2025-08-14T21:41:10.8786170Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8786356Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8786412Z return mod(**inputs) 2025-08-14T21:41:10.8786659Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8786729Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8786948Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8787017Z layer_outputs = layer_module( 2025-08-14T21:41:10.8787236Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8787308Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8787534Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.8787604Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.8787828Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.8787902Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.8788140Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 433, in forward 2025-08-14T21:41:10.8788282Z attn_weights = nn.functional.softmax(scores.float(), dim=-1).type_as(scores) 2025-08-14T21:41:10.8788286Z 2025-08-14T21:41:10.8788380Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8788561Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8788627Z return mod(**inputs) 2025-08-14T21:41:10.8788849Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8788921Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8789140Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8789206Z layer_outputs = layer_module( 2025-08-14T21:41:10.8789412Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8789483Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8789701Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.8789769Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.8789981Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.8790057Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.8790270Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 386, in forward 2025-08-14T21:41:10.8790339Z value_states = self.v(current_states) 2025-08-14T21:41:10.8790343Z 2025-08-14T21:41:10.8790439Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8790620Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8790686Z return mod(**inputs) 2025-08-14T21:41:10.8790904Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8790966Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8791189Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8791251Z layer_outputs = layer_module( 2025-08-14T21:41:10.8791446Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8791540Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8791759Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.8791872Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.8792088Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.8792165Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.8792389Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 440, in forward 2025-08-14T21:41:10.8792502Z attn_output = torch.matmul(attn_weights, value_states) 2025-08-14T21:41:10.8792505Z 2025-08-14T21:41:10.8792606Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8792790Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8792851Z return mod(**inputs) 2025-08-14T21:41:10.8793080Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8793145Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8793364Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8793451Z layer_outputs = layer_module( 2025-08-14T21:41:10.8793652Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8793744Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8793965Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.8794041Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.8794268Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.8794344Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.8794569Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 440, in forward 2025-08-14T21:41:10.8794669Z attn_output = torch.matmul(attn_weights, value_states) 2025-08-14T21:41:10.8794674Z 2025-08-14T21:41:10.8794767Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8794958Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8795016Z return mod(**inputs) 2025-08-14T21:41:10.8795240Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8795316Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8795537Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8795608Z layer_outputs = layer_module( 2025-08-14T21:41:10.8795812Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8795882Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8796109Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.8796184Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.8796400Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.8796483Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.8796704Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 442, in forward 2025-08-14T21:41:10.8796808Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:41:10.8796811Z 2025-08-14T21:41:10.8796918Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8797100Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8797168Z return mod(**inputs) 2025-08-14T21:41:10.8797388Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8797479Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8797701Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8797763Z layer_outputs = layer_module( 2025-08-14T21:41:10.8797986Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8798059Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8798276Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.8798358Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.8798574Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.8798655Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.8798874Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 444, in forward 2025-08-14T21:41:10.8798959Z attn_output = self.o(attn_output) 2025-08-14T21:41:10.8798963Z 2025-08-14T21:41:10.8799064Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8799248Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8799312Z return mod(**inputs) 2025-08-14T21:41:10.8799535Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8799599Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8799828Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8799891Z layer_outputs = layer_module( 2025-08-14T21:41:10.8800091Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8800171Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8800387Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.8800463Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.8800679Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 485, in forward 2025-08-14T21:41:10.8800798Z hidden_states = hidden_states + self.dropout(attention_output[0]) 2025-08-14T21:41:10.8800802Z 2025-08-14T21:41:10.8800882Z cudagraph partition due to non gpu ops 2025-08-14T21:41:10.8800976Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8801156Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8801221Z return mod(**inputs) 2025-08-14T21:41:10.8801442Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8801514Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8801732Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8801795Z layer_outputs = layer_module( 2025-08-14T21:41:10.8802002Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8802072Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8802310Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 609, in forward 2025-08-14T21:41:10.8802397Z hidden_states = self.layer[-1](hidden_states) 2025-08-14T21:41:10.8802616Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 215, in forward 2025-08-14T21:41:10.8802724Z forwarded_states = self.layer_norm(hidden_states) 2025-08-14T21:41:10.8802948Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 146, in forward 2025-08-14T21:41:10.8803017Z return self.weight * hidden_states 2025-08-14T21:41:10.8803021Z 2025-08-14T21:41:10.8803120Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8803321Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8803388Z return mod(**inputs) 2025-08-14T21:41:10.8803610Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8803678Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8803904Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8803967Z layer_outputs = layer_module( 2025-08-14T21:41:10.8804167Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8804266Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8804483Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 609, in forward 2025-08-14T21:41:10.8804573Z hidden_states = self.layer[-1](hidden_states) 2025-08-14T21:41:10.8804791Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 216, in forward 2025-08-14T21:41:10.8804897Z forwarded_states = self.DenseReluDense(forwarded_states) 2025-08-14T21:41:10.8805124Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 183, in forward 2025-08-14T21:41:10.8805215Z hidden_gelu = self.act(self.wi_0(hidden_states)) 2025-08-14T21:41:10.8805218Z 2025-08-14T21:41:10.8805316Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8805499Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8805559Z return mod(**inputs) 2025-08-14T21:41:10.8805787Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8805852Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8806071Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8806143Z layer_outputs = layer_module( 2025-08-14T21:41:10.8806340Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8806417Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8806631Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 609, in forward 2025-08-14T21:41:10.8806712Z hidden_states = self.layer[-1](hidden_states) 2025-08-14T21:41:10.8806938Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 216, in forward 2025-08-14T21:41:10.8807045Z forwarded_states = self.DenseReluDense(forwarded_states) 2025-08-14T21:41:10.8807269Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 184, in forward 2025-08-14T21:41:10.8807341Z hidden_linear = self.wi_1(hidden_states) 2025-08-14T21:41:10.8807344Z 2025-08-14T21:41:10.8807435Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8807626Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8807696Z return mod(**inputs) 2025-08-14T21:41:10.8807919Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8807993Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8808229Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8808303Z layer_outputs = layer_module( 2025-08-14T21:41:10.8808504Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8808573Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8808809Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 609, in forward 2025-08-14T21:41:10.8808892Z hidden_states = self.layer[-1](hidden_states) 2025-08-14T21:41:10.8809119Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 216, in forward 2025-08-14T21:41:10.8809222Z forwarded_states = self.DenseReluDense(forwarded_states) 2025-08-14T21:41:10.8809440Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 185, in forward 2025-08-14T21:41:10.8809531Z hidden_states = hidden_gelu * hidden_linear 2025-08-14T21:41:10.8809548Z 2025-08-14T21:41:10.8809642Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8809826Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8809893Z return mod(**inputs) 2025-08-14T21:41:10.8810115Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8810188Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8810407Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8810473Z layer_outputs = layer_module( 2025-08-14T21:41:10.8810682Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8810754Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8810971Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 609, in forward 2025-08-14T21:41:10.8811061Z hidden_states = self.layer[-1](hidden_states) 2025-08-14T21:41:10.8811279Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 216, in forward 2025-08-14T21:41:10.8811391Z forwarded_states = self.DenseReluDense(forwarded_states) 2025-08-14T21:41:10.8811609Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 198, in forward 2025-08-14T21:41:10.8811681Z hidden_states = self.wo(hidden_states) 2025-08-14T21:41:10.8811684Z 2025-08-14T21:41:10.8811766Z cudagraph partition due to non gpu ops 2025-08-14T21:41:10.8811860Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8812048Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8812110Z return mod(**inputs) 2025-08-14T21:41:10.8812331Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8812407Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8812624Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8812689Z layer_outputs = layer_module( 2025-08-14T21:41:10.8812897Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8812969Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8813204Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.8813279Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.8813492Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 474, in forward 2025-08-14T21:41:10.8813629Z normed_hidden_states = self.layer_norm(hidden_states) 2025-08-14T21:41:10.8813846Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 146, in forward 2025-08-14T21:41:10.8813919Z return self.weight * hidden_states 2025-08-14T21:41:10.8813923Z 2025-08-14T21:41:10.8814014Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8814208Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8814276Z return mod(**inputs) 2025-08-14T21:41:10.8814503Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8814569Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8814802Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8814869Z layer_outputs = layer_module( 2025-08-14T21:41:10.8815082Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8815168Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8815383Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.8815465Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.8815680Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.8815756Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.8815980Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 365, in forward 2025-08-14T21:41:10.8816049Z query_states = self.q(hidden_states) 2025-08-14T21:41:10.8816053Z 2025-08-14T21:41:10.8816151Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8816335Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8816395Z return mod(**inputs) 2025-08-14T21:41:10.8816622Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8816685Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8816911Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8816975Z layer_outputs = layer_module( 2025-08-14T21:41:10.8817176Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8817256Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8817468Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.8817542Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.8817764Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.8817839Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.8818062Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 385, in forward 2025-08-14T21:41:10.8818131Z key_states = self.k(current_states) 2025-08-14T21:41:10.8818134Z 2025-08-14T21:41:10.8818227Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8818415Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8818488Z return mod(**inputs) 2025-08-14T21:41:10.8818710Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8818781Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8819018Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8819090Z layer_outputs = layer_module( 2025-08-14T21:41:10.8819289Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8819360Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8819597Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.8819673Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.8819904Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.8819982Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.8820203Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 401, in forward 2025-08-14T21:41:10.8820333Z scores = torch.matmul(query_states, key_states.transpose(3, 2)) 2025-08-14T21:41:10.8820349Z 2025-08-14T21:41:10.8820443Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8820623Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8820689Z return mod(**inputs) 2025-08-14T21:41:10.8820912Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8820983Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8821200Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8821266Z layer_outputs = layer_module( 2025-08-14T21:41:10.8821473Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8821542Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8821769Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.8821842Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.8822061Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.8822142Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.8822360Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 433, in forward 2025-08-14T21:41:10.8822501Z attn_weights = nn.functional.softmax(scores.float(), dim=-1).type_as(scores) 2025-08-14T21:41:10.8822511Z 2025-08-14T21:41:10.8822604Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8822786Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8822852Z return mod(**inputs) 2025-08-14T21:41:10.8823077Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8823144Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8823372Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8823435Z layer_outputs = layer_module( 2025-08-14T21:41:10.8823645Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8823715Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8823945Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.8824026Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.8824240Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.8824327Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.8824554Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 433, in forward 2025-08-14T21:41:10.8824694Z attn_weights = nn.functional.softmax(scores.float(), dim=-1).type_as(scores) 2025-08-14T21:41:10.8824697Z 2025-08-14T21:41:10.8824884Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8825108Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8825171Z return mod(**inputs) 2025-08-14T21:41:10.8825403Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8825472Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8825693Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8825766Z layer_outputs = layer_module( 2025-08-14T21:41:10.8825971Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8826066Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8826285Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.8826356Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.8826581Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.8826654Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.8826878Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 433, in forward 2025-08-14T21:41:10.8827013Z attn_weights = nn.functional.softmax(scores.float(), dim=-1).type_as(scores) 2025-08-14T21:41:10.8827017Z 2025-08-14T21:41:10.8827106Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8827293Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8827352Z return mod(**inputs) 2025-08-14T21:41:10.8827569Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8827643Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8827861Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8827932Z layer_outputs = layer_module( 2025-08-14T21:41:10.8828130Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8828200Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8828420Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.8828491Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.8828716Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.8828789Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.8829004Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 386, in forward 2025-08-14T21:41:10.8829081Z value_states = self.v(current_states) 2025-08-14T21:41:10.8829085Z 2025-08-14T21:41:10.8829176Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8829357Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8829439Z return mod(**inputs) 2025-08-14T21:41:10.8829658Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8829730Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8829966Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8830032Z layer_outputs = layer_module( 2025-08-14T21:41:10.8830238Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8830311Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8830542Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.8830622Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.8830840Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.8830923Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.8831138Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 440, in forward 2025-08-14T21:41:10.8831239Z attn_output = torch.matmul(attn_weights, value_states) 2025-08-14T21:41:10.8831257Z 2025-08-14T21:41:10.8831359Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8832166Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8832232Z return mod(**inputs) 2025-08-14T21:41:10.8832453Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8832519Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8832746Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8832812Z layer_outputs = layer_module( 2025-08-14T21:41:10.8833020Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8833092Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8833311Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.8833393Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.8833609Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.8833683Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.8833909Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 440, in forward 2025-08-14T21:41:10.8834005Z attn_output = torch.matmul(attn_weights, value_states) 2025-08-14T21:41:10.8834009Z 2025-08-14T21:41:10.8834108Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8834288Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8834346Z return mod(**inputs) 2025-08-14T21:41:10.8834573Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8834640Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8834857Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8834928Z layer_outputs = layer_module( 2025-08-14T21:41:10.8835130Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8835210Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8835425Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.8835508Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.8835733Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.8835807Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.8836048Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 442, in forward 2025-08-14T21:41:10.8836146Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:41:10.8836150Z 2025-08-14T21:41:10.8836240Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8836440Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8836500Z return mod(**inputs) 2025-08-14T21:41:10.8836719Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8836791Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8837010Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8837080Z layer_outputs = layer_module( 2025-08-14T21:41:10.8837280Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8837368Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8837590Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.8837662Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.8837882Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.8837956Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.8838171Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 444, in forward 2025-08-14T21:41:10.8838247Z attn_output = self.o(attn_output) 2025-08-14T21:41:10.8838251Z 2025-08-14T21:41:10.8838323Z cudagraph partition due to non gpu ops 2025-08-14T21:41:10.8838416Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8838603Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8838662Z return mod(**inputs) 2025-08-14T21:41:10.8838888Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8838952Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8839172Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8839244Z layer_outputs = layer_module( 2025-08-14T21:41:10.8839441Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8839512Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8839729Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 609, in forward 2025-08-14T21:41:10.8839810Z hidden_states = self.layer[-1](hidden_states) 2025-08-14T21:41:10.8840032Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 215, in forward 2025-08-14T21:41:10.8840118Z forwarded_states = self.layer_norm(hidden_states) 2025-08-14T21:41:10.8840331Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 146, in forward 2025-08-14T21:41:10.8840408Z return self.weight * hidden_states 2025-08-14T21:41:10.8840412Z 2025-08-14T21:41:10.8840501Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8840686Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8840758Z return mod(**inputs) 2025-08-14T21:41:10.8840975Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8841043Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8841275Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8841340Z layer_outputs = layer_module( 2025-08-14T21:41:10.8841547Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8841617Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8841855Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 609, in forward 2025-08-14T21:41:10.8841938Z hidden_states = self.layer[-1](hidden_states) 2025-08-14T21:41:10.8842155Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 216, in forward 2025-08-14T21:41:10.8842271Z forwarded_states = self.DenseReluDense(forwarded_states) 2025-08-14T21:41:10.8842488Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 183, in forward 2025-08-14T21:41:10.8842588Z hidden_gelu = self.act(self.wi_0(hidden_states)) 2025-08-14T21:41:10.8842605Z 2025-08-14T21:41:10.8842698Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8842880Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8842947Z return mod(**inputs) 2025-08-14T21:41:10.8843167Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8843235Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8843462Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8843528Z layer_outputs = layer_module( 2025-08-14T21:41:10.8843737Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8843807Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8844028Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 609, in forward 2025-08-14T21:41:10.8844118Z hidden_states = self.layer[-1](hidden_states) 2025-08-14T21:41:10.8844335Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 216, in forward 2025-08-14T21:41:10.8844440Z forwarded_states = self.DenseReluDense(forwarded_states) 2025-08-14T21:41:10.8844665Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 184, in forward 2025-08-14T21:41:10.8844736Z hidden_linear = self.wi_1(hidden_states) 2025-08-14T21:41:10.8844739Z 2025-08-14T21:41:10.8844841Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8845023Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8845082Z return mod(**inputs) 2025-08-14T21:41:10.8845311Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8845385Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8845613Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8845679Z layer_outputs = layer_module( 2025-08-14T21:41:10.8845881Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8845959Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8846175Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 609, in forward 2025-08-14T21:41:10.8846273Z hidden_states = self.layer[-1](hidden_states) 2025-08-14T21:41:10.8846495Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 216, in forward 2025-08-14T21:41:10.8846621Z forwarded_states = self.DenseReluDense(forwarded_states) 2025-08-14T21:41:10.8846846Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 185, in forward 2025-08-14T21:41:10.8846926Z hidden_states = hidden_gelu * hidden_linear 2025-08-14T21:41:10.8846929Z 2025-08-14T21:41:10.8847023Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8847225Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8847286Z return mod(**inputs) 2025-08-14T21:41:10.8847507Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8847583Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8847801Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8847870Z layer_outputs = layer_module( 2025-08-14T21:41:10.8848072Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8848161Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8848388Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 609, in forward 2025-08-14T21:41:10.8848467Z hidden_states = self.layer[-1](hidden_states) 2025-08-14T21:41:10.8848690Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 216, in forward 2025-08-14T21:41:10.8848792Z forwarded_states = self.DenseReluDense(forwarded_states) 2025-08-14T21:41:10.8849009Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 198, in forward 2025-08-14T21:41:10.8849089Z hidden_states = self.wo(hidden_states) 2025-08-14T21:41:10.8849092Z 2025-08-14T21:41:10.8849164Z cudagraph partition due to non gpu ops 2025-08-14T21:41:10.8849258Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8849449Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8849507Z return mod(**inputs) 2025-08-14T21:41:10.8849733Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8849798Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8850016Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8850089Z layer_outputs = layer_module( 2025-08-14T21:41:10.8850291Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8850363Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8850587Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.8850663Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.8850888Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 474, in forward 2025-08-14T21:41:10.8850982Z normed_hidden_states = self.layer_norm(hidden_states) 2025-08-14T21:41:10.8851198Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 146, in forward 2025-08-14T21:41:10.8851273Z return self.weight * hidden_states 2025-08-14T21:41:10.8851277Z 2025-08-14T21:41:10.8851368Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8851568Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8851630Z return mod(**inputs) 2025-08-14T21:41:10.8851849Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8851940Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8852160Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8852225Z layer_outputs = layer_module( 2025-08-14T21:41:10.8852431Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8852516Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8852741Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.8852814Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.8853029Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.8853111Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.8853327Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 365, in forward 2025-08-14T21:41:10.8853404Z query_states = self.q(hidden_states) 2025-08-14T21:41:10.8853424Z 2025-08-14T21:41:10.8853518Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8853701Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8853766Z return mod(**inputs) 2025-08-14T21:41:10.8853985Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8854052Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8854278Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8854341Z layer_outputs = layer_module( 2025-08-14T21:41:10.8854549Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8854619Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8854838Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.8854920Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.8855136Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.8855212Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.8855436Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 385, in forward 2025-08-14T21:41:10.8855506Z key_states = self.k(current_states) 2025-08-14T21:41:10.8855509Z 2025-08-14T21:41:10.8855611Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8855794Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8855853Z return mod(**inputs) 2025-08-14T21:41:10.8856082Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8856147Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8856373Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8856438Z layer_outputs = layer_module( 2025-08-14T21:41:10.8856641Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8856719Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8856934Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.8857023Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.8857247Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.8857340Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.8857564Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 401, in forward 2025-08-14T21:41:10.8857685Z scores = torch.matmul(query_states, key_states.transpose(3, 2)) 2025-08-14T21:41:10.8857689Z 2025-08-14T21:41:10.8857781Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8857984Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8858045Z return mod(**inputs) 2025-08-14T21:41:10.8858273Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8858341Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8858560Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8858632Z layer_outputs = layer_module( 2025-08-14T21:41:10.8858834Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8858921Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8859147Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.8859217Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.8859443Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.8859517Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.8859735Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 433, in forward 2025-08-14T21:41:10.8859884Z attn_weights = nn.functional.softmax(scores.float(), dim=-1).type_as(scores) 2025-08-14T21:41:10.8859887Z 2025-08-14T21:41:10.8859979Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8860169Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8860229Z return mod(**inputs) 2025-08-14T21:41:10.8860454Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8860527Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8860749Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8860814Z layer_outputs = layer_module( 2025-08-14T21:41:10.8861023Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8861096Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8861322Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.8861393Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.8861612Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.8861695Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.8861913Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 433, in forward 2025-08-14T21:41:10.8862054Z attn_weights = nn.functional.softmax(scores.float(), dim=-1).type_as(scores) 2025-08-14T21:41:10.8862065Z 2025-08-14T21:41:10.8862156Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8862337Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8862419Z return mod(**inputs) 2025-08-14T21:41:10.8862641Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8862707Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8862950Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8863015Z layer_outputs = layer_module( 2025-08-14T21:41:10.8863222Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8863295Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8863524Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.8863613Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.8863844Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.8863925Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.8864154Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 433, in forward 2025-08-14T21:41:10.8864297Z attn_weights = nn.functional.softmax(scores.float(), dim=-1).type_as(scores) 2025-08-14T21:41:10.8864314Z 2025-08-14T21:41:10.8864415Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8864594Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8864651Z return mod(**inputs) 2025-08-14T21:41:10.8864976Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8865051Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8865283Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8865349Z layer_outputs = layer_module( 2025-08-14T21:41:10.8865553Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8865632Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8865852Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.8865928Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.8866154Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.8866232Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.8866459Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 386, in forward 2025-08-14T21:41:10.8866529Z value_states = self.v(current_states) 2025-08-14T21:41:10.8866532Z 2025-08-14T21:41:10.8866628Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8866821Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8866882Z return mod(**inputs) 2025-08-14T21:41:10.8867106Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8867182Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8867403Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8867475Z layer_outputs = layer_module( 2025-08-14T21:41:10.8867677Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8867747Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8867971Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.8868070Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.8868294Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.8868384Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.8868598Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 440, in forward 2025-08-14T21:41:10.8868705Z attn_output = torch.matmul(attn_weights, value_states) 2025-08-14T21:41:10.8868708Z 2025-08-14T21:41:10.8868799Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8868995Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8869064Z return mod(**inputs) 2025-08-14T21:41:10.8869282Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8869354Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8869577Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8869643Z layer_outputs = layer_module( 2025-08-14T21:41:10.8869852Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8869940Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8870166Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.8870238Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.8870457Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.8870537Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.8870756Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 440, in forward 2025-08-14T21:41:10.8870851Z attn_output = torch.matmul(attn_weights, value_states) 2025-08-14T21:41:10.8870855Z 2025-08-14T21:41:10.8870956Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8871140Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8871206Z return mod(**inputs) 2025-08-14T21:41:10.8871426Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8871490Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8871717Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8871783Z layer_outputs = layer_module( 2025-08-14T21:41:10.8871984Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8872064Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8872284Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.8872362Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.8872581Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.8872655Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.8872880Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 442, in forward 2025-08-14T21:41:10.8872976Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:41:10.8872980Z 2025-08-14T21:41:10.8873080Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8873263Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8873322Z return mod(**inputs) 2025-08-14T21:41:10.8873565Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8873630Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8873851Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8873940Z layer_outputs = layer_module( 2025-08-14T21:41:10.8874140Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8874220Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8874449Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.8874523Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.8874745Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.8874820Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.8875044Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 444, in forward 2025-08-14T21:41:10.8875113Z attn_output = self.o(attn_output) 2025-08-14T21:41:10.8875118Z 2025-08-14T21:41:10.8875227Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8875415Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8875475Z return mod(**inputs) 2025-08-14T21:41:10.8875702Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8875778Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8876001Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8876076Z layer_outputs = layer_module( 2025-08-14T21:41:10.8876279Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8876353Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8876580Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.8876657Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.8876880Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 485, in forward 2025-08-14T21:41:10.8877011Z hidden_states = hidden_states + self.dropout(attention_output[0]) 2025-08-14T21:41:10.8877014Z 2025-08-14T21:41:10.8877089Z cudagraph partition due to non gpu ops 2025-08-14T21:41:10.8877193Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8877380Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8877445Z return mod(**inputs) 2025-08-14T21:41:10.8877676Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8877743Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8877976Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8878044Z layer_outputs = layer_module( 2025-08-14T21:41:10.8878247Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8878329Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8878551Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 609, in forward 2025-08-14T21:41:10.8878636Z hidden_states = self.layer[-1](hidden_states) 2025-08-14T21:41:10.8878878Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 215, in forward 2025-08-14T21:41:10.8878967Z forwarded_states = self.layer_norm(hidden_states) 2025-08-14T21:41:10.8879191Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 146, in forward 2025-08-14T21:41:10.8879278Z return self.weight * hidden_states 2025-08-14T21:41:10.8879283Z 2025-08-14T21:41:10.8879373Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8879562Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8879619Z return mod(**inputs) 2025-08-14T21:41:10.8879865Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8879931Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8880150Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8880223Z layer_outputs = layer_module( 2025-08-14T21:41:10.8880422Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8880493Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8880716Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 609, in forward 2025-08-14T21:41:10.8880812Z hidden_states = self.layer[-1](hidden_states) 2025-08-14T21:41:10.8881036Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 216, in forward 2025-08-14T21:41:10.8881140Z forwarded_states = self.DenseReluDense(forwarded_states) 2025-08-14T21:41:10.8881355Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 183, in forward 2025-08-14T21:41:10.8881451Z hidden_gelu = self.act(self.wi_0(hidden_states)) 2025-08-14T21:41:10.8881454Z 2025-08-14T21:41:10.8881545Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8881732Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8881790Z return mod(**inputs) 2025-08-14T21:41:10.8882010Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8882086Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8882306Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8882370Z layer_outputs = layer_module( 2025-08-14T21:41:10.8882580Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8882651Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8882872Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 609, in forward 2025-08-14T21:41:10.8882953Z hidden_states = self.layer[-1](hidden_states) 2025-08-14T21:41:10.8883169Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 216, in forward 2025-08-14T21:41:10.8883281Z forwarded_states = self.DenseReluDense(forwarded_states) 2025-08-14T21:41:10.8883497Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 184, in forward 2025-08-14T21:41:10.8883570Z hidden_linear = self.wi_1(hidden_states) 2025-08-14T21:41:10.8883581Z 2025-08-14T21:41:10.8883672Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8883854Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8883922Z return mod(**inputs) 2025-08-14T21:41:10.8884142Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8884220Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8884448Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8884513Z layer_outputs = layer_module( 2025-08-14T21:41:10.8884870Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8884951Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8885170Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 609, in forward 2025-08-14T21:41:10.8885258Z hidden_states = self.layer[-1](hidden_states) 2025-08-14T21:41:10.8885513Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 216, in forward 2025-08-14T21:41:10.8885621Z forwarded_states = self.DenseReluDense(forwarded_states) 2025-08-14T21:41:10.8885852Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 185, in forward 2025-08-14T21:41:10.8885931Z hidden_states = hidden_gelu * hidden_linear 2025-08-14T21:41:10.8885935Z 2025-08-14T21:41:10.8886037Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8886218Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8886312Z return mod(**inputs) 2025-08-14T21:41:10.8886541Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8886606Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8886837Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8886901Z layer_outputs = layer_module( 2025-08-14T21:41:10.8887105Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8887186Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8887405Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 609, in forward 2025-08-14T21:41:10.8887483Z hidden_states = self.layer[-1](hidden_states) 2025-08-14T21:41:10.8887714Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 216, in forward 2025-08-14T21:41:10.8887818Z forwarded_states = self.DenseReluDense(forwarded_states) 2025-08-14T21:41:10.8888046Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 198, in forward 2025-08-14T21:41:10.8888118Z hidden_states = self.wo(hidden_states) 2025-08-14T21:41:10.8888122Z 2025-08-14T21:41:10.8888195Z cudagraph partition due to non gpu ops 2025-08-14T21:41:10.8888296Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8888481Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8888540Z return mod(**inputs) 2025-08-14T21:41:10.8888768Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8888834Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8889061Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8889129Z layer_outputs = layer_module( 2025-08-14T21:41:10.8889330Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8889408Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8889628Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.8889708Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.8889947Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 474, in forward 2025-08-14T21:41:10.8890044Z normed_hidden_states = self.layer_norm(hidden_states) 2025-08-14T21:41:10.8890267Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 146, in forward 2025-08-14T21:41:10.8890363Z return self.weight * hidden_states 2025-08-14T21:41:10.8890366Z 2025-08-14T21:41:10.8890458Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8890648Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8890707Z return mod(**inputs) 2025-08-14T21:41:10.8890947Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8891013Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8891235Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8891307Z layer_outputs = layer_module( 2025-08-14T21:41:10.8891508Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8891579Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8891798Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.8891886Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.8892113Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.8892187Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.8892405Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 365, in forward 2025-08-14T21:41:10.8892482Z query_states = self.q(hidden_states) 2025-08-14T21:41:10.8892486Z 2025-08-14T21:41:10.8892580Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8892769Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8892825Z return mod(**inputs) 2025-08-14T21:41:10.8893049Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8893123Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8893343Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8893406Z layer_outputs = layer_module( 2025-08-14T21:41:10.8893615Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8893684Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8893910Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.8893981Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.8894197Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.8894280Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.8894499Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 385, in forward 2025-08-14T21:41:10.8894574Z key_states = self.k(current_states) 2025-08-14T21:41:10.8894578Z 2025-08-14T21:41:10.8894670Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8894852Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8894917Z return mod(**inputs) 2025-08-14T21:41:10.8895137Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8895215Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8895441Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8895505Z layer_outputs = layer_module( 2025-08-14T21:41:10.8895730Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8895804Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8896019Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.8896097Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.8896334Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.8896410Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.8896638Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 401, in forward 2025-08-14T21:41:10.8896758Z scores = torch.matmul(query_states, key_states.transpose(3, 2)) 2025-08-14T21:41:10.8896761Z 2025-08-14T21:41:10.8896860Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8897044Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8897124Z return mod(**inputs) 2025-08-14T21:41:10.8897352Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8897415Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8897643Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8897708Z layer_outputs = layer_module( 2025-08-14T21:41:10.8897910Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8897991Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8898209Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.8898280Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.8898508Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.8898591Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.8898818Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 433, in forward 2025-08-14T21:41:10.8898959Z attn_weights = nn.functional.softmax(scores.float(), dim=-1).type_as(scores) 2025-08-14T21:41:10.8898962Z 2025-08-14T21:41:10.8899055Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8899247Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8899309Z return mod(**inputs) 2025-08-14T21:41:10.8899538Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8899603Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8899824Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8899895Z layer_outputs = layer_module( 2025-08-14T21:41:10.8900095Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8900169Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8900394Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.8900466Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.8900728Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.8900803Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.8901019Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 433, in forward 2025-08-14T21:41:10.8901179Z attn_weights = nn.functional.softmax(scores.float(), dim=-1).type_as(scores) 2025-08-14T21:41:10.8901184Z 2025-08-14T21:41:10.8901275Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8901462Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8901521Z return mod(**inputs) 2025-08-14T21:41:10.8901756Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8901832Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8902053Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8902120Z layer_outputs = layer_module( 2025-08-14T21:41:10.8902327Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8902400Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8902625Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.8902713Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.8902931Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.8903012Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.8903228Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 433, in forward 2025-08-14T21:41:10.8903363Z attn_weights = nn.functional.softmax(scores.float(), dim=-1).type_as(scores) 2025-08-14T21:41:10.8903374Z 2025-08-14T21:41:10.8903468Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8903648Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8903714Z return mod(**inputs) 2025-08-14T21:41:10.8903938Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8904003Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8904231Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8904295Z layer_outputs = layer_module( 2025-08-14T21:41:10.8904503Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8904574Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8904892Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.8904984Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.8905200Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.8905276Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.8905505Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 386, in forward 2025-08-14T21:41:10.8905577Z value_states = self.v(current_states) 2025-08-14T21:41:10.8905581Z 2025-08-14T21:41:10.8905685Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8905866Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8905927Z return mod(**inputs) 2025-08-14T21:41:10.8906153Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8906238Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8906467Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8906533Z layer_outputs = layer_module( 2025-08-14T21:41:10.8906751Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8906832Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8907050Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.8907121Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.8907368Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.8907444Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.8907672Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 440, in forward 2025-08-14T21:41:10.8907771Z attn_output = torch.matmul(attn_weights, value_states) 2025-08-14T21:41:10.8907774Z 2025-08-14T21:41:10.8907865Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8908056Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8908131Z return mod(**inputs) 2025-08-14T21:41:10.8908357Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8908429Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8908655Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8908726Z layer_outputs = layer_module( 2025-08-14T21:41:10.8908931Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8909004Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8909234Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.8909304Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.8909536Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.8909614Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.8909834Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 440, in forward 2025-08-14T21:41:10.8909939Z attn_output = torch.matmul(attn_weights, value_states) 2025-08-14T21:41:10.8909944Z 2025-08-14T21:41:10.8910037Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8910223Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8910290Z return mod(**inputs) 2025-08-14T21:41:10.8910516Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8910587Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8910811Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8910877Z layer_outputs = layer_module( 2025-08-14T21:41:10.8911088Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8911158Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8911385Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.8911458Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.8911678Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.8911771Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.8911989Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 442, in forward 2025-08-14T21:41:10.8912086Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:41:10.8912102Z 2025-08-14T21:41:10.8912207Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8912389Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8912454Z return mod(**inputs) 2025-08-14T21:41:10.8912686Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8912753Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8912981Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8913047Z layer_outputs = layer_module( 2025-08-14T21:41:10.8913248Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8913325Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8913542Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.8913637Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.8913852Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.8913925Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.8914153Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 444, in forward 2025-08-14T21:41:10.8914222Z attn_output = self.o(attn_output) 2025-08-14T21:41:10.8914226Z 2025-08-14T21:41:10.8914305Z cudagraph partition due to non gpu ops 2025-08-14T21:41:10.8914398Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8914581Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8914646Z return mod(**inputs) 2025-08-14T21:41:10.8914867Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8914935Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8915164Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8915228Z layer_outputs = layer_module( 2025-08-14T21:41:10.8915436Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8915507Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8915723Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 609, in forward 2025-08-14T21:41:10.8915812Z hidden_states = self.layer[-1](hidden_states) 2025-08-14T21:41:10.8916030Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 215, in forward 2025-08-14T21:41:10.8916119Z forwarded_states = self.layer_norm(hidden_states) 2025-08-14T21:41:10.8916344Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 146, in forward 2025-08-14T21:41:10.8916415Z return self.weight * hidden_states 2025-08-14T21:41:10.8916418Z 2025-08-14T21:41:10.8916517Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8916700Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8916758Z return mod(**inputs) 2025-08-14T21:41:10.8916987Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8917066Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8917296Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8917360Z layer_outputs = layer_module( 2025-08-14T21:41:10.8917579Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8917659Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8917876Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 609, in forward 2025-08-14T21:41:10.8917959Z hidden_states = self.layer[-1](hidden_states) 2025-08-14T21:41:10.8918195Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 216, in forward 2025-08-14T21:41:10.8918302Z forwarded_states = self.DenseReluDense(forwarded_states) 2025-08-14T21:41:10.8918529Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 183, in forward 2025-08-14T21:41:10.8918620Z hidden_gelu = self.act(self.wi_0(hidden_states)) 2025-08-14T21:41:10.8918624Z 2025-08-14T21:41:10.8918715Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8918908Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8918982Z return mod(**inputs) 2025-08-14T21:41:10.8919212Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8919278Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8919499Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8919571Z layer_outputs = layer_module( 2025-08-14T21:41:10.8919770Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8919844Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8920070Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 609, in forward 2025-08-14T21:41:10.8920149Z hidden_states = self.layer[-1](hidden_states) 2025-08-14T21:41:10.8920377Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 216, in forward 2025-08-14T21:41:10.8920483Z forwarded_states = self.DenseReluDense(forwarded_states) 2025-08-14T21:41:10.8920700Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 184, in forward 2025-08-14T21:41:10.8920781Z hidden_linear = self.wi_1(hidden_states) 2025-08-14T21:41:10.8920784Z 2025-08-14T21:41:10.8920877Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8921065Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8921124Z return mod(**inputs) 2025-08-14T21:41:10.8921345Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8921419Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8921639Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8921704Z layer_outputs = layer_module( 2025-08-14T21:41:10.8921915Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8921986Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8922213Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 609, in forward 2025-08-14T21:41:10.8922293Z hidden_states = self.layer[-1](hidden_states) 2025-08-14T21:41:10.8922523Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 216, in forward 2025-08-14T21:41:10.8922638Z forwarded_states = self.DenseReluDense(forwarded_states) 2025-08-14T21:41:10.8922853Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 185, in forward 2025-08-14T21:41:10.8922948Z hidden_states = hidden_gelu * hidden_linear 2025-08-14T21:41:10.8922960Z 2025-08-14T21:41:10.8923053Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8923233Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8923299Z return mod(**inputs) 2025-08-14T21:41:10.8923536Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8923605Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8923834Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8923898Z layer_outputs = layer_module( 2025-08-14T21:41:10.8924106Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8924176Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8924394Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 609, in forward 2025-08-14T21:41:10.8924497Z hidden_states = self.layer[-1](hidden_states) 2025-08-14T21:41:10.8924713Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 216, in forward 2025-08-14T21:41:10.8924815Z forwarded_states = self.DenseReluDense(forwarded_states) 2025-08-14T21:41:10.8925037Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 198, in forward 2025-08-14T21:41:10.8925107Z hidden_states = self.wo(hidden_states) 2025-08-14T21:41:10.8925110Z 2025-08-14T21:41:10.8925190Z cudagraph partition due to non gpu ops 2025-08-14T21:41:10.8925283Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8925466Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8925535Z return mod(**inputs) 2025-08-14T21:41:10.8925754Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8925826Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8926047Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8926110Z layer_outputs = layer_module( 2025-08-14T21:41:10.8926320Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8926391Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8926609Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.8926691Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.8926905Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 474, in forward 2025-08-14T21:41:10.8927009Z normed_hidden_states = self.layer_norm(hidden_states) 2025-08-14T21:41:10.8927224Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 146, in forward 2025-08-14T21:41:10.8927292Z return self.weight * hidden_states 2025-08-14T21:41:10.8927296Z 2025-08-14T21:41:10.8927395Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8927577Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8927635Z return mod(**inputs) 2025-08-14T21:41:10.8927864Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8927945Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8928174Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8928262Z layer_outputs = layer_module( 2025-08-14T21:41:10.8928463Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8928544Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8928757Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.8928849Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.8929072Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.8929146Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.8929374Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 365, in forward 2025-08-14T21:41:10.8929444Z query_states = self.q(hidden_states) 2025-08-14T21:41:10.8929447Z 2025-08-14T21:41:10.8929538Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8929731Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8929804Z return mod(**inputs) 2025-08-14T21:41:10.8930032Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8930099Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8930317Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8930389Z layer_outputs = layer_module( 2025-08-14T21:41:10.8930591Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8930661Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8930885Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.8930957Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.8931181Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.8931257Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.8931471Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 385, in forward 2025-08-14T21:41:10.8931547Z key_states = self.k(current_states) 2025-08-14T21:41:10.8931551Z 2025-08-14T21:41:10.8931641Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8931825Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8931883Z return mod(**inputs) 2025-08-14T21:41:10.8932101Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8932173Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8932391Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8932456Z layer_outputs = layer_module( 2025-08-14T21:41:10.8932664Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8932735Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8932958Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.8933030Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.8933244Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.8933337Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.8933555Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 401, in forward 2025-08-14T21:41:10.8933696Z scores = torch.matmul(query_states, key_states.transpose(3, 2)) 2025-08-14T21:41:10.8933701Z 2025-08-14T21:41:10.8933793Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8933973Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8934040Z return mod(**inputs) 2025-08-14T21:41:10.8934273Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8934341Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8934569Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8934634Z layer_outputs = layer_module( 2025-08-14T21:41:10.8934841Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8934911Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8935129Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.8935224Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.8935442Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.8935515Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.8935740Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 433, in forward 2025-08-14T21:41:10.8935880Z attn_weights = nn.functional.softmax(scores.float(), dim=-1).type_as(scores) 2025-08-14T21:41:10.8935884Z 2025-08-14T21:41:10.8935985Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8936166Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8936223Z return mod(**inputs) 2025-08-14T21:41:10.8936451Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8936519Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8936745Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8936808Z layer_outputs = layer_module( 2025-08-14T21:41:10.8937008Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8937086Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8937304Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.8937377Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.8937601Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.8937675Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.8937898Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 433, in forward 2025-08-14T21:41:10.8938036Z attn_weights = nn.functional.softmax(scores.float(), dim=-1).type_as(scores) 2025-08-14T21:41:10.8938040Z 2025-08-14T21:41:10.8938131Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8938323Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8938382Z return mod(**inputs) 2025-08-14T21:41:10.8938610Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8938690Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8938909Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8938981Z layer_outputs = layer_module( 2025-08-14T21:41:10.8939202Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8939274Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8939500Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.8939573Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.8939813Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.8939889Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.8940108Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 433, in forward 2025-08-14T21:41:10.8940254Z attn_weights = nn.functional.softmax(scores.float(), dim=-1).type_as(scores) 2025-08-14T21:41:10.8940257Z 2025-08-14T21:41:10.8940348Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8940536Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8940613Z return mod(**inputs) 2025-08-14T21:41:10.8940834Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8940908Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8941129Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8941193Z layer_outputs = layer_module( 2025-08-14T21:41:10.8941400Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8941472Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8941697Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.8941771Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.8941986Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.8942069Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.8942286Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 386, in forward 2025-08-14T21:41:10.8942356Z value_states = self.v(current_states) 2025-08-14T21:41:10.8942367Z 2025-08-14T21:41:10.8942459Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8942641Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8942707Z return mod(**inputs) 2025-08-14T21:41:10.8942928Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8942993Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8943221Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8943287Z layer_outputs = layer_module( 2025-08-14T21:41:10.8943495Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8943564Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8943781Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.8943859Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.8944075Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.8944162Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.8944388Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 440, in forward 2025-08-14T21:41:10.8944504Z attn_output = torch.matmul(attn_weights, value_states) 2025-08-14T21:41:10.8944509Z 2025-08-14T21:41:10.8944606Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8944876Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8944947Z return mod(**inputs) 2025-08-14T21:41:10.8945194Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8945262Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8945494Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8945561Z layer_outputs = layer_module( 2025-08-14T21:41:10.8945761Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8945841Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8946060Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.8946149Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.8946377Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.8946449Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.8946676Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 440, in forward 2025-08-14T21:41:10.8946773Z attn_output = torch.matmul(attn_weights, value_states) 2025-08-14T21:41:10.8946777Z 2025-08-14T21:41:10.8946870Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8947057Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8947114Z return mod(**inputs) 2025-08-14T21:41:10.8947341Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8947408Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8947628Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8947699Z layer_outputs = layer_module( 2025-08-14T21:41:10.8947900Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8947971Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8948195Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.8948267Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.8948490Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.8948563Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.8948781Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 442, in forward 2025-08-14T21:41:10.8948886Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:41:10.8948890Z 2025-08-14T21:41:10.8948981Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8949163Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8949230Z return mod(**inputs) 2025-08-14T21:41:10.8949450Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8949521Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8949766Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8949832Z layer_outputs = layer_module( 2025-08-14T21:41:10.8950040Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8950132Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8950355Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.8950427Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.8950654Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.8950736Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.8950950Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 444, in forward 2025-08-14T21:41:10.8951020Z attn_output = self.o(attn_output) 2025-08-14T21:41:10.8951023Z 2025-08-14T21:41:10.8951123Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8951300Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8951367Z return mod(**inputs) 2025-08-14T21:41:10.8951602Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8951673Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8951902Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8951968Z layer_outputs = layer_module( 2025-08-14T21:41:10.8952170Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8952250Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8952467Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.8952545Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.8952757Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 485, in forward 2025-08-14T21:41:10.8952880Z hidden_states = hidden_states + self.dropout(attention_output[0]) 2025-08-14T21:41:10.8952883Z 2025-08-14T21:41:10.8952962Z cudagraph partition due to non gpu ops 2025-08-14T21:41:10.8953053Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8953242Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8953300Z return mod(**inputs) 2025-08-14T21:41:10.8953518Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8953592Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8953810Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8953876Z layer_outputs = layer_module( 2025-08-14T21:41:10.8954082Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8954154Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8954375Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 609, in forward 2025-08-14T21:41:10.8954457Z hidden_states = self.layer[-1](hidden_states) 2025-08-14T21:41:10.8954674Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 215, in forward 2025-08-14T21:41:10.8954767Z forwarded_states = self.layer_norm(hidden_states) 2025-08-14T21:41:10.8954995Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 146, in forward 2025-08-14T21:41:10.8955074Z return self.weight * hidden_states 2025-08-14T21:41:10.8955078Z 2025-08-14T21:41:10.8955168Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8955346Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8955430Z return mod(**inputs) 2025-08-14T21:41:10.8955651Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8955715Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8955956Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8956023Z layer_outputs = layer_module( 2025-08-14T21:41:10.8956230Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8956303Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8956521Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 609, in forward 2025-08-14T21:41:10.8956610Z hidden_states = self.layer[-1](hidden_states) 2025-08-14T21:41:10.8956827Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 216, in forward 2025-08-14T21:41:10.8956946Z forwarded_states = self.DenseReluDense(forwarded_states) 2025-08-14T21:41:10.8957169Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 183, in forward 2025-08-14T21:41:10.8957258Z hidden_gelu = self.act(self.wi_0(hidden_states)) 2025-08-14T21:41:10.8957263Z 2025-08-14T21:41:10.8957360Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8957540Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8957597Z return mod(**inputs) 2025-08-14T21:41:10.8957823Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8957889Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8958114Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8958180Z layer_outputs = layer_module( 2025-08-14T21:41:10.8958379Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8958458Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8958675Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 609, in forward 2025-08-14T21:41:10.8958757Z hidden_states = self.layer[-1](hidden_states) 2025-08-14T21:41:10.8958981Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 216, in forward 2025-08-14T21:41:10.8959086Z forwarded_states = self.DenseReluDense(forwarded_states) 2025-08-14T21:41:10.8959308Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 184, in forward 2025-08-14T21:41:10.8959380Z hidden_linear = self.wi_1(hidden_states) 2025-08-14T21:41:10.8959385Z 2025-08-14T21:41:10.8959476Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8959663Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8959722Z return mod(**inputs) 2025-08-14T21:41:10.8959949Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8960016Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8960239Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8960322Z layer_outputs = layer_module( 2025-08-14T21:41:10.8960523Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8960592Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8960836Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 609, in forward 2025-08-14T21:41:10.8960918Z hidden_states = self.layer[-1](hidden_states) 2025-08-14T21:41:10.8961142Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 216, in forward 2025-08-14T21:41:10.8961246Z forwarded_states = self.DenseReluDense(forwarded_states) 2025-08-14T21:41:10.8961483Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 185, in forward 2025-08-14T21:41:10.8961574Z hidden_states = hidden_gelu * hidden_linear 2025-08-14T21:41:10.8961577Z 2025-08-14T21:41:10.8961671Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8961861Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8961920Z return mod(**inputs) 2025-08-14T21:41:10.8962139Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8962229Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8962452Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8962517Z layer_outputs = layer_module( 2025-08-14T21:41:10.8962731Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8962800Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8963028Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 609, in forward 2025-08-14T21:41:10.8963108Z hidden_states = self.layer[-1](hidden_states) 2025-08-14T21:41:10.8963327Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 216, in forward 2025-08-14T21:41:10.8963435Z forwarded_states = self.DenseReluDense(forwarded_states) 2025-08-14T21:41:10.8963656Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 198, in forward 2025-08-14T21:41:10.8963727Z hidden_states = self.wo(hidden_states) 2025-08-14T21:41:10.8963737Z 2025-08-14T21:41:10.8963809Z cudagraph partition due to non gpu ops 2025-08-14T21:41:10.8963900Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8964091Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8964151Z return mod(**inputs) 2025-08-14T21:41:10.8964376Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1750, in forward 2025-08-14T21:41:10.8964451Z encoder_outputs = self.encoder( 2025-08-14T21:41:10.8964676Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1115, in forward 2025-08-14T21:41:10.8964781Z hidden_states = self.final_layer_norm(hidden_states) 2025-08-14T21:41:10.8965000Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 146, in forward 2025-08-14T21:41:10.8965071Z return self.weight * hidden_states 2025-08-14T21:41:10.8965075Z 2025-08-14T21:41:10.8965176Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8965359Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8965417Z return mod(**inputs) 2025-08-14T21:41:10.8965651Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.8965730Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.8965957Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8966021Z layer_outputs = layer_module( 2025-08-14T21:41:10.8966237Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8966317Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8966532Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 583, in forward 2025-08-14T21:41:10.8966605Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:41:10.8966840Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 512, in forward 2025-08-14T21:41:10.8966919Z attention_output = self.EncDecAttention( 2025-08-14T21:41:10.8967145Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 385, in forward 2025-08-14T21:41:10.8967214Z key_states = self.k(current_states) 2025-08-14T21:41:10.8967218Z 2025-08-14T21:41:10.8967309Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8967496Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8967582Z return mod(**inputs) 2025-08-14T21:41:10.8967811Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.8967876Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.8968097Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8968168Z layer_outputs = layer_module( 2025-08-14T21:41:10.8968365Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8968437Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8968662Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 583, in forward 2025-08-14T21:41:10.8968734Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:41:10.8968958Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 512, in forward 2025-08-14T21:41:10.8969036Z attention_output = self.EncDecAttention( 2025-08-14T21:41:10.8969251Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 401, in forward 2025-08-14T21:41:10.8969377Z scores = torch.matmul(query_states, key_states.transpose(3, 2)) 2025-08-14T21:41:10.8969382Z 2025-08-14T21:41:10.8969474Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8969661Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8969720Z return mod(**inputs) 2025-08-14T21:41:10.8969944Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.8970016Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.8970236Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8970302Z layer_outputs = layer_module( 2025-08-14T21:41:10.8970508Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8970576Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8970799Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 583, in forward 2025-08-14T21:41:10.8970870Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:41:10.8971084Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 512, in forward 2025-08-14T21:41:10.8971180Z attention_output = self.EncDecAttention( 2025-08-14T21:41:10.8971398Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 433, in forward 2025-08-14T21:41:10.8971543Z attn_weights = nn.functional.softmax(scores.float(), dim=-1).type_as(scores) 2025-08-14T21:41:10.8971562Z 2025-08-14T21:41:10.8971658Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8971838Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8971904Z return mod(**inputs) 2025-08-14T21:41:10.8972137Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.8972204Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.8972430Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8972495Z layer_outputs = layer_module( 2025-08-14T21:41:10.8972703Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8972772Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8972988Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 583, in forward 2025-08-14T21:41:10.8973083Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:41:10.8973298Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 512, in forward 2025-08-14T21:41:10.8973372Z attention_output = self.EncDecAttention( 2025-08-14T21:41:10.8973597Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 386, in forward 2025-08-14T21:41:10.8973665Z value_states = self.v(current_states) 2025-08-14T21:41:10.8973669Z 2025-08-14T21:41:10.8973764Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8973943Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8974002Z return mod(**inputs) 2025-08-14T21:41:10.8974226Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.8974294Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.8974518Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8974584Z layer_outputs = layer_module( 2025-08-14T21:41:10.8974783Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8974861Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8975092Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 583, in forward 2025-08-14T21:41:10.8975167Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:41:10.8975401Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 512, in forward 2025-08-14T21:41:10.8975478Z attention_output = self.EncDecAttention( 2025-08-14T21:41:10.8975714Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 440, in forward 2025-08-14T21:41:10.8975818Z attn_output = torch.matmul(attn_weights, value_states) 2025-08-14T21:41:10.8975821Z 2025-08-14T21:41:10.8975917Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8976112Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8976172Z return mod(**inputs) 2025-08-14T21:41:10.8976419Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.8976484Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.8976747Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8976826Z layer_outputs = layer_module( 2025-08-14T21:41:10.8977035Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8977127Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8977366Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 583, in forward 2025-08-14T21:41:10.8977442Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:41:10.8977693Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 512, in forward 2025-08-14T21:41:10.8977772Z attention_output = self.EncDecAttention( 2025-08-14T21:41:10.8977999Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 440, in forward 2025-08-14T21:41:10.8978109Z attn_output = torch.matmul(attn_weights, value_states) 2025-08-14T21:41:10.8978113Z 2025-08-14T21:41:10.8978210Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8978420Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8978488Z return mod(**inputs) 2025-08-14T21:41:10.8978737Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.8978813Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.8979052Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8979121Z layer_outputs = layer_module( 2025-08-14T21:41:10.8979345Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8979420Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8979658Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 583, in forward 2025-08-14T21:41:10.8979734Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:41:10.8979963Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 512, in forward 2025-08-14T21:41:10.8980052Z attention_output = self.EncDecAttention( 2025-08-14T21:41:10.8980283Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 442, in forward 2025-08-14T21:41:10.8980382Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:41:10.8980391Z 2025-08-14T21:41:10.8980489Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8980680Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8980747Z return mod(**inputs) 2025-08-14T21:41:10.8980982Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.8981050Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.8981289Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8981361Z layer_outputs = layer_module( 2025-08-14T21:41:10.8981580Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8981652Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8981883Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 583, in forward 2025-08-14T21:41:10.8981966Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:41:10.8982197Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 512, in forward 2025-08-14T21:41:10.8982291Z attention_output = self.EncDecAttention( 2025-08-14T21:41:10.8982527Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 444, in forward 2025-08-14T21:41:10.8982598Z attn_output = self.o(attn_output) 2025-08-14T21:41:10.8982618Z 2025-08-14T21:41:10.8982703Z cudagraph partition due to non gpu ops 2025-08-14T21:41:10.8982803Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8982996Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8983065Z return mod(**inputs) 2025-08-14T21:41:10.8983314Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.8983386Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.8983626Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8983693Z layer_outputs = layer_module( 2025-08-14T21:41:10.8983915Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8983989Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8984222Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 609, in forward 2025-08-14T21:41:10.8984353Z hidden_states = self.layer[-1](hidden_states) 2025-08-14T21:41:10.8984712Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 215, in forward 2025-08-14T21:41:10.8984902Z forwarded_states = self.layer_norm(hidden_states) 2025-08-14T21:41:10.8985143Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 146, in forward 2025-08-14T21:41:10.8985217Z return self.weight * hidden_states 2025-08-14T21:41:10.8985220Z 2025-08-14T21:41:10.8985324Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8985520Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8985582Z return mod(**inputs) 2025-08-14T21:41:10.8985820Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.8985893Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.8986131Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8986197Z layer_outputs = layer_module( 2025-08-14T21:41:10.8986410Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8986492Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8986723Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 609, in forward 2025-08-14T21:41:10.8986809Z hidden_states = self.layer[-1](hidden_states) 2025-08-14T21:41:10.8987074Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 216, in forward 2025-08-14T21:41:10.8987181Z forwarded_states = self.DenseReluDense(forwarded_states) 2025-08-14T21:41:10.8987420Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 183, in forward 2025-08-14T21:41:10.8987516Z hidden_gelu = self.act(self.wi_0(hidden_states)) 2025-08-14T21:41:10.8987519Z 2025-08-14T21:41:10.8987614Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8987816Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8987876Z return mod(**inputs) 2025-08-14T21:41:10.8988113Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.8988182Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.8988461Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8988539Z layer_outputs = layer_module( 2025-08-14T21:41:10.8988750Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8988858Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8989083Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 609, in forward 2025-08-14T21:41:10.8989163Z hidden_states = self.layer[-1](hidden_states) 2025-08-14T21:41:10.8989416Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 216, in forward 2025-08-14T21:41:10.8989523Z forwarded_states = self.DenseReluDense(forwarded_states) 2025-08-14T21:41:10.8989739Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 184, in forward 2025-08-14T21:41:10.8989821Z hidden_linear = self.wi_1(hidden_states) 2025-08-14T21:41:10.8989824Z 2025-08-14T21:41:10.8989915Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8990104Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8990185Z return mod(**inputs) 2025-08-14T21:41:10.8990413Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.8990488Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.8990718Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8990781Z layer_outputs = layer_module( 2025-08-14T21:41:10.8990995Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8991066Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8991294Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 609, in forward 2025-08-14T21:41:10.8991375Z hidden_states = self.layer[-1](hidden_states) 2025-08-14T21:41:10.8991597Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 216, in forward 2025-08-14T21:41:10.8991708Z forwarded_states = self.DenseReluDense(forwarded_states) 2025-08-14T21:41:10.8991929Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 185, in forward 2025-08-14T21:41:10.8992013Z hidden_states = hidden_gelu * hidden_linear 2025-08-14T21:41:10.8992018Z 2025-08-14T21:41:10.8992111Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8992295Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8992365Z return mod(**inputs) 2025-08-14T21:41:10.8992592Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.8992657Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.8992892Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8992958Z layer_outputs = layer_module( 2025-08-14T21:41:10.8993171Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8993241Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8993463Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 609, in forward 2025-08-14T21:41:10.8993551Z hidden_states = self.layer[-1](hidden_states) 2025-08-14T21:41:10.8993770Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 216, in forward 2025-08-14T21:41:10.8993887Z forwarded_states = self.DenseReluDense(forwarded_states) 2025-08-14T21:41:10.8994110Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 198, in forward 2025-08-14T21:41:10.8994200Z hidden_states = self.wo(hidden_states) 2025-08-14T21:41:10.8994203Z 2025-08-14T21:41:10.8994305Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8994486Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8994546Z return mod(**inputs) 2025-08-14T21:41:10.8994792Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.8994859Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.8995089Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8995153Z layer_outputs = layer_module( 2025-08-14T21:41:10.8995355Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8995432Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8995648Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.8995739Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.8995964Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 474, in forward 2025-08-14T21:41:10.8996059Z normed_hidden_states = self.layer_norm(hidden_states) 2025-08-14T21:41:10.8996284Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 146, in forward 2025-08-14T21:41:10.8996351Z return self.weight * hidden_states 2025-08-14T21:41:10.8996355Z 2025-08-14T21:41:10.8996446Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8996637Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8996696Z return mod(**inputs) 2025-08-14T21:41:10.8996923Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.8996991Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.8997208Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8997278Z layer_outputs = layer_module( 2025-08-14T21:41:10.8997478Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8997548Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8997771Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.8997843Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.8998065Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.8998139Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.8998355Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 365, in forward 2025-08-14T21:41:10.8998432Z query_states = self.q(hidden_states) 2025-08-14T21:41:10.8998435Z 2025-08-14T21:41:10.8998526Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.8998712Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.8998771Z return mod(**inputs) 2025-08-14T21:41:10.8998991Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.8999064Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.8999297Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.8999363Z layer_outputs = layer_module( 2025-08-14T21:41:10.8999570Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.8999656Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.8999881Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.8999955Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.9000182Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.9000267Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.9000485Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 385, in forward 2025-08-14T21:41:10.9000554Z key_states = self.k(current_states) 2025-08-14T21:41:10.9000557Z 2025-08-14T21:41:10.9000656Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9000834Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9000900Z return mod(**inputs) 2025-08-14T21:41:10.9001134Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9001199Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9001425Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9001488Z layer_outputs = layer_module( 2025-08-14T21:41:10.9001688Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9001767Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9001982Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.9002062Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.9002277Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.9002355Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.9002581Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 401, in forward 2025-08-14T21:41:10.9002698Z scores = torch.matmul(query_states, key_states.transpose(3, 2)) 2025-08-14T21:41:10.9002702Z 2025-08-14T21:41:10.9002801Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9002982Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9003040Z return mod(**inputs) 2025-08-14T21:41:10.9003268Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9003333Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9003549Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9003623Z layer_outputs = layer_module( 2025-08-14T21:41:10.9003822Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9003899Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9004114Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.9004185Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.9004407Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.9004481Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.9004718Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 433, in forward 2025-08-14T21:41:10.9004865Z attn_weights = nn.functional.softmax(scores.float(), dim=-1).type_as(scores) 2025-08-14T21:41:10.9004885Z 2025-08-14T21:41:10.9004980Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9005170Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9005228Z return mod(**inputs) 2025-08-14T21:41:10.9005455Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9005543Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9005764Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9005836Z layer_outputs = layer_module( 2025-08-14T21:41:10.9006035Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9006108Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9006330Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.9006402Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.9006645Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.9006718Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.9006935Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 386, in forward 2025-08-14T21:41:10.9007015Z value_states = self.v(current_states) 2025-08-14T21:41:10.9007018Z 2025-08-14T21:41:10.9007111Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9007292Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9007358Z return mod(**inputs) 2025-08-14T21:41:10.9007575Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9007649Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9007868Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9007933Z layer_outputs = layer_module( 2025-08-14T21:41:10.9008140Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9008213Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9008428Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.9008508Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.9008725Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.9008805Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.9009021Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 440, in forward 2025-08-14T21:41:10.9009122Z attn_output = torch.matmul(attn_weights, value_states) 2025-08-14T21:41:10.9009125Z 2025-08-14T21:41:10.9009224Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9009404Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9009470Z return mod(**inputs) 2025-08-14T21:41:10.9009690Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9009756Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9009994Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9010062Z layer_outputs = layer_module( 2025-08-14T21:41:10.9010264Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9010365Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9010583Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.9010664Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.9010879Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.9010965Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.9011192Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 440, in forward 2025-08-14T21:41:10.9011288Z attn_output = torch.matmul(attn_weights, value_states) 2025-08-14T21:41:10.9011293Z 2025-08-14T21:41:10.9011392Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9011571Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9011630Z return mod(**inputs) 2025-08-14T21:41:10.9011858Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9011939Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9012157Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9012227Z layer_outputs = layer_module( 2025-08-14T21:41:10.9012429Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9012508Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9012727Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.9012799Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.9013025Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.9013099Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.9013318Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 442, in forward 2025-08-14T21:41:10.9013421Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:41:10.9013424Z 2025-08-14T21:41:10.9013516Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9013703Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9013761Z return mod(**inputs) 2025-08-14T21:41:10.9013978Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9014051Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9014269Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9014341Z layer_outputs = layer_module( 2025-08-14T21:41:10.9014541Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9014613Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9014836Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.9014908Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.9015127Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.9015208Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.9015441Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 444, in forward 2025-08-14T21:41:10.9015522Z attn_output = self.o(attn_output) 2025-08-14T21:41:10.9015525Z 2025-08-14T21:41:10.9015596Z cudagraph partition due to non gpu ops 2025-08-14T21:41:10.9015704Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9015898Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9015956Z return mod(**inputs) 2025-08-14T21:41:10.9016183Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9016248Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9016482Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9016557Z layer_outputs = layer_module( 2025-08-14T21:41:10.9016760Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9016831Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9017056Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 583, in forward 2025-08-14T21:41:10.9017130Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:41:10.9017375Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 511, in forward 2025-08-14T21:41:10.9017470Z normed_hidden_states = self.layer_norm(hidden_states) 2025-08-14T21:41:10.9017689Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 146, in forward 2025-08-14T21:41:10.9017765Z return self.weight * hidden_states 2025-08-14T21:41:10.9017769Z 2025-08-14T21:41:10.9017860Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9018041Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9018108Z return mod(**inputs) 2025-08-14T21:41:10.9018328Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9018402Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9018622Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9018687Z layer_outputs = layer_module( 2025-08-14T21:41:10.9018896Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9018968Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9019192Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 583, in forward 2025-08-14T21:41:10.9019265Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:41:10.9019483Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 512, in forward 2025-08-14T21:41:10.9019567Z attention_output = self.EncDecAttention( 2025-08-14T21:41:10.9019784Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 365, in forward 2025-08-14T21:41:10.9019856Z query_states = self.q(hidden_states) 2025-08-14T21:41:10.9019860Z 2025-08-14T21:41:10.9019959Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9020139Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9020205Z return mod(**inputs) 2025-08-14T21:41:10.9020424Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9020489Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9020730Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9020796Z layer_outputs = layer_module( 2025-08-14T21:41:10.9020996Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9021093Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9021310Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 583, in forward 2025-08-14T21:41:10.9021392Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:41:10.9021609Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 512, in forward 2025-08-14T21:41:10.9021700Z attention_output = self.EncDecAttention( 2025-08-14T21:41:10.9021930Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 385, in forward 2025-08-14T21:41:10.9021998Z key_states = self.k(current_states) 2025-08-14T21:41:10.9022001Z 2025-08-14T21:41:10.9022103Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9022282Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9022341Z return mod(**inputs) 2025-08-14T21:41:10.9022571Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9022653Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9022871Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9022942Z layer_outputs = layer_module( 2025-08-14T21:41:10.9023142Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9023221Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9023436Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 583, in forward 2025-08-14T21:41:10.9023509Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:41:10.9023735Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 512, in forward 2025-08-14T21:41:10.9023811Z attention_output = self.EncDecAttention( 2025-08-14T21:41:10.9024034Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 401, in forward 2025-08-14T21:41:10.9024152Z scores = torch.matmul(query_states, key_states.transpose(3, 2)) 2025-08-14T21:41:10.9024155Z 2025-08-14T21:41:10.9024247Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9024433Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9024492Z return mod(**inputs) 2025-08-14T21:41:10.9024709Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9024879Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9025113Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9025186Z layer_outputs = layer_module( 2025-08-14T21:41:10.9025387Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9025458Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9025683Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 583, in forward 2025-08-14T21:41:10.9025756Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:41:10.9025975Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 512, in forward 2025-08-14T21:41:10.9026059Z attention_output = self.EncDecAttention( 2025-08-14T21:41:10.9026299Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 433, in forward 2025-08-14T21:41:10.9026451Z attn_weights = nn.functional.softmax(scores.float(), dim=-1).type_as(scores) 2025-08-14T21:41:10.9026455Z 2025-08-14T21:41:10.9026549Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9026748Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9026816Z return mod(**inputs) 2025-08-14T21:41:10.9027037Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9027111Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9027347Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9027414Z layer_outputs = layer_module( 2025-08-14T21:41:10.9027621Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9027691Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9027905Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 583, in forward 2025-08-14T21:41:10.9027984Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:41:10.9028199Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 512, in forward 2025-08-14T21:41:10.9028299Z attention_output = self.EncDecAttention( 2025-08-14T21:41:10.9028515Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 386, in forward 2025-08-14T21:41:10.9028586Z value_states = self.v(current_states) 2025-08-14T21:41:10.9028590Z 2025-08-14T21:41:10.9028690Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9028871Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9028939Z return mod(**inputs) 2025-08-14T21:41:10.9029157Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9029222Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9029448Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9029513Z layer_outputs = layer_module( 2025-08-14T21:41:10.9029710Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9029787Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9030003Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 583, in forward 2025-08-14T21:41:10.9030082Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:41:10.9030298Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 512, in forward 2025-08-14T21:41:10.9030373Z attention_output = self.EncDecAttention( 2025-08-14T21:41:10.9030593Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 440, in forward 2025-08-14T21:41:10.9030691Z attn_output = torch.matmul(attn_weights, value_states) 2025-08-14T21:41:10.9030695Z 2025-08-14T21:41:10.9030793Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9030974Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9031032Z return mod(**inputs) 2025-08-14T21:41:10.9031262Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9031327Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9031546Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9031642Z layer_outputs = layer_module( 2025-08-14T21:41:10.9031843Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9031920Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9032154Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 583, in forward 2025-08-14T21:41:10.9032227Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:41:10.9032452Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 512, in forward 2025-08-14T21:41:10.9032525Z attention_output = self.EncDecAttention( 2025-08-14T21:41:10.9032756Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 440, in forward 2025-08-14T21:41:10.9032861Z attn_output = torch.matmul(attn_weights, value_states) 2025-08-14T21:41:10.9032864Z 2025-08-14T21:41:10.9032959Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9033145Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9033204Z return mod(**inputs) 2025-08-14T21:41:10.9033423Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9033514Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9033734Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9033808Z layer_outputs = layer_module( 2025-08-14T21:41:10.9034010Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9034080Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9034307Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 583, in forward 2025-08-14T21:41:10.9034380Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:41:10.9034598Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 512, in forward 2025-08-14T21:41:10.9034681Z attention_output = self.EncDecAttention( 2025-08-14T21:41:10.9034901Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 442, in forward 2025-08-14T21:41:10.9035008Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:41:10.9035011Z 2025-08-14T21:41:10.9035103Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9035287Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9035355Z return mod(**inputs) 2025-08-14T21:41:10.9035578Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9035650Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9035874Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9035939Z layer_outputs = layer_module( 2025-08-14T21:41:10.9036147Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9036219Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9036435Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 583, in forward 2025-08-14T21:41:10.9036516Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:41:10.9036736Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 512, in forward 2025-08-14T21:41:10.9036819Z attention_output = self.EncDecAttention( 2025-08-14T21:41:10.9037055Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 444, in forward 2025-08-14T21:41:10.9037128Z attn_output = self.o(attn_output) 2025-08-14T21:41:10.9037131Z 2025-08-14T21:41:10.9037211Z cudagraph partition due to non gpu ops 2025-08-14T21:41:10.9037303Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9037501Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9037570Z return mod(**inputs) 2025-08-14T21:41:10.9037790Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9037861Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9038094Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9038160Z layer_outputs = layer_module( 2025-08-14T21:41:10.9038365Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9038433Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9038648Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 609, in forward 2025-08-14T21:41:10.9038738Z hidden_states = self.layer[-1](hidden_states) 2025-08-14T21:41:10.9038952Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 215, in forward 2025-08-14T21:41:10.9039062Z forwarded_states = self.layer_norm(hidden_states) 2025-08-14T21:41:10.9039281Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 146, in forward 2025-08-14T21:41:10.9039351Z return self.weight * hidden_states 2025-08-14T21:41:10.9039354Z 2025-08-14T21:41:10.9039454Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9039633Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9039700Z return mod(**inputs) 2025-08-14T21:41:10.9039923Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9039987Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9040222Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9040288Z layer_outputs = layer_module( 2025-08-14T21:41:10.9040489Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9040568Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9040788Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 609, in forward 2025-08-14T21:41:10.9040877Z hidden_states = self.layer[-1](hidden_states) 2025-08-14T21:41:10.9041097Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 216, in forward 2025-08-14T21:41:10.9041201Z forwarded_states = self.DenseReluDense(forwarded_states) 2025-08-14T21:41:10.9041432Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 183, in forward 2025-08-14T21:41:10.9041521Z hidden_gelu = self.act(self.wi_0(hidden_states)) 2025-08-14T21:41:10.9041526Z 2025-08-14T21:41:10.9041622Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9041803Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9041861Z return mod(**inputs) 2025-08-14T21:41:10.9042093Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9042159Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9042383Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9042469Z layer_outputs = layer_module( 2025-08-14T21:41:10.9042673Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9042751Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9042984Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 609, in forward 2025-08-14T21:41:10.9043067Z hidden_states = self.layer[-1](hidden_states) 2025-08-14T21:41:10.9043293Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 216, in forward 2025-08-14T21:41:10.9043410Z forwarded_states = self.DenseReluDense(forwarded_states) 2025-08-14T21:41:10.9043635Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 184, in forward 2025-08-14T21:41:10.9043708Z hidden_linear = self.wi_1(hidden_states) 2025-08-14T21:41:10.9043711Z 2025-08-14T21:41:10.9043805Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9043996Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9044054Z return mod(**inputs) 2025-08-14T21:41:10.9044277Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9044367Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9044587Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9044661Z layer_outputs = layer_module( 2025-08-14T21:41:10.9044864Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9044936Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9045162Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 609, in forward 2025-08-14T21:41:10.9045242Z hidden_states = self.layer[-1](hidden_states) 2025-08-14T21:41:10.9045461Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 216, in forward 2025-08-14T21:41:10.9045573Z forwarded_states = self.DenseReluDense(forwarded_states) 2025-08-14T21:41:10.9045792Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 185, in forward 2025-08-14T21:41:10.9045877Z hidden_states = hidden_gelu * hidden_linear 2025-08-14T21:41:10.9045880Z 2025-08-14T21:41:10.9045972Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9046154Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9046220Z return mod(**inputs) 2025-08-14T21:41:10.9046439Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9046512Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9046732Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9046796Z layer_outputs = layer_module( 2025-08-14T21:41:10.9047004Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9047077Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9047293Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 609, in forward 2025-08-14T21:41:10.9047380Z hidden_states = self.layer[-1](hidden_states) 2025-08-14T21:41:10.9047597Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 216, in forward 2025-08-14T21:41:10.9047709Z forwarded_states = self.DenseReluDense(forwarded_states) 2025-08-14T21:41:10.9047940Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 198, in forward 2025-08-14T21:41:10.9048014Z hidden_states = self.wo(hidden_states) 2025-08-14T21:41:10.9048018Z 2025-08-14T21:41:10.9048096Z cudagraph partition due to non gpu ops 2025-08-14T21:41:10.9048203Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9048390Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9048450Z return mod(**inputs) 2025-08-14T21:41:10.9048670Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9048756Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9048981Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9049046Z layer_outputs = layer_module( 2025-08-14T21:41:10.9049256Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9049327Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9049551Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.9049626Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.9049863Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 474, in forward 2025-08-14T21:41:10.9049966Z normed_hidden_states = self.layer_norm(hidden_states) 2025-08-14T21:41:10.9050184Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 146, in forward 2025-08-14T21:41:10.9050252Z return self.weight * hidden_states 2025-08-14T21:41:10.9050263Z 2025-08-14T21:41:10.9050354Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9050533Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9050600Z return mod(**inputs) 2025-08-14T21:41:10.9050820Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9050887Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9051114Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9051179Z layer_outputs = layer_module( 2025-08-14T21:41:10.9051383Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9051456Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9051671Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.9051749Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.9051964Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.9052039Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.9052263Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 365, in forward 2025-08-14T21:41:10.9052334Z query_states = self.q(hidden_states) 2025-08-14T21:41:10.9052338Z 2025-08-14T21:41:10.9052437Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9052616Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9052673Z return mod(**inputs) 2025-08-14T21:41:10.9052901Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9052965Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9053201Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9053268Z layer_outputs = layer_module( 2025-08-14T21:41:10.9053466Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9053560Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9053776Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.9053848Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.9054070Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.9054160Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.9054387Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 385, in forward 2025-08-14T21:41:10.9054456Z key_states = self.k(current_states) 2025-08-14T21:41:10.9054459Z 2025-08-14T21:41:10.9054551Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9054738Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9054795Z return mod(**inputs) 2025-08-14T21:41:10.9055016Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9055103Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9055323Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9055394Z layer_outputs = layer_module( 2025-08-14T21:41:10.9055594Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9055666Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9055890Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.9055962Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.9056185Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.9056261Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.9056476Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 401, in forward 2025-08-14T21:41:10.9056603Z scores = torch.matmul(query_states, key_states.transpose(3, 2)) 2025-08-14T21:41:10.9056606Z 2025-08-14T21:41:10.9056700Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9056881Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9056948Z return mod(**inputs) 2025-08-14T21:41:10.9057166Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9057240Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9057459Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9057521Z layer_outputs = layer_module( 2025-08-14T21:41:10.9057728Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9057801Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9058029Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.9058099Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.9058315Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.9058396Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.9058629Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 433, in forward 2025-08-14T21:41:10.9058773Z attn_weights = nn.functional.softmax(scores.float(), dim=-1).type_as(scores) 2025-08-14T21:41:10.9058777Z 2025-08-14T21:41:10.9058878Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9059072Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9059138Z return mod(**inputs) 2025-08-14T21:41:10.9059358Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9059422Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9059660Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9059725Z layer_outputs = layer_module( 2025-08-14T21:41:10.9059926Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9060005Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9060220Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.9060301Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.9060515Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.9060616Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.9060839Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 386, in forward 2025-08-14T21:41:10.9060909Z value_states = self.v(current_states) 2025-08-14T21:41:10.9060912Z 2025-08-14T21:41:10.9061011Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9061189Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9061250Z return mod(**inputs) 2025-08-14T21:41:10.9061478Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9061542Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9061760Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9061832Z layer_outputs = layer_module( 2025-08-14T21:41:10.9062033Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9062111Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9062326Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.9062397Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.9062622Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.9062696Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.9062918Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 440, in forward 2025-08-14T21:41:10.9063016Z attn_output = torch.matmul(attn_weights, value_states) 2025-08-14T21:41:10.9063021Z 2025-08-14T21:41:10.9063113Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9063300Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9063359Z return mod(**inputs) 2025-08-14T21:41:10.9063580Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9063655Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9063876Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9063966Z layer_outputs = layer_module( 2025-08-14T21:41:10.9064170Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9064242Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9064486Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.9064561Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.9064867Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.9064963Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.9065202Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 440, in forward 2025-08-14T21:41:10.9065310Z attn_output = torch.matmul(attn_weights, value_states) 2025-08-14T21:41:10.9065314Z 2025-08-14T21:41:10.9065408Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9065586Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9065653Z return mod(**inputs) 2025-08-14T21:41:10.9065874Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9065965Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9066186Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9066252Z layer_outputs = layer_module( 2025-08-14T21:41:10.9066462Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9066535Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9066751Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.9066835Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.9067051Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.9067133Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.9067351Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 442, in forward 2025-08-14T21:41:10.9067448Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:41:10.9067452Z 2025-08-14T21:41:10.9067553Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9067737Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9067803Z return mod(**inputs) 2025-08-14T21:41:10.9068025Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9068091Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9068322Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9068386Z layer_outputs = layer_module( 2025-08-14T21:41:10.9068585Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9068668Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9068885Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.9068965Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.9069181Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.9069256Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.9069479Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 444, in forward 2025-08-14T21:41:10.9069563Z attn_output = self.o(attn_output) 2025-08-14T21:41:10.9069568Z 2025-08-14T21:41:10.9069670Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9069850Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9069925Z return mod(**inputs) 2025-08-14T21:41:10.9070158Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9070226Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9070491Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9070567Z layer_outputs = layer_module( 2025-08-14T21:41:10.9070767Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9070846Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9071063Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.9071134Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.9071358Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 485, in forward 2025-08-14T21:41:10.9071496Z hidden_states = hidden_states + self.dropout(attention_output[0]) 2025-08-14T21:41:10.9071500Z 2025-08-14T21:41:10.9071571Z cudagraph partition due to non gpu ops 2025-08-14T21:41:10.9071669Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9071849Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9071916Z return mod(**inputs) 2025-08-14T21:41:10.9072134Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9072200Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9072427Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9072490Z layer_outputs = layer_module( 2025-08-14T21:41:10.9072691Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9072771Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9072987Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 583, in forward 2025-08-14T21:41:10.9073066Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:41:10.9073283Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 511, in forward 2025-08-14T21:41:10.9073377Z normed_hidden_states = self.layer_norm(hidden_states) 2025-08-14T21:41:10.9073603Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 146, in forward 2025-08-14T21:41:10.9073672Z return self.weight * hidden_states 2025-08-14T21:41:10.9073676Z 2025-08-14T21:41:10.9073777Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9073958Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9074020Z return mod(**inputs) 2025-08-14T21:41:10.9074246Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9074311Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9074530Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9074601Z layer_outputs = layer_module( 2025-08-14T21:41:10.9074801Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9074890Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9075111Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 583, in forward 2025-08-14T21:41:10.9075181Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:41:10.9075426Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 512, in forward 2025-08-14T21:41:10.9075504Z attention_output = self.EncDecAttention( 2025-08-14T21:41:10.9075730Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 365, in forward 2025-08-14T21:41:10.9075801Z query_states = self.q(hidden_states) 2025-08-14T21:41:10.9075805Z 2025-08-14T21:41:10.9075912Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9076103Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9076162Z return mod(**inputs) 2025-08-14T21:41:10.9076383Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9076456Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9076675Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9076761Z layer_outputs = layer_module( 2025-08-14T21:41:10.9076963Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9077033Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9077258Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 583, in forward 2025-08-14T21:41:10.9077329Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:41:10.9077546Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 512, in forward 2025-08-14T21:41:10.9077633Z attention_output = self.EncDecAttention( 2025-08-14T21:41:10.9077852Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 385, in forward 2025-08-14T21:41:10.9077928Z key_states = self.k(current_states) 2025-08-14T21:41:10.9077932Z 2025-08-14T21:41:10.9078023Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9078205Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9078270Z return mod(**inputs) 2025-08-14T21:41:10.9078492Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9078564Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9078785Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9078848Z layer_outputs = layer_module( 2025-08-14T21:41:10.9079057Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9079127Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9079344Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 583, in forward 2025-08-14T21:41:10.9079426Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:41:10.9079645Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 512, in forward 2025-08-14T21:41:10.9079728Z attention_output = self.EncDecAttention( 2025-08-14T21:41:10.9079946Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 401, in forward 2025-08-14T21:41:10.9080063Z scores = torch.matmul(query_states, key_states.transpose(3, 2)) 2025-08-14T21:41:10.9080067Z 2025-08-14T21:41:10.9080169Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9080364Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9080432Z return mod(**inputs) 2025-08-14T21:41:10.9080657Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9080738Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9080969Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9081033Z layer_outputs = layer_module( 2025-08-14T21:41:10.9081233Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9081325Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9081548Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 583, in forward 2025-08-14T21:41:10.9081627Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:41:10.9081845Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 512, in forward 2025-08-14T21:41:10.9081921Z attention_output = self.EncDecAttention( 2025-08-14T21:41:10.9082144Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 433, in forward 2025-08-14T21:41:10.9082302Z attn_weights = nn.functional.softmax(scores.float(), dim=-1).type_as(scores) 2025-08-14T21:41:10.9082306Z 2025-08-14T21:41:10.9082406Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9082584Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9082644Z return mod(**inputs) 2025-08-14T21:41:10.9082870Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9082937Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9083157Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9083229Z layer_outputs = layer_module( 2025-08-14T21:41:10.9083428Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9083507Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9083724Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 583, in forward 2025-08-14T21:41:10.9083795Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:41:10.9084018Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 512, in forward 2025-08-14T21:41:10.9084095Z attention_output = self.EncDecAttention( 2025-08-14T21:41:10.9084308Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 386, in forward 2025-08-14T21:41:10.9084387Z value_states = self.v(current_states) 2025-08-14T21:41:10.9084390Z 2025-08-14T21:41:10.9084481Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9084887Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9084957Z return mod(**inputs) 2025-08-14T21:41:10.9085188Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9085264Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9085490Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9085567Z layer_outputs = layer_module( 2025-08-14T21:41:10.9085774Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9085846Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9086112Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 583, in forward 2025-08-14T21:41:10.9086190Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:41:10.9086466Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 512, in forward 2025-08-14T21:41:10.9086589Z attention_output = self.EncDecAttention( 2025-08-14T21:41:10.9086807Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 440, in forward 2025-08-14T21:41:10.9086909Z attn_output = torch.matmul(attn_weights, value_states) 2025-08-14T21:41:10.9086913Z 2025-08-14T21:41:10.9087028Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9087211Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9087278Z return mod(**inputs) 2025-08-14T21:41:10.9087541Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9087616Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9087840Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9087909Z layer_outputs = layer_module( 2025-08-14T21:41:10.9088146Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9088218Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9088443Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 583, in forward 2025-08-14T21:41:10.9088526Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:41:10.9088750Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 512, in forward 2025-08-14T21:41:10.9088833Z attention_output = self.EncDecAttention( 2025-08-14T21:41:10.9089058Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 440, in forward 2025-08-14T21:41:10.9089156Z attn_output = torch.matmul(attn_weights, value_states) 2025-08-14T21:41:10.9089161Z 2025-08-14T21:41:10.9089264Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9089453Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9089512Z return mod(**inputs) 2025-08-14T21:41:10.9089750Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9089818Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9090052Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9090117Z layer_outputs = layer_module( 2025-08-14T21:41:10.9090324Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9090403Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9090626Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 583, in forward 2025-08-14T21:41:10.9090708Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:41:10.9090932Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 512, in forward 2025-08-14T21:41:10.9091008Z attention_output = self.EncDecAttention( 2025-08-14T21:41:10.9091241Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 442, in forward 2025-08-14T21:41:10.9091338Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:41:10.9091341Z 2025-08-14T21:41:10.9091435Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9091644Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9091707Z return mod(**inputs) 2025-08-14T21:41:10.9091942Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9092035Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9092268Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9092341Z layer_outputs = layer_module( 2025-08-14T21:41:10.9092551Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9092637Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9092871Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 583, in forward 2025-08-14T21:41:10.9092945Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:41:10.9093178Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 512, in forward 2025-08-14T21:41:10.9093254Z attention_output = self.EncDecAttention( 2025-08-14T21:41:10.9093477Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 444, in forward 2025-08-14T21:41:10.9093570Z attn_output = self.o(attn_output) 2025-08-14T21:41:10.9093574Z 2025-08-14T21:41:10.9093648Z cudagraph partition due to non gpu ops 2025-08-14T21:41:10.9093750Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9093935Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9093996Z return mod(**inputs) 2025-08-14T21:41:10.9094226Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9094292Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9094517Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9094590Z layer_outputs = layer_module( 2025-08-14T21:41:10.9094793Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9094874Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9095096Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 609, in forward 2025-08-14T21:41:10.9095180Z hidden_states = self.layer[-1](hidden_states) 2025-08-14T21:41:10.9095408Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 215, in forward 2025-08-14T21:41:10.9095497Z forwarded_states = self.layer_norm(hidden_states) 2025-08-14T21:41:10.9095727Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 146, in forward 2025-08-14T21:41:10.9095800Z return self.weight * hidden_states 2025-08-14T21:41:10.9095803Z 2025-08-14T21:41:10.9095897Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9096088Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9096152Z return mod(**inputs) 2025-08-14T21:41:10.9096377Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9096452Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9096675Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9096748Z layer_outputs = layer_module( 2025-08-14T21:41:10.9096954Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9097026Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9097270Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 609, in forward 2025-08-14T21:41:10.9097356Z hidden_states = self.layer[-1](hidden_states) 2025-08-14T21:41:10.9097582Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 216, in forward 2025-08-14T21:41:10.9097719Z forwarded_states = self.DenseReluDense(forwarded_states) 2025-08-14T21:41:10.9097948Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 183, in forward 2025-08-14T21:41:10.9098047Z hidden_gelu = self.act(self.wi_0(hidden_states)) 2025-08-14T21:41:10.9098051Z 2025-08-14T21:41:10.9098160Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9098351Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9098420Z return mod(**inputs) 2025-08-14T21:41:10.9098645Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9098719Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9098943Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9099009Z layer_outputs = layer_module( 2025-08-14T21:41:10.9099238Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9099310Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9099535Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 609, in forward 2025-08-14T21:41:10.9099626Z hidden_states = self.layer[-1](hidden_states) 2025-08-14T21:41:10.9099849Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 216, in forward 2025-08-14T21:41:10.9099975Z forwarded_states = self.DenseReluDense(forwarded_states) 2025-08-14T21:41:10.9100191Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 184, in forward 2025-08-14T21:41:10.9100261Z hidden_linear = self.wi_1(hidden_states) 2025-08-14T21:41:10.9100266Z 2025-08-14T21:41:10.9100365Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9100546Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9100612Z return mod(**inputs) 2025-08-14T21:41:10.9100832Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9100899Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9101126Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9101191Z layer_outputs = layer_module( 2025-08-14T21:41:10.9101392Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9101469Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9101683Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 609, in forward 2025-08-14T21:41:10.9101771Z hidden_states = self.layer[-1](hidden_states) 2025-08-14T21:41:10.9101987Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 216, in forward 2025-08-14T21:41:10.9102089Z forwarded_states = self.DenseReluDense(forwarded_states) 2025-08-14T21:41:10.9102313Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 185, in forward 2025-08-14T21:41:10.9102393Z hidden_states = hidden_gelu * hidden_linear 2025-08-14T21:41:10.9102396Z 2025-08-14T21:41:10.9102493Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9102687Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9102749Z return mod(**inputs) 2025-08-14T21:41:10.9102976Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9103057Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9103280Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9103355Z layer_outputs = layer_module( 2025-08-14T21:41:10.9103556Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9103646Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9103862Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 609, in forward 2025-08-14T21:41:10.9103941Z hidden_states = self.layer[-1](hidden_states) 2025-08-14T21:41:10.9104165Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 216, in forward 2025-08-14T21:41:10.9104266Z forwarded_states = self.DenseReluDense(forwarded_states) 2025-08-14T21:41:10.9104481Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 198, in forward 2025-08-14T21:41:10.9104576Z hidden_states = self.wo(hidden_states) 2025-08-14T21:41:10.9104579Z 2025-08-14T21:41:10.9104649Z cudagraph partition due to non gpu ops 2025-08-14T21:41:10.9104823Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9105030Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9105092Z return mod(**inputs) 2025-08-14T21:41:10.9105321Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9105387Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9105612Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9105676Z layer_outputs = layer_module( 2025-08-14T21:41:10.9105879Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9105961Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9106176Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.9106249Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.9106474Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 474, in forward 2025-08-14T21:41:10.9106568Z normed_hidden_states = self.layer_norm(hidden_states) 2025-08-14T21:41:10.9106794Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 146, in forward 2025-08-14T21:41:10.9106863Z return self.weight * hidden_states 2025-08-14T21:41:10.9106867Z 2025-08-14T21:41:10.9106959Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9107147Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9107209Z return mod(**inputs) 2025-08-14T21:41:10.9107428Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9107504Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9107722Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9107796Z layer_outputs = layer_module( 2025-08-14T21:41:10.9107995Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9108085Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9108312Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.9108384Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.9108625Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.9108704Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.9108919Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 365, in forward 2025-08-14T21:41:10.9108996Z query_states = self.q(hidden_states) 2025-08-14T21:41:10.9108999Z 2025-08-14T21:41:10.9109227Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9109413Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9109481Z return mod(**inputs) 2025-08-14T21:41:10.9109704Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9109780Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9110002Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9110083Z layer_outputs = layer_module( 2025-08-14T21:41:10.9110294Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9110367Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9110595Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.9110668Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.9110887Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.9110972Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.9111191Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 385, in forward 2025-08-14T21:41:10.9111262Z key_states = self.k(current_states) 2025-08-14T21:41:10.9111267Z 2025-08-14T21:41:10.9111370Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9111554Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9111625Z return mod(**inputs) 2025-08-14T21:41:10.9111844Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9111911Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9112140Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9112203Z layer_outputs = layer_module( 2025-08-14T21:41:10.9112403Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9112482Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9112699Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.9112779Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.9112994Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.9113067Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.9113292Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 401, in forward 2025-08-14T21:41:10.9113410Z scores = torch.matmul(query_states, key_states.transpose(3, 2)) 2025-08-14T21:41:10.9113413Z 2025-08-14T21:41:10.9113512Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9113718Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9113777Z return mod(**inputs) 2025-08-14T21:41:10.9114005Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9114087Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9114313Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9114384Z layer_outputs = layer_module( 2025-08-14T21:41:10.9114588Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9114679Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9114901Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.9114972Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.9115196Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.9115270Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.9115492Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 433, in forward 2025-08-14T21:41:10.9115649Z attn_weights = nn.functional.softmax(scores.float(), dim=-1).type_as(scores) 2025-08-14T21:41:10.9115652Z 2025-08-14T21:41:10.9115744Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9115935Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9115995Z return mod(**inputs) 2025-08-14T21:41:10.9116221Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9116294Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9116518Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9116589Z layer_outputs = layer_module( 2025-08-14T21:41:10.9116789Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9116863Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9117089Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.9117160Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.9117380Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.9117461Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.9117678Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 386, in forward 2025-08-14T21:41:10.9117756Z value_states = self.v(current_states) 2025-08-14T21:41:10.9117759Z 2025-08-14T21:41:10.9117851Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9118031Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9118099Z return mod(**inputs) 2025-08-14T21:41:10.9118321Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9118394Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9118615Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9118681Z layer_outputs = layer_module( 2025-08-14T21:41:10.9118887Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9118957Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9119191Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.9119272Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.9119488Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.9119588Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.9119803Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 440, in forward 2025-08-14T21:41:10.9119902Z attn_output = torch.matmul(attn_weights, value_states) 2025-08-14T21:41:10.9119906Z 2025-08-14T21:41:10.9120019Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9120201Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9120266Z return mod(**inputs) 2025-08-14T21:41:10.9120485Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9120551Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9120776Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9120841Z layer_outputs = layer_module( 2025-08-14T21:41:10.9121053Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9121131Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9121346Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.9121425Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.9121643Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.9121715Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.9121939Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 440, in forward 2025-08-14T21:41:10.9122036Z attn_output = torch.matmul(attn_weights, value_states) 2025-08-14T21:41:10.9122039Z 2025-08-14T21:41:10.9122140Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9122321Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9122379Z return mod(**inputs) 2025-08-14T21:41:10.9122604Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9122670Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9122890Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9122962Z layer_outputs = layer_module( 2025-08-14T21:41:10.9123164Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9123243Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9123458Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.9123532Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.9123756Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.9123828Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.9124045Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 442, in forward 2025-08-14T21:41:10.9124149Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:41:10.9124153Z 2025-08-14T21:41:10.9124244Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9124446Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9124507Z return mod(**inputs) 2025-08-14T21:41:10.9124730Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9124829Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9125051Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9125128Z layer_outputs = layer_module( 2025-08-14T21:41:10.9125332Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9125421Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9125647Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.9125720Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.9125937Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.9126018Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.9126232Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 444, in forward 2025-08-14T21:41:10.9126325Z attn_output = self.o(attn_output) 2025-08-14T21:41:10.9126329Z 2025-08-14T21:41:10.9126401Z cudagraph partition due to non gpu ops 2025-08-14T21:41:10.9126493Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9126685Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9126746Z return mod(**inputs) 2025-08-14T21:41:10.9126969Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9127043Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9127269Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9127341Z layer_outputs = layer_module( 2025-08-14T21:41:10.9127545Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9127619Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9127851Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 583, in forward 2025-08-14T21:41:10.9127925Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:41:10.9128152Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 511, in forward 2025-08-14T21:41:10.9128249Z normed_hidden_states = self.layer_norm(hidden_states) 2025-08-14T21:41:10.9128471Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 146, in forward 2025-08-14T21:41:10.9128549Z return self.weight * hidden_states 2025-08-14T21:41:10.9128553Z 2025-08-14T21:41:10.9128646Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9128832Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9128898Z return mod(**inputs) 2025-08-14T21:41:10.9129126Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9129198Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9129420Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9129487Z layer_outputs = layer_module( 2025-08-14T21:41:10.9129697Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9129768Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9130004Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 583, in forward 2025-08-14T21:41:10.9130087Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:41:10.9130304Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 512, in forward 2025-08-14T21:41:10.9130404Z attention_output = self.EncDecAttention( 2025-08-14T21:41:10.9130621Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 365, in forward 2025-08-14T21:41:10.9130692Z query_states = self.q(hidden_states) 2025-08-14T21:41:10.9130695Z 2025-08-14T21:41:10.9130798Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9130994Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9131060Z return mod(**inputs) 2025-08-14T21:41:10.9131283Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9131352Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9131582Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9131647Z layer_outputs = layer_module( 2025-08-14T21:41:10.9131873Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9131947Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9132162Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 583, in forward 2025-08-14T21:41:10.9132241Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:41:10.9132457Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 512, in forward 2025-08-14T21:41:10.9132532Z attention_output = self.EncDecAttention( 2025-08-14T21:41:10.9132757Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 385, in forward 2025-08-14T21:41:10.9132824Z key_states = self.k(current_states) 2025-08-14T21:41:10.9132828Z 2025-08-14T21:41:10.9132929Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9133111Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9133170Z return mod(**inputs) 2025-08-14T21:41:10.9133396Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9133460Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9133679Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9133749Z layer_outputs = layer_module( 2025-08-14T21:41:10.9133949Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9134027Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9134242Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 583, in forward 2025-08-14T21:41:10.9134313Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:41:10.9134538Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 512, in forward 2025-08-14T21:41:10.9134613Z attention_output = self.EncDecAttention( 2025-08-14T21:41:10.9134826Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 401, in forward 2025-08-14T21:41:10.9134953Z scores = torch.matmul(query_states, key_states.transpose(3, 2)) 2025-08-14T21:41:10.9134956Z 2025-08-14T21:41:10.9135048Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9135234Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9135305Z return mod(**inputs) 2025-08-14T21:41:10.9135528Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9135601Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9135835Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9135908Z layer_outputs = layer_module( 2025-08-14T21:41:10.9136106Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9136175Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9136410Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 583, in forward 2025-08-14T21:41:10.9136485Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:41:10.9136701Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 512, in forward 2025-08-14T21:41:10.9136784Z attention_output = self.EncDecAttention( 2025-08-14T21:41:10.9136999Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 433, in forward 2025-08-14T21:41:10.9137148Z attn_weights = nn.functional.softmax(scores.float(), dim=-1).type_as(scores) 2025-08-14T21:41:10.9137166Z 2025-08-14T21:41:10.9137261Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9137443Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9137510Z return mod(**inputs) 2025-08-14T21:41:10.9137736Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9137812Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9138039Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9138104Z layer_outputs = layer_module( 2025-08-14T21:41:10.9138315Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9138387Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9138610Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 583, in forward 2025-08-14T21:41:10.9138690Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:41:10.9138911Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 512, in forward 2025-08-14T21:41:10.9138996Z attention_output = self.EncDecAttention( 2025-08-14T21:41:10.9139219Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 386, in forward 2025-08-14T21:41:10.9139290Z value_states = self.v(current_states) 2025-08-14T21:41:10.9139293Z 2025-08-14T21:41:10.9139396Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9139581Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9139645Z return mod(**inputs) 2025-08-14T21:41:10.9139873Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9139940Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9140172Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9140237Z layer_outputs = layer_module( 2025-08-14T21:41:10.9140441Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9140517Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9140752Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 583, in forward 2025-08-14T21:41:10.9140836Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:41:10.9141056Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 512, in forward 2025-08-14T21:41:10.9141158Z attention_output = self.EncDecAttention( 2025-08-14T21:41:10.9141385Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 440, in forward 2025-08-14T21:41:10.9141480Z attn_output = torch.matmul(attn_weights, value_states) 2025-08-14T21:41:10.9141484Z 2025-08-14T21:41:10.9141575Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9141778Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9141842Z return mod(**inputs) 2025-08-14T21:41:10.9142071Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9142139Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9142359Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9142432Z layer_outputs = layer_module( 2025-08-14T21:41:10.9142635Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9142725Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9142943Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 583, in forward 2025-08-14T21:41:10.9143019Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:41:10.9143242Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 512, in forward 2025-08-14T21:41:10.9143318Z attention_output = self.EncDecAttention( 2025-08-14T21:41:10.9143537Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 440, in forward 2025-08-14T21:41:10.9143640Z attn_output = torch.matmul(attn_weights, value_states) 2025-08-14T21:41:10.9143643Z 2025-08-14T21:41:10.9143735Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9143922Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9143985Z return mod(**inputs) 2025-08-14T21:41:10.9144208Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9144284Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9144506Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9144570Z layer_outputs = layer_module( 2025-08-14T21:41:10.9144869Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9144956Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9145183Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 583, in forward 2025-08-14T21:41:10.9145260Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:41:10.9145479Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 512, in forward 2025-08-14T21:41:10.9145564Z attention_output = self.EncDecAttention( 2025-08-14T21:41:10.9145783Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 442, in forward 2025-08-14T21:41:10.9145888Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:41:10.9145892Z 2025-08-14T21:41:10.9145983Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9146163Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9146251Z return mod(**inputs) 2025-08-14T21:41:10.9146472Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9146537Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9146780Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9146846Z layer_outputs = layer_module( 2025-08-14T21:41:10.9147051Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9147121Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9147352Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 583, in forward 2025-08-14T21:41:10.9147433Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:41:10.9147652Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 512, in forward 2025-08-14T21:41:10.9147736Z attention_output = self.EncDecAttention( 2025-08-14T21:41:10.9147951Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 444, in forward 2025-08-14T21:41:10.9148022Z attn_output = self.o(attn_output) 2025-08-14T21:41:10.9148039Z 2025-08-14T21:41:10.9148140Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9148321Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9148378Z return mod(**inputs) 2025-08-14T21:41:10.9148607Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9148673Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9148901Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9148967Z layer_outputs = layer_module( 2025-08-14T21:41:10.9149169Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9149245Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9149465Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 583, in forward 2025-08-14T21:41:10.9149538Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:41:10.9149761Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 524, in forward 2025-08-14T21:41:10.9149880Z layer_output = hidden_states + self.dropout(attention_output[0]) 2025-08-14T21:41:10.9149885Z 2025-08-14T21:41:10.9149964Z cudagraph partition due to non gpu ops 2025-08-14T21:41:10.9150055Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9150234Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9150301Z return mod(**inputs) 2025-08-14T21:41:10.9150520Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9150597Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9150818Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9150884Z layer_outputs = layer_module( 2025-08-14T21:41:10.9151092Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9151160Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9151377Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 609, in forward 2025-08-14T21:41:10.9151469Z hidden_states = self.layer[-1](hidden_states) 2025-08-14T21:41:10.9151701Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 215, in forward 2025-08-14T21:41:10.9151797Z forwarded_states = self.layer_norm(hidden_states) 2025-08-14T21:41:10.9152014Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 146, in forward 2025-08-14T21:41:10.9152099Z return self.weight * hidden_states 2025-08-14T21:41:10.9152104Z 2025-08-14T21:41:10.9152204Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9152386Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9152452Z return mod(**inputs) 2025-08-14T21:41:10.9152697Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9152763Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9152993Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9153059Z layer_outputs = layer_module( 2025-08-14T21:41:10.9153262Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9153339Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9153562Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 609, in forward 2025-08-14T21:41:10.9153667Z hidden_states = self.layer[-1](hidden_states) 2025-08-14T21:41:10.9153884Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 216, in forward 2025-08-14T21:41:10.9153991Z forwarded_states = self.DenseReluDense(forwarded_states) 2025-08-14T21:41:10.9154217Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 183, in forward 2025-08-14T21:41:10.9154307Z hidden_gelu = self.act(self.wi_0(hidden_states)) 2025-08-14T21:41:10.9154310Z 2025-08-14T21:41:10.9154410Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9154593Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9154653Z return mod(**inputs) 2025-08-14T21:41:10.9154881Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9154948Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9155168Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9155239Z layer_outputs = layer_module( 2025-08-14T21:41:10.9155441Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9155519Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9155734Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 609, in forward 2025-08-14T21:41:10.9155814Z hidden_states = self.layer[-1](hidden_states) 2025-08-14T21:41:10.9156037Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 216, in forward 2025-08-14T21:41:10.9156143Z forwarded_states = self.DenseReluDense(forwarded_states) 2025-08-14T21:41:10.9156360Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 184, in forward 2025-08-14T21:41:10.9156438Z hidden_linear = self.wi_1(hidden_states) 2025-08-14T21:41:10.9156442Z 2025-08-14T21:41:10.9156532Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9156724Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9156782Z return mod(**inputs) 2025-08-14T21:41:10.9157004Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9157093Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9157312Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9157385Z layer_outputs = layer_module( 2025-08-14T21:41:10.9157625Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9157696Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9157921Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 609, in forward 2025-08-14T21:41:10.9157999Z hidden_states = self.layer[-1](hidden_states) 2025-08-14T21:41:10.9158231Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 216, in forward 2025-08-14T21:41:10.9158342Z forwarded_states = self.DenseReluDense(forwarded_states) 2025-08-14T21:41:10.9158560Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 185, in forward 2025-08-14T21:41:10.9158648Z hidden_states = hidden_gelu * hidden_linear 2025-08-14T21:41:10.9158652Z 2025-08-14T21:41:10.9158743Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9158924Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9159006Z return mod(**inputs) 2025-08-14T21:41:10.9159230Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9159300Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9159526Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9159590Z layer_outputs = layer_module( 2025-08-14T21:41:10.9159799Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9159870Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9160095Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 609, in forward 2025-08-14T21:41:10.9160182Z hidden_states = self.layer[-1](hidden_states) 2025-08-14T21:41:10.9160404Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 216, in forward 2025-08-14T21:41:10.9160513Z forwarded_states = self.DenseReluDense(forwarded_states) 2025-08-14T21:41:10.9160736Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 198, in forward 2025-08-14T21:41:10.9160808Z hidden_states = self.wo(hidden_states) 2025-08-14T21:41:10.9160811Z 2025-08-14T21:41:10.9160890Z cudagraph partition due to non gpu ops 2025-08-14T21:41:10.9160981Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9161168Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9161234Z return mod(**inputs) 2025-08-14T21:41:10.9161456Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9161529Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9161752Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9161816Z layer_outputs = layer_module( 2025-08-14T21:41:10.9162029Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9162100Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9162329Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.9162402Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.9162636Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 474, in forward 2025-08-14T21:41:10.9162740Z normed_hidden_states = self.layer_norm(hidden_states) 2025-08-14T21:41:10.9162954Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 146, in forward 2025-08-14T21:41:10.9163040Z return self.weight * hidden_states 2025-08-14T21:41:10.9163044Z 2025-08-14T21:41:10.9163143Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9163324Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9163390Z return mod(**inputs) 2025-08-14T21:41:10.9163622Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9163689Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9163916Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9163979Z layer_outputs = layer_module( 2025-08-14T21:41:10.9164180Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9164260Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9164493Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.9164579Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.9164795Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.9164872Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.9165096Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 365, in forward 2025-08-14T21:41:10.9165164Z query_states = self.q(hidden_states) 2025-08-14T21:41:10.9165168Z 2025-08-14T21:41:10.9165266Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9165448Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9165507Z return mod(**inputs) 2025-08-14T21:41:10.9165734Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9165799Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9166017Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9166090Z layer_outputs = layer_module( 2025-08-14T21:41:10.9166291Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9166366Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9166583Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.9166656Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.9166877Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.9166956Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.9167179Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 385, in forward 2025-08-14T21:41:10.9167248Z key_states = self.k(current_states) 2025-08-14T21:41:10.9167251Z 2025-08-14T21:41:10.9167342Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9167532Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9167590Z return mod(**inputs) 2025-08-14T21:41:10.9167810Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9167900Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9168121Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9168192Z layer_outputs = layer_module( 2025-08-14T21:41:10.9168407Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9168479Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9168700Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.9168771Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.9168999Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.9169083Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.9169300Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 401, in forward 2025-08-14T21:41:10.9169426Z scores = torch.matmul(query_states, key_states.transpose(3, 2)) 2025-08-14T21:41:10.9169430Z 2025-08-14T21:41:10.9169521Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9169701Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9169784Z return mod(**inputs) 2025-08-14T21:41:10.9170002Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9170075Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9170294Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9170359Z layer_outputs = layer_module( 2025-08-14T21:41:10.9170564Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9170635Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9170849Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.9170930Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.9171147Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.9171228Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.9171443Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 433, in forward 2025-08-14T21:41:10.9171585Z attn_weights = nn.functional.softmax(scores.float(), dim=-1).type_as(scores) 2025-08-14T21:41:10.9171589Z 2025-08-14T21:41:10.9171688Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9171869Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9171934Z return mod(**inputs) 2025-08-14T21:41:10.9172153Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9172217Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9172443Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9172508Z layer_outputs = layer_module( 2025-08-14T21:41:10.9172704Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9172783Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9172999Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.9173080Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.9173309Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.9173385Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.9173606Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 386, in forward 2025-08-14T21:41:10.9173703Z value_states = self.v(current_states) 2025-08-14T21:41:10.9173707Z 2025-08-14T21:41:10.9173807Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9173986Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9174044Z return mod(**inputs) 2025-08-14T21:41:10.9174282Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9174348Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9174570Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9174643Z layer_outputs = layer_module( 2025-08-14T21:41:10.9174844Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9174922Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9175140Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.9175231Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.9175453Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.9175525Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.9175744Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 440, in forward 2025-08-14T21:41:10.9175850Z attn_output = torch.matmul(attn_weights, value_states) 2025-08-14T21:41:10.9175854Z 2025-08-14T21:41:10.9175947Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9176133Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9176193Z return mod(**inputs) 2025-08-14T21:41:10.9176413Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9176487Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9176706Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9176775Z layer_outputs = layer_module( 2025-08-14T21:41:10.9176976Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9177047Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9177273Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.9177346Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.9177562Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.9177643Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.9177860Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 440, in forward 2025-08-14T21:41:10.9177966Z attn_output = torch.matmul(attn_weights, value_states) 2025-08-14T21:41:10.9177969Z 2025-08-14T21:41:10.9178061Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9178243Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9178310Z return mod(**inputs) 2025-08-14T21:41:10.9178528Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9178606Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9178833Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9178897Z layer_outputs = layer_module( 2025-08-14T21:41:10.9179124Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9179196Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9179412Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.9179491Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.9179723Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.9179806Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.9180022Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 442, in forward 2025-08-14T21:41:10.9180119Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:41:10.9180122Z 2025-08-14T21:41:10.9180221Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9180403Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9180477Z return mod(**inputs) 2025-08-14T21:41:10.9180707Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9180772Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9180999Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9181062Z layer_outputs = layer_module( 2025-08-14T21:41:10.9181261Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9181340Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9181558Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.9181638Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.9181855Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.9181931Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.9182155Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 444, in forward 2025-08-14T21:41:10.9182224Z attn_output = self.o(attn_output) 2025-08-14T21:41:10.9182227Z 2025-08-14T21:41:10.9182301Z cudagraph partition due to non gpu ops 2025-08-14T21:41:10.9182402Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9182583Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9182649Z return mod(**inputs) 2025-08-14T21:41:10.9182868Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9182933Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9183164Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9183228Z layer_outputs = layer_module( 2025-08-14T21:41:10.9183428Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9183507Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9183722Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 583, in forward 2025-08-14T21:41:10.9183802Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:41:10.9184032Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 511, in forward 2025-08-14T21:41:10.9184131Z normed_hidden_states = self.layer_norm(hidden_states) 2025-08-14T21:41:10.9184353Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 146, in forward 2025-08-14T21:41:10.9184444Z return self.weight * hidden_states 2025-08-14T21:41:10.9184448Z 2025-08-14T21:41:10.9184549Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9184930Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9185001Z return mod(**inputs) 2025-08-14T21:41:10.9185276Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9185346Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9185579Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9185657Z layer_outputs = layer_module( 2025-08-14T21:41:10.9185866Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9185945Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9186175Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 583, in forward 2025-08-14T21:41:10.9186273Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:41:10.9186497Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 512, in forward 2025-08-14T21:41:10.9186573Z attention_output = self.EncDecAttention( 2025-08-14T21:41:10.9186788Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 365, in forward 2025-08-14T21:41:10.9186865Z query_states = self.q(hidden_states) 2025-08-14T21:41:10.9186868Z 2025-08-14T21:41:10.9186962Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9187157Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9187217Z return mod(**inputs) 2025-08-14T21:41:10.9187444Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9187523Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9187749Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9187825Z layer_outputs = layer_module( 2025-08-14T21:41:10.9188031Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9188103Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9188333Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 583, in forward 2025-08-14T21:41:10.9188408Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:41:10.9188631Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 512, in forward 2025-08-14T21:41:10.9188715Z attention_output = self.EncDecAttention( 2025-08-14T21:41:10.9188938Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 385, in forward 2025-08-14T21:41:10.9189017Z key_states = self.k(current_states) 2025-08-14T21:41:10.9189020Z 2025-08-14T21:41:10.9189114Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9189302Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9189371Z return mod(**inputs) 2025-08-14T21:41:10.9189597Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9189673Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9189921Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9189988Z layer_outputs = layer_module( 2025-08-14T21:41:10.9190204Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9190301Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9190524Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 583, in forward 2025-08-14T21:41:10.9190607Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:41:10.9190842Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 512, in forward 2025-08-14T21:41:10.9190929Z attention_output = self.EncDecAttention( 2025-08-14T21:41:10.9191155Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 401, in forward 2025-08-14T21:41:10.9191277Z scores = torch.matmul(query_states, key_states.transpose(3, 2)) 2025-08-14T21:41:10.9191281Z 2025-08-14T21:41:10.9191384Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9191573Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9191651Z return mod(**inputs) 2025-08-14T21:41:10.9191887Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9191954Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9192192Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9192258Z layer_outputs = layer_module( 2025-08-14T21:41:10.9192467Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9192546Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9192772Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 583, in forward 2025-08-14T21:41:10.9192855Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:41:10.9193083Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 512, in forward 2025-08-14T21:41:10.9193161Z attention_output = self.EncDecAttention( 2025-08-14T21:41:10.9193393Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 433, in forward 2025-08-14T21:41:10.9193540Z attn_weights = nn.functional.softmax(scores.float(), dim=-1).type_as(scores) 2025-08-14T21:41:10.9193545Z 2025-08-14T21:41:10.9193640Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9193837Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9193899Z return mod(**inputs) 2025-08-14T21:41:10.9194138Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9194204Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9194433Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9194508Z layer_outputs = layer_module( 2025-08-14T21:41:10.9194716Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9194795Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9195019Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 583, in forward 2025-08-14T21:41:10.9195093Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:41:10.9195326Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 512, in forward 2025-08-14T21:41:10.9195426Z attention_output = self.EncDecAttention( 2025-08-14T21:41:10.9195653Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 386, in forward 2025-08-14T21:41:10.9195733Z value_states = self.v(current_states) 2025-08-14T21:41:10.9195750Z 2025-08-14T21:41:10.9195848Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9196045Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9196104Z return mod(**inputs) 2025-08-14T21:41:10.9196329Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9196424Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9196650Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9196713Z layer_outputs = layer_module( 2025-08-14T21:41:10.9196927Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9196998Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9197231Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 583, in forward 2025-08-14T21:41:10.9197326Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:41:10.9197551Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 512, in forward 2025-08-14T21:41:10.9197634Z attention_output = self.EncDecAttention( 2025-08-14T21:41:10.9197858Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 440, in forward 2025-08-14T21:41:10.9197963Z attn_output = torch.matmul(attn_weights, value_states) 2025-08-14T21:41:10.9197966Z 2025-08-14T21:41:10.9198060Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9198302Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9198366Z return mod(**inputs) 2025-08-14T21:41:10.9198588Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9198653Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9198881Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9198944Z layer_outputs = layer_module( 2025-08-14T21:41:10.9199151Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9199220Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9199437Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 583, in forward 2025-08-14T21:41:10.9199514Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:41:10.9199731Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 512, in forward 2025-08-14T21:41:10.9199813Z attention_output = self.EncDecAttention( 2025-08-14T21:41:10.9200032Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 440, in forward 2025-08-14T21:41:10.9200130Z attn_output = torch.matmul(attn_weights, value_states) 2025-08-14T21:41:10.9200133Z 2025-08-14T21:41:10.9200232Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9200413Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9200472Z return mod(**inputs) 2025-08-14T21:41:10.9200703Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9200768Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9201012Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9201079Z layer_outputs = layer_module( 2025-08-14T21:41:10.9201280Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9201372Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9201599Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 583, in forward 2025-08-14T21:41:10.9201671Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:41:10.9201912Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 512, in forward 2025-08-14T21:41:10.9201990Z attention_output = self.EncDecAttention( 2025-08-14T21:41:10.9202219Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 442, in forward 2025-08-14T21:41:10.9202317Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:41:10.9202320Z 2025-08-14T21:41:10.9202410Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9202600Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9202659Z return mod(**inputs) 2025-08-14T21:41:10.9202905Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9202971Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9203188Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9203261Z layer_outputs = layer_module( 2025-08-14T21:41:10.9203458Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9203530Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9203755Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 583, in forward 2025-08-14T21:41:10.9203827Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:41:10.9204050Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 512, in forward 2025-08-14T21:41:10.9204128Z attention_output = self.EncDecAttention( 2025-08-14T21:41:10.9204345Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 444, in forward 2025-08-14T21:41:10.9204423Z attn_output = self.o(attn_output) 2025-08-14T21:41:10.9204427Z 2025-08-14T21:41:10.9204501Z cudagraph partition due to non gpu ops 2025-08-14T21:41:10.9204602Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9204784Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9204842Z return mod(**inputs) 2025-08-14T21:41:10.9205069Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9205134Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9205353Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9205427Z layer_outputs = layer_module( 2025-08-14T21:41:10.9205627Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9205704Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9205923Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 609, in forward 2025-08-14T21:41:10.9206005Z hidden_states = self.layer[-1](hidden_states) 2025-08-14T21:41:10.9206228Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 215, in forward 2025-08-14T21:41:10.9206329Z forwarded_states = self.layer_norm(hidden_states) 2025-08-14T21:41:10.9206549Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 146, in forward 2025-08-14T21:41:10.9206624Z return self.weight * hidden_states 2025-08-14T21:41:10.9206643Z 2025-08-14T21:41:10.9206736Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9206926Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9206983Z return mod(**inputs) 2025-08-14T21:41:10.9207202Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9207290Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9207514Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9207584Z layer_outputs = layer_module( 2025-08-14T21:41:10.9207786Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9207856Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9208079Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 609, in forward 2025-08-14T21:41:10.9208179Z hidden_states = self.layer[-1](hidden_states) 2025-08-14T21:41:10.9208397Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 216, in forward 2025-08-14T21:41:10.9208512Z forwarded_states = self.DenseReluDense(forwarded_states) 2025-08-14T21:41:10.9208729Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 183, in forward 2025-08-14T21:41:10.9208827Z hidden_gelu = self.act(self.wi_0(hidden_states)) 2025-08-14T21:41:10.9208830Z 2025-08-14T21:41:10.9208922Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9209105Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9209170Z return mod(**inputs) 2025-08-14T21:41:10.9209391Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9209467Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9209688Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9209752Z layer_outputs = layer_module( 2025-08-14T21:41:10.9209962Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9210033Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9210249Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 609, in forward 2025-08-14T21:41:10.9210339Z hidden_states = self.layer[-1](hidden_states) 2025-08-14T21:41:10.9210556Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 216, in forward 2025-08-14T21:41:10.9210667Z forwarded_states = self.DenseReluDense(forwarded_states) 2025-08-14T21:41:10.9210886Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 184, in forward 2025-08-14T21:41:10.9210959Z hidden_linear = self.wi_1(hidden_states) 2025-08-14T21:41:10.9210962Z 2025-08-14T21:41:10.9211060Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9211243Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9211301Z return mod(**inputs) 2025-08-14T21:41:10.9211527Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9211593Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9211837Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9211904Z layer_outputs = layer_module( 2025-08-14T21:41:10.9212103Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9212200Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9212420Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 609, in forward 2025-08-14T21:41:10.9212505Z hidden_states = self.layer[-1](hidden_states) 2025-08-14T21:41:10.9212734Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 216, in forward 2025-08-14T21:41:10.9212839Z forwarded_states = self.DenseReluDense(forwarded_states) 2025-08-14T21:41:10.9213064Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 185, in forward 2025-08-14T21:41:10.9213145Z hidden_states = hidden_gelu * hidden_linear 2025-08-14T21:41:10.9213148Z 2025-08-14T21:41:10.9213240Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9213430Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9213512Z return mod(**inputs) 2025-08-14T21:41:10.9213744Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9213810Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9214033Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9214105Z layer_outputs = layer_module( 2025-08-14T21:41:10.9214308Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9214385Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9214605Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 609, in forward 2025-08-14T21:41:10.9214684Z hidden_states = self.layer[-1](hidden_states) 2025-08-14T21:41:10.9214913Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 216, in forward 2025-08-14T21:41:10.9215015Z forwarded_states = self.DenseReluDense(forwarded_states) 2025-08-14T21:41:10.9215237Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 198, in forward 2025-08-14T21:41:10.9215317Z hidden_states = self.wo(hidden_states) 2025-08-14T21:41:10.9215322Z 2025-08-14T21:41:10.9215415Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9215608Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9215668Z return mod(**inputs) 2025-08-14T21:41:10.9215893Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9215965Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9216187Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9216261Z layer_outputs = layer_module( 2025-08-14T21:41:10.9216463Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9216533Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9216759Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 609, in forward 2025-08-14T21:41:10.9216838Z hidden_states = self.layer[-1](hidden_states) 2025-08-14T21:41:10.9217054Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 217, in forward 2025-08-14T21:41:10.9217193Z hidden_states = hidden_states + self.dropout(forwarded_states) 2025-08-14T21:41:10.9217197Z 2025-08-14T21:41:10.9217272Z cudagraph partition due to non gpu ops 2025-08-14T21:41:10.9217372Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9217571Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9217631Z return mod(**inputs) 2025-08-14T21:41:10.9217863Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9217928Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9218162Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9218236Z layer_outputs = layer_module( 2025-08-14T21:41:10.9218436Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9218514Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9218731Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.9218803Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.9219028Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 474, in forward 2025-08-14T21:41:10.9219139Z normed_hidden_states = self.layer_norm(hidden_states) 2025-08-14T21:41:10.9219366Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 146, in forward 2025-08-14T21:41:10.9219436Z return self.weight * hidden_states 2025-08-14T21:41:10.9219439Z 2025-08-14T21:41:10.9219529Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9219715Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9219774Z return mod(**inputs) 2025-08-14T21:41:10.9219997Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9220069Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9220291Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9220363Z layer_outputs = layer_module( 2025-08-14T21:41:10.9220561Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9220631Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9220856Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.9220928Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.9221143Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.9221226Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.9221439Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 365, in forward 2025-08-14T21:41:10.9221517Z query_states = self.q(hidden_states) 2025-08-14T21:41:10.9221521Z 2025-08-14T21:41:10.9221614Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9221794Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9221862Z return mod(**inputs) 2025-08-14T21:41:10.9222081Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9222155Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9222375Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9222438Z layer_outputs = layer_module( 2025-08-14T21:41:10.9222660Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9222732Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9222949Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.9223045Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.9223263Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.9223341Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.9223569Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 385, in forward 2025-08-14T21:41:10.9223639Z key_states = self.k(current_states) 2025-08-14T21:41:10.9223643Z 2025-08-14T21:41:10.9223743Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9223923Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9223990Z return mod(**inputs) 2025-08-14T21:41:10.9224209Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9224275Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9224515Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9224578Z layer_outputs = layer_module( 2025-08-14T21:41:10.9224855Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9224956Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9225173Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.9225255Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.9225473Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.9225549Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.9225775Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 401, in forward 2025-08-14T21:41:10.9225897Z scores = torch.matmul(query_states, key_states.transpose(3, 2)) 2025-08-14T21:41:10.9225900Z 2025-08-14T21:41:10.9226001Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9226182Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9226244Z return mod(**inputs) 2025-08-14T21:41:10.9226471Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9226536Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9226757Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9226830Z layer_outputs = layer_module( 2025-08-14T21:41:10.9227033Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9227116Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9227337Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.9227409Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.9227635Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.9227711Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.9227929Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 433, in forward 2025-08-14T21:41:10.9228095Z attn_weights = nn.functional.softmax(scores.float(), dim=-1).type_as(scores) 2025-08-14T21:41:10.9228099Z 2025-08-14T21:41:10.9228192Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9228379Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9228461Z return mod(**inputs) 2025-08-14T21:41:10.9228684Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9228758Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9228994Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9229067Z layer_outputs = layer_module( 2025-08-14T21:41:10.9229270Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9229341Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9229570Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.9229642Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.9229858Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.9229956Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.9230174Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 386, in forward 2025-08-14T21:41:10.9230249Z value_states = self.v(current_states) 2025-08-14T21:41:10.9230252Z 2025-08-14T21:41:10.9230345Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9230527Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9230593Z return mod(**inputs) 2025-08-14T21:41:10.9230816Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9230880Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9231105Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9231170Z layer_outputs = layer_module( 2025-08-14T21:41:10.9231377Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9231447Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9231664Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.9231744Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.9231960Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.9232038Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.9232253Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 440, in forward 2025-08-14T21:41:10.9232350Z attn_output = torch.matmul(attn_weights, value_states) 2025-08-14T21:41:10.9232354Z 2025-08-14T21:41:10.9232455Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9232635Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9232694Z return mod(**inputs) 2025-08-14T21:41:10.9232920Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9232986Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9233210Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9233273Z layer_outputs = layer_module( 2025-08-14T21:41:10.9233488Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9233569Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9233789Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.9233885Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.9234108Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.9234182Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.9234435Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 440, in forward 2025-08-14T21:41:10.9234534Z attn_output = torch.matmul(attn_weights, value_states) 2025-08-14T21:41:10.9234537Z 2025-08-14T21:41:10.9234630Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9234822Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9234883Z return mod(**inputs) 2025-08-14T21:41:10.9235113Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9235181Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9235416Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9235486Z layer_outputs = layer_module( 2025-08-14T21:41:10.9235686Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9235756Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9235977Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.9236049Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.9236273Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.9236346Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.9236558Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 442, in forward 2025-08-14T21:41:10.9236665Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:41:10.9236668Z 2025-08-14T21:41:10.9236759Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9236943Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9237001Z return mod(**inputs) 2025-08-14T21:41:10.9237220Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9237294Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9237511Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9237574Z layer_outputs = layer_module( 2025-08-14T21:41:10.9237777Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9237848Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9238072Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.9238143Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.9238358Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.9238440Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.9238655Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 444, in forward 2025-08-14T21:41:10.9238733Z attn_output = self.o(attn_output) 2025-08-14T21:41:10.9238755Z 2025-08-14T21:41:10.9238831Z cudagraph partition due to non gpu ops 2025-08-14T21:41:10.9238925Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9239115Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9239189Z return mod(**inputs) 2025-08-14T21:41:10.9239413Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9239486Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9239704Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9239789Z layer_outputs = layer_module( 2025-08-14T21:41:10.9239989Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9240059Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9240283Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 583, in forward 2025-08-14T21:41:10.9240355Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:41:10.9240572Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 511, in forward 2025-08-14T21:41:10.9240690Z normed_hidden_states = self.layer_norm(hidden_states) 2025-08-14T21:41:10.9240905Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 146, in forward 2025-08-14T21:41:10.9240979Z return self.weight * hidden_states 2025-08-14T21:41:10.9240982Z 2025-08-14T21:41:10.9241074Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9241253Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9241320Z return mod(**inputs) 2025-08-14T21:41:10.9241542Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9241613Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9241832Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9241896Z layer_outputs = layer_module( 2025-08-14T21:41:10.9242105Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9242174Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9242388Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 583, in forward 2025-08-14T21:41:10.9242468Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:41:10.9242682Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 512, in forward 2025-08-14T21:41:10.9242766Z attention_output = self.EncDecAttention( 2025-08-14T21:41:10.9242981Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 365, in forward 2025-08-14T21:41:10.9243050Z query_states = self.q(hidden_states) 2025-08-14T21:41:10.9243053Z 2025-08-14T21:41:10.9243154Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9243336Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9243401Z return mod(**inputs) 2025-08-14T21:41:10.9243618Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9243683Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9243908Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9243972Z layer_outputs = layer_module( 2025-08-14T21:41:10.9244210Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9244290Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9244510Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 583, in forward 2025-08-14T21:41:10.9244606Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:41:10.9244827Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 512, in forward 2025-08-14T21:41:10.9244902Z attention_output = self.EncDecAttention( 2025-08-14T21:41:10.9245147Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 385, in forward 2025-08-14T21:41:10.9245216Z key_states = self.k(current_states) 2025-08-14T21:41:10.9245220Z 2025-08-14T21:41:10.9245312Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9245505Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9245563Z return mod(**inputs) 2025-08-14T21:41:10.9245795Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9245861Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9246081Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9246170Z layer_outputs = layer_module( 2025-08-14T21:41:10.9246369Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9246445Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9246661Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 583, in forward 2025-08-14T21:41:10.9246734Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:41:10.9246957Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 512, in forward 2025-08-14T21:41:10.9247031Z attention_output = self.EncDecAttention( 2025-08-14T21:41:10.9247244Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 401, in forward 2025-08-14T21:41:10.9247372Z scores = torch.matmul(query_states, key_states.transpose(3, 2)) 2025-08-14T21:41:10.9247377Z 2025-08-14T21:41:10.9247468Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9247656Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9247715Z return mod(**inputs) 2025-08-14T21:41:10.9247938Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9248013Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9248232Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9248296Z layer_outputs = layer_module( 2025-08-14T21:41:10.9248505Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9248575Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9248802Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 583, in forward 2025-08-14T21:41:10.9248875Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:41:10.9249091Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 512, in forward 2025-08-14T21:41:10.9249177Z attention_output = self.EncDecAttention( 2025-08-14T21:41:10.9249392Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 433, in forward 2025-08-14T21:41:10.9249538Z attn_weights = nn.functional.softmax(scores.float(), dim=-1).type_as(scores) 2025-08-14T21:41:10.9249554Z 2025-08-14T21:41:10.9249648Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9249829Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9249910Z return mod(**inputs) 2025-08-14T21:41:10.9250133Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9250199Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9250428Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9250492Z layer_outputs = layer_module( 2025-08-14T21:41:10.9250716Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9250788Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9251011Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 583, in forward 2025-08-14T21:41:10.9251089Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:41:10.9251306Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 512, in forward 2025-08-14T21:41:10.9251389Z attention_output = self.EncDecAttention( 2025-08-14T21:41:10.9251617Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 386, in forward 2025-08-14T21:41:10.9251687Z value_states = self.v(current_states) 2025-08-14T21:41:10.9251690Z 2025-08-14T21:41:10.9251788Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9251967Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9252025Z return mod(**inputs) 2025-08-14T21:41:10.9252253Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9252317Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9252542Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9252608Z layer_outputs = layer_module( 2025-08-14T21:41:10.9252807Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9252884Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9253098Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 583, in forward 2025-08-14T21:41:10.9253170Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:41:10.9253393Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 512, in forward 2025-08-14T21:41:10.9253468Z attention_output = self.EncDecAttention( 2025-08-14T21:41:10.9253692Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 440, in forward 2025-08-14T21:41:10.9253789Z attn_output = torch.matmul(attn_weights, value_states) 2025-08-14T21:41:10.9253792Z 2025-08-14T21:41:10.9253883Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9254073Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9254135Z return mod(**inputs) 2025-08-14T21:41:10.9254362Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9254427Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9254651Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9254725Z layer_outputs = layer_module( 2025-08-14T21:41:10.9254940Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9255014Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9255243Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 583, in forward 2025-08-14T21:41:10.9255341Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:41:10.9255569Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 512, in forward 2025-08-14T21:41:10.9255646Z attention_output = self.EncDecAttention( 2025-08-14T21:41:10.9255862Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 440, in forward 2025-08-14T21:41:10.9255980Z attn_output = torch.matmul(attn_weights, value_states) 2025-08-14T21:41:10.9255983Z 2025-08-14T21:41:10.9256078Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9256268Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9256329Z return mod(**inputs) 2025-08-14T21:41:10.9256549Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9256622Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9256843Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9256925Z layer_outputs = layer_module( 2025-08-14T21:41:10.9257137Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9257209Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9257437Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 583, in forward 2025-08-14T21:41:10.9257510Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:41:10.9257729Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 512, in forward 2025-08-14T21:41:10.9257814Z attention_output = self.EncDecAttention( 2025-08-14T21:41:10.9258031Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 442, in forward 2025-08-14T21:41:10.9258130Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:41:10.9258142Z 2025-08-14T21:41:10.9258235Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9258418Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9258485Z return mod(**inputs) 2025-08-14T21:41:10.9258706Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9258773Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9258998Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9259065Z layer_outputs = layer_module( 2025-08-14T21:41:10.9259275Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9259345Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9259562Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 583, in forward 2025-08-14T21:41:10.9259644Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:41:10.9259862Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 512, in forward 2025-08-14T21:41:10.9259940Z attention_output = self.EncDecAttention( 2025-08-14T21:41:10.9260164Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 444, in forward 2025-08-14T21:41:10.9260234Z attn_output = self.o(attn_output) 2025-08-14T21:41:10.9260237Z 2025-08-14T21:41:10.9260331Z cudagraph partition due to non gpu ops 2025-08-14T21:41:10.9260428Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9260610Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9260695Z return mod(**inputs) 2025-08-14T21:41:10.9260922Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9260999Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9261223Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9261288Z layer_outputs = layer_module( 2025-08-14T21:41:10.9261517Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9261589Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9261807Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 609, in forward 2025-08-14T21:41:10.9261896Z hidden_states = self.layer[-1](hidden_states) 2025-08-14T21:41:10.9262117Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 215, in forward 2025-08-14T21:41:10.9262216Z forwarded_states = self.layer_norm(hidden_states) 2025-08-14T21:41:10.9262449Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 146, in forward 2025-08-14T21:41:10.9262517Z return self.weight * hidden_states 2025-08-14T21:41:10.9262521Z 2025-08-14T21:41:10.9262620Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9262803Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9262864Z return mod(**inputs) 2025-08-14T21:41:10.9263092Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9263158Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9263382Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9263446Z layer_outputs = layer_module( 2025-08-14T21:41:10.9263646Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9263726Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9263941Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 609, in forward 2025-08-14T21:41:10.9264029Z hidden_states = self.layer[-1](hidden_states) 2025-08-14T21:41:10.9264244Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 216, in forward 2025-08-14T21:41:10.9264348Z forwarded_states = self.DenseReluDense(forwarded_states) 2025-08-14T21:41:10.9264570Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 183, in forward 2025-08-14T21:41:10.9264659Z hidden_gelu = self.act(self.wi_0(hidden_states)) 2025-08-14T21:41:10.9264662Z 2025-08-14T21:41:10.9264828Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9265039Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9265104Z return mod(**inputs) 2025-08-14T21:41:10.9265335Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9265403Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9265628Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9265701Z layer_outputs = layer_module( 2025-08-14T21:41:10.9265925Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9266008Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9266228Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 609, in forward 2025-08-14T21:41:10.9266326Z hidden_states = self.layer[-1](hidden_states) 2025-08-14T21:41:10.9266555Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 216, in forward 2025-08-14T21:41:10.9266659Z forwarded_states = self.DenseReluDense(forwarded_states) 2025-08-14T21:41:10.9266891Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 184, in forward 2025-08-14T21:41:10.9266975Z hidden_linear = self.wi_1(hidden_states) 2025-08-14T21:41:10.9266978Z 2025-08-14T21:41:10.9267071Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9267260Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9267320Z return mod(**inputs) 2025-08-14T21:41:10.9267539Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9267616Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9267837Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9267920Z layer_outputs = layer_module( 2025-08-14T21:41:10.9268128Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9268198Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9268424Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 609, in forward 2025-08-14T21:41:10.9268505Z hidden_states = self.layer[-1](hidden_states) 2025-08-14T21:41:10.9268726Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 216, in forward 2025-08-14T21:41:10.9268839Z forwarded_states = self.DenseReluDense(forwarded_states) 2025-08-14T21:41:10.9269055Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 185, in forward 2025-08-14T21:41:10.9269145Z hidden_states = hidden_gelu * hidden_linear 2025-08-14T21:41:10.9269148Z 2025-08-14T21:41:10.9269242Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9269428Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9269496Z return mod(**inputs) 2025-08-14T21:41:10.9269716Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9269782Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9270012Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9270078Z layer_outputs = layer_module( 2025-08-14T21:41:10.9270283Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9270354Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9270574Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 609, in forward 2025-08-14T21:41:10.9270668Z hidden_states = self.layer[-1](hidden_states) 2025-08-14T21:41:10.9270885Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 216, in forward 2025-08-14T21:41:10.9270994Z forwarded_states = self.DenseReluDense(forwarded_states) 2025-08-14T21:41:10.9271211Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 198, in forward 2025-08-14T21:41:10.9271282Z hidden_states = self.wo(hidden_states) 2025-08-14T21:41:10.9271300Z 2025-08-14T21:41:10.9271374Z cudagraph partition due to non gpu ops 2025-08-14T21:41:10.9271466Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9271645Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9271726Z return mod(**inputs) 2025-08-14T21:41:10.9271945Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9272010Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9272227Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9272307Z layer_outputs = layer_module( 2025-08-14T21:41:10.9272516Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9272587Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9272803Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.9272885Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.9273098Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 474, in forward 2025-08-14T21:41:10.9273220Z normed_hidden_states = self.layer_norm(hidden_states) 2025-08-14T21:41:10.9273438Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 146, in forward 2025-08-14T21:41:10.9273508Z return self.weight * hidden_states 2025-08-14T21:41:10.9273512Z 2025-08-14T21:41:10.9273612Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9273791Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9273860Z return mod(**inputs) 2025-08-14T21:41:10.9274086Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9274155Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9274383Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9274451Z layer_outputs = layer_module( 2025-08-14T21:41:10.9274658Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9274740Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9274956Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.9275038Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.9275259Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.9275337Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.9275565Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 365, in forward 2025-08-14T21:41:10.9275636Z query_states = self.q(hidden_states) 2025-08-14T21:41:10.9275639Z 2025-08-14T21:41:10.9275741Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9275926Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9275988Z return mod(**inputs) 2025-08-14T21:41:10.9276220Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9276290Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9276511Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9276586Z layer_outputs = layer_module( 2025-08-14T21:41:10.9276809Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9276888Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9277106Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.9277193Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.9293778Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.9293902Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.9294267Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 385, in forward 2025-08-14T21:41:10.9294351Z key_states = self.k(current_states) 2025-08-14T21:41:10.9294359Z 2025-08-14T21:41:10.9294470Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9294701Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9294769Z return mod(**inputs) 2025-08-14T21:41:10.9295020Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9295094Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9295330Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9295440Z layer_outputs = layer_module( 2025-08-14T21:41:10.9295662Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9295741Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9295977Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.9296051Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.9296284Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.9296361Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.9296595Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 401, in forward 2025-08-14T21:41:10.9296721Z scores = torch.matmul(query_states, key_states.transpose(3, 2)) 2025-08-14T21:41:10.9296727Z 2025-08-14T21:41:10.9296833Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9297030Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9297089Z return mod(**inputs) 2025-08-14T21:41:10.9297327Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9297396Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9297631Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9297693Z layer_outputs = layer_module( 2025-08-14T21:41:10.9297915Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9297988Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9298222Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.9298294Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.9298521Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.9298605Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.9298844Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 433, in forward 2025-08-14T21:41:10.9298990Z attn_weights = nn.functional.softmax(scores.float(), dim=-1).type_as(scores) 2025-08-14T21:41:10.9299029Z 2025-08-14T21:41:10.9299128Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9299316Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9299411Z return mod(**inputs) 2025-08-14T21:41:10.9299637Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9299706Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9299935Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9300001Z layer_outputs = layer_module( 2025-08-14T21:41:10.9300226Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9300300Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9300522Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.9300601Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.9300820Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.9300898Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.9301141Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 386, in forward 2025-08-14T21:41:10.9301212Z value_states = self.v(current_states) 2025-08-14T21:41:10.9301215Z 2025-08-14T21:41:10.9301315Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9301495Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9301555Z return mod(**inputs) 2025-08-14T21:41:10.9301779Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9301847Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9302075Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9302141Z layer_outputs = layer_module( 2025-08-14T21:41:10.9302344Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9302423Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9302641Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.9302711Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.9302941Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.9303015Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.9303241Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 440, in forward 2025-08-14T21:41:10.9303343Z attn_output = torch.matmul(attn_weights, value_states) 2025-08-14T21:41:10.9303347Z 2025-08-14T21:41:10.9303439Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9303628Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9303690Z return mod(**inputs) 2025-08-14T21:41:10.9303913Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9303984Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9304204Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9304274Z layer_outputs = layer_module( 2025-08-14T21:41:10.9304473Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9304565Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9304885Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.9304992Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.9305217Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.9305291Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.9305509Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 440, in forward 2025-08-14T21:41:10.9305630Z attn_output = torch.matmul(attn_weights, value_states) 2025-08-14T21:41:10.9305635Z 2025-08-14T21:41:10.9305730Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9305912Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9305982Z return mod(**inputs) 2025-08-14T21:41:10.9306203Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9306277Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9306499Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9306582Z layer_outputs = layer_module( 2025-08-14T21:41:10.9306792Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9306864Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9307092Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.9307165Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.9307387Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.9307470Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.9307689Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 442, in forward 2025-08-14T21:41:10.9307790Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:41:10.9307795Z 2025-08-14T21:41:10.9307896Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9308079Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9308143Z return mod(**inputs) 2025-08-14T21:41:10.9308366Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9308433Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9308660Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9308725Z layer_outputs = layer_module( 2025-08-14T21:41:10.9308925Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9309003Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9309221Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.9309300Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.9309519Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.9309591Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.9309817Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 444, in forward 2025-08-14T21:41:10.9309888Z attn_output = self.o(attn_output) 2025-08-14T21:41:10.9309892Z 2025-08-14T21:41:10.9310007Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9310191Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9310250Z return mod(**inputs) 2025-08-14T21:41:10.9310478Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9310561Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9310786Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9310858Z layer_outputs = layer_module( 2025-08-14T21:41:10.9311075Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9311154Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9311370Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.9311442Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.9311666Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 485, in forward 2025-08-14T21:41:10.9311789Z hidden_states = hidden_states + self.dropout(attention_output[0]) 2025-08-14T21:41:10.9311793Z 2025-08-14T21:41:10.9311905Z cudagraph partition due to non gpu ops 2025-08-14T21:41:10.9311997Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9312178Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9312243Z return mod(**inputs) 2025-08-14T21:41:10.9312465Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9312531Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9312758Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9312822Z layer_outputs = layer_module( 2025-08-14T21:41:10.9313032Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9313105Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9313324Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 583, in forward 2025-08-14T21:41:10.9313408Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:41:10.9313625Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 511, in forward 2025-08-14T21:41:10.9313720Z normed_hidden_states = self.layer_norm(hidden_states) 2025-08-14T21:41:10.9313947Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 146, in forward 2025-08-14T21:41:10.9314017Z return self.weight * hidden_states 2025-08-14T21:41:10.9314020Z 2025-08-14T21:41:10.9314122Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9314300Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9314357Z return mod(**inputs) 2025-08-14T21:41:10.9314588Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9314654Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9314878Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9314941Z layer_outputs = layer_module( 2025-08-14T21:41:10.9315143Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9315220Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9315439Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 583, in forward 2025-08-14T21:41:10.9315525Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:41:10.9315751Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 512, in forward 2025-08-14T21:41:10.9315845Z attention_output = self.EncDecAttention( 2025-08-14T21:41:10.9316071Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 365, in forward 2025-08-14T21:41:10.9316144Z query_states = self.q(hidden_states) 2025-08-14T21:41:10.9316148Z 2025-08-14T21:41:10.9316238Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9316442Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9316503Z return mod(**inputs) 2025-08-14T21:41:10.9316728Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9316806Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9317030Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9317100Z layer_outputs = layer_module( 2025-08-14T21:41:10.9317308Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9317395Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9317621Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 583, in forward 2025-08-14T21:41:10.9317691Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:41:10.9317915Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 512, in forward 2025-08-14T21:41:10.9317992Z attention_output = self.EncDecAttention( 2025-08-14T21:41:10.9318210Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 385, in forward 2025-08-14T21:41:10.9318285Z key_states = self.k(current_states) 2025-08-14T21:41:10.9318289Z 2025-08-14T21:41:10.9318380Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9318561Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9318629Z return mod(**inputs) 2025-08-14T21:41:10.9318847Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9318922Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9319143Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9319206Z layer_outputs = layer_module( 2025-08-14T21:41:10.9319413Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9319483Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9319698Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 583, in forward 2025-08-14T21:41:10.9319776Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:41:10.9319995Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 512, in forward 2025-08-14T21:41:10.9320080Z attention_output = self.EncDecAttention( 2025-08-14T21:41:10.9320297Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 401, in forward 2025-08-14T21:41:10.9320416Z scores = torch.matmul(query_states, key_states.transpose(3, 2)) 2025-08-14T21:41:10.9320421Z 2025-08-14T21:41:10.9320521Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9320702Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9320769Z return mod(**inputs) 2025-08-14T21:41:10.9321002Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9321071Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9321301Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9321382Z layer_outputs = layer_module( 2025-08-14T21:41:10.9321583Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9321659Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9321890Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 583, in forward 2025-08-14T21:41:10.9321971Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:41:10.9322187Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 512, in forward 2025-08-14T21:41:10.9322265Z attention_output = self.EncDecAttention( 2025-08-14T21:41:10.9322487Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 433, in forward 2025-08-14T21:41:10.9322628Z attn_weights = nn.functional.softmax(scores.float(), dim=-1).type_as(scores) 2025-08-14T21:41:10.9322646Z 2025-08-14T21:41:10.9322748Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9322931Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9322991Z return mod(**inputs) 2025-08-14T21:41:10.9323220Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9323286Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9323505Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9323580Z layer_outputs = layer_module( 2025-08-14T21:41:10.9323783Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9323860Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9324078Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 583, in forward 2025-08-14T21:41:10.9324153Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:41:10.9324375Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 512, in forward 2025-08-14T21:41:10.9324464Z attention_output = self.EncDecAttention( 2025-08-14T21:41:10.9324689Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 386, in forward 2025-08-14T21:41:10.9324765Z value_states = self.v(current_states) 2025-08-14T21:41:10.9324769Z 2025-08-14T21:41:10.9324861Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9325046Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9325113Z return mod(**inputs) 2025-08-14T21:41:10.9325337Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9325413Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9325637Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9325698Z layer_outputs = layer_module( 2025-08-14T21:41:10.9325910Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9325980Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9326201Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 583, in forward 2025-08-14T21:41:10.9326294Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:41:10.9326512Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 512, in forward 2025-08-14T21:41:10.9326592Z attention_output = self.EncDecAttention( 2025-08-14T21:41:10.9326823Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 440, in forward 2025-08-14T21:41:10.9326923Z attn_output = torch.matmul(attn_weights, value_states) 2025-08-14T21:41:10.9326926Z 2025-08-14T21:41:10.9327025Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9327203Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9327282Z return mod(**inputs) 2025-08-14T21:41:10.9327501Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9327566Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9327791Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9327855Z layer_outputs = layer_module( 2025-08-14T21:41:10.9328053Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9328151Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9328372Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 583, in forward 2025-08-14T21:41:10.9328453Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:41:10.9328673Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 512, in forward 2025-08-14T21:41:10.9328751Z attention_output = self.EncDecAttention( 2025-08-14T21:41:10.9328978Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 440, in forward 2025-08-14T21:41:10.9329076Z attn_output = torch.matmul(attn_weights, value_states) 2025-08-14T21:41:10.9329080Z 2025-08-14T21:41:10.9329181Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9329365Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9329428Z return mod(**inputs) 2025-08-14T21:41:10.9329661Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9329728Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9329954Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9330029Z layer_outputs = layer_module( 2025-08-14T21:41:10.9330235Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9330315Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9330536Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 583, in forward 2025-08-14T21:41:10.9330611Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:41:10.9330838Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 512, in forward 2025-08-14T21:41:10.9330919Z attention_output = self.EncDecAttention( 2025-08-14T21:41:10.9331140Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 442, in forward 2025-08-14T21:41:10.9331245Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:41:10.9331248Z 2025-08-14T21:41:10.9331345Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9331534Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9331594Z return mod(**inputs) 2025-08-14T21:41:10.9331836Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9331910Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9332129Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9332219Z layer_outputs = layer_module( 2025-08-14T21:41:10.9332419Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9332488Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9332736Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 583, in forward 2025-08-14T21:41:10.9332811Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:41:10.9333027Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 512, in forward 2025-08-14T21:41:10.9333111Z attention_output = self.EncDecAttention( 2025-08-14T21:41:10.9333327Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 444, in forward 2025-08-14T21:41:10.9333403Z attn_output = self.o(attn_output) 2025-08-14T21:41:10.9333407Z 2025-08-14T21:41:10.9333478Z cudagraph partition due to non gpu ops 2025-08-14T21:41:10.9333587Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9333775Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9333834Z return mod(**inputs) 2025-08-14T21:41:10.9334055Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9334127Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9334345Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9334415Z layer_outputs = layer_module( 2025-08-14T21:41:10.9334614Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9334683Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9334908Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 609, in forward 2025-08-14T21:41:10.9334993Z hidden_states = self.layer[-1](hidden_states) 2025-08-14T21:41:10.9335216Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 215, in forward 2025-08-14T21:41:10.9335302Z forwarded_states = self.layer_norm(hidden_states) 2025-08-14T21:41:10.9335519Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 146, in forward 2025-08-14T21:41:10.9335598Z return self.weight * hidden_states 2025-08-14T21:41:10.9335601Z 2025-08-14T21:41:10.9335692Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9335874Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9335939Z return mod(**inputs) 2025-08-14T21:41:10.9336157Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9336232Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9336451Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9336514Z layer_outputs = layer_module( 2025-08-14T21:41:10.9336721Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9336790Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9337005Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 609, in forward 2025-08-14T21:41:10.9337110Z hidden_states = self.layer[-1](hidden_states) 2025-08-14T21:41:10.9337332Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 216, in forward 2025-08-14T21:41:10.9337447Z forwarded_states = self.DenseReluDense(forwarded_states) 2025-08-14T21:41:10.9337795Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 183, in forward 2025-08-14T21:41:10.9337889Z hidden_gelu = self.act(self.wi_0(hidden_states)) 2025-08-14T21:41:10.9337893Z 2025-08-14T21:41:10.9337994Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9338192Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9338264Z return mod(**inputs) 2025-08-14T21:41:10.9338485Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9338553Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9338783Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9338848Z layer_outputs = layer_module( 2025-08-14T21:41:10.9339048Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9339148Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9339364Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 609, in forward 2025-08-14T21:41:10.9339453Z hidden_states = self.layer[-1](hidden_states) 2025-08-14T21:41:10.9339671Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 216, in forward 2025-08-14T21:41:10.9339779Z forwarded_states = self.DenseReluDense(forwarded_states) 2025-08-14T21:41:10.9340007Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 184, in forward 2025-08-14T21:41:10.9340080Z hidden_linear = self.wi_1(hidden_states) 2025-08-14T21:41:10.9340083Z 2025-08-14T21:41:10.9340181Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9340359Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9340420Z return mod(**inputs) 2025-08-14T21:41:10.9340645Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9340709Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9340926Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9340997Z layer_outputs = layer_module( 2025-08-14T21:41:10.9341196Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9341274Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9341488Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 609, in forward 2025-08-14T21:41:10.9341568Z hidden_states = self.layer[-1](hidden_states) 2025-08-14T21:41:10.9341790Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 216, in forward 2025-08-14T21:41:10.9341895Z forwarded_states = self.DenseReluDense(forwarded_states) 2025-08-14T21:41:10.9342118Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 185, in forward 2025-08-14T21:41:10.9342198Z hidden_states = hidden_gelu * hidden_linear 2025-08-14T21:41:10.9342203Z 2025-08-14T21:41:10.9342293Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9342481Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9342540Z return mod(**inputs) 2025-08-14T21:41:10.9342799Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9342874Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9343094Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9343181Z layer_outputs = layer_module( 2025-08-14T21:41:10.9343389Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9343460Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9343703Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 609, in forward 2025-08-14T21:41:10.9343785Z hidden_states = self.layer[-1](hidden_states) 2025-08-14T21:41:10.9344000Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 216, in forward 2025-08-14T21:41:10.9344110Z forwarded_states = self.DenseReluDense(forwarded_states) 2025-08-14T21:41:10.9344323Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 198, in forward 2025-08-14T21:41:10.9344402Z hidden_states = self.wo(hidden_states) 2025-08-14T21:41:10.9344405Z 2025-08-14T21:41:10.9344520Z cudagraph partition due to non gpu ops 2025-08-14T21:41:10.9344614Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9344911Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9344980Z return mod(**inputs) 2025-08-14T21:41:10.9345211Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9345277Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9345494Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9345569Z layer_outputs = layer_module( 2025-08-14T21:41:10.9345769Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9345842Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9346067Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.9346140Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.9346369Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 474, in forward 2025-08-14T21:41:10.9346472Z normed_hidden_states = self.layer_norm(hidden_states) 2025-08-14T21:41:10.9346690Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 146, in forward 2025-08-14T21:41:10.9346768Z return self.weight * hidden_states 2025-08-14T21:41:10.9346772Z 2025-08-14T21:41:10.9346865Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9347057Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9347116Z return mod(**inputs) 2025-08-14T21:41:10.9347342Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9347418Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9347640Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9347704Z layer_outputs = layer_module( 2025-08-14T21:41:10.9347915Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9347985Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9348236Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.9348311Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.9348529Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.9348629Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.9348847Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 365, in forward 2025-08-14T21:41:10.9348918Z query_states = self.q(hidden_states) 2025-08-14T21:41:10.9348929Z 2025-08-14T21:41:10.9349021Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9349216Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9349282Z return mod(**inputs) 2025-08-14T21:41:10.9349504Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9349570Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9349797Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9349859Z layer_outputs = layer_module( 2025-08-14T21:41:10.9350066Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9350155Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9350370Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.9350449Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.9350667Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.9350742Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.9350967Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 385, in forward 2025-08-14T21:41:10.9351036Z key_states = self.k(current_states) 2025-08-14T21:41:10.9351039Z 2025-08-14T21:41:10.9351137Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9351316Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9351377Z return mod(**inputs) 2025-08-14T21:41:10.9351603Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9351669Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9351889Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9351960Z layer_outputs = layer_module( 2025-08-14T21:41:10.9352158Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9352236Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9352453Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.9352524Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.9352749Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.9352823Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.9353047Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 401, in forward 2025-08-14T21:41:10.9353164Z scores = torch.matmul(query_states, key_states.transpose(3, 2)) 2025-08-14T21:41:10.9353168Z 2025-08-14T21:41:10.9353259Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9353445Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9353502Z return mod(**inputs) 2025-08-14T21:41:10.9353733Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9353807Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9354024Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9354123Z layer_outputs = layer_module( 2025-08-14T21:41:10.9354328Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9354400Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9354645Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.9354720Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.9354948Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.9355026Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.9355245Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 433, in forward 2025-08-14T21:41:10.9355394Z attn_weights = nn.functional.softmax(scores.float(), dim=-1).type_as(scores) 2025-08-14T21:41:10.9355399Z 2025-08-14T21:41:10.9355507Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9355685Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9355748Z return mod(**inputs) 2025-08-14T21:41:10.9355969Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9356039Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9356258Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9356320Z layer_outputs = layer_module( 2025-08-14T21:41:10.9356530Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9356600Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9356825Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.9356897Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.9357114Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.9357192Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.9357408Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 386, in forward 2025-08-14T21:41:10.9357476Z value_states = self.v(current_states) 2025-08-14T21:41:10.9357479Z 2025-08-14T21:41:10.9357578Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9357759Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9357824Z return mod(**inputs) 2025-08-14T21:41:10.9358041Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9358109Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9358334Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9358397Z layer_outputs = layer_module( 2025-08-14T21:41:10.9358597Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9358674Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9358890Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.9358968Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.9359198Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.9359274Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.9359515Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 440, in forward 2025-08-14T21:41:10.9359613Z attn_output = torch.matmul(attn_weights, value_states) 2025-08-14T21:41:10.9359616Z 2025-08-14T21:41:10.9359714Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9359892Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9359965Z return mod(**inputs) 2025-08-14T21:41:10.9360197Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9360262Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9360485Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9360556Z layer_outputs = layer_module( 2025-08-14T21:41:10.9360757Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9360854Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9361069Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.9361140Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.9361362Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.9361435Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.9361656Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 440, in forward 2025-08-14T21:41:10.9361754Z attn_output = torch.matmul(attn_weights, value_states) 2025-08-14T21:41:10.9361757Z 2025-08-14T21:41:10.9361848Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9362035Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9362096Z return mod(**inputs) 2025-08-14T21:41:10.9362318Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9362392Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9362613Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9362682Z layer_outputs = layer_module( 2025-08-14T21:41:10.9362882Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9362953Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9363178Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.9363250Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.9363467Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.9363554Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.9363768Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 442, in forward 2025-08-14T21:41:10.9363872Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:41:10.9363875Z 2025-08-14T21:41:10.9363970Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9364152Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9364219Z return mod(**inputs) 2025-08-14T21:41:10.9364452Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9364527Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9364750Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9364832Z layer_outputs = layer_module( 2025-08-14T21:41:10.9365040Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9365111Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9365326Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 559, in forward 2025-08-14T21:41:10.9365420Z self_attention_outputs = self.layer[0]( 2025-08-14T21:41:10.9365638Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 475, in forward 2025-08-14T21:41:10.9365717Z attention_output = self.SelfAttention( 2025-08-14T21:41:10.9365934Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 444, in forward 2025-08-14T21:41:10.9366004Z attn_output = self.o(attn_output) 2025-08-14T21:41:10.9366008Z 2025-08-14T21:41:10.9366088Z cudagraph partition due to non gpu ops 2025-08-14T21:41:10.9366200Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9366390Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9366448Z return mod(**inputs) 2025-08-14T21:41:10.9366669Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9366741Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9366963Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9367025Z layer_outputs = layer_module( 2025-08-14T21:41:10.9367233Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9367301Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9367525Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 583, in forward 2025-08-14T21:41:10.9367601Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:41:10.9367817Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 511, in forward 2025-08-14T21:41:10.9367920Z normed_hidden_states = self.layer_norm(hidden_states) 2025-08-14T21:41:10.9368140Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 146, in forward 2025-08-14T21:41:10.9368208Z return self.weight * hidden_states 2025-08-14T21:41:10.9368218Z 2025-08-14T21:41:10.9368309Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9368490Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9368554Z return mod(**inputs) 2025-08-14T21:41:10.9368775Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9368841Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9369071Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9369134Z layer_outputs = layer_module( 2025-08-14T21:41:10.9369343Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9369413Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9369630Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 583, in forward 2025-08-14T21:41:10.9369710Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:41:10.9369942Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 512, in forward 2025-08-14T21:41:10.9370022Z attention_output = self.EncDecAttention( 2025-08-14T21:41:10.9370264Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 365, in forward 2025-08-14T21:41:10.9370335Z query_states = self.q(hidden_states) 2025-08-14T21:41:10.9370338Z 2025-08-14T21:41:10.9370435Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9370613Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9370684Z return mod(**inputs) 2025-08-14T21:41:10.9370912Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9370977Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9371197Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9371265Z layer_outputs = layer_module( 2025-08-14T21:41:10.9371464Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9371540Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9371772Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 583, in forward 2025-08-14T21:41:10.9371842Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:41:10.9372066Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 512, in forward 2025-08-14T21:41:10.9372141Z attention_output = self.EncDecAttention( 2025-08-14T21:41:10.9372364Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 385, in forward 2025-08-14T21:41:10.9372434Z key_states = self.k(current_states) 2025-08-14T21:41:10.9372438Z 2025-08-14T21:41:10.9372528Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9372713Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9372773Z return mod(**inputs) 2025-08-14T21:41:10.9372994Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9373066Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9373287Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9373357Z layer_outputs = layer_module( 2025-08-14T21:41:10.9373555Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9373627Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9373852Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 583, in forward 2025-08-14T21:41:10.9373924Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:41:10.9374142Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 512, in forward 2025-08-14T21:41:10.9374226Z attention_output = self.EncDecAttention( 2025-08-14T21:41:10.9374446Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 401, in forward 2025-08-14T21:41:10.9374564Z scores = torch.matmul(query_states, key_states.transpose(3, 2)) 2025-08-14T21:41:10.9374575Z 2025-08-14T21:41:10.9374666Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9374846Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9374910Z return mod(**inputs) 2025-08-14T21:41:10.9375154Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9375222Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9375449Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9375527Z layer_outputs = layer_module( 2025-08-14T21:41:10.9375736Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9375804Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9376021Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 583, in forward 2025-08-14T21:41:10.9376113Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:41:10.9376328Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 512, in forward 2025-08-14T21:41:10.9376403Z attention_output = self.EncDecAttention( 2025-08-14T21:41:10.9376621Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 433, in forward 2025-08-14T21:41:10.9376762Z attn_weights = nn.functional.softmax(scores.float(), dim=-1).type_as(scores) 2025-08-14T21:41:10.9376767Z 2025-08-14T21:41:10.9376862Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9377056Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9377113Z return mod(**inputs) 2025-08-14T21:41:10.9377334Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9377398Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9377621Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9377683Z layer_outputs = layer_module( 2025-08-14T21:41:10.9377881Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9377954Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9378168Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 583, in forward 2025-08-14T21:41:10.9378240Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:41:10.9378460Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 512, in forward 2025-08-14T21:41:10.9378535Z attention_output = self.EncDecAttention( 2025-08-14T21:41:10.9378757Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 386, in forward 2025-08-14T21:41:10.9378824Z value_states = self.v(current_states) 2025-08-14T21:41:10.9378827Z 2025-08-14T21:41:10.9378914Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9379101Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9379158Z return mod(**inputs) 2025-08-14T21:41:10.9379382Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9379447Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9379663Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9379734Z layer_outputs = layer_module( 2025-08-14T21:41:10.9379932Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9380002Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9380222Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 583, in forward 2025-08-14T21:41:10.9380291Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:41:10.9380526Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 512, in forward 2025-08-14T21:41:10.9380602Z attention_output = self.EncDecAttention( 2025-08-14T21:41:10.9380818Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 440, in forward 2025-08-14T21:41:10.9380943Z attn_output = torch.matmul(attn_weights, value_states) 2025-08-14T21:41:10.9380947Z 2025-08-14T21:41:10.9381037Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9381221Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9381279Z return mod(**inputs) 2025-08-14T21:41:10.9381511Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9381581Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9381804Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9381865Z layer_outputs = layer_module( 2025-08-14T21:41:10.9382068Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9382138Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9382377Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 583, in forward 2025-08-14T21:41:10.9382446Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:41:10.9382663Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 512, in forward 2025-08-14T21:41:10.9382743Z attention_output = self.EncDecAttention( 2025-08-14T21:41:10.9382962Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 440, in forward 2025-08-14T21:41:10.9383056Z attn_output = torch.matmul(attn_weights, value_states) 2025-08-14T21:41:10.9383067Z 2025-08-14T21:41:10.9383157Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9383339Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9383401Z return mod(**inputs) 2025-08-14T21:41:10.9383624Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9383689Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9383916Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9383976Z layer_outputs = layer_module( 2025-08-14T21:41:10.9384182Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9384250Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9384470Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 583, in forward 2025-08-14T21:41:10.9384544Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:41:10.9385031Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 512, in forward 2025-08-14T21:41:10.9385116Z attention_output = self.EncDecAttention( 2025-08-14T21:41:10.9385345Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 442, in forward 2025-08-14T21:41:10.9385441Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:41:10.9385447Z 2025-08-14T21:41:10.9385543Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9385727Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9385787Z return mod(**inputs) 2025-08-14T21:41:10.9386095Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9386161Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9386379Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9386479Z layer_outputs = layer_module( 2025-08-14T21:41:10.9386680Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9386754Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9386968Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 583, in forward 2025-08-14T21:41:10.9387060Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:41:10.9387284Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 512, in forward 2025-08-14T21:41:10.9387358Z attention_output = self.EncDecAttention( 2025-08-14T21:41:10.9387578Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 444, in forward 2025-08-14T21:41:10.9387646Z attn_output = self.o(attn_output) 2025-08-14T21:41:10.9387649Z 2025-08-14T21:41:10.9387740Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9387925Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9388008Z return mod(**inputs) 2025-08-14T21:41:10.9388229Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9388297Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9388521Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9388587Z layer_outputs = layer_module( 2025-08-14T21:41:10.9388789Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9388857Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9389079Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 583, in forward 2025-08-14T21:41:10.9389149Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:41:10.9389368Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 524, in forward 2025-08-14T21:41:10.9389494Z layer_output = hidden_states + self.dropout(attention_output[0]) 2025-08-14T21:41:10.9389497Z 2025-08-14T21:41:10.9389565Z cudagraph partition due to non gpu ops 2025-08-14T21:41:10.9389661Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9389842Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9389898Z return mod(**inputs) 2025-08-14T21:41:10.9390123Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9390184Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9390406Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9390470Z layer_outputs = layer_module( 2025-08-14T21:41:10.9390669Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9390743Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9390959Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 609, in forward 2025-08-14T21:41:10.9391039Z hidden_states = self.layer[-1](hidden_states) 2025-08-14T21:41:10.9391261Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 215, in forward 2025-08-14T21:41:10.9391343Z forwarded_states = self.layer_norm(hidden_states) 2025-08-14T21:41:10.9391581Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 146, in forward 2025-08-14T21:41:10.9391650Z return self.weight * hidden_states 2025-08-14T21:41:10.9391653Z 2025-08-14T21:41:10.9391761Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9391946Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9392004Z return mod(**inputs) 2025-08-14T21:41:10.9392228Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9392291Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9392525Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9392592Z layer_outputs = layer_module( 2025-08-14T21:41:10.9392792Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9392860Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9393081Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 609, in forward 2025-08-14T21:41:10.9393162Z hidden_states = self.layer[-1](hidden_states) 2025-08-14T21:41:10.9393406Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 216, in forward 2025-08-14T21:41:10.9393510Z forwarded_states = self.DenseReluDense(forwarded_states) 2025-08-14T21:41:10.9393724Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 183, in forward 2025-08-14T21:41:10.9393816Z hidden_gelu = self.act(self.wi_0(hidden_states)) 2025-08-14T21:41:10.9393819Z 2025-08-14T21:41:10.9393909Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9394095Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9394153Z return mod(**inputs) 2025-08-14T21:41:10.9394368Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9394439Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9394653Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9394715Z layer_outputs = layer_module( 2025-08-14T21:41:10.9394915Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9394983Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9395200Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 609, in forward 2025-08-14T21:41:10.9395279Z hidden_states = self.layer[-1](hidden_states) 2025-08-14T21:41:10.9395493Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 216, in forward 2025-08-14T21:41:10.9395599Z forwarded_states = self.DenseReluDense(forwarded_states) 2025-08-14T21:41:10.9395814Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 184, in forward 2025-08-14T21:41:10.9395884Z hidden_linear = self.wi_1(hidden_states) 2025-08-14T21:41:10.9395891Z 2025-08-14T21:41:10.9395979Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9396157Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9396223Z return mod(**inputs) 2025-08-14T21:41:10.9396451Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9396518Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9396760Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9396827Z layer_outputs = layer_module( 2025-08-14T21:41:10.9397027Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9397125Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9397350Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 609, in forward 2025-08-14T21:41:10.9397431Z hidden_states = self.layer[-1](hidden_states) 2025-08-14T21:41:10.9397659Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 216, in forward 2025-08-14T21:41:10.9397780Z forwarded_states = self.DenseReluDense(forwarded_states) 2025-08-14T21:41:10.9398006Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 185, in forward 2025-08-14T21:41:10.9398084Z hidden_states = hidden_gelu * hidden_linear 2025-08-14T21:41:10.9398089Z 2025-08-14T21:41:10.9398181Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9398369Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9398429Z return mod(**inputs) 2025-08-14T21:41:10.9398654Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9398737Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9398957Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1079, in forward 2025-08-14T21:41:10.9399028Z layer_outputs = layer_module( 2025-08-14T21:41:10.9399231Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:10.9399302Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:10.9399524Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 609, in forward 2025-08-14T21:41:10.9399604Z hidden_states = self.layer[-1](hidden_states) 2025-08-14T21:41:10.9399826Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 216, in forward 2025-08-14T21:41:10.9399929Z forwarded_states = self.DenseReluDense(forwarded_states) 2025-08-14T21:41:10.9400145Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 198, in forward 2025-08-14T21:41:10.9400225Z hidden_states = self.wo(hidden_states) 2025-08-14T21:41:10.9400229Z 2025-08-14T21:41:10.9400300Z cudagraph partition due to non gpu ops 2025-08-14T21:41:10.9400401Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9400579Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9400637Z return mod(**inputs) 2025-08-14T21:41:10.9400865Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1787, in forward 2025-08-14T21:41:10.9400930Z decoder_outputs = self.decoder( 2025-08-14T21:41:10.9401147Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1115, in forward 2025-08-14T21:41:10.9401253Z hidden_states = self.final_layer_norm(hidden_states) 2025-08-14T21:41:10.9401470Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 146, in forward 2025-08-14T21:41:10.9401546Z return self.weight * hidden_states 2025-08-14T21:41:10.9401550Z 2025-08-14T21:41:10.9401642Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9401821Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9401887Z return mod(**inputs) 2025-08-14T21:41:10.9402118Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1816, in forward 2025-08-14T21:41:10.9402201Z lm_logits = self.lm_head(sequence_output) 2025-08-14T21:41:10.9402210Z 2025-08-14T21:41:10.9402300Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9402495Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9402562Z return mod(**inputs) 2025-08-14T21:41:10.9402784Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1823, in forward 2025-08-14T21:41:10.9402913Z loss = loss_fct(lm_logits.view(-1, lm_logits.size(-1)), labels.view(-1)) 2025-08-14T21:41:10.9402917Z 2025-08-14T21:41:10.9403029Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9403205Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9403271Z return mod(**inputs) 2025-08-14T21:41:10.9403493Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1823, in forward 2025-08-14T21:41:10.9403613Z loss = loss_fct(lm_logits.view(-1, lm_logits.size(-1)), labels.view(-1)) 2025-08-14T21:41:10.9403616Z 2025-08-14T21:41:10.9403713Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:10.9403889Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:10.9403970Z return mod(**inputs) 2025-08-14T21:41:10.9404193Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mt5/modeling_mt5.py", line 1823, in forward 2025-08-14T21:41:10.9404311Z loss = loss_fct(lm_logits.view(-1, lm_logits.size(-1)), labels.view(-1)) 2025-08-14T21:41:10.9404314Z 2025-08-14T21:41:20.4776652Z Compilation time (from dynamo_timed): 19.849412768 2025-08-14T21:41:20.4993342Z pass 2025-08-14T21:41:20.4996786Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:41:20.5000670Z TIMING: _recursive_pre_grad_passes:0.01321 _recursive_joint_graph_passes:0.68683 _recursive_post_grad_passes:0.52475 async_compile.wait:0.71305 code_gen:9.0472 inductor_compile:11.4332 backend_compile:16.08293 gc:0.0001 entire_frame_compile:19.84941 total_wall_time:19.84941 2025-08-14T21:41:20.5001836Z STATS: call_* op count: 1189 | FakeTensorMode.__torch_dispatch__:29419 | FakeTensor.__torch_dispatch__:8702 | ProxyTorchDispatchMode.__torch_dispatch__:10618 2025-08-14T21:41:20.5002496Z Dynamo produced 1 graphs covering 1189 ops with 0 graph breaks (0 unique) 2025-08-14T21:41:24.8159580Z /opt/conda/envs/py_3.9/lib/python3.9/site-packages/llvmlite/binding/ffi.py:175: UserWarning: pkg_resources is deprecated as an API. See https://setuptools.pypa.io/en/latest/pkg_resources.html. The pkg_resources package is slated for removal as early as 2025-11-30. Refrain from using this package or pin to Setuptools<81. 2025-08-14T21:41:24.8160647Z from pkg_resources import resource_filename 2025-08-14T21:41:25.3633269Z 2025-08-14T21:41:25.3740106Z loading model: 0it [00:00, ?it/s]If you want to use `MegatronBertForCausalLM` as a standalone, add `is_decoder=True.` 2025-08-14T21:41:25.3742431Z WARNING:transformers.models.megatron_bert.modeling_megatron_bert:If you want to use `MegatronBertForCausalLM` as a standalone, add `is_decoder=True.` 2025-08-14T21:41:28.3865249Z 2025-08-14T21:41:28.3870305Z loading model: 0it [00:03, ?it/s] 2025-08-14T21:41:28.3891771Z cpu eval MegatronBertForCausalLM 2025-08-14T21:41:29.6389333Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:41:30.1261705Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:41:30.5988045Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:41:43.3628897Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.3631376Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.3631867Z return mod(**inputs) 2025-08-14T21:41:43.3634003Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.3634814Z outputs = self.bert( 2025-08-14T21:41:43.3639078Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.3640930Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.3641678Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.3646087Z layer_outputs = layer_module( 2025-08-14T21:41:43.3648259Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.3648752Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.3653204Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:41:43.3655181Z self_attention_outputs = self.attention( 2025-08-14T21:41:43.3655710Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:41:43.3656313Z self_outputs = self.self( 2025-08-14T21:41:43.3660708Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:41:43.3661186Z return func(*args, **kwargs) 2025-08-14T21:41:43.3663711Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 223, in forward 2025-08-14T21:41:43.3664122Z query_layer = self.query(hidden_states) 2025-08-14T21:41:43.3664318Z 2025-08-14T21:41:43.3668658Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.3670759Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.3671198Z return mod(**inputs) 2025-08-14T21:41:43.3675731Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.3680099Z outputs = self.bert( 2025-08-14T21:41:43.3684243Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.3685239Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.3685665Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.3686056Z layer_outputs = layer_module( 2025-08-14T21:41:43.3686391Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.3686737Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.3687135Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:41:43.3687532Z self_attention_outputs = self.attention( 2025-08-14T21:41:43.3687930Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:41:43.3688326Z self_outputs = self.self( 2025-08-14T21:41:43.3688670Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:41:43.3689013Z return func(*args, **kwargs) 2025-08-14T21:41:43.3689396Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 252, in forward 2025-08-14T21:41:43.3689935Z key_layer = self.key(current_states) 2025-08-14T21:41:43.3690072Z 2025-08-14T21:41:43.3690180Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.3690537Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.3690896Z return mod(**inputs) 2025-08-14T21:41:43.3691270Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.3691648Z outputs = self.bert( 2025-08-14T21:41:43.3692153Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.3692574Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.3692963Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.3693345Z layer_outputs = layer_module( 2025-08-14T21:41:43.3693672Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.3694007Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.3694391Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:41:43.3694823Z self_attention_outputs = self.attention( 2025-08-14T21:41:43.3695215Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:41:43.3695600Z self_outputs = self.self( 2025-08-14T21:41:43.3695930Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:41:43.3696274Z return func(*args, **kwargs) 2025-08-14T21:41:43.3696651Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 256, in forward 2025-08-14T21:41:43.3697044Z value_layer = self.value(current_states) 2025-08-14T21:41:43.3697169Z 2025-08-14T21:41:43.3697247Z cudagraph partition due to non gpu ops 2025-08-14T21:41:43.3697447Z cudagraph partition due to non gpu ops 2025-08-14T21:41:43.3697677Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.3698014Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.3698331Z return mod(**inputs) 2025-08-14T21:41:43.3698714Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.3699103Z outputs = self.bert( 2025-08-14T21:41:43.3699458Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.3699846Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.3700231Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.3700611Z layer_outputs = layer_module( 2025-08-14T21:41:43.3700932Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.3701266Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.3701655Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:41:43.3702041Z self_attention_outputs = self.attention( 2025-08-14T21:41:43.3702437Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 381, in forward 2025-08-14T21:41:43.3702898Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:41:43.3703368Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 329, in forward 2025-08-14T21:41:43.3703771Z hidden_states = self.dense(hidden_states) 2025-08-14T21:41:43.3703898Z 2025-08-14T21:41:43.3704000Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.3704383Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.3704685Z return mod(**inputs) 2025-08-14T21:41:43.3705167Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.3705551Z outputs = self.bert( 2025-08-14T21:41:43.3705937Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.3706336Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.3706722Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.3707102Z layer_outputs = layer_module( 2025-08-14T21:41:43.3707423Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.3707759Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.3708167Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:41:43.3708557Z layer_output = apply_chunking_to_forward( 2025-08-14T21:41:43.3708930Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:41:43.3709290Z return forward_fn(*input_tensors) 2025-08-14T21:41:43.3709705Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 481, in feed_forward_chunk 2025-08-14T21:41:43.3710153Z intermediate_output = self.intermediate(ln_output) 2025-08-14T21:41:43.3710564Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 397, in forward 2025-08-14T21:41:43.3710961Z hidden_states = self.dense(hidden_states) 2025-08-14T21:41:43.3711086Z 2025-08-14T21:41:43.3711183Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.3711512Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.3711807Z return mod(**inputs) 2025-08-14T21:41:43.3712177Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.3712557Z outputs = self.bert( 2025-08-14T21:41:43.3712918Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.3713304Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.3713677Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.3714061Z layer_outputs = layer_module( 2025-08-14T21:41:43.3714381Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.3714711Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.3715092Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:41:43.3715489Z layer_output = apply_chunking_to_forward( 2025-08-14T21:41:43.3715860Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:41:43.3716221Z return forward_fn(*input_tensors) 2025-08-14T21:41:43.3716645Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 481, in feed_forward_chunk 2025-08-14T21:41:43.3717093Z intermediate_output = self.intermediate(ln_output) 2025-08-14T21:41:43.3717502Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 398, in forward 2025-08-14T21:41:43.3717944Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:41:43.3718299Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:41:43.3718613Z return self.act(input) 2025-08-14T21:41:43.3718715Z 2025-08-14T21:41:43.3718835Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.3719161Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.3719461Z return mod(**inputs) 2025-08-14T21:41:43.3719831Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.3720215Z outputs = self.bert( 2025-08-14T21:41:43.3720573Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.3721005Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.3721389Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.3721769Z layer_outputs = layer_module( 2025-08-14T21:41:43.3722090Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.3722421Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.3722811Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:41:43.3723202Z layer_output = apply_chunking_to_forward( 2025-08-14T21:41:43.3723573Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:41:43.3723938Z return forward_fn(*input_tensors) 2025-08-14T21:41:43.3724344Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 482, in feed_forward_chunk 2025-08-14T21:41:43.3724827Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:41:43.3725273Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 410, in forward 2025-08-14T21:41:43.3725668Z hidden_states = self.dense(hidden_states) 2025-08-14T21:41:43.3725793Z 2025-08-14T21:41:43.3725888Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.3726223Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.3726522Z return mod(**inputs) 2025-08-14T21:41:43.3726887Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.3727263Z outputs = self.bert( 2025-08-14T21:41:43.3727626Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.3728010Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.3728395Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.3728780Z layer_outputs = layer_module( 2025-08-14T21:41:43.3729105Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.3729438Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.3729861Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:41:43.3730259Z self_attention_outputs = self.attention( 2025-08-14T21:41:43.3730655Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:41:43.3731061Z self_outputs = self.self( 2025-08-14T21:41:43.3731392Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:41:43.3731740Z return func(*args, **kwargs) 2025-08-14T21:41:43.3732135Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 223, in forward 2025-08-14T21:41:43.3732522Z query_layer = self.query(hidden_states) 2025-08-14T21:41:43.3732655Z 2025-08-14T21:41:43.3732750Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.3733088Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.3733388Z return mod(**inputs) 2025-08-14T21:41:43.3733747Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.3734151Z outputs = self.bert( 2025-08-14T21:41:43.3734515Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.3734911Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.3735300Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.3735699Z layer_outputs = layer_module( 2025-08-14T21:41:43.3736032Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.3736377Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.3736779Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:41:43.3737188Z self_attention_outputs = self.attention( 2025-08-14T21:41:43.3737597Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:41:43.3737990Z self_outputs = self.self( 2025-08-14T21:41:43.3738338Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:41:43.3738696Z return func(*args, **kwargs) 2025-08-14T21:41:43.3739082Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 252, in forward 2025-08-14T21:41:43.3739483Z key_layer = self.key(current_states) 2025-08-14T21:41:43.3739617Z 2025-08-14T21:41:43.3739720Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.3740073Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.3740371Z return mod(**inputs) 2025-08-14T21:41:43.3740752Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.3741150Z outputs = self.bert( 2025-08-14T21:41:43.3741517Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.3741899Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.3742285Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.3742671Z layer_outputs = layer_module( 2025-08-14T21:41:43.3743007Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.3743337Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.3743727Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:41:43.3744144Z self_attention_outputs = self.attention( 2025-08-14T21:41:43.3744527Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:41:43.3744989Z self_outputs = self.self( 2025-08-14T21:41:43.3745345Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:41:43.3745694Z return func(*args, **kwargs) 2025-08-14T21:41:43.3746066Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 256, in forward 2025-08-14T21:41:43.3746463Z value_layer = self.value(current_states) 2025-08-14T21:41:43.3746588Z 2025-08-14T21:41:43.3746671Z cudagraph partition due to non gpu ops 2025-08-14T21:41:43.3746862Z cudagraph partition due to non gpu ops 2025-08-14T21:41:43.3747084Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.3747417Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.3747739Z return mod(**inputs) 2025-08-14T21:41:43.3748098Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.3748483Z outputs = self.bert( 2025-08-14T21:41:43.3748848Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.3749241Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.3749626Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.3750008Z layer_outputs = layer_module( 2025-08-14T21:41:43.3750323Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.3750647Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.3751036Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:41:43.3751429Z self_attention_outputs = self.attention( 2025-08-14T21:41:43.3751820Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 381, in forward 2025-08-14T21:41:43.3752249Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:41:43.3752684Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 329, in forward 2025-08-14T21:41:43.3753081Z hidden_states = self.dense(hidden_states) 2025-08-14T21:41:43.3753206Z 2025-08-14T21:41:43.3753308Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.3753633Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.3753934Z return mod(**inputs) 2025-08-14T21:41:43.3754300Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.3754672Z outputs = self.bert( 2025-08-14T21:41:43.3755035Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.3755421Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.3755830Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.3756211Z layer_outputs = layer_module( 2025-08-14T21:41:43.3756527Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.3756860Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.3757258Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:41:43.3757655Z layer_output = apply_chunking_to_forward( 2025-08-14T21:41:43.3758026Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:41:43.3758402Z return forward_fn(*input_tensors) 2025-08-14T21:41:43.3758813Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 481, in feed_forward_chunk 2025-08-14T21:41:43.3759256Z intermediate_output = self.intermediate(ln_output) 2025-08-14T21:41:43.3759668Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 397, in forward 2025-08-14T21:41:43.3760064Z hidden_states = self.dense(hidden_states) 2025-08-14T21:41:43.3760192Z 2025-08-14T21:41:43.3760287Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.3760632Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.3760927Z return mod(**inputs) 2025-08-14T21:41:43.3761290Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.3761666Z outputs = self.bert( 2025-08-14T21:41:43.3762029Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.3762417Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.3762792Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.3763184Z layer_outputs = layer_module( 2025-08-14T21:41:43.3763501Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.3763834Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.3764215Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:41:43.3764610Z layer_output = apply_chunking_to_forward( 2025-08-14T21:41:43.3764983Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:41:43.3765342Z return forward_fn(*input_tensors) 2025-08-14T21:41:43.3765750Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 481, in feed_forward_chunk 2025-08-14T21:41:43.3766191Z intermediate_output = self.intermediate(ln_output) 2025-08-14T21:41:43.3766599Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 398, in forward 2025-08-14T21:41:43.3767019Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:41:43.3767369Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:41:43.3767681Z return self.act(input) 2025-08-14T21:41:43.3767782Z 2025-08-14T21:41:43.3767885Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.3768209Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.3768504Z return mod(**inputs) 2025-08-14T21:41:43.3768885Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.3769267Z outputs = self.bert( 2025-08-14T21:41:43.3769631Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.3770038Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.3770422Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.3770798Z layer_outputs = layer_module( 2025-08-14T21:41:43.3771118Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.3771462Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.3771851Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:41:43.3772241Z layer_output = apply_chunking_to_forward( 2025-08-14T21:41:43.3772609Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:41:43.3772971Z return forward_fn(*input_tensors) 2025-08-14T21:41:43.3773378Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 482, in feed_forward_chunk 2025-08-14T21:41:43.3773859Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:41:43.3774297Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 410, in forward 2025-08-14T21:41:43.3774692Z hidden_states = self.dense(hidden_states) 2025-08-14T21:41:43.3774818Z 2025-08-14T21:41:43.3774914Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.3775245Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.3775545Z return mod(**inputs) 2025-08-14T21:41:43.3775913Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.3776288Z outputs = self.bert( 2025-08-14T21:41:43.3776649Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.3777038Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.3777416Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.3777802Z layer_outputs = layer_module( 2025-08-14T21:41:43.3778124Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.3778459Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.3778845Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:41:43.3779240Z layer_output = apply_chunking_to_forward( 2025-08-14T21:41:43.3779609Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:41:43.3779975Z return forward_fn(*input_tensors) 2025-08-14T21:41:43.3780383Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 482, in feed_forward_chunk 2025-08-14T21:41:43.3780848Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:41:43.3781291Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 412, in forward 2025-08-14T21:41:43.3781684Z return input_tensor + hidden_states 2025-08-14T21:41:43.3781806Z 2025-08-14T21:41:43.3781902Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.3782245Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.3782543Z return mod(**inputs) 2025-08-14T21:41:43.3782901Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.3783300Z outputs = self.bert( 2025-08-14T21:41:43.3783662Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.3784047Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.3784436Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.3785037Z layer_outputs = layer_module( 2025-08-14T21:41:43.3785369Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.3785698Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.3786092Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:41:43.3786489Z self_attention_outputs = self.attention( 2025-08-14T21:41:43.3786927Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:41:43.3787304Z self_outputs = self.self( 2025-08-14T21:41:43.3787643Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:41:43.3787989Z return func(*args, **kwargs) 2025-08-14T21:41:43.3788366Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 223, in forward 2025-08-14T21:41:43.3788757Z query_layer = self.query(hidden_states) 2025-08-14T21:41:43.3788891Z 2025-08-14T21:41:43.3788990Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.3789325Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.3789620Z return mod(**inputs) 2025-08-14T21:41:43.3789990Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.3790376Z outputs = self.bert( 2025-08-14T21:41:43.3790735Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.3791116Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.3791496Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.3791880Z layer_outputs = layer_module( 2025-08-14T21:41:43.3792197Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.3792523Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.3792911Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:41:43.3793310Z self_attention_outputs = self.attention( 2025-08-14T21:41:43.3793694Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:41:43.3794080Z self_outputs = self.self( 2025-08-14T21:41:43.3794414Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:41:43.3794758Z return func(*args, **kwargs) 2025-08-14T21:41:43.3795125Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 252, in forward 2025-08-14T21:41:43.3795543Z key_layer = self.key(current_states) 2025-08-14T21:41:43.3795670Z 2025-08-14T21:41:43.3795774Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.3796099Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.3796420Z return mod(**inputs) 2025-08-14T21:41:43.3796791Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.3797174Z outputs = self.bert( 2025-08-14T21:41:43.3797552Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.3797946Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.3798331Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.3798717Z layer_outputs = layer_module( 2025-08-14T21:41:43.3799027Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.3799360Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.3799752Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:41:43.3800170Z self_attention_outputs = self.attention( 2025-08-14T21:41:43.3800565Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:41:43.3800950Z self_outputs = self.self( 2025-08-14T21:41:43.3801284Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:41:43.3801618Z return func(*args, **kwargs) 2025-08-14T21:41:43.3801996Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 256, in forward 2025-08-14T21:41:43.3802391Z value_layer = self.value(current_states) 2025-08-14T21:41:43.3802515Z 2025-08-14T21:41:43.3802596Z cudagraph partition due to non gpu ops 2025-08-14T21:41:43.3802788Z cudagraph partition due to non gpu ops 2025-08-14T21:41:43.3803011Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.3803347Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.3803639Z return mod(**inputs) 2025-08-14T21:41:43.3804010Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.3804394Z outputs = self.bert( 2025-08-14T21:41:43.3804759Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.3805139Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.3805520Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.3805903Z layer_outputs = layer_module( 2025-08-14T21:41:43.3806216Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.3806545Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.3806933Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:41:43.3807324Z self_attention_outputs = self.attention( 2025-08-14T21:41:43.3807709Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 381, in forward 2025-08-14T21:41:43.3808147Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:41:43.3808601Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 329, in forward 2025-08-14T21:41:43.3809004Z hidden_states = self.dense(hidden_states) 2025-08-14T21:41:43.3809130Z 2025-08-14T21:41:43.3809243Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.3809573Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.3809873Z return mod(**inputs) 2025-08-14T21:41:43.3810235Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.3810618Z outputs = self.bert( 2025-08-14T21:41:43.3811000Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.3811389Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.3811765Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.3812150Z layer_outputs = layer_module( 2025-08-14T21:41:43.3812471Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.3812802Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.3813204Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:41:43.3813602Z layer_output = apply_chunking_to_forward( 2025-08-14T21:41:43.3813978Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:41:43.3814333Z return forward_fn(*input_tensors) 2025-08-14T21:41:43.3814752Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 481, in feed_forward_chunk 2025-08-14T21:41:43.3815193Z intermediate_output = self.intermediate(ln_output) 2025-08-14T21:41:43.3815606Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 397, in forward 2025-08-14T21:41:43.3815994Z hidden_states = self.dense(hidden_states) 2025-08-14T21:41:43.3816128Z 2025-08-14T21:41:43.3816224Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.3816555Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.3816854Z return mod(**inputs) 2025-08-14T21:41:43.3817213Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.3817593Z outputs = self.bert( 2025-08-14T21:41:43.3817957Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.3818336Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.3818717Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.3819105Z layer_outputs = layer_module( 2025-08-14T21:41:43.3819422Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.3819747Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.3820134Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:41:43.3820534Z layer_output = apply_chunking_to_forward( 2025-08-14T21:41:43.3820897Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:41:43.3821260Z return forward_fn(*input_tensors) 2025-08-14T21:41:43.3821685Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 481, in feed_forward_chunk 2025-08-14T21:41:43.3822131Z intermediate_output = self.intermediate(ln_output) 2025-08-14T21:41:43.3822549Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 398, in forward 2025-08-14T21:41:43.3822971Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:41:43.3823324Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:41:43.3823634Z return self.act(input) 2025-08-14T21:41:43.3823734Z 2025-08-14T21:41:43.3823843Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.3824177Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.3824474Z return mod(**inputs) 2025-08-14T21:41:43.3824906Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.3825302Z outputs = self.bert( 2025-08-14T21:41:43.3825668Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.3826089Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.3826479Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.3826874Z layer_outputs = layer_module( 2025-08-14T21:41:43.3827206Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.3827547Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.3827941Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:41:43.3828346Z layer_output = apply_chunking_to_forward( 2025-08-14T21:41:43.3828726Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:41:43.3829090Z return forward_fn(*input_tensors) 2025-08-14T21:41:43.3829511Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 482, in feed_forward_chunk 2025-08-14T21:41:43.3829985Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:41:43.3830438Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 410, in forward 2025-08-14T21:41:43.3830837Z hidden_states = self.dense(hidden_states) 2025-08-14T21:41:43.3830976Z 2025-08-14T21:41:43.3831075Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.3831423Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.3831729Z return mod(**inputs) 2025-08-14T21:41:43.3832101Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.3832498Z outputs = self.bert( 2025-08-14T21:41:43.3832874Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.3833263Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.3833659Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.3834054Z layer_outputs = layer_module( 2025-08-14T21:41:43.3834384Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.3834716Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.3835131Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:41:43.3835535Z self_attention_outputs = self.attention( 2025-08-14T21:41:43.3835942Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:41:43.3836324Z self_outputs = self.self( 2025-08-14T21:41:43.3836660Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:41:43.3837006Z return func(*args, **kwargs) 2025-08-14T21:41:43.3837392Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 223, in forward 2025-08-14T21:41:43.3837794Z query_layer = self.query(hidden_states) 2025-08-14T21:41:43.3837928Z 2025-08-14T21:41:43.3838028Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.3838359Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.3838651Z return mod(**inputs) 2025-08-14T21:41:43.3839017Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.3839417Z outputs = self.bert( 2025-08-14T21:41:43.3839779Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.3840160Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.3840543Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.3840924Z layer_outputs = layer_module( 2025-08-14T21:41:43.3841239Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.3841571Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.3841960Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:41:43.3842352Z self_attention_outputs = self.attention( 2025-08-14T21:41:43.3842735Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:41:43.3843117Z self_outputs = self.self( 2025-08-14T21:41:43.3843450Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:41:43.3843794Z return func(*args, **kwargs) 2025-08-14T21:41:43.3844161Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 252, in forward 2025-08-14T21:41:43.3844550Z key_layer = self.key(current_states) 2025-08-14T21:41:43.3844672Z 2025-08-14T21:41:43.3844775Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.3845098Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.3845399Z return mod(**inputs) 2025-08-14T21:41:43.3845765Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.3846149Z outputs = self.bert( 2025-08-14T21:41:43.3846504Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.3846891Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.3847274Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.3847650Z layer_outputs = layer_module( 2025-08-14T21:41:43.3847983Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.3848318Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.3848710Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:41:43.3849120Z self_attention_outputs = self.attention( 2025-08-14T21:41:43.3849519Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:41:43.3849909Z self_outputs = self.self( 2025-08-14T21:41:43.3850266Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:41:43.3850605Z return func(*args, **kwargs) 2025-08-14T21:41:43.3850984Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 256, in forward 2025-08-14T21:41:43.3851381Z value_layer = self.value(current_states) 2025-08-14T21:41:43.3851505Z 2025-08-14T21:41:43.3851578Z cudagraph partition due to non gpu ops 2025-08-14T21:41:43.3851776Z cudagraph partition due to non gpu ops 2025-08-14T21:41:43.3851994Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.3852344Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.3852636Z return mod(**inputs) 2025-08-14T21:41:43.3853001Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.3853385Z outputs = self.bert( 2025-08-14T21:41:43.3853736Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.3854121Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.3854506Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.3854892Z layer_outputs = layer_module( 2025-08-14T21:41:43.3855204Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.3855538Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.3855926Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:41:43.3856318Z self_attention_outputs = self.attention( 2025-08-14T21:41:43.3856702Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 381, in forward 2025-08-14T21:41:43.3857141Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:41:43.3857575Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 329, in forward 2025-08-14T21:41:43.3857965Z hidden_states = self.dense(hidden_states) 2025-08-14T21:41:43.3858099Z 2025-08-14T21:41:43.3858194Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.3858521Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.3858819Z return mod(**inputs) 2025-08-14T21:41:43.3859177Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.3859559Z outputs = self.bert( 2025-08-14T21:41:43.3859920Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.3860308Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.3860698Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.3861089Z layer_outputs = layer_module( 2025-08-14T21:41:43.3861416Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.3861757Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.3862153Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:41:43.3862555Z layer_output = apply_chunking_to_forward( 2025-08-14T21:41:43.3862943Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:41:43.3863302Z return forward_fn(*input_tensors) 2025-08-14T21:41:43.3863716Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 481, in feed_forward_chunk 2025-08-14T21:41:43.3864165Z intermediate_output = self.intermediate(ln_output) 2025-08-14T21:41:43.3864580Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 397, in forward 2025-08-14T21:41:43.3865074Z hidden_states = self.dense(hidden_states) 2025-08-14T21:41:43.3865216Z 2025-08-14T21:41:43.3865334Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.3865674Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.3865967Z return mod(**inputs) 2025-08-14T21:41:43.3866340Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.3866726Z outputs = self.bert( 2025-08-14T21:41:43.3867091Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.3867475Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.3867860Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.3868243Z layer_outputs = layer_module( 2025-08-14T21:41:43.3868563Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.3868889Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.3869277Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:41:43.3869674Z layer_output = apply_chunking_to_forward( 2025-08-14T21:41:43.3870038Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:41:43.3870403Z return forward_fn(*input_tensors) 2025-08-14T21:41:43.3870814Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 481, in feed_forward_chunk 2025-08-14T21:41:43.3871253Z intermediate_output = self.intermediate(ln_output) 2025-08-14T21:41:43.3871658Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 398, in forward 2025-08-14T21:41:43.3872084Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:41:43.3872435Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:41:43.3872746Z return self.act(input) 2025-08-14T21:41:43.3872846Z 2025-08-14T21:41:43.3872943Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.3873278Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.3873576Z return mod(**inputs) 2025-08-14T21:41:43.3874026Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.3874409Z outputs = self.bert( 2025-08-14T21:41:43.3874794Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.3875319Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.3875912Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.3876522Z layer_outputs = layer_module( 2025-08-14T21:41:43.3877061Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.3877595Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.3878228Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:41:43.3878883Z layer_output = apply_chunking_to_forward( 2025-08-14T21:41:43.3879486Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:41:43.3880103Z return forward_fn(*input_tensors) 2025-08-14T21:41:43.3880857Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 482, in feed_forward_chunk 2025-08-14T21:41:43.3881766Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:41:43.3882532Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 410, in forward 2025-08-14T21:41:43.3883226Z hidden_states = self.dense(hidden_states) 2025-08-14T21:41:43.3883457Z 2025-08-14T21:41:43.3883612Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.3884182Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.3884902Z return mod(**inputs) 2025-08-14T21:41:43.3885567Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.3886279Z outputs = self.bert( 2025-08-14T21:41:43.3886944Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.3887656Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.3888370Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.3889068Z layer_outputs = layer_module( 2025-08-14T21:41:43.3889638Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.3890235Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.3890962Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:41:43.3891696Z layer_output = apply_chunking_to_forward( 2025-08-14T21:41:43.3892361Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:41:43.3893029Z return forward_fn(*input_tensors) 2025-08-14T21:41:43.3893789Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 482, in feed_forward_chunk 2025-08-14T21:41:43.3894653Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:41:43.3895461Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 412, in forward 2025-08-14T21:41:43.3896184Z return input_tensor + hidden_states 2025-08-14T21:41:43.3896404Z 2025-08-14T21:41:43.3896672Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.3897276Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.3897816Z return mod(**inputs) 2025-08-14T21:41:43.3898502Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.3899266Z outputs = self.bert( 2025-08-14T21:41:43.3899927Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.3900652Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.3901404Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.3902116Z layer_outputs = layer_module( 2025-08-14T21:41:43.3902685Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.3903283Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.3903992Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:41:43.3904808Z self_attention_outputs = self.attention( 2025-08-14T21:41:43.3905571Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:41:43.3906287Z self_outputs = self.self( 2025-08-14T21:41:43.3906901Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:41:43.3907515Z return func(*args, **kwargs) 2025-08-14T21:41:43.3908209Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 223, in forward 2025-08-14T21:41:43.3908929Z query_layer = self.query(hidden_states) 2025-08-14T21:41:43.3909148Z 2025-08-14T21:41:43.3909317Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.3909911Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.3910448Z return mod(**inputs) 2025-08-14T21:41:43.3911130Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.3911833Z outputs = self.bert( 2025-08-14T21:41:43.3912492Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.3913209Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.3913912Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.3914610Z layer_outputs = layer_module( 2025-08-14T21:41:43.3915187Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.3915785Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.3916502Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:41:43.3917230Z self_attention_outputs = self.attention( 2025-08-14T21:41:43.3917952Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:41:43.3918654Z self_outputs = self.self( 2025-08-14T21:41:43.3919273Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:41:43.3919899Z return func(*args, **kwargs) 2025-08-14T21:41:43.3920630Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 252, in forward 2025-08-14T21:41:43.3921364Z key_layer = self.key(current_states) 2025-08-14T21:41:43.3921586Z 2025-08-14T21:41:43.3921760Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.3922382Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.3922967Z return mod(**inputs) 2025-08-14T21:41:43.3923647Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.3924352Z outputs = self.bert( 2025-08-14T21:41:43.3925042Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.3925752Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.3926447Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.3927154Z layer_outputs = layer_module( 2025-08-14T21:41:43.3927726Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.3928326Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.3929057Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:41:43.3929813Z self_attention_outputs = self.attention( 2025-08-14T21:41:43.3930544Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:41:43.3931255Z self_outputs = self.self( 2025-08-14T21:41:43.3931859Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:41:43.3932491Z return func(*args, **kwargs) 2025-08-14T21:41:43.3933184Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 256, in forward 2025-08-14T21:41:43.3933901Z value_layer = self.value(current_states) 2025-08-14T21:41:43.3934136Z 2025-08-14T21:41:43.3934262Z cudagraph partition due to non gpu ops 2025-08-14T21:41:43.3934605Z cudagraph partition due to non gpu ops 2025-08-14T21:41:43.3934987Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.3935588Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.3936123Z return mod(**inputs) 2025-08-14T21:41:43.3936805Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.3937511Z outputs = self.bert( 2025-08-14T21:41:43.3938178Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.3938893Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.3939605Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.3940312Z layer_outputs = layer_module( 2025-08-14T21:41:43.3940895Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.3941504Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.3942224Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:41:43.3942959Z self_attention_outputs = self.attention( 2025-08-14T21:41:43.3943681Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 381, in forward 2025-08-14T21:41:43.3944496Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:41:43.3945426Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 329, in forward 2025-08-14T21:41:43.3946158Z hidden_states = self.dense(hidden_states) 2025-08-14T21:41:43.3946432Z 2025-08-14T21:41:43.3946599Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.3947203Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.3947737Z return mod(**inputs) 2025-08-14T21:41:43.3948419Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.3949144Z outputs = self.bert( 2025-08-14T21:41:43.3949818Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.3950523Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.3951214Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.3951924Z layer_outputs = layer_module( 2025-08-14T21:41:43.3952494Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.3953122Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.3953833Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:41:43.3954562Z layer_output = apply_chunking_to_forward( 2025-08-14T21:41:43.3955230Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:41:43.3955894Z return forward_fn(*input_tensors) 2025-08-14T21:41:43.3956654Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 481, in feed_forward_chunk 2025-08-14T21:41:43.3957480Z intermediate_output = self.intermediate(ln_output) 2025-08-14T21:41:43.3958242Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 397, in forward 2025-08-14T21:41:43.3958976Z hidden_states = self.dense(hidden_states) 2025-08-14T21:41:43.3959201Z 2025-08-14T21:41:43.3959372Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.3959963Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.3960506Z return mod(**inputs) 2025-08-14T21:41:43.3961187Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.3961891Z outputs = self.bert( 2025-08-14T21:41:43.3962553Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.3963278Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.3963981Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.3964700Z layer_outputs = layer_module( 2025-08-14T21:41:43.3965271Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.3965873Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.3966590Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:41:43.3967306Z layer_output = apply_chunking_to_forward( 2025-08-14T21:41:43.3967975Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:41:43.3968645Z return forward_fn(*input_tensors) 2025-08-14T21:41:43.3969463Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 481, in feed_forward_chunk 2025-08-14T21:41:43.3970277Z intermediate_output = self.intermediate(ln_output) 2025-08-14T21:41:43.3971088Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 398, in forward 2025-08-14T21:41:43.3971864Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:41:43.3972493Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:41:43.3973077Z return self.act(input) 2025-08-14T21:41:43.3973261Z 2025-08-14T21:41:43.3973430Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.3974031Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.3974571Z return mod(**inputs) 2025-08-14T21:41:43.3975250Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.3975970Z outputs = self.bert( 2025-08-14T21:41:43.3976648Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.3977427Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.3978145Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.3978860Z layer_outputs = layer_module( 2025-08-14T21:41:43.3979437Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.3980031Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.3980753Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:41:43.3981479Z layer_output = apply_chunking_to_forward( 2025-08-14T21:41:43.3982148Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:41:43.3982826Z return forward_fn(*input_tensors) 2025-08-14T21:41:43.3983590Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 482, in feed_forward_chunk 2025-08-14T21:41:43.3984465Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:41:43.3985510Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 410, in forward 2025-08-14T21:41:43.3986250Z hidden_states = self.dense(hidden_states) 2025-08-14T21:41:43.3986473Z 2025-08-14T21:41:43.3986645Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.3987249Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.3987776Z return mod(**inputs) 2025-08-14T21:41:43.3988454Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.3989170Z outputs = self.bert( 2025-08-14T21:41:43.3989839Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.3990555Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.3991260Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.3991967Z layer_outputs = layer_module( 2025-08-14T21:41:43.3992544Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.3993228Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.3993952Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:41:43.3994699Z self_attention_outputs = self.attention( 2025-08-14T21:41:43.3995468Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:41:43.3996180Z self_outputs = self.self( 2025-08-14T21:41:43.3996805Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:41:43.3997471Z return func(*args, **kwargs) 2025-08-14T21:41:43.3998163Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 223, in forward 2025-08-14T21:41:43.3998895Z query_layer = self.query(hidden_states) 2025-08-14T21:41:43.3999118Z 2025-08-14T21:41:43.3999295Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.3999890Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4000429Z return mod(**inputs) 2025-08-14T21:41:43.4001110Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4001857Z outputs = self.bert( 2025-08-14T21:41:43.4002516Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4003230Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4003943Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4004642Z layer_outputs = layer_module( 2025-08-14T21:41:43.4005218Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4005822Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4006551Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:41:43.4007272Z self_attention_outputs = self.attention( 2025-08-14T21:41:43.4008000Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:41:43.4008710Z self_outputs = self.self( 2025-08-14T21:41:43.4009324Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:41:43.4009957Z return func(*args, **kwargs) 2025-08-14T21:41:43.4010653Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 252, in forward 2025-08-14T21:41:43.4011374Z key_layer = self.key(current_states) 2025-08-14T21:41:43.4011590Z 2025-08-14T21:41:43.4011754Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4012357Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4012895Z return mod(**inputs) 2025-08-14T21:41:43.4013570Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4014269Z outputs = self.bert( 2025-08-14T21:41:43.4014933Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4015657Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4016354Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4017062Z layer_outputs = layer_module( 2025-08-14T21:41:43.4017696Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4018301Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4019009Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:41:43.4019761Z self_attention_outputs = self.attention( 2025-08-14T21:41:43.4020486Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:41:43.4021199Z self_outputs = self.self( 2025-08-14T21:41:43.4021809Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:41:43.4022428Z return func(*args, **kwargs) 2025-08-14T21:41:43.4023123Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 256, in forward 2025-08-14T21:41:43.4023836Z value_layer = self.value(current_states) 2025-08-14T21:41:43.4024062Z 2025-08-14T21:41:43.4024186Z cudagraph partition due to non gpu ops 2025-08-14T21:41:43.4024525Z cudagraph partition due to non gpu ops 2025-08-14T21:41:43.4025001Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4025610Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4026146Z return mod(**inputs) 2025-08-14T21:41:43.4026816Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4027511Z outputs = self.bert( 2025-08-14T21:41:43.4028178Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4028894Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4029607Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4030309Z layer_outputs = layer_module( 2025-08-14T21:41:43.4030888Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4031490Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4032700Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:41:43.4033425Z self_attention_outputs = self.attention( 2025-08-14T21:41:43.4034149Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 381, in forward 2025-08-14T21:41:43.4034954Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:41:43.4035754Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 329, in forward 2025-08-14T21:41:43.4036480Z hidden_states = self.dense(hidden_states) 2025-08-14T21:41:43.4036715Z 2025-08-14T21:41:43.4036882Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4037480Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4038019Z return mod(**inputs) 2025-08-14T21:41:43.4038689Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4039406Z outputs = self.bert( 2025-08-14T21:41:43.4040071Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4040768Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4041518Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4042247Z layer_outputs = layer_module( 2025-08-14T21:41:43.4042816Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4043457Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4044185Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:41:43.4044918Z layer_output = apply_chunking_to_forward( 2025-08-14T21:41:43.4045618Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:41:43.4046285Z return forward_fn(*input_tensors) 2025-08-14T21:41:43.4047049Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 481, in feed_forward_chunk 2025-08-14T21:41:43.4047864Z intermediate_output = self.intermediate(ln_output) 2025-08-14T21:41:43.4048616Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 397, in forward 2025-08-14T21:41:43.4049348Z hidden_states = self.dense(hidden_states) 2025-08-14T21:41:43.4049598Z 2025-08-14T21:41:43.4049776Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4050371Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4050907Z return mod(**inputs) 2025-08-14T21:41:43.4051589Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4052293Z outputs = self.bert( 2025-08-14T21:41:43.4052955Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4053658Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4054371Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4055086Z layer_outputs = layer_module( 2025-08-14T21:41:43.4055666Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4056268Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4056992Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:41:43.4057715Z layer_output = apply_chunking_to_forward( 2025-08-14T21:41:43.4058390Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:41:43.4059057Z return forward_fn(*input_tensors) 2025-08-14T21:41:43.4059821Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 481, in feed_forward_chunk 2025-08-14T21:41:43.4060632Z intermediate_output = self.intermediate(ln_output) 2025-08-14T21:41:43.4061402Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 398, in forward 2025-08-14T21:41:43.4062187Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:41:43.4062827Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:41:43.4063389Z return self.act(input) 2025-08-14T21:41:43.4063578Z 2025-08-14T21:41:43.4063746Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4064349Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4064986Z return mod(**inputs) 2025-08-14T21:41:43.4065700Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4066409Z outputs = self.bert( 2025-08-14T21:41:43.4067079Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4067817Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4068532Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4069247Z layer_outputs = layer_module( 2025-08-14T21:41:43.4069837Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4070441Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4071166Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:41:43.4071899Z layer_output = apply_chunking_to_forward( 2025-08-14T21:41:43.4072558Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:41:43.4073225Z return forward_fn(*input_tensors) 2025-08-14T21:41:43.4073992Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 482, in feed_forward_chunk 2025-08-14T21:41:43.4074883Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:41:43.4075688Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 410, in forward 2025-08-14T21:41:43.4076430Z hidden_states = self.dense(hidden_states) 2025-08-14T21:41:43.4076651Z 2025-08-14T21:41:43.4076825Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4077432Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4077967Z return mod(**inputs) 2025-08-14T21:41:43.4078650Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4079360Z outputs = self.bert( 2025-08-14T21:41:43.4080029Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4080751Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4081456Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4082178Z layer_outputs = layer_module( 2025-08-14T21:41:43.4082752Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4083358Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4084088Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:41:43.4085019Z layer_output = apply_chunking_to_forward( 2025-08-14T21:41:43.4085691Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:41:43.4086367Z return forward_fn(*input_tensors) 2025-08-14T21:41:43.4087133Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 482, in feed_forward_chunk 2025-08-14T21:41:43.4087986Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:41:43.4088797Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 412, in forward 2025-08-14T21:41:43.4089520Z return input_tensor + hidden_states 2025-08-14T21:41:43.4089737Z 2025-08-14T21:41:43.4089991Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4090590Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4091139Z return mod(**inputs) 2025-08-14T21:41:43.4091866Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4092572Z outputs = self.bert( 2025-08-14T21:41:43.4093236Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4093941Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4094686Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4095394Z layer_outputs = layer_module( 2025-08-14T21:41:43.4095983Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4096595Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4097318Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:41:43.4098045Z self_attention_outputs = self.attention( 2025-08-14T21:41:43.4098827Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:41:43.4099543Z self_outputs = self.self( 2025-08-14T21:41:43.4100145Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:41:43.4100770Z return func(*args, **kwargs) 2025-08-14T21:41:43.4101458Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 223, in forward 2025-08-14T21:41:43.4102190Z query_layer = self.query(hidden_states) 2025-08-14T21:41:43.4102413Z 2025-08-14T21:41:43.4102579Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4103184Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4103725Z return mod(**inputs) 2025-08-14T21:41:43.4104403Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4105185Z outputs = self.bert( 2025-08-14T21:41:43.4105872Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4106575Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4107268Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4107976Z layer_outputs = layer_module( 2025-08-14T21:41:43.4108559Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4109163Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4109881Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:41:43.4110616Z self_attention_outputs = self.attention( 2025-08-14T21:41:43.4111340Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:41:43.4112061Z self_outputs = self.self( 2025-08-14T21:41:43.4112658Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:41:43.4113288Z return func(*args, **kwargs) 2025-08-14T21:41:43.4114026Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 252, in forward 2025-08-14T21:41:43.4114744Z key_layer = self.key(current_states) 2025-08-14T21:41:43.4114971Z 2025-08-14T21:41:43.4115136Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4115766Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4116306Z return mod(**inputs) 2025-08-14T21:41:43.4116968Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4117676Z outputs = self.bert( 2025-08-14T21:41:43.4118364Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4119075Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4119782Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4120490Z layer_outputs = layer_module( 2025-08-14T21:41:43.4121069Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4121658Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4122405Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:41:43.4123131Z self_attention_outputs = self.attention( 2025-08-14T21:41:43.4123857Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:41:43.4124560Z self_outputs = self.self( 2025-08-14T21:41:43.4125171Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:41:43.4125788Z return func(*args, **kwargs) 2025-08-14T21:41:43.4126473Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 256, in forward 2025-08-14T21:41:43.4127197Z value_layer = self.value(current_states) 2025-08-14T21:41:43.4127432Z 2025-08-14T21:41:43.4127555Z cudagraph partition due to non gpu ops 2025-08-14T21:41:43.4127896Z cudagraph partition due to non gpu ops 2025-08-14T21:41:43.4128259Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4128860Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4129395Z return mod(**inputs) 2025-08-14T21:41:43.4130066Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4130768Z outputs = self.bert( 2025-08-14T21:41:43.4131438Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4132155Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4132858Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4133580Z layer_outputs = layer_module( 2025-08-14T21:41:43.4134156Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4134751Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4135464Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:41:43.4136191Z self_attention_outputs = self.attention( 2025-08-14T21:41:43.4136922Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 381, in forward 2025-08-14T21:41:43.4137761Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:41:43.4138557Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 329, in forward 2025-08-14T21:41:43.4139290Z hidden_states = self.dense(hidden_states) 2025-08-14T21:41:43.4139552Z 2025-08-14T21:41:43.4139726Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4140319Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4140850Z return mod(**inputs) 2025-08-14T21:41:43.4141541Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4142260Z outputs = self.bert( 2025-08-14T21:41:43.4142918Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4143639Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4144355Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4145158Z layer_outputs = layer_module( 2025-08-14T21:41:43.4145741Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4146377Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4147092Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:41:43.4147815Z layer_output = apply_chunking_to_forward( 2025-08-14T21:41:43.4148504Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:41:43.4149165Z return forward_fn(*input_tensors) 2025-08-14T21:41:43.4149935Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 481, in feed_forward_chunk 2025-08-14T21:41:43.4150750Z intermediate_output = self.intermediate(ln_output) 2025-08-14T21:41:43.4151515Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 397, in forward 2025-08-14T21:41:43.4152248Z hidden_states = self.dense(hidden_states) 2025-08-14T21:41:43.4152471Z 2025-08-14T21:41:43.4152628Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4153221Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4153758Z return mod(**inputs) 2025-08-14T21:41:43.4154443Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4155144Z outputs = self.bert( 2025-08-14T21:41:43.4155814Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4156530Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4157242Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4157955Z layer_outputs = layer_module( 2025-08-14T21:41:43.4158544Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4159139Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4159849Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:41:43.4160586Z layer_output = apply_chunking_to_forward( 2025-08-14T21:41:43.4161259Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:41:43.4161953Z return forward_fn(*input_tensors) 2025-08-14T21:41:43.4162725Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 481, in feed_forward_chunk 2025-08-14T21:41:43.4163588Z intermediate_output = self.intermediate(ln_output) 2025-08-14T21:41:43.4164360Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 398, in forward 2025-08-14T21:41:43.4165160Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:41:43.4165789Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:41:43.4166394Z return self.act(input) 2025-08-14T21:41:43.4166571Z 2025-08-14T21:41:43.4166744Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4167356Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4167889Z return mod(**inputs) 2025-08-14T21:41:43.4168566Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4169272Z outputs = self.bert( 2025-08-14T21:41:43.4169934Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4170676Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4171387Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4172088Z layer_outputs = layer_module( 2025-08-14T21:41:43.4172666Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4173265Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4173976Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:41:43.4174707Z layer_output = apply_chunking_to_forward( 2025-08-14T21:41:43.4175386Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:41:43.4176053Z return forward_fn(*input_tensors) 2025-08-14T21:41:43.4176817Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 482, in feed_forward_chunk 2025-08-14T21:41:43.4177667Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:41:43.4178493Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 410, in forward 2025-08-14T21:41:43.4179223Z hidden_states = self.dense(hidden_states) 2025-08-14T21:41:43.4179457Z 2025-08-14T21:41:43.4179632Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4180220Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4180753Z return mod(**inputs) 2025-08-14T21:41:43.4181430Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4182149Z outputs = self.bert( 2025-08-14T21:41:43.4182817Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4183529Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4184243Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4185177Z layer_outputs = layer_module( 2025-08-14T21:41:43.4185756Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4186444Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4187159Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:41:43.4187934Z self_attention_outputs = self.attention( 2025-08-14T21:41:43.4188668Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:41:43.4189380Z self_outputs = self.self( 2025-08-14T21:41:43.4189987Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:41:43.4190673Z return func(*args, **kwargs) 2025-08-14T21:41:43.4191373Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 223, in forward 2025-08-14T21:41:43.4192101Z query_layer = self.query(hidden_states) 2025-08-14T21:41:43.4192329Z 2025-08-14T21:41:43.4192494Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4193090Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4193627Z return mod(**inputs) 2025-08-14T21:41:43.4194295Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4194435Z outputs = self.bert( 2025-08-14T21:41:43.4194943Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4195069Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4195568Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4195675Z layer_outputs = layer_module( 2025-08-14T21:41:43.4196078Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4196194Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4196698Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:41:43.4196834Z self_attention_outputs = self.attention( 2025-08-14T21:41:43.4197333Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:41:43.4197447Z self_outputs = self.self( 2025-08-14T21:41:43.4197872Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:41:43.4197976Z return func(*args, **kwargs) 2025-08-14T21:41:43.4198492Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 252, in forward 2025-08-14T21:41:43.4198613Z key_layer = self.key(current_states) 2025-08-14T21:41:43.4198619Z 2025-08-14T21:41:43.4198790Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4199128Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4199228Z return mod(**inputs) 2025-08-14T21:41:43.4199744Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4199845Z outputs = self.bert( 2025-08-14T21:41:43.4200341Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4200461Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4200960Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4201104Z layer_outputs = layer_module( 2025-08-14T21:41:43.4201484Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4201598Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4202125Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:41:43.4202247Z self_attention_outputs = self.attention( 2025-08-14T21:41:43.4202744Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:41:43.4202871Z self_outputs = self.self( 2025-08-14T21:41:43.4203294Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:41:43.4203407Z return func(*args, **kwargs) 2025-08-14T21:41:43.4203903Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 256, in forward 2025-08-14T21:41:43.4204024Z value_layer = self.value(current_states) 2025-08-14T21:41:43.4204039Z 2025-08-14T21:41:43.4204162Z cudagraph partition due to non gpu ops 2025-08-14T21:41:43.4204289Z cudagraph partition due to non gpu ops 2025-08-14T21:41:43.4204483Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4204822Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4204915Z return mod(**inputs) 2025-08-14T21:41:43.4205430Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4205524Z outputs = self.bert( 2025-08-14T21:41:43.4206031Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4206140Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4206636Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4206751Z layer_outputs = layer_module( 2025-08-14T21:41:43.4207126Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4207245Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4207749Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:41:43.4207870Z self_attention_outputs = self.attention( 2025-08-14T21:41:43.4208376Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 381, in forward 2025-08-14T21:41:43.4208578Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:41:43.4209076Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 329, in forward 2025-08-14T21:41:43.4209207Z hidden_states = self.dense(hidden_states) 2025-08-14T21:41:43.4209212Z 2025-08-14T21:41:43.4209379Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4209722Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4209819Z return mod(**inputs) 2025-08-14T21:41:43.4210318Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4210422Z outputs = self.bert( 2025-08-14T21:41:43.4210918Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4211032Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4211560Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4211664Z layer_outputs = layer_module( 2025-08-14T21:41:43.4212052Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4212195Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4212694Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:41:43.4212826Z layer_output = apply_chunking_to_forward( 2025-08-14T21:41:43.4213303Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:41:43.4213429Z return forward_fn(*input_tensors) 2025-08-14T21:41:43.4213982Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 481, in feed_forward_chunk 2025-08-14T21:41:43.4214149Z intermediate_output = self.intermediate(ln_output) 2025-08-14T21:41:43.4214660Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 397, in forward 2025-08-14T21:41:43.4214788Z hidden_states = self.dense(hidden_states) 2025-08-14T21:41:43.4214817Z 2025-08-14T21:41:43.4214984Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4215322Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4215417Z return mod(**inputs) 2025-08-14T21:41:43.4215938Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4216034Z outputs = self.bert( 2025-08-14T21:41:43.4216538Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4216658Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4217154Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4217272Z layer_outputs = layer_module( 2025-08-14T21:41:43.4217661Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4217776Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4218281Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:41:43.4218407Z layer_output = apply_chunking_to_forward( 2025-08-14T21:41:43.4218859Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:41:43.4218983Z return forward_fn(*input_tensors) 2025-08-14T21:41:43.4219543Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 481, in feed_forward_chunk 2025-08-14T21:41:43.4219711Z intermediate_output = self.intermediate(ln_output) 2025-08-14T21:41:43.4220212Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 398, in forward 2025-08-14T21:41:43.4220391Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:41:43.4220759Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:41:43.4220864Z return self.act(input) 2025-08-14T21:41:43.4220872Z 2025-08-14T21:41:43.4221039Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4221372Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4221468Z return mod(**inputs) 2025-08-14T21:41:43.4222006Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4222101Z outputs = self.bert( 2025-08-14T21:41:43.4222601Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4222750Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4223244Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4223356Z layer_outputs = layer_module( 2025-08-14T21:41:43.4223754Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4223875Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4224390Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:41:43.4224517Z layer_output = apply_chunking_to_forward( 2025-08-14T21:41:43.4225069Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:41:43.4225193Z return forward_fn(*input_tensors) 2025-08-14T21:41:43.4225759Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 482, in feed_forward_chunk 2025-08-14T21:41:43.4225967Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:41:43.4226450Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 410, in forward 2025-08-14T21:41:43.4226572Z hidden_states = self.dense(hidden_states) 2025-08-14T21:41:43.4226586Z 2025-08-14T21:41:43.4226748Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4227073Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4227176Z return mod(**inputs) 2025-08-14T21:41:43.4227676Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4227772Z outputs = self.bert( 2025-08-14T21:41:43.4228255Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4228364Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4228869Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4228974Z layer_outputs = layer_module( 2025-08-14T21:41:43.4229350Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4229479Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4229979Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:41:43.4230103Z layer_output = apply_chunking_to_forward( 2025-08-14T21:41:43.4230564Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:41:43.4230679Z return forward_fn(*input_tensors) 2025-08-14T21:41:43.4231245Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 482, in feed_forward_chunk 2025-08-14T21:41:43.4231444Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:41:43.4231900Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 412, in forward 2025-08-14T21:41:43.4232012Z return input_tensor + hidden_states 2025-08-14T21:41:43.4232059Z 2025-08-14T21:41:43.4232219Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4232558Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4232699Z return mod(**inputs) 2025-08-14T21:41:43.4233192Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4233301Z outputs = self.bert( 2025-08-14T21:41:43.4233801Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4233943Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4234441Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4234544Z layer_outputs = layer_module( 2025-08-14T21:41:43.4234940Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4235055Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4235557Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:41:43.4235716Z self_attention_outputs = self.attention( 2025-08-14T21:41:43.4236208Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:41:43.4236324Z self_outputs = self.self( 2025-08-14T21:41:43.4236747Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:41:43.4236852Z return func(*args, **kwargs) 2025-08-14T21:41:43.4237362Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 223, in forward 2025-08-14T21:41:43.4237487Z query_layer = self.query(hidden_states) 2025-08-14T21:41:43.4237493Z 2025-08-14T21:41:43.4237667Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4237992Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4238088Z return mod(**inputs) 2025-08-14T21:41:43.4238591Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4238685Z outputs = self.bert( 2025-08-14T21:41:43.4239186Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4239304Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4239796Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4239912Z layer_outputs = layer_module( 2025-08-14T21:41:43.4240297Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4240410Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4240911Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:41:43.4241036Z self_attention_outputs = self.attention( 2025-08-14T21:41:43.4241533Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:41:43.4241647Z self_outputs = self.self( 2025-08-14T21:41:43.4242063Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:41:43.4242173Z return func(*args, **kwargs) 2025-08-14T21:41:43.4242691Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 252, in forward 2025-08-14T21:41:43.4242805Z key_layer = self.key(current_states) 2025-08-14T21:41:43.4242811Z 2025-08-14T21:41:43.4242978Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4243331Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4243437Z return mod(**inputs) 2025-08-14T21:41:43.4243939Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4244036Z outputs = self.bert( 2025-08-14T21:41:43.4244554Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4244663Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4245159Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4245269Z layer_outputs = layer_module( 2025-08-14T21:41:43.4245642Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4245769Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4246275Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:41:43.4246396Z self_attention_outputs = self.attention( 2025-08-14T21:41:43.4246902Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:41:43.4247005Z self_outputs = self.self( 2025-08-14T21:41:43.4247428Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:41:43.4247532Z return func(*args, **kwargs) 2025-08-14T21:41:43.4248030Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 256, in forward 2025-08-14T21:41:43.4248157Z value_layer = self.value(current_states) 2025-08-14T21:41:43.4248164Z 2025-08-14T21:41:43.4248285Z cudagraph partition due to non gpu ops 2025-08-14T21:41:43.4248407Z cudagraph partition due to non gpu ops 2025-08-14T21:41:43.4248576Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4248910Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4249013Z return mod(**inputs) 2025-08-14T21:41:43.4249513Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4249608Z outputs = self.bert( 2025-08-14T21:41:43.4250118Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4250224Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4250724Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4250832Z layer_outputs = layer_module( 2025-08-14T21:41:43.4251205Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4251326Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4251819Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:41:43.4251938Z self_attention_outputs = self.attention( 2025-08-14T21:41:43.4252443Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 381, in forward 2025-08-14T21:41:43.4252665Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:41:43.4253173Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 329, in forward 2025-08-14T21:41:43.4253318Z hidden_states = self.dense(hidden_states) 2025-08-14T21:41:43.4253324Z 2025-08-14T21:41:43.4253488Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4253831Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4253926Z return mod(**inputs) 2025-08-14T21:41:43.4254455Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4254552Z outputs = self.bert( 2025-08-14T21:41:43.4255049Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4255166Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4255660Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4255765Z layer_outputs = layer_module( 2025-08-14T21:41:43.4256157Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4256292Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4256795Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:41:43.4256922Z layer_output = apply_chunking_to_forward( 2025-08-14T21:41:43.4257357Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:41:43.4257479Z return forward_fn(*input_tensors) 2025-08-14T21:41:43.4258036Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 481, in feed_forward_chunk 2025-08-14T21:41:43.4258207Z intermediate_output = self.intermediate(ln_output) 2025-08-14T21:41:43.4258709Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 397, in forward 2025-08-14T21:41:43.4258834Z hidden_states = self.dense(hidden_states) 2025-08-14T21:41:43.4258840Z 2025-08-14T21:41:43.4259008Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4259340Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4259438Z return mod(**inputs) 2025-08-14T21:41:43.4259936Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4260030Z outputs = self.bert( 2025-08-14T21:41:43.4260540Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4260646Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4261147Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4261262Z layer_outputs = layer_module( 2025-08-14T21:41:43.4261631Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4261761Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4262239Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:41:43.4262362Z layer_output = apply_chunking_to_forward( 2025-08-14T21:41:43.4262833Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:41:43.4262945Z return forward_fn(*input_tensors) 2025-08-14T21:41:43.4263501Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 481, in feed_forward_chunk 2025-08-14T21:41:43.4263710Z intermediate_output = self.intermediate(ln_output) 2025-08-14T21:41:43.4264211Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 398, in forward 2025-08-14T21:41:43.4264393Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:41:43.4264926Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:41:43.4265037Z return self.act(input) 2025-08-14T21:41:43.4265043Z 2025-08-14T21:41:43.4265213Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4265552Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4265655Z return mod(**inputs) 2025-08-14T21:41:43.4266163Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4266258Z outputs = self.bert( 2025-08-14T21:41:43.4266765Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4266896Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4267391Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4267504Z layer_outputs = layer_module( 2025-08-14T21:41:43.4267881Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4268003Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4268495Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:41:43.4268623Z layer_output = apply_chunking_to_forward( 2025-08-14T21:41:43.4269076Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:41:43.4269191Z return forward_fn(*input_tensors) 2025-08-14T21:41:43.4269747Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 482, in feed_forward_chunk 2025-08-14T21:41:43.4269959Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:41:43.4270462Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 410, in forward 2025-08-14T21:41:43.4270593Z hidden_states = self.dense(hidden_states) 2025-08-14T21:41:43.4270599Z 2025-08-14T21:41:43.4270759Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4271096Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4271192Z return mod(**inputs) 2025-08-14T21:41:43.4271696Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4271803Z outputs = self.bert( 2025-08-14T21:41:43.4272298Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4272406Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4272915Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4273016Z layer_outputs = layer_module( 2025-08-14T21:41:43.4273439Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4273557Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4274051Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:41:43.4274202Z self_attention_outputs = self.attention( 2025-08-14T21:41:43.4274697Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:41:43.4274802Z self_outputs = self.self( 2025-08-14T21:41:43.4275221Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:41:43.4275323Z return func(*args, **kwargs) 2025-08-14T21:41:43.4275809Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 223, in forward 2025-08-14T21:41:43.4275923Z query_layer = self.query(hidden_states) 2025-08-14T21:41:43.4275929Z 2025-08-14T21:41:43.4276068Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4276377Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4276474Z return mod(**inputs) 2025-08-14T21:41:43.4276963Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4277056Z outputs = self.bert( 2025-08-14T21:41:43.4277515Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4277625Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4278076Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4278175Z layer_outputs = layer_module( 2025-08-14T21:41:43.4278505Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4278597Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4279033Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:41:43.4279140Z self_attention_outputs = self.attention( 2025-08-14T21:41:43.4279574Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:41:43.4279664Z self_outputs = self.self( 2025-08-14T21:41:43.4280033Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:41:43.4280153Z return func(*args, **kwargs) 2025-08-14T21:41:43.4280595Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 252, in forward 2025-08-14T21:41:43.4280692Z key_layer = self.key(current_states) 2025-08-14T21:41:43.4280697Z 2025-08-14T21:41:43.4280835Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4281137Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4281225Z return mod(**inputs) 2025-08-14T21:41:43.4281716Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4281808Z outputs = self.bert( 2025-08-14T21:41:43.4282269Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4282371Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4282819Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4282968Z layer_outputs = layer_module( 2025-08-14T21:41:43.4283320Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4283440Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4283913Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:41:43.4284024Z self_attention_outputs = self.attention( 2025-08-14T21:41:43.4284464Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:41:43.4284817Z self_outputs = self.self( 2025-08-14T21:41:43.4285216Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:41:43.4285313Z return func(*args, **kwargs) 2025-08-14T21:41:43.4285726Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 256, in forward 2025-08-14T21:41:43.4285838Z value_layer = self.value(current_states) 2025-08-14T21:41:43.4285845Z 2025-08-14T21:41:43.4285959Z cudagraph partition due to non gpu ops 2025-08-14T21:41:43.4286066Z cudagraph partition due to non gpu ops 2025-08-14T21:41:43.4286286Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4286604Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4286684Z return mod(**inputs) 2025-08-14T21:41:43.4287110Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4287189Z outputs = self.bert( 2025-08-14T21:41:43.4287609Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4287703Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4288114Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4288211Z layer_outputs = layer_module( 2025-08-14T21:41:43.4288527Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4288635Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4289040Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:41:43.4289139Z self_attention_outputs = self.attention( 2025-08-14T21:41:43.4289558Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 381, in forward 2025-08-14T21:41:43.4289732Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:41:43.4290144Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 329, in forward 2025-08-14T21:41:43.4290250Z hidden_states = self.dense(hidden_states) 2025-08-14T21:41:43.4290257Z 2025-08-14T21:41:43.4290388Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4290661Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4290733Z return mod(**inputs) 2025-08-14T21:41:43.4291117Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4291190Z outputs = self.bert( 2025-08-14T21:41:43.4291460Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4291534Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4291861Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4291930Z layer_outputs = layer_module( 2025-08-14T21:41:43.4292190Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4292295Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4292569Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:41:43.4292655Z layer_output = apply_chunking_to_forward( 2025-08-14T21:41:43.4292930Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:41:43.4293012Z return forward_fn(*input_tensors) 2025-08-14T21:41:43.4293318Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 481, in feed_forward_chunk 2025-08-14T21:41:43.4293415Z intermediate_output = self.intermediate(ln_output) 2025-08-14T21:41:43.4293695Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 397, in forward 2025-08-14T21:41:43.4294509Z hidden_states = self.dense(hidden_states) 2025-08-14T21:41:43.4294513Z 2025-08-14T21:41:43.4294618Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4294805Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4294866Z return mod(**inputs) 2025-08-14T21:41:43.4295149Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4295211Z outputs = self.bert( 2025-08-14T21:41:43.4295491Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4295561Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4295831Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4295904Z layer_outputs = layer_module( 2025-08-14T21:41:43.4296121Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4296220Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4296628Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:41:43.4296706Z layer_output = apply_chunking_to_forward( 2025-08-14T21:41:43.4296961Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:41:43.4297049Z return forward_fn(*input_tensors) 2025-08-14T21:41:43.4297382Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 481, in feed_forward_chunk 2025-08-14T21:41:43.4297486Z intermediate_output = self.intermediate(ln_output) 2025-08-14T21:41:43.4297759Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 398, in forward 2025-08-14T21:41:43.4297876Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:41:43.4298078Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:41:43.4298145Z return self.act(input) 2025-08-14T21:41:43.4298148Z 2025-08-14T21:41:43.4298251Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4298439Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4298517Z return mod(**inputs) 2025-08-14T21:41:43.4298800Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4298862Z outputs = self.bert( 2025-08-14T21:41:43.4299209Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4299281Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4299548Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4299623Z layer_outputs = layer_module( 2025-08-14T21:41:43.4299849Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4299930Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4300202Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:41:43.4300277Z layer_output = apply_chunking_to_forward( 2025-08-14T21:41:43.4300525Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:41:43.4300596Z return forward_fn(*input_tensors) 2025-08-14T21:41:43.4300925Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 482, in feed_forward_chunk 2025-08-14T21:41:43.4301057Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:41:43.4301328Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 410, in forward 2025-08-14T21:41:43.4301413Z hidden_states = self.dense(hidden_states) 2025-08-14T21:41:43.4301416Z 2025-08-14T21:41:43.4301512Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4301700Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4301770Z return mod(**inputs) 2025-08-14T21:41:43.4302038Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4302109Z outputs = self.bert( 2025-08-14T21:41:43.4302378Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4302445Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4302723Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4302790Z layer_outputs = layer_module( 2025-08-14T21:41:43.4302997Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4303078Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4303346Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:41:43.4303429Z layer_output = apply_chunking_to_forward( 2025-08-14T21:41:43.4303672Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:41:43.4303744Z return forward_fn(*input_tensors) 2025-08-14T21:41:43.4304051Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 482, in feed_forward_chunk 2025-08-14T21:41:43.4304176Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:41:43.4304457Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 412, in forward 2025-08-14T21:41:43.4304544Z return input_tensor + hidden_states 2025-08-14T21:41:43.4304548Z 2025-08-14T21:41:43.4304646Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4304912Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4304994Z return mod(**inputs) 2025-08-14T21:41:43.4305272Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4305338Z outputs = self.bert( 2025-08-14T21:41:43.4305614Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4305704Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4306032Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4306099Z layer_outputs = layer_module( 2025-08-14T21:41:43.4306318Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4306391Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4306670Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:41:43.4306765Z self_attention_outputs = self.attention( 2025-08-14T21:41:43.4307038Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:41:43.4307113Z self_outputs = self.self( 2025-08-14T21:41:43.4307348Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:41:43.4307417Z return func(*args, **kwargs) 2025-08-14T21:41:43.4307703Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 223, in forward 2025-08-14T21:41:43.4307781Z query_layer = self.query(hidden_states) 2025-08-14T21:41:43.4307784Z 2025-08-14T21:41:43.4307887Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4308079Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4308141Z return mod(**inputs) 2025-08-14T21:41:43.4308428Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4308488Z outputs = self.bert( 2025-08-14T21:41:43.4308757Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4308825Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4309088Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4309160Z layer_outputs = layer_module( 2025-08-14T21:41:43.4309363Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4309437Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4309707Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:41:43.4309783Z self_attention_outputs = self.attention( 2025-08-14T21:41:43.4310051Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:41:43.4310116Z self_outputs = self.self( 2025-08-14T21:41:43.4310342Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:41:43.4310413Z return func(*args, **kwargs) 2025-08-14T21:41:43.4310693Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 252, in forward 2025-08-14T21:41:43.4310774Z key_layer = self.key(current_states) 2025-08-14T21:41:43.4310778Z 2025-08-14T21:41:43.4310890Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4311074Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4311139Z return mod(**inputs) 2025-08-14T21:41:43.4311402Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4311461Z outputs = self.bert( 2025-08-14T21:41:43.4311743Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4311814Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4312084Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4312150Z layer_outputs = layer_module( 2025-08-14T21:41:43.4312351Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4312431Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4312709Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:41:43.4312789Z self_attention_outputs = self.attention( 2025-08-14T21:41:43.4313050Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:41:43.4313113Z self_outputs = self.self( 2025-08-14T21:41:43.4313341Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:41:43.4313406Z return func(*args, **kwargs) 2025-08-14T21:41:43.4313669Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 256, in forward 2025-08-14T21:41:43.4313749Z value_layer = self.value(current_states) 2025-08-14T21:41:43.4313754Z 2025-08-14T21:41:43.4313829Z cudagraph partition due to non gpu ops 2025-08-14T21:41:43.4313908Z cudagraph partition due to non gpu ops 2025-08-14T21:41:43.4313999Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4314179Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4314243Z return mod(**inputs) 2025-08-14T21:41:43.4314506Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4314565Z outputs = self.bert( 2025-08-14T21:41:43.4314829Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4314895Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4315163Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4315228Z layer_outputs = layer_module( 2025-08-14T21:41:43.4315428Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4315506Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4315767Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:41:43.4315846Z self_attention_outputs = self.attention( 2025-08-14T21:41:43.4316107Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 381, in forward 2025-08-14T21:41:43.4316241Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:41:43.4316513Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 329, in forward 2025-08-14T21:41:43.4316607Z hidden_states = self.dense(hidden_states) 2025-08-14T21:41:43.4316613Z 2025-08-14T21:41:43.4316712Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4316892Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4316950Z return mod(**inputs) 2025-08-14T21:41:43.4317237Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4317298Z outputs = self.bert( 2025-08-14T21:41:43.4317560Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4317634Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4317896Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4317968Z layer_outputs = layer_module( 2025-08-14T21:41:43.4318170Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4318257Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4318524Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:41:43.4318599Z layer_output = apply_chunking_to_forward( 2025-08-14T21:41:43.4318834Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:41:43.4318910Z return forward_fn(*input_tensors) 2025-08-14T21:41:43.4319199Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 481, in feed_forward_chunk 2025-08-14T21:41:43.4319300Z intermediate_output = self.intermediate(ln_output) 2025-08-14T21:41:43.4319559Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 397, in forward 2025-08-14T21:41:43.4319635Z hidden_states = self.dense(hidden_states) 2025-08-14T21:41:43.4319638Z 2025-08-14T21:41:43.4319736Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4319914Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4319979Z return mod(**inputs) 2025-08-14T21:41:43.4320240Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4320298Z outputs = self.bert( 2025-08-14T21:41:43.4320565Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4320631Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4320898Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4320964Z layer_outputs = layer_module( 2025-08-14T21:41:43.4321163Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4321237Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4321501Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:41:43.4321574Z layer_output = apply_chunking_to_forward( 2025-08-14T21:41:43.4321830Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:41:43.4321900Z return forward_fn(*input_tensors) 2025-08-14T21:41:43.4322193Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 481, in feed_forward_chunk 2025-08-14T21:41:43.4322303Z intermediate_output = self.intermediate(ln_output) 2025-08-14T21:41:43.4322566Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 398, in forward 2025-08-14T21:41:43.4322676Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:41:43.4322892Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:41:43.4322963Z return self.act(input) 2025-08-14T21:41:43.4322967Z 2025-08-14T21:41:43.4323060Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4323241Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4323306Z return mod(**inputs) 2025-08-14T21:41:43.4323572Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4323633Z outputs = self.bert( 2025-08-14T21:41:43.4323917Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4323983Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4324253Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4324317Z layer_outputs = layer_module( 2025-08-14T21:41:43.4324516Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4324593Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4324856Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:41:43.4324937Z layer_output = apply_chunking_to_forward( 2025-08-14T21:41:43.4325173Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:41:43.4325244Z return forward_fn(*input_tensors) 2025-08-14T21:41:43.4325536Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 482, in feed_forward_chunk 2025-08-14T21:41:43.4325657Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:41:43.4325919Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 410, in forward 2025-08-14T21:41:43.4325999Z hidden_states = self.dense(hidden_states) 2025-08-14T21:41:43.4326002Z 2025-08-14T21:41:43.4326094Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4326280Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4326338Z return mod(**inputs) 2025-08-14T21:41:43.4326603Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4326672Z outputs = self.bert( 2025-08-14T21:41:43.4326930Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4327001Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4327264Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4327328Z layer_outputs = layer_module( 2025-08-14T21:41:43.4327552Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4327626Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4327887Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:41:43.4327985Z self_attention_outputs = self.attention( 2025-08-14T21:41:43.4328247Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:41:43.4328317Z self_outputs = self.self( 2025-08-14T21:41:43.4328555Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:41:43.4328620Z return func(*args, **kwargs) 2025-08-14T21:41:43.4328890Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 223, in forward 2025-08-14T21:41:43.4328963Z query_layer = self.query(hidden_states) 2025-08-14T21:41:43.4328966Z 2025-08-14T21:41:43.4329066Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4329248Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4329308Z return mod(**inputs) 2025-08-14T21:41:43.4329593Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4329650Z outputs = self.bert( 2025-08-14T21:41:43.4329911Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4329983Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4330243Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4330315Z layer_outputs = layer_module( 2025-08-14T21:41:43.4330516Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4330587Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4330859Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:41:43.4330934Z self_attention_outputs = self.attention( 2025-08-14T21:41:43.4331201Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:41:43.4331264Z self_outputs = self.self( 2025-08-14T21:41:43.4331489Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:41:43.4331558Z return func(*args, **kwargs) 2025-08-14T21:41:43.4331821Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 252, in forward 2025-08-14T21:41:43.4331891Z key_layer = self.key(current_states) 2025-08-14T21:41:43.4331894Z 2025-08-14T21:41:43.4331991Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4332171Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4332238Z return mod(**inputs) 2025-08-14T21:41:43.4332502Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4332560Z outputs = self.bert( 2025-08-14T21:41:43.4332829Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4332893Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4333176Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4333243Z layer_outputs = layer_module( 2025-08-14T21:41:43.4333441Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4333536Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4333800Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:41:43.4333872Z self_attention_outputs = self.attention( 2025-08-14T21:41:43.4334158Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:41:43.4334225Z self_outputs = self.self( 2025-08-14T21:41:43.4334460Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:41:43.4334521Z return func(*args, **kwargs) 2025-08-14T21:41:43.4334788Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 256, in forward 2025-08-14T21:41:43.4334864Z value_layer = self.value(current_states) 2025-08-14T21:41:43.4334869Z 2025-08-14T21:41:43.4334941Z cudagraph partition due to non gpu ops 2025-08-14T21:41:43.4335028Z cudagraph partition due to non gpu ops 2025-08-14T21:41:43.4335127Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4335302Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4335365Z return mod(**inputs) 2025-08-14T21:41:43.4335629Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4335686Z outputs = self.bert( 2025-08-14T21:41:43.4335952Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4336018Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4336284Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4336349Z layer_outputs = layer_module( 2025-08-14T21:41:43.4336550Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4336627Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4336887Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:41:43.4336961Z self_attention_outputs = self.attention( 2025-08-14T21:41:43.4337231Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 381, in forward 2025-08-14T21:41:43.4337348Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:41:43.4337617Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 329, in forward 2025-08-14T21:41:43.4337690Z hidden_states = self.dense(hidden_states) 2025-08-14T21:41:43.4337695Z 2025-08-14T21:41:43.4337787Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4337969Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4338026Z return mod(**inputs) 2025-08-14T21:41:43.4338296Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4338355Z outputs = self.bert( 2025-08-14T21:41:43.4338614Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4338708Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4338971Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4339036Z layer_outputs = layer_module( 2025-08-14T21:41:43.4339262Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4339332Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4339603Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:41:43.4339679Z layer_output = apply_chunking_to_forward( 2025-08-14T21:41:43.4339935Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:41:43.4340012Z return forward_fn(*input_tensors) 2025-08-14T21:41:43.4340305Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 481, in feed_forward_chunk 2025-08-14T21:41:43.4340407Z intermediate_output = self.intermediate(ln_output) 2025-08-14T21:41:43.4340671Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 397, in forward 2025-08-14T21:41:43.4340761Z hidden_states = self.dense(hidden_states) 2025-08-14T21:41:43.4340765Z 2025-08-14T21:41:43.4340866Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4341045Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4341104Z return mod(**inputs) 2025-08-14T21:41:43.4341374Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4341431Z outputs = self.bert( 2025-08-14T21:41:43.4341699Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4341779Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4342039Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4342113Z layer_outputs = layer_module( 2025-08-14T21:41:43.4342314Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4342390Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4342652Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:41:43.4342724Z layer_output = apply_chunking_to_forward( 2025-08-14T21:41:43.4342968Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:41:43.4343037Z return forward_fn(*input_tensors) 2025-08-14T21:41:43.4343326Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 481, in feed_forward_chunk 2025-08-14T21:41:43.4343429Z intermediate_output = self.intermediate(ln_output) 2025-08-14T21:41:43.4343691Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 398, in forward 2025-08-14T21:41:43.4343800Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:41:43.4343992Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:41:43.4344057Z return self.act(input) 2025-08-14T21:41:43.4344060Z 2025-08-14T21:41:43.4344160Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4344339Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4344421Z return mod(**inputs) 2025-08-14T21:41:43.4344687Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4344828Z outputs = self.bert( 2025-08-14T21:41:43.4345133Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4345201Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4345468Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4345559Z layer_outputs = layer_module( 2025-08-14T21:41:43.4345770Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4345849Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4346174Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:41:43.4346250Z layer_output = apply_chunking_to_forward( 2025-08-14T21:41:43.4346496Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:41:43.4346583Z return forward_fn(*input_tensors) 2025-08-14T21:41:43.4346878Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 482, in feed_forward_chunk 2025-08-14T21:41:43.4346996Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:41:43.4347259Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 410, in forward 2025-08-14T21:41:43.4347341Z hidden_states = self.dense(hidden_states) 2025-08-14T21:41:43.4347345Z 2025-08-14T21:41:43.4347439Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4347624Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4347681Z return mod(**inputs) 2025-08-14T21:41:43.4347942Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4348008Z outputs = self.bert( 2025-08-14T21:41:43.4348269Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4348334Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4348600Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4348662Z layer_outputs = layer_module( 2025-08-14T21:41:43.4348869Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4348939Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4349198Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:41:43.4349280Z layer_output = apply_chunking_to_forward( 2025-08-14T21:41:43.4349516Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:41:43.4349588Z return forward_fn(*input_tensors) 2025-08-14T21:41:43.4349877Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 482, in feed_forward_chunk 2025-08-14T21:41:43.4349994Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:41:43.4350262Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 412, in forward 2025-08-14T21:41:43.4350352Z return input_tensor + hidden_states 2025-08-14T21:41:43.4350356Z 2025-08-14T21:41:43.4350451Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4350636Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4350712Z return mod(**inputs) 2025-08-14T21:41:43.4350982Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4351042Z outputs = self.bert( 2025-08-14T21:41:43.4351316Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4351391Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4351655Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4351725Z layer_outputs = layer_module( 2025-08-14T21:41:43.4351928Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4351998Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4352271Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:41:43.4352362Z self_attention_outputs = self.attention( 2025-08-14T21:41:43.4352626Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:41:43.4352699Z self_outputs = self.self( 2025-08-14T21:41:43.4352924Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:41:43.4352996Z return func(*args, **kwargs) 2025-08-14T21:41:43.4353260Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 223, in forward 2025-08-14T21:41:43.4353335Z query_layer = self.query(hidden_states) 2025-08-14T21:41:43.4353338Z 2025-08-14T21:41:43.4353439Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4353621Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4353690Z return mod(**inputs) 2025-08-14T21:41:43.4353958Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4354020Z outputs = self.bert( 2025-08-14T21:41:43.4354291Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4354358Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4354625Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4354698Z layer_outputs = layer_module( 2025-08-14T21:41:43.4354900Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4354980Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4355244Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:41:43.4355318Z self_attention_outputs = self.attention( 2025-08-14T21:41:43.4355592Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:41:43.4355656Z self_outputs = self.self( 2025-08-14T21:41:43.4355887Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:41:43.4355950Z return func(*args, **kwargs) 2025-08-14T21:41:43.4356230Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 252, in forward 2025-08-14T21:41:43.4356310Z key_layer = self.key(current_states) 2025-08-14T21:41:43.4356327Z 2025-08-14T21:41:43.4356423Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4356604Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4356670Z return mod(**inputs) 2025-08-14T21:41:43.4356933Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4357011Z outputs = self.bert( 2025-08-14T21:41:43.4357275Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4357340Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4357610Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4357672Z layer_outputs = layer_module( 2025-08-14T21:41:43.4357871Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4357968Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4358235Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:41:43.4358318Z self_attention_outputs = self.attention( 2025-08-14T21:41:43.4358582Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:41:43.4358647Z self_outputs = self.self( 2025-08-14T21:41:43.4358881Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:41:43.4358947Z return func(*args, **kwargs) 2025-08-14T21:41:43.4359218Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 256, in forward 2025-08-14T21:41:43.4359295Z value_layer = self.value(current_states) 2025-08-14T21:41:43.4359299Z 2025-08-14T21:41:43.4359375Z cudagraph partition due to non gpu ops 2025-08-14T21:41:43.4359456Z cudagraph partition due to non gpu ops 2025-08-14T21:41:43.4359549Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4359732Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4359800Z return mod(**inputs) 2025-08-14T21:41:43.4360067Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4360135Z outputs = self.bert( 2025-08-14T21:41:43.4360400Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4360467Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4360739Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4360808Z layer_outputs = layer_module( 2025-08-14T21:41:43.4361016Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4361088Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4361351Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:41:43.4361432Z self_attention_outputs = self.attention( 2025-08-14T21:41:43.4361711Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 381, in forward 2025-08-14T21:41:43.4361827Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:41:43.4362095Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 329, in forward 2025-08-14T21:41:43.4362189Z hidden_states = self.dense(hidden_states) 2025-08-14T21:41:43.4362194Z 2025-08-14T21:41:43.4362291Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4362470Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4362526Z return mod(**inputs) 2025-08-14T21:41:43.4362810Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4362871Z outputs = self.bert( 2025-08-14T21:41:43.4363139Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4363203Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4363465Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4363537Z layer_outputs = layer_module( 2025-08-14T21:41:43.4363751Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4363821Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4364088Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:41:43.4364165Z layer_output = apply_chunking_to_forward( 2025-08-14T21:41:43.4364410Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:41:43.4364477Z return forward_fn(*input_tensors) 2025-08-14T21:41:43.4364765Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 481, in feed_forward_chunk 2025-08-14T21:41:43.4364866Z intermediate_output = self.intermediate(ln_output) 2025-08-14T21:41:43.4365128Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 397, in forward 2025-08-14T21:41:43.4365211Z hidden_states = self.dense(hidden_states) 2025-08-14T21:41:43.4365214Z 2025-08-14T21:41:43.4365304Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4365483Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4365549Z return mod(**inputs) 2025-08-14T21:41:43.4365813Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4365873Z outputs = self.bert( 2025-08-14T21:41:43.4366139Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4366204Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4366472Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4366536Z layer_outputs = layer_module( 2025-08-14T21:41:43.4366737Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4366815Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4367076Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:41:43.4367157Z layer_output = apply_chunking_to_forward( 2025-08-14T21:41:43.4367414Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:41:43.4367485Z return forward_fn(*input_tensors) 2025-08-14T21:41:43.4367784Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 481, in feed_forward_chunk 2025-08-14T21:41:43.4367924Z intermediate_output = self.intermediate(ln_output) 2025-08-14T21:41:43.4368185Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 398, in forward 2025-08-14T21:41:43.4368295Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:41:43.4368502Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:41:43.4368576Z return self.act(input) 2025-08-14T21:41:43.4368580Z 2025-08-14T21:41:43.4368671Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4368852Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4368918Z return mod(**inputs) 2025-08-14T21:41:43.4369183Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4369251Z outputs = self.bert( 2025-08-14T21:41:43.4369527Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4369593Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4369861Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4369926Z layer_outputs = layer_module( 2025-08-14T21:41:43.4370126Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4370206Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4370467Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:41:43.4370548Z layer_output = apply_chunking_to_forward( 2025-08-14T21:41:43.4370785Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:41:43.4370855Z return forward_fn(*input_tensors) 2025-08-14T21:41:43.4371152Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 482, in feed_forward_chunk 2025-08-14T21:41:43.4371271Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:41:43.4371541Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 410, in forward 2025-08-14T21:41:43.4371614Z hidden_states = self.dense(hidden_states) 2025-08-14T21:41:43.4371619Z 2025-08-14T21:41:43.4371711Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4371896Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4371958Z return mod(**inputs) 2025-08-14T21:41:43.4372228Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4372289Z outputs = self.bert( 2025-08-14T21:41:43.4372550Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4372624Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4372887Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4372951Z layer_outputs = layer_module( 2025-08-14T21:41:43.4373183Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4373257Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4373527Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:41:43.4373619Z self_attention_outputs = self.attention( 2025-08-14T21:41:43.4373882Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:41:43.4373953Z self_outputs = self.self( 2025-08-14T21:41:43.4374191Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:41:43.4374256Z return func(*args, **kwargs) 2025-08-14T21:41:43.4374532Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 223, in forward 2025-08-14T21:41:43.4374604Z query_layer = self.query(hidden_states) 2025-08-14T21:41:43.4374607Z 2025-08-14T21:41:43.4374704Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4374885Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4374962Z return mod(**inputs) 2025-08-14T21:41:43.4375237Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4375295Z outputs = self.bert( 2025-08-14T21:41:43.4375568Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4375633Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4375898Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4375970Z layer_outputs = layer_module( 2025-08-14T21:41:43.4376173Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4376242Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4376515Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:41:43.4376590Z self_attention_outputs = self.attention( 2025-08-14T21:41:43.4376860Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:41:43.4376925Z self_outputs = self.self( 2025-08-14T21:41:43.4377148Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:41:43.4377221Z return func(*args, **kwargs) 2025-08-14T21:41:43.4377486Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 252, in forward 2025-08-14T21:41:43.4377562Z key_layer = self.key(current_states) 2025-08-14T21:41:43.4377565Z 2025-08-14T21:41:43.4377657Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4377840Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4377906Z return mod(**inputs) 2025-08-14T21:41:43.4378174Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4378231Z outputs = self.bert( 2025-08-14T21:41:43.4378499Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4378564Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4378848Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4378915Z layer_outputs = layer_module( 2025-08-14T21:41:43.4379115Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4379207Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4379470Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:41:43.4379550Z self_attention_outputs = self.attention( 2025-08-14T21:41:43.4379824Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:41:43.4379888Z self_outputs = self.self( 2025-08-14T21:41:43.4380124Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:41:43.4380189Z return func(*args, **kwargs) 2025-08-14T21:41:43.4380451Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 256, in forward 2025-08-14T21:41:43.4380531Z value_layer = self.value(current_states) 2025-08-14T21:41:43.4380535Z 2025-08-14T21:41:43.4380606Z cudagraph partition due to non gpu ops 2025-08-14T21:41:43.4380700Z cudagraph partition due to non gpu ops 2025-08-14T21:41:43.4380793Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4380972Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4381039Z return mod(**inputs) 2025-08-14T21:41:43.4381306Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4381366Z outputs = self.bert( 2025-08-14T21:41:43.4381638Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4381705Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4381979Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4382045Z layer_outputs = layer_module( 2025-08-14T21:41:43.4382251Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4382328Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4382594Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:41:43.4382676Z self_attention_outputs = self.attention( 2025-08-14T21:41:43.4382940Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 381, in forward 2025-08-14T21:41:43.4383057Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:41:43.4383329Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 329, in forward 2025-08-14T21:41:43.4383406Z hidden_states = self.dense(hidden_states) 2025-08-14T21:41:43.4383410Z 2025-08-14T21:41:43.4383507Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4383690Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4383749Z return mod(**inputs) 2025-08-14T21:41:43.4384023Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4384083Z outputs = self.bert( 2025-08-14T21:41:43.4384346Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4384434Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4385011Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4385147Z layer_outputs = layer_module( 2025-08-14T21:41:43.4385362Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4385440Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4385718Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:41:43.4385822Z layer_output = apply_chunking_to_forward( 2025-08-14T21:41:43.4386093Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:41:43.4386165Z return forward_fn(*input_tensors) 2025-08-14T21:41:43.4386460Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 481, in feed_forward_chunk 2025-08-14T21:41:43.4386568Z intermediate_output = self.intermediate(ln_output) 2025-08-14T21:41:43.4386831Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 397, in forward 2025-08-14T21:41:43.4386945Z hidden_states = self.dense(hidden_states) 2025-08-14T21:41:43.4386956Z 2025-08-14T21:41:43.4387050Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4387239Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4387309Z return mod(**inputs) 2025-08-14T21:41:43.4387582Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4387643Z outputs = self.bert( 2025-08-14T21:41:43.4387921Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4387990Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4388266Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4388334Z layer_outputs = layer_module( 2025-08-14T21:41:43.4388542Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4388621Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4388892Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:41:43.4388969Z layer_output = apply_chunking_to_forward( 2025-08-14T21:41:43.4389223Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:41:43.4389292Z return forward_fn(*input_tensors) 2025-08-14T21:41:43.4389601Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 481, in feed_forward_chunk 2025-08-14T21:41:43.4389699Z intermediate_output = self.intermediate(ln_output) 2025-08-14T21:41:43.4389971Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 398, in forward 2025-08-14T21:41:43.4390082Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:41:43.4390283Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:41:43.4390356Z return self.act(input) 2025-08-14T21:41:43.4390360Z 2025-08-14T21:41:43.4390454Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4390664Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4390734Z return mod(**inputs) 2025-08-14T21:41:43.4391006Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4391085Z outputs = self.bert( 2025-08-14T21:41:43.4391362Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4391433Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4391708Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4391787Z layer_outputs = layer_module( 2025-08-14T21:41:43.4391996Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4392077Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4392345Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:41:43.4392428Z layer_output = apply_chunking_to_forward( 2025-08-14T21:41:43.4392673Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:41:43.4392761Z return forward_fn(*input_tensors) 2025-08-14T21:41:43.4393067Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 482, in feed_forward_chunk 2025-08-14T21:41:43.4393189Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:41:43.4393460Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 410, in forward 2025-08-14T21:41:43.4393539Z hidden_states = self.dense(hidden_states) 2025-08-14T21:41:43.4393543Z 2025-08-14T21:41:43.4393638Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4393832Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4393890Z return mod(**inputs) 2025-08-14T21:41:43.4394161Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4394232Z outputs = self.bert( 2025-08-14T21:41:43.4394500Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4394574Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4394844Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4394909Z layer_outputs = layer_module( 2025-08-14T21:41:43.4395121Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4395191Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4395459Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:41:43.4395543Z layer_output = apply_chunking_to_forward( 2025-08-14T21:41:43.4395792Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:41:43.4395867Z return forward_fn(*input_tensors) 2025-08-14T21:41:43.4396167Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 482, in feed_forward_chunk 2025-08-14T21:41:43.4396288Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:41:43.4396580Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 412, in forward 2025-08-14T21:41:43.4396655Z return input_tensor + hidden_states 2025-08-14T21:41:43.4396658Z 2025-08-14T21:41:43.4396759Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4396946Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4397025Z return mod(**inputs) 2025-08-14T21:41:43.4397305Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4397367Z outputs = self.bert( 2025-08-14T21:41:43.4397654Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4397734Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4398027Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4398102Z layer_outputs = layer_module( 2025-08-14T21:41:43.4398332Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4398407Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4398709Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:41:43.4398806Z self_attention_outputs = self.attention( 2025-08-14T21:41:43.4399150Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:41:43.4399214Z self_outputs = self.self( 2025-08-14T21:41:43.4399437Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:41:43.4399509Z return func(*args, **kwargs) 2025-08-14T21:41:43.4399773Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 223, in forward 2025-08-14T21:41:43.4399846Z query_layer = self.query(hidden_states) 2025-08-14T21:41:43.4399856Z 2025-08-14T21:41:43.4399947Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4400129Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4400194Z return mod(**inputs) 2025-08-14T21:41:43.4400457Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4400516Z outputs = self.bert( 2025-08-14T21:41:43.4400785Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4400850Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4401129Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4401194Z layer_outputs = layer_module( 2025-08-14T21:41:43.4401393Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4401473Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4401738Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:41:43.4401812Z self_attention_outputs = self.attention( 2025-08-14T21:41:43.4402081Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:41:43.4402143Z self_outputs = self.self( 2025-08-14T21:41:43.4402372Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:41:43.4402450Z return func(*args, **kwargs) 2025-08-14T21:41:43.4402713Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 252, in forward 2025-08-14T21:41:43.4402789Z key_layer = self.key(current_states) 2025-08-14T21:41:43.4402809Z 2025-08-14T21:41:43.4402904Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4403094Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4403153Z return mod(**inputs) 2025-08-14T21:41:43.4403431Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4403502Z outputs = self.bert( 2025-08-14T21:41:43.4403765Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4403829Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4404096Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4404160Z layer_outputs = layer_module( 2025-08-14T21:41:43.4404368Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4404454Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4404716Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:41:43.4404796Z self_attention_outputs = self.attention( 2025-08-14T21:41:43.4405064Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:41:43.4405132Z self_outputs = self.self( 2025-08-14T21:41:43.4405357Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:41:43.4405420Z return func(*args, **kwargs) 2025-08-14T21:41:43.4405690Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 256, in forward 2025-08-14T21:41:43.4405762Z value_layer = self.value(current_states) 2025-08-14T21:41:43.4405766Z 2025-08-14T21:41:43.4405840Z cudagraph partition due to non gpu ops 2025-08-14T21:41:43.4405920Z cudagraph partition due to non gpu ops 2025-08-14T21:41:43.4406013Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4406202Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4406260Z return mod(**inputs) 2025-08-14T21:41:43.4406525Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4406591Z outputs = self.bert( 2025-08-14T21:41:43.4406852Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4406919Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4407185Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4407250Z layer_outputs = layer_module( 2025-08-14T21:41:43.4407456Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4407525Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4407786Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:41:43.4407865Z self_attention_outputs = self.attention( 2025-08-14T21:41:43.4408141Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 381, in forward 2025-08-14T21:41:43.4408266Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:41:43.4408528Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 329, in forward 2025-08-14T21:41:43.4408622Z hidden_states = self.dense(hidden_states) 2025-08-14T21:41:43.4408625Z 2025-08-14T21:41:43.4408723Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4408904Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4408969Z return mod(**inputs) 2025-08-14T21:41:43.4409256Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4409318Z outputs = self.bert( 2025-08-14T21:41:43.4409588Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4409653Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4409916Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4410005Z layer_outputs = layer_module( 2025-08-14T21:41:43.4410207Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4410283Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4410545Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:41:43.4410621Z layer_output = apply_chunking_to_forward( 2025-08-14T21:41:43.4410865Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:41:43.4410935Z return forward_fn(*input_tensors) 2025-08-14T21:41:43.4411230Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 481, in feed_forward_chunk 2025-08-14T21:41:43.4411327Z intermediate_output = self.intermediate(ln_output) 2025-08-14T21:41:43.4411588Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 397, in forward 2025-08-14T21:41:43.4411670Z hidden_states = self.dense(hidden_states) 2025-08-14T21:41:43.4411673Z 2025-08-14T21:41:43.4411764Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4411946Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4412012Z return mod(**inputs) 2025-08-14T21:41:43.4412274Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4412342Z outputs = self.bert( 2025-08-14T21:41:43.4412605Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4412673Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4412942Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4413007Z layer_outputs = layer_module( 2025-08-14T21:41:43.4413217Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4413288Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4413550Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:41:43.4413632Z layer_output = apply_chunking_to_forward( 2025-08-14T21:41:43.4413882Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:41:43.4413952Z return forward_fn(*input_tensors) 2025-08-14T21:41:43.4414251Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 481, in feed_forward_chunk 2025-08-14T21:41:43.4414362Z intermediate_output = self.intermediate(ln_output) 2025-08-14T21:41:43.4414633Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 398, in forward 2025-08-14T21:41:43.4414735Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:41:43.4414942Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:41:43.4415013Z return self.act(input) 2025-08-14T21:41:43.4415017Z 2025-08-14T21:41:43.4415111Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4415302Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4415360Z return mod(**inputs) 2025-08-14T21:41:43.4415626Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4415709Z outputs = self.bert( 2025-08-14T21:41:43.4415975Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4416040Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4416315Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4416377Z layer_outputs = layer_module( 2025-08-14T21:41:43.4416589Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4416661Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4416921Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:41:43.4417003Z layer_output = apply_chunking_to_forward( 2025-08-14T21:41:43.4417238Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:41:43.4417313Z return forward_fn(*input_tensors) 2025-08-14T21:41:43.4417598Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 482, in feed_forward_chunk 2025-08-14T21:41:43.4417716Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:41:43.4417981Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 410, in forward 2025-08-14T21:41:43.4418055Z hidden_states = self.dense(hidden_states) 2025-08-14T21:41:43.4418058Z 2025-08-14T21:41:43.4418150Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4418337Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4418396Z return mod(**inputs) 2025-08-14T21:41:43.4418671Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4418728Z outputs = self.bert( 2025-08-14T21:41:43.4418989Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4419063Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4419323Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4419395Z layer_outputs = layer_module( 2025-08-14T21:41:43.4419611Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4419686Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4419952Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:41:43.4420043Z self_attention_outputs = self.attention( 2025-08-14T21:41:43.4420305Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:41:43.4420376Z self_outputs = self.self( 2025-08-14T21:41:43.4420615Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:41:43.4420686Z return func(*args, **kwargs) 2025-08-14T21:41:43.4420949Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 223, in forward 2025-08-14T21:41:43.4421020Z query_layer = self.query(hidden_states) 2025-08-14T21:41:43.4421023Z 2025-08-14T21:41:43.4421122Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4421303Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4421386Z return mod(**inputs) 2025-08-14T21:41:43.4421656Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4421713Z outputs = self.bert( 2025-08-14T21:41:43.4421988Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4422054Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4422324Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4422394Z layer_outputs = layer_module( 2025-08-14T21:41:43.4422598Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4422675Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4422942Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:41:43.4423016Z self_attention_outputs = self.attention( 2025-08-14T21:41:43.4423288Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:41:43.4423352Z self_outputs = self.self( 2025-08-14T21:41:43.4423583Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:41:43.4423645Z return func(*args, **kwargs) 2025-08-14T21:41:43.4423915Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 252, in forward 2025-08-14T21:41:43.4423993Z key_layer = self.key(current_states) 2025-08-14T21:41:43.4423996Z 2025-08-14T21:41:43.4424089Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4424272Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4424336Z return mod(**inputs) 2025-08-14T21:41:43.4424608Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4424675Z outputs = self.bert( 2025-08-14T21:41:43.4425011Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4425082Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4425376Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4425443Z layer_outputs = layer_module( 2025-08-14T21:41:43.4425653Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4425742Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4426004Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:41:43.4426084Z self_attention_outputs = self.attention( 2025-08-14T21:41:43.4426364Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:41:43.4426429Z self_outputs = self.self( 2025-08-14T21:41:43.4426660Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:41:43.4426724Z return func(*args, **kwargs) 2025-08-14T21:41:43.4426997Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 256, in forward 2025-08-14T21:41:43.4427073Z value_layer = self.value(current_states) 2025-08-14T21:41:43.4427076Z 2025-08-14T21:41:43.4427165Z cudagraph partition due to non gpu ops 2025-08-14T21:41:43.4427247Z cudagraph partition due to non gpu ops 2025-08-14T21:41:43.4427341Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4427521Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4427590Z return mod(**inputs) 2025-08-14T21:41:43.4427855Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4427922Z outputs = self.bert( 2025-08-14T21:41:43.4428187Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4428252Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4428519Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4428587Z layer_outputs = layer_module( 2025-08-14T21:41:43.4428794Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4428864Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4429124Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:41:43.4429205Z self_attention_outputs = self.attention( 2025-08-14T21:41:43.4429467Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 381, in forward 2025-08-14T21:41:43.4429582Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:41:43.4429848Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 329, in forward 2025-08-14T21:41:43.4429923Z hidden_states = self.dense(hidden_states) 2025-08-14T21:41:43.4429928Z 2025-08-14T21:41:43.4430025Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4430202Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4430259Z return mod(**inputs) 2025-08-14T21:41:43.4430529Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4430587Z outputs = self.bert( 2025-08-14T21:41:43.4430871Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4430939Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4431206Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4431303Z layer_outputs = layer_module( 2025-08-14T21:41:43.4431506Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4431576Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4431840Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:41:43.4431931Z layer_output = apply_chunking_to_forward( 2025-08-14T21:41:43.4432177Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:41:43.4432245Z return forward_fn(*input_tensors) 2025-08-14T21:41:43.4432534Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 481, in feed_forward_chunk 2025-08-14T21:41:43.4432635Z intermediate_output = self.intermediate(ln_output) 2025-08-14T21:41:43.4432896Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 397, in forward 2025-08-14T21:41:43.4432993Z hidden_states = self.dense(hidden_states) 2025-08-14T21:41:43.4432996Z 2025-08-14T21:41:43.4433088Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4433268Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4433335Z return mod(**inputs) 2025-08-14T21:41:43.4433598Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4433658Z outputs = self.bert( 2025-08-14T21:41:43.4433927Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4433992Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4434261Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4434328Z layer_outputs = layer_module( 2025-08-14T21:41:43.4434530Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4434608Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4434868Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:41:43.4434951Z layer_output = apply_chunking_to_forward( 2025-08-14T21:41:43.4435190Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:41:43.4435257Z return forward_fn(*input_tensors) 2025-08-14T21:41:43.4435556Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 481, in feed_forward_chunk 2025-08-14T21:41:43.4435650Z intermediate_output = self.intermediate(ln_output) 2025-08-14T21:41:43.4435916Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 398, in forward 2025-08-14T21:41:43.4436015Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:41:43.4436208Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:41:43.4436278Z return self.act(input) 2025-08-14T21:41:43.4436281Z 2025-08-14T21:41:43.4436372Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4436570Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4436638Z return mod(**inputs) 2025-08-14T21:41:43.4436905Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4436989Z outputs = self.bert( 2025-08-14T21:41:43.4437252Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4437316Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4437597Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4437662Z layer_outputs = layer_module( 2025-08-14T21:41:43.4437863Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4437940Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4438204Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:41:43.4438284Z layer_output = apply_chunking_to_forward( 2025-08-14T21:41:43.4438521Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:41:43.4438604Z return forward_fn(*input_tensors) 2025-08-14T21:41:43.4438902Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 482, in feed_forward_chunk 2025-08-14T21:41:43.4439024Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:41:43.4439295Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 410, in forward 2025-08-14T21:41:43.4439370Z hidden_states = self.dense(hidden_states) 2025-08-14T21:41:43.4439373Z 2025-08-14T21:41:43.4439465Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4439656Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4439714Z return mod(**inputs) 2025-08-14T21:41:43.4439987Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4440047Z outputs = self.bert( 2025-08-14T21:41:43.4440311Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4440385Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4440647Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4440711Z layer_outputs = layer_module( 2025-08-14T21:41:43.4440920Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4440991Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4441258Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:41:43.4441334Z layer_output = apply_chunking_to_forward( 2025-08-14T21:41:43.4441572Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:41:43.4441647Z return forward_fn(*input_tensors) 2025-08-14T21:41:43.4441940Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 482, in feed_forward_chunk 2025-08-14T21:41:43.4442068Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:41:43.4442354Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 412, in forward 2025-08-14T21:41:43.4442427Z return input_tensor + hidden_states 2025-08-14T21:41:43.4442431Z 2025-08-14T21:41:43.4442529Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4442726Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4442786Z return mod(**inputs) 2025-08-14T21:41:43.4443057Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4443116Z outputs = self.bert( 2025-08-14T21:41:43.4443397Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4443466Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4443728Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4443801Z layer_outputs = layer_module( 2025-08-14T21:41:43.4444002Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4444080Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4444359Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:41:43.4444435Z self_attention_outputs = self.attention( 2025-08-14T21:41:43.4444706Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:41:43.4444770Z self_outputs = self.self( 2025-08-14T21:41:43.4444994Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:41:43.4445067Z return func(*args, **kwargs) 2025-08-14T21:41:43.4445333Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 223, in forward 2025-08-14T21:41:43.4445413Z query_layer = self.query(hidden_states) 2025-08-14T21:41:43.4445417Z 2025-08-14T21:41:43.4445511Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4445693Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4445759Z return mod(**inputs) 2025-08-14T21:41:43.4446026Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4446093Z outputs = self.bert( 2025-08-14T21:41:43.4446356Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4446424Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4446695Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4446758Z layer_outputs = layer_module( 2025-08-14T21:41:43.4446961Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4447041Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4447303Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:41:43.4447385Z self_attention_outputs = self.attention( 2025-08-14T21:41:43.4447649Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:41:43.4447711Z self_outputs = self.self( 2025-08-14T21:41:43.4447946Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:41:43.4448024Z return func(*args, **kwargs) 2025-08-14T21:41:43.4448299Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 252, in forward 2025-08-14T21:41:43.4448387Z key_layer = self.key(current_states) 2025-08-14T21:41:43.4448390Z 2025-08-14T21:41:43.4448486Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4448679Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4448737Z return mod(**inputs) 2025-08-14T21:41:43.4449021Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4449092Z outputs = self.bert( 2025-08-14T21:41:43.4449358Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4449436Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4449704Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4449770Z layer_outputs = layer_module( 2025-08-14T21:41:43.4449986Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4450073Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4450339Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:41:43.4450410Z self_attention_outputs = self.attention( 2025-08-14T21:41:43.4450671Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:41:43.4450738Z self_outputs = self.self( 2025-08-14T21:41:43.4450959Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:41:43.4451022Z return func(*args, **kwargs) 2025-08-14T21:41:43.4451290Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 256, in forward 2025-08-14T21:41:43.4451363Z value_layer = self.value(current_states) 2025-08-14T21:41:43.4451366Z 2025-08-14T21:41:43.4451443Z cudagraph partition due to non gpu ops 2025-08-14T21:41:43.4451514Z cudagraph partition due to non gpu ops 2025-08-14T21:41:43.4451607Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4451795Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4451854Z return mod(**inputs) 2025-08-14T21:41:43.4452117Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4452184Z outputs = self.bert( 2025-08-14T21:41:43.4452444Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4452516Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4452777Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4452842Z layer_outputs = layer_module( 2025-08-14T21:41:43.4453049Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4453120Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4453389Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:41:43.4453464Z self_attention_outputs = self.attention( 2025-08-14T21:41:43.4453752Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 381, in forward 2025-08-14T21:41:43.4453877Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:41:43.4454138Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 329, in forward 2025-08-14T21:41:43.4454232Z hidden_states = self.dense(hidden_states) 2025-08-14T21:41:43.4454243Z 2025-08-14T21:41:43.4454335Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4454517Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4454594Z return mod(**inputs) 2025-08-14T21:41:43.4454858Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4454916Z outputs = self.bert( 2025-08-14T21:41:43.4455187Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4455253Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4455526Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4455610Z layer_outputs = layer_module( 2025-08-14T21:41:43.4455810Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4455889Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4456151Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:41:43.4456226Z layer_output = apply_chunking_to_forward( 2025-08-14T21:41:43.4456474Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:41:43.4456544Z return forward_fn(*input_tensors) 2025-08-14T21:41:43.4456840Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 481, in feed_forward_chunk 2025-08-14T21:41:43.4456934Z intermediate_output = self.intermediate(ln_output) 2025-08-14T21:41:43.4457198Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 397, in forward 2025-08-14T21:41:43.4457279Z hidden_states = self.dense(hidden_states) 2025-08-14T21:41:43.4457283Z 2025-08-14T21:41:43.4457374Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4457560Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4457618Z return mod(**inputs) 2025-08-14T21:41:43.4457883Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4457949Z outputs = self.bert( 2025-08-14T21:41:43.4458211Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4458278Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4458550Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4458613Z layer_outputs = layer_module( 2025-08-14T21:41:43.4458820Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4458890Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4459158Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:41:43.4459239Z layer_output = apply_chunking_to_forward( 2025-08-14T21:41:43.4459515Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:41:43.4459591Z return forward_fn(*input_tensors) 2025-08-14T21:41:43.4459881Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 481, in feed_forward_chunk 2025-08-14T21:41:43.4459992Z intermediate_output = self.intermediate(ln_output) 2025-08-14T21:41:43.4460261Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 398, in forward 2025-08-14T21:41:43.4460377Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:41:43.4460578Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:41:43.4460640Z return self.act(input) 2025-08-14T21:41:43.4460643Z 2025-08-14T21:41:43.4460736Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4460921Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4460979Z return mod(**inputs) 2025-08-14T21:41:43.4461242Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4461340Z outputs = self.bert( 2025-08-14T21:41:43.4461602Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4461674Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4461939Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4462003Z layer_outputs = layer_module( 2025-08-14T21:41:43.4462213Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4462285Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4462544Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:41:43.4462629Z layer_output = apply_chunking_to_forward( 2025-08-14T21:41:43.4462866Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:41:43.4462941Z return forward_fn(*input_tensors) 2025-08-14T21:41:43.4463230Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 482, in feed_forward_chunk 2025-08-14T21:41:43.4463348Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:41:43.4463619Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 410, in forward 2025-08-14T21:41:43.4463694Z hidden_states = self.dense(hidden_states) 2025-08-14T21:41:43.4463697Z 2025-08-14T21:41:43.4463796Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4463976Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4464038Z return mod(**inputs) 2025-08-14T21:41:43.4464308Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4464367Z outputs = self.bert( 2025-08-14T21:41:43.4464634Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4464701Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4465029Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4465122Z layer_outputs = layer_module( 2025-08-14T21:41:43.4465326Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4465398Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4465685Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:41:43.4465762Z self_attention_outputs = self.attention( 2025-08-14T21:41:43.4466031Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:41:43.4466108Z self_outputs = self.self( 2025-08-14T21:41:43.4466333Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:41:43.4466404Z return func(*args, **kwargs) 2025-08-14T21:41:43.4466665Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 223, in forward 2025-08-14T21:41:43.4466748Z query_layer = self.query(hidden_states) 2025-08-14T21:41:43.4466751Z 2025-08-14T21:41:43.4466846Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4467029Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4467110Z return mod(**inputs) 2025-08-14T21:41:43.4467378Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4467437Z outputs = self.bert( 2025-08-14T21:41:43.4467706Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4467772Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4468040Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4468104Z layer_outputs = layer_module( 2025-08-14T21:41:43.4468305Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4468383Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4468645Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:41:43.4468718Z self_attention_outputs = self.attention( 2025-08-14T21:41:43.4468987Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:41:43.4469049Z self_outputs = self.self( 2025-08-14T21:41:43.4469277Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:41:43.4469340Z return func(*args, **kwargs) 2025-08-14T21:41:43.4469603Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 252, in forward 2025-08-14T21:41:43.4469682Z key_layer = self.key(current_states) 2025-08-14T21:41:43.4469686Z 2025-08-14T21:41:43.4469779Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4469968Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4470026Z return mod(**inputs) 2025-08-14T21:41:43.4470288Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4470356Z outputs = self.bert( 2025-08-14T21:41:43.4470616Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4470681Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4470965Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4471032Z layer_outputs = layer_module( 2025-08-14T21:41:43.4471239Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4471327Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4471590Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:41:43.4471673Z self_attention_outputs = self.attention( 2025-08-14T21:41:43.4471948Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:41:43.4472020Z self_outputs = self.self( 2025-08-14T21:41:43.4472249Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:41:43.4472312Z return func(*args, **kwargs) 2025-08-14T21:41:43.4472580Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 256, in forward 2025-08-14T21:41:43.4472652Z value_layer = self.value(current_states) 2025-08-14T21:41:43.4472678Z 2025-08-14T21:41:43.4472753Z cudagraph partition due to non gpu ops 2025-08-14T21:41:43.4472834Z cudagraph partition due to non gpu ops 2025-08-14T21:41:43.4472928Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4473114Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4473176Z return mod(**inputs) 2025-08-14T21:41:43.4473439Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4473506Z outputs = self.bert( 2025-08-14T21:41:43.4473769Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4473841Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4474104Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4474171Z layer_outputs = layer_module( 2025-08-14T21:41:43.4474379Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4474448Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4474709Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:41:43.4474788Z self_attention_outputs = self.attention( 2025-08-14T21:41:43.4475050Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 381, in forward 2025-08-14T21:41:43.4475172Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:41:43.4475434Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 329, in forward 2025-08-14T21:41:43.4475512Z hidden_states = self.dense(hidden_states) 2025-08-14T21:41:43.4475515Z 2025-08-14T21:41:43.4475614Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4475794Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4475859Z return mod(**inputs) 2025-08-14T21:41:43.4476124Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4476181Z outputs = self.bert( 2025-08-14T21:41:43.4476466Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4476536Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4476801Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4476888Z layer_outputs = layer_module( 2025-08-14T21:41:43.4477089Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4477166Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4477439Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:41:43.4477518Z layer_output = apply_chunking_to_forward( 2025-08-14T21:41:43.4477765Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:41:43.4477834Z return forward_fn(*input_tensors) 2025-08-14T21:41:43.4478134Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 481, in feed_forward_chunk 2025-08-14T21:41:43.4478228Z intermediate_output = self.intermediate(ln_output) 2025-08-14T21:41:43.4478490Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 397, in forward 2025-08-14T21:41:43.4478587Z hidden_states = self.dense(hidden_states) 2025-08-14T21:41:43.4478590Z 2025-08-14T21:41:43.4478684Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4478867Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4478933Z return mod(**inputs) 2025-08-14T21:41:43.4479198Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4479264Z outputs = self.bert( 2025-08-14T21:41:43.4479526Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4479590Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4479863Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4479928Z layer_outputs = layer_module( 2025-08-14T21:41:43.4480133Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4480202Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4480462Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:41:43.4480541Z layer_output = apply_chunking_to_forward( 2025-08-14T21:41:43.4480777Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:41:43.4480844Z return forward_fn(*input_tensors) 2025-08-14T21:41:43.4481141Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 481, in feed_forward_chunk 2025-08-14T21:41:43.4481236Z intermediate_output = self.intermediate(ln_output) 2025-08-14T21:41:43.4481505Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 398, in forward 2025-08-14T21:41:43.4481604Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:41:43.4481800Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:41:43.4481869Z return self.act(input) 2025-08-14T21:41:43.4481872Z 2025-08-14T21:41:43.4481963Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4482165Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4482226Z return mod(**inputs) 2025-08-14T21:41:43.4482488Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4482573Z outputs = self.bert( 2025-08-14T21:41:43.4482840Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4482905Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4483195Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4483261Z layer_outputs = layer_module( 2025-08-14T21:41:43.4483469Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4483542Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4483806Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:41:43.4483887Z layer_output = apply_chunking_to_forward( 2025-08-14T21:41:43.4484125Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:41:43.4484219Z return forward_fn(*input_tensors) 2025-08-14T21:41:43.4484506Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 482, in feed_forward_chunk 2025-08-14T21:41:43.4484836Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:41:43.4485136Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 410, in forward 2025-08-14T21:41:43.4485216Z hidden_states = self.dense(hidden_states) 2025-08-14T21:41:43.4485220Z 2025-08-14T21:41:43.4485325Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4485513Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4485575Z return mod(**inputs) 2025-08-14T21:41:43.4485864Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4485927Z outputs = self.bert( 2025-08-14T21:41:43.4486192Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4486268Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4486531Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4486604Z layer_outputs = layer_module( 2025-08-14T21:41:43.4486807Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4486879Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4487151Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:41:43.4487229Z layer_output = apply_chunking_to_forward( 2025-08-14T21:41:43.4487473Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:41:43.4487550Z return forward_fn(*input_tensors) 2025-08-14T21:41:43.4487849Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 482, in feed_forward_chunk 2025-08-14T21:41:43.4487978Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:41:43.4488289Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 412, in forward 2025-08-14T21:41:43.4488363Z return input_tensor + hidden_states 2025-08-14T21:41:43.4488367Z 2025-08-14T21:41:43.4488471Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4488687Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4488756Z return mod(**inputs) 2025-08-14T21:41:43.4489028Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4489089Z outputs = self.bert( 2025-08-14T21:41:43.4489384Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4489454Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4489735Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4489802Z layer_outputs = layer_module( 2025-08-14T21:41:43.4490010Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4490092Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4490383Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:41:43.4490459Z self_attention_outputs = self.attention( 2025-08-14T21:41:43.4490736Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:41:43.4490801Z self_outputs = self.self( 2025-08-14T21:41:43.4491039Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:41:43.4491107Z return func(*args, **kwargs) 2025-08-14T21:41:43.4491379Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 223, in forward 2025-08-14T21:41:43.4491462Z query_layer = self.query(hidden_states) 2025-08-14T21:41:43.4491467Z 2025-08-14T21:41:43.4491563Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4491760Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4491820Z return mod(**inputs) 2025-08-14T21:41:43.4492094Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4492162Z outputs = self.bert( 2025-08-14T21:41:43.4492432Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4492500Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4492781Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4492847Z layer_outputs = layer_module( 2025-08-14T21:41:43.4493063Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4493139Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4493408Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:41:43.4493492Z self_attention_outputs = self.attention( 2025-08-14T21:41:43.4493761Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:41:43.4493827Z self_outputs = self.self( 2025-08-14T21:41:43.4494078Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:41:43.4494145Z return func(*args, **kwargs) 2025-08-14T21:41:43.4494426Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 252, in forward 2025-08-14T21:41:43.4494514Z key_layer = self.key(current_states) 2025-08-14T21:41:43.4494519Z 2025-08-14T21:41:43.4494613Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4494806Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4494864Z return mod(**inputs) 2025-08-14T21:41:43.4495161Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4495222Z outputs = self.bert( 2025-08-14T21:41:43.4495491Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4495564Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4495832Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4495898Z layer_outputs = layer_module( 2025-08-14T21:41:43.4496109Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4496196Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4496472Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:41:43.4496548Z self_attention_outputs = self.attention( 2025-08-14T21:41:43.4496821Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:41:43.4496891Z self_outputs = self.self( 2025-08-14T21:41:43.4497123Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:41:43.4497194Z return func(*args, **kwargs) 2025-08-14T21:41:43.4497466Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 256, in forward 2025-08-14T21:41:43.4497544Z value_layer = self.value(current_states) 2025-08-14T21:41:43.4497547Z 2025-08-14T21:41:43.4497627Z cudagraph partition due to non gpu ops 2025-08-14T21:41:43.4497698Z cudagraph partition due to non gpu ops 2025-08-14T21:41:43.4497794Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4497988Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4498049Z return mod(**inputs) 2025-08-14T21:41:43.4498329Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4498390Z outputs = self.bert( 2025-08-14T21:41:43.4498675Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4498751Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4499015Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4499080Z layer_outputs = layer_module( 2025-08-14T21:41:43.4499289Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4499359Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4499628Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:41:43.4499701Z self_attention_outputs = self.attention( 2025-08-14T21:41:43.4499980Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 381, in forward 2025-08-14T21:41:43.4500105Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:41:43.4500384Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 329, in forward 2025-08-14T21:41:43.4500468Z hidden_states = self.dense(hidden_states) 2025-08-14T21:41:43.4500471Z 2025-08-14T21:41:43.4500562Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4500739Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4500820Z return mod(**inputs) 2025-08-14T21:41:43.4501083Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4501152Z outputs = self.bert( 2025-08-14T21:41:43.4501415Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4501480Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4501749Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4501830Z layer_outputs = layer_module( 2025-08-14T21:41:43.4502029Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4502108Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4502369Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:41:43.4502450Z layer_output = apply_chunking_to_forward( 2025-08-14T21:41:43.4502688Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:41:43.4502756Z return forward_fn(*input_tensors) 2025-08-14T21:41:43.4503051Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 481, in feed_forward_chunk 2025-08-14T21:41:43.4503146Z intermediate_output = self.intermediate(ln_output) 2025-08-14T21:41:43.4503416Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 397, in forward 2025-08-14T21:41:43.4503489Z hidden_states = self.dense(hidden_states) 2025-08-14T21:41:43.4503492Z 2025-08-14T21:41:43.4503584Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4503769Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4503826Z return mod(**inputs) 2025-08-14T21:41:43.4504089Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4504154Z outputs = self.bert( 2025-08-14T21:41:43.4504412Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4504485Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4504823Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4504896Z layer_outputs = layer_module( 2025-08-14T21:41:43.4505111Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4505181Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4505448Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:41:43.4505540Z layer_output = apply_chunking_to_forward( 2025-08-14T21:41:43.4505779Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:41:43.4505856Z return forward_fn(*input_tensors) 2025-08-14T21:41:43.4506165Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 481, in feed_forward_chunk 2025-08-14T21:41:43.4506258Z intermediate_output = self.intermediate(ln_output) 2025-08-14T21:41:43.4506533Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 398, in forward 2025-08-14T21:41:43.4506648Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:41:43.4506853Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:41:43.4506918Z return self.act(input) 2025-08-14T21:41:43.4506922Z 2025-08-14T21:41:43.4507015Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4507208Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4507266Z return mod(**inputs) 2025-08-14T21:41:43.4507545Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4507620Z outputs = self.bert( 2025-08-14T21:41:43.4507888Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4507959Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4508228Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4508293Z layer_outputs = layer_module( 2025-08-14T21:41:43.4508507Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4508578Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4508851Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:41:43.4508927Z layer_output = apply_chunking_to_forward( 2025-08-14T21:41:43.4509169Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:41:43.4509245Z return forward_fn(*input_tensors) 2025-08-14T21:41:43.4509541Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 482, in feed_forward_chunk 2025-08-14T21:41:43.4509667Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:41:43.4509937Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 410, in forward 2025-08-14T21:41:43.4510008Z hidden_states = self.dense(hidden_states) 2025-08-14T21:41:43.4510012Z 2025-08-14T21:41:43.4510112Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4510296Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4510354Z return mod(**inputs) 2025-08-14T21:41:43.4510631Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4510688Z outputs = self.bert( 2025-08-14T21:41:43.4510962Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4511027Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4511294Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4511380Z layer_outputs = layer_module( 2025-08-14T21:41:43.4511582Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4511658Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4511933Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:41:43.4512007Z self_attention_outputs = self.attention( 2025-08-14T21:41:43.4512272Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:41:43.4512349Z self_outputs = self.self( 2025-08-14T21:41:43.4512571Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:41:43.4512641Z return func(*args, **kwargs) 2025-08-14T21:41:43.4512900Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 223, in forward 2025-08-14T21:41:43.4512980Z query_layer = self.query(hidden_states) 2025-08-14T21:41:43.4512985Z 2025-08-14T21:41:43.4513080Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4513260Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4513341Z return mod(**inputs) 2025-08-14T21:41:43.4513607Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4513673Z outputs = self.bert( 2025-08-14T21:41:43.4513936Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4514002Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4514274Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4514338Z layer_outputs = layer_module( 2025-08-14T21:41:43.4514540Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4514619Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4514884Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:41:43.4514967Z self_attention_outputs = self.attention( 2025-08-14T21:41:43.4515230Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:41:43.4515293Z self_outputs = self.self( 2025-08-14T21:41:43.4515525Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:41:43.4515590Z return func(*args, **kwargs) 2025-08-14T21:41:43.4515861Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 252, in forward 2025-08-14T21:41:43.4515932Z key_layer = self.key(current_states) 2025-08-14T21:41:43.4515937Z 2025-08-14T21:41:43.4516034Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4516224Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4516281Z return mod(**inputs) 2025-08-14T21:41:43.4516547Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4516612Z outputs = self.bert( 2025-08-14T21:41:43.4516877Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4516964Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4517229Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4517293Z layer_outputs = layer_module( 2025-08-14T21:41:43.4517525Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4517599Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4517867Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:41:43.4517948Z self_attention_outputs = self.attention( 2025-08-14T21:41:43.4518711Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:41:43.4518786Z self_outputs = self.self( 2025-08-14T21:41:43.4519009Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:41:43.4519072Z return func(*args, **kwargs) 2025-08-14T21:41:43.4519342Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 256, in forward 2025-08-14T21:41:43.4519416Z value_layer = self.value(current_states) 2025-08-14T21:41:43.4519436Z 2025-08-14T21:41:43.4519516Z cudagraph partition due to non gpu ops 2025-08-14T21:41:43.4519587Z cudagraph partition due to non gpu ops 2025-08-14T21:41:43.4519680Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4519871Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4519930Z return mod(**inputs) 2025-08-14T21:41:43.4520196Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4520263Z outputs = self.bert( 2025-08-14T21:41:43.4520527Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4520600Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4520862Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4520927Z layer_outputs = layer_module( 2025-08-14T21:41:43.4521134Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4521203Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4521475Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:41:43.4521548Z self_attention_outputs = self.attention( 2025-08-14T21:41:43.4521810Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 381, in forward 2025-08-14T21:41:43.4521933Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:41:43.4522194Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 329, in forward 2025-08-14T21:41:43.4522272Z hidden_states = self.dense(hidden_states) 2025-08-14T21:41:43.4522285Z 2025-08-14T21:41:43.4522377Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4522557Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4522621Z return mod(**inputs) 2025-08-14T21:41:43.4522883Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4522942Z outputs = self.bert( 2025-08-14T21:41:43.4523227Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4523293Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4523566Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4523649Z layer_outputs = layer_module( 2025-08-14T21:41:43.4523848Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4523922Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4524193Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:41:43.4524271Z layer_output = apply_chunking_to_forward( 2025-08-14T21:41:43.4524516Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:41:43.4524585Z return forward_fn(*input_tensors) 2025-08-14T21:41:43.4524880Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 481, in feed_forward_chunk 2025-08-14T21:41:43.4524975Z intermediate_output = self.intermediate(ln_output) 2025-08-14T21:41:43.4525251Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 397, in forward 2025-08-14T21:41:43.4525332Z hidden_states = self.dense(hidden_states) 2025-08-14T21:41:43.4525335Z 2025-08-14T21:41:43.4525427Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4525616Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4525676Z return mod(**inputs) 2025-08-14T21:41:43.4525942Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4526009Z outputs = self.bert( 2025-08-14T21:41:43.4526273Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4526346Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4526607Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4526674Z layer_outputs = layer_module( 2025-08-14T21:41:43.4526880Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4526952Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4527214Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:41:43.4527294Z layer_output = apply_chunking_to_forward( 2025-08-14T21:41:43.4527531Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:41:43.4527607Z return forward_fn(*input_tensors) 2025-08-14T21:41:43.4527900Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 481, in feed_forward_chunk 2025-08-14T21:41:43.4527995Z intermediate_output = self.intermediate(ln_output) 2025-08-14T21:41:43.4528264Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 398, in forward 2025-08-14T21:41:43.4528365Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:41:43.4528567Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:41:43.4528630Z return self.act(input) 2025-08-14T21:41:43.4528634Z 2025-08-14T21:41:43.4528725Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4528928Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4528988Z return mod(**inputs) 2025-08-14T21:41:43.4529254Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4529337Z outputs = self.bert( 2025-08-14T21:41:43.4544647Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4544876Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4545303Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4545380Z layer_outputs = layer_module( 2025-08-14T21:41:43.4545599Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4545694Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4545976Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:41:43.4546069Z layer_output = apply_chunking_to_forward( 2025-08-14T21:41:43.4546348Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:41:43.4546422Z return forward_fn(*input_tensors) 2025-08-14T21:41:43.4546731Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 482, in feed_forward_chunk 2025-08-14T21:41:43.4546855Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:41:43.4547124Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 410, in forward 2025-08-14T21:41:43.4547211Z hidden_states = self.dense(hidden_states) 2025-08-14T21:41:43.4547219Z 2025-08-14T21:41:43.4547323Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4547527Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4547594Z return mod(**inputs) 2025-08-14T21:41:43.4547867Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4547939Z outputs = self.bert( 2025-08-14T21:41:43.4548205Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4548283Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4548547Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4548614Z layer_outputs = layer_module( 2025-08-14T21:41:43.4548833Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4548909Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4549176Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:41:43.4549262Z layer_output = apply_chunking_to_forward( 2025-08-14T21:41:43.4549501Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:41:43.4549579Z return forward_fn(*input_tensors) 2025-08-14T21:41:43.4549874Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 482, in feed_forward_chunk 2025-08-14T21:41:43.4549997Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:41:43.4550299Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 412, in forward 2025-08-14T21:41:43.4550372Z return input_tensor + hidden_states 2025-08-14T21:41:43.4550376Z 2025-08-14T21:41:43.4550502Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4550694Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4550755Z return mod(**inputs) 2025-08-14T21:41:43.4551034Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4551095Z outputs = self.bert( 2025-08-14T21:41:43.4551380Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4551449Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4551718Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4551793Z layer_outputs = layer_module( 2025-08-14T21:41:43.4552001Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4552077Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4552370Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:41:43.4552444Z self_attention_outputs = self.attention( 2025-08-14T21:41:43.4552713Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:41:43.4552777Z self_outputs = self.self( 2025-08-14T21:41:43.4553004Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:41:43.4553077Z return func(*args, **kwargs) 2025-08-14T21:41:43.4553338Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 223, in forward 2025-08-14T21:41:43.4553417Z query_layer = self.query(hidden_states) 2025-08-14T21:41:43.4553423Z 2025-08-14T21:41:43.4553520Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4553707Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4553771Z return mod(**inputs) 2025-08-14T21:41:43.4554038Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4554096Z outputs = self.bert( 2025-08-14T21:41:43.4554363Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4554431Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4554701Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4554765Z layer_outputs = layer_module( 2025-08-14T21:41:43.4554967Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4555047Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4555307Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:41:43.4555380Z self_attention_outputs = self.attention( 2025-08-14T21:41:43.4555648Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:41:43.4555712Z self_outputs = self.self( 2025-08-14T21:41:43.4555971Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:41:43.4556038Z return func(*args, **kwargs) 2025-08-14T21:41:43.4556302Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 252, in forward 2025-08-14T21:41:43.4556400Z key_layer = self.key(current_states) 2025-08-14T21:41:43.4556405Z 2025-08-14T21:41:43.4556499Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4556686Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4556745Z return mod(**inputs) 2025-08-14T21:41:43.4557023Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4557091Z outputs = self.bert( 2025-08-14T21:41:43.4557358Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4557427Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4557698Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4557764Z layer_outputs = layer_module( 2025-08-14T21:41:43.4557994Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4558065Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4558330Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:41:43.4558411Z self_attention_outputs = self.attention( 2025-08-14T21:41:43.4558677Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:41:43.4558746Z self_outputs = self.self( 2025-08-14T21:41:43.4558975Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:41:43.4559037Z return func(*args, **kwargs) 2025-08-14T21:41:43.4559313Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 256, in forward 2025-08-14T21:41:43.4559387Z value_layer = self.value(current_states) 2025-08-14T21:41:43.4559390Z 2025-08-14T21:41:43.4559464Z cudagraph partition due to non gpu ops 2025-08-14T21:41:43.4559542Z cudagraph partition due to non gpu ops 2025-08-14T21:41:43.4559637Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4559831Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4559890Z return mod(**inputs) 2025-08-14T21:41:43.4560157Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4560222Z outputs = self.bert( 2025-08-14T21:41:43.4560487Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4560553Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4560828Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4560891Z layer_outputs = layer_module( 2025-08-14T21:41:43.4561103Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4561174Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4561440Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:41:43.4561519Z self_attention_outputs = self.attention( 2025-08-14T21:41:43.4561803Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 381, in forward 2025-08-14T21:41:43.4561934Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:41:43.4562228Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 329, in forward 2025-08-14T21:41:43.4562306Z hidden_states = self.dense(hidden_states) 2025-08-14T21:41:43.4562309Z 2025-08-14T21:41:43.4562409Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4562603Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4562673Z return mod(**inputs) 2025-08-14T21:41:43.4562941Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4563001Z outputs = self.bert( 2025-08-14T21:41:43.4563276Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4563344Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4563613Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4563725Z layer_outputs = layer_module( 2025-08-14T21:41:43.4563927Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4564005Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4564266Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:41:43.4564343Z layer_output = apply_chunking_to_forward( 2025-08-14T21:41:43.4564593Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:41:43.4564664Z return forward_fn(*input_tensors) 2025-08-14T21:41:43.4564962Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 481, in feed_forward_chunk 2025-08-14T21:41:43.4565066Z intermediate_output = self.intermediate(ln_output) 2025-08-14T21:41:43.4565330Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 397, in forward 2025-08-14T21:41:43.4565411Z hidden_states = self.dense(hidden_states) 2025-08-14T21:41:43.4565415Z 2025-08-14T21:41:43.4565508Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4565690Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4565759Z return mod(**inputs) 2025-08-14T21:41:43.4566025Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4566093Z outputs = self.bert( 2025-08-14T21:41:43.4566353Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4566424Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4566697Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4566761Z layer_outputs = layer_module( 2025-08-14T21:41:43.4566974Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4567046Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4567311Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:41:43.4567408Z layer_output = apply_chunking_to_forward( 2025-08-14T21:41:43.4567646Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:41:43.4567730Z return forward_fn(*input_tensors) 2025-08-14T21:41:43.4568035Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 481, in feed_forward_chunk 2025-08-14T21:41:43.4568134Z intermediate_output = self.intermediate(ln_output) 2025-08-14T21:41:43.4568422Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 398, in forward 2025-08-14T21:41:43.4568531Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:41:43.4568728Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:41:43.4568800Z return self.act(input) 2025-08-14T21:41:43.4568805Z 2025-08-14T21:41:43.4568896Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4569082Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4569143Z return mod(**inputs) 2025-08-14T21:41:43.4569408Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4569489Z outputs = self.bert( 2025-08-14T21:41:43.4569753Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4569822Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4570095Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4570158Z layer_outputs = layer_module( 2025-08-14T21:41:43.4570367Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4570437Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4570697Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:41:43.4570781Z layer_output = apply_chunking_to_forward( 2025-08-14T21:41:43.4571020Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:41:43.4571096Z return forward_fn(*input_tensors) 2025-08-14T21:41:43.4571386Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 482, in feed_forward_chunk 2025-08-14T21:41:43.4571506Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:41:43.4571777Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 410, in forward 2025-08-14T21:41:43.4571849Z hidden_states = self.dense(hidden_states) 2025-08-14T21:41:43.4571852Z 2025-08-14T21:41:43.4571950Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4572132Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4572193Z return mod(**inputs) 2025-08-14T21:41:43.4572465Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4572523Z outputs = self.bert( 2025-08-14T21:41:43.4572787Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4572862Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4573136Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4573211Z layer_outputs = layer_module( 2025-08-14T21:41:43.4573414Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4573500Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4573770Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:41:43.4573843Z self_attention_outputs = self.attention( 2025-08-14T21:41:43.4574119Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:41:43.4574192Z self_outputs = self.self( 2025-08-14T21:41:43.4574412Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:41:43.4574484Z return func(*args, **kwargs) 2025-08-14T21:41:43.4574746Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 223, in forward 2025-08-14T21:41:43.4574818Z query_layer = self.query(hidden_states) 2025-08-14T21:41:43.4574823Z 2025-08-14T21:41:43.4574923Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4575125Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4575191Z return mod(**inputs) 2025-08-14T21:41:43.4575454Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4575515Z outputs = self.bert( 2025-08-14T21:41:43.4575785Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4575851Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4576112Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4576184Z layer_outputs = layer_module( 2025-08-14T21:41:43.4576386Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4576466Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4576726Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:41:43.4576798Z self_attention_outputs = self.attention( 2025-08-14T21:41:43.4577070Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:41:43.4577133Z self_outputs = self.self( 2025-08-14T21:41:43.4577362Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:41:43.4577426Z return func(*args, **kwargs) 2025-08-14T21:41:43.4577687Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 252, in forward 2025-08-14T21:41:43.4577767Z key_layer = self.key(current_states) 2025-08-14T21:41:43.4577772Z 2025-08-14T21:41:43.4577866Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4578045Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4578112Z return mod(**inputs) 2025-08-14T21:41:43.4578379Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4578439Z outputs = self.bert( 2025-08-14T21:41:43.4578710Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4578789Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4579052Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4579125Z layer_outputs = layer_module( 2025-08-14T21:41:43.4579343Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4579421Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4579680Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:41:43.4579766Z self_attention_outputs = self.attention( 2025-08-14T21:41:43.4580036Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:41:43.4580100Z self_outputs = self.self( 2025-08-14T21:41:43.4580327Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:41:43.4580389Z return func(*args, **kwargs) 2025-08-14T21:41:43.4580648Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 256, in forward 2025-08-14T21:41:43.4580743Z value_layer = self.value(current_states) 2025-08-14T21:41:43.4580746Z 2025-08-14T21:41:43.4580818Z cudagraph partition due to non gpu ops 2025-08-14T21:41:43.4580889Z cudagraph partition due to non gpu ops 2025-08-14T21:41:43.4580988Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4581172Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4581240Z return mod(**inputs) 2025-08-14T21:41:43.4581504Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4581565Z outputs = self.bert( 2025-08-14T21:41:43.4581838Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4581907Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4582173Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4582247Z layer_outputs = layer_module( 2025-08-14T21:41:43.4582452Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4582530Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4582794Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:41:43.4582869Z self_attention_outputs = self.attention( 2025-08-14T21:41:43.4583146Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 381, in forward 2025-08-14T21:41:43.4583267Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:41:43.4583539Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 329, in forward 2025-08-14T21:41:43.4583620Z hidden_states = self.dense(hidden_states) 2025-08-14T21:41:43.4583623Z 2025-08-14T21:41:43.4583717Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4583906Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4583969Z return mod(**inputs) 2025-08-14T21:41:43.4584242Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4584302Z outputs = self.bert( 2025-08-14T21:41:43.4584831Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4584953Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4585242Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4585340Z layer_outputs = layer_module( 2025-08-14T21:41:43.4585556Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4585628Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4585929Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:41:43.4586009Z layer_output = apply_chunking_to_forward( 2025-08-14T21:41:43.4586261Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:41:43.4586337Z return forward_fn(*input_tensors) 2025-08-14T21:41:43.4586626Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 481, in feed_forward_chunk 2025-08-14T21:41:43.4586728Z intermediate_output = self.intermediate(ln_output) 2025-08-14T21:41:43.4587015Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 397, in forward 2025-08-14T21:41:43.4587093Z hidden_states = self.dense(hidden_states) 2025-08-14T21:41:43.4587097Z 2025-08-14T21:41:43.4587199Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4587386Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4587447Z return mod(**inputs) 2025-08-14T21:41:43.4587728Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4587787Z outputs = self.bert( 2025-08-14T21:41:43.4588065Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4588133Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4588407Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4588480Z layer_outputs = layer_module( 2025-08-14T21:41:43.4588687Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4588765Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4589035Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:41:43.4589110Z layer_output = apply_chunking_to_forward( 2025-08-14T21:41:43.4589362Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:41:43.4589431Z return forward_fn(*input_tensors) 2025-08-14T21:41:43.4589729Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 481, in feed_forward_chunk 2025-08-14T21:41:43.4589834Z intermediate_output = self.intermediate(ln_output) 2025-08-14T21:41:43.4590103Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 398, in forward 2025-08-14T21:41:43.4590217Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:41:43.4590417Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:41:43.4590484Z return self.act(input) 2025-08-14T21:41:43.4590487Z 2025-08-14T21:41:43.4590603Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4590790Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4590859Z return mod(**inputs) 2025-08-14T21:41:43.4591147Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4591210Z outputs = self.bert( 2025-08-14T21:41:43.4591487Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4591554Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4591836Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4591910Z layer_outputs = layer_module( 2025-08-14T21:41:43.4592118Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4592196Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4592465Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:41:43.4592543Z layer_output = apply_chunking_to_forward( 2025-08-14T21:41:43.4592809Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:41:43.4592876Z return forward_fn(*input_tensors) 2025-08-14T21:41:43.4593179Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 482, in feed_forward_chunk 2025-08-14T21:41:43.4593301Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:41:43.4593572Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 410, in forward 2025-08-14T21:41:43.4593655Z hidden_states = self.dense(hidden_states) 2025-08-14T21:41:43.4593658Z 2025-08-14T21:41:43.4593753Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4593939Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4594009Z return mod(**inputs) 2025-08-14T21:41:43.4594281Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4594350Z outputs = self.bert( 2025-08-14T21:41:43.4594619Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4594686Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4594961Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4595028Z layer_outputs = layer_module( 2025-08-14T21:41:43.4595240Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4595312Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4595582Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:41:43.4595665Z layer_output = apply_chunking_to_forward( 2025-08-14T21:41:43.4595905Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:41:43.4595975Z return forward_fn(*input_tensors) 2025-08-14T21:41:43.4596276Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 482, in feed_forward_chunk 2025-08-14T21:41:43.4596397Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:41:43.4596685Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 412, in forward 2025-08-14T21:41:43.4596760Z return input_tensor + hidden_states 2025-08-14T21:41:43.4596777Z 2025-08-14T21:41:43.4596872Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4597067Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4597125Z return mod(**inputs) 2025-08-14T21:41:43.4597401Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4597487Z outputs = self.bert( 2025-08-14T21:41:43.4597761Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4597833Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4598108Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4598172Z layer_outputs = layer_module( 2025-08-14T21:41:43.4598382Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4598469Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4598750Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:41:43.4598827Z self_attention_outputs = self.attention( 2025-08-14T21:41:43.4599102Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:41:43.4599171Z self_outputs = self.self( 2025-08-14T21:41:43.4599394Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:41:43.4599462Z return func(*args, **kwargs) 2025-08-14T21:41:43.4599722Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 223, in forward 2025-08-14T21:41:43.4599796Z query_layer = self.query(hidden_states) 2025-08-14T21:41:43.4599801Z 2025-08-14T21:41:43.4599901Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4600080Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4600139Z return mod(**inputs) 2025-08-14T21:41:43.4600410Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4600468Z outputs = self.bert( 2025-08-14T21:41:43.4600738Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4600804Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4601063Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4601135Z layer_outputs = layer_module( 2025-08-14T21:41:43.4601336Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4601413Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4601674Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:41:43.4601748Z self_attention_outputs = self.attention( 2025-08-14T21:41:43.4602015Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:41:43.4602077Z self_outputs = self.self( 2025-08-14T21:41:43.4602311Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:41:43.4602382Z return func(*args, **kwargs) 2025-08-14T21:41:43.4602648Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 252, in forward 2025-08-14T21:41:43.4602742Z key_layer = self.key(current_states) 2025-08-14T21:41:43.4602746Z 2025-08-14T21:41:43.4602838Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4603020Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4603086Z return mod(**inputs) 2025-08-14T21:41:43.4603376Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4603445Z outputs = self.bert( 2025-08-14T21:41:43.4603706Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4603771Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4604039Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4604118Z layer_outputs = layer_module( 2025-08-14T21:41:43.4604320Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4604398Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4604659Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:41:43.4604739Z self_attention_outputs = self.attention( 2025-08-14T21:41:43.4604998Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:41:43.4605060Z self_outputs = self.self( 2025-08-14T21:41:43.4605288Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:41:43.4605349Z return func(*args, **kwargs) 2025-08-14T21:41:43.4605621Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 256, in forward 2025-08-14T21:41:43.4605694Z value_layer = self.value(current_states) 2025-08-14T21:41:43.4605697Z 2025-08-14T21:41:43.4605770Z cudagraph partition due to non gpu ops 2025-08-14T21:41:43.4605846Z cudagraph partition due to non gpu ops 2025-08-14T21:41:43.4605941Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4606119Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4606185Z return mod(**inputs) 2025-08-14T21:41:43.4606450Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4606514Z outputs = self.bert( 2025-08-14T21:41:43.4606774Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4606839Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4607109Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4607172Z layer_outputs = layer_module( 2025-08-14T21:41:43.4607372Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4607449Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4607710Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:41:43.4607804Z self_attention_outputs = self.attention( 2025-08-14T21:41:43.4608067Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 381, in forward 2025-08-14T21:41:43.4608185Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:41:43.4608473Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 329, in forward 2025-08-14T21:41:43.4608550Z hidden_states = self.dense(hidden_states) 2025-08-14T21:41:43.4608553Z 2025-08-14T21:41:43.4608650Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4608845Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4608907Z return mod(**inputs) 2025-08-14T21:41:43.4609177Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4609237Z outputs = self.bert( 2025-08-14T21:41:43.4609500Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4609575Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4609837Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4609921Z layer_outputs = layer_module( 2025-08-14T21:41:43.4610122Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4610192Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4610462Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:41:43.4610536Z layer_output = apply_chunking_to_forward( 2025-08-14T21:41:43.4610778Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:41:43.4610846Z return forward_fn(*input_tensors) 2025-08-14T21:41:43.4611134Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 481, in feed_forward_chunk 2025-08-14T21:41:43.4611238Z intermediate_output = self.intermediate(ln_output) 2025-08-14T21:41:43.4611497Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 397, in forward 2025-08-14T21:41:43.4611577Z hidden_states = self.dense(hidden_states) 2025-08-14T21:41:43.4611581Z 2025-08-14T21:41:43.4611672Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4611853Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4611917Z return mod(**inputs) 2025-08-14T21:41:43.4612182Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4612240Z outputs = self.bert( 2025-08-14T21:41:43.4612506Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4612573Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4612839Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4612904Z layer_outputs = layer_module( 2025-08-14T21:41:43.4613104Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4613182Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4613455Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:41:43.4613539Z layer_output = apply_chunking_to_forward( 2025-08-14T21:41:43.4613775Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:41:43.4613857Z return forward_fn(*input_tensors) 2025-08-14T21:41:43.4614157Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 481, in feed_forward_chunk 2025-08-14T21:41:43.4614251Z intermediate_output = self.intermediate(ln_output) 2025-08-14T21:41:43.4614526Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 398, in forward 2025-08-14T21:41:43.4614637Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:41:43.4614829Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:41:43.4614900Z return self.act(input) 2025-08-14T21:41:43.4614903Z 2025-08-14T21:41:43.4614993Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4615172Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4615239Z return mod(**inputs) 2025-08-14T21:41:43.4615518Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4615583Z outputs = self.bert( 2025-08-14T21:41:43.4615844Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4615911Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4616180Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4616244Z layer_outputs = layer_module( 2025-08-14T21:41:43.4616443Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4616520Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4616778Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:41:43.4616863Z layer_output = apply_chunking_to_forward( 2025-08-14T21:41:43.4617098Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:41:43.4617164Z return forward_fn(*input_tensors) 2025-08-14T21:41:43.4617460Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 482, in feed_forward_chunk 2025-08-14T21:41:43.4617578Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:41:43.4617848Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 410, in forward 2025-08-14T21:41:43.4617921Z hidden_states = self.dense(hidden_states) 2025-08-14T21:41:43.4617924Z 2025-08-14T21:41:43.4618015Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4618204Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4618264Z return mod(**inputs) 2025-08-14T21:41:43.4618525Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4618592Z outputs = self.bert( 2025-08-14T21:41:43.4618852Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4618924Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4619197Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4619264Z layer_outputs = layer_module( 2025-08-14T21:41:43.4619471Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4619564Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4619835Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:41:43.4619910Z self_attention_outputs = self.attention( 2025-08-14T21:41:43.4620185Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:41:43.4620258Z self_outputs = self.self( 2025-08-14T21:41:43.4620481Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:41:43.4620544Z return func(*args, **kwargs) 2025-08-14T21:41:43.4620811Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 223, in forward 2025-08-14T21:41:43.4620884Z query_layer = self.query(hidden_states) 2025-08-14T21:41:43.4620889Z 2025-08-14T21:41:43.4620986Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4621181Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4621240Z return mod(**inputs) 2025-08-14T21:41:43.4621512Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4621569Z outputs = self.bert( 2025-08-14T21:41:43.4621839Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4621904Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4622167Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4622237Z layer_outputs = layer_module( 2025-08-14T21:41:43.4622439Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4622510Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4622779Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:41:43.4622851Z self_attention_outputs = self.attention( 2025-08-14T21:41:43.4623119Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:41:43.4623181Z self_outputs = self.self( 2025-08-14T21:41:43.4623402Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:41:43.4623470Z return func(*args, **kwargs) 2025-08-14T21:41:43.4623732Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 252, in forward 2025-08-14T21:41:43.4623809Z key_layer = self.key(current_states) 2025-08-14T21:41:43.4623814Z 2025-08-14T21:41:43.4623905Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4624087Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4624150Z return mod(**inputs) 2025-08-14T21:41:43.4624417Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4624475Z outputs = self.bert( 2025-08-14T21:41:43.4624827Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4624902Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4625174Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4625254Z layer_outputs = layer_module( 2025-08-14T21:41:43.4625454Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4625532Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4625793Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:41:43.4625889Z self_attention_outputs = self.attention( 2025-08-14T21:41:43.4626153Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:41:43.4626217Z self_outputs = self.self( 2025-08-14T21:41:43.4626447Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:41:43.4626511Z return func(*args, **kwargs) 2025-08-14T21:41:43.4626773Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 256, in forward 2025-08-14T21:41:43.4626874Z value_layer = self.value(current_states) 2025-08-14T21:41:43.4626877Z 2025-08-14T21:41:43.4626951Z cudagraph partition due to non gpu ops 2025-08-14T21:41:43.4627028Z cudagraph partition due to non gpu ops 2025-08-14T21:41:43.4627121Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4627304Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4627368Z return mod(**inputs) 2025-08-14T21:41:43.4627637Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4627697Z outputs = self.bert( 2025-08-14T21:41:43.4627969Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4628035Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4628306Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4628370Z layer_outputs = layer_module( 2025-08-14T21:41:43.4628574Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4628653Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4628917Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:41:43.4628997Z self_attention_outputs = self.attention( 2025-08-14T21:41:43.4629262Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 381, in forward 2025-08-14T21:41:43.4629378Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:41:43.4629651Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 329, in forward 2025-08-14T21:41:43.4629726Z hidden_states = self.dense(hidden_states) 2025-08-14T21:41:43.4629730Z 2025-08-14T21:41:43.4629819Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4630008Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4630067Z return mod(**inputs) 2025-08-14T21:41:43.4630337Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4630395Z outputs = self.bert( 2025-08-14T21:41:43.4630677Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4630751Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4631030Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4631102Z layer_outputs = layer_module( 2025-08-14T21:41:43.4631304Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4631373Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4631658Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:41:43.4631736Z layer_output = apply_chunking_to_forward( 2025-08-14T21:41:43.4631976Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:41:43.4632051Z return forward_fn(*input_tensors) 2025-08-14T21:41:43.4632339Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 481, in feed_forward_chunk 2025-08-14T21:41:43.4632441Z intermediate_output = self.intermediate(ln_output) 2025-08-14T21:41:43.4632720Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 397, in forward 2025-08-14T21:41:43.4632793Z hidden_states = self.dense(hidden_states) 2025-08-14T21:41:43.4632796Z 2025-08-14T21:41:43.4632896Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4633075Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4633141Z return mod(**inputs) 2025-08-14T21:41:43.4633405Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4633464Z outputs = self.bert( 2025-08-14T21:41:43.4633732Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4633798Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4634061Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4634131Z layer_outputs = layer_module( 2025-08-14T21:41:43.4634332Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4634410Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4634669Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:41:43.4634744Z layer_output = apply_chunking_to_forward( 2025-08-14T21:41:43.4634988Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:41:43.4635055Z return forward_fn(*input_tensors) 2025-08-14T21:41:43.4635350Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 481, in feed_forward_chunk 2025-08-14T21:41:43.4635444Z intermediate_output = self.intermediate(ln_output) 2025-08-14T21:41:43.4635705Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 398, in forward 2025-08-14T21:41:43.4635814Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:41:43.4636007Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:41:43.4636069Z return self.act(input) 2025-08-14T21:41:43.4636080Z 2025-08-14T21:41:43.4636184Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4636369Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4636435Z return mod(**inputs) 2025-08-14T21:41:43.4636715Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4636776Z outputs = self.bert( 2025-08-14T21:41:43.4637038Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4637117Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4637392Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4637464Z layer_outputs = layer_module( 2025-08-14T21:41:43.4637669Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4637739Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4638007Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:41:43.4638100Z layer_output = apply_chunking_to_forward( 2025-08-14T21:41:43.4638337Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:41:43.4638411Z return forward_fn(*input_tensors) 2025-08-14T21:41:43.4638701Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 482, in feed_forward_chunk 2025-08-14T21:41:43.4638827Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:41:43.4639090Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 410, in forward 2025-08-14T21:41:43.4639162Z hidden_states = self.dense(hidden_states) 2025-08-14T21:41:43.4639165Z 2025-08-14T21:41:43.4639263Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4639443Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4639510Z return mod(**inputs) 2025-08-14T21:41:43.4639773Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1064, in forward 2025-08-14T21:41:43.4639831Z outputs = self.bert( 2025-08-14T21:41:43.4640099Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:41:43.4640163Z encoder_outputs = self.encoder( 2025-08-14T21:41:43.4640423Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:41:43.4640494Z layer_outputs = layer_module( 2025-08-14T21:41:43.4640692Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:41:43.4640769Z return super().__call__(*args, **kwargs) 2025-08-14T21:41:43.4641030Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:41:43.4641104Z layer_output = apply_chunking_to_forward( 2025-08-14T21:41:43.4641348Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:41:43.4641416Z return forward_fn(*input_tensors) 2025-08-14T21:41:43.4641711Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 482, in feed_forward_chunk 2025-08-14T21:41:43.4641857Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:41:43.4642121Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 412, in forward 2025-08-14T21:41:43.4642201Z return input_tensor + hidden_states 2025-08-14T21:41:43.4642218Z 2025-08-14T21:41:43.4642311Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4642498Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4642556Z return mod(**inputs) 2025-08-14T21:41:43.4642834Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1082, in forward 2025-08-14T21:41:43.4642930Z prediction_scores = self.cls(sequence_output) 2025-08-14T21:41:43.4643197Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 652, in forward 2025-08-14T21:41:43.4643303Z prediction_scores = self.predictions(sequence_output) 2025-08-14T21:41:43.4643575Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 640, in forward 2025-08-14T21:41:43.4643658Z hidden_states = self.transform(hidden_states) 2025-08-14T21:41:43.4643930Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 615, in forward 2025-08-14T21:41:43.4644022Z hidden_states = self.dense(hidden_states) 2025-08-14T21:41:43.4644025Z 2025-08-14T21:41:43.4644117Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4644309Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4644369Z return mod(**inputs) 2025-08-14T21:41:43.4644643Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1082, in forward 2025-08-14T21:41:43.4644728Z prediction_scores = self.cls(sequence_output) 2025-08-14T21:41:43.4644994Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 652, in forward 2025-08-14T21:41:43.4645103Z prediction_scores = self.predictions(sequence_output) 2025-08-14T21:41:43.4645370Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 641, in forward 2025-08-14T21:41:43.4645451Z hidden_states = self.decoder(hidden_states) 2025-08-14T21:41:43.4645461Z 2025-08-14T21:41:43.4645551Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:41:43.4645735Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:41:43.4645801Z return mod(**inputs) 2025-08-14T21:41:43.4646067Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1086, in forward 2025-08-14T21:41:43.4646132Z lm_loss = self.loss_function( 2025-08-14T21:41:43.4646364Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/loss/loss_utils.py", line 67, in ForCausalLMLoss 2025-08-14T21:41:43.4646526Z loss = fixed_cross_entropy(logits, shift_labels, num_items_in_batch, ignore_index, **kwargs) 2025-08-14T21:41:43.4646766Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/loss/loss_utils.py", line 36, in fixed_cross_entropy 2025-08-14T21:41:43.4646950Z loss = nn.functional.cross_entropy(source, target, ignore_index=ignore_index, reduction=reduction) 2025-08-14T21:41:43.4646953Z 2025-08-14T21:41:53.3248985Z Compilation time (from dynamo_timed): 21.250251395 2025-08-14T21:41:53.3277556Z pass 2025-08-14T21:41:53.3282773Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:41:53.3284298Z TIMING: _recursive_pre_grad_passes:0.00925 _recursive_joint_graph_passes:0.96051 _recursive_post_grad_passes:0.11949 async_compile.wait:0.75086 code_gen:8.61503 inductor_compile:10.64448 backend_compile:16.37223 gc:0.00042 entire_frame_compile:21.25025 total_wall_time:21.25025 2025-08-14T21:41:53.3285346Z STATS: call_* op count: 723 | FakeTensorMode.__torch_dispatch__:28473 | FakeTensor.__torch_dispatch__:8903 | ProxyTorchDispatchMode.__torch_dispatch__:10946 2025-08-14T21:41:53.3285893Z Dynamo produced 1 graphs covering 723 ops with 0 graph breaks (0 unique) 2025-08-14T21:41:57.6126369Z /opt/conda/envs/py_3.9/lib/python3.9/site-packages/llvmlite/binding/ffi.py:175: UserWarning: pkg_resources is deprecated as an API. See https://setuptools.pypa.io/en/latest/pkg_resources.html. The pkg_resources package is slated for removal as early as 2025-11-30. Refrain from using this package or pin to Setuptools<81. 2025-08-14T21:41:57.6127416Z from pkg_resources import resource_filename 2025-08-14T21:41:58.1536821Z 2025-08-14T21:42:00.8724672Z loading model: 0it [00:00, ?it/s] 2025-08-14T21:42:00.8728982Z loading model: 0it [00:02, ?it/s] 2025-08-14T21:42:00.8748949Z cpu eval MegatronBertForQuestionAnswering 2025-08-14T21:42:02.1660024Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:42:02.5912308Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:42:03.0365553Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:42:15.8296721Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8297263Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8301897Z return mod(**inputs) 2025-08-14T21:42:15.8302538Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8303082Z outputs = self.bert( 2025-08-14T21:42:15.8310115Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8314122Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8318816Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8323021Z layer_outputs = layer_module( 2025-08-14T21:42:15.8327720Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8329653Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8333173Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:42:15.8333731Z self_attention_outputs = self.attention( 2025-08-14T21:42:15.8338655Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:42:15.8339168Z self_outputs = self.self( 2025-08-14T21:42:15.8339551Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:42:15.8339937Z return func(*args, **kwargs) 2025-08-14T21:42:15.8340345Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 223, in forward 2025-08-14T21:42:15.8340758Z query_layer = self.query(hidden_states) 2025-08-14T21:42:15.8340894Z 2025-08-14T21:42:15.8341005Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8341356Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8341673Z return mod(**inputs) 2025-08-14T21:42:15.8342055Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8342736Z outputs = self.bert( 2025-08-14T21:42:15.8343113Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8343571Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8343969Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8344369Z layer_outputs = layer_module( 2025-08-14T21:42:15.8344700Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8345228Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8345630Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:42:15.8346028Z self_attention_outputs = self.attention( 2025-08-14T21:42:15.8346424Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:42:15.8346812Z self_outputs = self.self( 2025-08-14T21:42:15.8347145Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:42:15.8347538Z return func(*args, **kwargs) 2025-08-14T21:42:15.8347916Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 252, in forward 2025-08-14T21:42:15.8348310Z key_layer = self.key(current_states) 2025-08-14T21:42:15.8348435Z 2025-08-14T21:42:15.8348543Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8348874Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8349172Z return mod(**inputs) 2025-08-14T21:42:15.8349544Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8349919Z outputs = self.bert( 2025-08-14T21:42:15.8350282Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8350682Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8351066Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8351448Z layer_outputs = layer_module( 2025-08-14T21:42:15.8351771Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8352105Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8352485Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:42:15.8352884Z self_attention_outputs = self.attention( 2025-08-14T21:42:15.8353279Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:42:15.8353666Z self_outputs = self.self( 2025-08-14T21:42:15.8353993Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:42:15.8354339Z return func(*args, **kwargs) 2025-08-14T21:42:15.8354718Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 256, in forward 2025-08-14T21:42:15.8355113Z value_layer = self.value(current_states) 2025-08-14T21:42:15.8355237Z 2025-08-14T21:42:15.8355315Z cudagraph partition due to non gpu ops 2025-08-14T21:42:15.8355516Z cudagraph partition due to non gpu ops 2025-08-14T21:42:15.8355741Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8356098Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8356413Z return mod(**inputs) 2025-08-14T21:42:15.8356800Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8357219Z outputs = self.bert( 2025-08-14T21:42:15.8357578Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8357970Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8358404Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8358788Z layer_outputs = layer_module( 2025-08-14T21:42:15.8359110Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8359446Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8359835Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:42:15.8360233Z self_attention_outputs = self.attention( 2025-08-14T21:42:15.8360644Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 381, in forward 2025-08-14T21:42:15.8361109Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:42:15.8361543Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 329, in forward 2025-08-14T21:42:15.8361934Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:15.8362070Z 2025-08-14T21:42:15.8362168Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8362500Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8362793Z return mod(**inputs) 2025-08-14T21:42:15.8363159Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8363541Z outputs = self.bert( 2025-08-14T21:42:15.8363901Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8364283Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8364664Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8365049Z layer_outputs = layer_module( 2025-08-14T21:42:15.8365371Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8365693Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8366079Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:42:15.8366476Z layer_output = apply_chunking_to_forward( 2025-08-14T21:42:15.8366841Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:42:15.8367209Z return forward_fn(*input_tensors) 2025-08-14T21:42:15.8367624Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 481, in feed_forward_chunk 2025-08-14T21:42:15.8368070Z intermediate_output = self.intermediate(ln_output) 2025-08-14T21:42:15.8368479Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 397, in forward 2025-08-14T21:42:15.8368882Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:15.8369014Z 2025-08-14T21:42:15.8369129Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8369465Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8369761Z return mod(**inputs) 2025-08-14T21:42:15.8370131Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8370534Z outputs = self.bert( 2025-08-14T21:42:15.8370890Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8371279Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8371678Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8372072Z layer_outputs = layer_module( 2025-08-14T21:42:15.8372391Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8372726Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8373112Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:42:15.8373509Z layer_output = apply_chunking_to_forward( 2025-08-14T21:42:15.8373895Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:42:15.8374259Z return forward_fn(*input_tensors) 2025-08-14T21:42:15.8374675Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 481, in feed_forward_chunk 2025-08-14T21:42:15.8375125Z intermediate_output = self.intermediate(ln_output) 2025-08-14T21:42:15.8375550Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 398, in forward 2025-08-14T21:42:15.8375978Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:42:15.8376334Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:42:15.8376643Z return self.act(input) 2025-08-14T21:42:15.8376756Z 2025-08-14T21:42:15.8376853Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8377186Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8377485Z return mod(**inputs) 2025-08-14T21:42:15.8377848Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8378233Z outputs = self.bert( 2025-08-14T21:42:15.8378596Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8378980Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8379366Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8379750Z layer_outputs = layer_module( 2025-08-14T21:42:15.8380067Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8380393Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8380782Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:42:15.8381179Z layer_output = apply_chunking_to_forward( 2025-08-14T21:42:15.8381550Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:42:15.8381911Z return forward_fn(*input_tensors) 2025-08-14T21:42:15.8382350Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 482, in feed_forward_chunk 2025-08-14T21:42:15.8382824Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:42:15.8383260Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 410, in forward 2025-08-14T21:42:15.8383673Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:15.8383806Z 2025-08-14T21:42:15.8383903Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8384238Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8384531Z return mod(**inputs) 2025-08-14T21:42:15.8385185Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8385583Z outputs = self.bert( 2025-08-14T21:42:15.8385951Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8386346Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8386744Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8387172Z layer_outputs = layer_module( 2025-08-14T21:42:15.8387490Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8387828Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8388614Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:42:15.8389008Z self_attention_outputs = self.attention( 2025-08-14T21:42:15.8389393Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:42:15.8389782Z self_outputs = self.self( 2025-08-14T21:42:15.8390127Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:42:15.8390476Z return func(*args, **kwargs) 2025-08-14T21:42:15.8390846Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 223, in forward 2025-08-14T21:42:15.8391245Z query_layer = self.query(hidden_states) 2025-08-14T21:42:15.8391371Z 2025-08-14T21:42:15.8391477Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8391812Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8392106Z return mod(**inputs) 2025-08-14T21:42:15.8392474Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8392854Z outputs = self.bert( 2025-08-14T21:42:15.8393212Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8393600Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8393985Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8394371Z layer_outputs = layer_module( 2025-08-14T21:42:15.8394686Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8395018Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8395424Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:42:15.8395826Z self_attention_outputs = self.attention( 2025-08-14T21:42:15.8396259Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:42:15.8396651Z self_outputs = self.self( 2025-08-14T21:42:15.8396986Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:42:15.8397351Z return func(*args, **kwargs) 2025-08-14T21:42:15.8397737Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 252, in forward 2025-08-14T21:42:15.8398134Z key_layer = self.key(current_states) 2025-08-14T21:42:15.8398259Z 2025-08-14T21:42:15.8398364Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8398727Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8399035Z return mod(**inputs) 2025-08-14T21:42:15.8399408Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8399794Z outputs = self.bert( 2025-08-14T21:42:15.8400161Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8400555Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8400962Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8401342Z layer_outputs = layer_module( 2025-08-14T21:42:15.8401663Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8401998Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8402391Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:42:15.8402783Z self_attention_outputs = self.attention( 2025-08-14T21:42:15.8403179Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:42:15.8403565Z self_outputs = self.self( 2025-08-14T21:42:15.8403897Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:42:15.8404248Z return func(*args, **kwargs) 2025-08-14T21:42:15.8404629Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 256, in forward 2025-08-14T21:42:15.8405039Z value_layer = self.value(current_states) 2025-08-14T21:42:15.8405170Z 2025-08-14T21:42:15.8405245Z cudagraph partition due to non gpu ops 2025-08-14T21:42:15.8405447Z cudagraph partition due to non gpu ops 2025-08-14T21:42:15.8405668Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8405992Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8406297Z return mod(**inputs) 2025-08-14T21:42:15.8406667Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8407051Z outputs = self.bert( 2025-08-14T21:42:15.8407408Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8407800Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8408186Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8408575Z layer_outputs = layer_module( 2025-08-14T21:42:15.8408888Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8409223Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8409631Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:42:15.8410023Z self_attention_outputs = self.attention( 2025-08-14T21:42:15.8410417Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 381, in forward 2025-08-14T21:42:15.8410890Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:42:15.8411322Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 329, in forward 2025-08-14T21:42:15.8411739Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:15.8411878Z 2025-08-14T21:42:15.8411976Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8412307Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8412611Z return mod(**inputs) 2025-08-14T21:42:15.8412976Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8413362Z outputs = self.bert( 2025-08-14T21:42:15.8413728Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8414135Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8414521Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8414910Z layer_outputs = layer_module( 2025-08-14T21:42:15.8415238Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8415573Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8415970Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:42:15.8416370Z layer_output = apply_chunking_to_forward( 2025-08-14T21:42:15.8416745Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:42:15.8417102Z return forward_fn(*input_tensors) 2025-08-14T21:42:15.8417518Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 481, in feed_forward_chunk 2025-08-14T21:42:15.8417963Z intermediate_output = self.intermediate(ln_output) 2025-08-14T21:42:15.8418372Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 397, in forward 2025-08-14T21:42:15.8418774Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:15.8418916Z 2025-08-14T21:42:15.8419012Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8419345Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8419641Z return mod(**inputs) 2025-08-14T21:42:15.8420009Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8420395Z outputs = self.bert( 2025-08-14T21:42:15.8420759Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8421142Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8421532Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8421920Z layer_outputs = layer_module( 2025-08-14T21:42:15.8422233Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8422586Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8422993Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:42:15.8423388Z layer_output = apply_chunking_to_forward( 2025-08-14T21:42:15.8423770Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:42:15.8424135Z return forward_fn(*input_tensors) 2025-08-14T21:42:15.8424548Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 481, in feed_forward_chunk 2025-08-14T21:42:15.8425085Z intermediate_output = self.intermediate(ln_output) 2025-08-14T21:42:15.8425501Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 398, in forward 2025-08-14T21:42:15.8425930Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:42:15.8426290Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:42:15.8426600Z return self.act(input) 2025-08-14T21:42:15.8426714Z 2025-08-14T21:42:15.8426814Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8427174Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8427477Z return mod(**inputs) 2025-08-14T21:42:15.8427841Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8428229Z outputs = self.bert( 2025-08-14T21:42:15.8428593Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8428980Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8429357Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8429741Z layer_outputs = layer_module( 2025-08-14T21:42:15.8430064Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8430394Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8430789Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:42:15.8431184Z layer_output = apply_chunking_to_forward( 2025-08-14T21:42:15.8431558Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:42:15.8431915Z return forward_fn(*input_tensors) 2025-08-14T21:42:15.8432333Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 482, in feed_forward_chunk 2025-08-14T21:42:15.8432800Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:42:15.8433240Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 410, in forward 2025-08-14T21:42:15.8433631Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:15.8433766Z 2025-08-14T21:42:15.8433861Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8434191Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8434484Z return mod(**inputs) 2025-08-14T21:42:15.8434853Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8435236Z outputs = self.bert( 2025-08-14T21:42:15.8435611Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8435994Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8436376Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8436779Z layer_outputs = layer_module( 2025-08-14T21:42:15.8437096Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8437430Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8437823Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:42:15.8438237Z layer_output = apply_chunking_to_forward( 2025-08-14T21:42:15.8438602Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:42:15.8438966Z return forward_fn(*input_tensors) 2025-08-14T21:42:15.8439377Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 482, in feed_forward_chunk 2025-08-14T21:42:15.8439843Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:42:15.8440276Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 412, in forward 2025-08-14T21:42:15.8440685Z return input_tensor + hidden_states 2025-08-14T21:42:15.8440808Z 2025-08-14T21:42:15.8440913Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8441244Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8441533Z return mod(**inputs) 2025-08-14T21:42:15.8441896Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8442278Z outputs = self.bert( 2025-08-14T21:42:15.8442635Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8443172Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8443570Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8443962Z layer_outputs = layer_module( 2025-08-14T21:42:15.8444278Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8444612Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8445006Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:42:15.8445406Z self_attention_outputs = self.attention( 2025-08-14T21:42:15.8445797Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:42:15.8446187Z self_outputs = self.self( 2025-08-14T21:42:15.8446527Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:42:15.8446870Z return func(*args, **kwargs) 2025-08-14T21:42:15.8447249Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 223, in forward 2025-08-14T21:42:15.8447640Z query_layer = self.query(hidden_states) 2025-08-14T21:42:15.8447767Z 2025-08-14T21:42:15.8447870Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8448195Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8448497Z return mod(**inputs) 2025-08-14T21:42:15.8448882Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8449259Z outputs = self.bert( 2025-08-14T21:42:15.8449641Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8450048Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8450434Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8450814Z layer_outputs = layer_module( 2025-08-14T21:42:15.8451135Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8451513Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8451903Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:42:15.8452291Z self_attention_outputs = self.attention( 2025-08-14T21:42:15.8452686Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:42:15.8453073Z self_outputs = self.self( 2025-08-14T21:42:15.8453402Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:42:15.8453762Z return func(*args, **kwargs) 2025-08-14T21:42:15.8454138Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 252, in forward 2025-08-14T21:42:15.8454525Z key_layer = self.key(current_states) 2025-08-14T21:42:15.8454645Z 2025-08-14T21:42:15.8454744Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8455075Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8455377Z return mod(**inputs) 2025-08-14T21:42:15.8455741Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8456122Z outputs = self.bert( 2025-08-14T21:42:15.8456486Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8456878Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8457254Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8457636Z layer_outputs = layer_module( 2025-08-14T21:42:15.8457952Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8458286Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8458667Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:42:15.8459058Z self_attention_outputs = self.attention( 2025-08-14T21:42:15.8459448Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:42:15.8459826Z self_outputs = self.self( 2025-08-14T21:42:15.8460160Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:42:15.8460500Z return func(*args, **kwargs) 2025-08-14T21:42:15.8460876Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 256, in forward 2025-08-14T21:42:15.8461262Z value_layer = self.value(current_states) 2025-08-14T21:42:15.8461392Z 2025-08-14T21:42:15.8461466Z cudagraph partition due to non gpu ops 2025-08-14T21:42:15.8461664Z cudagraph partition due to non gpu ops 2025-08-14T21:42:15.8461880Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8462233Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8462539Z return mod(**inputs) 2025-08-14T21:42:15.8462910Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8463304Z outputs = self.bert( 2025-08-14T21:42:15.8463668Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8464055Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8464452Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8464913Z layer_outputs = layer_module( 2025-08-14T21:42:15.8465239Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8465582Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8465973Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:42:15.8466383Z self_attention_outputs = self.attention( 2025-08-14T21:42:15.8466792Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 381, in forward 2025-08-14T21:42:15.8467254Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:42:15.8467682Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 329, in forward 2025-08-14T21:42:15.8468083Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:15.8468220Z 2025-08-14T21:42:15.8468319Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8468653Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8468945Z return mod(**inputs) 2025-08-14T21:42:15.8469309Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8469694Z outputs = self.bert( 2025-08-14T21:42:15.8470048Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8470436Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8470818Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8471204Z layer_outputs = layer_module( 2025-08-14T21:42:15.8471516Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8471846Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8472236Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:42:15.8472631Z layer_output = apply_chunking_to_forward( 2025-08-14T21:42:15.8472999Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:42:15.8473364Z return forward_fn(*input_tensors) 2025-08-14T21:42:15.8473773Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 481, in feed_forward_chunk 2025-08-14T21:42:15.8474206Z intermediate_output = self.intermediate(ln_output) 2025-08-14T21:42:15.8474622Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 397, in forward 2025-08-14T21:42:15.8475014Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:15.8475140Z 2025-08-14T21:42:15.8475258Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8475590Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8475897Z return mod(**inputs) 2025-08-14T21:42:15.8476286Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8476680Z outputs = self.bert( 2025-08-14T21:42:15.8477042Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8477437Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8477844Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8478228Z layer_outputs = layer_module( 2025-08-14T21:42:15.8478554Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8478888Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8479278Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:42:15.8479672Z layer_output = apply_chunking_to_forward( 2025-08-14T21:42:15.8480060Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:42:15.8480421Z return forward_fn(*input_tensors) 2025-08-14T21:42:15.8480834Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 481, in feed_forward_chunk 2025-08-14T21:42:15.8481271Z intermediate_output = self.intermediate(ln_output) 2025-08-14T21:42:15.8481684Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 398, in forward 2025-08-14T21:42:15.8482114Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:42:15.8482462Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:42:15.8482782Z return self.act(input) 2025-08-14T21:42:15.8482892Z 2025-08-14T21:42:15.8482991Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8483329Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8483622Z return mod(**inputs) 2025-08-14T21:42:15.8483990Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8484377Z outputs = self.bert( 2025-08-14T21:42:15.8484900Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8485294Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8485686Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8486077Z layer_outputs = layer_module( 2025-08-14T21:42:15.8486391Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8486725Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8487117Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:42:15.8487516Z layer_output = apply_chunking_to_forward( 2025-08-14T21:42:15.8487883Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:42:15.8488248Z return forward_fn(*input_tensors) 2025-08-14T21:42:15.8488716Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 482, in feed_forward_chunk 2025-08-14T21:42:15.8489182Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:42:15.8489617Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 410, in forward 2025-08-14T21:42:15.8490044Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:15.8490172Z 2025-08-14T21:42:15.8490274Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8490600Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8490926Z return mod(**inputs) 2025-08-14T21:42:15.8491301Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8491685Z outputs = self.bert( 2025-08-14T21:42:15.8492047Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8492437Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8492821Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8493238Z layer_outputs = layer_module( 2025-08-14T21:42:15.8493556Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8493894Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8494290Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:42:15.8494686Z self_attention_outputs = self.attention( 2025-08-14T21:42:15.8495087Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:42:15.8495477Z self_outputs = self.self( 2025-08-14T21:42:15.8495822Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:42:15.8496166Z return func(*args, **kwargs) 2025-08-14T21:42:15.8496549Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 223, in forward 2025-08-14T21:42:15.8496954Z query_layer = self.query(hidden_states) 2025-08-14T21:42:15.8497084Z 2025-08-14T21:42:15.8497190Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8497525Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8497832Z return mod(**inputs) 2025-08-14T21:42:15.8498201Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8498583Z outputs = self.bert( 2025-08-14T21:42:15.8498953Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8499348Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8499740Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8500128Z layer_outputs = layer_module( 2025-08-14T21:42:15.8500453Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8500793Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8501182Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:42:15.8501584Z self_attention_outputs = self.attention( 2025-08-14T21:42:15.8502002Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:42:15.8502390Z self_outputs = self.self( 2025-08-14T21:42:15.8502716Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:42:15.8503078Z return func(*args, **kwargs) 2025-08-14T21:42:15.8503456Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 252, in forward 2025-08-14T21:42:15.8503848Z key_layer = self.key(current_states) 2025-08-14T21:42:15.8503971Z 2025-08-14T21:42:15.8504082Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8504418Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8504764Z return mod(**inputs) 2025-08-14T21:42:15.8505139Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8505529Z outputs = self.bert( 2025-08-14T21:42:15.8505894Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8506285Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8506690Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8507076Z layer_outputs = layer_module( 2025-08-14T21:42:15.8507401Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8507728Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8508120Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:42:15.8508518Z self_attention_outputs = self.attention( 2025-08-14T21:42:15.8508914Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:42:15.8509295Z self_outputs = self.self( 2025-08-14T21:42:15.8509627Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:42:15.8509971Z return func(*args, **kwargs) 2025-08-14T21:42:15.8510346Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 256, in forward 2025-08-14T21:42:15.8510730Z value_layer = self.value(current_states) 2025-08-14T21:42:15.8510863Z 2025-08-14T21:42:15.8510937Z cudagraph partition due to non gpu ops 2025-08-14T21:42:15.8511135Z cudagraph partition due to non gpu ops 2025-08-14T21:42:15.8511344Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8511675Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8511975Z return mod(**inputs) 2025-08-14T21:42:15.8512343Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8512720Z outputs = self.bert( 2025-08-14T21:42:15.8513084Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8513474Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8513849Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8514237Z layer_outputs = layer_module( 2025-08-14T21:42:15.8514555Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8514888Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8515290Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:42:15.8515688Z self_attention_outputs = self.attention( 2025-08-14T21:42:15.8516099Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 381, in forward 2025-08-14T21:42:15.8516535Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:42:15.8516966Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 329, in forward 2025-08-14T21:42:15.8517376Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:15.8517513Z 2025-08-14T21:42:15.8517616Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8517939Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8518240Z return mod(**inputs) 2025-08-14T21:42:15.8518605Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8518987Z outputs = self.bert( 2025-08-14T21:42:15.8519346Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8519754Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8520134Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8520521Z layer_outputs = layer_module( 2025-08-14T21:42:15.8520830Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8521158Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8521550Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:42:15.8521942Z layer_output = apply_chunking_to_forward( 2025-08-14T21:42:15.8522314Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:42:15.8522680Z return forward_fn(*input_tensors) 2025-08-14T21:42:15.8523090Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 481, in feed_forward_chunk 2025-08-14T21:42:15.8523525Z intermediate_output = self.intermediate(ln_output) 2025-08-14T21:42:15.8523938Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 397, in forward 2025-08-14T21:42:15.8524331Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:15.8524457Z 2025-08-14T21:42:15.8524558Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8524878Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8525176Z return mod(**inputs) 2025-08-14T21:42:15.8525540Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8525914Z outputs = self.bert( 2025-08-14T21:42:15.8526278Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8526665Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8527041Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8527416Z layer_outputs = layer_module( 2025-08-14T21:42:15.8527730Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8528084Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8528479Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:42:15.8528885Z layer_output = apply_chunking_to_forward( 2025-08-14T21:42:15.8529257Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:42:15.8529622Z return forward_fn(*input_tensors) 2025-08-14T21:42:15.8530026Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 481, in feed_forward_chunk 2025-08-14T21:42:15.8530479Z intermediate_output = self.intermediate(ln_output) 2025-08-14T21:42:15.8530892Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 398, in forward 2025-08-14T21:42:15.8531314Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:42:15.8531655Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:42:15.8531966Z return self.act(input) 2025-08-14T21:42:15.8532069Z 2025-08-14T21:42:15.8532172Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8532526Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8532817Z return mod(**inputs) 2025-08-14T21:42:15.8533183Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8533568Z outputs = self.bert( 2025-08-14T21:42:15.8533924Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8534312Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8534695Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8535077Z layer_outputs = layer_module( 2025-08-14T21:42:15.8535387Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8535721Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8536108Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:42:15.8536501Z layer_output = apply_chunking_to_forward( 2025-08-14T21:42:15.8536861Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:42:15.8537222Z return forward_fn(*input_tensors) 2025-08-14T21:42:15.8537634Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 482, in feed_forward_chunk 2025-08-14T21:42:15.8538092Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:42:15.8538532Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 410, in forward 2025-08-14T21:42:15.8538930Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:15.8539054Z 2025-08-14T21:42:15.8539158Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8539483Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8539781Z return mod(**inputs) 2025-08-14T21:42:15.8540147Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8540529Z outputs = self.bert( 2025-08-14T21:42:15.8540897Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8541291Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8541675Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8542073Z layer_outputs = layer_module( 2025-08-14T21:42:15.8542400Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8542734Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8543136Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:42:15.8543527Z layer_output = apply_chunking_to_forward( 2025-08-14T21:42:15.8543897Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:42:15.8544261Z return forward_fn(*input_tensors) 2025-08-14T21:42:15.8544673Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 482, in feed_forward_chunk 2025-08-14T21:42:15.8545202Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:42:15.8545649Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 412, in forward 2025-08-14T21:42:15.8546069Z return input_tensor + hidden_states 2025-08-14T21:42:15.8546196Z 2025-08-14T21:42:15.8546298Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8546644Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8546954Z return mod(**inputs) 2025-08-14T21:42:15.8547332Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8547717Z outputs = self.bert( 2025-08-14T21:42:15.8548092Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8548489Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8548883Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8549272Z layer_outputs = layer_module( 2025-08-14T21:42:15.8549603Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8549942Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8550335Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:42:15.8550740Z self_attention_outputs = self.attention( 2025-08-14T21:42:15.8551143Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:42:15.8551534Z self_outputs = self.self( 2025-08-14T21:42:15.8551874Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:42:15.8552233Z return func(*args, **kwargs) 2025-08-14T21:42:15.8552622Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 223, in forward 2025-08-14T21:42:15.8553015Z query_layer = self.query(hidden_states) 2025-08-14T21:42:15.8553152Z 2025-08-14T21:42:15.8553254Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8553596Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8553900Z return mod(**inputs) 2025-08-14T21:42:15.8554285Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8554675Z outputs = self.bert( 2025-08-14T21:42:15.8555039Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8555445Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8555825Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8556210Z layer_outputs = layer_module( 2025-08-14T21:42:15.8556561Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8556890Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8557283Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:42:15.8557678Z self_attention_outputs = self.attention( 2025-08-14T21:42:15.8558071Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:42:15.8558445Z self_outputs = self.self( 2025-08-14T21:42:15.8558781Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:42:15.8559141Z return func(*args, **kwargs) 2025-08-14T21:42:15.8559514Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 252, in forward 2025-08-14T21:42:15.8559898Z key_layer = self.key(current_states) 2025-08-14T21:42:15.8560029Z 2025-08-14T21:42:15.8560125Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8560452Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8560740Z return mod(**inputs) 2025-08-14T21:42:15.8561107Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8561488Z outputs = self.bert( 2025-08-14T21:42:15.8561848Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8562230Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8562608Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8562993Z layer_outputs = layer_module( 2025-08-14T21:42:15.8563305Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8563636Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8564024Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:42:15.8564417Z self_attention_outputs = self.attention( 2025-08-14T21:42:15.8564798Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:42:15.8565183Z self_outputs = self.self( 2025-08-14T21:42:15.8565518Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:42:15.8565859Z return func(*args, **kwargs) 2025-08-14T21:42:15.8566226Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 256, in forward 2025-08-14T21:42:15.8566616Z value_layer = self.value(current_states) 2025-08-14T21:42:15.8566737Z 2025-08-14T21:42:15.8566816Z cudagraph partition due to non gpu ops 2025-08-14T21:42:15.8567006Z cudagraph partition due to non gpu ops 2025-08-14T21:42:15.8567238Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8567574Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8567872Z return mod(**inputs) 2025-08-14T21:42:15.8568234Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8568639Z outputs = self.bert( 2025-08-14T21:42:15.8569003Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8569385Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8569779Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8570169Z layer_outputs = layer_module( 2025-08-14T21:42:15.8570490Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8570813Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8571201Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:42:15.8571598Z self_attention_outputs = self.attention( 2025-08-14T21:42:15.8572000Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 381, in forward 2025-08-14T21:42:15.8572429Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:42:15.8572864Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 329, in forward 2025-08-14T21:42:15.8573257Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:15.8573383Z 2025-08-14T21:42:15.8573484Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8573810Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8574108Z return mod(**inputs) 2025-08-14T21:42:15.8574472Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8574849Z outputs = self.bert( 2025-08-14T21:42:15.8575210Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8575596Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8575978Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8576359Z layer_outputs = layer_module( 2025-08-14T21:42:15.8576676Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8577006Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8577386Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:42:15.8577784Z layer_output = apply_chunking_to_forward( 2025-08-14T21:42:15.8578149Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:42:15.8578510Z return forward_fn(*input_tensors) 2025-08-14T21:42:15.8578912Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 481, in feed_forward_chunk 2025-08-14T21:42:15.8579353Z intermediate_output = self.intermediate(ln_output) 2025-08-14T21:42:15.8579766Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 397, in forward 2025-08-14T21:42:15.8580159Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:15.8580285Z 2025-08-14T21:42:15.8580404Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8580737Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8581037Z return mod(**inputs) 2025-08-14T21:42:15.8581414Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8581801Z outputs = self.bert( 2025-08-14T21:42:15.8582166Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8582575Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8582953Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8583338Z layer_outputs = layer_module( 2025-08-14T21:42:15.8583662Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8583993Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8584374Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:42:15.8584983Z layer_output = apply_chunking_to_forward( 2025-08-14T21:42:15.8585361Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:42:15.8585728Z return forward_fn(*input_tensors) 2025-08-14T21:42:15.8586149Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 481, in feed_forward_chunk 2025-08-14T21:42:15.8586595Z intermediate_output = self.intermediate(ln_output) 2025-08-14T21:42:15.8587016Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 398, in forward 2025-08-14T21:42:15.8587436Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:42:15.8587793Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:42:15.8588116Z return self.act(input) 2025-08-14T21:42:15.8588221Z 2025-08-14T21:42:15.8588326Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8588653Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8588953Z return mod(**inputs) 2025-08-14T21:42:15.8589325Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8589700Z outputs = self.bert( 2025-08-14T21:42:15.8590063Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8590452Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8590834Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8591211Z layer_outputs = layer_module( 2025-08-14T21:42:15.8591529Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8591866Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8592259Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:42:15.8592648Z layer_output = apply_chunking_to_forward( 2025-08-14T21:42:15.8593021Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:42:15.8593382Z return forward_fn(*input_tensors) 2025-08-14T21:42:15.8593837Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 482, in feed_forward_chunk 2025-08-14T21:42:15.8594309Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:42:15.8594773Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 410, in forward 2025-08-14T21:42:15.8595168Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:15.8595296Z 2025-08-14T21:42:15.8595390Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8595721Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8596055Z return mod(**inputs) 2025-08-14T21:42:15.8596425Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8596804Z outputs = self.bert( 2025-08-14T21:42:15.8597169Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8597554Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8597927Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8598348Z layer_outputs = layer_module( 2025-08-14T21:42:15.8598670Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8599007Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8599393Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:42:15.8599789Z self_attention_outputs = self.attention( 2025-08-14T21:42:15.8600189Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:42:15.8600575Z self_outputs = self.self( 2025-08-14T21:42:15.8600908Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:42:15.8601258Z return func(*args, **kwargs) 2025-08-14T21:42:15.8601639Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 223, in forward 2025-08-14T21:42:15.8602030Z query_layer = self.query(hidden_states) 2025-08-14T21:42:15.8602166Z 2025-08-14T21:42:15.8602263Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8602599Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8602900Z return mod(**inputs) 2025-08-14T21:42:15.8603263Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8603648Z outputs = self.bert( 2025-08-14T21:42:15.8604015Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8604400Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8604787Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8605178Z layer_outputs = layer_module( 2025-08-14T21:42:15.8605505Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8605837Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8606231Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:42:15.8606629Z self_attention_outputs = self.attention( 2025-08-14T21:42:15.8607037Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:42:15.8607417Z self_outputs = self.self( 2025-08-14T21:42:15.8607750Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:42:15.8608124Z return func(*args, **kwargs) 2025-08-14T21:42:15.8608494Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 252, in forward 2025-08-14T21:42:15.8608886Z key_layer = self.key(current_states) 2025-08-14T21:42:15.8609015Z 2025-08-14T21:42:15.8609125Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8609456Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8609746Z return mod(**inputs) 2025-08-14T21:42:15.8610117Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8610499Z outputs = self.bert( 2025-08-14T21:42:15.8610857Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8611257Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8611638Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8612018Z layer_outputs = layer_module( 2025-08-14T21:42:15.8612328Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8612660Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8613047Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:42:15.8613442Z self_attention_outputs = self.attention( 2025-08-14T21:42:15.8613824Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:42:15.8614209Z self_outputs = self.self( 2025-08-14T21:42:15.8614541Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:42:15.8614874Z return func(*args, **kwargs) 2025-08-14T21:42:15.8615248Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 256, in forward 2025-08-14T21:42:15.8615640Z value_layer = self.value(current_states) 2025-08-14T21:42:15.8615763Z 2025-08-14T21:42:15.8615842Z cudagraph partition due to non gpu ops 2025-08-14T21:42:15.8616033Z cudagraph partition due to non gpu ops 2025-08-14T21:42:15.8616251Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8616587Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8616879Z return mod(**inputs) 2025-08-14T21:42:15.8617246Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8617634Z outputs = self.bert( 2025-08-14T21:42:15.8617993Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8618374Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8618758Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8619148Z layer_outputs = layer_module( 2025-08-14T21:42:15.8619466Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8619806Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8620201Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:42:15.8620591Z self_attention_outputs = self.attention( 2025-08-14T21:42:15.8620993Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 381, in forward 2025-08-14T21:42:15.8621432Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:42:15.8621870Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 329, in forward 2025-08-14T21:42:15.8622287Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:15.8622416Z 2025-08-14T21:42:15.8622511Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8622844Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8623145Z return mod(**inputs) 2025-08-14T21:42:15.8623512Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8623893Z outputs = self.bert( 2025-08-14T21:42:15.8624256Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8624668Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8625116Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8625509Z layer_outputs = layer_module( 2025-08-14T21:42:15.8625835Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8626174Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8626564Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:42:15.8626970Z layer_output = apply_chunking_to_forward( 2025-08-14T21:42:15.8627348Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:42:15.8627722Z return forward_fn(*input_tensors) 2025-08-14T21:42:15.8628135Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 481, in feed_forward_chunk 2025-08-14T21:42:15.8628582Z intermediate_output = self.intermediate(ln_output) 2025-08-14T21:42:15.8629001Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 397, in forward 2025-08-14T21:42:15.8629394Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:15.8629532Z 2025-08-14T21:42:15.8629631Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8629968Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8630270Z return mod(**inputs) 2025-08-14T21:42:15.8630634Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8631025Z outputs = self.bert( 2025-08-14T21:42:15.8631389Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8631781Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8632161Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8632553Z layer_outputs = layer_module( 2025-08-14T21:42:15.8632892Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8633221Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8633612Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:42:15.8634025Z layer_output = apply_chunking_to_forward( 2025-08-14T21:42:15.8634396Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:42:15.8634751Z return forward_fn(*input_tensors) 2025-08-14T21:42:15.8635183Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 481, in feed_forward_chunk 2025-08-14T21:42:15.8635631Z intermediate_output = self.intermediate(ln_output) 2025-08-14T21:42:15.8636045Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 398, in forward 2025-08-14T21:42:15.8636466Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:42:15.8636820Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:42:15.8637135Z return self.act(input) 2025-08-14T21:42:15.8637238Z 2025-08-14T21:42:15.8637334Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8637688Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8637988Z return mod(**inputs) 2025-08-14T21:42:15.8638355Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8638732Z outputs = self.bert( 2025-08-14T21:42:15.8639097Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8639488Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8639871Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8640250Z layer_outputs = layer_module( 2025-08-14T21:42:15.8640573Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8640910Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8641291Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:42:15.8641690Z layer_output = apply_chunking_to_forward( 2025-08-14T21:42:15.8642058Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:42:15.8642419Z return forward_fn(*input_tensors) 2025-08-14T21:42:15.8642829Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 482, in feed_forward_chunk 2025-08-14T21:42:15.8643292Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:42:15.8643729Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 410, in forward 2025-08-14T21:42:15.8644130Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:15.8644256Z 2025-08-14T21:42:15.8644351Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8644681Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8644977Z return mod(**inputs) 2025-08-14T21:42:15.8645336Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8645718Z outputs = self.bert( 2025-08-14T21:42:15.8646098Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8646495Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8646878Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8647279Z layer_outputs = layer_module( 2025-08-14T21:42:15.8647597Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8647926Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8648324Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:42:15.8648724Z layer_output = apply_chunking_to_forward( 2025-08-14T21:42:15.8649090Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:42:15.8649447Z return forward_fn(*input_tensors) 2025-08-14T21:42:15.8649858Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 482, in feed_forward_chunk 2025-08-14T21:42:15.8650319Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:42:15.8650772Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 412, in forward 2025-08-14T21:42:15.8651154Z return input_tensor + hidden_states 2025-08-14T21:42:15.8651281Z 2025-08-14T21:42:15.8651378Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8651709Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8652006Z return mod(**inputs) 2025-08-14T21:42:15.8652366Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8652749Z outputs = self.bert( 2025-08-14T21:42:15.8653112Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8653493Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8653880Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8654263Z layer_outputs = layer_module( 2025-08-14T21:42:15.8654581Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8654905Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8655291Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:42:15.8655685Z self_attention_outputs = self.attention( 2025-08-14T21:42:15.8656076Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:42:15.8656453Z self_outputs = self.self( 2025-08-14T21:42:15.8656787Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:42:15.8657135Z return func(*args, **kwargs) 2025-08-14T21:42:15.8657500Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 223, in forward 2025-08-14T21:42:15.8657895Z query_layer = self.query(hidden_states) 2025-08-14T21:42:15.8658028Z 2025-08-14T21:42:15.8658126Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8658455Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8658745Z return mod(**inputs) 2025-08-14T21:42:15.8659143Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8659529Z outputs = self.bert( 2025-08-14T21:42:15.8659887Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8660314Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8660695Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8661081Z layer_outputs = layer_module( 2025-08-14T21:42:15.8661408Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8661743Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8662133Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:42:15.8662527Z self_attention_outputs = self.attention( 2025-08-14T21:42:15.8662913Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:42:15.8663296Z self_outputs = self.self( 2025-08-14T21:42:15.8663628Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:42:15.8663976Z return func(*args, **kwargs) 2025-08-14T21:42:15.8664348Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 252, in forward 2025-08-14T21:42:15.8664801Z key_layer = self.key(current_states) 2025-08-14T21:42:15.8664933Z 2025-08-14T21:42:15.8665039Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8665366Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8665672Z return mod(**inputs) 2025-08-14T21:42:15.8666043Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8666427Z outputs = self.bert( 2025-08-14T21:42:15.8666789Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8667183Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8667570Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8667952Z layer_outputs = layer_module( 2025-08-14T21:42:15.8668276Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8668612Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8669001Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:42:15.8669393Z self_attention_outputs = self.attention( 2025-08-14T21:42:15.8669789Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:42:15.8670180Z self_outputs = self.self( 2025-08-14T21:42:15.8670511Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:42:15.8670858Z return func(*args, **kwargs) 2025-08-14T21:42:15.8671236Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 256, in forward 2025-08-14T21:42:15.8671631Z value_layer = self.value(current_states) 2025-08-14T21:42:15.8671754Z 2025-08-14T21:42:15.8671828Z cudagraph partition due to non gpu ops 2025-08-14T21:42:15.8672024Z cudagraph partition due to non gpu ops 2025-08-14T21:42:15.8672258Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8672591Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8672883Z return mod(**inputs) 2025-08-14T21:42:15.8673274Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8673662Z outputs = self.bert( 2025-08-14T21:42:15.8674026Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8674421Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8674825Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8675215Z layer_outputs = layer_module( 2025-08-14T21:42:15.8675527Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8675855Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8676244Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:42:15.8676630Z self_attention_outputs = self.attention( 2025-08-14T21:42:15.8677038Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 381, in forward 2025-08-14T21:42:15.8677471Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:42:15.8677904Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 329, in forward 2025-08-14T21:42:15.8678290Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:15.8678422Z 2025-08-14T21:42:15.8678518Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8678846Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8679145Z return mod(**inputs) 2025-08-14T21:42:15.8679503Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8679883Z outputs = self.bert( 2025-08-14T21:42:15.8680244Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8680621Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8680999Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8681382Z layer_outputs = layer_module( 2025-08-14T21:42:15.8681699Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8682024Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8682413Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:42:15.8682808Z layer_output = apply_chunking_to_forward( 2025-08-14T21:42:15.8683175Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:42:15.8683528Z return forward_fn(*input_tensors) 2025-08-14T21:42:15.8683935Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 481, in feed_forward_chunk 2025-08-14T21:42:15.8684377Z intermediate_output = self.intermediate(ln_output) 2025-08-14T21:42:15.8684925Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 397, in forward 2025-08-14T21:42:15.8685369Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:15.8685506Z 2025-08-14T21:42:15.8685606Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8685944Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8686255Z return mod(**inputs) 2025-08-14T21:42:15.8686628Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8687017Z outputs = self.bert( 2025-08-14T21:42:15.8687382Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8687788Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8688182Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8688575Z layer_outputs = layer_module( 2025-08-14T21:42:15.8688893Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8689229Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8689623Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:42:15.8690048Z layer_output = apply_chunking_to_forward( 2025-08-14T21:42:15.8690411Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:42:15.8690776Z return forward_fn(*input_tensors) 2025-08-14T21:42:15.8691194Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 481, in feed_forward_chunk 2025-08-14T21:42:15.8691637Z intermediate_output = self.intermediate(ln_output) 2025-08-14T21:42:15.8692045Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 398, in forward 2025-08-14T21:42:15.8692472Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:42:15.8692827Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:42:15.8693136Z return self.act(input) 2025-08-14T21:42:15.8693247Z 2025-08-14T21:42:15.8693343Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8693674Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8693975Z return mod(**inputs) 2025-08-14T21:42:15.8694336Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8694718Z outputs = self.bert( 2025-08-14T21:42:15.8695082Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8695471Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8695846Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8696231Z layer_outputs = layer_module( 2025-08-14T21:42:15.8696553Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8696875Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8697264Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:42:15.8697664Z layer_output = apply_chunking_to_forward( 2025-08-14T21:42:15.8698035Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:42:15.8698392Z return forward_fn(*input_tensors) 2025-08-14T21:42:15.8698817Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 482, in feed_forward_chunk 2025-08-14T21:42:15.8699287Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:42:15.8699742Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 410, in forward 2025-08-14T21:42:15.8700133Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:15.8700267Z 2025-08-14T21:42:15.8700363Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8700709Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8701004Z return mod(**inputs) 2025-08-14T21:42:15.8701372Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8701756Z outputs = self.bert( 2025-08-14T21:42:15.8702122Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8702501Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8702887Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8703307Z layer_outputs = layer_module( 2025-08-14T21:42:15.8703630Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8703958Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8704352Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:42:15.8704799Z self_attention_outputs = self.attention( 2025-08-14T21:42:15.8705196Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:42:15.8705584Z self_outputs = self.self( 2025-08-14T21:42:15.8705923Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:42:15.8706270Z return func(*args, **kwargs) 2025-08-14T21:42:15.8706641Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 223, in forward 2025-08-14T21:42:15.8707035Z query_layer = self.query(hidden_states) 2025-08-14T21:42:15.8707161Z 2025-08-14T21:42:15.8707267Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8707599Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8707893Z return mod(**inputs) 2025-08-14T21:42:15.8708263Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8708324Z outputs = self.bert( 2025-08-14T21:42:15.8708596Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8708667Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8708931Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8709005Z layer_outputs = layer_module( 2025-08-14T21:42:15.8709210Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8709290Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8709553Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:42:15.8709647Z self_attention_outputs = self.attention( 2025-08-14T21:42:15.8709920Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:42:15.8709983Z self_outputs = self.self( 2025-08-14T21:42:15.8710230Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:42:15.8710297Z return func(*args, **kwargs) 2025-08-14T21:42:15.8710560Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 252, in forward 2025-08-14T21:42:15.8710638Z key_layer = self.key(current_states) 2025-08-14T21:42:15.8710641Z 2025-08-14T21:42:15.8710749Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8710935Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8711002Z return mod(**inputs) 2025-08-14T21:42:15.8711269Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8711335Z outputs = self.bert( 2025-08-14T21:42:15.8711599Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8711684Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8711958Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8712022Z layer_outputs = layer_module( 2025-08-14T21:42:15.8712235Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8712307Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8712571Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:42:15.8712652Z self_attention_outputs = self.attention( 2025-08-14T21:42:15.8712915Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:42:15.8712978Z self_outputs = self.self( 2025-08-14T21:42:15.8713210Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:42:15.8713273Z return func(*args, **kwargs) 2025-08-14T21:42:15.8713545Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 256, in forward 2025-08-14T21:42:15.8713619Z value_layer = self.value(current_states) 2025-08-14T21:42:15.8713622Z 2025-08-14T21:42:15.8713695Z cudagraph partition due to non gpu ops 2025-08-14T21:42:15.8713773Z cudagraph partition due to non gpu ops 2025-08-14T21:42:15.8713868Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8714052Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8714118Z return mod(**inputs) 2025-08-14T21:42:15.8714384Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8714453Z outputs = self.bert( 2025-08-14T21:42:15.8714714Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8714781Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8715053Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8715118Z layer_outputs = layer_module( 2025-08-14T21:42:15.8715327Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8715413Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8715676Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:42:15.8715774Z self_attention_outputs = self.attention( 2025-08-14T21:42:15.8716040Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 381, in forward 2025-08-14T21:42:15.8716157Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:42:15.8716442Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 329, in forward 2025-08-14T21:42:15.8716518Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:15.8716521Z 2025-08-14T21:42:15.8716626Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8716813Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8716872Z return mod(**inputs) 2025-08-14T21:42:15.8717145Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8717203Z outputs = self.bert( 2025-08-14T21:42:15.8717490Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8717557Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8717825Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8717903Z layer_outputs = layer_module( 2025-08-14T21:42:15.8718110Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8718185Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8718460Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:42:15.8718541Z layer_output = apply_chunking_to_forward( 2025-08-14T21:42:15.8718792Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:42:15.8718868Z return forward_fn(*input_tensors) 2025-08-14T21:42:15.8719161Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 481, in feed_forward_chunk 2025-08-14T21:42:15.8719270Z intermediate_output = self.intermediate(ln_output) 2025-08-14T21:42:15.8719536Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 397, in forward 2025-08-14T21:42:15.8719628Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:15.8719631Z 2025-08-14T21:42:15.8719730Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8719913Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8719983Z return mod(**inputs) 2025-08-14T21:42:15.8720251Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8720324Z outputs = self.bert( 2025-08-14T21:42:15.8720588Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8720656Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8720930Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8720998Z layer_outputs = layer_module( 2025-08-14T21:42:15.8721225Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8721305Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8721570Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:42:15.8721668Z layer_output = apply_chunking_to_forward( 2025-08-14T21:42:15.8721910Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:42:15.8721978Z return forward_fn(*input_tensors) 2025-08-14T21:42:15.8722298Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 481, in feed_forward_chunk 2025-08-14T21:42:15.8722395Z intermediate_output = self.intermediate(ln_output) 2025-08-14T21:42:15.8722665Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 398, in forward 2025-08-14T21:42:15.8722769Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:42:15.8722962Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:42:15.8723035Z return self.act(input) 2025-08-14T21:42:15.8723038Z 2025-08-14T21:42:15.8723148Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8723329Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8723396Z return mod(**inputs) 2025-08-14T21:42:15.8723661Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8723730Z outputs = self.bert( 2025-08-14T21:42:15.8723991Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8724060Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8724328Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8724392Z layer_outputs = layer_module( 2025-08-14T21:42:15.8724595Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8724674Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8724934Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:42:15.8725019Z layer_output = apply_chunking_to_forward( 2025-08-14T21:42:15.8725257Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:42:15.8725326Z return forward_fn(*input_tensors) 2025-08-14T21:42:15.8725627Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 482, in feed_forward_chunk 2025-08-14T21:42:15.8725746Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:42:15.8726015Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 410, in forward 2025-08-14T21:42:15.8726090Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:15.8726094Z 2025-08-14T21:42:15.8726188Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8726376Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8726436Z return mod(**inputs) 2025-08-14T21:42:15.8726709Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8726768Z outputs = self.bert( 2025-08-14T21:42:15.8727044Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8727117Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8727386Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8727471Z layer_outputs = layer_module( 2025-08-14T21:42:15.8727681Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8727749Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8728043Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:42:15.8728120Z layer_output = apply_chunking_to_forward( 2025-08-14T21:42:15.8728361Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:42:15.8728436Z return forward_fn(*input_tensors) 2025-08-14T21:42:15.8728726Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 482, in feed_forward_chunk 2025-08-14T21:42:15.8728853Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:42:15.8729130Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 412, in forward 2025-08-14T21:42:15.8729200Z return input_tensor + hidden_states 2025-08-14T21:42:15.8729203Z 2025-08-14T21:42:15.8729303Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8729487Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8729547Z return mod(**inputs) 2025-08-14T21:42:15.8729821Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8729879Z outputs = self.bert( 2025-08-14T21:42:15.8730149Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8730215Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8730478Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8730549Z layer_outputs = layer_module( 2025-08-14T21:42:15.8730751Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8730831Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8731091Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:42:15.8731167Z self_attention_outputs = self.attention( 2025-08-14T21:42:15.8731436Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:42:15.8731499Z self_outputs = self.self( 2025-08-14T21:42:15.8731724Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:42:15.8731798Z return func(*args, **kwargs) 2025-08-14T21:42:15.8732059Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 223, in forward 2025-08-14T21:42:15.8732138Z query_layer = self.query(hidden_states) 2025-08-14T21:42:15.8732143Z 2025-08-14T21:42:15.8732237Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8732417Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8732483Z return mod(**inputs) 2025-08-14T21:42:15.8732762Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8732835Z outputs = self.bert( 2025-08-14T21:42:15.8733100Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8733185Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8733456Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8733522Z layer_outputs = layer_module( 2025-08-14T21:42:15.8733738Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8733818Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8734081Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:42:15.8734163Z self_attention_outputs = self.attention( 2025-08-14T21:42:15.8734427Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:42:15.8734491Z self_outputs = self.self( 2025-08-14T21:42:15.8734740Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:42:15.8734804Z return func(*args, **kwargs) 2025-08-14T21:42:15.8735074Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 252, in forward 2025-08-14T21:42:15.8735146Z key_layer = self.key(current_states) 2025-08-14T21:42:15.8735149Z 2025-08-14T21:42:15.8735242Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8735428Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8735488Z return mod(**inputs) 2025-08-14T21:42:15.8735752Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8735820Z outputs = self.bert( 2025-08-14T21:42:15.8736082Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8736157Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8736417Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8736482Z layer_outputs = layer_module( 2025-08-14T21:42:15.8736692Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8736763Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8737032Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:42:15.8737107Z self_attention_outputs = self.attention( 2025-08-14T21:42:15.8737370Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:42:15.8737442Z self_outputs = self.self( 2025-08-14T21:42:15.8737664Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:42:15.8737726Z return func(*args, **kwargs) 2025-08-14T21:42:15.8737997Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 256, in forward 2025-08-14T21:42:15.8738070Z value_layer = self.value(current_states) 2025-08-14T21:42:15.8738074Z 2025-08-14T21:42:15.8738154Z cudagraph partition due to non gpu ops 2025-08-14T21:42:15.8738241Z cudagraph partition due to non gpu ops 2025-08-14T21:42:15.8738336Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8738524Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8738601Z return mod(**inputs) 2025-08-14T21:42:15.8738872Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8738939Z outputs = self.bert( 2025-08-14T21:42:15.8739205Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8739291Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8739555Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8739618Z layer_outputs = layer_module( 2025-08-14T21:42:15.8739830Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8739899Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8740167Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:42:15.8740266Z self_attention_outputs = self.attention( 2025-08-14T21:42:15.8740526Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 381, in forward 2025-08-14T21:42:15.8740650Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:42:15.8740913Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 329, in forward 2025-08-14T21:42:15.8740987Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:15.8740998Z 2025-08-14T21:42:15.8741090Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8741271Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8741335Z return mod(**inputs) 2025-08-14T21:42:15.8741601Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8741662Z outputs = self.bert( 2025-08-14T21:42:15.8741932Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8741997Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8742265Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8742328Z layer_outputs = layer_module( 2025-08-14T21:42:15.8742531Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8742608Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8742873Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:42:15.8742948Z layer_output = apply_chunking_to_forward( 2025-08-14T21:42:15.8743193Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:42:15.8743261Z return forward_fn(*input_tensors) 2025-08-14T21:42:15.8743563Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 481, in feed_forward_chunk 2025-08-14T21:42:15.8743656Z intermediate_output = self.intermediate(ln_output) 2025-08-14T21:42:15.8743918Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 397, in forward 2025-08-14T21:42:15.8744013Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:15.8744017Z 2025-08-14T21:42:15.8744110Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8744295Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8744370Z return mod(**inputs) 2025-08-14T21:42:15.8744635Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8744702Z outputs = self.bert( 2025-08-14T21:42:15.8745057Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8745126Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8745405Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8745471Z layer_outputs = layer_module( 2025-08-14T21:42:15.8745682Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8745754Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8746019Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:42:15.8746120Z layer_output = apply_chunking_to_forward( 2025-08-14T21:42:15.8746362Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:42:15.8746439Z return forward_fn(*input_tensors) 2025-08-14T21:42:15.8746731Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 481, in feed_forward_chunk 2025-08-14T21:42:15.8746826Z intermediate_output = self.intermediate(ln_output) 2025-08-14T21:42:15.8747101Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 398, in forward 2025-08-14T21:42:15.8747206Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:42:15.8747410Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:42:15.8747477Z return self.act(input) 2025-08-14T21:42:15.8747481Z 2025-08-14T21:42:15.8747575Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8747769Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8747828Z return mod(**inputs) 2025-08-14T21:42:15.8748098Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8748167Z outputs = self.bert( 2025-08-14T21:42:15.8748431Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8748505Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8748770Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8748836Z layer_outputs = layer_module( 2025-08-14T21:42:15.8749047Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8749117Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8749382Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:42:15.8749464Z layer_output = apply_chunking_to_forward( 2025-08-14T21:42:15.8749701Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:42:15.8749791Z return forward_fn(*input_tensors) 2025-08-14T21:42:15.8750083Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 482, in feed_forward_chunk 2025-08-14T21:42:15.8750201Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:42:15.8750495Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 410, in forward 2025-08-14T21:42:15.8750569Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:15.8750572Z 2025-08-14T21:42:15.8750670Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8750864Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8750925Z return mod(**inputs) 2025-08-14T21:42:15.8751203Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8751263Z outputs = self.bert( 2025-08-14T21:42:15.8751536Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8751601Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8751868Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8751956Z layer_outputs = layer_module( 2025-08-14T21:42:15.8752160Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8752233Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8752506Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:42:15.8752581Z self_attention_outputs = self.attention( 2025-08-14T21:42:15.8752854Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:42:15.8752917Z self_outputs = self.self( 2025-08-14T21:42:15.8753143Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:42:15.8753217Z return func(*args, **kwargs) 2025-08-14T21:42:15.8753482Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 223, in forward 2025-08-14T21:42:15.8753562Z query_layer = self.query(hidden_states) 2025-08-14T21:42:15.8753566Z 2025-08-14T21:42:15.8753660Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8753841Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8753908Z return mod(**inputs) 2025-08-14T21:42:15.8754176Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8754234Z outputs = self.bert( 2025-08-14T21:42:15.8754506Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8754574Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8754848Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8754911Z layer_outputs = layer_module( 2025-08-14T21:42:15.8755115Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8755191Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8755452Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:42:15.8755548Z self_attention_outputs = self.attention( 2025-08-14T21:42:15.8755812Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:42:15.8755873Z self_outputs = self.self( 2025-08-14T21:42:15.8756121Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:42:15.8756187Z return func(*args, **kwargs) 2025-08-14T21:42:15.8756455Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 252, in forward 2025-08-14T21:42:15.8756545Z key_layer = self.key(current_states) 2025-08-14T21:42:15.8756549Z 2025-08-14T21:42:15.8756645Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8756832Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8756891Z return mod(**inputs) 2025-08-14T21:42:15.8757162Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8757228Z outputs = self.bert( 2025-08-14T21:42:15.8757494Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8757575Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8757850Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8757914Z layer_outputs = layer_module( 2025-08-14T21:42:15.8758128Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8758197Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8758463Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:42:15.8758542Z self_attention_outputs = self.attention( 2025-08-14T21:42:15.8758808Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:42:15.8758878Z self_outputs = self.self( 2025-08-14T21:42:15.8759105Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:42:15.8759167Z return func(*args, **kwargs) 2025-08-14T21:42:15.8759444Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 256, in forward 2025-08-14T21:42:15.8759517Z value_layer = self.value(current_states) 2025-08-14T21:42:15.8759520Z 2025-08-14T21:42:15.8759593Z cudagraph partition due to non gpu ops 2025-08-14T21:42:15.8759672Z cudagraph partition due to non gpu ops 2025-08-14T21:42:15.8759766Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8759955Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8760013Z return mod(**inputs) 2025-08-14T21:42:15.8760281Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8760349Z outputs = self.bert( 2025-08-14T21:42:15.8760613Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8760685Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8760952Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8761017Z layer_outputs = layer_module( 2025-08-14T21:42:15.8762132Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8762211Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8762474Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:42:15.8762577Z self_attention_outputs = self.attention( 2025-08-14T21:42:15.8762844Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 381, in forward 2025-08-14T21:42:15.8762969Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:42:15.8763247Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 329, in forward 2025-08-14T21:42:15.8763325Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:15.8763328Z 2025-08-14T21:42:15.8763431Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8763618Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8763688Z return mod(**inputs) 2025-08-14T21:42:15.8763954Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8764038Z outputs = self.bert( 2025-08-14T21:42:15.8764314Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8764382Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8764650Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8764723Z layer_outputs = layer_module( 2025-08-14T21:42:15.8764929Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8765007Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8765274Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:42:15.8765348Z layer_output = apply_chunking_to_forward( 2025-08-14T21:42:15.8765599Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:42:15.8765670Z return forward_fn(*input_tensors) 2025-08-14T21:42:15.8765973Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 481, in feed_forward_chunk 2025-08-14T21:42:15.8766069Z intermediate_output = self.intermediate(ln_output) 2025-08-14T21:42:15.8766337Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 397, in forward 2025-08-14T21:42:15.8766418Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:15.8766422Z 2025-08-14T21:42:15.8766516Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8766700Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8766768Z return mod(**inputs) 2025-08-14T21:42:15.8767037Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8767106Z outputs = self.bert( 2025-08-14T21:42:15.8767372Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8767440Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8767714Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8767779Z layer_outputs = layer_module( 2025-08-14T21:42:15.8768006Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8768078Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8768344Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:42:15.8768448Z layer_output = apply_chunking_to_forward( 2025-08-14T21:42:15.8768688Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:42:15.8768755Z return forward_fn(*input_tensors) 2025-08-14T21:42:15.8769069Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 481, in feed_forward_chunk 2025-08-14T21:42:15.8769166Z intermediate_output = self.intermediate(ln_output) 2025-08-14T21:42:15.8769438Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 398, in forward 2025-08-14T21:42:15.8769541Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:42:15.8769740Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:42:15.8769814Z return self.act(input) 2025-08-14T21:42:15.8769835Z 2025-08-14T21:42:15.8769929Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8770117Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8770175Z return mod(**inputs) 2025-08-14T21:42:15.8770442Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8770509Z outputs = self.bert( 2025-08-14T21:42:15.8770772Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8770840Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8771111Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8771176Z layer_outputs = layer_module( 2025-08-14T21:42:15.8771384Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8771455Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8771715Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:42:15.8771800Z layer_output = apply_chunking_to_forward( 2025-08-14T21:42:15.8772037Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:42:15.8772110Z return forward_fn(*input_tensors) 2025-08-14T21:42:15.8772400Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 482, in feed_forward_chunk 2025-08-14T21:42:15.8772518Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:42:15.8772790Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 410, in forward 2025-08-14T21:42:15.8772865Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:15.8772868Z 2025-08-14T21:42:15.8772965Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8773145Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8773203Z return mod(**inputs) 2025-08-14T21:42:15.8773474Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8773532Z outputs = self.bert( 2025-08-14T21:42:15.8773815Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8773888Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8774164Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8774237Z layer_outputs = layer_module( 2025-08-14T21:42:15.8774438Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8774508Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8774793Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:42:15.8774870Z layer_output = apply_chunking_to_forward( 2025-08-14T21:42:15.8775111Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:42:15.8775186Z return forward_fn(*input_tensors) 2025-08-14T21:42:15.8775476Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 482, in feed_forward_chunk 2025-08-14T21:42:15.8775600Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:42:15.8775879Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 412, in forward 2025-08-14T21:42:15.8775951Z return input_tensor + hidden_states 2025-08-14T21:42:15.8775954Z 2025-08-14T21:42:15.8776052Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8776235Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8776300Z return mod(**inputs) 2025-08-14T21:42:15.8776567Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8776626Z outputs = self.bert( 2025-08-14T21:42:15.8776898Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8776967Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8777236Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8777302Z layer_outputs = layer_module( 2025-08-14T21:42:15.8777503Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8777582Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8777842Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:42:15.8777917Z self_attention_outputs = self.attention( 2025-08-14T21:42:15.8778187Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:42:15.8778252Z self_outputs = self.self( 2025-08-14T21:42:15.8778483Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:42:15.8778548Z return func(*args, **kwargs) 2025-08-14T21:42:15.8778811Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 223, in forward 2025-08-14T21:42:15.8778893Z query_layer = self.query(hidden_states) 2025-08-14T21:42:15.8778896Z 2025-08-14T21:42:15.8778988Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8779185Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8779244Z return mod(**inputs) 2025-08-14T21:42:15.8779555Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8779623Z outputs = self.bert( 2025-08-14T21:42:15.8779903Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8779971Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8780240Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8780304Z layer_outputs = layer_module( 2025-08-14T21:42:15.8780527Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8780600Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8780863Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:42:15.8780944Z self_attention_outputs = self.attention( 2025-08-14T21:42:15.8781203Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:42:15.8781266Z self_outputs = self.self( 2025-08-14T21:42:15.8781514Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:42:15.8781577Z return func(*args, **kwargs) 2025-08-14T21:42:15.8781851Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 252, in forward 2025-08-14T21:42:15.8781923Z key_layer = self.key(current_states) 2025-08-14T21:42:15.8781926Z 2025-08-14T21:42:15.8782020Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8782210Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8782269Z return mod(**inputs) 2025-08-14T21:42:15.8782543Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8782604Z outputs = self.bert( 2025-08-14T21:42:15.8782870Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8782943Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8783207Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8783272Z layer_outputs = layer_module( 2025-08-14T21:42:15.8783483Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8783554Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8783825Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:42:15.8783899Z self_attention_outputs = self.attention( 2025-08-14T21:42:15.8784160Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:42:15.8784231Z self_outputs = self.self( 2025-08-14T21:42:15.8784455Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:42:15.8784525Z return func(*args, **kwargs) 2025-08-14T21:42:15.8784993Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 256, in forward 2025-08-14T21:42:15.8785070Z value_layer = self.value(current_states) 2025-08-14T21:42:15.8785073Z 2025-08-14T21:42:15.8785155Z cudagraph partition due to non gpu ops 2025-08-14T21:42:15.8785268Z cudagraph partition due to non gpu ops 2025-08-14T21:42:15.8785366Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8785556Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8785639Z return mod(**inputs) 2025-08-14T21:42:15.8785921Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8785980Z outputs = self.bert( 2025-08-14T21:42:15.8786251Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8786346Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8786612Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8786688Z layer_outputs = layer_module( 2025-08-14T21:42:15.8786894Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8786965Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8787237Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:42:15.8787337Z self_attention_outputs = self.attention( 2025-08-14T21:42:15.8787602Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 381, in forward 2025-08-14T21:42:15.8787728Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:42:15.8787992Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 329, in forward 2025-08-14T21:42:15.8788074Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:15.8788077Z 2025-08-14T21:42:15.8788171Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8788352Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8788419Z return mod(**inputs) 2025-08-14T21:42:15.8788687Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8788755Z outputs = self.bert( 2025-08-14T21:42:15.8789049Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8789115Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8789400Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8789468Z layer_outputs = layer_module( 2025-08-14T21:42:15.8789768Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8789868Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8790268Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:42:15.8790382Z layer_output = apply_chunking_to_forward( 2025-08-14T21:42:15.8790753Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:42:15.8790847Z return forward_fn(*input_tensors) 2025-08-14T21:42:15.8791299Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 481, in feed_forward_chunk 2025-08-14T21:42:15.8791434Z intermediate_output = self.intermediate(ln_output) 2025-08-14T21:42:15.8791771Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 397, in forward 2025-08-14T21:42:15.8791876Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:15.8791879Z 2025-08-14T21:42:15.8791974Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8792167Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8792249Z return mod(**inputs) 2025-08-14T21:42:15.8792524Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8792594Z outputs = self.bert( 2025-08-14T21:42:15.8792882Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8792960Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8793231Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8793299Z layer_outputs = layer_module( 2025-08-14T21:42:15.8793513Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8793585Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8793879Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:42:15.8794021Z layer_output = apply_chunking_to_forward( 2025-08-14T21:42:15.8794267Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:42:15.8794346Z return forward_fn(*input_tensors) 2025-08-14T21:42:15.8794648Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 481, in feed_forward_chunk 2025-08-14T21:42:15.8794743Z intermediate_output = self.intermediate(ln_output) 2025-08-14T21:42:15.8795018Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 398, in forward 2025-08-14T21:42:15.8795125Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:42:15.8795332Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:42:15.8795399Z return self.act(input) 2025-08-14T21:42:15.8795402Z 2025-08-14T21:42:15.8795497Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8795688Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8795750Z return mod(**inputs) 2025-08-14T21:42:15.8796033Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8796093Z outputs = self.bert( 2025-08-14T21:42:15.8796365Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8796440Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8796709Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8796778Z layer_outputs = layer_module( 2025-08-14T21:42:15.8796993Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8797063Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8797338Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:42:15.8797414Z layer_output = apply_chunking_to_forward( 2025-08-14T21:42:15.8797679Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:42:15.8797805Z return forward_fn(*input_tensors) 2025-08-14T21:42:15.8798183Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 482, in feed_forward_chunk 2025-08-14T21:42:15.8798330Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:42:15.8798665Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 410, in forward 2025-08-14T21:42:15.8798769Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:15.8798773Z 2025-08-14T21:42:15.8798913Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8799204Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8799278Z return mod(**inputs) 2025-08-14T21:42:15.8799694Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8799769Z outputs = self.bert( 2025-08-14T21:42:15.8800191Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8800276Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8800714Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8800803Z layer_outputs = layer_module( 2025-08-14T21:42:15.8801120Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8801217Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8801626Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:42:15.8801705Z self_attention_outputs = self.attention( 2025-08-14T21:42:15.8801985Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:42:15.8802051Z self_outputs = self.self( 2025-08-14T21:42:15.8802342Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:42:15.8802417Z return func(*args, **kwargs) 2025-08-14T21:42:15.8802685Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 223, in forward 2025-08-14T21:42:15.8802765Z query_layer = self.query(hidden_states) 2025-08-14T21:42:15.8802769Z 2025-08-14T21:42:15.8802864Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8803046Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8803113Z return mod(**inputs) 2025-08-14T21:42:15.8803385Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8803450Z outputs = self.bert( 2025-08-14T21:42:15.8803717Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8803787Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8804060Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8804124Z layer_outputs = layer_module( 2025-08-14T21:42:15.8804338Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8804415Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8804676Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:42:15.8804776Z self_attention_outputs = self.attention( 2025-08-14T21:42:15.8805038Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:42:15.8805119Z self_outputs = self.self( 2025-08-14T21:42:15.8805354Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:42:15.8805417Z return func(*args, **kwargs) 2025-08-14T21:42:15.8805687Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 252, in forward 2025-08-14T21:42:15.8805770Z key_layer = self.key(current_states) 2025-08-14T21:42:15.8805774Z 2025-08-14T21:42:15.8805868Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8806051Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8806112Z return mod(**inputs) 2025-08-14T21:42:15.8806376Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8806442Z outputs = self.bert( 2025-08-14T21:42:15.8806702Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8806790Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8807056Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8807120Z layer_outputs = layer_module( 2025-08-14T21:42:15.8807329Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8807398Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8807668Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:42:15.8807741Z self_attention_outputs = self.attention( 2025-08-14T21:42:15.8808003Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:42:15.8808078Z self_outputs = self.self( 2025-08-14T21:42:15.8808299Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:42:15.8808361Z return func(*args, **kwargs) 2025-08-14T21:42:15.8808635Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 256, in forward 2025-08-14T21:42:15.8808705Z value_layer = self.value(current_states) 2025-08-14T21:42:15.8808709Z 2025-08-14T21:42:15.8808790Z cudagraph partition due to non gpu ops 2025-08-14T21:42:15.8808861Z cudagraph partition due to non gpu ops 2025-08-14T21:42:15.8808953Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8809141Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8809201Z return mod(**inputs) 2025-08-14T21:42:15.8809467Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8809535Z outputs = self.bert( 2025-08-14T21:42:15.8809798Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8809871Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8810133Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8810198Z layer_outputs = layer_module( 2025-08-14T21:42:15.8810423Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8810495Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8810763Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:42:15.8810854Z self_attention_outputs = self.attention( 2025-08-14T21:42:15.8811116Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 381, in forward 2025-08-14T21:42:15.8811236Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:42:15.8811511Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 329, in forward 2025-08-14T21:42:15.8811588Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:15.8811598Z 2025-08-14T21:42:15.8811688Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8811871Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8811937Z return mod(**inputs) 2025-08-14T21:42:15.8812203Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8812278Z outputs = self.bert( 2025-08-14T21:42:15.8812550Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8812614Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8812886Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8812950Z layer_outputs = layer_module( 2025-08-14T21:42:15.8813150Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8813227Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8813490Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:42:15.8813567Z layer_output = apply_chunking_to_forward( 2025-08-14T21:42:15.8813813Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:42:15.8813883Z return forward_fn(*input_tensors) 2025-08-14T21:42:15.8814182Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 481, in feed_forward_chunk 2025-08-14T21:42:15.8814277Z intermediate_output = self.intermediate(ln_output) 2025-08-14T21:42:15.8814540Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 397, in forward 2025-08-14T21:42:15.8814623Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:15.8814627Z 2025-08-14T21:42:15.8814720Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8814908Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8814968Z return mod(**inputs) 2025-08-14T21:42:15.8815236Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8815304Z outputs = self.bert( 2025-08-14T21:42:15.8815567Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8815641Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8815906Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8815970Z layer_outputs = layer_module( 2025-08-14T21:42:15.8816198Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8816270Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8816535Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:42:15.8816635Z layer_output = apply_chunking_to_forward( 2025-08-14T21:42:15.8816875Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:42:15.8816950Z return forward_fn(*input_tensors) 2025-08-14T21:42:15.8817263Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 481, in feed_forward_chunk 2025-08-14T21:42:15.8817358Z intermediate_output = self.intermediate(ln_output) 2025-08-14T21:42:15.8817628Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 398, in forward 2025-08-14T21:42:15.8817729Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:42:15.8817931Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:42:15.8818013Z return self.act(input) 2025-08-14T21:42:15.8818017Z 2025-08-14T21:42:15.8818108Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8818296Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8818355Z return mod(**inputs) 2025-08-14T21:42:15.8818621Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8818687Z outputs = self.bert( 2025-08-14T21:42:15.8818951Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8819024Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8819285Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8819350Z layer_outputs = layer_module( 2025-08-14T21:42:15.8819558Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8819627Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8819896Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:42:15.8819971Z layer_output = apply_chunking_to_forward( 2025-08-14T21:42:15.8820206Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:42:15.8820280Z return forward_fn(*input_tensors) 2025-08-14T21:42:15.8820570Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 482, in feed_forward_chunk 2025-08-14T21:42:15.8820688Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:42:15.8820959Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 410, in forward 2025-08-14T21:42:15.8821033Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:15.8821037Z 2025-08-14T21:42:15.8821135Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8821316Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8821374Z return mod(**inputs) 2025-08-14T21:42:15.8821645Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8821719Z outputs = self.bert( 2025-08-14T21:42:15.8821990Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8822055Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8822330Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8822403Z layer_outputs = layer_module( 2025-08-14T21:42:15.8822604Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8822674Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8822957Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:42:15.8823035Z layer_output = apply_chunking_to_forward( 2025-08-14T21:42:15.8823287Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:42:15.8823354Z return forward_fn(*input_tensors) 2025-08-14T21:42:15.8823648Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 482, in feed_forward_chunk 2025-08-14T21:42:15.8823790Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:42:15.8824050Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 412, in forward 2025-08-14T21:42:15.8824127Z return input_tensor + hidden_states 2025-08-14T21:42:15.8824131Z 2025-08-14T21:42:15.8824226Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8824406Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8824472Z return mod(**inputs) 2025-08-14T21:42:15.8824826Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8824901Z outputs = self.bert( 2025-08-14T21:42:15.8825178Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8825251Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8825525Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8825592Z layer_outputs = layer_module( 2025-08-14T21:42:15.8825798Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8825880Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8826145Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:42:15.8826233Z self_attention_outputs = self.attention( 2025-08-14T21:42:15.8826495Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:42:15.8826560Z self_outputs = self.self( 2025-08-14T21:42:15.8826794Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:42:15.8826860Z return func(*args, **kwargs) 2025-08-14T21:42:15.8827123Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 223, in forward 2025-08-14T21:42:15.8827209Z query_layer = self.query(hidden_states) 2025-08-14T21:42:15.8827212Z 2025-08-14T21:42:15.8827309Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8827500Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8827578Z return mod(**inputs) 2025-08-14T21:42:15.8827844Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8827912Z outputs = self.bert( 2025-08-14T21:42:15.8828225Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8828300Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8828561Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8828638Z layer_outputs = layer_module( 2025-08-14T21:42:15.8828853Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8828924Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8829191Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:42:15.8829272Z self_attention_outputs = self.attention( 2025-08-14T21:42:15.8829537Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:42:15.8829624Z self_outputs = self.self( 2025-08-14T21:42:15.8829848Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:42:15.8829912Z return func(*args, **kwargs) 2025-08-14T21:42:15.8830186Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 252, in forward 2025-08-14T21:42:15.8830256Z key_layer = self.key(current_states) 2025-08-14T21:42:15.8830260Z 2025-08-14T21:42:15.8830359Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8830543Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8830602Z return mod(**inputs) 2025-08-14T21:42:15.8830879Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8830938Z outputs = self.bert( 2025-08-14T21:42:15.8831204Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8831277Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8831538Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8831609Z layer_outputs = layer_module( 2025-08-14T21:42:15.8831810Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8831882Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8832153Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:42:15.8832226Z self_attention_outputs = self.attention( 2025-08-14T21:42:15.8832496Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:42:15.8832559Z self_outputs = self.self( 2025-08-14T21:42:15.8832780Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:42:15.8832847Z return func(*args, **kwargs) 2025-08-14T21:42:15.8833110Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 256, in forward 2025-08-14T21:42:15.8833182Z value_layer = self.value(current_states) 2025-08-14T21:42:15.8833185Z 2025-08-14T21:42:15.8833276Z cudagraph partition due to non gpu ops 2025-08-14T21:42:15.8833350Z cudagraph partition due to non gpu ops 2025-08-14T21:42:15.8833447Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8833624Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8833700Z return mod(**inputs) 2025-08-14T21:42:15.8833973Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8834031Z outputs = self.bert( 2025-08-14T21:42:15.8834307Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8834382Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8834646Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8834718Z layer_outputs = layer_module( 2025-08-14T21:42:15.8834920Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8834992Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8835261Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:42:15.8835351Z self_attention_outputs = self.attention( 2025-08-14T21:42:15.8835622Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 381, in forward 2025-08-14T21:42:15.8835740Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:42:15.8836005Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 329, in forward 2025-08-14T21:42:15.8836086Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:15.8836090Z 2025-08-14T21:42:15.8836183Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8836369Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8836426Z return mod(**inputs) 2025-08-14T21:42:15.8836696Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8836764Z outputs = self.bert( 2025-08-14T21:42:15.8837029Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8837096Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8837368Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8837431Z layer_outputs = layer_module( 2025-08-14T21:42:15.8837644Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8837713Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8837976Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:42:15.8838062Z layer_output = apply_chunking_to_forward( 2025-08-14T21:42:15.8838303Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:42:15.8838370Z return forward_fn(*input_tensors) 2025-08-14T21:42:15.8838670Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 481, in feed_forward_chunk 2025-08-14T21:42:15.8838764Z intermediate_output = self.intermediate(ln_output) 2025-08-14T21:42:15.8839051Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 397, in forward 2025-08-14T21:42:15.8839128Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:15.8839131Z 2025-08-14T21:42:15.8839222Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8839433Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8839492Z return mod(**inputs) 2025-08-14T21:42:15.8839763Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8839836Z outputs = self.bert( 2025-08-14T21:42:15.8840113Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8840190Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8840455Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8840528Z layer_outputs = layer_module( 2025-08-14T21:42:15.8840729Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8840802Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8841073Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:42:15.8841164Z layer_output = apply_chunking_to_forward( 2025-08-14T21:42:15.8841401Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:42:15.8841477Z return forward_fn(*input_tensors) 2025-08-14T21:42:15.8841770Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 481, in feed_forward_chunk 2025-08-14T21:42:15.8841869Z intermediate_output = self.intermediate(ln_output) 2025-08-14T21:42:15.8842130Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 398, in forward 2025-08-14T21:42:15.8842232Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:42:15.8842434Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:42:15.8842497Z return self.act(input) 2025-08-14T21:42:15.8842500Z 2025-08-14T21:42:15.8842599Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8842779Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8842839Z return mod(**inputs) 2025-08-14T21:42:15.8843109Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8843167Z outputs = self.bert( 2025-08-14T21:42:15.8843427Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8843500Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8843761Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8843835Z layer_outputs = layer_module( 2025-08-14T21:42:15.8844034Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8844103Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8844371Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:42:15.8844445Z layer_output = apply_chunking_to_forward( 2025-08-14T21:42:15.8844703Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:42:15.8844773Z return forward_fn(*input_tensors) 2025-08-14T21:42:15.8845062Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 482, in feed_forward_chunk 2025-08-14T21:42:15.8845205Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:42:15.8845468Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 410, in forward 2025-08-14T21:42:15.8845541Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:15.8845553Z 2025-08-14T21:42:15.8845659Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8845840Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8845906Z return mod(**inputs) 2025-08-14T21:42:15.8846170Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8846229Z outputs = self.bert( 2025-08-14T21:42:15.8846498Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8846564Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8846849Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8846913Z layer_outputs = layer_module( 2025-08-14T21:42:15.8847117Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8847194Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8847454Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:42:15.8847530Z self_attention_outputs = self.attention( 2025-08-14T21:42:15.8847797Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:42:15.8847860Z self_outputs = self.self( 2025-08-14T21:42:15.8848090Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:42:15.8848155Z return func(*args, **kwargs) 2025-08-14T21:42:15.8848416Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 223, in forward 2025-08-14T21:42:15.8848496Z query_layer = self.query(hidden_states) 2025-08-14T21:42:15.8848501Z 2025-08-14T21:42:15.8848594Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8848782Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8848841Z return mod(**inputs) 2025-08-14T21:42:15.8849106Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8849173Z outputs = self.bert( 2025-08-14T21:42:15.8849432Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8849501Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8849769Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8849833Z layer_outputs = layer_module( 2025-08-14T21:42:15.8850043Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8850115Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8850391Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:42:15.8853205Z self_attention_outputs = self.attention( 2025-08-14T21:42:15.8853487Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:42:15.8854640Z self_outputs = self.self( 2025-08-14T21:42:15.8854881Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:42:15.8854958Z return func(*args, **kwargs) 2025-08-14T21:42:15.8855254Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 252, in forward 2025-08-14T21:42:15.8855338Z key_layer = self.key(current_states) 2025-08-14T21:42:15.8855342Z 2025-08-14T21:42:15.8855442Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8855635Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8855707Z return mod(**inputs) 2025-08-14T21:42:15.8856011Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8856076Z outputs = self.bert( 2025-08-14T21:42:15.8856357Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8856433Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8856697Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8856765Z layer_outputs = layer_module( 2025-08-14T21:42:15.8856982Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8857056Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8857328Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:42:15.8857405Z self_attention_outputs = self.attention( 2025-08-14T21:42:15.8857669Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:42:15.8857742Z self_outputs = self.self( 2025-08-14T21:42:15.8857967Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:42:15.8858032Z return func(*args, **kwargs) 2025-08-14T21:42:15.8858304Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 256, in forward 2025-08-14T21:42:15.8858377Z value_layer = self.value(current_states) 2025-08-14T21:42:15.8858381Z 2025-08-14T21:42:15.8858461Z cudagraph partition due to non gpu ops 2025-08-14T21:42:15.8858534Z cudagraph partition due to non gpu ops 2025-08-14T21:42:15.8858629Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8858826Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8858886Z return mod(**inputs) 2025-08-14T21:42:15.8859154Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8859222Z outputs = self.bert( 2025-08-14T21:42:15.8859486Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8859562Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8859825Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8859890Z layer_outputs = layer_module( 2025-08-14T21:42:15.8860126Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8860251Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8860518Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:42:15.8860615Z self_attention_outputs = self.attention( 2025-08-14T21:42:15.8860880Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 381, in forward 2025-08-14T21:42:15.8861006Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:42:15.8861301Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 329, in forward 2025-08-14T21:42:15.8861386Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:15.8861390Z 2025-08-14T21:42:15.8861488Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8861672Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8861740Z return mod(**inputs) 2025-08-14T21:42:15.8862007Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8862070Z outputs = self.bert( 2025-08-14T21:42:15.8862344Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8862413Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8862687Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8862753Z layer_outputs = layer_module( 2025-08-14T21:42:15.8862958Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8863037Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8863302Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:42:15.8863379Z layer_output = apply_chunking_to_forward( 2025-08-14T21:42:15.8863635Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:42:15.8863705Z return forward_fn(*input_tensors) 2025-08-14T21:42:15.8864008Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 481, in feed_forward_chunk 2025-08-14T21:42:15.8864105Z intermediate_output = self.intermediate(ln_output) 2025-08-14T21:42:15.8864369Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 397, in forward 2025-08-14T21:42:15.8864453Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:15.8864458Z 2025-08-14T21:42:15.8864552Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8864852Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8864925Z return mod(**inputs) 2025-08-14T21:42:15.8865193Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8865260Z outputs = self.bert( 2025-08-14T21:42:15.8865527Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8865600Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8865864Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8865950Z layer_outputs = layer_module( 2025-08-14T21:42:15.8866164Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8866266Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8866527Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:42:15.8866633Z layer_output = apply_chunking_to_forward( 2025-08-14T21:42:15.8866876Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:42:15.8866954Z return forward_fn(*input_tensors) 2025-08-14T21:42:15.8867262Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 481, in feed_forward_chunk 2025-08-14T21:42:15.8867358Z intermediate_output = self.intermediate(ln_output) 2025-08-14T21:42:15.8867631Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 398, in forward 2025-08-14T21:42:15.8867734Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:42:15.8867934Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:42:15.8868000Z return self.act(input) 2025-08-14T21:42:15.8868004Z 2025-08-14T21:42:15.8868097Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8868284Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8868342Z return mod(**inputs) 2025-08-14T21:42:15.8868610Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8868676Z outputs = self.bert( 2025-08-14T21:42:15.8868939Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8869014Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8869276Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8869341Z layer_outputs = layer_module( 2025-08-14T21:42:15.8869550Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8869620Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8869890Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:42:15.8869964Z layer_output = apply_chunking_to_forward( 2025-08-14T21:42:15.8870202Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:42:15.8870277Z return forward_fn(*input_tensors) 2025-08-14T21:42:15.8870569Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 482, in feed_forward_chunk 2025-08-14T21:42:15.8870692Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:42:15.8870963Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 410, in forward 2025-08-14T21:42:15.8871036Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:15.8871039Z 2025-08-14T21:42:15.8871138Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8871319Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8871378Z return mod(**inputs) 2025-08-14T21:42:15.8871650Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8871722Z outputs = self.bert( 2025-08-14T21:42:15.8872008Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8872074Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8872372Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8872444Z layer_outputs = layer_module( 2025-08-14T21:42:15.8872647Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8872731Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8873003Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:42:15.8873078Z layer_output = apply_chunking_to_forward( 2025-08-14T21:42:15.8873324Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:42:15.8873393Z return forward_fn(*input_tensors) 2025-08-14T21:42:15.8873681Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 482, in feed_forward_chunk 2025-08-14T21:42:15.8873810Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:42:15.8874074Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 412, in forward 2025-08-14T21:42:15.8874150Z return input_tensor + hidden_states 2025-08-14T21:42:15.8874155Z 2025-08-14T21:42:15.8874247Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8874426Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8874491Z return mod(**inputs) 2025-08-14T21:42:15.8874757Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8874818Z outputs = self.bert( 2025-08-14T21:42:15.8875085Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8875152Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8875418Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8875483Z layer_outputs = layer_module( 2025-08-14T21:42:15.8875689Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8875772Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8876034Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:42:15.8876118Z self_attention_outputs = self.attention( 2025-08-14T21:42:15.8876382Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:42:15.8876446Z self_outputs = self.self( 2025-08-14T21:42:15.8876677Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:42:15.8876740Z return func(*args, **kwargs) 2025-08-14T21:42:15.8877000Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 223, in forward 2025-08-14T21:42:15.8877082Z query_layer = self.query(hidden_states) 2025-08-14T21:42:15.8877085Z 2025-08-14T21:42:15.8877177Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8877381Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8877442Z return mod(**inputs) 2025-08-14T21:42:15.8877722Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8877789Z outputs = self.bert( 2025-08-14T21:42:15.8878069Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8878141Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8878403Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8878480Z layer_outputs = layer_module( 2025-08-14T21:42:15.8878692Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8878761Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8879024Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:42:15.8879107Z self_attention_outputs = self.attention( 2025-08-14T21:42:15.8879368Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:42:15.8879441Z self_outputs = self.self( 2025-08-14T21:42:15.8879663Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:42:15.8879726Z return func(*args, **kwargs) 2025-08-14T21:42:15.8879996Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 252, in forward 2025-08-14T21:42:15.8880067Z key_layer = self.key(current_states) 2025-08-14T21:42:15.8880070Z 2025-08-14T21:42:15.8880169Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8880351Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8880411Z return mod(**inputs) 2025-08-14T21:42:15.8880682Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8880744Z outputs = self.bert( 2025-08-14T21:42:15.8881004Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8881078Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8881344Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8881415Z layer_outputs = layer_module( 2025-08-14T21:42:15.8881616Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8881687Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8881958Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:42:15.8882031Z self_attention_outputs = self.attention( 2025-08-14T21:42:15.8882300Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:42:15.8882362Z self_outputs = self.self( 2025-08-14T21:42:15.8882582Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:42:15.8882651Z return func(*args, **kwargs) 2025-08-14T21:42:15.8882915Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 256, in forward 2025-08-14T21:42:15.8882985Z value_layer = self.value(current_states) 2025-08-14T21:42:15.8882994Z 2025-08-14T21:42:15.8883080Z cudagraph partition due to non gpu ops 2025-08-14T21:42:15.8883168Z cudagraph partition due to non gpu ops 2025-08-14T21:42:15.8883265Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8883447Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8883530Z return mod(**inputs) 2025-08-14T21:42:15.8883802Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8883860Z outputs = self.bert( 2025-08-14T21:42:15.8884136Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8884211Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8884474Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8884547Z layer_outputs = layer_module( 2025-08-14T21:42:15.8885041Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8885140Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8885416Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:42:15.8885496Z self_attention_outputs = self.attention( 2025-08-14T21:42:15.8885767Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 381, in forward 2025-08-14T21:42:15.8885887Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:42:15.8886149Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 329, in forward 2025-08-14T21:42:15.8886232Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:15.8886237Z 2025-08-14T21:42:15.8886330Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8886521Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8886580Z return mod(**inputs) 2025-08-14T21:42:15.8886846Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8886915Z outputs = self.bert( 2025-08-14T21:42:15.8887178Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8887246Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8887517Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8887582Z layer_outputs = layer_module( 2025-08-14T21:42:15.8887794Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8887867Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8888129Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:42:15.8888215Z layer_output = apply_chunking_to_forward( 2025-08-14T21:42:15.8888453Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:42:15.8888522Z return forward_fn(*input_tensors) 2025-08-14T21:42:15.8888819Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 481, in feed_forward_chunk 2025-08-14T21:42:15.8888913Z intermediate_output = self.intermediate(ln_output) 2025-08-14T21:42:15.8889227Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 397, in forward 2025-08-14T21:42:15.8889331Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:15.8889335Z 2025-08-14T21:42:15.8889428Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8889620Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8889710Z return mod(**inputs) 2025-08-14T21:42:15.8889990Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8890053Z outputs = self.bert( 2025-08-14T21:42:15.8890345Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8890423Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8890688Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8890761Z layer_outputs = layer_module( 2025-08-14T21:42:15.8890965Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8891034Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8891304Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:42:15.8891379Z layer_output = apply_chunking_to_forward( 2025-08-14T21:42:15.8891619Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:42:15.8891695Z return forward_fn(*input_tensors) 2025-08-14T21:42:15.8891985Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 481, in feed_forward_chunk 2025-08-14T21:42:15.8892087Z intermediate_output = self.intermediate(ln_output) 2025-08-14T21:42:15.8892350Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 398, in forward 2025-08-14T21:42:15.8892452Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:42:15.8892657Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:42:15.8892721Z return self.act(input) 2025-08-14T21:42:15.8892725Z 2025-08-14T21:42:15.8892821Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8893003Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8893064Z return mod(**inputs) 2025-08-14T21:42:15.8893336Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8893393Z outputs = self.bert( 2025-08-14T21:42:15.8893658Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8893734Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8893995Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8894070Z layer_outputs = layer_module( 2025-08-14T21:42:15.8894274Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8894342Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8894614Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:42:15.8894689Z layer_output = apply_chunking_to_forward( 2025-08-14T21:42:15.8894947Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:42:15.8895032Z return forward_fn(*input_tensors) 2025-08-14T21:42:15.8895320Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 482, in feed_forward_chunk 2025-08-14T21:42:15.8895465Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:42:15.8895726Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 410, in forward 2025-08-14T21:42:15.8895798Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:15.8895809Z 2025-08-14T21:42:15.8895913Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8896096Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8896163Z return mod(**inputs) 2025-08-14T21:42:15.8896427Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8896487Z outputs = self.bert( 2025-08-14T21:42:15.8896755Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8896820Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8897087Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8897150Z layer_outputs = layer_module( 2025-08-14T21:42:15.8897352Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8897429Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8897687Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:42:15.8897763Z self_attention_outputs = self.attention( 2025-08-14T21:42:15.8898035Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:42:15.8898099Z self_outputs = self.self( 2025-08-14T21:42:15.8898328Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:42:15.8898390Z return func(*args, **kwargs) 2025-08-14T21:42:15.8898652Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 223, in forward 2025-08-14T21:42:15.8898734Z query_layer = self.query(hidden_states) 2025-08-14T21:42:15.8898737Z 2025-08-14T21:42:15.8898828Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8899013Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8899073Z return mod(**inputs) 2025-08-14T21:42:15.8899339Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8899405Z outputs = self.bert( 2025-08-14T21:42:15.8899666Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8899733Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8900001Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8900065Z layer_outputs = layer_module( 2025-08-14T21:42:15.8900275Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8900346Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8900621Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:42:15.8900723Z self_attention_outputs = self.attention( 2025-08-14T21:42:15.8900991Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:42:15.8901080Z self_outputs = self.self( 2025-08-14T21:42:15.8901304Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:42:15.8901367Z return func(*args, **kwargs) 2025-08-14T21:42:15.8901653Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 252, in forward 2025-08-14T21:42:15.8901726Z key_layer = self.key(current_states) 2025-08-14T21:42:15.8901730Z 2025-08-14T21:42:15.8901825Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8902014Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8902097Z return mod(**inputs) 2025-08-14T21:42:15.8902365Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8902426Z outputs = self.bert( 2025-08-14T21:42:15.8902704Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8902771Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8903043Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8903108Z layer_outputs = layer_module( 2025-08-14T21:42:15.8903312Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8903392Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8903657Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:42:15.8903732Z self_attention_outputs = self.attention( 2025-08-14T21:42:15.8904004Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:42:15.8904068Z self_outputs = self.self( 2025-08-14T21:42:15.8904299Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:42:15.8904361Z return func(*args, **kwargs) 2025-08-14T21:42:15.8904627Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 256, in forward 2025-08-14T21:42:15.8904790Z value_layer = self.value(current_states) 2025-08-14T21:42:15.8904801Z 2025-08-14T21:42:15.8904884Z cudagraph partition due to non gpu ops 2025-08-14T21:42:15.8904965Z cudagraph partition due to non gpu ops 2025-08-14T21:42:15.8905061Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8905244Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8905311Z return mod(**inputs) 2025-08-14T21:42:15.8905579Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8905639Z outputs = self.bert( 2025-08-14T21:42:15.8905910Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8905978Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8906250Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8906315Z layer_outputs = layer_module( 2025-08-14T21:42:15.8906535Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8906642Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8906903Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:42:15.8906999Z self_attention_outputs = self.attention( 2025-08-14T21:42:15.8907271Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 381, in forward 2025-08-14T21:42:15.8907402Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:42:15.8907672Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 329, in forward 2025-08-14T21:42:15.8907744Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:15.8907748Z 2025-08-14T21:42:15.8907839Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8908025Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8908082Z return mod(**inputs) 2025-08-14T21:42:15.8908350Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8908410Z outputs = self.bert( 2025-08-14T21:42:15.8908668Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8908737Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8908997Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8909060Z layer_outputs = layer_module( 2025-08-14T21:42:15.8909270Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8909341Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8909606Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:42:15.8909681Z layer_output = apply_chunking_to_forward( 2025-08-14T21:42:15.8909917Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:42:15.8909991Z return forward_fn(*input_tensors) 2025-08-14T21:42:15.8910280Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 481, in feed_forward_chunk 2025-08-14T21:42:15.8910380Z intermediate_output = self.intermediate(ln_output) 2025-08-14T21:42:15.8910640Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 397, in forward 2025-08-14T21:42:15.8910714Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:15.8910718Z 2025-08-14T21:42:15.8910816Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8910992Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8911060Z return mod(**inputs) 2025-08-14T21:42:15.8911323Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8911381Z outputs = self.bert( 2025-08-14T21:42:15.8911648Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8911712Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8911970Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8912057Z layer_outputs = layer_module( 2025-08-14T21:42:15.8912273Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8912351Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8912615Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:42:15.8912711Z layer_output = apply_chunking_to_forward( 2025-08-14T21:42:15.8912956Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:42:15.8913040Z return forward_fn(*input_tensors) 2025-08-14T21:42:15.8913341Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 481, in feed_forward_chunk 2025-08-14T21:42:15.8913436Z intermediate_output = self.intermediate(ln_output) 2025-08-14T21:42:15.8913701Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 398, in forward 2025-08-14T21:42:15.8913815Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:42:15.8914014Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:42:15.8914080Z return self.act(input) 2025-08-14T21:42:15.8914083Z 2025-08-14T21:42:15.8914183Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8914366Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8914432Z return mod(**inputs) 2025-08-14T21:42:15.8914698Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8914757Z outputs = self.bert( 2025-08-14T21:42:15.8915028Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8915097Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8915368Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8915433Z layer_outputs = layer_module( 2025-08-14T21:42:15.8915633Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8915712Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8915976Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:42:15.8916051Z layer_output = apply_chunking_to_forward( 2025-08-14T21:42:15.8916297Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:42:15.8916368Z return forward_fn(*input_tensors) 2025-08-14T21:42:15.8916668Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 482, in feed_forward_chunk 2025-08-14T21:42:15.8916788Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:42:15.8917053Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 410, in forward 2025-08-14T21:42:15.8917132Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:15.8917135Z 2025-08-14T21:42:15.8917226Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8917415Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8917473Z return mod(**inputs) 2025-08-14T21:42:15.8917753Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8917820Z outputs = self.bert( 2025-08-14T21:42:15.8918100Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8918167Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8918452Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8918515Z layer_outputs = layer_module( 2025-08-14T21:42:15.8918723Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8918809Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8919071Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:42:15.8919154Z layer_output = apply_chunking_to_forward( 2025-08-14T21:42:15.8919391Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:42:15.8919468Z return forward_fn(*input_tensors) 2025-08-14T21:42:15.8919757Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 482, in feed_forward_chunk 2025-08-14T21:42:15.8919876Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:42:15.8920145Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 412, in forward 2025-08-14T21:42:15.8920217Z return input_tensor + hidden_states 2025-08-14T21:42:15.8920220Z 2025-08-14T21:42:15.8920312Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8920500Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8920560Z return mod(**inputs) 2025-08-14T21:42:15.8920831Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8920890Z outputs = self.bert( 2025-08-14T21:42:15.8921151Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8921225Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8921484Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8921553Z layer_outputs = layer_module( 2025-08-14T21:42:15.8921755Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8921825Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8922092Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:42:15.8922167Z self_attention_outputs = self.attention( 2025-08-14T21:42:15.8922427Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:42:15.8922500Z self_outputs = self.self( 2025-08-14T21:42:15.8922722Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:42:15.8922791Z return func(*args, **kwargs) 2025-08-14T21:42:15.8923051Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 223, in forward 2025-08-14T21:42:15.8923124Z query_layer = self.query(hidden_states) 2025-08-14T21:42:15.8923128Z 2025-08-14T21:42:15.8923227Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8923420Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8923501Z return mod(**inputs) 2025-08-14T21:42:15.8923765Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8923824Z outputs = self.bert( 2025-08-14T21:42:15.8924106Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8924171Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8924444Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8924516Z layer_outputs = layer_module( 2025-08-14T21:42:15.8924721Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8924798Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8925061Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:42:15.8925136Z self_attention_outputs = self.attention( 2025-08-14T21:42:15.8925408Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:42:15.8925471Z self_outputs = self.self( 2025-08-14T21:42:15.8925706Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:42:15.8925771Z return func(*args, **kwargs) 2025-08-14T21:42:15.8926036Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 252, in forward 2025-08-14T21:42:15.8926111Z key_layer = self.key(current_states) 2025-08-14T21:42:15.8926114Z 2025-08-14T21:42:15.8926206Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8926388Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8926452Z return mod(**inputs) 2025-08-14T21:42:15.8926717Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8926784Z outputs = self.bert( 2025-08-14T21:42:15.8927047Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8927111Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8927385Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8927448Z layer_outputs = layer_module( 2025-08-14T21:42:15.8927660Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8927730Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8927996Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:42:15.8928077Z self_attention_outputs = self.attention( 2025-08-14T21:42:15.8928343Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:42:15.8928405Z self_outputs = self.self( 2025-08-14T21:42:15.8928637Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:42:15.8928701Z return func(*args, **kwargs) 2025-08-14T21:42:15.8928973Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 256, in forward 2025-08-14T21:42:15.8929046Z value_layer = self.value(current_states) 2025-08-14T21:42:15.8929073Z 2025-08-14T21:42:15.8929147Z cudagraph partition due to non gpu ops 2025-08-14T21:42:15.8929243Z cudagraph partition due to non gpu ops 2025-08-14T21:42:15.8929530Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8929718Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8929797Z return mod(**inputs) 2025-08-14T21:42:15.8930064Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8930131Z outputs = self.bert( 2025-08-14T21:42:15.8930406Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8930475Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8930752Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8930818Z layer_outputs = layer_module( 2025-08-14T21:42:15.8931034Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8931106Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8931373Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:42:15.8931457Z self_attention_outputs = self.attention( 2025-08-14T21:42:15.8931723Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 381, in forward 2025-08-14T21:42:15.8931847Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:42:15.8932114Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 329, in forward 2025-08-14T21:42:15.8932190Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:15.8932194Z 2025-08-14T21:42:15.8932294Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8932473Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8932531Z return mod(**inputs) 2025-08-14T21:42:15.8932808Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8932867Z outputs = self.bert( 2025-08-14T21:42:15.8933140Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8933205Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8933465Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8933536Z layer_outputs = layer_module( 2025-08-14T21:42:15.8933738Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8933816Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8934078Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:42:15.8934154Z layer_output = apply_chunking_to_forward( 2025-08-14T21:42:15.8934403Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:42:15.8934471Z return forward_fn(*input_tensors) 2025-08-14T21:42:15.8934768Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 481, in feed_forward_chunk 2025-08-14T21:42:15.8934869Z intermediate_output = self.intermediate(ln_output) 2025-08-14T21:42:15.8935146Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 397, in forward 2025-08-14T21:42:15.8935247Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:15.8935250Z 2025-08-14T21:42:15.8935342Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8935539Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8935607Z return mod(**inputs) 2025-08-14T21:42:15.8935873Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8935939Z outputs = self.bert( 2025-08-14T21:42:15.8936212Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8936280Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8936550Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8936615Z layer_outputs = layer_module( 2025-08-14T21:42:15.8936819Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8936897Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8937159Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:42:15.8937239Z layer_output = apply_chunking_to_forward( 2025-08-14T21:42:15.8937477Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:42:15.8937545Z return forward_fn(*input_tensors) 2025-08-14T21:42:15.8937840Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 481, in feed_forward_chunk 2025-08-14T21:42:15.8937935Z intermediate_output = self.intermediate(ln_output) 2025-08-14T21:42:15.8938205Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 398, in forward 2025-08-14T21:42:15.8938307Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:42:15.8938502Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:42:15.8938572Z return self.act(input) 2025-08-14T21:42:15.8938576Z 2025-08-14T21:42:15.8938667Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8938850Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8938917Z return mod(**inputs) 2025-08-14T21:42:15.8939180Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8939248Z outputs = self.bert( 2025-08-14T21:42:15.8939510Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8939577Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8939848Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8939913Z layer_outputs = layer_module( 2025-08-14T21:42:15.8940121Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8940192Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8940455Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:42:15.8940538Z layer_output = apply_chunking_to_forward( 2025-08-14T21:42:15.8940793Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:42:15.8940879Z return forward_fn(*input_tensors) 2025-08-14T21:42:15.8941180Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 482, in feed_forward_chunk 2025-08-14T21:42:15.8941315Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:42:15.8941588Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 410, in forward 2025-08-14T21:42:15.8941663Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:15.8941666Z 2025-08-14T21:42:15.8941778Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8941969Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8942027Z return mod(**inputs) 2025-08-14T21:42:15.8942300Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8942362Z outputs = self.bert( 2025-08-14T21:42:15.8942626Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8942700Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8942965Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8943030Z layer_outputs = layer_module( 2025-08-14T21:42:15.8943243Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8943315Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8943584Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:42:15.8943660Z self_attention_outputs = self.attention( 2025-08-14T21:42:15.8943925Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:42:15.8943996Z self_outputs = self.self( 2025-08-14T21:42:15.8944222Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:42:15.8944293Z return func(*args, **kwargs) 2025-08-14T21:42:15.8944556Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 223, in forward 2025-08-14T21:42:15.8944631Z query_layer = self.query(hidden_states) 2025-08-14T21:42:15.8944634Z 2025-08-14T21:42:15.8944823Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8945022Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8945107Z return mod(**inputs) 2025-08-14T21:42:15.8945520Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8945606Z outputs = self.bert( 2025-08-14T21:42:15.8945891Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8945962Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8946221Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8946293Z layer_outputs = layer_module( 2025-08-14T21:42:15.8946495Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8946572Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8946856Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:42:15.8946946Z self_attention_outputs = self.attention( 2025-08-14T21:42:15.8947216Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:42:15.8947296Z self_outputs = self.self( 2025-08-14T21:42:15.8947521Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:42:15.8947590Z return func(*args, **kwargs) 2025-08-14T21:42:15.8947873Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 252, in forward 2025-08-14T21:42:15.8947955Z key_layer = self.key(current_states) 2025-08-14T21:42:15.8947958Z 2025-08-14T21:42:15.8948053Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8948236Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8948302Z return mod(**inputs) 2025-08-14T21:42:15.8948567Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8948635Z outputs = self.bert( 2025-08-14T21:42:15.8948898Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8948963Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8949232Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8949296Z layer_outputs = layer_module( 2025-08-14T21:42:15.8949499Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8949579Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8949838Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:42:15.8949919Z self_attention_outputs = self.attention( 2025-08-14T21:42:15.8950181Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:42:15.8950244Z self_outputs = self.self( 2025-08-14T21:42:15.8950474Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:42:15.8950535Z return func(*args, **kwargs) 2025-08-14T21:42:15.8950805Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 256, in forward 2025-08-14T21:42:15.8950876Z value_layer = self.value(current_states) 2025-08-14T21:42:15.8950879Z 2025-08-14T21:42:15.8950952Z cudagraph partition due to non gpu ops 2025-08-14T21:42:15.8951030Z cudagraph partition due to non gpu ops 2025-08-14T21:42:15.8951123Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8951303Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8951370Z return mod(**inputs) 2025-08-14T21:42:15.8951634Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8951698Z outputs = self.bert( 2025-08-14T21:42:15.8951959Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8952024Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8952293Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8952372Z layer_outputs = layer_module( 2025-08-14T21:42:15.8952577Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8952692Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8952951Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:42:15.8953061Z self_attention_outputs = self.attention( 2025-08-14T21:42:15.8953321Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 381, in forward 2025-08-14T21:42:15.8953450Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:42:15.8953723Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 329, in forward 2025-08-14T21:42:15.8953799Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:15.8953804Z 2025-08-14T21:42:15.8953901Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8954083Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8954141Z return mod(**inputs) 2025-08-14T21:42:15.8954414Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8954474Z outputs = self.bert( 2025-08-14T21:42:15.8954736Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8954810Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8955073Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8955146Z layer_outputs = layer_module( 2025-08-14T21:42:15.8955352Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8955424Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8955695Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:42:15.8955773Z layer_output = apply_chunking_to_forward( 2025-08-14T21:42:15.8956021Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:42:15.8956090Z return forward_fn(*input_tensors) 2025-08-14T21:42:15.8956384Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 481, in feed_forward_chunk 2025-08-14T21:42:15.8956488Z intermediate_output = self.intermediate(ln_output) 2025-08-14T21:42:15.8956754Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 397, in forward 2025-08-14T21:42:15.8956837Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:15.8956840Z 2025-08-14T21:42:15.8956931Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8957111Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8957178Z return mod(**inputs) 2025-08-14T21:42:15.8957445Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8957504Z outputs = self.bert( 2025-08-14T21:42:15.8957777Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8957844Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8958129Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8958195Z layer_outputs = layer_module( 2025-08-14T21:42:15.8958417Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8958497Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8958778Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:42:15.8958860Z layer_output = apply_chunking_to_forward( 2025-08-14T21:42:15.8959099Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:42:15.8959183Z return forward_fn(*input_tensors) 2025-08-14T21:42:15.8959482Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 481, in feed_forward_chunk 2025-08-14T21:42:15.8959577Z intermediate_output = self.intermediate(ln_output) 2025-08-14T21:42:15.8959840Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 398, in forward 2025-08-14T21:42:15.8959950Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:42:15.8960142Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:42:15.8960213Z return self.act(input) 2025-08-14T21:42:15.8960217Z 2025-08-14T21:42:15.8960321Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8960503Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8960569Z return mod(**inputs) 2025-08-14T21:42:15.8960835Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8960900Z outputs = self.bert( 2025-08-14T21:42:15.8961162Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8961228Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8961494Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8961558Z layer_outputs = layer_module( 2025-08-14T21:42:15.8961760Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8961838Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8962101Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:42:15.8962182Z layer_output = apply_chunking_to_forward( 2025-08-14T21:42:15.8962420Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:42:15.8962488Z return forward_fn(*input_tensors) 2025-08-14T21:42:15.8962785Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 482, in feed_forward_chunk 2025-08-14T21:42:15.8962905Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:42:15.8963174Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 410, in forward 2025-08-14T21:42:15.8963247Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:15.8963251Z 2025-08-14T21:42:15.8963345Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8963531Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8963589Z return mod(**inputs) 2025-08-14T21:42:15.8963868Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8963979Z outputs = self.bert( 2025-08-14T21:42:15.8964242Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8964331Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8964592Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8964656Z layer_outputs = layer_module( 2025-08-14T21:42:15.8964882Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8964955Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8965226Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:42:15.8965304Z layer_output = apply_chunking_to_forward( 2025-08-14T21:42:15.8965538Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:42:15.8965614Z return forward_fn(*input_tensors) 2025-08-14T21:42:15.8965904Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 482, in feed_forward_chunk 2025-08-14T21:42:15.8966022Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:42:15.8966292Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 412, in forward 2025-08-14T21:42:15.8966364Z return input_tensor + hidden_states 2025-08-14T21:42:15.8966367Z 2025-08-14T21:42:15.8966467Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8966648Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8966708Z return mod(**inputs) 2025-08-14T21:42:15.8966980Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8967039Z outputs = self.bert( 2025-08-14T21:42:15.8967307Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8967373Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8967636Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8967707Z layer_outputs = layer_module( 2025-08-14T21:42:15.8967909Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8967979Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8968247Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:42:15.8968323Z self_attention_outputs = self.attention( 2025-08-14T21:42:15.8968590Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:42:15.8968656Z self_outputs = self.self( 2025-08-14T21:42:15.8968883Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:42:15.8968954Z return func(*args, **kwargs) 2025-08-14T21:42:15.8969215Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 223, in forward 2025-08-14T21:42:15.8969293Z query_layer = self.query(hidden_states) 2025-08-14T21:42:15.8969297Z 2025-08-14T21:42:15.8969389Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8969585Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8969666Z return mod(**inputs) 2025-08-14T21:42:15.8969930Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8970010Z outputs = self.bert( 2025-08-14T21:42:15.8970281Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8970348Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8970635Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8970700Z layer_outputs = layer_module( 2025-08-14T21:42:15.8970904Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8970985Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8971249Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:42:15.8971331Z self_attention_outputs = self.attention( 2025-08-14T21:42:15.8971594Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:42:15.8971658Z self_outputs = self.self( 2025-08-14T21:42:15.8971893Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:42:15.8971955Z return func(*args, **kwargs) 2025-08-14T21:42:15.8972220Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 252, in forward 2025-08-14T21:42:15.8972298Z key_layer = self.key(current_states) 2025-08-14T21:42:15.8972302Z 2025-08-14T21:42:15.8972396Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8972587Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8972645Z return mod(**inputs) 2025-08-14T21:42:15.8972913Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8972981Z outputs = self.bert( 2025-08-14T21:42:15.8973247Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8973320Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8973586Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8973649Z layer_outputs = layer_module( 2025-08-14T21:42:15.8973862Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8973934Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8974199Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:42:15.8974282Z self_attention_outputs = self.attention( 2025-08-14T21:42:15.8974546Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:42:15.8974616Z self_outputs = self.self( 2025-08-14T21:42:15.8974842Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:42:15.8974905Z return func(*args, **kwargs) 2025-08-14T21:42:15.8975175Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 256, in forward 2025-08-14T21:42:15.8975259Z value_layer = self.value(current_states) 2025-08-14T21:42:15.8975263Z 2025-08-14T21:42:15.8975357Z cudagraph partition due to non gpu ops 2025-08-14T21:42:15.8975428Z cudagraph partition due to non gpu ops 2025-08-14T21:42:15.8975521Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8975707Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8975781Z return mod(**inputs) 2025-08-14T21:42:15.8976045Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8976113Z outputs = self.bert( 2025-08-14T21:42:15.8976387Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8976461Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8976726Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8976792Z layer_outputs = layer_module( 2025-08-14T21:42:15.8977004Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8977074Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8977339Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:42:15.8977417Z self_attention_outputs = self.attention( 2025-08-14T21:42:15.8977679Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 381, in forward 2025-08-14T21:42:15.8977800Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:42:15.8978062Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 329, in forward 2025-08-14T21:42:15.8978140Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:15.8978145Z 2025-08-14T21:42:15.8978246Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8978426Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8978492Z return mod(**inputs) 2025-08-14T21:42:15.8978758Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8978816Z outputs = self.bert( 2025-08-14T21:42:15.8979084Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8979150Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8979411Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8979484Z layer_outputs = layer_module( 2025-08-14T21:42:15.8979687Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8979764Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8980024Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:42:15.8980100Z layer_output = apply_chunking_to_forward( 2025-08-14T21:42:15.8980346Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:42:15.8980415Z return forward_fn(*input_tensors) 2025-08-14T21:42:15.8980713Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 481, in feed_forward_chunk 2025-08-14T21:42:15.8980807Z intermediate_output = self.intermediate(ln_output) 2025-08-14T21:42:15.8981083Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 397, in forward 2025-08-14T21:42:15.8981182Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:15.8981185Z 2025-08-14T21:42:15.8981278Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8981485Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8981543Z return mod(**inputs) 2025-08-14T21:42:15.8981806Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8981887Z outputs = self.bert( 2025-08-14T21:42:15.8982148Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8982213Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8982479Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8982545Z layer_outputs = layer_module( 2025-08-14T21:42:15.8982754Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8982826Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8983084Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:42:15.8983165Z layer_output = apply_chunking_to_forward( 2025-08-14T21:42:15.8983403Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:42:15.8983478Z return forward_fn(*input_tensors) 2025-08-14T21:42:15.8983769Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 481, in feed_forward_chunk 2025-08-14T21:42:15.8983861Z intermediate_output = self.intermediate(ln_output) 2025-08-14T21:42:15.8984133Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 398, in forward 2025-08-14T21:42:15.8984236Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:42:15.8984429Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:42:15.8984499Z return self.act(input) 2025-08-14T21:42:15.8984503Z 2025-08-14T21:42:15.8984788Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8985042Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8985104Z return mod(**inputs) 2025-08-14T21:42:15.8985380Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8985453Z outputs = self.bert( 2025-08-14T21:42:15.8985724Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8985802Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8986075Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8986142Z layer_outputs = layer_module( 2025-08-14T21:42:15.8986361Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8986436Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8986709Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:42:15.8986796Z layer_output = apply_chunking_to_forward( 2025-08-14T21:42:15.8987083Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:42:15.8987187Z return forward_fn(*input_tensors) 2025-08-14T21:42:15.8987476Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 482, in feed_forward_chunk 2025-08-14T21:42:15.8988350Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:42:15.8988620Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 410, in forward 2025-08-14T21:42:15.8988725Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:15.8988729Z 2025-08-14T21:42:15.8988831Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8989016Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8989077Z return mod(**inputs) 2025-08-14T21:42:15.8989352Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8989418Z outputs = self.bert( 2025-08-14T21:42:15.8989683Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8989760Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8990024Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8990098Z layer_outputs = layer_module( 2025-08-14T21:42:15.8990303Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8990374Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8990643Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:42:15.8990719Z self_attention_outputs = self.attention( 2025-08-14T21:42:15.8990987Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:42:15.8991051Z self_outputs = self.self( 2025-08-14T21:42:15.8991277Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:42:15.8991346Z return func(*args, **kwargs) 2025-08-14T21:42:15.8991610Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 223, in forward 2025-08-14T21:42:15.8991683Z query_layer = self.query(hidden_states) 2025-08-14T21:42:15.8991693Z 2025-08-14T21:42:15.8991788Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8991970Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8992038Z return mod(**inputs) 2025-08-14T21:42:15.8992303Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8992363Z outputs = self.bert( 2025-08-14T21:42:15.8992635Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8992703Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8992973Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8993039Z layer_outputs = layer_module( 2025-08-14T21:42:15.8993245Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8993323Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8993601Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:42:15.8993700Z self_attention_outputs = self.attention( 2025-08-14T21:42:15.8993969Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:42:15.8994047Z self_outputs = self.self( 2025-08-14T21:42:15.8994279Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:42:15.8994342Z return func(*args, **kwargs) 2025-08-14T21:42:15.8994620Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 252, in forward 2025-08-14T21:42:15.8994700Z key_layer = self.key(current_states) 2025-08-14T21:42:15.8994704Z 2025-08-14T21:42:15.8994796Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8994984Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8995043Z return mod(**inputs) 2025-08-14T21:42:15.8995309Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8995375Z outputs = self.bert( 2025-08-14T21:42:15.8995636Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8995701Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8995971Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8996035Z layer_outputs = layer_module( 2025-08-14T21:42:15.8996242Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8996313Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8996578Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:42:15.8996658Z self_attention_outputs = self.attention( 2025-08-14T21:42:15.8996922Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:42:15.8996990Z self_outputs = self.self( 2025-08-14T21:42:15.8997212Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:42:15.8997275Z return func(*args, **kwargs) 2025-08-14T21:42:15.8997555Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 256, in forward 2025-08-14T21:42:15.8997625Z value_layer = self.value(current_states) 2025-08-14T21:42:15.8997629Z 2025-08-14T21:42:15.8997701Z cudagraph partition due to non gpu ops 2025-08-14T21:42:15.8997779Z cudagraph partition due to non gpu ops 2025-08-14T21:42:15.8997871Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.8998056Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.8998115Z return mod(**inputs) 2025-08-14T21:42:15.8998379Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.8998444Z outputs = self.bert( 2025-08-14T21:42:15.8998705Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.8998772Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.8999040Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.8999117Z layer_outputs = layer_module( 2025-08-14T21:42:15.8999340Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.8999410Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.8999691Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:42:15.8999770Z self_attention_outputs = self.attention( 2025-08-14T21:42:15.9000033Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 381, in forward 2025-08-14T21:42:15.9000168Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:42:15.9000436Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 329, in forward 2025-08-14T21:42:15.9000511Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:15.9000515Z 2025-08-14T21:42:15.9000617Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.9000799Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.9000859Z return mod(**inputs) 2025-08-14T21:42:15.9001135Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.9001193Z outputs = self.bert( 2025-08-14T21:42:15.9001467Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.9001536Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.9001799Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.9001873Z layer_outputs = layer_module( 2025-08-14T21:42:15.9002078Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.9002157Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.9002420Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:42:15.9002499Z layer_output = apply_chunking_to_forward( 2025-08-14T21:42:15.9002746Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:42:15.9002816Z return forward_fn(*input_tensors) 2025-08-14T21:42:15.9003113Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 481, in feed_forward_chunk 2025-08-14T21:42:15.9003216Z intermediate_output = self.intermediate(ln_output) 2025-08-14T21:42:15.9003480Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 397, in forward 2025-08-14T21:42:15.9003563Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:15.9003566Z 2025-08-14T21:42:15.9003662Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.9003843Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.9003912Z return mod(**inputs) 2025-08-14T21:42:15.9004177Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.9004244Z outputs = self.bert( 2025-08-14T21:42:15.9004509Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.9004575Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.9004860Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.9004942Z layer_outputs = layer_module( 2025-08-14T21:42:15.9005143Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.9005222Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.9005507Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:42:15.9005588Z layer_output = apply_chunking_to_forward( 2025-08-14T21:42:15.9005839Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:42:15.9005910Z return forward_fn(*input_tensors) 2025-08-14T21:42:15.9006205Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 481, in feed_forward_chunk 2025-08-14T21:42:15.9006300Z intermediate_output = self.intermediate(ln_output) 2025-08-14T21:42:15.9006572Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 398, in forward 2025-08-14T21:42:15.9006673Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:42:15.9006868Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:42:15.9006940Z return self.act(input) 2025-08-14T21:42:15.9006943Z 2025-08-14T21:42:15.9007035Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.9007223Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.9007281Z return mod(**inputs) 2025-08-14T21:42:15.9007544Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.9007611Z outputs = self.bert( 2025-08-14T21:42:15.9007873Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.9007942Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.9008211Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.9008276Z layer_outputs = layer_module( 2025-08-14T21:42:15.9008485Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.9008556Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.9008817Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:42:15.9008900Z layer_output = apply_chunking_to_forward( 2025-08-14T21:42:15.9009135Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:42:15.9009205Z return forward_fn(*input_tensors) 2025-08-14T21:42:15.9009499Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 482, in feed_forward_chunk 2025-08-14T21:42:15.9009620Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:42:15.9009891Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 410, in forward 2025-08-14T21:42:15.9009964Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:15.9009968Z 2025-08-14T21:42:15.9010060Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.9010247Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.9010304Z return mod(**inputs) 2025-08-14T21:42:15.9010585Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.9010663Z outputs = self.bert( 2025-08-14T21:42:15.9010931Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.9011019Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.9011285Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.9011355Z layer_outputs = layer_module( 2025-08-14T21:42:15.9011570Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.9011642Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.9011909Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:42:15.9011983Z layer_output = apply_chunking_to_forward( 2025-08-14T21:42:15.9012220Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:42:15.9012295Z return forward_fn(*input_tensors) 2025-08-14T21:42:15.9012584Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 482, in feed_forward_chunk 2025-08-14T21:42:15.9012707Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:42:15.9012971Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 412, in forward 2025-08-14T21:42:15.9013040Z return input_tensor + hidden_states 2025-08-14T21:42:15.9013044Z 2025-08-14T21:42:15.9013143Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.9013324Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.9013392Z return mod(**inputs) 2025-08-14T21:42:15.9013658Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.9013716Z outputs = self.bert( 2025-08-14T21:42:15.9013986Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.9014052Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.9014313Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.9014385Z layer_outputs = layer_module( 2025-08-14T21:42:15.9014585Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.9014664Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.9014929Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:42:15.9015005Z self_attention_outputs = self.attention( 2025-08-14T21:42:15.9015274Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:42:15.9015338Z self_outputs = self.self( 2025-08-14T21:42:15.9015566Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:42:15.9015629Z return func(*args, **kwargs) 2025-08-14T21:42:15.9015890Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 223, in forward 2025-08-14T21:42:15.9015968Z query_layer = self.query(hidden_states) 2025-08-14T21:42:15.9015972Z 2025-08-14T21:42:15.9016064Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.9016269Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.9016352Z return mod(**inputs) 2025-08-14T21:42:15.9016624Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.9016706Z outputs = self.bert( 2025-08-14T21:42:15.9016970Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.9017036Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.9017325Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.9017391Z layer_outputs = layer_module( 2025-08-14T21:42:15.9017601Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.9017674Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.9017939Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:42:15.9018022Z self_attention_outputs = self.attention( 2025-08-14T21:42:15.9018288Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:42:15.9018352Z self_outputs = self.self( 2025-08-14T21:42:15.9018583Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:42:15.9018650Z return func(*args, **kwargs) 2025-08-14T21:42:15.9018920Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 252, in forward 2025-08-14T21:42:15.9018991Z key_layer = self.key(current_states) 2025-08-14T21:42:15.9018995Z 2025-08-14T21:42:15.9019089Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.9019277Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.9019335Z return mod(**inputs) 2025-08-14T21:42:15.9019619Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.9019679Z outputs = self.bert( 2025-08-14T21:42:15.9019941Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.9020015Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.9020277Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.9020342Z layer_outputs = layer_module( 2025-08-14T21:42:15.9020555Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.9020628Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.9020900Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:42:15.9020975Z self_attention_outputs = self.attention( 2025-08-14T21:42:15.9021237Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:42:15.9021308Z self_outputs = self.self( 2025-08-14T21:42:15.9021531Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:42:15.9021601Z return func(*args, **kwargs) 2025-08-14T21:42:15.9021863Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 256, in forward 2025-08-14T21:42:15.9021951Z value_layer = self.value(current_states) 2025-08-14T21:42:15.9021969Z 2025-08-14T21:42:15.9022053Z cudagraph partition due to non gpu ops 2025-08-14T21:42:15.9022123Z cudagraph partition due to non gpu ops 2025-08-14T21:42:15.9022216Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.9022422Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.9022481Z return mod(**inputs) 2025-08-14T21:42:15.9022755Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.9022813Z outputs = self.bert( 2025-08-14T21:42:15.9023092Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.9023168Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.9023435Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.9023500Z layer_outputs = layer_module( 2025-08-14T21:42:15.9023710Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.9023782Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.9024055Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:42:15.9024129Z self_attention_outputs = self.attention( 2025-08-14T21:42:15.9024393Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 381, in forward 2025-08-14T21:42:15.9024517Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:42:15.9024851Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 329, in forward 2025-08-14T21:42:15.9024941Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:15.9024946Z 2025-08-14T21:42:15.9025040Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.9025224Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.9025292Z return mod(**inputs) 2025-08-14T21:42:15.9025559Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.9025627Z outputs = self.bert( 2025-08-14T21:42:15.9025893Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.9025959Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.9026231Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.9026297Z layer_outputs = layer_module( 2025-08-14T21:42:15.9026502Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.9026580Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.9026843Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:42:15.9040706Z layer_output = apply_chunking_to_forward( 2025-08-14T21:42:15.9041112Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:42:15.9041208Z return forward_fn(*input_tensors) 2025-08-14T21:42:15.9041532Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 481, in feed_forward_chunk 2025-08-14T21:42:15.9041737Z intermediate_output = self.intermediate(ln_output) 2025-08-14T21:42:15.9042023Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 397, in forward 2025-08-14T21:42:15.9042144Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:15.9042152Z 2025-08-14T21:42:15.9042256Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.9042499Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.9042566Z return mod(**inputs) 2025-08-14T21:42:15.9042843Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.9042946Z outputs = self.bert( 2025-08-14T21:42:15.9043212Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.9043293Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.9043559Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.9043630Z layer_outputs = layer_module( 2025-08-14T21:42:15.9043847Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.9043927Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.9044192Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:42:15.9044279Z layer_output = apply_chunking_to_forward( 2025-08-14T21:42:15.9044523Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:42:15.9044604Z return forward_fn(*input_tensors) 2025-08-14T21:42:15.9044895Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 481, in feed_forward_chunk 2025-08-14T21:42:15.9044994Z intermediate_output = self.intermediate(ln_output) 2025-08-14T21:42:15.9045265Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 398, in forward 2025-08-14T21:42:15.9045372Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:42:15.9045578Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:42:15.9045644Z return self.act(input) 2025-08-14T21:42:15.9045648Z 2025-08-14T21:42:15.9045745Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.9045945Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.9046006Z return mod(**inputs) 2025-08-14T21:42:15.9046276Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.9046350Z outputs = self.bert( 2025-08-14T21:42:15.9046611Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.9046687Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.9046950Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.9047015Z layer_outputs = layer_module( 2025-08-14T21:42:15.9047229Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.9047304Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.9047571Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:42:15.9047663Z layer_output = apply_chunking_to_forward( 2025-08-14T21:42:15.9047906Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:42:15.9048004Z return forward_fn(*input_tensors) 2025-08-14T21:42:15.9048300Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 482, in feed_forward_chunk 2025-08-14T21:42:15.9048444Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:42:15.9048707Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 410, in forward 2025-08-14T21:42:15.9048808Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:15.9048812Z 2025-08-14T21:42:15.9048915Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.9049100Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.9049162Z return mod(**inputs) 2025-08-14T21:42:15.9049437Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.9049499Z outputs = self.bert( 2025-08-14T21:42:15.9049768Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.9049837Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.9050100Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.9050173Z layer_outputs = layer_module( 2025-08-14T21:42:15.9050377Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.9050457Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.9050723Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:42:15.9050800Z self_attention_outputs = self.attention( 2025-08-14T21:42:15.9051069Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:42:15.9051136Z self_outputs = self.self( 2025-08-14T21:42:15.9051367Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:42:15.9051441Z return func(*args, **kwargs) 2025-08-14T21:42:15.9051706Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 223, in forward 2025-08-14T21:42:15.9051788Z query_layer = self.query(hidden_states) 2025-08-14T21:42:15.9051791Z 2025-08-14T21:42:15.9051885Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.9052067Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.9052136Z return mod(**inputs) 2025-08-14T21:42:15.9052402Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.9052470Z outputs = self.bert( 2025-08-14T21:42:15.9052732Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.9052801Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.9053074Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.9053140Z layer_outputs = layer_module( 2025-08-14T21:42:15.9053342Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.9053437Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.9053702Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:42:15.9053800Z self_attention_outputs = self.attention( 2025-08-14T21:42:15.9054064Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:42:15.9054143Z self_outputs = self.self( 2025-08-14T21:42:15.9054374Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:42:15.9054438Z return func(*args, **kwargs) 2025-08-14T21:42:15.9054719Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 252, in forward 2025-08-14T21:42:15.9054801Z key_layer = self.key(current_states) 2025-08-14T21:42:15.9054804Z 2025-08-14T21:42:15.9054902Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.9055093Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.9055153Z return mod(**inputs) 2025-08-14T21:42:15.9055419Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.9055486Z outputs = self.bert( 2025-08-14T21:42:15.9055750Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.9055825Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.9056089Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.9056153Z layer_outputs = layer_module( 2025-08-14T21:42:15.9056366Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.9056436Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.9056699Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:42:15.9056780Z self_attention_outputs = self.attention( 2025-08-14T21:42:15.9057043Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:42:15.9057113Z self_outputs = self.self( 2025-08-14T21:42:15.9057335Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:42:15.9057399Z return func(*args, **kwargs) 2025-08-14T21:42:15.9057670Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 256, in forward 2025-08-14T21:42:15.9057742Z value_layer = self.value(current_states) 2025-08-14T21:42:15.9057746Z 2025-08-14T21:42:15.9057830Z cudagraph partition due to non gpu ops 2025-08-14T21:42:15.9057902Z cudagraph partition due to non gpu ops 2025-08-14T21:42:15.9057995Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.9058185Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.9058245Z return mod(**inputs) 2025-08-14T21:42:15.9058511Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.9058579Z outputs = self.bert( 2025-08-14T21:42:15.9058844Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.9058916Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.9059191Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.9059276Z layer_outputs = layer_module( 2025-08-14T21:42:15.9059490Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.9059560Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.9059839Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:42:15.9059922Z self_attention_outputs = self.attention( 2025-08-14T21:42:15.9060197Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 381, in forward 2025-08-14T21:42:15.9060326Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:42:15.9060591Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 329, in forward 2025-08-14T21:42:15.9060668Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:15.9060673Z 2025-08-14T21:42:15.9060776Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.9060956Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.9061024Z return mod(**inputs) 2025-08-14T21:42:15.9061286Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.9061345Z outputs = self.bert( 2025-08-14T21:42:15.9061615Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.9061681Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.9061947Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.9062013Z layer_outputs = layer_module( 2025-08-14T21:42:15.9062214Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.9062292Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.9062555Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:42:15.9062632Z layer_output = apply_chunking_to_forward( 2025-08-14T21:42:15.9062878Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:42:15.9062945Z return forward_fn(*input_tensors) 2025-08-14T21:42:15.9063242Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 481, in feed_forward_chunk 2025-08-14T21:42:15.9063337Z intermediate_output = self.intermediate(ln_output) 2025-08-14T21:42:15.9063599Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 397, in forward 2025-08-14T21:42:15.9063683Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:15.9063687Z 2025-08-14T21:42:15.9063779Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.9063966Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.9064024Z return mod(**inputs) 2025-08-14T21:42:15.9064286Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.9064353Z outputs = self.bert( 2025-08-14T21:42:15.9064613Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.9064678Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.9065051Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.9065138Z layer_outputs = layer_module( 2025-08-14T21:42:15.9065352Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.9065443Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.9065705Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:42:15.9065788Z layer_output = apply_chunking_to_forward( 2025-08-14T21:42:15.9066039Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:42:15.9066118Z return forward_fn(*input_tensors) 2025-08-14T21:42:15.9066413Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 481, in feed_forward_chunk 2025-08-14T21:42:15.9066507Z intermediate_output = self.intermediate(ln_output) 2025-08-14T21:42:15.9066784Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 398, in forward 2025-08-14T21:42:15.9066888Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:42:15.9067090Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:42:15.9067162Z return self.act(input) 2025-08-14T21:42:15.9067165Z 2025-08-14T21:42:15.9067259Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.9067452Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.9067512Z return mod(**inputs) 2025-08-14T21:42:15.9067783Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.9067852Z outputs = self.bert( 2025-08-14T21:42:15.9068123Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.9068198Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.9068466Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.9068531Z layer_outputs = layer_module( 2025-08-14T21:42:15.9068743Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.9068817Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.9069086Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:42:15.9069169Z layer_output = apply_chunking_to_forward( 2025-08-14T21:42:15.9069410Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:42:15.9069489Z return forward_fn(*input_tensors) 2025-08-14T21:42:15.9069783Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 482, in feed_forward_chunk 2025-08-14T21:42:15.9069907Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:42:15.9070184Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 410, in forward 2025-08-14T21:42:15.9070258Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:15.9070263Z 2025-08-14T21:42:15.9070364Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.9070549Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.9070609Z return mod(**inputs) 2025-08-14T21:42:15.9070928Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.9071003Z outputs = self.bert( 2025-08-14T21:42:15.9071272Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.9071370Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.9071633Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.9071704Z layer_outputs = layer_module( 2025-08-14T21:42:15.9071920Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.9071992Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.9072265Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:42:15.9072338Z layer_output = apply_chunking_to_forward( 2025-08-14T21:42:15.9072587Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:42:15.9072655Z return forward_fn(*input_tensors) 2025-08-14T21:42:15.9072947Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 482, in feed_forward_chunk 2025-08-14T21:42:15.9073076Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:42:15.9073340Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 412, in forward 2025-08-14T21:42:15.9073417Z return input_tensor + hidden_states 2025-08-14T21:42:15.9073421Z 2025-08-14T21:42:15.9073515Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.9073697Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.9073765Z return mod(**inputs) 2025-08-14T21:42:15.9074031Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.9074093Z outputs = self.bert( 2025-08-14T21:42:15.9074367Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.9074433Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.9074703Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.9074768Z layer_outputs = layer_module( 2025-08-14T21:42:15.9074969Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.9075047Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.9075320Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:42:15.9075396Z self_attention_outputs = self.attention( 2025-08-14T21:42:15.9075666Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:42:15.9075732Z self_outputs = self.self( 2025-08-14T21:42:15.9075957Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:42:15.9076029Z return func(*args, **kwargs) 2025-08-14T21:42:15.9076292Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 223, in forward 2025-08-14T21:42:15.9076372Z query_layer = self.query(hidden_states) 2025-08-14T21:42:15.9076375Z 2025-08-14T21:42:15.9076483Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.9076667Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.9076756Z return mod(**inputs) 2025-08-14T21:42:15.9077023Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.9077098Z outputs = self.bert( 2025-08-14T21:42:15.9077368Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.9077434Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.9077715Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.9077780Z layer_outputs = layer_module( 2025-08-14T21:42:15.9077982Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.9078061Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.9078325Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:42:15.9078405Z self_attention_outputs = self.attention( 2025-08-14T21:42:15.9078668Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:42:15.9078731Z self_outputs = self.self( 2025-08-14T21:42:15.9078964Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:42:15.9079027Z return func(*args, **kwargs) 2025-08-14T21:42:15.9079287Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 252, in forward 2025-08-14T21:42:15.9079364Z key_layer = self.key(current_states) 2025-08-14T21:42:15.9079369Z 2025-08-14T21:42:15.9079461Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.9079648Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.9079706Z return mod(**inputs) 2025-08-14T21:42:15.9079973Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.9080038Z outputs = self.bert( 2025-08-14T21:42:15.9080298Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.9080364Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.9080635Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.9080697Z layer_outputs = layer_module( 2025-08-14T21:42:15.9080905Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.9080976Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.9081239Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:42:15.9081320Z self_attention_outputs = self.attention( 2025-08-14T21:42:15.9081583Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:42:15.9081652Z self_outputs = self.self( 2025-08-14T21:42:15.9081880Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:42:15.9081943Z return func(*args, **kwargs) 2025-08-14T21:42:15.9082229Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 256, in forward 2025-08-14T21:42:15.9082303Z value_layer = self.value(current_states) 2025-08-14T21:42:15.9082320Z 2025-08-14T21:42:15.9082401Z cudagraph partition due to non gpu ops 2025-08-14T21:42:15.9082476Z cudagraph partition due to non gpu ops 2025-08-14T21:42:15.9082568Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.9082772Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.9082830Z return mod(**inputs) 2025-08-14T21:42:15.9083095Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.9083175Z outputs = self.bert( 2025-08-14T21:42:15.9083437Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.9083513Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.9083777Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.9083843Z layer_outputs = layer_module( 2025-08-14T21:42:15.9084051Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.9084123Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.9084385Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:42:15.9084467Z self_attention_outputs = self.attention( 2025-08-14T21:42:15.9085011Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 381, in forward 2025-08-14T21:42:15.9085149Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:42:15.9085413Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 329, in forward 2025-08-14T21:42:15.9085493Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:15.9085496Z 2025-08-14T21:42:15.9085597Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.9085780Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.9085849Z return mod(**inputs) 2025-08-14T21:42:15.9086113Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.9086171Z outputs = self.bert( 2025-08-14T21:42:15.9086443Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.9086510Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.9086772Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.9086846Z layer_outputs = layer_module( 2025-08-14T21:42:15.9087047Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.9087126Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.9087389Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:42:15.9087465Z layer_output = apply_chunking_to_forward( 2025-08-14T21:42:15.9087714Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:42:15.9087783Z return forward_fn(*input_tensors) 2025-08-14T21:42:15.9088080Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 481, in feed_forward_chunk 2025-08-14T21:42:15.9088232Z intermediate_output = self.intermediate(ln_output) 2025-08-14T21:42:15.9088527Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 397, in forward 2025-08-14T21:42:15.9088607Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:15.9088634Z 2025-08-14T21:42:15.9088729Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.9088910Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.9088981Z return mod(**inputs) 2025-08-14T21:42:15.9089273Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.9089345Z outputs = self.bert( 2025-08-14T21:42:15.9089609Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.9089677Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.9089949Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.9090015Z layer_outputs = layer_module( 2025-08-14T21:42:15.9090228Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.9090302Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.9090571Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:42:15.9090654Z layer_output = apply_chunking_to_forward( 2025-08-14T21:42:15.9090894Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:42:15.9090963Z return forward_fn(*input_tensors) 2025-08-14T21:42:15.9091262Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 481, in feed_forward_chunk 2025-08-14T21:42:15.9091358Z intermediate_output = self.intermediate(ln_output) 2025-08-14T21:42:15.9091626Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 398, in forward 2025-08-14T21:42:15.9091731Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:42:15.9091926Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:42:15.9091999Z return self.act(input) 2025-08-14T21:42:15.9092002Z 2025-08-14T21:42:15.9092096Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.9092282Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.9092340Z return mod(**inputs) 2025-08-14T21:42:15.9092604Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.9092674Z outputs = self.bert( 2025-08-14T21:42:15.9092936Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.9093002Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.9093271Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.9093334Z layer_outputs = layer_module( 2025-08-14T21:42:15.9093542Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.9093613Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.9093874Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:42:15.9093971Z layer_output = apply_chunking_to_forward( 2025-08-14T21:42:15.9094229Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:42:15.9094301Z return forward_fn(*input_tensors) 2025-08-14T21:42:15.9094589Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 482, in feed_forward_chunk 2025-08-14T21:42:15.9094725Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:42:15.9094998Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 410, in forward 2025-08-14T21:42:15.9095086Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:15.9095089Z 2025-08-14T21:42:15.9095189Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.9095369Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.9095427Z return mod(**inputs) 2025-08-14T21:42:15.9095699Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.9095760Z outputs = self.bert( 2025-08-14T21:42:15.9096025Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.9096100Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.9096362Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.9096435Z layer_outputs = layer_module( 2025-08-14T21:42:15.9096638Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.9096708Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.9096981Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:42:15.9097059Z self_attention_outputs = self.attention( 2025-08-14T21:42:15.9097320Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:42:15.9097391Z self_outputs = self.self( 2025-08-14T21:42:15.9097612Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:42:15.9097682Z return func(*args, **kwargs) 2025-08-14T21:42:15.9097947Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 223, in forward 2025-08-14T21:42:15.9098020Z query_layer = self.query(hidden_states) 2025-08-14T21:42:15.9098023Z 2025-08-14T21:42:15.9098124Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.9098305Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.9098373Z return mod(**inputs) 2025-08-14T21:42:15.9098640Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.9098701Z outputs = self.bert( 2025-08-14T21:42:15.9098970Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.9099037Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.9099300Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.9099374Z layer_outputs = layer_module( 2025-08-14T21:42:15.9099575Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.9099669Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.9099944Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:42:15.9100018Z self_attention_outputs = self.attention( 2025-08-14T21:42:15.9100303Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:42:15.9100366Z self_outputs = self.self( 2025-08-14T21:42:15.9100598Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:42:15.9100677Z return func(*args, **kwargs) 2025-08-14T21:42:15.9100942Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 252, in forward 2025-08-14T21:42:15.9101020Z key_layer = self.key(current_states) 2025-08-14T21:42:15.9101023Z 2025-08-14T21:42:15.9101118Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.9101301Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.9101367Z return mod(**inputs) 2025-08-14T21:42:15.9101632Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.9101700Z outputs = self.bert( 2025-08-14T21:42:15.9101961Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.9102028Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.9102297Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.9102360Z layer_outputs = layer_module( 2025-08-14T21:42:15.9102569Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.9102639Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.9102900Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:42:15.9102982Z self_attention_outputs = self.attention( 2025-08-14T21:42:15.9103244Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 372, in forward 2025-08-14T21:42:15.9103306Z self_outputs = self.self( 2025-08-14T21:42:15.9103536Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:42:15.9103597Z return func(*args, **kwargs) 2025-08-14T21:42:15.9103866Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 256, in forward 2025-08-14T21:42:15.9103938Z value_layer = self.value(current_states) 2025-08-14T21:42:15.9103941Z 2025-08-14T21:42:15.9104014Z cudagraph partition due to non gpu ops 2025-08-14T21:42:15.9104091Z cudagraph partition due to non gpu ops 2025-08-14T21:42:15.9104183Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.9104373Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.9104431Z return mod(**inputs) 2025-08-14T21:42:15.9104694Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.9104818Z outputs = self.bert( 2025-08-14T21:42:15.9105090Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.9105157Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.9105450Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.9105534Z layer_outputs = layer_module( 2025-08-14T21:42:15.9105744Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.9105814Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.9106091Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 444, in forward 2025-08-14T21:42:15.9106172Z self_attention_outputs = self.attention( 2025-08-14T21:42:15.9106446Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 381, in forward 2025-08-14T21:42:15.9106568Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:42:15.9106844Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 329, in forward 2025-08-14T21:42:15.9106918Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:15.9106923Z 2025-08-14T21:42:15.9107023Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.9107203Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.9107266Z return mod(**inputs) 2025-08-14T21:42:15.9107542Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.9107599Z outputs = self.bert( 2025-08-14T21:42:15.9107868Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.9107935Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.9108198Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.9108269Z layer_outputs = layer_module( 2025-08-14T21:42:15.9108473Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.9108541Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.9108811Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:42:15.9108887Z layer_output = apply_chunking_to_forward( 2025-08-14T21:42:15.9109130Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:42:15.9109201Z return forward_fn(*input_tensors) 2025-08-14T21:42:15.9109491Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 481, in feed_forward_chunk 2025-08-14T21:42:15.9109592Z intermediate_output = self.intermediate(ln_output) 2025-08-14T21:42:15.9109854Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 397, in forward 2025-08-14T21:42:15.9109936Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:15.9109939Z 2025-08-14T21:42:15.9110031Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.9110209Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.9110274Z return mod(**inputs) 2025-08-14T21:42:15.9110538Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.9110606Z outputs = self.bert( 2025-08-14T21:42:15.9110868Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.9110935Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.9111221Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.9111303Z layer_outputs = layer_module( 2025-08-14T21:42:15.9111505Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.9111601Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.9111868Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:42:15.9111952Z layer_output = apply_chunking_to_forward( 2025-08-14T21:42:15.9112206Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:42:15.9112275Z return forward_fn(*input_tensors) 2025-08-14T21:42:15.9112577Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 481, in feed_forward_chunk 2025-08-14T21:42:15.9112671Z intermediate_output = self.intermediate(ln_output) 2025-08-14T21:42:15.9112939Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 398, in forward 2025-08-14T21:42:15.9113045Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:42:15.9113238Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:42:15.9113308Z return self.act(input) 2025-08-14T21:42:15.9113312Z 2025-08-14T21:42:15.9113404Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.9113586Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.9113652Z return mod(**inputs) 2025-08-14T21:42:15.9113919Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.9113985Z outputs = self.bert( 2025-08-14T21:42:15.9114249Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.9114315Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.9114588Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.9114653Z layer_outputs = layer_module( 2025-08-14T21:42:15.9114859Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.9114931Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.9115194Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:42:15.9115275Z layer_output = apply_chunking_to_forward( 2025-08-14T21:42:15.9115514Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:42:15.9115583Z return forward_fn(*input_tensors) 2025-08-14T21:42:15.9115880Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 482, in feed_forward_chunk 2025-08-14T21:42:15.9116002Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:42:15.9116271Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 410, in forward 2025-08-14T21:42:15.9116346Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:15.9116349Z 2025-08-14T21:42:15.9116442Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.9116629Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.9116687Z return mod(**inputs) 2025-08-14T21:42:15.9116984Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1597, in forward 2025-08-14T21:42:15.9117059Z outputs = self.bert( 2025-08-14T21:42:15.9117321Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 856, in forward 2025-08-14T21:42:15.9117413Z encoder_outputs = self.encoder( 2025-08-14T21:42:15.9117683Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 537, in forward 2025-08-14T21:42:15.9117749Z layer_outputs = layer_module( 2025-08-14T21:42:15.9117977Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:42:15.9118050Z return super().__call__(*args, **kwargs) 2025-08-14T21:42:15.9118321Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 474, in forward 2025-08-14T21:42:15.9118397Z layer_output = apply_chunking_to_forward( 2025-08-14T21:42:15.9118634Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:42:15.9118708Z return forward_fn(*input_tensors) 2025-08-14T21:42:15.9118996Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 482, in feed_forward_chunk 2025-08-14T21:42:15.9119121Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:42:15.9119388Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 412, in forward 2025-08-14T21:42:15.9119457Z return input_tensor + hidden_states 2025-08-14T21:42:15.9119460Z 2025-08-14T21:42:15.9119559Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.9119744Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.9119806Z return mod(**inputs) 2025-08-14T21:42:15.9120079Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1611, in forward 2025-08-14T21:42:15.9120157Z logits = self.qa_outputs(sequence_output) 2025-08-14T21:42:15.9120160Z 2025-08-14T21:42:15.9120258Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.9120438Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.9120497Z return mod(**inputs) 2025-08-14T21:42:15.9120774Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1629, in forward 2025-08-14T21:42:15.9120870Z start_loss = loss_fct(start_logits, start_positions) 2025-08-14T21:42:15.9120874Z 2025-08-14T21:42:15.9120973Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:15.9121154Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:15.9121212Z return mod(**inputs) 2025-08-14T21:42:15.9121485Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/megatron_bert/modeling_megatron_bert.py", line 1630, in forward 2025-08-14T21:42:15.9121569Z end_loss = loss_fct(end_logits, end_positions) 2025-08-14T21:42:15.9121573Z 2025-08-14T21:42:24.4405774Z Compilation time (from dynamo_timed): 20.121112277 2025-08-14T21:42:24.4407256Z pass 2025-08-14T21:42:24.4407697Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:42:24.4412374Z TIMING: _recursive_pre_grad_passes:0.0093 _recursive_joint_graph_passes:0.95653 _recursive_post_grad_passes:0.12263 async_compile.wait:0.00269 code_gen:7.50261 inductor_compile:9.49652 backend_compile:15.23236 gc:0.00071 entire_frame_compile:20.12111 total_wall_time:20.12111 2025-08-14T21:42:24.4413457Z STATS: call_* op count: 724 | FakeTensorMode.__torch_dispatch__:28476 | FakeTensor.__torch_dispatch__:8921 | ProxyTorchDispatchMode.__torch_dispatch__:10973 2025-08-14T21:42:24.4413996Z Dynamo produced 1 graphs covering 724 ops with 0 graph breaks (0 unique) 2025-08-14T21:42:28.7806325Z /opt/conda/envs/py_3.9/lib/python3.9/site-packages/llvmlite/binding/ffi.py:175: UserWarning: pkg_resources is deprecated as an API. See https://setuptools.pypa.io/en/latest/pkg_resources.html. The pkg_resources package is slated for removal as early as 2025-11-30. Refrain from using this package or pin to Setuptools<81. 2025-08-14T21:42:28.7807634Z from pkg_resources import resource_filename 2025-08-14T21:42:29.3112974Z 2025-08-14T21:42:29.9022651Z loading model: 0it [00:00, ?it/s] 2025-08-14T21:42:29.9023153Z loading model: 0it [00:00, ?it/s] 2025-08-14T21:42:29.9085337Z cpu eval MobileBertForMaskedLM 2025-08-14T21:42:30.1390468Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:42:30.2815068Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:42:30.4121893Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:42:54.6265623Z cudagraph partition due to non gpu ops 2025-08-14T21:42:54.6269772Z cudagraph partition due to non gpu ops 2025-08-14T21:42:54.6271878Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6272372Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6277139Z return mod(**inputs) 2025-08-14T21:42:54.6279537Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6280083Z outputs = self.mobilebert( 2025-08-14T21:42:54.6284989Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 791, in forward 2025-08-14T21:42:54.6286644Z embedding_output = self.embeddings( 2025-08-14T21:42:54.6287244Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 199, in forward 2025-08-14T21:42:54.6287728Z inputs_embeds = torch.cat( 2025-08-14T21:42:54.6290545Z 2025-08-14T21:42:54.6294621Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6296865Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6297261Z return mod(**inputs) 2025-08-14T21:42:54.6302160Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 989, in forward 2025-08-14T21:42:54.6305557Z prediction_scores = self.cls(sequence_output) 2025-08-14T21:42:54.6310231Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 643, in forward 2025-08-14T21:42:54.6314670Z prediction_scores = self.predictions(sequence_output) 2025-08-14T21:42:54.6319231Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 632, in forward 2025-08-14T21:42:54.6323657Z hidden_states = hidden_states.matmul(torch.cat([self.decoder.weight.t(), self.dense.weight], dim=0)) 2025-08-14T21:42:54.6328026Z 2025-08-14T21:42:54.6332571Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6336948Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6339145Z return mod(**inputs) 2025-08-14T21:42:54.6339757Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6340278Z outputs = self.mobilebert( 2025-08-14T21:42:54.6340989Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 791, in forward 2025-08-14T21:42:54.6341456Z embedding_output = self.embeddings( 2025-08-14T21:42:54.6341954Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 208, in forward 2025-08-14T21:42:54.6342472Z inputs_embeds = self.embedding_transformation(inputs_embeds) 2025-08-14T21:42:54.6342655Z 2025-08-14T21:42:54.6342764Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6343137Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6343496Z return mod(**inputs) 2025-08-14T21:42:54.6343870Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6344261Z outputs = self.mobilebert( 2025-08-14T21:42:54.6344643Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 791, in forward 2025-08-14T21:42:54.6345166Z embedding_output = self.embeddings( 2025-08-14T21:42:54.6345574Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 215, in forward 2025-08-14T21:42:54.6346056Z embeddings = self.LayerNorm(embeddings) 2025-08-14T21:42:54.6346459Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.6346869Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.6347023Z 2025-08-14T21:42:54.6347132Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6347481Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6347793Z return mod(**inputs) 2025-08-14T21:42:54.6348215Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6348613Z outputs = self.mobilebert( 2025-08-14T21:42:54.6349010Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6349406Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6349813Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6350200Z layer_outputs = layer_module( 2025-08-14T21:42:54.6350584Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:42:54.6351054Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:42:54.6351548Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 444, in forward 2025-08-14T21:42:54.6352050Z shared_attention_input = self.attention(hidden_states) 2025-08-14T21:42:54.6352475Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 409, in forward 2025-08-14T21:42:54.6352869Z layer_input = self.dense(hidden_states) 2025-08-14T21:42:54.6353005Z 2025-08-14T21:42:54.6353103Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6353449Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6353757Z return mod(**inputs) 2025-08-14T21:42:54.6354143Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6354536Z outputs = self.mobilebert( 2025-08-14T21:42:54.6354950Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6355336Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6355746Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6356135Z layer_outputs = layer_module( 2025-08-14T21:42:54.6356544Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:42:54.6356941Z self_attention_outputs = self.attention( 2025-08-14T21:42:54.6357341Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:42:54.6357776Z self_outputs = self.self( 2025-08-14T21:42:54.6358149Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 255, in forward 2025-08-14T21:42:54.6358535Z self.value(value_tensor) 2025-08-14T21:42:54.6358649Z 2025-08-14T21:42:54.6358749Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6359087Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6359381Z return mod(**inputs) 2025-08-14T21:42:54.6359752Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6360137Z outputs = self.mobilebert( 2025-08-14T21:42:54.6360505Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6360893Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6361282Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6361679Z layer_outputs = layer_module( 2025-08-14T21:42:54.6362060Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:42:54.6362543Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:42:54.6363023Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 440, in forward 2025-08-14T21:42:54.6363460Z bottlenecked_hidden_states = self.input(hidden_states) 2025-08-14T21:42:54.6363884Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 409, in forward 2025-08-14T21:42:54.6364286Z layer_input = self.dense(hidden_states) 2025-08-14T21:42:54.6364419Z 2025-08-14T21:42:54.6364525Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6364872Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6365187Z return mod(**inputs) 2025-08-14T21:42:54.6365576Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6365976Z outputs = self.mobilebert( 2025-08-14T21:42:54.6366352Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6366761Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6367168Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6367565Z layer_outputs = layer_module( 2025-08-14T21:42:54.6367948Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:42:54.6368426Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:42:54.6368924Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 444, in forward 2025-08-14T21:42:54.6369386Z shared_attention_input = self.attention(hidden_states) 2025-08-14T21:42:54.6369813Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 410, in forward 2025-08-14T21:42:54.6370249Z layer_input = self.LayerNorm(layer_input) 2025-08-14T21:42:54.6370680Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.6371088Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.6371236Z 2025-08-14T21:42:54.6371352Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6371696Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6372004Z return mod(**inputs) 2025-08-14T21:42:54.6372367Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6372760Z outputs = self.mobilebert( 2025-08-14T21:42:54.6373151Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6373552Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6373945Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6374346Z layer_outputs = layer_module( 2025-08-14T21:42:54.6374742Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:42:54.6375156Z self_attention_outputs = self.attention( 2025-08-14T21:42:54.6375565Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:42:54.6375958Z self_outputs = self.self( 2025-08-14T21:42:54.6376341Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 245, in forward 2025-08-14T21:42:54.6376725Z self.query(query_tensor) 2025-08-14T21:42:54.6376841Z 2025-08-14T21:42:54.6376942Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6377281Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6377589Z return mod(**inputs) 2025-08-14T21:42:54.6377958Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6378362Z outputs = self.mobilebert( 2025-08-14T21:42:54.6378758Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6379162Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6379563Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6379980Z layer_outputs = layer_module( 2025-08-14T21:42:54.6380389Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:42:54.6380801Z self_attention_outputs = self.attention( 2025-08-14T21:42:54.6381215Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:42:54.6381616Z self_outputs = self.self( 2025-08-14T21:42:54.6382000Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 250, in forward 2025-08-14T21:42:54.6382397Z self.key(key_tensor) 2025-08-14T21:42:54.6382506Z 2025-08-14T21:42:54.6382587Z cudagraph partition due to non gpu ops 2025-08-14T21:42:54.6382816Z cudagraph partition due to non gpu ops 2025-08-14T21:42:54.6383063Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6383418Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6383738Z return mod(**inputs) 2025-08-14T21:42:54.6384142Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6384552Z outputs = self.mobilebert( 2025-08-14T21:42:54.6385289Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6385767Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6386172Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6386612Z layer_outputs = layer_module( 2025-08-14T21:42:54.6387030Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:42:54.6387465Z self_attention_outputs = self.attention( 2025-08-14T21:42:54.6387885Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 344, in forward 2025-08-14T21:42:54.6388356Z attention_output = self.output(self_outputs[0], layer_input) 2025-08-14T21:42:54.6388791Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 292, in forward 2025-08-14T21:42:54.6389182Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:42:54.6389324Z 2025-08-14T21:42:54.6389422Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6389760Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6390064Z return mod(**inputs) 2025-08-14T21:42:54.6390419Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6390809Z outputs = self.mobilebert( 2025-08-14T21:42:54.6391190Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6391580Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6391952Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6392332Z layer_outputs = layer_module( 2025-08-14T21:42:54.6392713Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:42:54.6393100Z self_attention_outputs = self.attention( 2025-08-14T21:42:54.6393491Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 344, in forward 2025-08-14T21:42:54.6393916Z attention_output = self.output(self_outputs[0], layer_input) 2025-08-14T21:42:54.6394348Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 295, in forward 2025-08-14T21:42:54.6394776Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:42:54.6395209Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.6395610Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.6395748Z 2025-08-14T21:42:54.6395854Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6396181Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6396484Z return mod(**inputs) 2025-08-14T21:42:54.6396877Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6397306Z outputs = self.mobilebert( 2025-08-14T21:42:54.6397670Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6398073Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6398455Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6398825Z layer_outputs = layer_module( 2025-08-14T21:42:54.6399215Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.6399617Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.6400015Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.6400425Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.6400839Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:42:54.6401232Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:54.6401363Z 2025-08-14T21:42:54.6401465Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6401786Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6402084Z return mod(**inputs) 2025-08-14T21:42:54.6402442Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6402815Z outputs = self.mobilebert( 2025-08-14T21:42:54.6403180Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6403558Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6403940Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6404335Z layer_outputs = layer_module( 2025-08-14T21:42:54.6404721Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.6405146Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.6405559Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.6405982Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.6406415Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:42:54.6406842Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:42:54.6407001Z 2025-08-14T21:42:54.6407098Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6407435Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6407736Z return mod(**inputs) 2025-08-14T21:42:54.6408098Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6408483Z outputs = self.mobilebert( 2025-08-14T21:42:54.6408856Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6409245Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6409617Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6410005Z layer_outputs = layer_module( 2025-08-14T21:42:54.6410412Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.6410843Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.6411240Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.6411716Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.6412148Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:42:54.6412541Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:42:54.6412671Z 2025-08-14T21:42:54.6412783Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6413121Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6413426Z return mod(**inputs) 2025-08-14T21:42:54.6413784Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6414173Z outputs = self.mobilebert( 2025-08-14T21:42:54.6414551Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6414936Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6415308Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6415692Z layer_outputs = layer_module( 2025-08-14T21:42:54.6416071Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.6416477Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.6416878Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.6417312Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.6417751Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:42:54.6418179Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:42:54.6418605Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.6419007Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.6419144Z 2025-08-14T21:42:54.6419251Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6419580Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6419885Z return mod(**inputs) 2025-08-14T21:42:54.6420255Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6420645Z outputs = self.mobilebert( 2025-08-14T21:42:54.6421024Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6421403Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6421775Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6422147Z layer_outputs = layer_module( 2025-08-14T21:42:54.6422512Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.6422904Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.6423300Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.6423724Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.6424150Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:42:54.6424542Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:54.6424759Z 2025-08-14T21:42:54.6424885Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6425214Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6425521Z return mod(**inputs) 2025-08-14T21:42:54.6425903Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6426282Z outputs = self.mobilebert( 2025-08-14T21:42:54.6426646Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6427030Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6427409Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6427785Z layer_outputs = layer_module( 2025-08-14T21:42:54.6428157Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.6428562Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.6428962Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.6429375Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.6429790Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:42:54.6430210Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:42:54.6430364Z 2025-08-14T21:42:54.6430471Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6430797Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6431098Z return mod(**inputs) 2025-08-14T21:42:54.6431458Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6431831Z outputs = self.mobilebert( 2025-08-14T21:42:54.6432204Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6432588Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6432963Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6433340Z layer_outputs = layer_module( 2025-08-14T21:42:54.6433715Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.6434121Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.6434524Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.6434945Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.6435372Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:42:54.6435771Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:42:54.6435902Z 2025-08-14T21:42:54.6436001Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6436338Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6436639Z return mod(**inputs) 2025-08-14T21:42:54.6437018Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6437400Z outputs = self.mobilebert( 2025-08-14T21:42:54.6437766Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6438160Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6438527Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6438892Z layer_outputs = layer_module( 2025-08-14T21:42:54.6439275Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.6439678Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.6440066Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.6440489Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.6440930Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:42:54.6441361Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:42:54.6441792Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.6442185Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.6442320Z 2025-08-14T21:42:54.6442424Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6442759Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6443050Z return mod(**inputs) 2025-08-14T21:42:54.6443416Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6443804Z outputs = self.mobilebert( 2025-08-14T21:42:54.6444172Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6444563Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6444947Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6445336Z layer_outputs = layer_module( 2025-08-14T21:42:54.6445709Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.6446118Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.6446537Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.6446962Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.6447374Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:42:54.6447773Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:54.6447902Z 2025-08-14T21:42:54.6448008Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6448335Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6448643Z return mod(**inputs) 2025-08-14T21:42:54.6449008Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6449399Z outputs = self.mobilebert( 2025-08-14T21:42:54.6449764Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6450154Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6451211Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6451624Z layer_outputs = layer_module( 2025-08-14T21:42:54.6451993Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.6452420Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.6452822Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.6453256Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.6453672Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:42:54.6454101Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:42:54.6454258Z 2025-08-14T21:42:54.6454365Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6454700Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6455004Z return mod(**inputs) 2025-08-14T21:42:54.6455365Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6455745Z outputs = self.mobilebert( 2025-08-14T21:42:54.6456107Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6456490Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6456873Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6457234Z layer_outputs = layer_module( 2025-08-14T21:42:54.6457599Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.6457994Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.6459191Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.6459610Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.6460020Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:42:54.6460406Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:42:54.6460534Z 2025-08-14T21:42:54.6460637Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6460958Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6461247Z return mod(**inputs) 2025-08-14T21:42:54.6461602Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6461978Z outputs = self.mobilebert( 2025-08-14T21:42:54.6462332Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6462710Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6463076Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6463444Z layer_outputs = layer_module( 2025-08-14T21:42:54.6463806Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.6464200Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.6464594Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.6465141Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.6465597Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:42:54.6466020Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:42:54.6466456Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.6466848Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.6466995Z 2025-08-14T21:42:54.6467095Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6467444Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6467747Z return mod(**inputs) 2025-08-14T21:42:54.6468097Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6468478Z outputs = self.mobilebert( 2025-08-14T21:42:54.6468844Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6469218Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6469586Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6469961Z layer_outputs = layer_module( 2025-08-14T21:42:54.6470326Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 518, in forward 2025-08-14T21:42:54.6470740Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:42:54.6471159Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:42:54.6471549Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:54.6471675Z 2025-08-14T21:42:54.6471779Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6472099Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6472395Z return mod(**inputs) 2025-08-14T21:42:54.6472752Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6473127Z outputs = self.mobilebert( 2025-08-14T21:42:54.6473486Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6473868Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6474243Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6474611Z layer_outputs = layer_module( 2025-08-14T21:42:54.6474982Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 518, in forward 2025-08-14T21:42:54.6475401Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:42:54.6475818Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:42:54.6476224Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:42:54.6476384Z 2025-08-14T21:42:54.6476479Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6476811Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6477110Z return mod(**inputs) 2025-08-14T21:42:54.6477461Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6477834Z outputs = self.mobilebert( 2025-08-14T21:42:54.6478213Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6478594Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6478979Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6479375Z layer_outputs = layer_module( 2025-08-14T21:42:54.6479737Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:42:54.6480181Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:42:54.6480657Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 392, in forward 2025-08-14T21:42:54.6481052Z layer_output = self.dense(intermediate_states) 2025-08-14T21:42:54.6481187Z 2025-08-14T21:42:54.6481287Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6481605Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6481900Z return mod(**inputs) 2025-08-14T21:42:54.6482249Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6482612Z outputs = self.mobilebert( 2025-08-14T21:42:54.6482973Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6483347Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6483713Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6484077Z layer_outputs = layer_module( 2025-08-14T21:42:54.6484443Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:42:54.6485030Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:42:54.6485496Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 397, in forward 2025-08-14T21:42:54.6485916Z layer_output = self.LayerNorm(layer_output + residual_tensor_1) 2025-08-14T21:42:54.6486340Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.6486736Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.6486869Z 2025-08-14T21:42:54.6486966Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6487297Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6487594Z return mod(**inputs) 2025-08-14T21:42:54.6487954Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6488323Z outputs = self.mobilebert( 2025-08-14T21:42:54.6488692Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6489072Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6489446Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6489817Z layer_outputs = layer_module( 2025-08-14T21:42:54.6490188Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:42:54.6490644Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:42:54.6491145Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 398, in forward 2025-08-14T21:42:54.6491603Z layer_output = self.bottleneck(layer_output, residual_tensor_2) 2025-08-14T21:42:54.6492025Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 372, in forward 2025-08-14T21:42:54.6492446Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:42:54.6492574Z 2025-08-14T21:42:54.6492670Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6492998Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6493300Z return mod(**inputs) 2025-08-14T21:42:54.6493686Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6494055Z outputs = self.mobilebert( 2025-08-14T21:42:54.6494418Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6494794Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6495156Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6495528Z layer_outputs = layer_module( 2025-08-14T21:42:54.6495898Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:42:54.6496356Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:42:54.6496802Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 398, in forward 2025-08-14T21:42:54.6497226Z layer_output = self.bottleneck(layer_output, residual_tensor_2) 2025-08-14T21:42:54.6497650Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 374, in forward 2025-08-14T21:42:54.6498070Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:42:54.6498480Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.6498871Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.6499006Z 2025-08-14T21:42:54.6499109Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6499433Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6499717Z return mod(**inputs) 2025-08-14T21:42:54.6500070Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6500440Z outputs = self.mobilebert( 2025-08-14T21:42:54.6500792Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6501169Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6501538Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6501912Z layer_outputs = layer_module( 2025-08-14T21:42:54.6502278Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:42:54.6502733Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:42:54.6503196Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 444, in forward 2025-08-14T21:42:54.6503607Z shared_attention_input = self.attention(hidden_states) 2025-08-14T21:42:54.6504011Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 409, in forward 2025-08-14T21:42:54.6504422Z layer_input = self.dense(hidden_states) 2025-08-14T21:42:54.6504567Z 2025-08-14T21:42:54.6504672Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6505059Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6505390Z return mod(**inputs) 2025-08-14T21:42:54.6505757Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6506144Z outputs = self.mobilebert( 2025-08-14T21:42:54.6506533Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6506918Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6507291Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6507667Z layer_outputs = layer_module( 2025-08-14T21:42:54.6508031Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:42:54.6508432Z self_attention_outputs = self.attention( 2025-08-14T21:42:54.6508816Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:42:54.6509187Z self_outputs = self.self( 2025-08-14T21:42:54.6509553Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 255, in forward 2025-08-14T21:42:54.6509929Z self.value(value_tensor) 2025-08-14T21:42:54.6510034Z 2025-08-14T21:42:54.6510135Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6510454Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6510752Z return mod(**inputs) 2025-08-14T21:42:54.6511109Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6511487Z outputs = self.mobilebert( 2025-08-14T21:42:54.6511859Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6512240Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6512610Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6512976Z layer_outputs = layer_module( 2025-08-14T21:42:54.6513346Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:42:54.6513799Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:42:54.6514258Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 440, in forward 2025-08-14T21:42:54.6514658Z bottlenecked_hidden_states = self.input(hidden_states) 2025-08-14T21:42:54.6515067Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 409, in forward 2025-08-14T21:42:54.6515450Z layer_input = self.dense(hidden_states) 2025-08-14T21:42:54.6515574Z 2025-08-14T21:42:54.6515673Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6515990Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6516286Z return mod(**inputs) 2025-08-14T21:42:54.6516647Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6517016Z outputs = self.mobilebert( 2025-08-14T21:42:54.6517412Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6517806Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6518175Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6518560Z layer_outputs = layer_module( 2025-08-14T21:42:54.6518930Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:42:54.6519387Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:42:54.6519858Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 444, in forward 2025-08-14T21:42:54.6520276Z shared_attention_input = self.attention(hidden_states) 2025-08-14T21:42:54.6520696Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 410, in forward 2025-08-14T21:42:54.6521090Z layer_input = self.LayerNorm(layer_input) 2025-08-14T21:42:54.6521473Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.6521873Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.6522011Z 2025-08-14T21:42:54.6522105Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6522430Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6522720Z return mod(**inputs) 2025-08-14T21:42:54.6523073Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6523462Z outputs = self.mobilebert( 2025-08-14T21:42:54.6523831Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6524204Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6524577Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6524955Z layer_outputs = layer_module( 2025-08-14T21:42:54.6525317Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:42:54.6525707Z self_attention_outputs = self.attention( 2025-08-14T21:42:54.6526092Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:42:54.6526470Z self_outputs = self.self( 2025-08-14T21:42:54.6526825Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 245, in forward 2025-08-14T21:42:54.6527199Z self.query(query_tensor) 2025-08-14T21:42:54.6527302Z 2025-08-14T21:42:54.6527403Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6527735Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6528022Z return mod(**inputs) 2025-08-14T21:42:54.6528374Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6528749Z outputs = self.mobilebert( 2025-08-14T21:42:54.6529126Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6529512Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6529889Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6530268Z layer_outputs = layer_module( 2025-08-14T21:42:54.6530647Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:42:54.6531052Z self_attention_outputs = self.attention( 2025-08-14T21:42:54.6531436Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:42:54.6531824Z self_outputs = self.self( 2025-08-14T21:42:54.6532191Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 250, in forward 2025-08-14T21:42:54.6532566Z self.key(key_tensor) 2025-08-14T21:42:54.6532664Z 2025-08-14T21:42:54.6532747Z cudagraph partition due to non gpu ops 2025-08-14T21:42:54.6532955Z cudagraph partition due to non gpu ops 2025-08-14T21:42:54.6533182Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6533516Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6533812Z return mod(**inputs) 2025-08-14T21:42:54.6534175Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6534553Z outputs = self.mobilebert( 2025-08-14T21:42:54.6534919Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6535295Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6535669Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6536047Z layer_outputs = layer_module( 2025-08-14T21:42:54.6536428Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:42:54.6536809Z self_attention_outputs = self.attention( 2025-08-14T21:42:54.6537198Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 344, in forward 2025-08-14T21:42:54.6537624Z attention_output = self.output(self_outputs[0], layer_input) 2025-08-14T21:42:54.6538055Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 292, in forward 2025-08-14T21:42:54.6538443Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:42:54.6538580Z 2025-08-14T21:42:54.6538674Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6539002Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6539297Z return mod(**inputs) 2025-08-14T21:42:54.6539657Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6540031Z outputs = self.mobilebert( 2025-08-14T21:42:54.6540398Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6540776Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6541166Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6541543Z layer_outputs = layer_module( 2025-08-14T21:42:54.6541905Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:42:54.6542300Z self_attention_outputs = self.attention( 2025-08-14T21:42:54.6542686Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 344, in forward 2025-08-14T21:42:54.6543110Z attention_output = self.output(self_outputs[0], layer_input) 2025-08-14T21:42:54.6543528Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 295, in forward 2025-08-14T21:42:54.6543976Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:42:54.6544420Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.6544886Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.6545055Z 2025-08-14T21:42:54.6545148Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6545476Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6545776Z return mod(**inputs) 2025-08-14T21:42:54.6546156Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6546529Z outputs = self.mobilebert( 2025-08-14T21:42:54.6546904Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6547290Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6547659Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6548042Z layer_outputs = layer_module( 2025-08-14T21:42:54.6548413Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.6548815Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.6549210Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.6549622Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.6550035Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:42:54.6550418Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:54.6550547Z 2025-08-14T21:42:54.6550641Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6550973Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6551269Z return mod(**inputs) 2025-08-14T21:42:54.6551619Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6552003Z outputs = self.mobilebert( 2025-08-14T21:42:54.6552370Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6552746Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6553109Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6553483Z layer_outputs = layer_module( 2025-08-14T21:42:54.6553856Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.6554253Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.6554640Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.6555047Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.6555460Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:42:54.6555862Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:42:54.6556020Z 2025-08-14T21:42:54.6556113Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6556438Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6556730Z return mod(**inputs) 2025-08-14T21:42:54.6557093Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6557482Z outputs = self.mobilebert( 2025-08-14T21:42:54.6557845Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6558236Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6558607Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6558980Z layer_outputs = layer_module( 2025-08-14T21:42:54.6559373Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.6559764Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.6560165Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.6560594Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.6561019Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:42:54.6561402Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:42:54.6561538Z 2025-08-14T21:42:54.6561632Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6561962Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6562259Z return mod(**inputs) 2025-08-14T21:42:54.6562608Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6562981Z outputs = self.mobilebert( 2025-08-14T21:42:54.6563347Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6563723Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6564102Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6564477Z layer_outputs = layer_module( 2025-08-14T21:42:54.6564847Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.6565234Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.6565632Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.6566056Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.6566479Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:42:54.6566896Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:42:54.6567320Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.6567716Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.6567850Z 2025-08-14T21:42:54.6567951Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6568271Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6568569Z return mod(**inputs) 2025-08-14T21:42:54.6568926Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6569296Z outputs = self.mobilebert( 2025-08-14T21:42:54.6569661Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6570054Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6570444Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6570812Z layer_outputs = layer_module( 2025-08-14T21:42:54.6571180Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.6571594Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.6571979Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.6572406Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.6572819Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:42:54.6573202Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:54.6573330Z 2025-08-14T21:42:54.6573423Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6573747Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6574039Z return mod(**inputs) 2025-08-14T21:42:54.6574392Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6574761Z outputs = self.mobilebert( 2025-08-14T21:42:54.6575123Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6575498Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6575862Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6576233Z layer_outputs = layer_module( 2025-08-14T21:42:54.6576603Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.6577000Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.6577384Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.6577796Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.6578203Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:42:54.6578615Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:42:54.6578767Z 2025-08-14T21:42:54.6578861Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6579185Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6579485Z return mod(**inputs) 2025-08-14T21:42:54.6579833Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6580207Z outputs = self.mobilebert( 2025-08-14T21:42:54.6580565Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6580943Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6581303Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6581677Z layer_outputs = layer_module( 2025-08-14T21:42:54.6582043Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.6582442Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.6582844Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.6583271Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.6583717Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:42:54.6584115Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:42:54.6584250Z 2025-08-14T21:42:54.6584344Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6584905Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6585210Z return mod(**inputs) 2025-08-14T21:42:54.6585609Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6586012Z outputs = self.mobilebert( 2025-08-14T21:42:54.6586376Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6586749Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6587114Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6587491Z layer_outputs = layer_module( 2025-08-14T21:42:54.6587862Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.6588253Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.6588648Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.6589068Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.6589493Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:42:54.6589906Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:42:54.6590327Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.6590713Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.6590849Z 2025-08-14T21:42:54.6590951Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6591270Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6591564Z return mod(**inputs) 2025-08-14T21:42:54.6591920Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6592284Z outputs = self.mobilebert( 2025-08-14T21:42:54.6592645Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6593022Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6593388Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6593756Z layer_outputs = layer_module( 2025-08-14T21:42:54.6594124Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.6594521Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.6594916Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.6595323Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.6595733Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:42:54.6596120Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:54.6596274Z 2025-08-14T21:42:54.6596378Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6596723Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6597018Z return mod(**inputs) 2025-08-14T21:42:54.6597370Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6597762Z outputs = self.mobilebert( 2025-08-14T21:42:54.6598127Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6598519Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6598889Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6599258Z layer_outputs = layer_module( 2025-08-14T21:42:54.6599624Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.6600023Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.6600412Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.6600826Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.6601234Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:42:54.6601644Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:42:54.6601794Z 2025-08-14T21:42:54.6601889Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6602215Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6602510Z return mod(**inputs) 2025-08-14T21:42:54.6602863Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6603232Z outputs = self.mobilebert( 2025-08-14T21:42:54.6603595Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6603973Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6604334Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6604707Z layer_outputs = layer_module( 2025-08-14T21:42:54.6605077Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.6605472Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.6605858Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.6606277Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.6606697Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:42:54.6607079Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:42:54.6607204Z 2025-08-14T21:42:54.6607297Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6607619Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6607910Z return mod(**inputs) 2025-08-14T21:42:54.6608256Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6608627Z outputs = self.mobilebert( 2025-08-14T21:42:54.6609005Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6609387Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6609796Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6610167Z layer_outputs = layer_module( 2025-08-14T21:42:54.6610576Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.6610973Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.6611400Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.6611826Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.6612249Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:42:54.6612664Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:42:54.6613086Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.6613482Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.6613618Z 2025-08-14T21:42:54.6613723Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6614044Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6614339Z return mod(**inputs) 2025-08-14T21:42:54.6614692Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6615069Z outputs = self.mobilebert( 2025-08-14T21:42:54.6615422Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6615798Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6616171Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6616539Z layer_outputs = layer_module( 2025-08-14T21:42:54.6616907Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 518, in forward 2025-08-14T21:42:54.6617324Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:42:54.6617740Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:42:54.6618119Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:54.6618253Z 2025-08-14T21:42:54.6618347Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6618669Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6618968Z return mod(**inputs) 2025-08-14T21:42:54.6619318Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6619693Z outputs = self.mobilebert( 2025-08-14T21:42:54.6620052Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6620423Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6620790Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6621166Z layer_outputs = layer_module( 2025-08-14T21:42:54.6621528Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 518, in forward 2025-08-14T21:42:54.6621935Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:42:54.6622364Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:42:54.6622796Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:42:54.6622948Z 2025-08-14T21:42:54.6623048Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6623384Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6623680Z return mod(**inputs) 2025-08-14T21:42:54.6624031Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6624395Z outputs = self.mobilebert( 2025-08-14T21:42:54.6624854Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6625241Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6625616Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6625989Z layer_outputs = layer_module( 2025-08-14T21:42:54.6626364Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:42:54.6626830Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:42:54.6627296Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 392, in forward 2025-08-14T21:42:54.6627690Z layer_output = self.dense(intermediate_states) 2025-08-14T21:42:54.6627838Z 2025-08-14T21:42:54.6627933Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6628266Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6628562Z return mod(**inputs) 2025-08-14T21:42:54.6628922Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6629300Z outputs = self.mobilebert( 2025-08-14T21:42:54.6629665Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6630039Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6630416Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6630792Z layer_outputs = layer_module( 2025-08-14T21:42:54.6631159Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:42:54.6631618Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:42:54.6632080Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 397, in forward 2025-08-14T21:42:54.6632501Z layer_output = self.LayerNorm(layer_output + residual_tensor_1) 2025-08-14T21:42:54.6632918Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.6633325Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.6633463Z 2025-08-14T21:42:54.6633559Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6633915Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6634212Z return mod(**inputs) 2025-08-14T21:42:54.6634586Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6634962Z outputs = self.mobilebert( 2025-08-14T21:42:54.6635340Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6635716Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6636107Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6636483Z layer_outputs = layer_module( 2025-08-14T21:42:54.6636859Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:42:54.6637310Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:42:54.6637775Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 398, in forward 2025-08-14T21:42:54.6638206Z layer_output = self.bottleneck(layer_output, residual_tensor_2) 2025-08-14T21:42:54.6638625Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 372, in forward 2025-08-14T21:42:54.6639014Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:42:54.6639150Z 2025-08-14T21:42:54.6639246Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6639574Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6639865Z return mod(**inputs) 2025-08-14T21:42:54.6640217Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6640594Z outputs = self.mobilebert( 2025-08-14T21:42:54.6640953Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6641332Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6641703Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6642083Z layer_outputs = layer_module( 2025-08-14T21:42:54.6642445Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:42:54.6642902Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:42:54.6643359Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 398, in forward 2025-08-14T21:42:54.6643780Z layer_output = self.bottleneck(layer_output, residual_tensor_2) 2025-08-14T21:42:54.6644192Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 374, in forward 2025-08-14T21:42:54.6644612Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:42:54.6645034Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.6645428Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.6645562Z 2025-08-14T21:42:54.6645656Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6645984Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6646286Z return mod(**inputs) 2025-08-14T21:42:54.6646636Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6647012Z outputs = self.mobilebert( 2025-08-14T21:42:54.6647379Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6647759Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6648128Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6648534Z layer_outputs = layer_module( 2025-08-14T21:42:54.6648907Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:42:54.6649388Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:42:54.6649855Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 444, in forward 2025-08-14T21:42:54.6650265Z shared_attention_input = self.attention(hidden_states) 2025-08-14T21:42:54.6650670Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 409, in forward 2025-08-14T21:42:54.6651060Z layer_input = self.dense(hidden_states) 2025-08-14T21:42:54.6651197Z 2025-08-14T21:42:54.6651290Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6651616Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6651912Z return mod(**inputs) 2025-08-14T21:42:54.6652258Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6652630Z outputs = self.mobilebert( 2025-08-14T21:42:54.6652993Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6653370Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6653735Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6654109Z layer_outputs = layer_module( 2025-08-14T21:42:54.6654474Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:42:54.6654855Z self_attention_outputs = self.attention( 2025-08-14T21:42:54.6655241Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:42:54.6655620Z self_outputs = self.self( 2025-08-14T21:42:54.6655985Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 255, in forward 2025-08-14T21:42:54.6656357Z self.value(value_tensor) 2025-08-14T21:42:54.6656469Z 2025-08-14T21:42:54.6656563Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6656888Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6657173Z return mod(**inputs) 2025-08-14T21:42:54.6657526Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6657899Z outputs = self.mobilebert( 2025-08-14T21:42:54.6658258Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6658628Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6658998Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6659372Z layer_outputs = layer_module( 2025-08-14T21:42:54.6659737Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:42:54.6660185Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:42:54.6660644Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 440, in forward 2025-08-14T21:42:54.6661055Z bottlenecked_hidden_states = self.input(hidden_states) 2025-08-14T21:42:54.6661474Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 409, in forward 2025-08-14T21:42:54.6661858Z layer_input = self.dense(hidden_states) 2025-08-14T21:42:54.6662015Z 2025-08-14T21:42:54.6662107Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6662429Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6662734Z return mod(**inputs) 2025-08-14T21:42:54.6663092Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6663463Z outputs = self.mobilebert( 2025-08-14T21:42:54.6663841Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6664214Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6664588Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6665034Z layer_outputs = layer_module( 2025-08-14T21:42:54.6665405Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:42:54.6665866Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:42:54.6666332Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 444, in forward 2025-08-14T21:42:54.6666750Z shared_attention_input = self.attention(hidden_states) 2025-08-14T21:42:54.6667156Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 410, in forward 2025-08-14T21:42:54.6667552Z layer_input = self.LayerNorm(layer_input) 2025-08-14T21:42:54.6667945Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.6668345Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.6668483Z 2025-08-14T21:42:54.6668578Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6668906Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6669205Z return mod(**inputs) 2025-08-14T21:42:54.6669555Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6669929Z outputs = self.mobilebert( 2025-08-14T21:42:54.6670295Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6670672Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6671039Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6671412Z layer_outputs = layer_module( 2025-08-14T21:42:54.6671781Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:42:54.6672171Z self_attention_outputs = self.attention( 2025-08-14T21:42:54.6672551Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:42:54.6672926Z self_outputs = self.self( 2025-08-14T21:42:54.6673293Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 245, in forward 2025-08-14T21:42:54.6673660Z self.query(query_tensor) 2025-08-14T21:42:54.6673771Z 2025-08-14T21:42:54.6673867Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6674193Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6674490Z return mod(**inputs) 2025-08-14T21:42:54.6674855Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6675252Z outputs = self.mobilebert( 2025-08-14T21:42:54.6675615Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6676011Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6676373Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6676745Z layer_outputs = layer_module( 2025-08-14T21:42:54.6677127Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:42:54.6677512Z self_attention_outputs = self.attention( 2025-08-14T21:42:54.6677898Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:42:54.6678273Z self_outputs = self.self( 2025-08-14T21:42:54.6678636Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 250, in forward 2025-08-14T21:42:54.6678999Z self.key(key_tensor) 2025-08-14T21:42:54.6679104Z 2025-08-14T21:42:54.6679180Z cudagraph partition due to non gpu ops 2025-08-14T21:42:54.6679380Z cudagraph partition due to non gpu ops 2025-08-14T21:42:54.6679589Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6679916Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6680213Z return mod(**inputs) 2025-08-14T21:42:54.6680567Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6680933Z outputs = self.mobilebert( 2025-08-14T21:42:54.6681299Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6681679Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6682045Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6682425Z layer_outputs = layer_module( 2025-08-14T21:42:54.6682794Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:42:54.6683181Z self_attention_outputs = self.attention( 2025-08-14T21:42:54.6683562Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 344, in forward 2025-08-14T21:42:54.6683983Z attention_output = self.output(self_outputs[0], layer_input) 2025-08-14T21:42:54.6684404Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 292, in forward 2025-08-14T21:42:54.6684899Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:42:54.6685034Z 2025-08-14T21:42:54.6685130Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6685462Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6685767Z return mod(**inputs) 2025-08-14T21:42:54.6686118Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6686498Z outputs = self.mobilebert( 2025-08-14T21:42:54.6686868Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6687249Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6687615Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6688036Z layer_outputs = layer_module( 2025-08-14T21:42:54.6688432Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:42:54.6688827Z self_attention_outputs = self.attention( 2025-08-14T21:42:54.6689233Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 344, in forward 2025-08-14T21:42:54.6689661Z attention_output = self.output(self_outputs[0], layer_input) 2025-08-14T21:42:54.6690087Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 295, in forward 2025-08-14T21:42:54.6690530Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:42:54.6690956Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.6691349Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.6691482Z 2025-08-14T21:42:54.6691586Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6691906Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6692199Z return mod(**inputs) 2025-08-14T21:42:54.6692553Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6692924Z outputs = self.mobilebert( 2025-08-14T21:42:54.6693277Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6693650Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6694019Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6694382Z layer_outputs = layer_module( 2025-08-14T21:42:54.6694750Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.6695151Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.6695549Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.6695955Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.6696366Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:42:54.6696752Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:54.6696879Z 2025-08-14T21:42:54.6696979Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6697298Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6697596Z return mod(**inputs) 2025-08-14T21:42:54.6697948Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6698318Z outputs = self.mobilebert( 2025-08-14T21:42:54.6698680Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6699062Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6699427Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6699795Z layer_outputs = layer_module( 2025-08-14T21:42:54.6700162Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.6700560Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.6700967Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.6701389Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.6701803Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:42:54.6702236Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:42:54.6702386Z 2025-08-14T21:42:54.6702482Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6702810Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6703108Z return mod(**inputs) 2025-08-14T21:42:54.6703478Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6703850Z outputs = self.mobilebert( 2025-08-14T21:42:54.6704222Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6704602Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6705032Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6705410Z layer_outputs = layer_module( 2025-08-14T21:42:54.6705784Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.6706189Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.6706583Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.6707013Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.6707440Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:42:54.6707835Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:42:54.6707966Z 2025-08-14T21:42:54.6708062Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6708390Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6708689Z return mod(**inputs) 2025-08-14T21:42:54.6709040Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6709412Z outputs = self.mobilebert( 2025-08-14T21:42:54.6709777Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6710157Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6710524Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6710904Z layer_outputs = layer_module( 2025-08-14T21:42:54.6711276Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.6711678Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.6712071Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.6712500Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.6712926Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:42:54.6713350Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:42:54.6713766Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.6714204Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.6714342Z 2025-08-14T21:42:54.6714464Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6714648Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6714716Z return mod(**inputs) 2025-08-14T21:42:54.6714987Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6715052Z outputs = self.mobilebert( 2025-08-14T21:42:54.6715311Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6715393Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6715648Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6715721Z layer_outputs = layer_module( 2025-08-14T21:42:54.6715974Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.6716069Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.6716324Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.6716426Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.6716689Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:42:54.6716767Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:54.6716773Z 2025-08-14T21:42:54.6716875Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6717055Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6717113Z return mod(**inputs) 2025-08-14T21:42:54.6717374Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6717438Z outputs = self.mobilebert( 2025-08-14T21:42:54.6717691Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6717763Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6718018Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6718086Z layer_outputs = layer_module( 2025-08-14T21:42:54.6718341Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.6718425Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.6718687Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.6718787Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.6719047Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:42:54.6719148Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:42:54.6719152Z 2025-08-14T21:42:54.6719243Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6719426Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6719484Z return mod(**inputs) 2025-08-14T21:42:54.6719737Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6719806Z outputs = self.mobilebert( 2025-08-14T21:42:54.6720074Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6720164Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6720419Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6720499Z layer_outputs = layer_module( 2025-08-14T21:42:54.6720760Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.6720845Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.6721119Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.6721233Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.6721485Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:42:54.6721571Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:42:54.6721576Z 2025-08-14T21:42:54.6721669Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6721850Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6721920Z return mod(**inputs) 2025-08-14T21:42:54.6722174Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6722245Z outputs = self.mobilebert( 2025-08-14T21:42:54.6722497Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6722562Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6722820Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6722884Z layer_outputs = layer_module( 2025-08-14T21:42:54.6723144Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.6723230Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.6723481Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.6723601Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.6723853Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:42:54.6723963Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:42:54.6724223Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.6724308Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.6724311Z 2025-08-14T21:42:54.6724410Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6724587Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6724645Z return mod(**inputs) 2025-08-14T21:42:54.6724904Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6724966Z outputs = self.mobilebert( 2025-08-14T21:42:54.6725223Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6725289Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6725545Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6725616Z layer_outputs = layer_module( 2025-08-14T21:42:54.6725882Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.6725981Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.6726241Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.6726360Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.6726618Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:42:54.6726691Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:54.6726710Z 2025-08-14T21:42:54.6726805Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6726991Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6727050Z return mod(**inputs) 2025-08-14T21:42:54.6727312Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6727377Z outputs = self.mobilebert( 2025-08-14T21:42:54.6727630Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6727704Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6727959Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6728023Z layer_outputs = layer_module( 2025-08-14T21:42:54.6728284Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.6728367Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.6728629Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.6728730Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.6728984Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:42:54.6729095Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:42:54.6729099Z 2025-08-14T21:42:54.6729190Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6729376Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6729435Z return mod(**inputs) 2025-08-14T21:42:54.6729691Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6729762Z outputs = self.mobilebert( 2025-08-14T21:42:54.6730019Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6730092Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6730350Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6730413Z layer_outputs = layer_module( 2025-08-14T21:42:54.6730678Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.6730763Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.6731019Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.6731139Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.6731392Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:42:54.6731488Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:42:54.6731507Z 2025-08-14T21:42:54.6731601Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6731778Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6731864Z return mod(**inputs) 2025-08-14T21:42:54.6732120Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6732190Z outputs = self.mobilebert( 2025-08-14T21:42:54.6732463Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6732531Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6732794Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6732857Z layer_outputs = layer_module( 2025-08-14T21:42:54.6733112Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.6733205Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.6733458Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.6733577Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.6733830Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:42:54.6733939Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:42:54.6734202Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.6734284Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.6734289Z 2025-08-14T21:42:54.6734390Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6734575Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6734633Z return mod(**inputs) 2025-08-14T21:42:54.6734895Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6734958Z outputs = self.mobilebert( 2025-08-14T21:42:54.6735211Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6735282Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6735536Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6735603Z layer_outputs = layer_module( 2025-08-14T21:42:54.6735862Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 518, in forward 2025-08-14T21:42:54.6735971Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:42:54.6736233Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:42:54.6736309Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:54.6736312Z 2025-08-14T21:42:54.6736413Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6736592Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6736651Z return mod(**inputs) 2025-08-14T21:42:54.6736912Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6736975Z outputs = self.mobilebert( 2025-08-14T21:42:54.6737249Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6737340Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6737590Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6737679Z layer_outputs = layer_module( 2025-08-14T21:42:54.6737931Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 518, in forward 2025-08-14T21:42:54.6738036Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:42:54.6738313Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:42:54.6738416Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:42:54.6738419Z 2025-08-14T21:42:54.6738518Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6738698Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6738758Z return mod(**inputs) 2025-08-14T21:42:54.6739016Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6739081Z outputs = self.mobilebert( 2025-08-14T21:42:54.6739332Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6739403Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6739655Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6739722Z layer_outputs = layer_module( 2025-08-14T21:42:54.6739977Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:42:54.6740121Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:42:54.6740382Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 392, in forward 2025-08-14T21:42:54.6740468Z layer_output = self.dense(intermediate_states) 2025-08-14T21:42:54.6740472Z 2025-08-14T21:42:54.6740569Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6740744Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6740801Z return mod(**inputs) 2025-08-14T21:42:54.6741063Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6741125Z outputs = self.mobilebert( 2025-08-14T21:42:54.6741383Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6741446Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6741702Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6741771Z layer_outputs = layer_module( 2025-08-14T21:42:54.6742026Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:42:54.6742167Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:42:54.6742426Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 397, in forward 2025-08-14T21:42:54.6742536Z layer_output = self.LayerNorm(layer_output + residual_tensor_1) 2025-08-14T21:42:54.6742795Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.6742892Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.6742909Z 2025-08-14T21:42:54.6743002Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6743190Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6743265Z return mod(**inputs) 2025-08-14T21:42:54.6743529Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6743592Z outputs = self.mobilebert( 2025-08-14T21:42:54.6743860Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6743934Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6744189Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6744252Z layer_outputs = layer_module( 2025-08-14T21:42:54.6744514Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:42:54.6744659Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:42:54.6744984Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 398, in forward 2025-08-14T21:42:54.6745101Z layer_output = self.bottleneck(layer_output, residual_tensor_2) 2025-08-14T21:42:54.6745358Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 372, in forward 2025-08-14T21:42:54.6745444Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:42:54.6745448Z 2025-08-14T21:42:54.6745541Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6745728Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6745788Z return mod(**inputs) 2025-08-14T21:42:54.6746045Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6746118Z outputs = self.mobilebert( 2025-08-14T21:42:54.6746375Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6746442Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6746704Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6746770Z layer_outputs = layer_module( 2025-08-14T21:42:54.6747033Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:42:54.6747178Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:42:54.6747433Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 398, in forward 2025-08-14T21:42:54.6747555Z layer_output = self.bottleneck(layer_output, residual_tensor_2) 2025-08-14T21:42:54.6747809Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 374, in forward 2025-08-14T21:42:54.6747929Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:42:54.6748184Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.6748268Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.6748271Z 2025-08-14T21:42:54.6748374Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6748556Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6748643Z return mod(**inputs) 2025-08-14T21:42:54.6748915Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6748977Z outputs = self.mobilebert( 2025-08-14T21:42:54.6749241Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6749324Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6749581Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6749676Z layer_outputs = layer_module( 2025-08-14T21:42:54.6749933Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:42:54.6750086Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:42:54.6750343Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 444, in forward 2025-08-14T21:42:54.6750444Z shared_attention_input = self.attention(hidden_states) 2025-08-14T21:42:54.6750707Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 409, in forward 2025-08-14T21:42:54.6750783Z layer_input = self.dense(hidden_states) 2025-08-14T21:42:54.6750787Z 2025-08-14T21:42:54.6750882Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6751062Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6751119Z return mod(**inputs) 2025-08-14T21:42:54.6751381Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6751444Z outputs = self.mobilebert( 2025-08-14T21:42:54.6751697Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6751770Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6752022Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6752095Z layer_outputs = layer_module( 2025-08-14T21:42:54.6752347Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:42:54.6752424Z self_attention_outputs = self.attention( 2025-08-14T21:42:54.6752683Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:42:54.6752748Z self_outputs = self.self( 2025-08-14T21:42:54.6753008Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 255, in forward 2025-08-14T21:42:54.6753073Z self.value(value_tensor) 2025-08-14T21:42:54.6753076Z 2025-08-14T21:42:54.6753169Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6753353Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6753413Z return mod(**inputs) 2025-08-14T21:42:54.6753667Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6753737Z outputs = self.mobilebert( 2025-08-14T21:42:54.6753991Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6754063Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6754314Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6754394Z layer_outputs = layer_module( 2025-08-14T21:42:54.6754676Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:42:54.6754821Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:42:54.6755111Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 440, in forward 2025-08-14T21:42:54.6755209Z bottlenecked_hidden_states = self.input(hidden_states) 2025-08-14T21:42:54.6755483Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 409, in forward 2025-08-14T21:42:54.6755567Z layer_input = self.dense(hidden_states) 2025-08-14T21:42:54.6755570Z 2025-08-14T21:42:54.6755660Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6755839Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6755905Z return mod(**inputs) 2025-08-14T21:42:54.6756157Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6756226Z outputs = self.mobilebert( 2025-08-14T21:42:54.6756482Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6756546Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6756806Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6756870Z layer_outputs = layer_module( 2025-08-14T21:42:54.6757128Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:42:54.6757274Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:42:54.6757531Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 444, in forward 2025-08-14T21:42:54.6757635Z shared_attention_input = self.attention(hidden_states) 2025-08-14T21:42:54.6757889Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 410, in forward 2025-08-14T21:42:54.6757973Z layer_input = self.LayerNorm(layer_input) 2025-08-14T21:42:54.6758228Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.6758311Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.6758314Z 2025-08-14T21:42:54.6758413Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6758594Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6758655Z return mod(**inputs) 2025-08-14T21:42:54.6758920Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6758982Z outputs = self.mobilebert( 2025-08-14T21:42:54.6759244Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6759309Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6759564Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6759636Z layer_outputs = layer_module( 2025-08-14T21:42:54.6759890Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:42:54.6759971Z self_attention_outputs = self.attention( 2025-08-14T21:42:54.6760240Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:42:54.6760322Z self_outputs = self.self( 2025-08-14T21:42:54.6760587Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 245, in forward 2025-08-14T21:42:54.6760671Z self.query(query_tensor) 2025-08-14T21:42:54.6760674Z 2025-08-14T21:42:54.6760767Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6760954Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6761013Z return mod(**inputs) 2025-08-14T21:42:54.6761289Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6761354Z outputs = self.mobilebert( 2025-08-14T21:42:54.6761609Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6761683Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6761935Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6761999Z layer_outputs = layer_module( 2025-08-14T21:42:54.6762260Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:42:54.6762335Z self_attention_outputs = self.attention( 2025-08-14T21:42:54.6762596Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:42:54.6762658Z self_outputs = self.self( 2025-08-14T21:42:54.6762910Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 250, in forward 2025-08-14T21:42:54.6762983Z self.key(key_tensor) 2025-08-14T21:42:54.6762988Z 2025-08-14T21:42:54.6763062Z cudagraph partition due to non gpu ops 2025-08-14T21:42:54.6763144Z cudagraph partition due to non gpu ops 2025-08-14T21:42:54.6763238Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6763414Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6763481Z return mod(**inputs) 2025-08-14T21:42:54.6763735Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6763797Z outputs = self.mobilebert( 2025-08-14T21:42:54.6764057Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6764122Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6764380Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6764442Z layer_outputs = layer_module( 2025-08-14T21:42:54.6764696Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:42:54.6764778Z self_attention_outputs = self.attention( 2025-08-14T21:42:54.6765031Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 344, in forward 2025-08-14T21:42:54.6765143Z attention_output = self.output(self_outputs[0], layer_input) 2025-08-14T21:42:54.6765404Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 292, in forward 2025-08-14T21:42:54.6765480Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:42:54.6765483Z 2025-08-14T21:42:54.6765580Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6765774Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6765847Z return mod(**inputs) 2025-08-14T21:42:54.6766114Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6766175Z outputs = self.mobilebert( 2025-08-14T21:42:54.6766450Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6766513Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6766766Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6766852Z layer_outputs = layer_module( 2025-08-14T21:42:54.6767106Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:42:54.6767181Z self_attention_outputs = self.attention( 2025-08-14T21:42:54.6767442Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 344, in forward 2025-08-14T21:42:54.6767552Z attention_output = self.output(self_outputs[0], layer_input) 2025-08-14T21:42:54.6767812Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 295, in forward 2025-08-14T21:42:54.6767925Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:42:54.6768178Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.6768268Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.6768272Z 2025-08-14T21:42:54.6768364Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6768548Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6768608Z return mod(**inputs) 2025-08-14T21:42:54.6768861Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6768932Z outputs = self.mobilebert( 2025-08-14T21:42:54.6769183Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6769253Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6769505Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6769568Z layer_outputs = layer_module( 2025-08-14T21:42:54.6769827Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.6769912Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.6770164Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.6770272Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.6770525Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:42:54.6770604Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:54.6770607Z 2025-08-14T21:42:54.6770698Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6770874Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6770938Z return mod(**inputs) 2025-08-14T21:42:54.6771190Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6771258Z outputs = self.mobilebert( 2025-08-14T21:42:54.6771525Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6771604Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6771866Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6771948Z layer_outputs = layer_module( 2025-08-14T21:42:54.6772200Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.6772292Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.6772561Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.6772671Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.6772925Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:42:54.6773027Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:42:54.6773032Z 2025-08-14T21:42:54.6773131Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6773310Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6773378Z return mod(**inputs) 2025-08-14T21:42:54.6773630Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6773693Z outputs = self.mobilebert( 2025-08-14T21:42:54.6773953Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6774018Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6774272Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6774343Z layer_outputs = layer_module( 2025-08-14T21:42:54.6774600Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.6774692Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.6774948Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.6775062Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.6775325Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:42:54.6775400Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:42:54.6775403Z 2025-08-14T21:42:54.6775500Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6775680Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6775738Z return mod(**inputs) 2025-08-14T21:42:54.6776002Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6776063Z outputs = self.mobilebert( 2025-08-14T21:42:54.6776318Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6776388Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6776640Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6776710Z layer_outputs = layer_module( 2025-08-14T21:42:54.6776963Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.6777046Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.6777323Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.6777456Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.6777715Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:42:54.6777838Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:42:54.6778092Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.6778194Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.6778198Z 2025-08-14T21:42:54.6778290Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6778475Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6778535Z return mod(**inputs) 2025-08-14T21:42:54.6778786Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6778860Z outputs = self.mobilebert( 2025-08-14T21:42:54.6779112Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6779177Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6779434Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6779498Z layer_outputs = layer_module( 2025-08-14T21:42:54.6779755Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.6779839Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.6780093Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.6780201Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.6780454Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:42:54.6780539Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:54.6780542Z 2025-08-14T21:42:54.6780633Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6780809Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6780874Z return mod(**inputs) 2025-08-14T21:42:54.6781127Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6781189Z outputs = self.mobilebert( 2025-08-14T21:42:54.6781449Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6781515Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6781774Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6781840Z layer_outputs = layer_module( 2025-08-14T21:42:54.6782091Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.6782183Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.6782436Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.6782540Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.6782805Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:42:54.6782908Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:42:54.6782927Z 2025-08-14T21:42:54.6783027Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6783205Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6783282Z return mod(**inputs) 2025-08-14T21:42:54.6783553Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6783614Z outputs = self.mobilebert( 2025-08-14T21:42:54.6783896Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6783964Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6784217Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6784288Z layer_outputs = layer_module( 2025-08-14T21:42:54.6784543Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.6784778Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.6785044Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.6785156Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.6785419Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:42:54.6785497Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:42:54.6785500Z 2025-08-14T21:42:54.6785593Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6785779Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6785840Z return mod(**inputs) 2025-08-14T21:42:54.6786101Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6786166Z outputs = self.mobilebert( 2025-08-14T21:42:54.6786421Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6786494Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6786749Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6786821Z layer_outputs = layer_module( 2025-08-14T21:42:54.6787073Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.6787157Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.6787421Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.6787536Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.6787788Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:42:54.6787905Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:42:54.6788158Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.6788247Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.6788251Z 2025-08-14T21:42:54.6788342Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6788520Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6788622Z return mod(**inputs) 2025-08-14T21:42:54.6788879Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6788973Z outputs = self.mobilebert( 2025-08-14T21:42:54.6789229Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6789317Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6789582Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6789645Z layer_outputs = layer_module( 2025-08-14T21:42:54.6789923Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.6790020Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.6790272Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.6790381Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.6790638Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:42:54.6790715Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:54.6790718Z 2025-08-14T21:42:54.6790817Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6790993Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6791061Z return mod(**inputs) 2025-08-14T21:42:54.6791314Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6791376Z outputs = self.mobilebert( 2025-08-14T21:42:54.6791638Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6791703Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6791955Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6792026Z layer_outputs = layer_module( 2025-08-14T21:42:54.6792277Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.6792367Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.6792620Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.6792718Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.6792979Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:42:54.6793078Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:42:54.6793083Z 2025-08-14T21:42:54.6793181Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6793361Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6793419Z return mod(**inputs) 2025-08-14T21:42:54.6793679Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6793741Z outputs = self.mobilebert( 2025-08-14T21:42:54.6794003Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6794067Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6794318Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6794402Z layer_outputs = layer_module( 2025-08-14T21:42:54.6794666Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.6794768Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.6795047Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.6795158Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.6795421Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:42:54.6795510Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:42:54.6795513Z 2025-08-14T21:42:54.6795611Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6795799Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6795862Z return mod(**inputs) 2025-08-14T21:42:54.6796127Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6796192Z outputs = self.mobilebert( 2025-08-14T21:42:54.6796450Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6796525Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6796780Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6796848Z layer_outputs = layer_module( 2025-08-14T21:42:54.6797110Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.6797195Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.6797456Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.6797571Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.6797827Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:42:54.6797947Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:42:54.6798203Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.6798294Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.6798297Z 2025-08-14T21:42:54.6798391Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6798572Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6798640Z return mod(**inputs) 2025-08-14T21:42:54.6798898Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6798964Z outputs = self.mobilebert( 2025-08-14T21:42:54.6799228Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6799298Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6799560Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6799625Z layer_outputs = layer_module( 2025-08-14T21:42:54.6799880Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 518, in forward 2025-08-14T21:42:54.6799997Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:42:54.6800275Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:42:54.6800371Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:54.6800374Z 2025-08-14T21:42:54.6800466Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6800647Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6800728Z return mod(**inputs) 2025-08-14T21:42:54.6800986Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6801048Z outputs = self.mobilebert( 2025-08-14T21:42:54.6801326Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6801394Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6801662Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6801726Z layer_outputs = layer_module( 2025-08-14T21:42:54.6801983Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 518, in forward 2025-08-14T21:42:54.6802097Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:42:54.6802358Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:42:54.6802464Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:42:54.6802467Z 2025-08-14T21:42:54.6802559Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6802741Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6802806Z return mod(**inputs) 2025-08-14T21:42:54.6803069Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6803134Z outputs = self.mobilebert( 2025-08-14T21:42:54.6803398Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6803463Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6803732Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6803794Z layer_outputs = layer_module( 2025-08-14T21:42:54.6804053Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:42:54.6804210Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:42:54.6804469Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 392, in forward 2025-08-14T21:42:54.6804564Z layer_output = self.dense(intermediate_states) 2025-08-14T21:42:54.6804568Z 2025-08-14T21:42:54.6804659Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6804840Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6804910Z return mod(**inputs) 2025-08-14T21:42:54.6805170Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6805232Z outputs = self.mobilebert( 2025-08-14T21:42:54.6805499Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6805564Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6805831Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6805912Z layer_outputs = layer_module( 2025-08-14T21:42:54.6806167Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:42:54.6806337Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:42:54.6806606Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 397, in forward 2025-08-14T21:42:54.6806723Z layer_output = self.LayerNorm(layer_output + residual_tensor_1) 2025-08-14T21:42:54.6806989Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.6807075Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.6807079Z 2025-08-14T21:42:54.6807176Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6807354Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6807419Z return mod(**inputs) 2025-08-14T21:42:54.6807673Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6807734Z outputs = self.mobilebert( 2025-08-14T21:42:54.6807995Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6808058Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6808309Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6808380Z layer_outputs = layer_module( 2025-08-14T21:42:54.6808631Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:42:54.6808781Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:42:54.6809032Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 398, in forward 2025-08-14T21:42:54.6809144Z layer_output = self.bottleneck(layer_output, residual_tensor_2) 2025-08-14T21:42:54.6809403Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 372, in forward 2025-08-14T21:42:54.6809481Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:42:54.6809484Z 2025-08-14T21:42:54.6809580Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6809759Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6809817Z return mod(**inputs) 2025-08-14T21:42:54.6810076Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6810139Z outputs = self.mobilebert( 2025-08-14T21:42:54.6810394Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6810466Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6810719Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6810792Z layer_outputs = layer_module( 2025-08-14T21:42:54.6811045Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:42:54.6811188Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:42:54.6811448Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 398, in forward 2025-08-14T21:42:54.6811557Z layer_output = self.bottleneck(layer_output, residual_tensor_2) 2025-08-14T21:42:54.6811829Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 374, in forward 2025-08-14T21:42:54.6811953Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:42:54.6812208Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.6812315Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.6812318Z 2025-08-14T21:42:54.6812408Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6812606Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6812666Z return mod(**inputs) 2025-08-14T21:42:54.6812919Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6812986Z outputs = self.mobilebert( 2025-08-14T21:42:54.6813239Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6813304Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6813568Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6813632Z layer_outputs = layer_module( 2025-08-14T21:42:54.6813894Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:42:54.6814041Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:42:54.6814303Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 444, in forward 2025-08-14T21:42:54.6814410Z shared_attention_input = self.attention(hidden_states) 2025-08-14T21:42:54.6814664Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 409, in forward 2025-08-14T21:42:54.6814745Z layer_input = self.dense(hidden_states) 2025-08-14T21:42:54.6814749Z 2025-08-14T21:42:54.6814838Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6815019Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6815085Z return mod(**inputs) 2025-08-14T21:42:54.6815339Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6815400Z outputs = self.mobilebert( 2025-08-14T21:42:54.6815662Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6815726Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6815988Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6816051Z layer_outputs = layer_module( 2025-08-14T21:42:54.6816306Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:42:54.6816392Z self_attention_outputs = self.attention( 2025-08-14T21:42:54.6816646Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:42:54.6816715Z self_outputs = self.self( 2025-08-14T21:42:54.6816971Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 255, in forward 2025-08-14T21:42:54.6817035Z self.value(value_tensor) 2025-08-14T21:42:54.6817038Z 2025-08-14T21:42:54.6817135Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6817331Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6817404Z return mod(**inputs) 2025-08-14T21:42:54.6817668Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6817729Z outputs = self.mobilebert( 2025-08-14T21:42:54.6818012Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6818076Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6818331Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6818875Z layer_outputs = layer_module( 2025-08-14T21:42:54.6819130Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:42:54.6819283Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:42:54.6819538Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 440, in forward 2025-08-14T21:42:54.6819640Z bottlenecked_hidden_states = self.input(hidden_states) 2025-08-14T21:42:54.6819903Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 409, in forward 2025-08-14T21:42:54.6819978Z layer_input = self.dense(hidden_states) 2025-08-14T21:42:54.6819982Z 2025-08-14T21:42:54.6820073Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6820262Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6820321Z return mod(**inputs) 2025-08-14T21:42:54.6820583Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6820647Z outputs = self.mobilebert( 2025-08-14T21:42:54.6820900Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6820975Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6821228Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6821298Z layer_outputs = layer_module( 2025-08-14T21:42:54.6821549Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:42:54.6821694Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:42:54.6821960Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 444, in forward 2025-08-14T21:42:54.6822056Z shared_attention_input = self.attention(hidden_states) 2025-08-14T21:42:54.6822311Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 410, in forward 2025-08-14T21:42:54.6822397Z layer_input = self.LayerNorm(layer_input) 2025-08-14T21:42:54.6822651Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.6822748Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.6822751Z 2025-08-14T21:42:54.6822843Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6823023Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6823090Z return mod(**inputs) 2025-08-14T21:42:54.6823344Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6823412Z outputs = self.mobilebert( 2025-08-14T21:42:54.6823683Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6823767Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6824027Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6824106Z layer_outputs = layer_module( 2025-08-14T21:42:54.6824365Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:42:54.6824449Z self_attention_outputs = self.attention( 2025-08-14T21:42:54.6824785Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:42:54.6824869Z self_outputs = self.self( 2025-08-14T21:42:54.6825127Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 245, in forward 2025-08-14T21:42:54.6825191Z self.query(query_tensor) 2025-08-14T21:42:54.6825197Z 2025-08-14T21:42:54.6825300Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6825478Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6825545Z return mod(**inputs) 2025-08-14T21:42:54.6825801Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6825866Z outputs = self.mobilebert( 2025-08-14T21:42:54.6826129Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6826195Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6826450Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6826522Z layer_outputs = layer_module( 2025-08-14T21:42:54.6826778Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:42:54.6826861Z self_attention_outputs = self.attention( 2025-08-14T21:42:54.6827117Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:42:54.6827181Z self_outputs = self.self( 2025-08-14T21:42:54.6827444Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 250, in forward 2025-08-14T21:42:54.6827506Z self.key(key_tensor) 2025-08-14T21:42:54.6827510Z 2025-08-14T21:42:54.6827593Z cudagraph partition due to non gpu ops 2025-08-14T21:42:54.6827664Z cudagraph partition due to non gpu ops 2025-08-14T21:42:54.6827756Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6827940Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6827999Z return mod(**inputs) 2025-08-14T21:42:54.6828255Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6828326Z outputs = self.mobilebert( 2025-08-14T21:42:54.6828579Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6828650Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6828903Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6828964Z layer_outputs = layer_module( 2025-08-14T21:42:54.6829222Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:42:54.6829315Z self_attention_outputs = self.attention( 2025-08-14T21:42:54.6829573Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 344, in forward 2025-08-14T21:42:54.6829710Z attention_output = self.output(self_outputs[0], layer_input) 2025-08-14T21:42:54.6829978Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 292, in forward 2025-08-14T21:42:54.6830059Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:42:54.6830062Z 2025-08-14T21:42:54.6830152Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6830342Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6830409Z return mod(**inputs) 2025-08-14T21:42:54.6830664Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6830734Z outputs = self.mobilebert( 2025-08-14T21:42:54.6830988Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6831054Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6831314Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6831379Z layer_outputs = layer_module( 2025-08-14T21:42:54.6831631Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:42:54.6831715Z self_attention_outputs = self.attention( 2025-08-14T21:42:54.6831969Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 344, in forward 2025-08-14T21:42:54.6832085Z attention_output = self.output(self_outputs[0], layer_input) 2025-08-14T21:42:54.6832342Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 295, in forward 2025-08-14T21:42:54.6832456Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:42:54.6832719Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.6832805Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.6832808Z 2025-08-14T21:42:54.6832906Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6833086Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6833146Z return mod(**inputs) 2025-08-14T21:42:54.6833407Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6833470Z outputs = self.mobilebert( 2025-08-14T21:42:54.6833731Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6833799Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6834053Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6834125Z layer_outputs = layer_module( 2025-08-14T21:42:54.6834379Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.6834465Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.6834727Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.6834830Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.6835129Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:42:54.6835226Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:54.6835229Z 2025-08-14T21:42:54.6835322Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6835507Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6835587Z return mod(**inputs) 2025-08-14T21:42:54.6835854Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6835917Z outputs = self.mobilebert( 2025-08-14T21:42:54.6836191Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6836265Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6836523Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6836585Z layer_outputs = layer_module( 2025-08-14T21:42:54.6836851Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.6836935Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.6837198Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.6837297Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.6837552Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:42:54.6837660Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:42:54.6837664Z 2025-08-14T21:42:54.6837758Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6837947Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6838009Z return mod(**inputs) 2025-08-14T21:42:54.6838266Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6838337Z outputs = self.mobilebert( 2025-08-14T21:42:54.6838592Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6838656Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6838915Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6838978Z layer_outputs = layer_module( 2025-08-14T21:42:54.6839236Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.6839320Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.6839576Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.6839700Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.6839955Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:42:54.6840039Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:42:54.6840042Z 2025-08-14T21:42:54.6840133Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6840312Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6840377Z return mod(**inputs) 2025-08-14T21:42:54.6840630Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6840693Z outputs = self.mobilebert( 2025-08-14T21:42:54.6840966Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6841047Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6841312Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6841400Z layer_outputs = layer_module( 2025-08-14T21:42:54.6841656Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.6841747Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.6842016Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.6842135Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.6842396Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:42:54.6842507Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:42:54.6842767Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.6842849Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.6842852Z 2025-08-14T21:42:54.6842949Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6843128Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6843188Z return mod(**inputs) 2025-08-14T21:42:54.6843452Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6843513Z outputs = self.mobilebert( 2025-08-14T21:42:54.6843770Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6843843Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6844100Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6844169Z layer_outputs = layer_module( 2025-08-14T21:42:54.6844422Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.6844503Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.6844765Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.6844864Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.6845124Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:42:54.6845199Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:54.6845202Z 2025-08-14T21:42:54.6845293Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6845477Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6845536Z return mod(**inputs) 2025-08-14T21:42:54.6845789Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6845861Z outputs = self.mobilebert( 2025-08-14T21:42:54.6846117Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6846189Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6846444Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6846523Z layer_outputs = layer_module( 2025-08-14T21:42:54.6846801Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.6846885Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.6847160Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.6847259Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.6847524Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:42:54.6847633Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:42:54.6847637Z 2025-08-14T21:42:54.6847729Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6847907Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6847973Z return mod(**inputs) 2025-08-14T21:42:54.6848228Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6848298Z outputs = self.mobilebert( 2025-08-14T21:42:54.6848555Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6848618Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6848878Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6848941Z layer_outputs = layer_module( 2025-08-14T21:42:54.6849201Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.6849283Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.6849538Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.6849657Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.6849910Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:42:54.6849988Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:42:54.6849999Z 2025-08-14T21:42:54.6850092Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6850273Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6850339Z return mod(**inputs) 2025-08-14T21:42:54.6850592Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6850654Z outputs = self.mobilebert( 2025-08-14T21:42:54.6850914Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6850979Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6851241Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6851305Z layer_outputs = layer_module( 2025-08-14T21:42:54.6851556Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.6851648Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.6851900Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.6852010Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.6852286Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:42:54.6852417Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:42:54.6852681Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.6852783Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.6852786Z 2025-08-14T21:42:54.6852878Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6853065Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6853140Z return mod(**inputs) 2025-08-14T21:42:54.6853408Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6853470Z outputs = self.mobilebert( 2025-08-14T21:42:54.6853726Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6853800Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6854056Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6854120Z layer_outputs = layer_module( 2025-08-14T21:42:54.6854381Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.6854463Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.6854724Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.6854824Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.6855081Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:42:54.6855164Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:54.6855169Z 2025-08-14T21:42:54.6855263Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6855448Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6855509Z return mod(**inputs) 2025-08-14T21:42:54.6855762Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6855832Z outputs = self.mobilebert( 2025-08-14T21:42:54.6856089Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6856153Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6856414Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6856477Z layer_outputs = layer_module( 2025-08-14T21:42:54.6856738Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.6856819Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.6857075Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.6857181Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.6857436Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:42:54.6857541Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:42:54.6857544Z 2025-08-14T21:42:54.6857636Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6857831Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6857897Z return mod(**inputs) 2025-08-14T21:42:54.6858170Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6858232Z outputs = self.mobilebert( 2025-08-14T21:42:54.6858511Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6858575Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6858834Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6858910Z layer_outputs = layer_module( 2025-08-14T21:42:54.6859167Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.6859256Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.6859512Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.6859627Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.6859884Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:42:54.6859960Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:42:54.6859963Z 2025-08-14T21:42:54.6860061Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6860240Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6860304Z return mod(**inputs) 2025-08-14T21:42:54.6860561Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6860622Z outputs = self.mobilebert( 2025-08-14T21:42:54.6860887Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6860952Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6861206Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6861277Z layer_outputs = layer_module( 2025-08-14T21:42:54.6861532Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.6861621Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.6861876Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.6861986Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.6862252Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:42:54.6862361Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:42:54.6862627Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.6862710Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.6862713Z 2025-08-14T21:42:54.6862804Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6862992Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6863050Z return mod(**inputs) 2025-08-14T21:42:54.6863306Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6863378Z outputs = self.mobilebert( 2025-08-14T21:42:54.6863651Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6863752Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6864008Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6864091Z layer_outputs = layer_module( 2025-08-14T21:42:54.6864351Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 518, in forward 2025-08-14T21:42:54.6864459Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:42:54.6864800Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:42:54.6864886Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:54.6864890Z 2025-08-14T21:42:54.6864983Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6865173Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6865234Z return mod(**inputs) 2025-08-14T21:42:54.6865489Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6865562Z outputs = self.mobilebert( 2025-08-14T21:42:54.6865816Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6865891Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6866150Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6866215Z layer_outputs = layer_module( 2025-08-14T21:42:54.6866478Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 518, in forward 2025-08-14T21:42:54.6866587Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:42:54.6866852Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:42:54.6866954Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:42:54.6866958Z 2025-08-14T21:42:54.6867052Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6867244Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6867305Z return mod(**inputs) 2025-08-14T21:42:54.6867564Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6867638Z outputs = self.mobilebert( 2025-08-14T21:42:54.6867895Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6867968Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6868225Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6868288Z layer_outputs = layer_module( 2025-08-14T21:42:54.6868550Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:42:54.6868696Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:42:54.6868957Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 392, in forward 2025-08-14T21:42:54.6869043Z layer_output = self.dense(intermediate_states) 2025-08-14T21:42:54.6869046Z 2025-08-14T21:42:54.6869138Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6869323Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6869399Z return mod(**inputs) 2025-08-14T21:42:54.6869671Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6869741Z outputs = self.mobilebert( 2025-08-14T21:42:54.6870016Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6870088Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6870345Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6870420Z layer_outputs = layer_module( 2025-08-14T21:42:54.6870684Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:42:54.6870828Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:42:54.6871090Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 397, in forward 2025-08-14T21:42:54.6871203Z layer_output = self.LayerNorm(layer_output + residual_tensor_1) 2025-08-14T21:42:54.6871456Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.6871546Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.6871549Z 2025-08-14T21:42:54.6871641Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6871827Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6871884Z return mod(**inputs) 2025-08-14T21:42:54.6872138Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6872208Z outputs = self.mobilebert( 2025-08-14T21:42:54.6872464Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6872531Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6872791Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6872854Z layer_outputs = layer_module( 2025-08-14T21:42:54.6873112Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:42:54.6873255Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:42:54.6873507Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 398, in forward 2025-08-14T21:42:54.6873624Z layer_output = self.bottleneck(layer_output, residual_tensor_2) 2025-08-14T21:42:54.6873880Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 372, in forward 2025-08-14T21:42:54.6873963Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:42:54.6873967Z 2025-08-14T21:42:54.6874056Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6874236Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6874302Z return mod(**inputs) 2025-08-14T21:42:54.6874551Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6874615Z outputs = self.mobilebert( 2025-08-14T21:42:54.6874875Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6874938Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6875211Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6875296Z layer_outputs = layer_module( 2025-08-14T21:42:54.6875549Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:42:54.6875715Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:42:54.6875974Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 398, in forward 2025-08-14T21:42:54.6876089Z layer_output = self.bottleneck(layer_output, residual_tensor_2) 2025-08-14T21:42:54.6876358Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 374, in forward 2025-08-14T21:42:54.6876469Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:42:54.6876729Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.6876811Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.6876814Z 2025-08-14T21:42:54.6876912Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6877090Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6877148Z return mod(**inputs) 2025-08-14T21:42:54.6877404Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6877466Z outputs = self.mobilebert( 2025-08-14T21:42:54.6877719Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6877791Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6878046Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6878118Z layer_outputs = layer_module( 2025-08-14T21:42:54.6878368Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:42:54.6878515Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:42:54.6878777Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 444, in forward 2025-08-14T21:42:54.6878875Z shared_attention_input = self.attention(hidden_states) 2025-08-14T21:42:54.6879134Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 409, in forward 2025-08-14T21:42:54.6879206Z layer_input = self.dense(hidden_states) 2025-08-14T21:42:54.6879209Z 2025-08-14T21:42:54.6879300Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6879486Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6879544Z return mod(**inputs) 2025-08-14T21:42:54.6879796Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6879866Z outputs = self.mobilebert( 2025-08-14T21:42:54.6880119Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6880190Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6880442Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6880505Z layer_outputs = layer_module( 2025-08-14T21:42:54.6880779Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:42:54.6880857Z self_attention_outputs = self.attention( 2025-08-14T21:42:54.6881136Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:42:54.6881200Z self_outputs = self.self( 2025-08-14T21:42:54.6881470Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 255, in forward 2025-08-14T21:42:54.6881542Z self.value(value_tensor) 2025-08-14T21:42:54.6881545Z 2025-08-14T21:42:54.6881637Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6881831Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6881905Z return mod(**inputs) 2025-08-14T21:42:54.6882159Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6882229Z outputs = self.mobilebert( 2025-08-14T21:42:54.6882482Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6882547Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6882807Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6882870Z layer_outputs = layer_module( 2025-08-14T21:42:54.6883129Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:42:54.6883273Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:42:54.6883527Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 440, in forward 2025-08-14T21:42:54.6883633Z bottlenecked_hidden_states = self.input(hidden_states) 2025-08-14T21:42:54.6883887Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 409, in forward 2025-08-14T21:42:54.6883963Z layer_input = self.dense(hidden_states) 2025-08-14T21:42:54.6883973Z 2025-08-14T21:42:54.6884065Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6884244Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6884309Z return mod(**inputs) 2025-08-14T21:42:54.6884560Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6884762Z outputs = self.mobilebert( 2025-08-14T21:42:54.6885032Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6885097Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6885359Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6885424Z layer_outputs = layer_module( 2025-08-14T21:42:54.6885680Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:42:54.6885833Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:42:54.6886089Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 444, in forward 2025-08-14T21:42:54.6886187Z shared_attention_input = self.attention(hidden_states) 2025-08-14T21:42:54.6886452Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 410, in forward 2025-08-14T21:42:54.6886530Z layer_input = self.LayerNorm(layer_input) 2025-08-14T21:42:54.6886842Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.6886951Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.6886954Z 2025-08-14T21:42:54.6887046Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6887257Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6887317Z return mod(**inputs) 2025-08-14T21:42:54.6887580Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6887643Z outputs = self.mobilebert( 2025-08-14T21:42:54.6887920Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6887994Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6888248Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6888313Z layer_outputs = layer_module( 2025-08-14T21:42:54.6888571Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:42:54.6888648Z self_attention_outputs = self.attention( 2025-08-14T21:42:54.6888907Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:42:54.6888968Z self_outputs = self.self( 2025-08-14T21:42:54.6889220Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 245, in forward 2025-08-14T21:42:54.6889291Z self.query(query_tensor) 2025-08-14T21:42:54.6889295Z 2025-08-14T21:42:54.6889386Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6889571Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6889630Z return mod(**inputs) 2025-08-14T21:42:54.6889882Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6889949Z outputs = self.mobilebert( 2025-08-14T21:42:54.6890200Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6890265Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6890522Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6890584Z layer_outputs = layer_module( 2025-08-14T21:42:54.6890841Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:42:54.6890915Z self_attention_outputs = self.attention( 2025-08-14T21:42:54.6891168Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:42:54.6891240Z self_outputs = self.self( 2025-08-14T21:42:54.6891492Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 250, in forward 2025-08-14T21:42:54.6891561Z self.key(key_tensor) 2025-08-14T21:42:54.6891564Z 2025-08-14T21:42:54.6891638Z cudagraph partition due to non gpu ops 2025-08-14T21:42:54.6891708Z cudagraph partition due to non gpu ops 2025-08-14T21:42:54.6891805Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6891982Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6892039Z return mod(**inputs) 2025-08-14T21:42:54.6892298Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6892376Z outputs = self.mobilebert( 2025-08-14T21:42:54.6892650Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6892715Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6892985Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6893055Z layer_outputs = layer_module( 2025-08-14T21:42:54.6893305Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:42:54.6893394Z self_attention_outputs = self.attention( 2025-08-14T21:42:54.6893653Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 344, in forward 2025-08-14T21:42:54.6893764Z attention_output = self.output(self_outputs[0], layer_input) 2025-08-14T21:42:54.6894023Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 292, in forward 2025-08-14T21:42:54.6894099Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:42:54.6894102Z 2025-08-14T21:42:54.6894193Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6894380Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6894438Z return mod(**inputs) 2025-08-14T21:42:54.6894697Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6894760Z outputs = self.mobilebert( 2025-08-14T21:42:54.6895013Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6895086Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6895338Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6895404Z layer_outputs = layer_module( 2025-08-14T21:42:54.6895661Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:42:54.6895737Z self_attention_outputs = self.attention( 2025-08-14T21:42:54.6895996Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 344, in forward 2025-08-14T21:42:54.6896104Z attention_output = self.output(self_outputs[0], layer_input) 2025-08-14T21:42:54.6896356Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 295, in forward 2025-08-14T21:42:54.6896476Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:42:54.6896729Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.6896819Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.6896822Z 2025-08-14T21:42:54.6896914Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6897092Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6897159Z return mod(**inputs) 2025-08-14T21:42:54.6897411Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6897481Z outputs = self.mobilebert( 2025-08-14T21:42:54.6897732Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6897797Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6898071Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6898150Z layer_outputs = layer_module( 2025-08-14T21:42:54.6898404Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.6898498Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.6898773Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.6898881Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.6899153Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:42:54.6899231Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:54.6899234Z 2025-08-14T21:42:54.6899333Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6899514Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6899581Z return mod(**inputs) 2025-08-14T21:42:54.6899836Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6899902Z outputs = self.mobilebert( 2025-08-14T21:42:54.6900164Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6900228Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6900486Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6900556Z layer_outputs = layer_module( 2025-08-14T21:42:54.6900811Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.6900902Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.6901155Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.6901255Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.6901515Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:42:54.6901616Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:42:54.6901619Z 2025-08-14T21:42:54.6901716Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6901895Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6901952Z return mod(**inputs) 2025-08-14T21:42:54.6902216Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6902280Z outputs = self.mobilebert( 2025-08-14T21:42:54.6902532Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6902605Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6902859Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6902930Z layer_outputs = layer_module( 2025-08-14T21:42:54.6903184Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.6903269Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.6903528Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.6903639Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.6903919Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:42:54.6904009Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:42:54.6904012Z 2025-08-14T21:42:54.6904102Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6904300Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6904359Z return mod(**inputs) 2025-08-14T21:42:54.6904610Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6904749Z outputs = self.mobilebert( 2025-08-14T21:42:54.6905016Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6905087Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6905340Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6905403Z layer_outputs = layer_module( 2025-08-14T21:42:54.6905662Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.6905746Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.6906005Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.6906117Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.6906374Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:42:54.6906490Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:42:54.6906746Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.6906829Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.6906840Z 2025-08-14T21:42:54.6906934Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6907114Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6907182Z return mod(**inputs) 2025-08-14T21:42:54.6907437Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6907499Z outputs = self.mobilebert( 2025-08-14T21:42:54.6907763Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6907828Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6908092Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6908157Z layer_outputs = layer_module( 2025-08-14T21:42:54.6908411Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.6908503Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.6908760Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.6908858Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.6909120Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:42:54.6909194Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:54.6909197Z 2025-08-14T21:42:54.6909296Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6909490Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6909564Z return mod(**inputs) 2025-08-14T21:42:54.6909834Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6909898Z outputs = self.mobilebert( 2025-08-14T21:42:54.6910179Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6910244Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6910515Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6910589Z layer_outputs = layer_module( 2025-08-14T21:42:54.6910843Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.6910927Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.6911187Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.6911286Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.6911550Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:42:54.6911653Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:42:54.6911656Z 2025-08-14T21:42:54.6911749Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6911938Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6911996Z return mod(**inputs) 2025-08-14T21:42:54.6912256Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6912320Z outputs = self.mobilebert( 2025-08-14T21:42:54.6912572Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6912646Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6912900Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6912963Z layer_outputs = layer_module( 2025-08-14T21:42:54.6913223Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.6913306Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.6913567Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.6913679Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.6913934Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:42:54.6914018Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:42:54.6914021Z 2025-08-14T21:42:54.6914112Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6914300Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6914357Z return mod(**inputs) 2025-08-14T21:42:54.6914610Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6914678Z outputs = self.mobilebert( 2025-08-14T21:42:54.6914933Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6915006Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6915274Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6915353Z layer_outputs = layer_module( 2025-08-14T21:42:54.6915616Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.6915717Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.6915967Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.6916086Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.6916350Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:42:54.6916469Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:42:54.6916726Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.6916809Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.6916813Z 2025-08-14T21:42:54.6916912Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6917089Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6917155Z return mod(**inputs) 2025-08-14T21:42:54.6917407Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6917470Z outputs = self.mobilebert( 2025-08-14T21:42:54.6917731Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6917795Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6918049Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6918118Z layer_outputs = layer_module( 2025-08-14T21:42:54.6918371Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.6918458Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.6918713Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.6918811Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.6919072Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:42:54.6919146Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:54.6919149Z 2025-08-14T21:42:54.6919248Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6919427Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6919487Z return mod(**inputs) 2025-08-14T21:42:54.6919747Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6919809Z outputs = self.mobilebert( 2025-08-14T21:42:54.6920062Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6920134Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6920383Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6920452Z layer_outputs = layer_module( 2025-08-14T21:42:54.6920701Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.6920783Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.6921051Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.6921166Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.6921428Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:42:54.6921541Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:42:54.6921544Z 2025-08-14T21:42:54.6921637Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6921838Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6921897Z return mod(**inputs) 2025-08-14T21:42:54.6922151Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6922219Z outputs = self.mobilebert( 2025-08-14T21:42:54.6922480Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6922554Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6922807Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6922871Z layer_outputs = layer_module( 2025-08-14T21:42:54.6923129Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.6923211Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.6923472Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.6923583Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.6923838Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:42:54.6923923Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:42:54.6923926Z 2025-08-14T21:42:54.6924018Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6924205Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6924262Z return mod(**inputs) 2025-08-14T21:42:54.6924512Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6924581Z outputs = self.mobilebert( 2025-08-14T21:42:54.6924831Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6924897Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6925160Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6925224Z layer_outputs = layer_module( 2025-08-14T21:42:54.6925480Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.6925565Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.6925817Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.6925936Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.6926191Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:42:54.6926307Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:42:54.6926578Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.6926676Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.6926680Z 2025-08-14T21:42:54.6926780Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6926959Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6927043Z return mod(**inputs) 2025-08-14T21:42:54.6927311Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6927372Z outputs = self.mobilebert( 2025-08-14T21:42:54.6927652Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6927719Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6927973Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6928043Z layer_outputs = layer_module( 2025-08-14T21:42:54.6928299Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 518, in forward 2025-08-14T21:42:54.6928415Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:42:54.6928673Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:42:54.6928747Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:54.6928750Z 2025-08-14T21:42:54.6928849Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6929028Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6929086Z return mod(**inputs) 2025-08-14T21:42:54.6929350Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6929412Z outputs = self.mobilebert( 2025-08-14T21:42:54.6929671Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6929736Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6929991Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6930060Z layer_outputs = layer_module( 2025-08-14T21:42:54.6930315Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 518, in forward 2025-08-14T21:42:54.6930431Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:42:54.6930685Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:42:54.6930785Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:42:54.6930788Z 2025-08-14T21:42:54.6930914Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6931093Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6931152Z return mod(**inputs) 2025-08-14T21:42:54.6931412Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6931476Z outputs = self.mobilebert( 2025-08-14T21:42:54.6931732Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6931796Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6932047Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6932114Z layer_outputs = layer_module( 2025-08-14T21:42:54.6932382Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:42:54.6932548Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:42:54.6932802Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 392, in forward 2025-08-14T21:42:54.6932906Z layer_output = self.dense(intermediate_states) 2025-08-14T21:42:54.6932910Z 2025-08-14T21:42:54.6933008Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6933200Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6933260Z return mod(**inputs) 2025-08-14T21:42:54.6933517Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6933578Z outputs = self.mobilebert( 2025-08-14T21:42:54.6933838Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6933904Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6934156Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6934227Z layer_outputs = layer_module( 2025-08-14T21:42:54.6934480Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:42:54.6934628Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:42:54.6934883Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 397, in forward 2025-08-14T21:42:54.6934994Z layer_output = self.LayerNorm(layer_output + residual_tensor_1) 2025-08-14T21:42:54.6935254Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.6935339Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.6935342Z 2025-08-14T21:42:54.6935439Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6935619Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6935675Z return mod(**inputs) 2025-08-14T21:42:54.6935934Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6935998Z outputs = self.mobilebert( 2025-08-14T21:42:54.6936252Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6936323Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6936574Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6936645Z layer_outputs = layer_module( 2025-08-14T21:42:54.6936896Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:42:54.6937039Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:42:54.6937300Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 398, in forward 2025-08-14T21:42:54.6937410Z layer_output = self.bottleneck(layer_output, residual_tensor_2) 2025-08-14T21:42:54.6937671Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 372, in forward 2025-08-14T21:42:54.6937747Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:42:54.6937750Z 2025-08-14T21:42:54.6937861Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6938051Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6938128Z return mod(**inputs) 2025-08-14T21:42:54.6938386Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6938474Z outputs = self.mobilebert( 2025-08-14T21:42:54.6938730Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6938804Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6939071Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6939136Z layer_outputs = layer_module( 2025-08-14T21:42:54.6939397Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:42:54.6939539Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:42:54.6939800Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 398, in forward 2025-08-14T21:42:54.6939911Z layer_output = self.bottleneck(layer_output, residual_tensor_2) 2025-08-14T21:42:54.6940163Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 374, in forward 2025-08-14T21:42:54.6940280Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:42:54.6940534Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.6940619Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.6940629Z 2025-08-14T21:42:54.6940722Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6940905Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6940977Z return mod(**inputs) 2025-08-14T21:42:54.6941229Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6941293Z outputs = self.mobilebert( 2025-08-14T21:42:54.6941554Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6941618Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6941877Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6941940Z layer_outputs = layer_module( 2025-08-14T21:42:54.6942193Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:42:54.6942347Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:42:54.6942601Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 444, in forward 2025-08-14T21:42:54.6942707Z shared_attention_input = self.attention(hidden_states) 2025-08-14T21:42:54.6942959Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 409, in forward 2025-08-14T21:42:54.6943032Z layer_input = self.dense(hidden_states) 2025-08-14T21:42:54.6943035Z 2025-08-14T21:42:54.6943135Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6943312Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6943369Z return mod(**inputs) 2025-08-14T21:42:54.6943644Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6943726Z outputs = self.mobilebert( 2025-08-14T21:42:54.6943993Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6944055Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6944329Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6944399Z layer_outputs = layer_module( 2025-08-14T21:42:54.6944666Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:42:54.6944814Z self_attention_outputs = self.attention( 2025-08-14T21:42:54.6945075Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:42:54.6945142Z self_outputs = self.self( 2025-08-14T21:42:54.6945407Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 255, in forward 2025-08-14T21:42:54.6945473Z self.value(value_tensor) 2025-08-14T21:42:54.6945477Z 2025-08-14T21:42:54.6945571Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6945760Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6945819Z return mod(**inputs) 2025-08-14T21:42:54.6946081Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6946144Z outputs = self.mobilebert( 2025-08-14T21:42:54.6946398Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6946472Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6946726Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6946796Z layer_outputs = layer_module( 2025-08-14T21:42:54.6947047Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:42:54.6947193Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:42:54.6947456Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 440, in forward 2025-08-14T21:42:54.6947555Z bottlenecked_hidden_states = self.input(hidden_states) 2025-08-14T21:42:54.6947808Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 409, in forward 2025-08-14T21:42:54.6947888Z layer_input = self.dense(hidden_states) 2025-08-14T21:42:54.6947891Z 2025-08-14T21:42:54.6947983Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6948170Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6948229Z return mod(**inputs) 2025-08-14T21:42:54.6948480Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6948553Z outputs = self.mobilebert( 2025-08-14T21:42:54.6948805Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6948875Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6949126Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6949189Z layer_outputs = layer_module( 2025-08-14T21:42:54.6949466Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:42:54.6949634Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:42:54.6949887Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 444, in forward 2025-08-14T21:42:54.6950006Z shared_attention_input = self.attention(hidden_states) 2025-08-14T21:42:54.6950260Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 410, in forward 2025-08-14T21:42:54.6950345Z layer_input = self.LayerNorm(layer_input) 2025-08-14T21:42:54.6950612Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.6950696Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.6950699Z 2025-08-14T21:42:54.6950797Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6950979Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6951046Z return mod(**inputs) 2025-08-14T21:42:54.6951299Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6951361Z outputs = self.mobilebert( 2025-08-14T21:42:54.6951618Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6951681Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6951935Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6952003Z layer_outputs = layer_module( 2025-08-14T21:42:54.6952255Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:42:54.6952342Z self_attention_outputs = self.attention( 2025-08-14T21:42:54.6952595Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:42:54.6952657Z self_outputs = self.self( 2025-08-14T21:42:54.6952920Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 245, in forward 2025-08-14T21:42:54.6952983Z self.query(query_tensor) 2025-08-14T21:42:54.6952987Z 2025-08-14T21:42:54.6953084Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6953263Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6953323Z return mod(**inputs) 2025-08-14T21:42:54.6953581Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6953646Z outputs = self.mobilebert( 2025-08-14T21:42:54.6953898Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6953972Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6954225Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6954294Z layer_outputs = layer_module( 2025-08-14T21:42:54.6954547Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:42:54.6954624Z self_attention_outputs = self.attention( 2025-08-14T21:42:54.6954883Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:42:54.6954945Z self_outputs = self.self( 2025-08-14T21:42:54.6955219Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 250, in forward 2025-08-14T21:42:54.6955296Z self.key(key_tensor) 2025-08-14T21:42:54.6955299Z 2025-08-14T21:42:54.6955373Z cudagraph partition due to non gpu ops 2025-08-14T21:42:54.6955453Z cudagraph partition due to non gpu ops 2025-08-14T21:42:54.6955562Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6955740Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6955805Z return mod(**inputs) 2025-08-14T21:42:54.6956056Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6956142Z outputs = self.mobilebert( 2025-08-14T21:42:54.6956400Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6956464Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6956725Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6956789Z layer_outputs = layer_module( 2025-08-14T21:42:54.6957041Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:42:54.6957126Z self_attention_outputs = self.attention( 2025-08-14T21:42:54.6957378Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 344, in forward 2025-08-14T21:42:54.6957495Z attention_output = self.output(self_outputs[0], layer_input) 2025-08-14T21:42:54.6957751Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 292, in forward 2025-08-14T21:42:54.6957827Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:42:54.6957830Z 2025-08-14T21:42:54.6957929Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6958110Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6958175Z return mod(**inputs) 2025-08-14T21:42:54.6958429Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6958493Z outputs = self.mobilebert( 2025-08-14T21:42:54.6958754Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6958820Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6959073Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6959142Z layer_outputs = layer_module( 2025-08-14T21:42:54.6959396Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:42:54.6959478Z self_attention_outputs = self.attention( 2025-08-14T21:42:54.6959729Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 344, in forward 2025-08-14T21:42:54.6959841Z attention_output = self.output(self_outputs[0], layer_input) 2025-08-14T21:42:54.6960101Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 295, in forward 2025-08-14T21:42:54.6960212Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:42:54.6960474Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.6960558Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.6960562Z 2025-08-14T21:42:54.6960651Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6960850Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6960927Z return mod(**inputs) 2025-08-14T21:42:54.6961180Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6961267Z outputs = self.mobilebert( 2025-08-14T21:42:54.6961523Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6961593Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6961859Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6961924Z layer_outputs = layer_module( 2025-08-14T21:42:54.6962185Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.6962272Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.6962535Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.6962635Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.6962890Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:42:54.6962973Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:54.6962977Z 2025-08-14T21:42:54.6963068Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6963250Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6963307Z return mod(**inputs) 2025-08-14T21:42:54.6963558Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6963627Z outputs = self.mobilebert( 2025-08-14T21:42:54.6963877Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6963941Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6964199Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6964262Z layer_outputs = layer_module( 2025-08-14T21:42:54.6964519Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.6964606Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.6964855Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.6964962Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.6965216Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:42:54.6965323Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:42:54.6965326Z 2025-08-14T21:42:54.6965418Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6965595Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6965660Z return mod(**inputs) 2025-08-14T21:42:54.6965912Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6965976Z outputs = self.mobilebert( 2025-08-14T21:42:54.6966237Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6966301Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6966577Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6966657Z layer_outputs = layer_module( 2025-08-14T21:42:54.6966916Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.6967024Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.6967276Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.6967398Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.6967675Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:42:54.6967755Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:42:54.6967758Z 2025-08-14T21:42:54.6967872Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6968050Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6968109Z return mod(**inputs) 2025-08-14T21:42:54.6968368Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6968432Z outputs = self.mobilebert( 2025-08-14T21:42:54.6968693Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6968757Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6969010Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6969079Z layer_outputs = layer_module( 2025-08-14T21:42:54.6969331Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.6969422Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.6969677Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.6969788Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.6970049Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:42:54.6970157Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:42:54.6970409Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.6970498Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.6970502Z 2025-08-14T21:42:54.6970593Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6970802Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6970862Z return mod(**inputs) 2025-08-14T21:42:54.6971117Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6971188Z outputs = self.mobilebert( 2025-08-14T21:42:54.6971439Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6971510Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6971762Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6971825Z layer_outputs = layer_module( 2025-08-14T21:42:54.6972083Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.6972192Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.6972462Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.6972568Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.6972836Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:42:54.6972917Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:54.6972920Z 2025-08-14T21:42:54.6973011Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6973202Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6973272Z return mod(**inputs) 2025-08-14T21:42:54.6973529Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6973599Z outputs = self.mobilebert( 2025-08-14T21:42:54.6973859Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6973925Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6974188Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6974251Z layer_outputs = layer_module( 2025-08-14T21:42:54.6974508Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.6974599Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.6974857Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.6974964Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.6975224Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:42:54.6975326Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:42:54.6975329Z 2025-08-14T21:42:54.6975426Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6975608Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6975672Z return mod(**inputs) 2025-08-14T21:42:54.6975926Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6975989Z outputs = self.mobilebert( 2025-08-14T21:42:54.6976250Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6976312Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6976566Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6976636Z layer_outputs = layer_module( 2025-08-14T21:42:54.6976891Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.6976980Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.6977236Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.6977345Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.6977610Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:42:54.6977684Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:42:54.6977687Z 2025-08-14T21:42:54.6977801Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6977980Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6978059Z return mod(**inputs) 2025-08-14T21:42:54.6978322Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6978401Z outputs = self.mobilebert( 2025-08-14T21:42:54.6978664Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6978730Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6979003Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6979075Z layer_outputs = layer_module( 2025-08-14T21:42:54.6979334Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.6979418Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.6979684Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.6979796Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.6980061Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:42:54.6980172Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:42:54.6980429Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.6980520Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.6980524Z 2025-08-14T21:42:54.6980618Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6980808Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6980870Z return mod(**inputs) 2025-08-14T21:42:54.6981132Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6981207Z outputs = self.mobilebert( 2025-08-14T21:42:54.6981476Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6981553Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6981818Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6981882Z layer_outputs = layer_module( 2025-08-14T21:42:54.6982149Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.6982237Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.6982504Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.6982616Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.6982883Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:42:54.6982968Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:54.6982971Z 2025-08-14T21:42:54.6983067Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6983254Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6983321Z return mod(**inputs) 2025-08-14T21:42:54.6983585Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6983666Z outputs = self.mobilebert( 2025-08-14T21:42:54.6983934Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6984018Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6984285Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6984365Z layer_outputs = layer_module( 2025-08-14T21:42:54.6984807Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.6984949Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.6985213Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.6985324Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.6985589Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:42:54.6985692Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:42:54.6985695Z 2025-08-14T21:42:54.6985798Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6985983Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6986044Z return mod(**inputs) 2025-08-14T21:42:54.6986311Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6986378Z outputs = self.mobilebert( 2025-08-14T21:42:54.6986645Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6986711Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6986970Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6987045Z layer_outputs = layer_module( 2025-08-14T21:42:54.6987309Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.6987406Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.6987670Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.6987785Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.6988054Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:42:54.6988132Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:42:54.6988135Z 2025-08-14T21:42:54.6988237Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6988421Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6988483Z return mod(**inputs) 2025-08-14T21:42:54.6988752Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6988818Z outputs = self.mobilebert( 2025-08-14T21:42:54.6989112Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6989185Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6989445Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6989516Z layer_outputs = layer_module( 2025-08-14T21:42:54.6989777Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.6989888Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.6990174Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.6990290Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.6990584Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:42:54.6990695Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:42:54.6990978Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.6991072Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.6991075Z 2025-08-14T21:42:54.6991171Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6991359Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6991428Z return mod(**inputs) 2025-08-14T21:42:54.6991689Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6991762Z outputs = self.mobilebert( 2025-08-14T21:42:54.6992021Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6992086Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6992358Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6992423Z layer_outputs = layer_module( 2025-08-14T21:42:54.6992689Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 518, in forward 2025-08-14T21:42:54.6992802Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:42:54.6993065Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:42:54.6993146Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:54.6993149Z 2025-08-14T21:42:54.6993243Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6993430Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6993495Z return mod(**inputs) 2025-08-14T21:42:54.6993754Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6993824Z outputs = self.mobilebert( 2025-08-14T21:42:54.6994083Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6994146Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6994416Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6994481Z layer_outputs = layer_module( 2025-08-14T21:42:54.6994797Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 518, in forward 2025-08-14T21:42:54.6994903Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:42:54.6995154Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:42:54.6995259Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:42:54.6995262Z 2025-08-14T21:42:54.6995355Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6995532Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6995614Z return mod(**inputs) 2025-08-14T21:42:54.6995872Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6995960Z outputs = self.mobilebert( 2025-08-14T21:42:54.6996217Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6996298Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6996564Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6996626Z layer_outputs = layer_module( 2025-08-14T21:42:54.6996903Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:42:54.6997050Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:42:54.6997311Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 392, in forward 2025-08-14T21:42:54.6997407Z layer_output = self.dense(intermediate_states) 2025-08-14T21:42:54.6997410Z 2025-08-14T21:42:54.6997502Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.6997686Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.6997751Z return mod(**inputs) 2025-08-14T21:42:54.6998010Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.6998079Z outputs = self.mobilebert( 2025-08-14T21:42:54.6998341Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.6998406Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.6998672Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.6998737Z layer_outputs = layer_module( 2025-08-14T21:42:54.6999001Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:42:54.6999147Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:42:54.6999406Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 397, in forward 2025-08-14T21:42:54.6999524Z layer_output = self.LayerNorm(layer_output + residual_tensor_1) 2025-08-14T21:42:54.6999782Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.6999870Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.6999873Z 2025-08-14T21:42:54.6999970Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7000151Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7000219Z return mod(**inputs) 2025-08-14T21:42:54.7000477Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7000547Z outputs = self.mobilebert( 2025-08-14T21:42:54.7000812Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7000876Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7001140Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7001203Z layer_outputs = layer_module( 2025-08-14T21:42:54.7001474Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:42:54.7001627Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:42:54.7001899Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 398, in forward 2025-08-14T21:42:54.7002034Z layer_output = self.bottleneck(layer_output, residual_tensor_2) 2025-08-14T21:42:54.7002286Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 372, in forward 2025-08-14T21:42:54.7002362Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:42:54.7002365Z 2025-08-14T21:42:54.7002477Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7002658Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7002718Z return mod(**inputs) 2025-08-14T21:42:54.7002982Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7003046Z outputs = self.mobilebert( 2025-08-14T21:42:54.7003306Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7003372Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7003624Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7003694Z layer_outputs = layer_module( 2025-08-14T21:42:54.7003947Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:42:54.7004095Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:42:54.7004352Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 398, in forward 2025-08-14T21:42:54.7004462Z layer_output = self.bottleneck(layer_output, residual_tensor_2) 2025-08-14T21:42:54.7004725Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 374, in forward 2025-08-14T21:42:54.7004834Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:42:54.7005090Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.7005178Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.7005181Z 2025-08-14T21:42:54.7005274Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7005460Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7005519Z return mod(**inputs) 2025-08-14T21:42:54.7005774Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7005845Z outputs = self.mobilebert( 2025-08-14T21:42:54.7006099Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7006172Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7006427Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7006490Z layer_outputs = layer_module( 2025-08-14T21:42:54.7006749Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:42:54.7006893Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:42:54.7007151Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 444, in forward 2025-08-14T21:42:54.7007265Z shared_attention_input = self.attention(hidden_states) 2025-08-14T21:42:54.7007533Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 409, in forward 2025-08-14T21:42:54.7007615Z layer_input = self.dense(hidden_states) 2025-08-14T21:42:54.7007632Z 2025-08-14T21:42:54.7007725Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7007902Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7007968Z return mod(**inputs) 2025-08-14T21:42:54.7008236Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7034234Z outputs = self.mobilebert( 2025-08-14T21:42:54.7034625Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7034722Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7035011Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7035082Z layer_outputs = layer_module( 2025-08-14T21:42:54.7035354Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:42:54.7035441Z self_attention_outputs = self.attention( 2025-08-14T21:42:54.7035701Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:42:54.7035782Z self_outputs = self.self( 2025-08-14T21:42:54.7036038Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 255, in forward 2025-08-14T21:42:54.7036104Z self.value(value_tensor) 2025-08-14T21:42:54.7036112Z 2025-08-14T21:42:54.7036228Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7036423Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7036494Z return mod(**inputs) 2025-08-14T21:42:54.7036753Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7036826Z outputs = self.mobilebert( 2025-08-14T21:42:54.7037089Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7037159Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7037417Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7037491Z layer_outputs = layer_module( 2025-08-14T21:42:54.7037747Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:42:54.7037910Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:42:54.7038167Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 440, in forward 2025-08-14T21:42:54.7038274Z bottlenecked_hidden_states = self.input(hidden_states) 2025-08-14T21:42:54.7038540Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 409, in forward 2025-08-14T21:42:54.7038617Z layer_input = self.dense(hidden_states) 2025-08-14T21:42:54.7038621Z 2025-08-14T21:42:54.7038726Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7038916Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7038979Z return mod(**inputs) 2025-08-14T21:42:54.7039324Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7039422Z outputs = self.mobilebert( 2025-08-14T21:42:54.7039684Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7039789Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7040044Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7040118Z layer_outputs = layer_module( 2025-08-14T21:42:54.7040399Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:42:54.7040549Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:42:54.7040813Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 444, in forward 2025-08-14T21:42:54.7040918Z shared_attention_input = self.attention(hidden_states) 2025-08-14T21:42:54.7041184Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 410, in forward 2025-08-14T21:42:54.7041268Z layer_input = self.LayerNorm(layer_input) 2025-08-14T21:42:54.7041523Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.7041617Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.7041621Z 2025-08-14T21:42:54.7041718Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7041910Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7041971Z return mod(**inputs) 2025-08-14T21:42:54.7042227Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7042305Z outputs = self.mobilebert( 2025-08-14T21:42:54.7042570Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7042637Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7042914Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7042979Z layer_outputs = layer_module( 2025-08-14T21:42:54.7043239Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:42:54.7043321Z self_attention_outputs = self.attention( 2025-08-14T21:42:54.7043575Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:42:54.7043649Z self_outputs = self.self( 2025-08-14T21:42:54.7043906Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 245, in forward 2025-08-14T21:42:54.7043979Z self.query(query_tensor) 2025-08-14T21:42:54.7043983Z 2025-08-14T21:42:54.7044076Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7044260Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7044327Z return mod(**inputs) 2025-08-14T21:42:54.7044647Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7044716Z outputs = self.mobilebert( 2025-08-14T21:42:54.7044984Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7045052Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7045339Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7045419Z layer_outputs = layer_module( 2025-08-14T21:42:54.7045684Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:42:54.7045793Z self_attention_outputs = self.attention( 2025-08-14T21:42:54.7046056Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:42:54.7046128Z self_outputs = self.self( 2025-08-14T21:42:54.7046402Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 250, in forward 2025-08-14T21:42:54.7046466Z self.key(key_tensor) 2025-08-14T21:42:54.7046470Z 2025-08-14T21:42:54.7046556Z cudagraph partition due to non gpu ops 2025-08-14T21:42:54.7046630Z cudagraph partition due to non gpu ops 2025-08-14T21:42:54.7046728Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7046925Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7046984Z return mod(**inputs) 2025-08-14T21:42:54.7047255Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7047322Z outputs = self.mobilebert( 2025-08-14T21:42:54.7047587Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7047662Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7047927Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7047992Z layer_outputs = layer_module( 2025-08-14T21:42:54.7048266Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:42:54.7048345Z self_attention_outputs = self.attention( 2025-08-14T21:42:54.7048615Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 344, in forward 2025-08-14T21:42:54.7048730Z attention_output = self.output(self_outputs[0], layer_input) 2025-08-14T21:42:54.7048994Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 292, in forward 2025-08-14T21:42:54.7049079Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:42:54.7049082Z 2025-08-14T21:42:54.7049177Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7049369Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7049428Z return mod(**inputs) 2025-08-14T21:42:54.7049693Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7049767Z outputs = self.mobilebert( 2025-08-14T21:42:54.7050033Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7050102Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7050372Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7050438Z layer_outputs = layer_module( 2025-08-14T21:42:54.7050711Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:42:54.7050788Z self_attention_outputs = self.attention( 2025-08-14T21:42:54.7051052Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 344, in forward 2025-08-14T21:42:54.7051187Z attention_output = self.output(self_outputs[0], layer_input) 2025-08-14T21:42:54.7051466Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 295, in forward 2025-08-14T21:42:54.7051595Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:42:54.7051873Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.7051963Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.7051967Z 2025-08-14T21:42:54.7052071Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7052273Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7052336Z return mod(**inputs) 2025-08-14T21:42:54.7052612Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7052679Z outputs = self.mobilebert( 2025-08-14T21:42:54.7052950Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7053019Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7053287Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7053361Z layer_outputs = layer_module( 2025-08-14T21:42:54.7053629Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7053728Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7053992Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.7054099Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.7054377Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:42:54.7054458Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:54.7054461Z 2025-08-14T21:42:54.7054564Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7054757Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7054817Z return mod(**inputs) 2025-08-14T21:42:54.7055088Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7055156Z outputs = self.mobilebert( 2025-08-14T21:42:54.7055422Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7055496Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7055763Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7055838Z layer_outputs = layer_module( 2025-08-14T21:42:54.7056110Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7056196Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7056460Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.7056561Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.7056820Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:42:54.7056929Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:42:54.7056933Z 2025-08-14T21:42:54.7057038Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7057240Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7057299Z return mod(**inputs) 2025-08-14T21:42:54.7057551Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7057643Z outputs = self.mobilebert( 2025-08-14T21:42:54.7057895Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7057964Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7058230Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7058295Z layer_outputs = layer_module( 2025-08-14T21:42:54.7058555Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7058642Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7058903Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.7059021Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.7059273Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:42:54.7059355Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:42:54.7059358Z 2025-08-14T21:42:54.7059451Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7059631Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7059697Z return mod(**inputs) 2025-08-14T21:42:54.7059953Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7060026Z outputs = self.mobilebert( 2025-08-14T21:42:54.7060281Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7060346Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7060606Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7060670Z layer_outputs = layer_module( 2025-08-14T21:42:54.7060929Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7061014Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7061268Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.7061390Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.7061645Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:42:54.7061756Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:42:54.7062016Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.7062099Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.7062103Z 2025-08-14T21:42:54.7062200Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7062383Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7062442Z return mod(**inputs) 2025-08-14T21:42:54.7062717Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7062784Z outputs = self.mobilebert( 2025-08-14T21:42:54.7063088Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7063155Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7063429Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7063498Z layer_outputs = layer_module( 2025-08-14T21:42:54.7063750Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7063847Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7064107Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.7064205Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.7064465Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:42:54.7064548Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:54.7064552Z 2025-08-14T21:42:54.7064645Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7064925Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7064991Z return mod(**inputs) 2025-08-14T21:42:54.7065261Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7065330Z outputs = self.mobilebert( 2025-08-14T21:42:54.7065594Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7065681Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7065938Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7066004Z layer_outputs = layer_module( 2025-08-14T21:42:54.7066267Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7066353Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7066618Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.7066715Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.7066971Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:42:54.7067079Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:42:54.7067082Z 2025-08-14T21:42:54.7067177Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7067365Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7067426Z return mod(**inputs) 2025-08-14T21:42:54.7067683Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7067758Z outputs = self.mobilebert( 2025-08-14T21:42:54.7068016Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7068080Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7068347Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7068410Z layer_outputs = layer_module( 2025-08-14T21:42:54.7068690Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7068791Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7069052Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.7069425Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.7069680Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:42:54.7069756Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:42:54.7069760Z 2025-08-14T21:42:54.7069878Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7070058Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7070117Z return mod(**inputs) 2025-08-14T21:42:54.7070381Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7070447Z outputs = self.mobilebert( 2025-08-14T21:42:54.7070708Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7070773Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7071030Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7071105Z layer_outputs = layer_module( 2025-08-14T21:42:54.7071362Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7071455Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7071708Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.7071821Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.7072083Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:42:54.7072191Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:42:54.7072446Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.7072535Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.7072539Z 2025-08-14T21:42:54.7072629Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7072814Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7072872Z return mod(**inputs) 2025-08-14T21:42:54.7073122Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7073193Z outputs = self.mobilebert( 2025-08-14T21:42:54.7073447Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7073519Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7073773Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7073837Z layer_outputs = layer_module( 2025-08-14T21:42:54.7074096Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7074179Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7074432Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.7074537Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.7074803Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:42:54.7074899Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:54.7074902Z 2025-08-14T21:42:54.7074994Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7075187Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7075252Z return mod(**inputs) 2025-08-14T21:42:54.7075510Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7075592Z outputs = self.mobilebert( 2025-08-14T21:42:54.7075843Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7075906Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7076163Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7076226Z layer_outputs = layer_module( 2025-08-14T21:42:54.7076475Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7076563Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7076812Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.7076912Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.7077164Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:42:54.7077261Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:42:54.7077264Z 2025-08-14T21:42:54.7077360Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7077536Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7077599Z return mod(**inputs) 2025-08-14T21:42:54.7077846Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7077910Z outputs = self.mobilebert( 2025-08-14T21:42:54.7078165Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7078228Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7078479Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7078549Z layer_outputs = layer_module( 2025-08-14T21:42:54.7078801Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7078890Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7079142Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.7079253Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.7079511Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:42:54.7079587Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:42:54.7079591Z 2025-08-14T21:42:54.7079688Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7079862Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7079919Z return mod(**inputs) 2025-08-14T21:42:54.7080191Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7080268Z outputs = self.mobilebert( 2025-08-14T21:42:54.7080520Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7080589Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7080859Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7080927Z layer_outputs = layer_module( 2025-08-14T21:42:54.7081193Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7081279Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7081538Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.7081647Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.7081903Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:42:54.7082009Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:42:54.7082263Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.7082352Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.7082355Z 2025-08-14T21:42:54.7082445Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7082633Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7082691Z return mod(**inputs) 2025-08-14T21:42:54.7082943Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7083015Z outputs = self.mobilebert( 2025-08-14T21:42:54.7083272Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7083335Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7083593Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7083656Z layer_outputs = layer_module( 2025-08-14T21:42:54.7083916Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 518, in forward 2025-08-14T21:42:54.7084026Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:42:54.7084278Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:42:54.7084359Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:54.7084364Z 2025-08-14T21:42:54.7084454Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7084858Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7084924Z return mod(**inputs) 2025-08-14T21:42:54.7085180Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7085256Z outputs = self.mobilebert( 2025-08-14T21:42:54.7085508Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7085576Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7085838Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7085903Z layer_outputs = layer_module( 2025-08-14T21:42:54.7086230Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 518, in forward 2025-08-14T21:42:54.7086364Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:42:54.7086619Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:42:54.7086789Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:42:54.7086793Z 2025-08-14T21:42:54.7086886Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7087074Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7087157Z return mod(**inputs) 2025-08-14T21:42:54.7087414Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7087486Z outputs = self.mobilebert( 2025-08-14T21:42:54.7087741Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7087806Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7088068Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7088129Z layer_outputs = layer_module( 2025-08-14T21:42:54.7088388Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:42:54.7088534Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:42:54.7088788Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 392, in forward 2025-08-14T21:42:54.7088880Z layer_output = self.dense(intermediate_states) 2025-08-14T21:42:54.7088883Z 2025-08-14T21:42:54.7088977Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7089161Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7089219Z return mod(**inputs) 2025-08-14T21:42:54.7089472Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7089545Z outputs = self.mobilebert( 2025-08-14T21:42:54.7089795Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7089859Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7090118Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7090182Z layer_outputs = layer_module( 2025-08-14T21:42:54.7090441Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:42:54.7090585Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:42:54.7090841Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 397, in forward 2025-08-14T21:42:54.7090959Z layer_output = self.LayerNorm(layer_output + residual_tensor_1) 2025-08-14T21:42:54.7091212Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.7091302Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.7091305Z 2025-08-14T21:42:54.7091397Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7091575Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7091641Z return mod(**inputs) 2025-08-14T21:42:54.7091909Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7091994Z outputs = self.mobilebert( 2025-08-14T21:42:54.7092249Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7092332Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7092590Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7092653Z layer_outputs = layer_module( 2025-08-14T21:42:54.7092921Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:42:54.7093073Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:42:54.7093326Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 398, in forward 2025-08-14T21:42:54.7093446Z layer_output = self.bottleneck(layer_output, residual_tensor_2) 2025-08-14T21:42:54.7093700Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 372, in forward 2025-08-14T21:42:54.7093777Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:42:54.7093782Z 2025-08-14T21:42:54.7093880Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7094058Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7094122Z return mod(**inputs) 2025-08-14T21:42:54.7094375Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7094438Z outputs = self.mobilebert( 2025-08-14T21:42:54.7094692Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7094768Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7095024Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7095091Z layer_outputs = layer_module( 2025-08-14T21:42:54.7095345Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:42:54.7095488Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:42:54.7095749Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 398, in forward 2025-08-14T21:42:54.7095857Z layer_output = self.bottleneck(layer_output, residual_tensor_2) 2025-08-14T21:42:54.7096116Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 374, in forward 2025-08-14T21:42:54.7096224Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:42:54.7096477Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.7096564Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.7096569Z 2025-08-14T21:42:54.7096659Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7096844Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7096902Z return mod(**inputs) 2025-08-14T21:42:54.7097155Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7097225Z outputs = self.mobilebert( 2025-08-14T21:42:54.7097479Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7097556Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7097834Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7097897Z layer_outputs = layer_module( 2025-08-14T21:42:54.7098156Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:42:54.7098318Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:42:54.7098570Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 444, in forward 2025-08-14T21:42:54.7098688Z shared_attention_input = self.attention(hidden_states) 2025-08-14T21:42:54.7098942Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 409, in forward 2025-08-14T21:42:54.7099025Z layer_input = self.dense(hidden_states) 2025-08-14T21:42:54.7099028Z 2025-08-14T21:42:54.7099121Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7099296Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7099361Z return mod(**inputs) 2025-08-14T21:42:54.7099614Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7099675Z outputs = self.mobilebert( 2025-08-14T21:42:54.7099932Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7099996Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7100253Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7100314Z layer_outputs = layer_module( 2025-08-14T21:42:54.7100565Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:42:54.7100653Z self_attention_outputs = self.attention( 2025-08-14T21:42:54.7100903Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:42:54.7100970Z self_outputs = self.self( 2025-08-14T21:42:54.7101219Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 255, in forward 2025-08-14T21:42:54.7101281Z self.value(value_tensor) 2025-08-14T21:42:54.7101284Z 2025-08-14T21:42:54.7101382Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7101561Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7101617Z return mod(**inputs) 2025-08-14T21:42:54.7101879Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7101942Z outputs = self.mobilebert( 2025-08-14T21:42:54.7102201Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7102264Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7102515Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7102582Z layer_outputs = layer_module( 2025-08-14T21:42:54.7102834Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:42:54.7102983Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:42:54.7103247Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 440, in forward 2025-08-14T21:42:54.7103347Z bottlenecked_hidden_states = self.input(hidden_states) 2025-08-14T21:42:54.7103619Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 409, in forward 2025-08-14T21:42:54.7103707Z layer_input = self.dense(hidden_states) 2025-08-14T21:42:54.7103711Z 2025-08-14T21:42:54.7103806Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7103984Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7104041Z return mod(**inputs) 2025-08-14T21:42:54.7104327Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7104391Z outputs = self.mobilebert( 2025-08-14T21:42:54.7104651Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7104801Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7105065Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7105134Z layer_outputs = layer_module( 2025-08-14T21:42:54.7105387Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:42:54.7105531Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:42:54.7105790Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 444, in forward 2025-08-14T21:42:54.7105887Z shared_attention_input = self.attention(hidden_states) 2025-08-14T21:42:54.7106149Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 410, in forward 2025-08-14T21:42:54.7106229Z layer_input = self.LayerNorm(layer_input) 2025-08-14T21:42:54.7106486Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.7106574Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.7106579Z 2025-08-14T21:42:54.7106671Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7106850Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7106915Z return mod(**inputs) 2025-08-14T21:42:54.7107168Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7107237Z outputs = self.mobilebert( 2025-08-14T21:42:54.7107486Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7107552Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7107810Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7107873Z layer_outputs = layer_module( 2025-08-14T21:42:54.7108132Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:42:54.7108211Z self_attention_outputs = self.attention( 2025-08-14T21:42:54.7108463Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:42:54.7108532Z self_outputs = self.self( 2025-08-14T21:42:54.7108785Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 245, in forward 2025-08-14T21:42:54.7108850Z self.query(query_tensor) 2025-08-14T21:42:54.7108853Z 2025-08-14T21:42:54.7108969Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7109151Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7109232Z return mod(**inputs) 2025-08-14T21:42:54.7109490Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7109568Z outputs = self.mobilebert( 2025-08-14T21:42:54.7109829Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7109892Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7110158Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7110227Z layer_outputs = layer_module( 2025-08-14T21:42:54.7110479Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:42:54.7110560Z self_attention_outputs = self.attention( 2025-08-14T21:42:54.7110811Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:42:54.7110872Z self_outputs = self.self( 2025-08-14T21:42:54.7111132Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 250, in forward 2025-08-14T21:42:54.7111191Z self.key(key_tensor) 2025-08-14T21:42:54.7111194Z 2025-08-14T21:42:54.7111272Z cudagraph partition due to non gpu ops 2025-08-14T21:42:54.7111341Z cudagraph partition due to non gpu ops 2025-08-14T21:42:54.7111433Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7111617Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7111675Z return mod(**inputs) 2025-08-14T21:42:54.7111928Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7111997Z outputs = self.mobilebert( 2025-08-14T21:42:54.7112245Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7112317Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7112567Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7112627Z layer_outputs = layer_module( 2025-08-14T21:42:54.7112885Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:42:54.7112959Z self_attention_outputs = self.attention( 2025-08-14T21:42:54.7113213Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 344, in forward 2025-08-14T21:42:54.7113331Z attention_output = self.output(self_outputs[0], layer_input) 2025-08-14T21:42:54.7113582Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 292, in forward 2025-08-14T21:42:54.7113662Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:42:54.7113667Z 2025-08-14T21:42:54.7113754Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7113928Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7113992Z return mod(**inputs) 2025-08-14T21:42:54.7114244Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7114313Z outputs = self.mobilebert( 2025-08-14T21:42:54.7114575Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7114639Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7114914Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7114975Z layer_outputs = layer_module( 2025-08-14T21:42:54.7115245Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:42:54.7115327Z self_attention_outputs = self.attention( 2025-08-14T21:42:54.7115577Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 344, in forward 2025-08-14T21:42:54.7115704Z attention_output = self.output(self_outputs[0], layer_input) 2025-08-14T21:42:54.7115958Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 295, in forward 2025-08-14T21:42:54.7116070Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:42:54.7116332Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.7116414Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.7116418Z 2025-08-14T21:42:54.7116516Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7116691Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7116748Z return mod(**inputs) 2025-08-14T21:42:54.7117005Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7117066Z outputs = self.mobilebert( 2025-08-14T21:42:54.7117323Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7117390Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7117641Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7117709Z layer_outputs = layer_module( 2025-08-14T21:42:54.7117962Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7118048Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7118305Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.7118407Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.7118665Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:42:54.7118738Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:54.7118741Z 2025-08-14T21:42:54.7118833Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7119017Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7119074Z return mod(**inputs) 2025-08-14T21:42:54.7119330Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7119393Z outputs = self.mobilebert( 2025-08-14T21:42:54.7119644Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7119714Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7119966Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7120030Z layer_outputs = layer_module( 2025-08-14T21:42:54.7120303Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7120404Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7120662Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.7120777Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.7121029Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:42:54.7121134Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:42:54.7121137Z 2025-08-14T21:42:54.7121243Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7121430Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7121489Z return mod(**inputs) 2025-08-14T21:42:54.7121745Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7121817Z outputs = self.mobilebert( 2025-08-14T21:42:54.7122071Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7122135Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7122396Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7122458Z layer_outputs = layer_module( 2025-08-14T21:42:54.7122717Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7122801Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7123055Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.7123176Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.7123432Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:42:54.7123513Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:42:54.7123518Z 2025-08-14T21:42:54.7123607Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7123786Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7123851Z return mod(**inputs) 2025-08-14T21:42:54.7124105Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7124166Z outputs = self.mobilebert( 2025-08-14T21:42:54.7124424Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7124489Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7124750Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7124812Z layer_outputs = layer_module( 2025-08-14T21:42:54.7125066Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7125155Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7125407Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.7125526Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.7125780Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:42:54.7125903Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:42:54.7126166Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.7126264Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.7126267Z 2025-08-14T21:42:54.7126388Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7126568Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7126626Z return mod(**inputs) 2025-08-14T21:42:54.7126903Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7126968Z outputs = self.mobilebert( 2025-08-14T21:42:54.7127222Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7127292Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7127547Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7127618Z layer_outputs = layer_module( 2025-08-14T21:42:54.7127871Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7127955Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7128214Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.7128314Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.7128577Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:42:54.7128652Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:54.7128655Z 2025-08-14T21:42:54.7128748Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7128934Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7128990Z return mod(**inputs) 2025-08-14T21:42:54.7129242Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7129314Z outputs = self.mobilebert( 2025-08-14T21:42:54.7129567Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7129637Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7129889Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7129950Z layer_outputs = layer_module( 2025-08-14T21:42:54.7130212Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7130296Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7130556Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.7130656Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.7130911Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:42:54.7131015Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:42:54.7131018Z 2025-08-14T21:42:54.7131110Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7131287Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7131348Z return mod(**inputs) 2025-08-14T21:42:54.7131615Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7131698Z outputs = self.mobilebert( 2025-08-14T21:42:54.7131951Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7132035Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7132295Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7132356Z layer_outputs = layer_module( 2025-08-14T21:42:54.7132631Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7132717Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7132971Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.7133089Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.7133343Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:42:54.7133416Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:42:54.7133426Z 2025-08-14T21:42:54.7133517Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7133695Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7133756Z return mod(**inputs) 2025-08-14T21:42:54.7134008Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7134071Z outputs = self.mobilebert( 2025-08-14T21:42:54.7134334Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7134400Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7134660Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7134723Z layer_outputs = layer_module( 2025-08-14T21:42:54.7134974Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7135065Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7135314Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.7135425Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.7135684Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:42:54.7135795Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:42:54.7136055Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.7136137Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.7136141Z 2025-08-14T21:42:54.7136233Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7136423Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7136480Z return mod(**inputs) 2025-08-14T21:42:54.7136739Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7136800Z outputs = self.mobilebert( 2025-08-14T21:42:54.7137052Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7137122Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7137387Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7137465Z layer_outputs = layer_module( 2025-08-14T21:42:54.7137727Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7137829Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7138093Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.7138190Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.7138460Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:42:54.7138544Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:54.7138548Z 2025-08-14T21:42:54.7138640Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7138825Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7138882Z return mod(**inputs) 2025-08-14T21:42:54.7139133Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7139202Z outputs = self.mobilebert( 2025-08-14T21:42:54.7139452Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7139512Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7139771Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7139833Z layer_outputs = layer_module( 2025-08-14T21:42:54.7140092Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7140175Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7140426Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.7140530Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.7140786Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:42:54.7140887Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:42:54.7140891Z 2025-08-14T21:42:54.7140983Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7141159Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7141223Z return mod(**inputs) 2025-08-14T21:42:54.7141479Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7141548Z outputs = self.mobilebert( 2025-08-14T21:42:54.7141802Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7141866Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7142127Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7142188Z layer_outputs = layer_module( 2025-08-14T21:42:54.7142441Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7142531Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7142783Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.7142917Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.7143186Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:42:54.7143260Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:42:54.7143277Z 2025-08-14T21:42:54.7143375Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7143556Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7143617Z return mod(**inputs) 2025-08-14T21:42:54.7143899Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7143966Z outputs = self.mobilebert( 2025-08-14T21:42:54.7144281Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7144350Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7144619Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7144762Z layer_outputs = layer_module( 2025-08-14T21:42:54.7145050Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7145149Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7145422Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.7145542Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.7145833Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:42:54.7145949Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:42:54.7146227Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.7146324Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.7146328Z 2025-08-14T21:42:54.7146419Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7146600Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7146656Z return mod(**inputs) 2025-08-14T21:42:54.7146933Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7147008Z outputs = self.mobilebert( 2025-08-14T21:42:54.7147280Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7147356Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7147633Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7147701Z layer_outputs = layer_module( 2025-08-14T21:42:54.7147980Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 518, in forward 2025-08-14T21:42:54.7148099Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:42:54.7148377Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:42:54.7148457Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:54.7148463Z 2025-08-14T21:42:54.7148559Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7148756Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7148817Z return mod(**inputs) 2025-08-14T21:42:54.7149108Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7149209Z outputs = self.mobilebert( 2025-08-14T21:42:54.7149484Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7149576Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7149857Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7149923Z layer_outputs = layer_module( 2025-08-14T21:42:54.7150227Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 518, in forward 2025-08-14T21:42:54.7150342Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:42:54.7150624Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:42:54.7150728Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:42:54.7150731Z 2025-08-14T21:42:54.7150828Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7151022Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7151086Z return mod(**inputs) 2025-08-14T21:42:54.7151360Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7151433Z outputs = self.mobilebert( 2025-08-14T21:42:54.7151708Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7151785Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7152060Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7152126Z layer_outputs = layer_module( 2025-08-14T21:42:54.7152451Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:42:54.7152607Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:42:54.7152888Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 392, in forward 2025-08-14T21:42:54.7152978Z layer_output = self.dense(intermediate_states) 2025-08-14T21:42:54.7152982Z 2025-08-14T21:42:54.7153078Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7153278Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7153340Z return mod(**inputs) 2025-08-14T21:42:54.7153618Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7153687Z outputs = self.mobilebert( 2025-08-14T21:42:54.7153958Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7154034Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7154307Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7154373Z layer_outputs = layer_module( 2025-08-14T21:42:54.7154655Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:42:54.7154808Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:42:54.7155087Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 397, in forward 2025-08-14T21:42:54.7155222Z layer_output = self.LayerNorm(layer_output + residual_tensor_1) 2025-08-14T21:42:54.7155511Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.7155606Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.7155623Z 2025-08-14T21:42:54.7155724Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7155925Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7155987Z return mod(**inputs) 2025-08-14T21:42:54.7156282Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7156359Z outputs = self.mobilebert( 2025-08-14T21:42:54.7156637Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7156708Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7156996Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7157059Z layer_outputs = layer_module( 2025-08-14T21:42:54.7157327Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:42:54.7157469Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:42:54.7157726Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 398, in forward 2025-08-14T21:42:54.7157843Z layer_output = self.bottleneck(layer_output, residual_tensor_2) 2025-08-14T21:42:54.7158097Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 372, in forward 2025-08-14T21:42:54.7158182Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:42:54.7158188Z 2025-08-14T21:42:54.7158278Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7158456Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7158520Z return mod(**inputs) 2025-08-14T21:42:54.7158775Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7158838Z outputs = self.mobilebert( 2025-08-14T21:42:54.7159100Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7159163Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7159421Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7159483Z layer_outputs = layer_module( 2025-08-14T21:42:54.7159738Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:42:54.7159886Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:42:54.7160137Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 398, in forward 2025-08-14T21:42:54.7160256Z layer_output = self.bottleneck(layer_output, residual_tensor_2) 2025-08-14T21:42:54.7160510Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 374, in forward 2025-08-14T21:42:54.7160618Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:42:54.7160877Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.7160972Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.7160975Z 2025-08-14T21:42:54.7161088Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7161265Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7161323Z return mod(**inputs) 2025-08-14T21:42:54.7161597Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7161659Z outputs = self.mobilebert( 2025-08-14T21:42:54.7161912Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7161997Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7162251Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7162321Z layer_outputs = layer_module( 2025-08-14T21:42:54.7162578Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:42:54.7162727Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:42:54.7162991Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 444, in forward 2025-08-14T21:42:54.7163094Z shared_attention_input = self.attention(hidden_states) 2025-08-14T21:42:54.7163354Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 409, in forward 2025-08-14T21:42:54.7163429Z layer_input = self.dense(hidden_states) 2025-08-14T21:42:54.7163433Z 2025-08-14T21:42:54.7163524Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7163710Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7163770Z return mod(**inputs) 2025-08-14T21:42:54.7164020Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7164090Z outputs = self.mobilebert( 2025-08-14T21:42:54.7164341Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7164411Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7164663Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7164725Z layer_outputs = layer_module( 2025-08-14T21:42:54.7164987Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:42:54.7165064Z self_attention_outputs = self.attention( 2025-08-14T21:42:54.7165322Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:42:54.7165387Z self_outputs = self.self( 2025-08-14T21:42:54.7165641Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 255, in forward 2025-08-14T21:42:54.7165713Z self.value(value_tensor) 2025-08-14T21:42:54.7165716Z 2025-08-14T21:42:54.7165807Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7165986Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7166049Z return mod(**inputs) 2025-08-14T21:42:54.7166303Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7166371Z outputs = self.mobilebert( 2025-08-14T21:42:54.7166638Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7166704Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7166977Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7167038Z layer_outputs = layer_module( 2025-08-14T21:42:54.7167315Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:42:54.7167459Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:42:54.7167725Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 440, in forward 2025-08-14T21:42:54.7167833Z bottlenecked_hidden_states = self.input(hidden_states) 2025-08-14T21:42:54.7168088Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 409, in forward 2025-08-14T21:42:54.7168163Z layer_input = self.dense(hidden_states) 2025-08-14T21:42:54.7168175Z 2025-08-14T21:42:54.7168268Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7168447Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7168515Z return mod(**inputs) 2025-08-14T21:42:54.7168770Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7168833Z outputs = self.mobilebert( 2025-08-14T21:42:54.7169092Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7169157Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7169417Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7169482Z layer_outputs = layer_module( 2025-08-14T21:42:54.7169736Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:42:54.7169886Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:42:54.7170141Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 444, in forward 2025-08-14T21:42:54.7170240Z shared_attention_input = self.attention(hidden_states) 2025-08-14T21:42:54.7170500Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 410, in forward 2025-08-14T21:42:54.7170578Z layer_input = self.LayerNorm(layer_input) 2025-08-14T21:42:54.7170840Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.7170922Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.7170926Z 2025-08-14T21:42:54.7171016Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7171202Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7171261Z return mod(**inputs) 2025-08-14T21:42:54.7171523Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7171585Z outputs = self.mobilebert( 2025-08-14T21:42:54.7171839Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7171910Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7172167Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7172229Z layer_outputs = layer_module( 2025-08-14T21:42:54.7172510Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:42:54.7172603Z self_attention_outputs = self.attention( 2025-08-14T21:42:54.7172869Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:42:54.7172947Z self_outputs = self.self( 2025-08-14T21:42:54.7173208Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 245, in forward 2025-08-14T21:42:54.7173277Z self.query(query_tensor) 2025-08-14T21:42:54.7173281Z 2025-08-14T21:42:54.7173385Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7173573Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7173633Z return mod(**inputs) 2025-08-14T21:42:54.7173890Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7173962Z outputs = self.mobilebert( 2025-08-14T21:42:54.7174217Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7174282Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7174544Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7174605Z layer_outputs = layer_module( 2025-08-14T21:42:54.7174868Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:42:54.7174943Z self_attention_outputs = self.attention( 2025-08-14T21:42:54.7175198Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:42:54.7175269Z self_outputs = self.self( 2025-08-14T21:42:54.7175525Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 250, in forward 2025-08-14T21:42:54.7175593Z self.key(key_tensor) 2025-08-14T21:42:54.7175596Z 2025-08-14T21:42:54.7175671Z cudagraph partition due to non gpu ops 2025-08-14T21:42:54.7175740Z cudagraph partition due to non gpu ops 2025-08-14T21:42:54.7175838Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7176015Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7176070Z return mod(**inputs) 2025-08-14T21:42:54.7176333Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7176394Z outputs = self.mobilebert( 2025-08-14T21:42:54.7176656Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7176720Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7176973Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7177044Z layer_outputs = layer_module( 2025-08-14T21:42:54.7177303Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:42:54.7177377Z self_attention_outputs = self.attention( 2025-08-14T21:42:54.7177637Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 344, in forward 2025-08-14T21:42:54.7177746Z attention_output = self.output(self_outputs[0], layer_input) 2025-08-14T21:42:54.7178008Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 292, in forward 2025-08-14T21:42:54.7178099Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:42:54.7178144Z 2025-08-14T21:42:54.7178238Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7178422Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7178497Z return mod(**inputs) 2025-08-14T21:42:54.7178759Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7178821Z outputs = self.mobilebert( 2025-08-14T21:42:54.7179089Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7179161Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7179419Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7179482Z layer_outputs = layer_module( 2025-08-14T21:42:54.7179744Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:42:54.7179820Z self_attention_outputs = self.attention( 2025-08-14T21:42:54.7180081Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 344, in forward 2025-08-14T21:42:54.7180193Z attention_output = self.output(self_outputs[0], layer_input) 2025-08-14T21:42:54.7180450Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 295, in forward 2025-08-14T21:42:54.7180568Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:42:54.7180828Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.7180918Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.7180923Z 2025-08-14T21:42:54.7181015Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7181194Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7181261Z return mod(**inputs) 2025-08-14T21:42:54.7181521Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7181593Z outputs = self.mobilebert( 2025-08-14T21:42:54.7181849Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7181914Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7182175Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7182239Z layer_outputs = layer_module( 2025-08-14T21:42:54.7182493Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7182588Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7182843Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.7182950Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.7183205Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:42:54.7183279Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:54.7183282Z 2025-08-14T21:42:54.7183380Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7183560Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7183624Z return mod(**inputs) 2025-08-14T21:42:54.7183892Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7183969Z outputs = self.mobilebert( 2025-08-14T21:42:54.7184229Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7184309Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7184561Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7184818Z layer_outputs = layer_module( 2025-08-14T21:42:54.7185124Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7185218Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7185481Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.7185583Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.7185852Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:42:54.7185954Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:42:54.7185960Z 2025-08-14T21:42:54.7186063Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7186253Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7186315Z return mod(**inputs) 2025-08-14T21:42:54.7186653Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7186717Z outputs = self.mobilebert( 2025-08-14T21:42:54.7186969Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7187043Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7187298Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7187369Z layer_outputs = layer_module( 2025-08-14T21:42:54.7187624Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7187710Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7187971Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.7188083Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.7188342Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:42:54.7188419Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:42:54.7188422Z 2025-08-14T21:42:54.7188517Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7188712Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7188774Z return mod(**inputs) 2025-08-14T21:42:54.7189043Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7189117Z outputs = self.mobilebert( 2025-08-14T21:42:54.7189383Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7189459Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7189728Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7189794Z layer_outputs = layer_module( 2025-08-14T21:42:54.7190094Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7190209Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7190482Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.7190623Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.7190889Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:42:54.7191034Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:42:54.7191301Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.7191385Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.7191397Z 2025-08-14T21:42:54.7191492Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7191681Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7191747Z return mod(**inputs) 2025-08-14T21:42:54.7192013Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7192079Z outputs = self.mobilebert( 2025-08-14T21:42:54.7192356Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7192425Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7192700Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7192764Z layer_outputs = layer_module( 2025-08-14T21:42:54.7193033Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7193127Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7193393Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.7193497Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.7193767Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:42:54.7193846Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:54.7193849Z 2025-08-14T21:42:54.7193952Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7194143Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7194204Z return mod(**inputs) 2025-08-14T21:42:54.7194479Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7194546Z outputs = self.mobilebert( 2025-08-14T21:42:54.7194820Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7194887Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7195154Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7195226Z layer_outputs = layer_module( 2025-08-14T21:42:54.7195497Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7195585Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7195875Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.7195982Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.7196273Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:42:54.7196377Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:42:54.7196395Z 2025-08-14T21:42:54.7196494Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7196691Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7196751Z return mod(**inputs) 2025-08-14T21:42:54.7197042Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7197110Z outputs = self.mobilebert( 2025-08-14T21:42:54.7197380Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7197455Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7197727Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7197799Z layer_outputs = layer_module( 2025-08-14T21:42:54.7198072Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7198170Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7198431Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.7198544Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.7198798Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:42:54.7198882Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:42:54.7198885Z 2025-08-14T21:42:54.7198978Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7199163Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7199223Z return mod(**inputs) 2025-08-14T21:42:54.7199476Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7199547Z outputs = self.mobilebert( 2025-08-14T21:42:54.7199800Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7199872Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7200123Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7200185Z layer_outputs = layer_module( 2025-08-14T21:42:54.7200442Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7200527Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7200778Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.7200898Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.7201152Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:42:54.7201269Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:42:54.7201525Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.7201608Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.7201611Z 2025-08-14T21:42:54.7201725Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7201919Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7201984Z return mod(**inputs) 2025-08-14T21:42:54.7202243Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7202321Z outputs = self.mobilebert( 2025-08-14T21:42:54.7202579Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7202642Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7202909Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7202980Z layer_outputs = layer_module( 2025-08-14T21:42:54.7203234Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7203324Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7203574Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.7203672Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.7203930Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:42:54.7204005Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:54.7204008Z 2025-08-14T21:42:54.7204108Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7204285Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7204342Z return mod(**inputs) 2025-08-14T21:42:54.7204604Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7204669Z outputs = self.mobilebert( 2025-08-14T21:42:54.7204920Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7204991Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7205241Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7205310Z layer_outputs = layer_module( 2025-08-14T21:42:54.7205563Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7205646Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7205902Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.7206000Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.7206259Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:42:54.7206356Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:42:54.7206360Z 2025-08-14T21:42:54.7206450Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7206632Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7206689Z return mod(**inputs) 2025-08-14T21:42:54.7206948Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7207009Z outputs = self.mobilebert( 2025-08-14T21:42:54.7207261Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7207342Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7207608Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7207669Z layer_outputs = layer_module( 2025-08-14T21:42:54.7207943Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7208024Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7208281Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.7208407Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.7208663Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:42:54.7208747Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:42:54.7208750Z 2025-08-14T21:42:54.7208842Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7209028Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7209085Z return mod(**inputs) 2025-08-14T21:42:54.7209339Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7209409Z outputs = self.mobilebert( 2025-08-14T21:42:54.7209661Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7209724Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7209982Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7210044Z layer_outputs = layer_module( 2025-08-14T21:42:54.7210302Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7210386Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7210640Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.7210760Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.7211011Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:42:54.7211128Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:42:54.7211379Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.7211460Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.7211463Z 2025-08-14T21:42:54.7211563Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7211741Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7211799Z return mod(**inputs) 2025-08-14T21:42:54.7212058Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7212121Z outputs = self.mobilebert( 2025-08-14T21:42:54.7212379Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7212442Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7212694Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7212762Z layer_outputs = layer_module( 2025-08-14T21:42:54.7213029Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 518, in forward 2025-08-14T21:42:54.7213161Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:42:54.7213415Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:42:54.7213511Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:54.7213514Z 2025-08-14T21:42:54.7213610Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7213785Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7213846Z return mod(**inputs) 2025-08-14T21:42:54.7214117Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7214181Z outputs = self.mobilebert( 2025-08-14T21:42:54.7214441Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7214507Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7214758Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7214830Z layer_outputs = layer_module( 2025-08-14T21:42:54.7215084Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 518, in forward 2025-08-14T21:42:54.7215195Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:42:54.7215449Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:42:54.7215550Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:42:54.7215553Z 2025-08-14T21:42:54.7215651Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7215831Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7215890Z return mod(**inputs) 2025-08-14T21:42:54.7216148Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7216212Z outputs = self.mobilebert( 2025-08-14T21:42:54.7216471Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7216534Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7216788Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7216858Z layer_outputs = layer_module( 2025-08-14T21:42:54.7217111Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:42:54.7217261Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:42:54.7217516Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 392, in forward 2025-08-14T21:42:54.7217600Z layer_output = self.dense(intermediate_states) 2025-08-14T21:42:54.7217605Z 2025-08-14T21:42:54.7217705Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7217883Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7217941Z return mod(**inputs) 2025-08-14T21:42:54.7218202Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7218264Z outputs = self.mobilebert( 2025-08-14T21:42:54.7218526Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7218603Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7218871Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7218942Z layer_outputs = layer_module( 2025-08-14T21:42:54.7219193Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:42:54.7219360Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:42:54.7219619Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 397, in forward 2025-08-14T21:42:54.7219745Z layer_output = self.LayerNorm(layer_output + residual_tensor_1) 2025-08-14T21:42:54.7220009Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.7220091Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.7220095Z 2025-08-14T21:42:54.7220192Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7220368Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7220424Z return mod(**inputs) 2025-08-14T21:42:54.7220684Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7220746Z outputs = self.mobilebert( 2025-08-14T21:42:54.7220996Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7221067Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7221316Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7221382Z layer_outputs = layer_module( 2025-08-14T21:42:54.7221632Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:42:54.7221773Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:42:54.7222029Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 398, in forward 2025-08-14T21:42:54.7222140Z layer_output = self.bottleneck(layer_output, residual_tensor_2) 2025-08-14T21:42:54.7222398Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 372, in forward 2025-08-14T21:42:54.7222475Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:42:54.7222478Z 2025-08-14T21:42:54.7222568Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7222750Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7222809Z return mod(**inputs) 2025-08-14T21:42:54.7223061Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7223132Z outputs = self.mobilebert( 2025-08-14T21:42:54.7223383Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7223454Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7223706Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7223768Z layer_outputs = layer_module( 2025-08-14T21:42:54.7224027Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:42:54.7224165Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:42:54.7224439Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 398, in forward 2025-08-14T21:42:54.7224563Z layer_output = self.bottleneck(layer_output, residual_tensor_2) 2025-08-14T21:42:54.7224877Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 374, in forward 2025-08-14T21:42:54.7225020Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:42:54.7225273Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.7225379Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.7225382Z 2025-08-14T21:42:54.7225480Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7225667Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7225736Z return mod(**inputs) 2025-08-14T21:42:54.7226010Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7226073Z outputs = self.mobilebert( 2025-08-14T21:42:54.7226334Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7226399Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7226661Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7226725Z layer_outputs = layer_module( 2025-08-14T21:42:54.7226981Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:42:54.7227136Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:42:54.7227393Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 444, in forward 2025-08-14T21:42:54.7227500Z shared_attention_input = self.attention(hidden_states) 2025-08-14T21:42:54.7227755Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 409, in forward 2025-08-14T21:42:54.7227830Z layer_input = self.dense(hidden_states) 2025-08-14T21:42:54.7227834Z 2025-08-14T21:42:54.7227933Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7228112Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7228172Z return mod(**inputs) 2025-08-14T21:42:54.7228434Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7228495Z outputs = self.mobilebert( 2025-08-14T21:42:54.7228758Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7228822Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7229077Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7229148Z layer_outputs = layer_module( 2025-08-14T21:42:54.7229403Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:42:54.7229487Z self_attention_outputs = self.attention( 2025-08-14T21:42:54.7229745Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:42:54.7229809Z self_outputs = self.self( 2025-08-14T21:42:54.7230069Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 255, in forward 2025-08-14T21:42:54.7230152Z self.value(value_tensor) 2025-08-14T21:42:54.7230169Z 2025-08-14T21:42:54.7230262Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7230447Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7230526Z return mod(**inputs) 2025-08-14T21:42:54.7230787Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7230850Z outputs = self.mobilebert( 2025-08-14T21:42:54.7231115Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7231186Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7231439Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7231508Z layer_outputs = layer_module( 2025-08-14T21:42:54.7231759Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:42:54.7231904Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:42:54.7232163Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 440, in forward 2025-08-14T21:42:54.7232260Z bottlenecked_hidden_states = self.input(hidden_states) 2025-08-14T21:42:54.7232511Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 409, in forward 2025-08-14T21:42:54.7232591Z layer_input = self.dense(hidden_states) 2025-08-14T21:42:54.7232594Z 2025-08-14T21:42:54.7232685Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7232872Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7232930Z return mod(**inputs) 2025-08-14T21:42:54.7233181Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7233249Z outputs = self.mobilebert( 2025-08-14T21:42:54.7233500Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7233572Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7233821Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7233883Z layer_outputs = layer_module( 2025-08-14T21:42:54.7234141Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:42:54.7234281Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:42:54.7234534Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 444, in forward 2025-08-14T21:42:54.7234640Z shared_attention_input = self.attention(hidden_states) 2025-08-14T21:42:54.7234890Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 410, in forward 2025-08-14T21:42:54.7234975Z layer_input = self.LayerNorm(layer_input) 2025-08-14T21:42:54.7235224Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.7235305Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.7235309Z 2025-08-14T21:42:54.7235406Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7235582Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7235646Z return mod(**inputs) 2025-08-14T21:42:54.7235914Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7235997Z outputs = self.mobilebert( 2025-08-14T21:42:54.7236258Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7236337Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7236587Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7236657Z layer_outputs = layer_module( 2025-08-14T21:42:54.7236922Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:42:54.7237005Z self_attention_outputs = self.attention( 2025-08-14T21:42:54.7237259Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:42:54.7237323Z self_outputs = self.self( 2025-08-14T21:42:54.7237581Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 245, in forward 2025-08-14T21:42:54.7237645Z self.query(query_tensor) 2025-08-14T21:42:54.7237649Z 2025-08-14T21:42:54.7237745Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7237922Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7237980Z return mod(**inputs) 2025-08-14T21:42:54.7238237Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7238297Z outputs = self.mobilebert( 2025-08-14T21:42:54.7238549Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7238622Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7238879Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7238948Z layer_outputs = layer_module( 2025-08-14T21:42:54.7239200Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:42:54.7239274Z self_attention_outputs = self.attention( 2025-08-14T21:42:54.7239533Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:42:54.7239597Z self_outputs = self.self( 2025-08-14T21:42:54.7239857Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 250, in forward 2025-08-14T21:42:54.7239917Z self.key(key_tensor) 2025-08-14T21:42:54.7239920Z 2025-08-14T21:42:54.7239993Z cudagraph partition due to non gpu ops 2025-08-14T21:42:54.7240073Z cudagraph partition due to non gpu ops 2025-08-14T21:42:54.7240166Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7240341Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7240409Z return mod(**inputs) 2025-08-14T21:42:54.7240663Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7240732Z outputs = self.mobilebert( 2025-08-14T21:42:54.7240985Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7241048Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7241305Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7241380Z layer_outputs = layer_module( 2025-08-14T21:42:54.7241647Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:42:54.7241729Z self_attention_outputs = self.attention( 2025-08-14T21:42:54.7241981Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 344, in forward 2025-08-14T21:42:54.7242117Z attention_output = self.output(self_outputs[0], layer_input) 2025-08-14T21:42:54.7242373Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 292, in forward 2025-08-14T21:42:54.7242461Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:42:54.7242465Z 2025-08-14T21:42:54.7242565Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7242745Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7242813Z return mod(**inputs) 2025-08-14T21:42:54.7243069Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7243133Z outputs = self.mobilebert( 2025-08-14T21:42:54.7243394Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7243461Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7243717Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7243788Z layer_outputs = layer_module( 2025-08-14T21:42:54.7244047Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:42:54.7244130Z self_attention_outputs = self.attention( 2025-08-14T21:42:54.7244387Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 344, in forward 2025-08-14T21:42:54.7244499Z attention_output = self.output(self_outputs[0], layer_input) 2025-08-14T21:42:54.7244768Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 295, in forward 2025-08-14T21:42:54.7244882Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:42:54.7245147Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.7245229Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.7245234Z 2025-08-14T21:42:54.7245325Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7245512Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7245569Z return mod(**inputs) 2025-08-14T21:42:54.7245835Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7245899Z outputs = self.mobilebert( 2025-08-14T21:42:54.7246156Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7246228Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7246486Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7246547Z layer_outputs = layer_module( 2025-08-14T21:42:54.7246813Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7246897Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7247176Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.7247290Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.7247541Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:42:54.7247642Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:54.7247645Z 2025-08-14T21:42:54.7247738Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7247925Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7247986Z return mod(**inputs) 2025-08-14T21:42:54.7248253Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7248325Z outputs = self.mobilebert( 2025-08-14T21:42:54.7248580Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7248643Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7248906Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7248968Z layer_outputs = layer_module( 2025-08-14T21:42:54.7249234Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7249319Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7249578Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.7249685Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.7249944Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:42:54.7250051Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:42:54.7250055Z 2025-08-14T21:42:54.7250146Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7250326Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7250392Z return mod(**inputs) 2025-08-14T21:42:54.7250649Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7250710Z outputs = self.mobilebert( 2025-08-14T21:42:54.7250980Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7251043Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7251305Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7251369Z layer_outputs = layer_module( 2025-08-14T21:42:54.7251625Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7251715Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7251977Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.7252096Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.7252357Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:42:54.7252432Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:42:54.7252435Z 2025-08-14T21:42:54.7252531Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7252713Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7252785Z return mod(**inputs) 2025-08-14T21:42:54.7253044Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7253121Z outputs = self.mobilebert( 2025-08-14T21:42:54.7253385Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7253468Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7253729Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7253798Z layer_outputs = layer_module( 2025-08-14T21:42:54.7254072Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7254168Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7254425Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.7254542Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.7254803Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:42:54.7254918Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:42:54.7255173Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.7255266Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.7255271Z 2025-08-14T21:42:54.7255364Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7255553Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7255615Z return mod(**inputs) 2025-08-14T21:42:54.7255873Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7255948Z outputs = self.mobilebert( 2025-08-14T21:42:54.7256206Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7256282Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7256539Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7256605Z layer_outputs = layer_module( 2025-08-14T21:42:54.7256870Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7256956Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7257213Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.7257324Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.7257580Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:42:54.7257665Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:54.7257668Z 2025-08-14T21:42:54.7257762Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7257944Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7258013Z return mod(**inputs) 2025-08-14T21:42:54.7258268Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7258342Z outputs = self.mobilebert( 2025-08-14T21:42:54.7258621Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7258687Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7258959Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7259022Z layer_outputs = layer_module( 2025-08-14T21:42:54.7259290Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7259381Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7259648Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.7259755Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.7260007Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:42:54.7260107Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:42:54.7260112Z 2025-08-14T21:42:54.7260209Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7260388Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7260454Z return mod(**inputs) 2025-08-14T21:42:54.7260707Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7260769Z outputs = self.mobilebert( 2025-08-14T21:42:54.7261028Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7261091Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7261341Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7261410Z layer_outputs = layer_module( 2025-08-14T21:42:54.7261662Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7261754Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7262007Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.7262117Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.7262376Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:42:54.7262452Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:42:54.7262455Z 2025-08-14T21:42:54.7262550Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7262726Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7262784Z return mod(**inputs) 2025-08-14T21:42:54.7263041Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7263105Z outputs = self.mobilebert( 2025-08-14T21:42:54.7263362Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7263427Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7263677Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7263745Z layer_outputs = layer_module( 2025-08-14T21:42:54.7263997Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7264079Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7264353Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.7264480Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.7264801Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:42:54.7264939Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:42:54.7265198Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.7265289Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.7265310Z 2025-08-14T21:42:54.7265405Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7265594Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7265651Z return mod(**inputs) 2025-08-14T21:42:54.7265907Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7265978Z outputs = self.mobilebert( 2025-08-14T21:42:54.7266232Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7266299Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7266562Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7266625Z layer_outputs = layer_module( 2025-08-14T21:42:54.7266888Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7266971Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7267226Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.7267334Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.7267589Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:42:54.7267671Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:54.7267674Z 2025-08-14T21:42:54.7267766Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7267946Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7268012Z return mod(**inputs) 2025-08-14T21:42:54.7268267Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7268331Z outputs = self.mobilebert( 2025-08-14T21:42:54.7268592Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7268656Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7268920Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7268984Z layer_outputs = layer_module( 2025-08-14T21:42:54.7269241Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7269332Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7269587Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.7269694Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.7269947Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:42:54.7270086Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:42:54.7270102Z 2025-08-14T21:42:54.7270203Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7270383Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7270440Z return mod(**inputs) 2025-08-14T21:42:54.7270715Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7270779Z outputs = self.mobilebert( 2025-08-14T21:42:54.7271054Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7271120Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7271375Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7271446Z layer_outputs = layer_module( 2025-08-14T21:42:54.7271700Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7271792Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7272046Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.7272158Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.7272421Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:42:54.7272498Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:42:54.7272501Z 2025-08-14T21:42:54.7272597Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7272776Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7272840Z return mod(**inputs) 2025-08-14T21:42:54.7273106Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7273169Z outputs = self.mobilebert( 2025-08-14T21:42:54.7273423Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7273496Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7273753Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7273820Z layer_outputs = layer_module( 2025-08-14T21:42:54.7274073Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7274156Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7274417Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.7274529Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.7274792Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:42:54.7274902Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:42:54.7275156Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.7275244Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.7275247Z 2025-08-14T21:42:54.7275338Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7275518Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7275583Z return mod(**inputs) 2025-08-14T21:42:54.7275850Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7275935Z outputs = self.mobilebert( 2025-08-14T21:42:54.7276195Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7276274Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7276538Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7276601Z layer_outputs = layer_module( 2025-08-14T21:42:54.7276883Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 518, in forward 2025-08-14T21:42:54.7276992Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:42:54.7277244Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:42:54.7277328Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:54.7277332Z 2025-08-14T21:42:54.7277425Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7277602Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7277668Z return mod(**inputs) 2025-08-14T21:42:54.7277918Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7277987Z outputs = self.mobilebert( 2025-08-14T21:42:54.7278239Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7278302Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7278559Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7278622Z layer_outputs = layer_module( 2025-08-14T21:42:54.7278880Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 518, in forward 2025-08-14T21:42:54.7278985Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:42:54.7279237Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:42:54.7279343Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:42:54.7279346Z 2025-08-14T21:42:54.7279437Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7279615Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7279681Z return mod(**inputs) 2025-08-14T21:42:54.7279932Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7280004Z outputs = self.mobilebert( 2025-08-14T21:42:54.7280257Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7280320Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7280581Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7280644Z layer_outputs = layer_module( 2025-08-14T21:42:54.7280902Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:42:54.7281045Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:42:54.7281296Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 392, in forward 2025-08-14T21:42:54.7281398Z layer_output = self.dense(intermediate_states) 2025-08-14T21:42:54.7281401Z 2025-08-14T21:42:54.7281507Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7281685Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7281753Z return mod(**inputs) 2025-08-14T21:42:54.7282020Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7282091Z outputs = self.mobilebert( 2025-08-14T21:42:54.7282343Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7282421Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7282686Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7282750Z layer_outputs = layer_module( 2025-08-14T21:42:54.7283012Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:42:54.7283158Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:42:54.7283412Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 397, in forward 2025-08-14T21:42:54.7283533Z layer_output = self.LayerNorm(layer_output + residual_tensor_1) 2025-08-14T21:42:54.7283786Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.7283875Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.7283879Z 2025-08-14T21:42:54.7283970Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7284151Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7284217Z return mod(**inputs) 2025-08-14T21:42:54.7284472Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7284537Z outputs = self.mobilebert( 2025-08-14T21:42:54.7284936Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7285010Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7285273Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7285338Z layer_outputs = layer_module( 2025-08-14T21:42:54.7285597Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:42:54.7285750Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:42:54.7286020Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 398, in forward 2025-08-14T21:42:54.7286146Z layer_output = self.bottleneck(layer_output, residual_tensor_2) 2025-08-14T21:42:54.7286416Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 372, in forward 2025-08-14T21:42:54.7286499Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:42:54.7286503Z 2025-08-14T21:42:54.7286609Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7286797Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7286860Z return mod(**inputs) 2025-08-14T21:42:54.7287143Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7287208Z outputs = self.mobilebert( 2025-08-14T21:42:54.7287508Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7287598Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7287854Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7287952Z layer_outputs = layer_module( 2025-08-14T21:42:54.7288219Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:42:54.7288376Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:42:54.7288671Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 398, in forward 2025-08-14T21:42:54.7288791Z layer_output = self.bottleneck(layer_output, residual_tensor_2) 2025-08-14T21:42:54.7289072Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 374, in forward 2025-08-14T21:42:54.7289187Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:42:54.7289454Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.7289548Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.7289551Z 2025-08-14T21:42:54.7289648Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7289844Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7289905Z return mod(**inputs) 2025-08-14T21:42:54.7290176Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7290247Z outputs = self.mobilebert( 2025-08-14T21:42:54.7290517Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7290594Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7290860Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7290928Z layer_outputs = layer_module( 2025-08-14T21:42:54.7291202Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:42:54.7291355Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:42:54.7291631Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 444, in forward 2025-08-14T21:42:54.7291734Z shared_attention_input = self.attention(hidden_states) 2025-08-14T21:42:54.7292004Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 409, in forward 2025-08-14T21:42:54.7292089Z layer_input = self.dense(hidden_states) 2025-08-14T21:42:54.7292093Z 2025-08-14T21:42:54.7292189Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7292376Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7292446Z return mod(**inputs) 2025-08-14T21:42:54.7292715Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7292788Z outputs = self.mobilebert( 2025-08-14T21:42:54.7293055Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7293123Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7293411Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7293481Z layer_outputs = layer_module( 2025-08-14T21:42:54.7294436Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:42:54.7294519Z self_attention_outputs = self.attention( 2025-08-14T21:42:54.7294805Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:42:54.7294880Z self_outputs = self.self( 2025-08-14T21:42:54.7295164Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 255, in forward 2025-08-14T21:42:54.7295235Z self.value(value_tensor) 2025-08-14T21:42:54.7295239Z 2025-08-14T21:42:54.7295346Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7295534Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7295607Z return mod(**inputs) 2025-08-14T21:42:54.7295874Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7295940Z outputs = self.mobilebert( 2025-08-14T21:42:54.7296217Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7296283Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7296537Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7296609Z layer_outputs = layer_module( 2025-08-14T21:42:54.7296861Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:42:54.7297012Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:42:54.7297266Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 440, in forward 2025-08-14T21:42:54.7297366Z bottlenecked_hidden_states = self.input(hidden_states) 2025-08-14T21:42:54.7297623Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 409, in forward 2025-08-14T21:42:54.7297698Z layer_input = self.dense(hidden_states) 2025-08-14T21:42:54.7297701Z 2025-08-14T21:42:54.7297799Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7297979Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7298036Z return mod(**inputs) 2025-08-14T21:42:54.7298294Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7298357Z outputs = self.mobilebert( 2025-08-14T21:42:54.7298615Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7298679Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7298930Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7299001Z layer_outputs = layer_module( 2025-08-14T21:42:54.7299250Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:42:54.7299392Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:42:54.7299651Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 444, in forward 2025-08-14T21:42:54.7299750Z shared_attention_input = self.attention(hidden_states) 2025-08-14T21:42:54.7300021Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 410, in forward 2025-08-14T21:42:54.7300117Z layer_input = self.LayerNorm(layer_input) 2025-08-14T21:42:54.7300373Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.7300488Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.7300492Z 2025-08-14T21:42:54.7300583Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7300770Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7300844Z return mod(**inputs) 2025-08-14T21:42:54.7301099Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7301172Z outputs = self.mobilebert( 2025-08-14T21:42:54.7301424Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7301490Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7301749Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7301815Z layer_outputs = layer_module( 2025-08-14T21:42:54.7302072Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:42:54.7302147Z self_attention_outputs = self.attention( 2025-08-14T21:42:54.7302400Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:42:54.7302471Z self_outputs = self.self( 2025-08-14T21:42:54.7302724Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 245, in forward 2025-08-14T21:42:54.7302798Z self.query(query_tensor) 2025-08-14T21:42:54.7302802Z 2025-08-14T21:42:54.7302896Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7303071Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7303137Z return mod(**inputs) 2025-08-14T21:42:54.7303387Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7303449Z outputs = self.mobilebert( 2025-08-14T21:42:54.7303707Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7303773Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7304031Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7304093Z layer_outputs = layer_module( 2025-08-14T21:42:54.7304343Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:42:54.7304427Z self_attention_outputs = self.attention( 2025-08-14T21:42:54.7304680Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:42:54.7304818Z self_outputs = self.self( 2025-08-14T21:42:54.7305082Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 250, in forward 2025-08-14T21:42:54.7305144Z self.key(key_tensor) 2025-08-14T21:42:54.7305147Z 2025-08-14T21:42:54.7305233Z cudagraph partition due to non gpu ops 2025-08-14T21:42:54.7305305Z cudagraph partition due to non gpu ops 2025-08-14T21:42:54.7305400Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7305606Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7305668Z return mod(**inputs) 2025-08-14T21:42:54.7305945Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7306010Z outputs = self.mobilebert( 2025-08-14T21:42:54.7306278Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7306353Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7306604Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7306682Z layer_outputs = layer_module( 2025-08-14T21:42:54.7306944Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:42:54.7307021Z self_attention_outputs = self.attention( 2025-08-14T21:42:54.7307283Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 344, in forward 2025-08-14T21:42:54.7307395Z attention_output = self.output(self_outputs[0], layer_input) 2025-08-14T21:42:54.7307647Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 292, in forward 2025-08-14T21:42:54.7307733Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:42:54.7307737Z 2025-08-14T21:42:54.7307827Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7308012Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7308070Z return mod(**inputs) 2025-08-14T21:42:54.7308320Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7308390Z outputs = self.mobilebert( 2025-08-14T21:42:54.7308642Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7308707Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7308968Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7309033Z layer_outputs = layer_module( 2025-08-14T21:42:54.7309291Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:42:54.7309367Z self_attention_outputs = self.attention( 2025-08-14T21:42:54.7309619Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 344, in forward 2025-08-14T21:42:54.7309736Z attention_output = self.output(self_outputs[0], layer_input) 2025-08-14T21:42:54.7309990Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 295, in forward 2025-08-14T21:42:54.7310109Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:42:54.7310361Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.7310442Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.7310446Z 2025-08-14T21:42:54.7310541Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7310716Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7310773Z return mod(**inputs) 2025-08-14T21:42:54.7311029Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7311092Z outputs = self.mobilebert( 2025-08-14T21:42:54.7311363Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7311441Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7311699Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7311785Z layer_outputs = layer_module( 2025-08-14T21:42:54.7312042Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7312132Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7312402Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.7312503Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.7312762Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:42:54.7312839Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:54.7312844Z 2025-08-14T21:42:54.7312944Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7313125Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7313188Z return mod(**inputs) 2025-08-14T21:42:54.7313448Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7313512Z outputs = self.mobilebert( 2025-08-14T21:42:54.7313767Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7313842Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7314096Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7314170Z layer_outputs = layer_module( 2025-08-14T21:42:54.7314425Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7314513Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7314775Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.7314878Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.7315139Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:42:54.7315244Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:42:54.7315248Z 2025-08-14T21:42:54.7315341Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7315528Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7315591Z return mod(**inputs) 2025-08-14T21:42:54.7315851Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7315924Z outputs = self.mobilebert( 2025-08-14T21:42:54.7316181Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7316257Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7316512Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7316579Z layer_outputs = layer_module( 2025-08-14T21:42:54.7316842Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7316931Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7317207Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.7317334Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.7317586Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:42:54.7317685Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:42:54.7317689Z 2025-08-14T21:42:54.7317778Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7317952Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7318033Z return mod(**inputs) 2025-08-14T21:42:54.7318288Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7318357Z outputs = self.mobilebert( 2025-08-14T21:42:54.7318608Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7318673Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7318932Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7318995Z layer_outputs = layer_module( 2025-08-14T21:42:54.7319249Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7319332Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7319583Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.7319700Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.7319954Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:42:54.7320063Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:42:54.7320321Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.7320403Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.7320406Z 2025-08-14T21:42:54.7320503Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7320679Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7320737Z return mod(**inputs) 2025-08-14T21:42:54.7320995Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7321057Z outputs = self.mobilebert( 2025-08-14T21:42:54.7321316Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7321380Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7321630Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7321701Z layer_outputs = layer_module( 2025-08-14T21:42:54.7321949Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7322032Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7322291Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.7322390Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.7322647Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:42:54.7322735Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:54.7322760Z 2025-08-14T21:42:54.7322853Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7323037Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7323115Z return mod(**inputs) 2025-08-14T21:42:54.7323380Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7323443Z outputs = self.mobilebert( 2025-08-14T21:42:54.7323722Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7323796Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7324053Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7324115Z layer_outputs = layer_module( 2025-08-14T21:42:54.7324377Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7324459Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7324721Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.7324819Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.7325075Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:42:54.7325182Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:42:54.7325185Z 2025-08-14T21:42:54.7325276Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7325460Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7325518Z return mod(**inputs) 2025-08-14T21:42:54.7325779Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7325848Z outputs = self.mobilebert( 2025-08-14T21:42:54.7326104Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7326167Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7326428Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7326491Z layer_outputs = layer_module( 2025-08-14T21:42:54.7326753Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7326834Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7327091Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.7327212Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.7327466Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:42:54.7327549Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:42:54.7327552Z 2025-08-14T21:42:54.7327642Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7327822Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7327888Z return mod(**inputs) 2025-08-14T21:42:54.7328145Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7328215Z outputs = self.mobilebert( 2025-08-14T21:42:54.7328485Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7328564Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7328824Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7328901Z layer_outputs = layer_module( 2025-08-14T21:42:54.7329153Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7329243Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7329512Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.7329630Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.7329884Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:42:54.7329993Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:42:54.7330252Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.7330335Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.7330339Z 2025-08-14T21:42:54.7330436Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7330613Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7330671Z return mod(**inputs) 2025-08-14T21:42:54.7330931Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7330994Z outputs = self.mobilebert( 2025-08-14T21:42:54.7331247Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7331319Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7331572Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7331641Z layer_outputs = layer_module( 2025-08-14T21:42:54.7331895Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7331978Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7332239Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.7332335Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.7332593Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:42:54.7332669Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:54.7332674Z 2025-08-14T21:42:54.7332764Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7332949Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7333011Z return mod(**inputs) 2025-08-14T21:42:54.7333261Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7333331Z outputs = self.mobilebert( 2025-08-14T21:42:54.7333584Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7333653Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7333906Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7333982Z layer_outputs = layer_module( 2025-08-14T21:42:54.7334244Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7334341Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7334604Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.7334717Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.7334969Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:42:54.7335088Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:42:54.7335091Z 2025-08-14T21:42:54.7335185Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7335362Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7335429Z return mod(**inputs) 2025-08-14T21:42:54.7335681Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7335753Z outputs = self.mobilebert( 2025-08-14T21:42:54.7336006Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7336070Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7336329Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7336393Z layer_outputs = layer_module( 2025-08-14T21:42:54.7336653Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7336737Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7336988Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.7337106Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.7337357Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:42:54.7337434Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:42:54.7337445Z 2025-08-14T21:42:54.7337534Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7337713Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7337776Z return mod(**inputs) 2025-08-14T21:42:54.7338028Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7338090Z outputs = self.mobilebert( 2025-08-14T21:42:54.7338349Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7338416Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7338675Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7338736Z layer_outputs = layer_module( 2025-08-14T21:42:54.7338991Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7339079Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7339332Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.7339441Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.7339713Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:42:54.7339836Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:42:54.7340094Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.7340192Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.7340196Z 2025-08-14T21:42:54.7340288Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7340472Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7340529Z return mod(**inputs) 2025-08-14T21:42:54.7340803Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7340867Z outputs = self.mobilebert( 2025-08-14T21:42:54.7341121Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7341192Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7341444Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7341514Z layer_outputs = layer_module( 2025-08-14T21:42:54.7341768Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 518, in forward 2025-08-14T21:42:54.7341875Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:42:54.7342133Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:42:54.7342207Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:54.7342211Z 2025-08-14T21:42:54.7342302Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7342488Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7342547Z return mod(**inputs) 2025-08-14T21:42:54.7342810Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7342873Z outputs = self.mobilebert( 2025-08-14T21:42:54.7343129Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7343200Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7343455Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7343527Z layer_outputs = layer_module( 2025-08-14T21:42:54.7343780Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 518, in forward 2025-08-14T21:42:54.7343886Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:42:54.7344155Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:42:54.7344255Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:42:54.7344259Z 2025-08-14T21:42:54.7344352Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7344541Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7344599Z return mod(**inputs) 2025-08-14T21:42:54.7344937Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7345005Z outputs = self.mobilebert( 2025-08-14T21:42:54.7345265Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7345338Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7345620Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7345707Z layer_outputs = layer_module( 2025-08-14T21:42:54.7345962Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:42:54.7346125Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:42:54.7346393Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 392, in forward 2025-08-14T21:42:54.7346494Z layer_output = self.dense(intermediate_states) 2025-08-14T21:42:54.7346498Z 2025-08-14T21:42:54.7346598Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7346792Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7346855Z return mod(**inputs) 2025-08-14T21:42:54.7347120Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7347190Z outputs = self.mobilebert( 2025-08-14T21:42:54.7347446Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7347524Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7347780Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7347853Z layer_outputs = layer_module( 2025-08-14T21:42:54.7348112Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:42:54.7348259Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:42:54.7348524Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 397, in forward 2025-08-14T21:42:54.7348639Z layer_output = self.LayerNorm(layer_output + residual_tensor_1) 2025-08-14T21:42:54.7348896Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.7348990Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.7348994Z 2025-08-14T21:42:54.7349090Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7349279Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7349341Z return mod(**inputs) 2025-08-14T21:42:54.7349598Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7349671Z outputs = self.mobilebert( 2025-08-14T21:42:54.7349928Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7350004Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7350261Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7350328Z layer_outputs = layer_module( 2025-08-14T21:42:54.7350591Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:42:54.7350737Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:42:54.7351001Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 398, in forward 2025-08-14T21:42:54.7351115Z layer_output = self.bottleneck(layer_output, residual_tensor_2) 2025-08-14T21:42:54.7351388Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 372, in forward 2025-08-14T21:42:54.7351485Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:42:54.7351488Z 2025-08-14T21:42:54.7351581Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7351761Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7351842Z return mod(**inputs) 2025-08-14T21:42:54.7352095Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7352165Z outputs = self.mobilebert( 2025-08-14T21:42:54.7352432Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7352497Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7352758Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7352821Z layer_outputs = layer_module( 2025-08-14T21:42:54.7353079Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:42:54.7353223Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:42:54.7353474Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 398, in forward 2025-08-14T21:42:54.7353592Z layer_output = self.bottleneck(layer_output, residual_tensor_2) 2025-08-14T21:42:54.7353846Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 374, in forward 2025-08-14T21:42:54.7353954Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:42:54.7354217Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.7354302Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.7354305Z 2025-08-14T21:42:54.7354404Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7354585Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7354644Z return mod(**inputs) 2025-08-14T21:42:54.7354903Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7354965Z outputs = self.mobilebert( 2025-08-14T21:42:54.7355223Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7355285Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7355538Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7355607Z layer_outputs = layer_module( 2025-08-14T21:42:54.7355863Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:42:54.7356009Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:42:54.7356271Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 444, in forward 2025-08-14T21:42:54.7356371Z shared_attention_input = self.attention(hidden_states) 2025-08-14T21:42:54.7356634Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 409, in forward 2025-08-14T21:42:54.7356707Z layer_input = self.dense(hidden_states) 2025-08-14T21:42:54.7356711Z 2025-08-14T21:42:54.7356802Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7357001Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7357074Z return mod(**inputs) 2025-08-14T21:42:54.7357339Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7357417Z outputs = self.mobilebert( 2025-08-14T21:42:54.7357668Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7357740Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7358033Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7358104Z layer_outputs = layer_module( 2025-08-14T21:42:54.7358355Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:42:54.7358434Z self_attention_outputs = self.attention( 2025-08-14T21:42:54.7358694Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:42:54.7358761Z self_outputs = self.self( 2025-08-14T21:42:54.7359013Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 255, in forward 2025-08-14T21:42:54.7359087Z self.value(value_tensor) 2025-08-14T21:42:54.7359090Z 2025-08-14T21:42:54.7359182Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7359368Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7359427Z return mod(**inputs) 2025-08-14T21:42:54.7359680Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7359750Z outputs = self.mobilebert( 2025-08-14T21:42:54.7360007Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7360074Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7360339Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7360406Z layer_outputs = layer_module( 2025-08-14T21:42:54.7360668Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:42:54.7360814Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:42:54.7361071Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 440, in forward 2025-08-14T21:42:54.7361177Z bottlenecked_hidden_states = self.input(hidden_states) 2025-08-14T21:42:54.7361433Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 409, in forward 2025-08-14T21:42:54.7361516Z layer_input = self.dense(hidden_states) 2025-08-14T21:42:54.7361519Z 2025-08-14T21:42:54.7361611Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7361793Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7361861Z return mod(**inputs) 2025-08-14T21:42:54.7362114Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7362183Z outputs = self.mobilebert( 2025-08-14T21:42:54.7362438Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7362502Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7362777Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7362855Z layer_outputs = layer_module( 2025-08-14T21:42:54.7363110Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:42:54.7363276Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:42:54.7363531Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 444, in forward 2025-08-14T21:42:54.7363634Z shared_attention_input = self.attention(hidden_states) 2025-08-14T21:42:54.7363906Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 410, in forward 2025-08-14T21:42:54.7363986Z layer_input = self.LayerNorm(layer_input) 2025-08-14T21:42:54.7364247Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.7364328Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.7364332Z 2025-08-14T21:42:54.7364431Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7364608Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7364667Z return mod(**inputs) 2025-08-14T21:42:54.7364926Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7364988Z outputs = self.mobilebert( 2025-08-14T21:42:54.7365241Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7365313Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7365566Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7365635Z layer_outputs = layer_module( 2025-08-14T21:42:54.7365889Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:42:54.7365966Z self_attention_outputs = self.attention( 2025-08-14T21:42:54.7366228Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:42:54.7366290Z self_outputs = self.self( 2025-08-14T21:42:54.7366548Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 245, in forward 2025-08-14T21:42:54.7366614Z self.query(query_tensor) 2025-08-14T21:42:54.7366617Z 2025-08-14T21:42:54.7366708Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7366891Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7366949Z return mod(**inputs) 2025-08-14T21:42:54.7367203Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7367272Z outputs = self.mobilebert( 2025-08-14T21:42:54.7367525Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7367597Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7367849Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7367912Z layer_outputs = layer_module( 2025-08-14T21:42:54.7368171Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:42:54.7368246Z self_attention_outputs = self.attention( 2025-08-14T21:42:54.7368520Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:42:54.7368597Z self_outputs = self.self( 2025-08-14T21:42:54.7368850Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 250, in forward 2025-08-14T21:42:54.7368931Z self.key(key_tensor) 2025-08-14T21:42:54.7368934Z 2025-08-14T21:42:54.7369008Z cudagraph partition due to non gpu ops 2025-08-14T21:42:54.7369079Z cudagraph partition due to non gpu ops 2025-08-14T21:42:54.7369179Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7369370Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7369439Z return mod(**inputs) 2025-08-14T21:42:54.7369693Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7369757Z outputs = self.mobilebert( 2025-08-14T21:42:54.7370018Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7370082Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7370336Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7370407Z layer_outputs = layer_module( 2025-08-14T21:42:54.7370660Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:42:54.7370742Z self_attention_outputs = self.attention( 2025-08-14T21:42:54.7370999Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 344, in forward 2025-08-14T21:42:54.7371111Z attention_output = self.output(self_outputs[0], layer_input) 2025-08-14T21:42:54.7371373Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 292, in forward 2025-08-14T21:42:54.7371449Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:42:54.7371452Z 2025-08-14T21:42:54.7371552Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7371729Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7371788Z return mod(**inputs) 2025-08-14T21:42:54.7372049Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7372110Z outputs = self.mobilebert( 2025-08-14T21:42:54.7372364Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7372434Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7372689Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7372759Z layer_outputs = layer_module( 2025-08-14T21:42:54.7373013Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:42:54.7373089Z self_attention_outputs = self.attention( 2025-08-14T21:42:54.7373350Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 344, in forward 2025-08-14T21:42:54.7373461Z attention_output = self.output(self_outputs[0], layer_input) 2025-08-14T21:42:54.7373724Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 295, in forward 2025-08-14T21:42:54.7373839Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:42:54.7374107Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.7374200Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.7374218Z 2025-08-14T21:42:54.7374311Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7374493Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7374567Z return mod(**inputs) 2025-08-14T21:42:54.7374818Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7374889Z outputs = self.mobilebert( 2025-08-14T21:42:54.7375157Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7375224Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7375487Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7375552Z layer_outputs = layer_module( 2025-08-14T21:42:54.7375813Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7375901Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7376155Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.7376263Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.7376520Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:42:54.7376595Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:54.7376606Z 2025-08-14T21:42:54.7376697Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7376878Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7376945Z return mod(**inputs) 2025-08-14T21:42:54.7377199Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7377261Z outputs = self.mobilebert( 2025-08-14T21:42:54.7377524Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7377587Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7377846Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7377909Z layer_outputs = layer_module( 2025-08-14T21:42:54.7378162Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7378253Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7378506Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.7378606Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.7378869Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:42:54.7378971Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:42:54.7378974Z 2025-08-14T21:42:54.7379071Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7379250Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7379307Z return mod(**inputs) 2025-08-14T21:42:54.7379569Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7379631Z outputs = self.mobilebert( 2025-08-14T21:42:54.7379908Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7379987Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7380245Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7380330Z layer_outputs = layer_module( 2025-08-14T21:42:54.7380585Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7380669Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7380944Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.7381059Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.7381320Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:42:54.7381398Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:42:54.7381401Z 2025-08-14T21:42:54.7381494Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7381679Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7381739Z return mod(**inputs) 2025-08-14T21:42:54.7381995Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7382057Z outputs = self.mobilebert( 2025-08-14T21:42:54.7382307Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7382379Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7382630Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7382699Z layer_outputs = layer_module( 2025-08-14T21:42:54.7382950Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7383031Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7383289Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.7383399Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.7383650Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:42:54.7383764Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:42:54.7384015Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.7384100Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.7384105Z 2025-08-14T21:42:54.7384194Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7384370Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7384436Z return mod(**inputs) 2025-08-14T21:42:54.7384882Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7384962Z outputs = self.mobilebert( 2025-08-14T21:42:54.7385232Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7385299Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7385572Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7385673Z layer_outputs = layer_module( 2025-08-14T21:42:54.7385984Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7386075Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7386368Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.7386478Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.7386747Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:42:54.7386846Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:54.7386850Z 2025-08-14T21:42:54.7386952Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7387133Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7387200Z return mod(**inputs) 2025-08-14T21:42:54.7387454Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7387516Z outputs = self.mobilebert( 2025-08-14T21:42:54.7387783Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7387849Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7388117Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7388190Z layer_outputs = layer_module( 2025-08-14T21:42:54.7388455Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7388547Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7388814Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.7388918Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.7389190Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:42:54.7389296Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:42:54.7389300Z 2025-08-14T21:42:54.7389401Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7389590Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7389651Z return mod(**inputs) 2025-08-14T21:42:54.7389925Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7389990Z outputs = self.mobilebert( 2025-08-14T21:42:54.7390256Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7390331Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7390596Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7390670Z layer_outputs = layer_module( 2025-08-14T21:42:54.7390934Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7391021Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7391294Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.7391411Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.7391698Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:42:54.7391793Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:42:54.7391796Z 2025-08-14T21:42:54.7391890Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7392082Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7392159Z return mod(**inputs) 2025-08-14T21:42:54.7392429Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7392505Z outputs = self.mobilebert( 2025-08-14T21:42:54.7392786Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7392863Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7393139Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7393206Z layer_outputs = layer_module( 2025-08-14T21:42:54.7393487Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7393574Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7393849Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.7393966Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.7394237Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:42:54.7394357Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:42:54.7394627Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.7394719Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.7394724Z 2025-08-14T21:42:54.7394820Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7395010Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7395080Z return mod(**inputs) 2025-08-14T21:42:54.7395347Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7395413Z outputs = self.mobilebert( 2025-08-14T21:42:54.7395690Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7395757Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7396032Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7396099Z layer_outputs = layer_module( 2025-08-14T21:42:54.7396368Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7396462Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7396736Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.7396841Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.7397093Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:42:54.7397168Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:54.7397171Z 2025-08-14T21:42:54.7397270Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7397450Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7397523Z return mod(**inputs) 2025-08-14T21:42:54.7397795Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7397858Z outputs = self.mobilebert( 2025-08-14T21:42:54.7398114Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7398192Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7398449Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7398530Z layer_outputs = layer_module( 2025-08-14T21:42:54.7398781Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7398869Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7399120Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.7399219Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.7399477Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:42:54.7399578Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:42:54.7399581Z 2025-08-14T21:42:54.7399671Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7399852Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7399912Z return mod(**inputs) 2025-08-14T21:42:54.7400170Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7400232Z outputs = self.mobilebert( 2025-08-14T21:42:54.7400484Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7400555Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7400809Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7400879Z layer_outputs = layer_module( 2025-08-14T21:42:54.7401131Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7401213Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7401470Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.7401581Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.7401831Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:42:54.7401913Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:42:54.7401916Z 2025-08-14T21:42:54.7402005Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7402186Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7402247Z return mod(**inputs) 2025-08-14T21:42:54.7402498Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7402569Z outputs = self.mobilebert( 2025-08-14T21:42:54.7402821Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7402892Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7403156Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7403219Z layer_outputs = layer_module( 2025-08-14T21:42:54.7403491Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7403573Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7403840Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.7403956Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.7404220Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:42:54.7404339Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:42:54.7404591Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.7404672Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.7404677Z 2025-08-14T21:42:54.7404776Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7404953Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7405022Z return mod(**inputs) 2025-08-14T21:42:54.7405275Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7405339Z outputs = self.mobilebert( 2025-08-14T21:42:54.7405599Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7405664Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7405920Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7405985Z layer_outputs = layer_module( 2025-08-14T21:42:54.7406237Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 518, in forward 2025-08-14T21:42:54.7406353Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:42:54.7406607Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:42:54.7406680Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:54.7406683Z 2025-08-14T21:42:54.7406781Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7406958Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7407023Z return mod(**inputs) 2025-08-14T21:42:54.7407273Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7407338Z outputs = self.mobilebert( 2025-08-14T21:42:54.7407598Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7407662Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7407920Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7407985Z layer_outputs = layer_module( 2025-08-14T21:42:54.7408235Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 518, in forward 2025-08-14T21:42:54.7408348Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:42:54.7408600Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:42:54.7408697Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:42:54.7408707Z 2025-08-14T21:42:54.7408812Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7409013Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7409079Z return mod(**inputs) 2025-08-14T21:42:54.7409330Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7409408Z outputs = self.mobilebert( 2025-08-14T21:42:54.7409676Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7409740Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7410018Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7410083Z layer_outputs = layer_module( 2025-08-14T21:42:54.7410336Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:42:54.7410489Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:42:54.7410739Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 392, in forward 2025-08-14T21:42:54.7410824Z layer_output = self.dense(intermediate_states) 2025-08-14T21:42:54.7410834Z 2025-08-14T21:42:54.7410925Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7411102Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7411167Z return mod(**inputs) 2025-08-14T21:42:54.7411420Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7411482Z outputs = self.mobilebert( 2025-08-14T21:42:54.7411743Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7411807Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7412065Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7412128Z layer_outputs = layer_module( 2025-08-14T21:42:54.7412377Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:42:54.7412526Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:42:54.7412779Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 397, in forward 2025-08-14T21:42:54.7412890Z layer_output = self.LayerNorm(layer_output + residual_tensor_1) 2025-08-14T21:42:54.7413157Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.7413239Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.7413243Z 2025-08-14T21:42:54.7413341Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7413518Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7413576Z return mod(**inputs) 2025-08-14T21:42:54.7413836Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7413898Z outputs = self.mobilebert( 2025-08-14T21:42:54.7414159Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7414221Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7414487Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7414571Z layer_outputs = layer_module( 2025-08-14T21:42:54.7414822Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:42:54.7414964Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:42:54.7415243Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 398, in forward 2025-08-14T21:42:54.7415353Z layer_output = self.bottleneck(layer_output, residual_tensor_2) 2025-08-14T21:42:54.7415625Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 372, in forward 2025-08-14T21:42:54.7415701Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:42:54.7415705Z 2025-08-14T21:42:54.7415796Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7415983Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7416042Z return mod(**inputs) 2025-08-14T21:42:54.7416299Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7416363Z outputs = self.mobilebert( 2025-08-14T21:42:54.7416613Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7416683Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7416936Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7417004Z layer_outputs = layer_module( 2025-08-14T21:42:54.7417254Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:42:54.7417397Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:42:54.7417656Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 398, in forward 2025-08-14T21:42:54.7417764Z layer_output = self.bottleneck(layer_output, residual_tensor_2) 2025-08-14T21:42:54.7418016Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 374, in forward 2025-08-14T21:42:54.7418129Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:42:54.7418380Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.7418467Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.7418470Z 2025-08-14T21:42:54.7418561Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7418742Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7418809Z return mod(**inputs) 2025-08-14T21:42:54.7419059Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7419131Z outputs = self.mobilebert( 2025-08-14T21:42:54.7419381Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7419444Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7419704Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7419767Z layer_outputs = layer_module( 2025-08-14T21:42:54.7420018Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:42:54.7420188Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:42:54.7420458Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 444, in forward 2025-08-14T21:42:54.7420566Z shared_attention_input = self.attention(hidden_states) 2025-08-14T21:42:54.7420835Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 409, in forward 2025-08-14T21:42:54.7420908Z layer_input = self.dense(hidden_states) 2025-08-14T21:42:54.7420912Z 2025-08-14T21:42:54.7421009Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7421201Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7421269Z return mod(**inputs) 2025-08-14T21:42:54.7421524Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7421589Z outputs = self.mobilebert( 2025-08-14T21:42:54.7421851Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7421917Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7422179Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7422243Z layer_outputs = layer_module( 2025-08-14T21:42:54.7422496Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:42:54.7422579Z self_attention_outputs = self.attention( 2025-08-14T21:42:54.7422830Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:42:54.7422894Z self_outputs = self.self( 2025-08-14T21:42:54.7423154Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 255, in forward 2025-08-14T21:42:54.7423219Z self.value(value_tensor) 2025-08-14T21:42:54.7423222Z 2025-08-14T21:42:54.7423320Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7423502Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7423560Z return mod(**inputs) 2025-08-14T21:42:54.7423818Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7423878Z outputs = self.mobilebert( 2025-08-14T21:42:54.7424131Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7424201Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7424453Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7424524Z layer_outputs = layer_module( 2025-08-14T21:42:54.7424838Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:42:54.7424993Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:42:54.7425258Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 440, in forward 2025-08-14T21:42:54.7425358Z bottlenecked_hidden_states = self.input(hidden_states) 2025-08-14T21:42:54.7425623Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 409, in forward 2025-08-14T21:42:54.7425699Z layer_input = self.dense(hidden_states) 2025-08-14T21:42:54.7425703Z 2025-08-14T21:42:54.7425796Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7425999Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7426075Z return mod(**inputs) 2025-08-14T21:42:54.7426338Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7426418Z outputs = self.mobilebert( 2025-08-14T21:42:54.7426672Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7426743Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7427009Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7427073Z layer_outputs = layer_module( 2025-08-14T21:42:54.7427333Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:42:54.7427476Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:42:54.7427739Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 444, in forward 2025-08-14T21:42:54.7427840Z shared_attention_input = self.attention(hidden_states) 2025-08-14T21:42:54.7428092Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 410, in forward 2025-08-14T21:42:54.7428176Z layer_input = self.LayerNorm(layer_input) 2025-08-14T21:42:54.7428430Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.7428519Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.7428524Z 2025-08-14T21:42:54.7428615Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7428794Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7428861Z return mod(**inputs) 2025-08-14T21:42:54.7429110Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7429174Z outputs = self.mobilebert( 2025-08-14T21:42:54.7429431Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7429494Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7429751Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7429813Z layer_outputs = layer_module( 2025-08-14T21:42:54.7430065Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:42:54.7430148Z self_attention_outputs = self.attention( 2025-08-14T21:42:54.7430399Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:42:54.7430469Z self_outputs = self.self( 2025-08-14T21:42:54.7430722Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 245, in forward 2025-08-14T21:42:54.7430788Z self.query(query_tensor) 2025-08-14T21:42:54.7430791Z 2025-08-14T21:42:54.7430889Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7431068Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7431128Z return mod(**inputs) 2025-08-14T21:42:54.7431387Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7431449Z outputs = self.mobilebert( 2025-08-14T21:42:54.7431731Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7431824Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7432079Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7432169Z layer_outputs = layer_module( 2025-08-14T21:42:54.7432423Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:42:54.7432504Z self_attention_outputs = self.attention( 2025-08-14T21:42:54.7432776Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:42:54.7432842Z self_outputs = self.self( 2025-08-14T21:42:54.7433104Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 250, in forward 2025-08-14T21:42:54.7433165Z self.key(key_tensor) 2025-08-14T21:42:54.7433169Z 2025-08-14T21:42:54.7433243Z cudagraph partition due to non gpu ops 2025-08-14T21:42:54.7433321Z cudagraph partition due to non gpu ops 2025-08-14T21:42:54.7433413Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7433599Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7433657Z return mod(**inputs) 2025-08-14T21:42:54.7433908Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7433979Z outputs = self.mobilebert( 2025-08-14T21:42:54.7434232Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7434296Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7434558Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7434623Z layer_outputs = layer_module( 2025-08-14T21:42:54.7434881Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:42:54.7434960Z self_attention_outputs = self.attention( 2025-08-14T21:42:54.7435209Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 344, in forward 2025-08-14T21:42:54.7435328Z attention_output = self.output(self_outputs[0], layer_input) 2025-08-14T21:42:54.7435580Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 292, in forward 2025-08-14T21:42:54.7435662Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:42:54.7435665Z 2025-08-14T21:42:54.7435754Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7435933Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7436000Z return mod(**inputs) 2025-08-14T21:42:54.7436253Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7436318Z outputs = self.mobilebert( 2025-08-14T21:42:54.7436576Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7436639Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7436898Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7436961Z layer_outputs = layer_module( 2025-08-14T21:42:54.7437211Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:42:54.7437305Z self_attention_outputs = self.attention( 2025-08-14T21:42:54.7437572Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 344, in forward 2025-08-14T21:42:54.7437689Z attention_output = self.output(self_outputs[0], layer_input) 2025-08-14T21:42:54.7437956Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 295, in forward 2025-08-14T21:42:54.7438067Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:42:54.7438340Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.7438426Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.7438429Z 2025-08-14T21:42:54.7438525Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7438706Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7438766Z return mod(**inputs) 2025-08-14T21:42:54.7439026Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7439088Z outputs = self.mobilebert( 2025-08-14T21:42:54.7439340Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7439413Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7439668Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7439737Z layer_outputs = layer_module( 2025-08-14T21:42:54.7439989Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7440076Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7440337Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.7440437Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.7440689Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:42:54.7440772Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:54.7440775Z 2025-08-14T21:42:54.7440866Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7441053Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7441112Z return mod(**inputs) 2025-08-14T21:42:54.7441365Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7441434Z outputs = self.mobilebert( 2025-08-14T21:42:54.7441689Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7441760Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7442012Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7442075Z layer_outputs = layer_module( 2025-08-14T21:42:54.7442331Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7442415Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7442669Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.7442774Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.7443041Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:42:54.7443166Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:42:54.7443169Z 2025-08-14T21:42:54.7443260Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7443450Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7443515Z return mod(**inputs) 2025-08-14T21:42:54.7443767Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7443833Z outputs = self.mobilebert( 2025-08-14T21:42:54.7444098Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7444162Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7444421Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7444483Z layer_outputs = layer_module( 2025-08-14T21:42:54.7444739Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7444829Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7445086Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.7445203Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.7445461Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:42:54.7445534Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:42:54.7445538Z 2025-08-14T21:42:54.7445634Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7445819Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7445883Z return mod(**inputs) 2025-08-14T21:42:54.7446145Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7446207Z outputs = self.mobilebert( 2025-08-14T21:42:54.7446473Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7446536Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7446805Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7446867Z layer_outputs = layer_module( 2025-08-14T21:42:54.7447125Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7447217Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7447478Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.7447590Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.7447858Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:42:54.7447967Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:42:54.7448234Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.7448316Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.7448319Z 2025-08-14T21:42:54.7448410Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7448618Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7448692Z return mod(**inputs) 2025-08-14T21:42:54.7448952Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7449017Z outputs = self.mobilebert( 2025-08-14T21:42:54.7449307Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7449379Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7449631Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7449714Z layer_outputs = layer_module( 2025-08-14T21:42:54.7449978Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7450063Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7450324Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.7450424Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.7450677Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:42:54.7450761Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:54.7450764Z 2025-08-14T21:42:54.7450857Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7451046Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7451105Z return mod(**inputs) 2025-08-14T21:42:54.7451359Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7451432Z outputs = self.mobilebert( 2025-08-14T21:42:54.7451686Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7451753Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7452018Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7452084Z layer_outputs = layer_module( 2025-08-14T21:42:54.7452344Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7452429Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7452684Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.7452790Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.7453044Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:42:54.7453150Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:42:54.7453153Z 2025-08-14T21:42:54.7453245Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7453424Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7453490Z return mod(**inputs) 2025-08-14T21:42:54.7453744Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7453806Z outputs = self.mobilebert( 2025-08-14T21:42:54.7454069Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7454134Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7454407Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7454487Z layer_outputs = layer_module( 2025-08-14T21:42:54.7454739Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7454845Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7455101Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.7455222Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.7455492Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:42:54.7455569Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:42:54.7455572Z 2025-08-14T21:42:54.7455669Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7455847Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7455907Z return mod(**inputs) 2025-08-14T21:42:54.7456165Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7456230Z outputs = self.mobilebert( 2025-08-14T21:42:54.7456490Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7456554Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7456807Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7456879Z layer_outputs = layer_module( 2025-08-14T21:42:54.7457129Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7457219Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7457472Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.7457582Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.7457845Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:42:54.7457952Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:42:54.7458215Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.7458296Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.7458299Z 2025-08-14T21:42:54.7458389Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7458574Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7458632Z return mod(**inputs) 2025-08-14T21:42:54.7458885Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7458956Z outputs = self.mobilebert( 2025-08-14T21:42:54.7459209Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7459279Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7459529Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7459591Z layer_outputs = layer_module( 2025-08-14T21:42:54.7459849Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7459931Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7460204Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.7460316Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.7460570Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:42:54.7460665Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:54.7460668Z 2025-08-14T21:42:54.7460757Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7460949Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7461015Z return mod(**inputs) 2025-08-14T21:42:54.7461269Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7461338Z outputs = self.mobilebert( 2025-08-14T21:42:54.7461592Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7461658Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7461919Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7461984Z layer_outputs = layer_module( 2025-08-14T21:42:54.7462244Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7462327Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7462580Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.7462684Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.7462938Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:42:54.7463039Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:42:54.7463049Z 2025-08-14T21:42:54.7463139Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7463318Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7463385Z return mod(**inputs) 2025-08-14T21:42:54.7463640Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7463702Z outputs = self.mobilebert( 2025-08-14T21:42:54.7463963Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7464025Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7464283Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7464346Z layer_outputs = layer_module( 2025-08-14T21:42:54.7464599Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7464752Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7465018Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.7465132Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.7465398Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:42:54.7465473Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:42:54.7465476Z 2025-08-14T21:42:54.7465578Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7465773Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7465851Z return mod(**inputs) 2025-08-14T21:42:54.7466114Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7466193Z outputs = self.mobilebert( 2025-08-14T21:42:54.7466453Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7466519Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7466787Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7466860Z layer_outputs = layer_module( 2025-08-14T21:42:54.7467114Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7467199Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7467464Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.7467575Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.7467838Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:42:54.7467948Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:42:54.7468203Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.7468294Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.7468297Z 2025-08-14T21:42:54.7468387Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7468574Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7468633Z return mod(**inputs) 2025-08-14T21:42:54.7468888Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7468959Z outputs = self.mobilebert( 2025-08-14T21:42:54.7469214Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7469277Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7469537Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7469600Z layer_outputs = layer_module( 2025-08-14T21:42:54.7469856Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 518, in forward 2025-08-14T21:42:54.7469963Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:42:54.7470214Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:42:54.7470298Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:54.7470301Z 2025-08-14T21:42:54.7470391Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7470579Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7470636Z return mod(**inputs) 2025-08-14T21:42:54.7470887Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7470958Z outputs = self.mobilebert( 2025-08-14T21:42:54.7471209Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7471279Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7471546Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7471626Z layer_outputs = layer_module( 2025-08-14T21:42:54.7471885Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 518, in forward 2025-08-14T21:42:54.7472015Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:42:54.7472269Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:42:54.7472374Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:42:54.7472391Z 2025-08-14T21:42:54.7472485Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7472669Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7472726Z return mod(**inputs) 2025-08-14T21:42:54.7472982Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7473054Z outputs = self.mobilebert( 2025-08-14T21:42:54.7473308Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7473380Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7473634Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7473697Z layer_outputs = layer_module( 2025-08-14T21:42:54.7473955Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:42:54.7474099Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:42:54.7474354Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 392, in forward 2025-08-14T21:42:54.7474445Z layer_output = self.dense(intermediate_states) 2025-08-14T21:42:54.7474449Z 2025-08-14T21:42:54.7474539Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7474724Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7474784Z return mod(**inputs) 2025-08-14T21:42:54.7475035Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7475106Z outputs = self.mobilebert( 2025-08-14T21:42:54.7475360Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7475431Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7475686Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7475750Z layer_outputs = layer_module( 2025-08-14T21:42:54.7476010Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:42:54.7476154Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:42:54.7476409Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 397, in forward 2025-08-14T21:42:54.7476527Z layer_output = self.LayerNorm(layer_output + residual_tensor_1) 2025-08-14T21:42:54.7476782Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.7476873Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.7476876Z 2025-08-14T21:42:54.7476969Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7477162Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7477243Z return mod(**inputs) 2025-08-14T21:42:54.7477495Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7477582Z outputs = self.mobilebert( 2025-08-14T21:42:54.7477835Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7477899Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7478172Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7478238Z layer_outputs = layer_module( 2025-08-14T21:42:54.7478495Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:42:54.7478646Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:42:54.7478899Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 398, in forward 2025-08-14T21:42:54.7479018Z layer_output = self.bottleneck(layer_output, residual_tensor_2) 2025-08-14T21:42:54.7479271Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 372, in forward 2025-08-14T21:42:54.7479345Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:42:54.7479348Z 2025-08-14T21:42:54.7479449Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7479628Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7479694Z return mod(**inputs) 2025-08-14T21:42:54.7479947Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7480009Z outputs = self.mobilebert( 2025-08-14T21:42:54.7480269Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7480333Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7480594Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7480655Z layer_outputs = layer_module( 2025-08-14T21:42:54.7480907Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:42:54.7481054Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:42:54.7481308Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 398, in forward 2025-08-14T21:42:54.7481418Z layer_output = self.bottleneck(layer_output, residual_tensor_2) 2025-08-14T21:42:54.7481679Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 374, in forward 2025-08-14T21:42:54.7481787Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:42:54.7482050Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.7482133Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.7482136Z 2025-08-14T21:42:54.7482226Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7482413Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7482471Z return mod(**inputs) 2025-08-14T21:42:54.7482729Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7482809Z outputs = self.mobilebert( 2025-08-14T21:42:54.7483076Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7483147Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7483415Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7483478Z layer_outputs = layer_module( 2025-08-14T21:42:54.7483735Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:42:54.7483892Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:42:54.7484154Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 444, in forward 2025-08-14T21:42:54.7484256Z shared_attention_input = self.attention(hidden_states) 2025-08-14T21:42:54.7484509Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 409, in forward 2025-08-14T21:42:54.7484729Z layer_input = self.dense(hidden_states) 2025-08-14T21:42:54.7484735Z 2025-08-14T21:42:54.7484836Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7485029Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7485090Z return mod(**inputs) 2025-08-14T21:42:54.7485345Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7485418Z outputs = self.mobilebert( 2025-08-14T21:42:54.7485680Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7485746Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7486026Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7486091Z layer_outputs = layer_module( 2025-08-14T21:42:54.7486354Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:42:54.7486433Z self_attention_outputs = self.attention( 2025-08-14T21:42:54.7486683Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:42:54.7486755Z self_outputs = self.self( 2025-08-14T21:42:54.7487011Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 255, in forward 2025-08-14T21:42:54.7487082Z self.value(value_tensor) 2025-08-14T21:42:54.7487085Z 2025-08-14T21:42:54.7487179Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7487358Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7487427Z return mod(**inputs) 2025-08-14T21:42:54.7487679Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7487742Z outputs = self.mobilebert( 2025-08-14T21:42:54.7488002Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7488066Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7488326Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7488388Z layer_outputs = layer_module( 2025-08-14T21:42:54.7488640Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:42:54.7488826Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:42:54.7489102Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 440, in forward 2025-08-14T21:42:54.7489233Z bottlenecked_hidden_states = self.input(hidden_states) 2025-08-14T21:42:54.7489490Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 409, in forward 2025-08-14T21:42:54.7489562Z layer_input = self.dense(hidden_states) 2025-08-14T21:42:54.7489565Z 2025-08-14T21:42:54.7489686Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7489867Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7489935Z return mod(**inputs) 2025-08-14T21:42:54.7490193Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7490257Z outputs = self.mobilebert( 2025-08-14T21:42:54.7490517Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7490580Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7490837Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7490908Z layer_outputs = layer_module( 2025-08-14T21:42:54.7491163Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:42:54.7491316Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:42:54.7491573Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 444, in forward 2025-08-14T21:42:54.7491675Z shared_attention_input = self.attention(hidden_states) 2025-08-14T21:42:54.7491939Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 410, in forward 2025-08-14T21:42:54.7492017Z layer_input = self.LayerNorm(layer_input) 2025-08-14T21:42:54.7492280Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.7492362Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.7492365Z 2025-08-14T21:42:54.7492457Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7492644Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7492704Z return mod(**inputs) 2025-08-14T21:42:54.7492956Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7493027Z outputs = self.mobilebert( 2025-08-14T21:42:54.7493281Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7493353Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7493605Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7493669Z layer_outputs = layer_module( 2025-08-14T21:42:54.7493928Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:42:54.7494007Z self_attention_outputs = self.attention( 2025-08-14T21:42:54.7494267Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:42:54.7494330Z self_outputs = self.self( 2025-08-14T21:42:54.7494601Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 245, in forward 2025-08-14T21:42:54.7494695Z self.query(query_tensor) 2025-08-14T21:42:54.7494698Z 2025-08-14T21:42:54.7494789Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7494984Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7495052Z return mod(**inputs) 2025-08-14T21:42:54.7495305Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7495376Z outputs = self.mobilebert( 2025-08-14T21:42:54.7495641Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7495708Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7495969Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7496034Z layer_outputs = layer_module( 2025-08-14T21:42:54.7496293Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:42:54.7496371Z self_attention_outputs = self.attention( 2025-08-14T21:42:54.7496624Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:42:54.7496693Z self_outputs = self.self( 2025-08-14T21:42:54.7496945Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 250, in forward 2025-08-14T21:42:54.7497005Z self.key(key_tensor) 2025-08-14T21:42:54.7497008Z 2025-08-14T21:42:54.7497087Z cudagraph partition due to non gpu ops 2025-08-14T21:42:54.7497156Z cudagraph partition due to non gpu ops 2025-08-14T21:42:54.7497256Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7497436Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7497495Z return mod(**inputs) 2025-08-14T21:42:54.7497755Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7497818Z outputs = self.mobilebert( 2025-08-14T21:42:54.7498070Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7498140Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7498393Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7498460Z layer_outputs = layer_module( 2025-08-14T21:42:54.7498714Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:42:54.7498790Z self_attention_outputs = self.attention( 2025-08-14T21:42:54.7499053Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 344, in forward 2025-08-14T21:42:54.7499164Z attention_output = self.output(self_outputs[0], layer_input) 2025-08-14T21:42:54.7499427Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 292, in forward 2025-08-14T21:42:54.7499501Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:42:54.7499504Z 2025-08-14T21:42:54.7499594Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7499781Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7499839Z return mod(**inputs) 2025-08-14T21:42:54.7500104Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7500189Z outputs = self.mobilebert( 2025-08-14T21:42:54.7500442Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7500511Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7500781Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7500844Z layer_outputs = layer_module( 2025-08-14T21:42:54.7501117Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:42:54.7501194Z self_attention_outputs = self.attention( 2025-08-14T21:42:54.7501455Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 344, in forward 2025-08-14T21:42:54.7501569Z attention_output = self.output(self_outputs[0], layer_input) 2025-08-14T21:42:54.7501821Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 295, in forward 2025-08-14T21:42:54.7501941Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:42:54.7502194Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.7502273Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.7502283Z 2025-08-14T21:42:54.7502373Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7502550Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7502616Z return mod(**inputs) 2025-08-14T21:42:54.7502870Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7502933Z outputs = self.mobilebert( 2025-08-14T21:42:54.7503191Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7503257Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7503517Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7503582Z layer_outputs = layer_module( 2025-08-14T21:42:54.7503834Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7503927Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7504179Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.7504278Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.7504537Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:42:54.7504613Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:54.7504616Z 2025-08-14T21:42:54.7505407Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7505592Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7505651Z return mod(**inputs) 2025-08-14T21:42:54.7505915Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7505981Z outputs = self.mobilebert( 2025-08-14T21:42:54.7506242Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7506307Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7506584Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7506668Z layer_outputs = layer_module( 2025-08-14T21:42:54.7506924Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7507034Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7507288Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.7507387Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.7507662Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:42:54.7507766Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:42:54.7507770Z 2025-08-14T21:42:54.7507861Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7508049Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7508108Z return mod(**inputs) 2025-08-14T21:42:54.7508372Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7508436Z outputs = self.mobilebert( 2025-08-14T21:42:54.7508688Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7508759Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7509014Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7509082Z layer_outputs = layer_module( 2025-08-14T21:42:54.7509337Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7509427Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7509694Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.7509807Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.7510064Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:42:54.7510147Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:42:54.7510150Z 2025-08-14T21:42:54.7510239Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7510426Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7510484Z return mod(**inputs) 2025-08-14T21:42:54.7510739Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7510802Z outputs = self.mobilebert( 2025-08-14T21:42:54.7511055Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7511122Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7511378Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7511436Z layer_outputs = layer_module( 2025-08-14T21:42:54.7511697Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7511781Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7512034Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.7512166Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.7512420Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:42:54.7512554Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:42:54.7512817Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.7512900Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.7512904Z 2025-08-14T21:42:54.7513003Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7513220Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7513288Z return mod(**inputs) 2025-08-14T21:42:54.7513543Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7513606Z outputs = self.mobilebert( 2025-08-14T21:42:54.7513863Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7513926Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7514180Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7514248Z layer_outputs = layer_module( 2025-08-14T21:42:54.7514500Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7514589Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7514837Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.7514935Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.7515195Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:42:54.7515270Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:54.7515273Z 2025-08-14T21:42:54.7515371Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7515548Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7515606Z return mod(**inputs) 2025-08-14T21:42:54.7515863Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7515928Z outputs = self.mobilebert( 2025-08-14T21:42:54.7516181Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7516251Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7516504Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7516576Z layer_outputs = layer_module( 2025-08-14T21:42:54.7516826Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7516910Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7517169Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.7517268Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.7517527Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:42:54.7517625Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:42:54.7517628Z 2025-08-14T21:42:54.7517719Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7518141Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7518220Z return mod(**inputs) 2025-08-14T21:42:54.7518475Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7518563Z outputs = self.mobilebert( 2025-08-14T21:42:54.7518822Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7518893Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7519168Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7519233Z layer_outputs = layer_module( 2025-08-14T21:42:54.7519491Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7519573Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7519835Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.7519950Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.7520205Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:42:54.7520291Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:42:54.7520294Z 2025-08-14T21:42:54.7520387Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7520576Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7520634Z return mod(**inputs) 2025-08-14T21:42:54.7520893Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7520963Z outputs = self.mobilebert( 2025-08-14T21:42:54.7521217Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7521281Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7521543Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7521606Z layer_outputs = layer_module( 2025-08-14T21:42:54.7521872Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7521957Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7522214Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.7522336Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.7522593Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:42:54.7522707Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:42:54.7522964Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.7523049Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.7523052Z 2025-08-14T21:42:54.7523146Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7523324Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7523382Z return mod(**inputs) 2025-08-14T21:42:54.7523643Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7523718Z outputs = self.mobilebert( 2025-08-14T21:42:54.7523981Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7524061Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7524316Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7524404Z layer_outputs = layer_module( 2025-08-14T21:42:54.7524659Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7524747Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7525017Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.7525119Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.7525379Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:42:54.7525456Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:54.7525460Z 2025-08-14T21:42:54.7525551Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7525736Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7525794Z return mod(**inputs) 2025-08-14T21:42:54.7526053Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7526115Z outputs = self.mobilebert( 2025-08-14T21:42:54.7526369Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7526435Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7526684Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7526750Z layer_outputs = layer_module( 2025-08-14T21:42:54.7526998Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7527078Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7527335Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.7527431Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.7527685Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:42:54.7527790Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:42:54.7527793Z 2025-08-14T21:42:54.7527883Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7528069Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7528129Z return mod(**inputs) 2025-08-14T21:42:54.7528384Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7528456Z outputs = self.mobilebert( 2025-08-14T21:42:54.7528707Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7528777Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7529031Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7529094Z layer_outputs = layer_module( 2025-08-14T21:42:54.7529349Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7529444Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7529714Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.7529832Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.7530101Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:42:54.7530185Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:42:54.7530188Z 2025-08-14T21:42:54.7530278Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7530469Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7530535Z return mod(**inputs) 2025-08-14T21:42:54.7530788Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7530859Z outputs = self.mobilebert( 2025-08-14T21:42:54.7531114Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7531178Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7531443Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7531503Z layer_outputs = layer_module( 2025-08-14T21:42:54.7531758Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7531847Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7532103Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.7532220Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.7532475Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:42:54.7532583Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:42:54.7532837Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.7532918Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.7532922Z 2025-08-14T21:42:54.7533014Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7533192Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7533249Z return mod(**inputs) 2025-08-14T21:42:54.7533514Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7533575Z outputs = self.mobilebert( 2025-08-14T21:42:54.7533839Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7533904Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7534161Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7534233Z layer_outputs = layer_module( 2025-08-14T21:42:54.7534490Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 518, in forward 2025-08-14T21:42:54.7534599Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:42:54.7534865Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:42:54.7534939Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:54.7534942Z 2025-08-14T21:42:54.7535056Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7535252Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7535310Z return mod(**inputs) 2025-08-14T21:42:54.7535575Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7535655Z outputs = self.mobilebert( 2025-08-14T21:42:54.7535914Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7535979Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7536252Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7536324Z layer_outputs = layer_module( 2025-08-14T21:42:54.7536580Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 518, in forward 2025-08-14T21:42:54.7536685Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:42:54.7536941Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:42:54.7537042Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:42:54.7537046Z 2025-08-14T21:42:54.7537141Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7537316Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7537371Z return mod(**inputs) 2025-08-14T21:42:54.7537626Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7537690Z outputs = self.mobilebert( 2025-08-14T21:42:54.7537944Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7538009Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7538263Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7538333Z layer_outputs = layer_module( 2025-08-14T21:42:54.7538582Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:42:54.7538726Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:42:54.7538987Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 392, in forward 2025-08-14T21:42:54.7539074Z layer_output = self.dense(intermediate_states) 2025-08-14T21:42:54.7539077Z 2025-08-14T21:42:54.7539175Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7539350Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7539410Z return mod(**inputs) 2025-08-14T21:42:54.7539669Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7539731Z outputs = self.mobilebert( 2025-08-14T21:42:54.7539992Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7540055Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7540305Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7540376Z layer_outputs = layer_module( 2025-08-14T21:42:54.7540624Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:42:54.7540781Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:42:54.7541082Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 397, in forward 2025-08-14T21:42:54.7541192Z layer_output = self.LayerNorm(layer_output + residual_tensor_1) 2025-08-14T21:42:54.7541468Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.7541549Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.7541553Z 2025-08-14T21:42:54.7541643Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7541840Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7541900Z return mod(**inputs) 2025-08-14T21:42:54.7542159Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7542218Z outputs = self.mobilebert( 2025-08-14T21:42:54.7542472Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7542543Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7542799Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7542861Z layer_outputs = layer_module( 2025-08-14T21:42:54.7543121Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:42:54.7543264Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:42:54.7543519Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 398, in forward 2025-08-14T21:42:54.7543633Z layer_output = self.bottleneck(layer_output, residual_tensor_2) 2025-08-14T21:42:54.7543888Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 372, in forward 2025-08-14T21:42:54.7543970Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:42:54.7543974Z 2025-08-14T21:42:54.7544066Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7544254Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7544311Z return mod(**inputs) 2025-08-14T21:42:54.7544563Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7544633Z outputs = self.mobilebert( 2025-08-14T21:42:54.7544966Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7545043Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7545298Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7545364Z layer_outputs = layer_module( 2025-08-14T21:42:54.7545631Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:42:54.7545775Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:42:54.7546040Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 398, in forward 2025-08-14T21:42:54.7546154Z layer_output = self.bottleneck(layer_output, residual_tensor_2) 2025-08-14T21:42:54.7546403Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 374, in forward 2025-08-14T21:42:54.7546534Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:42:54.7546790Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.7546888Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.7546892Z 2025-08-14T21:42:54.7547006Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7547181Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7547247Z return mod(**inputs) 2025-08-14T21:42:54.7547514Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7547578Z outputs = self.mobilebert( 2025-08-14T21:42:54.7547836Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7547900Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7548151Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7548218Z layer_outputs = layer_module( 2025-08-14T21:42:54.7548469Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:42:54.7548620Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:42:54.7548874Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 444, in forward 2025-08-14T21:42:54.7548973Z shared_attention_input = self.attention(hidden_states) 2025-08-14T21:42:54.7549227Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 409, in forward 2025-08-14T21:42:54.7549299Z layer_input = self.dense(hidden_states) 2025-08-14T21:42:54.7549303Z 2025-08-14T21:42:54.7549397Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7549573Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7549630Z return mod(**inputs) 2025-08-14T21:42:54.7549883Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7549947Z outputs = self.mobilebert( 2025-08-14T21:42:54.7550205Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7550269Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7550520Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7550585Z layer_outputs = layer_module( 2025-08-14T21:42:54.7550838Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:42:54.7550917Z self_attention_outputs = self.attention( 2025-08-14T21:42:54.7551176Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:42:54.7551242Z self_outputs = self.self( 2025-08-14T21:42:54.7551501Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 255, in forward 2025-08-14T21:42:54.7551563Z self.value(value_tensor) 2025-08-14T21:42:54.7551566Z 2025-08-14T21:42:54.7551659Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7551838Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7551896Z return mod(**inputs) 2025-08-14T21:42:54.7552162Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7552241Z outputs = self.mobilebert( 2025-08-14T21:42:54.7552500Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7552569Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7552836Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7552895Z layer_outputs = layer_module( 2025-08-14T21:42:54.7553172Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:42:54.7553318Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:42:54.7553576Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 440, in forward 2025-08-14T21:42:54.7553672Z bottlenecked_hidden_states = self.input(hidden_states) 2025-08-14T21:42:54.7553925Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 409, in forward 2025-08-14T21:42:54.7554006Z layer_input = self.dense(hidden_states) 2025-08-14T21:42:54.7554011Z 2025-08-14T21:42:54.7554101Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7554281Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7554338Z return mod(**inputs) 2025-08-14T21:42:54.7554590Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7554655Z outputs = self.mobilebert( 2025-08-14T21:42:54.7554905Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7554968Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7555226Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7555290Z layer_outputs = layer_module( 2025-08-14T21:42:54.7555548Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:42:54.7555690Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:42:54.7555941Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 444, in forward 2025-08-14T21:42:54.7556045Z shared_attention_input = self.attention(hidden_states) 2025-08-14T21:42:54.7556296Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 410, in forward 2025-08-14T21:42:54.7556381Z layer_input = self.LayerNorm(layer_input) 2025-08-14T21:42:54.7556632Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.7556712Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.7556715Z 2025-08-14T21:42:54.7556815Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7556992Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7557047Z return mod(**inputs) 2025-08-14T21:42:54.7557306Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7557368Z outputs = self.mobilebert( 2025-08-14T21:42:54.7557622Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7557686Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7557950Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7558035Z layer_outputs = layer_module( 2025-08-14T21:42:54.7558287Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:42:54.7558395Z self_attention_outputs = self.attention( 2025-08-14T21:42:54.7558644Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:42:54.7558706Z self_outputs = self.self( 2025-08-14T21:42:54.7558979Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 245, in forward 2025-08-14T21:42:54.7559044Z self.query(query_tensor) 2025-08-14T21:42:54.7559048Z 2025-08-14T21:42:54.7559141Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7559329Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7559388Z return mod(**inputs) 2025-08-14T21:42:54.7559649Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7559713Z outputs = self.mobilebert( 2025-08-14T21:42:54.7559968Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7560035Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7560292Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7560357Z layer_outputs = layer_module( 2025-08-14T21:42:54.7560613Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:42:54.7560692Z self_attention_outputs = self.attention( 2025-08-14T21:42:54.7560955Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:42:54.7561019Z self_outputs = self.self( 2025-08-14T21:42:54.7561276Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 250, in forward 2025-08-14T21:42:54.7561342Z self.key(key_tensor) 2025-08-14T21:42:54.7561345Z 2025-08-14T21:42:54.7561417Z cudagraph partition due to non gpu ops 2025-08-14T21:42:54.7561495Z cudagraph partition due to non gpu ops 2025-08-14T21:42:54.7561590Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7561769Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7561833Z return mod(**inputs) 2025-08-14T21:42:54.7562088Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7562153Z outputs = self.mobilebert( 2025-08-14T21:42:54.7562412Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7562480Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7562740Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7562803Z layer_outputs = layer_module( 2025-08-14T21:42:54.7563054Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:42:54.7563134Z self_attention_outputs = self.attention( 2025-08-14T21:42:54.7563387Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 344, in forward 2025-08-14T21:42:54.7563524Z attention_output = self.output(self_outputs[0], layer_input) 2025-08-14T21:42:54.7563794Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 292, in forward 2025-08-14T21:42:54.7563869Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:42:54.7563887Z 2025-08-14T21:42:54.7563987Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7564165Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7564222Z return mod(**inputs) 2025-08-14T21:42:54.7564496Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7564561Z outputs = self.mobilebert( 2025-08-14T21:42:54.7564822Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7564886Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7565138Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7565212Z layer_outputs = layer_module( 2025-08-14T21:42:54.7565466Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:42:54.7565548Z self_attention_outputs = self.attention( 2025-08-14T21:42:54.7565801Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 344, in forward 2025-08-14T21:42:54.7565913Z attention_output = self.output(self_outputs[0], layer_input) 2025-08-14T21:42:54.7566174Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 295, in forward 2025-08-14T21:42:54.7566288Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:42:54.7566541Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.7566634Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.7566637Z 2025-08-14T21:42:54.7566728Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7566915Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7566973Z return mod(**inputs) 2025-08-14T21:42:54.7567223Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7567295Z outputs = self.mobilebert( 2025-08-14T21:42:54.7567547Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7567614Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7567869Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7567932Z layer_outputs = layer_module( 2025-08-14T21:42:54.7568192Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7568284Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7568538Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.7568645Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.7568902Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:42:54.7568982Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:54.7568985Z 2025-08-14T21:42:54.7569089Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7569266Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7569349Z return mod(**inputs) 2025-08-14T21:42:54.7569608Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7569695Z outputs = self.mobilebert( 2025-08-14T21:42:54.7569951Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7570013Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7570289Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7570356Z layer_outputs = layer_module( 2025-08-14T21:42:54.7570613Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7570705Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7570962Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.7571068Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.7571332Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:42:54.7571435Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:42:54.7571439Z 2025-08-14T21:42:54.7571538Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7571721Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7571788Z return mod(**inputs) 2025-08-14T21:42:54.7572049Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7572113Z outputs = self.mobilebert( 2025-08-14T21:42:54.7572381Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7572446Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7572707Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7572778Z layer_outputs = layer_module( 2025-08-14T21:42:54.7573041Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7573135Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7573396Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.7573511Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.7573781Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:42:54.7573856Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:42:54.7573861Z 2025-08-14T21:42:54.7573960Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7574142Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7574200Z return mod(**inputs) 2025-08-14T21:42:54.7574464Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7574527Z outputs = self.mobilebert( 2025-08-14T21:42:54.7574786Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7574864Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7575124Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7575207Z layer_outputs = layer_module( 2025-08-14T21:42:54.7575472Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7575576Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7575841Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.7575967Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.7576235Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:42:54.7576347Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:42:54.7576610Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.7576703Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.7576706Z 2025-08-14T21:42:54.7576810Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7576997Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7577056Z return mod(**inputs) 2025-08-14T21:42:54.7577337Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7577411Z outputs = self.mobilebert( 2025-08-14T21:42:54.7577672Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7577737Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7578008Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7578073Z layer_outputs = layer_module( 2025-08-14T21:42:54.7578340Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7578427Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7578690Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.7578800Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.7579064Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:42:54.7579144Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:54.7579148Z 2025-08-14T21:42:54.7579242Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7579429Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7579495Z return mod(**inputs) 2025-08-14T21:42:54.7579757Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7579824Z outputs = self.mobilebert( 2025-08-14T21:42:54.7580085Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7580151Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7580421Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7580485Z layer_outputs = layer_module( 2025-08-14T21:42:54.7580762Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7580852Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7581133Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.7581237Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.7581516Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:42:54.7581615Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:42:54.7581618Z 2025-08-14T21:42:54.7581713Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7581914Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7581977Z return mod(**inputs) 2025-08-14T21:42:54.7582252Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7582317Z outputs = self.mobilebert( 2025-08-14T21:42:54.7582588Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7582654Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7582918Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7582991Z layer_outputs = layer_module( 2025-08-14T21:42:54.7583253Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7583349Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7583611Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.7583726Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.7583997Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:42:54.7584075Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:42:54.7584079Z 2025-08-14T21:42:54.7584181Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7584366Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7584425Z return mod(**inputs) 2025-08-14T21:42:54.7584989Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7585060Z outputs = self.mobilebert( 2025-08-14T21:42:54.7585323Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7585400Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7585681Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7585759Z layer_outputs = layer_module( 2025-08-14T21:42:54.7586033Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7586127Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7586415Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.7586530Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.7586792Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:42:54.7586912Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:42:54.7587208Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.7587323Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.7587326Z 2025-08-14T21:42:54.7587420Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7587630Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7587697Z return mod(**inputs) 2025-08-14T21:42:54.7587961Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7588056Z outputs = self.mobilebert( 2025-08-14T21:42:54.7588319Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7588385Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7588655Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7588732Z layer_outputs = layer_module( 2025-08-14T21:42:54.7588990Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7589073Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7589324Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.7589427Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.7589677Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:42:54.7589751Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:54.7589761Z 2025-08-14T21:42:54.7589851Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7590028Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7590094Z return mod(**inputs) 2025-08-14T21:42:54.7590346Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7590408Z outputs = self.mobilebert( 2025-08-14T21:42:54.7590666Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7590728Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7590988Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7591050Z layer_outputs = layer_module( 2025-08-14T21:42:54.7591297Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7591388Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7591639Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.7591736Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.7591996Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:42:54.7592096Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:42:54.7592099Z 2025-08-14T21:42:54.7592196Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7592372Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7592429Z return mod(**inputs) 2025-08-14T21:42:54.7592697Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7592764Z outputs = self.mobilebert( 2025-08-14T21:42:54.7593038Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7593101Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7593368Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7593436Z layer_outputs = layer_module( 2025-08-14T21:42:54.7593688Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7593787Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7594046Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.7594157Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.7594413Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:42:54.7594490Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:42:54.7594493Z 2025-08-14T21:42:54.7594585Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7594767Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7594824Z return mod(**inputs) 2025-08-14T21:42:54.7595085Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7595149Z outputs = self.mobilebert( 2025-08-14T21:42:54.7595400Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7595471Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7595717Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7595780Z layer_outputs = layer_module( 2025-08-14T21:42:54.7596036Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7596116Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7596375Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.7596485Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.7596734Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:42:54.7596849Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:42:54.7597104Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.7597193Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.7597197Z 2025-08-14T21:42:54.7597288Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7597468Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7597534Z return mod(**inputs) 2025-08-14T21:42:54.7597785Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7597848Z outputs = self.mobilebert( 2025-08-14T21:42:54.7598107Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7598170Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7598468Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7598550Z layer_outputs = layer_module( 2025-08-14T21:42:54.7598809Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 518, in forward 2025-08-14T21:42:54.7598945Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:42:54.7599199Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:42:54.7599283Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:54.7599286Z 2025-08-14T21:42:54.7599393Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7599572Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7599638Z return mod(**inputs) 2025-08-14T21:42:54.7599891Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7599955Z outputs = self.mobilebert( 2025-08-14T21:42:54.7600221Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7600287Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7600549Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7600610Z layer_outputs = layer_module( 2025-08-14T21:42:54.7600865Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 518, in forward 2025-08-14T21:42:54.7600977Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:42:54.7601237Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:42:54.7601351Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:42:54.7601356Z 2025-08-14T21:42:54.7601449Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7601624Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7601691Z return mod(**inputs) 2025-08-14T21:42:54.7601944Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7602012Z outputs = self.mobilebert( 2025-08-14T21:42:54.7602267Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7602331Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7602590Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7602654Z layer_outputs = layer_module( 2025-08-14T21:42:54.7602905Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:42:54.7603057Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:42:54.7603308Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 392, in forward 2025-08-14T21:42:54.7603399Z layer_output = self.dense(intermediate_states) 2025-08-14T21:42:54.7603402Z 2025-08-14T21:42:54.7603492Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7603670Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7603736Z return mod(**inputs) 2025-08-14T21:42:54.7604012Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7604085Z outputs = self.mobilebert( 2025-08-14T21:42:54.7604352Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7604414Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7604692Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7604754Z layer_outputs = layer_module( 2025-08-14T21:42:54.7605005Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:42:54.7605171Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:42:54.7605425Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 397, in forward 2025-08-14T21:42:54.7605541Z layer_output = self.LayerNorm(layer_output + residual_tensor_1) 2025-08-14T21:42:54.7605795Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.7605877Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.7605882Z 2025-08-14T21:42:54.7605980Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7606155Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7606214Z return mod(**inputs) 2025-08-14T21:42:54.7606463Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7606521Z outputs = self.mobilebert( 2025-08-14T21:42:54.7606773Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7606834Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7607086Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7607156Z layer_outputs = layer_module( 2025-08-14T21:42:54.7607407Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:42:54.7607555Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:42:54.7607806Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 398, in forward 2025-08-14T21:42:54.7607918Z layer_output = self.bottleneck(layer_output, residual_tensor_2) 2025-08-14T21:42:54.7608177Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 372, in forward 2025-08-14T21:42:54.7608251Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:42:54.7608255Z 2025-08-14T21:42:54.7608352Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7608532Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7608591Z return mod(**inputs) 2025-08-14T21:42:54.7608851Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7608913Z outputs = self.mobilebert( 2025-08-14T21:42:54.7609169Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7609235Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7609489Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7609559Z layer_outputs = layer_module( 2025-08-14T21:42:54.7609833Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:42:54.7609991Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:42:54.7610253Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 398, in forward 2025-08-14T21:42:54.7610382Z layer_output = self.bottleneck(layer_output, residual_tensor_2) 2025-08-14T21:42:54.7610645Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 374, in forward 2025-08-14T21:42:54.7610766Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:42:54.7611018Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.7611108Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.7611111Z 2025-08-14T21:42:54.7611203Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7611390Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7611449Z return mod(**inputs) 2025-08-14T21:42:54.7611699Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7611770Z outputs = self.mobilebert( 2025-08-14T21:42:54.7612022Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7612086Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7612338Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7612399Z layer_outputs = layer_module( 2025-08-14T21:42:54.7612659Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:42:54.7612804Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:42:54.7613058Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 444, in forward 2025-08-14T21:42:54.7613166Z shared_attention_input = self.attention(hidden_states) 2025-08-14T21:42:54.7613416Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 409, in forward 2025-08-14T21:42:54.7613495Z layer_input = self.dense(hidden_states) 2025-08-14T21:42:54.7613500Z 2025-08-14T21:42:54.7613591Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7613769Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7613834Z return mod(**inputs) 2025-08-14T21:42:54.7614085Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7614147Z outputs = self.mobilebert( 2025-08-14T21:42:54.7614405Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7614471Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7614730Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7614792Z layer_outputs = layer_module( 2025-08-14T21:42:54.7615044Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:42:54.7615128Z self_attention_outputs = self.attention( 2025-08-14T21:42:54.7615398Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:42:54.7615473Z self_outputs = self.self( 2025-08-14T21:42:54.7615743Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 255, in forward 2025-08-14T21:42:54.7615807Z self.value(value_tensor) 2025-08-14T21:42:54.7615828Z 2025-08-14T21:42:54.7615929Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7616109Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7616166Z return mod(**inputs) 2025-08-14T21:42:54.7616444Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7616509Z outputs = self.mobilebert( 2025-08-14T21:42:54.7616771Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7616836Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7617091Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7617156Z layer_outputs = layer_module( 2025-08-14T21:42:54.7617407Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:42:54.7617557Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:42:54.7617811Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 440, in forward 2025-08-14T21:42:54.7617906Z bottlenecked_hidden_states = self.input(hidden_states) 2025-08-14T21:42:54.7618167Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 409, in forward 2025-08-14T21:42:54.7618242Z layer_input = self.dense(hidden_states) 2025-08-14T21:42:54.7618245Z 2025-08-14T21:42:54.7618345Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7618524Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7618582Z return mod(**inputs) 2025-08-14T21:42:54.7618847Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7618908Z outputs = self.mobilebert( 2025-08-14T21:42:54.7619164Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7619234Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7619488Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7619556Z layer_outputs = layer_module( 2025-08-14T21:42:54.7619813Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:42:54.7619957Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:42:54.7622269Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 444, in forward 2025-08-14T21:42:54.7623500Z shared_attention_input = self.attention(hidden_states) 2025-08-14T21:42:54.7623780Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 410, in forward 2025-08-14T21:42:54.7623868Z layer_input = self.LayerNorm(layer_input) 2025-08-14T21:42:54.7624131Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.7624216Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.7624220Z 2025-08-14T21:42:54.7624321Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7624532Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7624593Z return mod(**inputs) 2025-08-14T21:42:54.7624986Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7625093Z outputs = self.mobilebert( 2025-08-14T21:42:54.7625371Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7625450Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7625716Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7625786Z layer_outputs = layer_module( 2025-08-14T21:42:54.7626056Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:42:54.7626134Z self_attention_outputs = self.attention( 2025-08-14T21:42:54.7626397Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:42:54.7626461Z self_outputs = self.self( 2025-08-14T21:42:54.7626723Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 245, in forward 2025-08-14T21:42:54.7626790Z self.query(query_tensor) 2025-08-14T21:42:54.7626793Z 2025-08-14T21:42:54.7626888Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7627075Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7627135Z return mod(**inputs) 2025-08-14T21:42:54.7627391Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7627459Z outputs = self.mobilebert( 2025-08-14T21:42:54.7627720Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7627795Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7628058Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7628123Z layer_outputs = layer_module( 2025-08-14T21:42:54.7628384Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:42:54.7628459Z self_attention_outputs = self.attention( 2025-08-14T21:42:54.7628719Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:42:54.7628792Z self_outputs = self.self( 2025-08-14T21:42:54.7629049Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 250, in forward 2025-08-14T21:42:54.7629115Z self.key(key_tensor) 2025-08-14T21:42:54.7629118Z 2025-08-14T21:42:54.7629194Z cudagraph partition due to non gpu ops 2025-08-14T21:42:54.7629269Z cudagraph partition due to non gpu ops 2025-08-14T21:42:54.7629421Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7629626Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7629688Z return mod(**inputs) 2025-08-14T21:42:54.7629957Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7630024Z outputs = self.mobilebert( 2025-08-14T21:42:54.7630294Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7630358Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7630636Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7630711Z layer_outputs = layer_module( 2025-08-14T21:42:54.7630991Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:42:54.7631077Z self_attention_outputs = self.attention( 2025-08-14T21:42:54.7631336Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 344, in forward 2025-08-14T21:42:54.7631450Z attention_output = self.output(self_outputs[0], layer_input) 2025-08-14T21:42:54.7631717Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 292, in forward 2025-08-14T21:42:54.7631794Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:42:54.7631797Z 2025-08-14T21:42:54.7631893Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7632082Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7632141Z return mod(**inputs) 2025-08-14T21:42:54.7632410Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7632475Z outputs = self.mobilebert( 2025-08-14T21:42:54.7632735Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7632809Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7633069Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7633139Z layer_outputs = layer_module( 2025-08-14T21:42:54.7633398Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:42:54.7633476Z self_attention_outputs = self.attention( 2025-08-14T21:42:54.7633741Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 344, in forward 2025-08-14T21:42:54.7633855Z attention_output = self.output(self_outputs[0], layer_input) 2025-08-14T21:42:54.7634115Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 295, in forward 2025-08-14T21:42:54.7634239Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:42:54.7634501Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.7634588Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.7634591Z 2025-08-14T21:42:54.7634681Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7634865Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7634931Z return mod(**inputs) 2025-08-14T21:42:54.7635190Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7635310Z outputs = self.mobilebert( 2025-08-14T21:42:54.7635587Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7635654Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7635924Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7635988Z layer_outputs = layer_module( 2025-08-14T21:42:54.7636257Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7636364Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7636615Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.7636738Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.7636994Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:42:54.7637070Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:54.7637073Z 2025-08-14T21:42:54.7637170Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7637345Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7637409Z return mod(**inputs) 2025-08-14T21:42:54.7637660Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7637720Z outputs = self.mobilebert( 2025-08-14T21:42:54.7637975Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7638036Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7638289Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7638358Z layer_outputs = layer_module( 2025-08-14T21:42:54.7638610Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7638702Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7638953Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.7639052Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.7639312Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:42:54.7639411Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:42:54.7639416Z 2025-08-14T21:42:54.7639515Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7639695Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7639752Z return mod(**inputs) 2025-08-14T21:42:54.7640012Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7640074Z outputs = self.mobilebert( 2025-08-14T21:42:54.7640331Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7640393Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7640645Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7640714Z layer_outputs = layer_module( 2025-08-14T21:42:54.7640979Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7641080Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7641340Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.7641452Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.7641709Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:42:54.7641781Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:42:54.7641784Z 2025-08-14T21:42:54.7641875Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7642070Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7642126Z return mod(**inputs) 2025-08-14T21:42:54.7642393Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7642456Z outputs = self.mobilebert( 2025-08-14T21:42:54.7642708Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7642774Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7643026Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7643088Z layer_outputs = layer_module( 2025-08-14T21:42:54.7643345Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7643431Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7643689Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.7643804Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.7644057Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:42:54.7644172Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:42:54.7644422Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.7644509Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.7644513Z 2025-08-14T21:42:54.7644602Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7644780Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7644845Z return mod(**inputs) 2025-08-14T21:42:54.7645096Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7645160Z outputs = self.mobilebert( 2025-08-14T21:42:54.7645422Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7645486Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7645743Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7645804Z layer_outputs = layer_module( 2025-08-14T21:42:54.7646050Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7646140Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7646394Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.7646496Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.7646775Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:42:54.7646850Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:54.7646853Z 2025-08-14T21:42:54.7646948Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7647126Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7647184Z return mod(**inputs) 2025-08-14T21:42:54.7647452Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7647539Z outputs = self.mobilebert( 2025-08-14T21:42:54.7647804Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7647867Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7648137Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7648209Z layer_outputs = layer_module( 2025-08-14T21:42:54.7648461Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7648553Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7648807Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.7648908Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.7649169Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:42:54.7649267Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:42:54.7649272Z 2025-08-14T21:42:54.7649373Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7649555Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7649615Z return mod(**inputs) 2025-08-14T21:42:54.7649867Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7649930Z outputs = self.mobilebert( 2025-08-14T21:42:54.7650181Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7650253Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7650505Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7650576Z layer_outputs = layer_module( 2025-08-14T21:42:54.7650829Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7650914Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7651173Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.7651284Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.7651536Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:42:54.7651617Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:42:54.7651620Z 2025-08-14T21:42:54.7651709Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7651892Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7651950Z return mod(**inputs) 2025-08-14T21:42:54.7652217Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7652289Z outputs = self.mobilebert( 2025-08-14T21:42:54.7652558Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7652632Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7652889Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7652951Z layer_outputs = layer_module( 2025-08-14T21:42:54.7653209Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7653310Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7653562Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.7653693Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.7653947Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:42:54.7654055Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:42:54.7654305Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.7654382Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.7654390Z 2025-08-14T21:42:54.7654478Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7654650Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7654716Z return mod(**inputs) 2025-08-14T21:42:54.7654967Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7655031Z outputs = self.mobilebert( 2025-08-14T21:42:54.7655295Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7655359Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7655617Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7655680Z layer_outputs = layer_module( 2025-08-14T21:42:54.7655930Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7656021Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7656272Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.7656372Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.7656633Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:42:54.7656707Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:54.7656710Z 2025-08-14T21:42:54.7656806Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7656982Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7657040Z return mod(**inputs) 2025-08-14T21:42:54.7657299Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7657363Z outputs = self.mobilebert( 2025-08-14T21:42:54.7657622Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7657681Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7657966Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7658034Z layer_outputs = layer_module( 2025-08-14T21:42:54.7658287Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7658371Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7658631Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.7658729Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.7659004Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:42:54.7659102Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:42:54.7659119Z 2025-08-14T21:42:54.7659213Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7659404Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7659463Z return mod(**inputs) 2025-08-14T21:42:54.7659723Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7659785Z outputs = self.mobilebert( 2025-08-14T21:42:54.7660038Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7660108Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7660362Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7660427Z layer_outputs = layer_module( 2025-08-14T21:42:54.7660688Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7660770Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7661027Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.7661137Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.7661392Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:42:54.7661474Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:42:54.7661477Z 2025-08-14T21:42:54.7661569Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7661757Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7661815Z return mod(**inputs) 2025-08-14T21:42:54.7662069Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7662139Z outputs = self.mobilebert( 2025-08-14T21:42:54.7662394Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7662458Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7662719Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7662781Z layer_outputs = layer_module( 2025-08-14T21:42:54.7663044Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7663129Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7663382Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.7663519Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.7663791Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:42:54.7663910Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:42:54.7664166Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.7664246Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.7664249Z 2025-08-14T21:42:54.7664347Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7664525Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7664603Z return mod(**inputs) 2025-08-14T21:42:54.7664934Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7665024Z outputs = self.mobilebert( 2025-08-14T21:42:54.7665289Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7665354Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7665607Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7665679Z layer_outputs = layer_module( 2025-08-14T21:42:54.7665938Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 518, in forward 2025-08-14T21:42:54.7666057Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:42:54.7666325Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:42:54.7666399Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:54.7666402Z 2025-08-14T21:42:54.7666501Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7666682Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7666747Z return mod(**inputs) 2025-08-14T21:42:54.7666999Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7667059Z outputs = self.mobilebert( 2025-08-14T21:42:54.7667317Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7667382Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7667633Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7667700Z layer_outputs = layer_module( 2025-08-14T21:42:54.7667952Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 518, in forward 2025-08-14T21:42:54.7668059Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:42:54.7668307Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:42:54.7668404Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:42:54.7668407Z 2025-08-14T21:42:54.7668503Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7668678Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7668736Z return mod(**inputs) 2025-08-14T21:42:54.7668989Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7669050Z outputs = self.mobilebert( 2025-08-14T21:42:54.7669329Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7669419Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7669675Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7669745Z layer_outputs = layer_module( 2025-08-14T21:42:54.7669996Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:42:54.7670145Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:42:54.7670399Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 392, in forward 2025-08-14T21:42:54.7670501Z layer_output = self.dense(intermediate_states) 2025-08-14T21:42:54.7670505Z 2025-08-14T21:42:54.7670619Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7670800Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7670865Z return mod(**inputs) 2025-08-14T21:42:54.7671119Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7671181Z outputs = self.mobilebert( 2025-08-14T21:42:54.7671438Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7671500Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7671750Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7671822Z layer_outputs = layer_module( 2025-08-14T21:42:54.7672074Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:42:54.7672227Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:42:54.7672482Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 397, in forward 2025-08-14T21:42:54.7672593Z layer_output = self.LayerNorm(layer_output + residual_tensor_1) 2025-08-14T21:42:54.7672855Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.7672937Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.7672941Z 2025-08-14T21:42:54.7673038Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7673221Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7673280Z return mod(**inputs) 2025-08-14T21:42:54.7673545Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7673610Z outputs = self.mobilebert( 2025-08-14T21:42:54.7673861Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7673931Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7674185Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7674253Z layer_outputs = layer_module( 2025-08-14T21:42:54.7674504Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:42:54.7674647Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:42:54.7674907Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 398, in forward 2025-08-14T21:42:54.7675037Z layer_output = self.bottleneck(layer_output, residual_tensor_2) 2025-08-14T21:42:54.7675315Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 372, in forward 2025-08-14T21:42:54.7675389Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:42:54.7675393Z 2025-08-14T21:42:54.7675481Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7675664Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7675718Z return mod(**inputs) 2025-08-14T21:42:54.7675973Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7676051Z outputs = self.mobilebert( 2025-08-14T21:42:54.7676305Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7676390Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7676648Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7676710Z layer_outputs = layer_module( 2025-08-14T21:42:54.7676971Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:42:54.7677113Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:42:54.7677372Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 398, in forward 2025-08-14T21:42:54.7677485Z layer_output = self.bottleneck(layer_output, residual_tensor_2) 2025-08-14T21:42:54.7677737Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 374, in forward 2025-08-14T21:42:54.7677856Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:42:54.7678112Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.7678200Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.7678203Z 2025-08-14T21:42:54.7678294Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7678475Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7678540Z return mod(**inputs) 2025-08-14T21:42:54.7678794Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7678858Z outputs = self.mobilebert( 2025-08-14T21:42:54.7679112Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7679181Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7679438Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7679500Z layer_outputs = layer_module( 2025-08-14T21:42:54.7679750Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:42:54.7679901Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:42:54.7680153Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 444, in forward 2025-08-14T21:42:54.7680254Z shared_attention_input = self.attention(hidden_states) 2025-08-14T21:42:54.7680501Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 409, in forward 2025-08-14T21:42:54.7680576Z layer_input = self.dense(hidden_states) 2025-08-14T21:42:54.7680596Z 2025-08-14T21:42:54.7680713Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7680893Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7680958Z return mod(**inputs) 2025-08-14T21:42:54.7681210Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7681273Z outputs = self.mobilebert( 2025-08-14T21:42:54.7681531Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7681611Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7681870Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7681956Z layer_outputs = layer_module( 2025-08-14T21:42:54.7682212Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:42:54.7682294Z self_attention_outputs = self.attention( 2025-08-14T21:42:54.7682549Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:42:54.7682612Z self_outputs = self.self( 2025-08-14T21:42:54.7682873Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 255, in forward 2025-08-14T21:42:54.7682936Z self.value(value_tensor) 2025-08-14T21:42:54.7682939Z 2025-08-14T21:42:54.7683035Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7683215Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7683273Z return mod(**inputs) 2025-08-14T21:42:54.7683537Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7683601Z outputs = self.mobilebert( 2025-08-14T21:42:54.7683854Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7683927Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7684179Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7684249Z layer_outputs = layer_module( 2025-08-14T21:42:54.7684507Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:42:54.7684808Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:42:54.7685086Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 440, in forward 2025-08-14T21:42:54.7685195Z bottlenecked_hidden_states = self.input(hidden_states) 2025-08-14T21:42:54.7685461Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 409, in forward 2025-08-14T21:42:54.7685539Z layer_input = self.dense(hidden_states) 2025-08-14T21:42:54.7685544Z 2025-08-14T21:42:54.7685637Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7685827Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7685885Z return mod(**inputs) 2025-08-14T21:42:54.7686139Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7686211Z outputs = self.mobilebert( 2025-08-14T21:42:54.7686480Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7686587Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7686865Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7686928Z layer_outputs = layer_module( 2025-08-14T21:42:54.7687187Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:42:54.7687331Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:42:54.7687599Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 444, in forward 2025-08-14T21:42:54.7687729Z shared_attention_input = self.attention(hidden_states) 2025-08-14T21:42:54.7687991Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 410, in forward 2025-08-14T21:42:54.7688106Z layer_input = self.LayerNorm(layer_input) 2025-08-14T21:42:54.7688371Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.7688457Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.7688468Z 2025-08-14T21:42:54.7688561Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7688745Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7688813Z return mod(**inputs) 2025-08-14T21:42:54.7689074Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7689141Z outputs = self.mobilebert( 2025-08-14T21:42:54.7689408Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7689475Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7689741Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7689804Z layer_outputs = layer_module( 2025-08-14T21:42:54.7690064Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:42:54.7690153Z self_attention_outputs = self.attention( 2025-08-14T21:42:54.7690413Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:42:54.7690476Z self_outputs = self.self( 2025-08-14T21:42:54.7690746Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 245, in forward 2025-08-14T21:42:54.7690810Z self.query(query_tensor) 2025-08-14T21:42:54.7690815Z 2025-08-14T21:42:54.7690916Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7691101Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7691159Z return mod(**inputs) 2025-08-14T21:42:54.7691427Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7691490Z outputs = self.mobilebert( 2025-08-14T21:42:54.7691757Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7691821Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7692080Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7692153Z layer_outputs = layer_module( 2025-08-14T21:42:54.7692441Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:42:54.7692521Z self_attention_outputs = self.attention( 2025-08-14T21:42:54.7692810Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:42:54.7692876Z self_outputs = self.self( 2025-08-14T21:42:54.7693145Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 250, in forward 2025-08-14T21:42:54.7693206Z self.key(key_tensor) 2025-08-14T21:42:54.7693210Z 2025-08-14T21:42:54.7693283Z cudagraph partition due to non gpu ops 2025-08-14T21:42:54.7693362Z cudagraph partition due to non gpu ops 2025-08-14T21:42:54.7693471Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7693659Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7693741Z return mod(**inputs) 2025-08-14T21:42:54.7694009Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7694078Z outputs = self.mobilebert( 2025-08-14T21:42:54.7694339Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7694402Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7694670Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7694733Z layer_outputs = layer_module( 2025-08-14T21:42:54.7695000Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:42:54.7695078Z self_attention_outputs = self.attention( 2025-08-14T21:42:54.7695338Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 344, in forward 2025-08-14T21:42:54.7695461Z attention_output = self.output(self_outputs[0], layer_input) 2025-08-14T21:42:54.7695723Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 292, in forward 2025-08-14T21:42:54.7695799Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:42:54.7695808Z 2025-08-14T21:42:54.7695900Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7696083Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7696149Z return mod(**inputs) 2025-08-14T21:42:54.7696408Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7696473Z outputs = self.mobilebert( 2025-08-14T21:42:54.7696739Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7696806Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7697072Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7697135Z layer_outputs = layer_module( 2025-08-14T21:42:54.7697395Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:42:54.7697475Z self_attention_outputs = self.attention( 2025-08-14T21:42:54.7697734Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 344, in forward 2025-08-14T21:42:54.7697847Z attention_output = self.output(self_outputs[0], layer_input) 2025-08-14T21:42:54.7698113Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 295, in forward 2025-08-14T21:42:54.7698244Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:42:54.7698529Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.7698616Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.7698620Z 2025-08-14T21:42:54.7698716Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7698905Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7698964Z return mod(**inputs) 2025-08-14T21:42:54.7699229Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7699309Z outputs = self.mobilebert( 2025-08-14T21:42:54.7699579Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7699666Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7699921Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7699986Z layer_outputs = layer_module( 2025-08-14T21:42:54.7700245Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7700329Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7700587Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.7700689Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.7700942Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:42:54.7701020Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:54.7701025Z 2025-08-14T21:42:54.7701113Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7701294Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7701352Z return mod(**inputs) 2025-08-14T21:42:54.7701601Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7701666Z outputs = self.mobilebert( 2025-08-14T21:42:54.7701916Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7701978Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7702236Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7702295Z layer_outputs = layer_module( 2025-08-14T21:42:54.7702554Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7702636Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7702889Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.7702996Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.7703249Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:42:54.7703358Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:42:54.7703361Z 2025-08-14T21:42:54.7703452Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7703629Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7703696Z return mod(**inputs) 2025-08-14T21:42:54.7703964Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7704045Z outputs = self.mobilebert( 2025-08-14T21:42:54.7704306Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7704370Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7704632Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7704749Z layer_outputs = layer_module( 2025-08-14T21:42:54.7705008Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7705125Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7705376Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.7705518Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.7705774Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:42:54.7705849Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:42:54.7705853Z 2025-08-14T21:42:54.7705950Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7706129Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7706197Z return mod(**inputs) 2025-08-14T21:42:54.7706448Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7706513Z outputs = self.mobilebert( 2025-08-14T21:42:54.7706771Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7706840Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7707094Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7707164Z layer_outputs = layer_module( 2025-08-14T21:42:54.7707416Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7707506Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7707757Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.7707872Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.7708133Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:42:54.7708244Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:42:54.7708502Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.7708584Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.7708587Z 2025-08-14T21:42:54.7708675Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7708859Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7708914Z return mod(**inputs) 2025-08-14T21:42:54.7709161Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7709225Z outputs = self.mobilebert( 2025-08-14T21:42:54.7709476Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7709542Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7709824Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7709886Z layer_outputs = layer_module( 2025-08-14T21:42:54.7710140Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7710221Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7710476Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.7710574Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.7710839Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:42:54.7710919Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:54.7710943Z 2025-08-14T21:42:54.7711034Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7711214Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7711276Z return mod(**inputs) 2025-08-14T21:42:54.7711526Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7711589Z outputs = self.mobilebert( 2025-08-14T21:42:54.7711836Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7711897Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7712153Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7712214Z layer_outputs = layer_module( 2025-08-14T21:42:54.7712473Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7712558Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7712808Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.7712913Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.7713162Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:42:54.7713262Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:42:54.7713271Z 2025-08-14T21:42:54.7713360Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7713539Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7713602Z return mod(**inputs) 2025-08-14T21:42:54.7713856Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7713918Z outputs = self.mobilebert( 2025-08-14T21:42:54.7714178Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7714240Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7714500Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7714562Z layer_outputs = layer_module( 2025-08-14T21:42:54.7714812Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7714901Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7715153Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.7715277Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.7715551Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:42:54.7715626Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:42:54.7715629Z 2025-08-14T21:42:54.7715722Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7715896Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7715950Z return mod(**inputs) 2025-08-14T21:42:54.7716206Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7716281Z outputs = self.mobilebert( 2025-08-14T21:42:54.7716542Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7716625Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7716876Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7716943Z layer_outputs = layer_module( 2025-08-14T21:42:54.7717195Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7717276Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7717528Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.7717637Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.7717891Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:42:54.7717997Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:42:54.7718247Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.7718332Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.7718335Z 2025-08-14T21:42:54.7718424Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7718603Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7718658Z return mod(**inputs) 2025-08-14T21:42:54.7718910Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7718982Z outputs = self.mobilebert( 2025-08-14T21:42:54.7719235Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7719306Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7719558Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7719617Z layer_outputs = layer_module( 2025-08-14T21:42:54.7719869Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7719952Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7720202Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.7720308Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.7720559Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:42:54.7720639Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:54.7720644Z 2025-08-14T21:42:54.7720771Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7720983Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7721053Z return mod(**inputs) 2025-08-14T21:42:54.7721307Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7721377Z outputs = self.mobilebert( 2025-08-14T21:42:54.7721630Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7721693Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7721973Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7722037Z layer_outputs = layer_module( 2025-08-14T21:42:54.7722295Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7722435Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7722688Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.7722795Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.7723048Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:42:54.7723147Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:42:54.7723151Z 2025-08-14T21:42:54.7723249Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7723427Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7723493Z return mod(**inputs) 2025-08-14T21:42:54.7723750Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7723815Z outputs = self.mobilebert( 2025-08-14T21:42:54.7724073Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7724138Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7724389Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7724458Z layer_outputs = layer_module( 2025-08-14T21:42:54.7724708Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7724799Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7725051Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.7725165Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.7725423Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:42:54.7725499Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:42:54.7725502Z 2025-08-14T21:42:54.7725600Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7725775Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7725834Z return mod(**inputs) 2025-08-14T21:42:54.7726092Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7726155Z outputs = self.mobilebert( 2025-08-14T21:42:54.7726405Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7726490Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7726759Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7726830Z layer_outputs = layer_module( 2025-08-14T21:42:54.7727086Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7727171Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7727433Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.7727541Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.7727814Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:42:54.7727938Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:42:54.7728190Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.7728278Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.7728282Z 2025-08-14T21:42:54.7728371Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7728557Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7728615Z return mod(**inputs) 2025-08-14T21:42:54.7728865Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7728933Z outputs = self.mobilebert( 2025-08-14T21:42:54.7729185Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7729250Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7729512Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7729574Z layer_outputs = layer_module( 2025-08-14T21:42:54.7729834Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 518, in forward 2025-08-14T21:42:54.7729942Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:42:54.7730195Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:42:54.7730279Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:54.7730283Z 2025-08-14T21:42:54.7730375Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7730561Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7730620Z return mod(**inputs) 2025-08-14T21:42:54.7730877Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7730948Z outputs = self.mobilebert( 2025-08-14T21:42:54.7731201Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7731264Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7731524Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7731583Z layer_outputs = layer_module( 2025-08-14T21:42:54.7731839Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 518, in forward 2025-08-14T21:42:54.7731948Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:42:54.7732219Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:42:54.7732344Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:42:54.7732348Z 2025-08-14T21:42:54.7732442Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7732629Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7732689Z return mod(**inputs) 2025-08-14T21:42:54.7732943Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7733016Z outputs = self.mobilebert( 2025-08-14T21:42:54.7733268Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7733356Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7733622Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7733702Z layer_outputs = layer_module( 2025-08-14T21:42:54.7733966Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:42:54.7734110Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:42:54.7734364Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 392, in forward 2025-08-14T21:42:54.7734455Z layer_output = self.dense(intermediate_states) 2025-08-14T21:42:54.7734459Z 2025-08-14T21:42:54.7734550Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7734736Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7734794Z return mod(**inputs) 2025-08-14T21:42:54.7735046Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7735117Z outputs = self.mobilebert( 2025-08-14T21:42:54.7735371Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7735433Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7735691Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7735752Z layer_outputs = layer_module( 2025-08-14T21:42:54.7736010Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:42:54.7736156Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:42:54.7736410Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 397, in forward 2025-08-14T21:42:54.7736533Z layer_output = self.LayerNorm(layer_output + residual_tensor_1) 2025-08-14T21:42:54.7736789Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.7736879Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.7736883Z 2025-08-14T21:42:54.7736974Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7737150Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7737217Z return mod(**inputs) 2025-08-14T21:42:54.7737472Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7737536Z outputs = self.mobilebert( 2025-08-14T21:42:54.7737795Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7737875Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7738153Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7738218Z layer_outputs = layer_module( 2025-08-14T21:42:54.7738477Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:42:54.7738627Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:42:54.7738879Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 398, in forward 2025-08-14T21:42:54.7739011Z layer_output = self.bottleneck(layer_output, residual_tensor_2) 2025-08-14T21:42:54.7739265Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 372, in forward 2025-08-14T21:42:54.7740069Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:42:54.7740072Z 2025-08-14T21:42:54.7740172Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7740354Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7740419Z return mod(**inputs) 2025-08-14T21:42:54.7740673Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7740735Z outputs = self.mobilebert( 2025-08-14T21:42:54.7740995Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7741060Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7741315Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7741385Z layer_outputs = layer_module( 2025-08-14T21:42:54.7741642Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:42:54.7741784Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:42:54.7742036Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 398, in forward 2025-08-14T21:42:54.7742142Z layer_output = self.bottleneck(layer_output, residual_tensor_2) 2025-08-14T21:42:54.7742396Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 374, in forward 2025-08-14T21:42:54.7742504Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:42:54.7742768Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.7742850Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.7742854Z 2025-08-14T21:42:54.7742947Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7743136Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7743195Z return mod(**inputs) 2025-08-14T21:42:54.7743449Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7743517Z outputs = self.mobilebert( 2025-08-14T21:42:54.7743769Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7743842Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7744094Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7744160Z layer_outputs = layer_module( 2025-08-14T21:42:54.7744455Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:42:54.7744603Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:42:54.7744959Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 444, in forward 2025-08-14T21:42:54.7745066Z shared_attention_input = self.attention(hidden_states) 2025-08-14T21:42:54.7745322Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 409, in forward 2025-08-14T21:42:54.7745405Z layer_input = self.dense(hidden_states) 2025-08-14T21:42:54.7745427Z 2025-08-14T21:42:54.7745522Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7745709Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7745787Z return mod(**inputs) 2025-08-14T21:42:54.7746041Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7746113Z outputs = self.mobilebert( 2025-08-14T21:42:54.7746365Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7746430Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7746689Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7746752Z layer_outputs = layer_module( 2025-08-14T21:42:54.7747011Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:42:54.7747088Z self_attention_outputs = self.attention( 2025-08-14T21:42:54.7747342Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:42:54.7747414Z self_outputs = self.self( 2025-08-14T21:42:54.7747669Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 255, in forward 2025-08-14T21:42:54.7747734Z self.value(value_tensor) 2025-08-14T21:42:54.7747745Z 2025-08-14T21:42:54.7747837Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7748013Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7748078Z return mod(**inputs) 2025-08-14T21:42:54.7748325Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7748387Z outputs = self.mobilebert( 2025-08-14T21:42:54.7748639Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7748705Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7748958Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7749019Z layer_outputs = layer_module( 2025-08-14T21:42:54.7749270Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:42:54.7749419Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:42:54.7749672Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 440, in forward 2025-08-14T21:42:54.7749770Z bottlenecked_hidden_states = self.input(hidden_states) 2025-08-14T21:42:54.7750035Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 409, in forward 2025-08-14T21:42:54.7750123Z layer_input = self.dense(hidden_states) 2025-08-14T21:42:54.7750128Z 2025-08-14T21:42:54.7750245Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7750427Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7750484Z return mod(**inputs) 2025-08-14T21:42:54.7750746Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7750808Z outputs = self.mobilebert( 2025-08-14T21:42:54.7751070Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7751151Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7751402Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7751488Z layer_outputs = layer_module( 2025-08-14T21:42:54.7751742Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:42:54.7751891Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:42:54.7752146Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 444, in forward 2025-08-14T21:42:54.7752242Z shared_attention_input = self.attention(hidden_states) 2025-08-14T21:42:54.7752501Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 410, in forward 2025-08-14T21:42:54.7752580Z layer_input = self.LayerNorm(layer_input) 2025-08-14T21:42:54.7752832Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.7752922Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.7752926Z 2025-08-14T21:42:54.7753019Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7753206Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7753266Z return mod(**inputs) 2025-08-14T21:42:54.7753516Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7753585Z outputs = self.mobilebert( 2025-08-14T21:42:54.7753837Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7753908Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7754161Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7754222Z layer_outputs = layer_module( 2025-08-14T21:42:54.7754483Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:42:54.7754559Z self_attention_outputs = self.attention( 2025-08-14T21:42:54.7754810Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:42:54.7754878Z self_outputs = self.self( 2025-08-14T21:42:54.7755130Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 245, in forward 2025-08-14T21:42:54.7755198Z self.query(query_tensor) 2025-08-14T21:42:54.7755201Z 2025-08-14T21:42:54.7755291Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7755470Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7755534Z return mod(**inputs) 2025-08-14T21:42:54.7755800Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7755892Z outputs = self.mobilebert( 2025-08-14T21:42:54.7756143Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7756207Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7756462Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7756524Z layer_outputs = layer_module( 2025-08-14T21:42:54.7756776Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:42:54.7756876Z self_attention_outputs = self.attention( 2025-08-14T21:42:54.7757130Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:42:54.7757215Z self_outputs = self.self( 2025-08-14T21:42:54.7757470Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 250, in forward 2025-08-14T21:42:54.7757531Z self.key(key_tensor) 2025-08-14T21:42:54.7757534Z 2025-08-14T21:42:54.7757615Z cudagraph partition due to non gpu ops 2025-08-14T21:42:54.7757686Z cudagraph partition due to non gpu ops 2025-08-14T21:42:54.7757778Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7757963Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7758021Z return mod(**inputs) 2025-08-14T21:42:54.7758281Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7758341Z outputs = self.mobilebert( 2025-08-14T21:42:54.7758596Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7758668Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7758922Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7758991Z layer_outputs = layer_module( 2025-08-14T21:42:54.7759244Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:42:54.7759320Z self_attention_outputs = self.attention( 2025-08-14T21:42:54.7759578Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 344, in forward 2025-08-14T21:42:54.7759690Z attention_output = self.output(self_outputs[0], layer_input) 2025-08-14T21:42:54.7759943Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 292, in forward 2025-08-14T21:42:54.7760029Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:42:54.7760032Z 2025-08-14T21:42:54.7760123Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7760308Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7760365Z return mod(**inputs) 2025-08-14T21:42:54.7760617Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7760687Z outputs = self.mobilebert( 2025-08-14T21:42:54.7760939Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7761012Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7761265Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7761329Z layer_outputs = layer_module( 2025-08-14T21:42:54.7761616Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:42:54.7761694Z self_attention_outputs = self.attention( 2025-08-14T21:42:54.7761947Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 344, in forward 2025-08-14T21:42:54.7762064Z attention_output = self.output(self_outputs[0], layer_input) 2025-08-14T21:42:54.7762319Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 295, in forward 2025-08-14T21:42:54.7762438Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:42:54.7762706Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.7762806Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.7762809Z 2025-08-14T21:42:54.7762910Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7763089Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7763156Z return mod(**inputs) 2025-08-14T21:42:54.7763409Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7763476Z outputs = self.mobilebert( 2025-08-14T21:42:54.7763737Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7763802Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7764058Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7764129Z layer_outputs = layer_module( 2025-08-14T21:42:54.7764384Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7764477Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7764728Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.7764829Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.7765090Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:42:54.7765164Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:54.7765167Z 2025-08-14T21:42:54.7765267Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7765445Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7765501Z return mod(**inputs) 2025-08-14T21:42:54.7765767Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7765832Z outputs = self.mobilebert( 2025-08-14T21:42:54.7766084Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7766154Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7766407Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7766476Z layer_outputs = layer_module( 2025-08-14T21:42:54.7766728Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7766814Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7767075Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.7767190Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.7767466Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:42:54.7767568Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:42:54.7767571Z 2025-08-14T21:42:54.7767663Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7767847Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7767904Z return mod(**inputs) 2025-08-14T21:42:54.7768156Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7768243Z outputs = self.mobilebert( 2025-08-14T21:42:54.7768501Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7768590Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7768841Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7768902Z layer_outputs = layer_module( 2025-08-14T21:42:54.7769159Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7769243Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7769497Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.7769610Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.7769859Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:42:54.7769943Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:42:54.7769946Z 2025-08-14T21:42:54.7770036Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7770218Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7770275Z return mod(**inputs) 2025-08-14T21:42:54.7770523Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7770590Z outputs = self.mobilebert( 2025-08-14T21:42:54.7770841Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7770905Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7771164Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7771227Z layer_outputs = layer_module( 2025-08-14T21:42:54.7771486Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7771568Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7771816Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.7771935Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.7772184Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:42:54.7772298Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:42:54.7772553Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.7772633Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.7772638Z 2025-08-14T21:42:54.7772750Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7772945Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7773005Z return mod(**inputs) 2025-08-14T21:42:54.7773268Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7773329Z outputs = self.mobilebert( 2025-08-14T21:42:54.7773591Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7773655Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7773926Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7773997Z layer_outputs = layer_module( 2025-08-14T21:42:54.7774266Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7774357Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7774610Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.7774709Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.7774970Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:42:54.7775043Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:54.7775046Z 2025-08-14T21:42:54.7775137Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7775321Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7775377Z return mod(**inputs) 2025-08-14T21:42:54.7775638Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7775703Z outputs = self.mobilebert( 2025-08-14T21:42:54.7775954Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7776026Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7776278Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7776349Z layer_outputs = layer_module( 2025-08-14T21:42:54.7776599Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7776683Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7776942Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.7777042Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.7777296Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:42:54.7777404Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:42:54.7777408Z 2025-08-14T21:42:54.7777498Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7777680Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7777737Z return mod(**inputs) 2025-08-14T21:42:54.7777988Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7778059Z outputs = self.mobilebert( 2025-08-14T21:42:54.7778310Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7778404Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7778672Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7778771Z layer_outputs = layer_module( 2025-08-14T21:42:54.7779190Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7779330Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7779604Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.7779764Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.7780064Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:42:54.7780166Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:42:54.7780170Z 2025-08-14T21:42:54.7780372Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7780569Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7780655Z return mod(**inputs) 2025-08-14T21:42:54.7780952Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7781037Z outputs = self.mobilebert( 2025-08-14T21:42:54.7781343Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7781442Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7781714Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7781830Z layer_outputs = layer_module( 2025-08-14T21:42:54.7782106Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7782210Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7782512Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.7782654Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.7782957Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:42:54.7783088Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:42:54.7783364Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.7783482Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.7783487Z 2025-08-14T21:42:54.7783618Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7783860Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7783938Z return mod(**inputs) 2025-08-14T21:42:54.7784209Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7784318Z outputs = self.mobilebert( 2025-08-14T21:42:54.7784757Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7784945Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7785231Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7785319Z layer_outputs = layer_module( 2025-08-14T21:42:54.7785676Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7785807Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7786141Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.7786277Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.7786564Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:42:54.7786687Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:54.7786691Z 2025-08-14T21:42:54.7786829Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7787027Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7787151Z return mod(**inputs) 2025-08-14T21:42:54.7787467Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7787583Z outputs = self.mobilebert( 2025-08-14T21:42:54.7787868Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7787959Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7788268Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7788380Z layer_outputs = layer_module( 2025-08-14T21:42:54.7788703Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7788814Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7789102Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.7789289Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.7789570Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:42:54.7789750Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:42:54.7789754Z 2025-08-14T21:42:54.7789872Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7790081Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7790200Z return mod(**inputs) 2025-08-14T21:42:54.7790487Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7790604Z outputs = self.mobilebert( 2025-08-14T21:42:54.7790899Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7790991Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7791309Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7791397Z layer_outputs = layer_module( 2025-08-14T21:42:54.7791686Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7791824Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7792122Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.7792286Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.7792587Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:42:54.7792689Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:42:54.7792708Z 2025-08-14T21:42:54.7792857Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7793086Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7793209Z return mod(**inputs) 2025-08-14T21:42:54.7793496Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7793584Z outputs = self.mobilebert( 2025-08-14T21:42:54.7793917Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7794009Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7794355Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7794453Z layer_outputs = layer_module( 2025-08-14T21:42:54.7794747Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7794882Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7795169Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.7795297Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.7795649Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:42:54.7795792Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:42:54.7796091Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.7796194Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.7796200Z 2025-08-14T21:42:54.7796313Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7796557Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7796645Z return mod(**inputs) 2025-08-14T21:42:54.7796944Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7797029Z outputs = self.mobilebert( 2025-08-14T21:42:54.7797304Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7797403Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7797706Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7797820Z layer_outputs = layer_module( 2025-08-14T21:42:54.7798114Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 518, in forward 2025-08-14T21:42:54.7798244Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:42:54.7798541Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:42:54.7798633Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:54.7798637Z 2025-08-14T21:42:54.7798806Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7799003Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7799082Z return mod(**inputs) 2025-08-14T21:42:54.7799387Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7799477Z outputs = self.mobilebert( 2025-08-14T21:42:54.7799755Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7799915Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7800187Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7800295Z layer_outputs = layer_module( 2025-08-14T21:42:54.7800572Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 518, in forward 2025-08-14T21:42:54.7800700Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:42:54.7801010Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:42:54.7801154Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:42:54.7801157Z 2025-08-14T21:42:54.7801292Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7801515Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7801594Z return mod(**inputs) 2025-08-14T21:42:54.7801891Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7802000Z outputs = self.mobilebert( 2025-08-14T21:42:54.7802326Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7802416Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7802687Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7802798Z layer_outputs = layer_module( 2025-08-14T21:42:54.7803061Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:42:54.7803293Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:42:54.7803572Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 392, in forward 2025-08-14T21:42:54.7803678Z layer_output = self.dense(intermediate_states) 2025-08-14T21:42:54.7803681Z 2025-08-14T21:42:54.7803819Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7804021Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7804089Z return mod(**inputs) 2025-08-14T21:42:54.7804415Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7804504Z outputs = self.mobilebert( 2025-08-14T21:42:54.7804804Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7804891Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7805168Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7805282Z layer_outputs = layer_module( 2025-08-14T21:42:54.7805571Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:42:54.7805761Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:42:54.7806034Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 397, in forward 2025-08-14T21:42:54.7806166Z layer_output = self.LayerNorm(layer_output + residual_tensor_1) 2025-08-14T21:42:54.7806458Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.7806613Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.7806617Z 2025-08-14T21:42:54.7806784Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7806981Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7807061Z return mod(**inputs) 2025-08-14T21:42:54.7807361Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7807431Z outputs = self.mobilebert( 2025-08-14T21:42:54.7807764Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7807876Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7808149Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7808280Z layer_outputs = layer_module( 2025-08-14T21:42:54.7808564Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:42:54.7808716Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:42:54.7809042Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 398, in forward 2025-08-14T21:42:54.7809172Z layer_output = self.bottleneck(layer_output, residual_tensor_2) 2025-08-14T21:42:54.7809474Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 372, in forward 2025-08-14T21:42:54.7809576Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:42:54.7809580Z 2025-08-14T21:42:54.7809693Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7809925Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7810018Z return mod(**inputs) 2025-08-14T21:42:54.7810318Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7810405Z outputs = self.mobilebert( 2025-08-14T21:42:54.7810681Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7810804Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7811094Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7811209Z layer_outputs = layer_module( 2025-08-14T21:42:54.7811484Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:42:54.7811651Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:42:54.7811953Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 398, in forward 2025-08-14T21:42:54.7812072Z layer_output = self.bottleneck(layer_output, residual_tensor_2) 2025-08-14T21:42:54.7812400Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 374, in forward 2025-08-14T21:42:54.7812529Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:42:54.7812801Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.7812939Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.7812944Z 2025-08-14T21:42:54.7813056Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7813288Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7813378Z return mod(**inputs) 2025-08-14T21:42:54.7813684Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7813793Z outputs = self.mobilebert( 2025-08-14T21:42:54.7814068Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7814154Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7814470Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7814564Z layer_outputs = layer_module( 2025-08-14T21:42:54.7814883Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:42:54.7815074Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:42:54.7815365Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 444, in forward 2025-08-14T21:42:54.7815490Z shared_attention_input = self.attention(hidden_states) 2025-08-14T21:42:54.7815823Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 409, in forward 2025-08-14T21:42:54.7815958Z layer_input = self.dense(hidden_states) 2025-08-14T21:42:54.7815962Z 2025-08-14T21:42:54.7816072Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7816270Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7816378Z return mod(**inputs) 2025-08-14T21:42:54.7816639Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7816790Z outputs = self.mobilebert( 2025-08-14T21:42:54.7817066Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7817154Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7817453Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7817541Z layer_outputs = layer_module( 2025-08-14T21:42:54.7817854Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:42:54.7817962Z self_attention_outputs = self.attention( 2025-08-14T21:42:54.7818237Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:42:54.7818348Z self_outputs = self.self( 2025-08-14T21:42:54.7818622Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 255, in forward 2025-08-14T21:42:54.7818709Z self.value(value_tensor) 2025-08-14T21:42:54.7818712Z 2025-08-14T21:42:54.7818868Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7819076Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7819199Z return mod(**inputs) 2025-08-14T21:42:54.7819473Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7819556Z outputs = self.mobilebert( 2025-08-14T21:42:54.7819850Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7819957Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7820264Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7820348Z layer_outputs = layer_module( 2025-08-14T21:42:54.7820652Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:42:54.7820849Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:42:54.7821115Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 440, in forward 2025-08-14T21:42:54.7821309Z bottlenecked_hidden_states = self.input(hidden_states) 2025-08-14T21:42:54.7821585Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 409, in forward 2025-08-14T21:42:54.7821691Z layer_input = self.dense(hidden_states) 2025-08-14T21:42:54.7821695Z 2025-08-14T21:42:54.7821841Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7822040Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7822166Z return mod(**inputs) 2025-08-14T21:42:54.7822451Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7822533Z outputs = self.mobilebert( 2025-08-14T21:42:54.7822840Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7822923Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7823198Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7823309Z layer_outputs = layer_module( 2025-08-14T21:42:54.7823614Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:42:54.7823807Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:42:54.7824088Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 444, in forward 2025-08-14T21:42:54.7824210Z shared_attention_input = self.attention(hidden_states) 2025-08-14T21:42:54.7824501Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 410, in forward 2025-08-14T21:42:54.7834908Z layer_input = self.LayerNorm(layer_input) 2025-08-14T21:42:54.7835296Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.7835403Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.7835421Z 2025-08-14T21:42:54.7835527Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7835729Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7835800Z return mod(**inputs) 2025-08-14T21:42:54.7836079Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7836161Z outputs = self.mobilebert( 2025-08-14T21:42:54.7836425Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7836504Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7836764Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7836830Z layer_outputs = layer_module( 2025-08-14T21:42:54.7837097Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:42:54.7837179Z self_attention_outputs = self.attention( 2025-08-14T21:42:54.7837507Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:42:54.7837613Z self_outputs = self.self( 2025-08-14T21:42:54.7837869Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 245, in forward 2025-08-14T21:42:54.7837941Z self.query(query_tensor) 2025-08-14T21:42:54.7837945Z 2025-08-14T21:42:54.7838046Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7838232Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7838303Z return mod(**inputs) 2025-08-14T21:42:54.7838558Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7838671Z outputs = self.mobilebert( 2025-08-14T21:42:54.7838928Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7839021Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7839284Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7839349Z layer_outputs = layer_module( 2025-08-14T21:42:54.7839608Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:42:54.7839692Z self_attention_outputs = self.attention( 2025-08-14T21:42:54.7839947Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:42:54.7840019Z self_outputs = self.self( 2025-08-14T21:42:54.7840277Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 250, in forward 2025-08-14T21:42:54.7840336Z self.key(key_tensor) 2025-08-14T21:42:54.7840341Z 2025-08-14T21:42:54.7840425Z cudagraph partition due to non gpu ops 2025-08-14T21:42:54.7840497Z cudagraph partition due to non gpu ops 2025-08-14T21:42:54.7840600Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7840784Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7840844Z return mod(**inputs) 2025-08-14T21:42:54.7841107Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7841171Z outputs = self.mobilebert( 2025-08-14T21:42:54.7841427Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7841501Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7841754Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7841829Z layer_outputs = layer_module( 2025-08-14T21:42:54.7842084Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:42:54.7842159Z self_attention_outputs = self.attention( 2025-08-14T21:42:54.7842422Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 344, in forward 2025-08-14T21:42:54.7842537Z attention_output = self.output(self_outputs[0], layer_input) 2025-08-14T21:42:54.7842791Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 292, in forward 2025-08-14T21:42:54.7842876Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:42:54.7842880Z 2025-08-14T21:42:54.7842974Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7843163Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7843237Z return mod(**inputs) 2025-08-14T21:42:54.7843505Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7843580Z outputs = self.mobilebert( 2025-08-14T21:42:54.7843839Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7843913Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7844167Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7844230Z layer_outputs = layer_module( 2025-08-14T21:42:54.7844506Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:42:54.7844579Z self_attention_outputs = self.attention( 2025-08-14T21:42:54.7844849Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 344, in forward 2025-08-14T21:42:54.7844972Z attention_output = self.output(self_outputs[0], layer_input) 2025-08-14T21:42:54.7845224Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 295, in forward 2025-08-14T21:42:54.7845347Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:42:54.7845600Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.7845688Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.7845693Z 2025-08-14T21:42:54.7845792Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7845970Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7846037Z return mod(**inputs) 2025-08-14T21:42:54.7846293Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7846357Z outputs = self.mobilebert( 2025-08-14T21:42:54.7846615Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7846680Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7846928Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7846997Z layer_outputs = layer_module( 2025-08-14T21:42:54.7847249Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7847343Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7847596Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.7847702Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.7847964Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:42:54.7848041Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:54.7848044Z 2025-08-14T21:42:54.7848145Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7848323Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7848382Z return mod(**inputs) 2025-08-14T21:42:54.7848641Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7848706Z outputs = self.mobilebert( 2025-08-14T21:42:54.7848979Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7849051Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7849320Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7849393Z layer_outputs = layer_module( 2025-08-14T21:42:54.7849644Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7849729Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7849987Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.7850108Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.7850371Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:42:54.7850491Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:42:54.7850495Z 2025-08-14T21:42:54.7850589Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7850776Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7850834Z return mod(**inputs) 2025-08-14T21:42:54.7851094Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7851157Z outputs = self.mobilebert( 2025-08-14T21:42:54.7851406Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7851479Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7851731Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7851795Z layer_outputs = layer_module( 2025-08-14T21:42:54.7852058Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7852140Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7852397Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.7852513Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.7852761Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:42:54.7852846Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:42:54.7852849Z 2025-08-14T21:42:54.7852938Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7853120Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7853180Z return mod(**inputs) 2025-08-14T21:42:54.7853438Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7853507Z outputs = self.mobilebert( 2025-08-14T21:42:54.7853759Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7853823Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7854082Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7854145Z layer_outputs = layer_module( 2025-08-14T21:42:54.7854405Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7854488Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7854754Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.7854890Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.7855145Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:42:54.7855261Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:42:54.7855512Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.7855596Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.7855614Z 2025-08-14T21:42:54.7855715Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7855894Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7855968Z return mod(**inputs) 2025-08-14T21:42:54.7856232Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7856296Z outputs = self.mobilebert( 2025-08-14T21:42:54.7856556Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7856624Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7856878Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7856951Z layer_outputs = layer_module( 2025-08-14T21:42:54.7857204Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7857300Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7857556Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.7857660Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.7857923Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:42:54.7858000Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:54.7858004Z 2025-08-14T21:42:54.7858104Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7858282Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7858340Z return mod(**inputs) 2025-08-14T21:42:54.7858603Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7858668Z outputs = self.mobilebert( 2025-08-14T21:42:54.7858923Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7858999Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7859256Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7859327Z layer_outputs = layer_module( 2025-08-14T21:42:54.7859579Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7859662Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7859923Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.7860025Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.7860279Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:42:54.7860404Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:42:54.7860408Z 2025-08-14T21:42:54.7860521Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7860708Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7860767Z return mod(**inputs) 2025-08-14T21:42:54.7861022Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7861092Z outputs = self.mobilebert( 2025-08-14T21:42:54.7861344Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7861432Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7861685Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7861766Z layer_outputs = layer_module( 2025-08-14T21:42:54.7862034Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7862120Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7862387Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.7862501Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.7862757Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:42:54.7862841Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:42:54.7862846Z 2025-08-14T21:42:54.7862936Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7863117Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7863185Z return mod(**inputs) 2025-08-14T21:42:54.7863444Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7863514Z outputs = self.mobilebert( 2025-08-14T21:42:54.7863773Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7863837Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7864102Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7864164Z layer_outputs = layer_module( 2025-08-14T21:42:54.7864430Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7864513Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7864878Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.7865005Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.7865260Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:42:54.7865370Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:42:54.7865630Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.7865714Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.7865719Z 2025-08-14T21:42:54.7865820Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7865999Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7866060Z return mod(**inputs) 2025-08-14T21:42:54.7866358Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7866425Z outputs = self.mobilebert( 2025-08-14T21:42:54.7866690Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7866754Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7867011Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7867082Z layer_outputs = layer_module( 2025-08-14T21:42:54.7867336Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7867436Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7867692Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.7867818Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.7868075Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:42:54.7868152Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:54.7868155Z 2025-08-14T21:42:54.7868254Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7868432Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7868491Z return mod(**inputs) 2025-08-14T21:42:54.7868751Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7868814Z outputs = self.mobilebert( 2025-08-14T21:42:54.7869073Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7869140Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7869393Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7869463Z layer_outputs = layer_module( 2025-08-14T21:42:54.7869714Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7869804Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7870057Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.7870157Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.7870416Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:42:54.7870517Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:42:54.7870520Z 2025-08-14T21:42:54.7870620Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7870798Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7870856Z return mod(**inputs) 2025-08-14T21:42:54.7871114Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7871177Z outputs = self.mobilebert( 2025-08-14T21:42:54.7871426Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7871498Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7871747Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7871817Z layer_outputs = layer_module( 2025-08-14T21:42:54.7872093Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7872178Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7872439Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.7872550Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.7872809Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:42:54.7872883Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:42:54.7872900Z 2025-08-14T21:42:54.7872992Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7873181Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7873255Z return mod(**inputs) 2025-08-14T21:42:54.7873517Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7873588Z outputs = self.mobilebert( 2025-08-14T21:42:54.7873846Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7873915Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7874175Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7874238Z layer_outputs = layer_module( 2025-08-14T21:42:54.7874503Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7874585Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7874852Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.7874965Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.7875221Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:42:54.7875335Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:42:54.7875596Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.7875679Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.7875690Z 2025-08-14T21:42:54.7875782Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7875962Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7876025Z return mod(**inputs) 2025-08-14T21:42:54.7876286Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7876349Z outputs = self.mobilebert( 2025-08-14T21:42:54.7876614Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7876678Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7876939Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7877003Z layer_outputs = layer_module( 2025-08-14T21:42:54.7877259Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 518, in forward 2025-08-14T21:42:54.7877379Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:42:54.7877638Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:42:54.7877729Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:54.7877753Z 2025-08-14T21:42:54.7877847Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7878028Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7878095Z return mod(**inputs) 2025-08-14T21:42:54.7878351Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7878413Z outputs = self.mobilebert( 2025-08-14T21:42:54.7878675Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7878754Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7879016Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7879095Z layer_outputs = layer_module( 2025-08-14T21:42:54.7879350Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 518, in forward 2025-08-14T21:42:54.7879462Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:42:54.7879717Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:42:54.7879816Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:42:54.7879826Z 2025-08-14T21:42:54.7879916Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7880098Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7880160Z return mod(**inputs) 2025-08-14T21:42:54.7880413Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7880478Z outputs = self.mobilebert( 2025-08-14T21:42:54.7880744Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7880808Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7881069Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7881132Z layer_outputs = layer_module( 2025-08-14T21:42:54.7881386Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:42:54.7881536Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:42:54.7881791Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 392, in forward 2025-08-14T21:42:54.7881877Z layer_output = self.dense(intermediate_states) 2025-08-14T21:42:54.7881888Z 2025-08-14T21:42:54.7881981Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7882160Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7882224Z return mod(**inputs) 2025-08-14T21:42:54.7882477Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7882539Z outputs = self.mobilebert( 2025-08-14T21:42:54.7882801Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7882866Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7883125Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7883188Z layer_outputs = layer_module( 2025-08-14T21:42:54.7883483Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:42:54.7883634Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:42:54.7883890Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 397, in forward 2025-08-14T21:42:54.7884007Z layer_output = self.LayerNorm(layer_output + residual_tensor_1) 2025-08-14T21:42:54.7884265Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.7884346Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.7884364Z 2025-08-14T21:42:54.7884466Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7884802Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7884919Z return mod(**inputs) 2025-08-14T21:42:54.7885196Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7885261Z outputs = self.mobilebert( 2025-08-14T21:42:54.7885526Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7885590Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7885849Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7885921Z layer_outputs = layer_module( 2025-08-14T21:42:54.7886184Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:42:54.7886339Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:42:54.7886604Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 398, in forward 2025-08-14T21:42:54.7886721Z layer_output = self.bottleneck(layer_output, residual_tensor_2) 2025-08-14T21:42:54.7886994Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 372, in forward 2025-08-14T21:42:54.7887070Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:42:54.7887074Z 2025-08-14T21:42:54.7887166Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7887353Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7887415Z return mod(**inputs) 2025-08-14T21:42:54.7887683Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7887745Z outputs = self.mobilebert( 2025-08-14T21:42:54.7888010Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7888084Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7888342Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7888413Z layer_outputs = layer_module( 2025-08-14T21:42:54.7888671Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:42:54.7888814Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:42:54.7889085Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 398, in forward 2025-08-14T21:42:54.7889198Z layer_output = self.bottleneck(layer_output, residual_tensor_2) 2025-08-14T21:42:54.7889484Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 374, in forward 2025-08-14T21:42:54.7889629Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:42:54.7889889Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.7889978Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.7889981Z 2025-08-14T21:42:54.7890075Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7890256Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7890324Z return mod(**inputs) 2025-08-14T21:42:54.7890603Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7890672Z outputs = self.mobilebert( 2025-08-14T21:42:54.7890948Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7891015Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7891280Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7891345Z layer_outputs = layer_module( 2025-08-14T21:42:54.7891610Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:42:54.7891759Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:42:54.7892016Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 444, in forward 2025-08-14T21:42:54.7892126Z shared_attention_input = self.attention(hidden_states) 2025-08-14T21:42:54.7892388Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 409, in forward 2025-08-14T21:42:54.7892466Z layer_input = self.dense(hidden_states) 2025-08-14T21:42:54.7892476Z 2025-08-14T21:42:54.7892571Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7892752Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7892817Z return mod(**inputs) 2025-08-14T21:42:54.7893076Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7893139Z outputs = self.mobilebert( 2025-08-14T21:42:54.7893402Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7893473Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7893733Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7893805Z layer_outputs = layer_module( 2025-08-14T21:42:54.7894065Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:42:54.7894141Z self_attention_outputs = self.attention( 2025-08-14T21:42:54.7894405Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:42:54.7894470Z self_outputs = self.self( 2025-08-14T21:42:54.7894733Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 255, in forward 2025-08-14T21:42:54.7894800Z self.value(value_tensor) 2025-08-14T21:42:54.7894803Z 2025-08-14T21:42:54.7894897Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7895084Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7895159Z return mod(**inputs) 2025-08-14T21:42:54.7895440Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7895506Z outputs = self.mobilebert( 2025-08-14T21:42:54.7895772Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7895845Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7896112Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7896175Z layer_outputs = layer_module( 2025-08-14T21:42:54.7896460Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:42:54.7896610Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:42:54.7896920Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 440, in forward 2025-08-14T21:42:54.7897056Z bottlenecked_hidden_states = self.input(hidden_states) 2025-08-14T21:42:54.7897334Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 409, in forward 2025-08-14T21:42:54.7897416Z layer_input = self.dense(hidden_states) 2025-08-14T21:42:54.7897420Z 2025-08-14T21:42:54.7897513Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7897702Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7897763Z return mod(**inputs) 2025-08-14T21:42:54.7898025Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7898096Z outputs = self.mobilebert( 2025-08-14T21:42:54.7898357Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7898422Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7898691Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7898754Z layer_outputs = layer_module( 2025-08-14T21:42:54.7899020Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:42:54.7899178Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:42:54.7899433Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 444, in forward 2025-08-14T21:42:54.7899536Z shared_attention_input = self.attention(hidden_states) 2025-08-14T21:42:54.7899790Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 410, in forward 2025-08-14T21:42:54.7899872Z layer_input = self.LayerNorm(layer_input) 2025-08-14T21:42:54.7900128Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.7900209Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.7900212Z 2025-08-14T21:42:54.7900310Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7900485Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7900548Z return mod(**inputs) 2025-08-14T21:42:54.7900799Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7900861Z outputs = self.mobilebert( 2025-08-14T21:42:54.7901138Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7901216Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7901469Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7901537Z layer_outputs = layer_module( 2025-08-14T21:42:54.7901787Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:42:54.7901867Z self_attention_outputs = self.attention( 2025-08-14T21:42:54.7902117Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:42:54.7902220Z self_outputs = self.self( 2025-08-14T21:42:54.7902480Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 245, in forward 2025-08-14T21:42:54.7902566Z self.query(query_tensor) 2025-08-14T21:42:54.7902569Z 2025-08-14T21:42:54.7902669Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7902847Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7902905Z return mod(**inputs) 2025-08-14T21:42:54.7903164Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7903224Z outputs = self.mobilebert( 2025-08-14T21:42:54.7903474Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7903546Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7903797Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7903867Z layer_outputs = layer_module( 2025-08-14T21:42:54.7904120Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:42:54.7904197Z self_attention_outputs = self.attention( 2025-08-14T21:42:54.7904454Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:42:54.7904517Z self_outputs = self.self( 2025-08-14T21:42:54.7904828Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 250, in forward 2025-08-14T21:42:54.7904899Z self.key(key_tensor) 2025-08-14T21:42:54.7904902Z 2025-08-14T21:42:54.7904977Z cudagraph partition due to non gpu ops 2025-08-14T21:42:54.7905058Z cudagraph partition due to non gpu ops 2025-08-14T21:42:54.7905149Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7905326Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7905392Z return mod(**inputs) 2025-08-14T21:42:54.7905645Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7905715Z outputs = self.mobilebert( 2025-08-14T21:42:54.7905968Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7906032Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7906288Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7906352Z layer_outputs = layer_module( 2025-08-14T21:42:54.7906602Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:42:54.7906687Z self_attention_outputs = self.attention( 2025-08-14T21:42:54.7906965Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 344, in forward 2025-08-14T21:42:54.7907086Z attention_output = self.output(self_outputs[0], layer_input) 2025-08-14T21:42:54.7907339Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 292, in forward 2025-08-14T21:42:54.7907414Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:42:54.7907417Z 2025-08-14T21:42:54.7907517Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7907692Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7907774Z return mod(**inputs) 2025-08-14T21:42:54.7908035Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7908115Z outputs = self.mobilebert( 2025-08-14T21:42:54.7908376Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7908440Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7908692Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7908761Z layer_outputs = layer_module( 2025-08-14T21:42:54.7909014Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:42:54.7909093Z self_attention_outputs = self.attention( 2025-08-14T21:42:54.7909344Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 344, in forward 2025-08-14T21:42:54.7909455Z attention_output = self.output(self_outputs[0], layer_input) 2025-08-14T21:42:54.7909719Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 295, in forward 2025-08-14T21:42:54.7909833Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:42:54.7910093Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.7910175Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.7910178Z 2025-08-14T21:42:54.7910268Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7910455Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7910513Z return mod(**inputs) 2025-08-14T21:42:54.7910767Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7910836Z outputs = self.mobilebert( 2025-08-14T21:42:54.7911093Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7911165Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7911418Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7911480Z layer_outputs = layer_module( 2025-08-14T21:42:54.7911746Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7911830Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7912091Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.7912191Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.7912444Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:42:54.7912543Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:54.7912547Z 2025-08-14T21:42:54.7912655Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7912832Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7912899Z return mod(**inputs) 2025-08-14T21:42:54.7913150Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7913216Z outputs = self.mobilebert( 2025-08-14T21:42:54.7913468Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7913546Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7913807Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7913885Z layer_outputs = layer_module( 2025-08-14T21:42:54.7914145Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7914229Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7914481Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.7914584Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.7914835Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:42:54.7914937Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:42:54.7914946Z 2025-08-14T21:42:54.7915035Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7915211Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7915278Z return mod(**inputs) 2025-08-14T21:42:54.7915530Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7915592Z outputs = self.mobilebert( 2025-08-14T21:42:54.7915850Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7915913Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7916170Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7916231Z layer_outputs = layer_module( 2025-08-14T21:42:54.7916485Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7916574Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7916830Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.7916942Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.7917197Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:42:54.7917271Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:42:54.7917275Z 2025-08-14T21:42:54.7917370Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7917546Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7917605Z return mod(**inputs) 2025-08-14T21:42:54.7917863Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7917924Z outputs = self.mobilebert( 2025-08-14T21:42:54.7918209Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7918274Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7918529Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7918599Z layer_outputs = layer_module( 2025-08-14T21:42:54.7918850Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7918933Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7919193Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.7919322Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.7919583Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:42:54.7919708Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:42:54.7919960Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.7920048Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.7920051Z 2025-08-14T21:42:54.7920142Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7920326Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7920383Z return mod(**inputs) 2025-08-14T21:42:54.7920636Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7920704Z outputs = self.mobilebert( 2025-08-14T21:42:54.7920957Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7921028Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7921282Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7921344Z layer_outputs = layer_module( 2025-08-14T21:42:54.7921602Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7921684Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7921936Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.7922042Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.7922296Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:42:54.7922379Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:54.7922383Z 2025-08-14T21:42:54.7922474Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7922651Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7922716Z return mod(**inputs) 2025-08-14T21:42:54.7922969Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7923036Z outputs = self.mobilebert( 2025-08-14T21:42:54.7923288Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7923353Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7923609Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7923672Z layer_outputs = layer_module( 2025-08-14T21:42:54.7923962Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7924055Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7924308Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.7924411Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.7924664Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:42:54.7924786Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:42:54.7924790Z 2025-08-14T21:42:54.7924888Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7925066Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7925150Z return mod(**inputs) 2025-08-14T21:42:54.7925406Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7925468Z outputs = self.mobilebert( 2025-08-14T21:42:54.7925727Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7925789Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7926038Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7926105Z layer_outputs = layer_module( 2025-08-14T21:42:54.7926357Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7926446Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7926698Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.7926810Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.7927069Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:42:54.7927143Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:42:54.7927147Z 2025-08-14T21:42:54.7927244Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7927420Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7927478Z return mod(**inputs) 2025-08-14T21:42:54.7927735Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7927796Z outputs = self.mobilebert( 2025-08-14T21:42:54.7928050Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7928121Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7928372Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7928441Z layer_outputs = layer_module( 2025-08-14T21:42:54.7928690Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7928770Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7929028Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.7929138Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.7929408Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:42:54.7929534Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:42:54.7929786Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.7929874Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.7929878Z 2025-08-14T21:42:54.7929968Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7930154Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7930210Z return mod(**inputs) 2025-08-14T21:42:54.7930475Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7930543Z outputs = self.mobilebert( 2025-08-14T21:42:54.7930795Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7930873Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7931135Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7931198Z layer_outputs = layer_module( 2025-08-14T21:42:54.7931459Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7931542Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7931796Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.7931902Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.7932159Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:42:54.7932243Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:54.7932246Z 2025-08-14T21:42:54.7932338Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7932514Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7932581Z return mod(**inputs) 2025-08-14T21:42:54.7932833Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7932894Z outputs = self.mobilebert( 2025-08-14T21:42:54.7933155Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7933219Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7933477Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7933541Z layer_outputs = layer_module( 2025-08-14T21:42:54.7933799Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7933887Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7934142Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.7934246Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.7934501Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:42:54.7934600Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:42:54.7934604Z 2025-08-14T21:42:54.7934701Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7934880Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7934954Z return mod(**inputs) 2025-08-14T21:42:54.7935230Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7935294Z outputs = self.mobilebert( 2025-08-14T21:42:54.7935553Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7935615Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7935865Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7935934Z layer_outputs = layer_module( 2025-08-14T21:42:54.7936207Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7936296Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7936563Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.7936675Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.7936934Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:42:54.7937009Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:42:54.7937012Z 2025-08-14T21:42:54.7937102Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7937287Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7937344Z return mod(**inputs) 2025-08-14T21:42:54.7937604Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7937664Z outputs = self.mobilebert( 2025-08-14T21:42:54.7937918Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7937990Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7938243Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7938310Z layer_outputs = layer_module( 2025-08-14T21:42:54.7938560Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7938642Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7938898Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.7939009Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.7939260Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:42:54.7939376Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:42:54.7939628Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.7939715Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.7939718Z 2025-08-14T21:42:54.7939808Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7939987Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7940050Z return mod(**inputs) 2025-08-14T21:42:54.7940302Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7940370Z outputs = self.mobilebert( 2025-08-14T21:42:54.7940633Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7940701Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7940975Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7941039Z layer_outputs = layer_module( 2025-08-14T21:42:54.7941293Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 518, in forward 2025-08-14T21:42:54.7941408Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:42:54.7941662Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:42:54.7941759Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:54.7941762Z 2025-08-14T21:42:54.7941853Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7942048Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7942111Z return mod(**inputs) 2025-08-14T21:42:54.7942364Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7942431Z outputs = self.mobilebert( 2025-08-14T21:42:54.7942680Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7942742Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7943001Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7943064Z layer_outputs = layer_module( 2025-08-14T21:42:54.7943315Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 518, in forward 2025-08-14T21:42:54.7943428Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:42:54.7943681Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:42:54.7943786Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:42:54.7943789Z 2025-08-14T21:42:54.7943878Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7944054Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7944116Z return mod(**inputs) 2025-08-14T21:42:54.7944367Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7944434Z outputs = self.mobilebert( 2025-08-14T21:42:54.7944683Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7944827Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7945090Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7945153Z layer_outputs = layer_module( 2025-08-14T21:42:54.7945403Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:42:54.7945553Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:42:54.7945805Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 392, in forward 2025-08-14T21:42:54.7945896Z layer_output = self.dense(intermediate_states) 2025-08-14T21:42:54.7945901Z 2025-08-14T21:42:54.7945990Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7946166Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7946233Z return mod(**inputs) 2025-08-14T21:42:54.7946524Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7946596Z outputs = self.mobilebert( 2025-08-14T21:42:54.7946847Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7946910Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7947170Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7947232Z layer_outputs = layer_module( 2025-08-14T21:42:54.7947505Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:42:54.7947648Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:42:54.7947916Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 397, in forward 2025-08-14T21:42:54.7948034Z layer_output = self.LayerNorm(layer_output + residual_tensor_1) 2025-08-14T21:42:54.7948288Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.7948368Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.7948378Z 2025-08-14T21:42:54.7948469Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7948646Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7948712Z return mod(**inputs) 2025-08-14T21:42:54.7948964Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7949028Z outputs = self.mobilebert( 2025-08-14T21:42:54.7949290Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7949354Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7949612Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7949673Z layer_outputs = layer_module( 2025-08-14T21:42:54.7949927Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:42:54.7950074Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:42:54.7950330Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 398, in forward 2025-08-14T21:42:54.7950442Z layer_output = self.bottleneck(layer_output, residual_tensor_2) 2025-08-14T21:42:54.7950704Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 372, in forward 2025-08-14T21:42:54.7950780Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:42:54.7950784Z 2025-08-14T21:42:54.7950883Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7951059Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7951118Z return mod(**inputs) 2025-08-14T21:42:54.7951376Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7951438Z outputs = self.mobilebert( 2025-08-14T21:42:54.7951697Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7951761Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7952024Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7952102Z layer_outputs = layer_module( 2025-08-14T21:42:54.7952367Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:42:54.7952510Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:42:54.7952768Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 398, in forward 2025-08-14T21:42:54.7952879Z layer_output = self.bottleneck(layer_output, residual_tensor_2) 2025-08-14T21:42:54.7953136Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 374, in forward 2025-08-14T21:42:54.7953258Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:42:54.7953508Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.7953614Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.7953617Z 2025-08-14T21:42:54.7953709Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7953896Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7953954Z return mod(**inputs) 2025-08-14T21:42:54.7954205Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7954274Z outputs = self.mobilebert( 2025-08-14T21:42:54.7954525Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7954595Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7954848Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7954911Z layer_outputs = layer_module( 2025-08-14T21:42:54.7955173Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:42:54.7955318Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:42:54.7955570Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 444, in forward 2025-08-14T21:42:54.7955674Z shared_attention_input = self.attention(hidden_states) 2025-08-14T21:42:54.7955928Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 409, in forward 2025-08-14T21:42:54.7956013Z layer_input = self.dense(hidden_states) 2025-08-14T21:42:54.7956017Z 2025-08-14T21:42:54.7956108Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7956286Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7956349Z return mod(**inputs) 2025-08-14T21:42:54.7956603Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7956668Z outputs = self.mobilebert( 2025-08-14T21:42:54.7956920Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7956983Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7957242Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7957303Z layer_outputs = layer_module( 2025-08-14T21:42:54.7957555Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:42:54.7957651Z self_attention_outputs = self.attention( 2025-08-14T21:42:54.7957920Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:42:54.7957989Z self_outputs = self.self( 2025-08-14T21:42:54.7958240Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 255, in forward 2025-08-14T21:42:54.7958301Z self.value(value_tensor) 2025-08-14T21:42:54.7958304Z 2025-08-14T21:42:54.7958402Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7958579Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7958659Z return mod(**inputs) 2025-08-14T21:42:54.7958910Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7958986Z outputs = self.mobilebert( 2025-08-14T21:42:54.7959246Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7959309Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7959562Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7959631Z layer_outputs = layer_module( 2025-08-14T21:42:54.7959883Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:42:54.7960033Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:42:54.7960289Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 440, in forward 2025-08-14T21:42:54.7960387Z bottlenecked_hidden_states = self.input(hidden_states) 2025-08-14T21:42:54.7960649Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 409, in forward 2025-08-14T21:42:54.7960725Z layer_input = self.dense(hidden_states) 2025-08-14T21:42:54.7960728Z 2025-08-14T21:42:54.7960824Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7961001Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7961058Z return mod(**inputs) 2025-08-14T21:42:54.7961319Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7961379Z outputs = self.mobilebert( 2025-08-14T21:42:54.7961634Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7961703Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7961962Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7962034Z layer_outputs = layer_module( 2025-08-14T21:42:54.7962286Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:42:54.7962430Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:42:54.7962692Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 444, in forward 2025-08-14T21:42:54.7962791Z shared_attention_input = self.attention(hidden_states) 2025-08-14T21:42:54.7963050Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 410, in forward 2025-08-14T21:42:54.7963128Z layer_input = self.LayerNorm(layer_input) 2025-08-14T21:42:54.7963392Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.7963493Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.7963497Z 2025-08-14T21:42:54.7963590Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7963773Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7963828Z return mod(**inputs) 2025-08-14T21:42:54.7964079Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7964143Z outputs = self.mobilebert( 2025-08-14T21:42:54.7964396Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7964475Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7964731Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7964807Z layer_outputs = layer_module( 2025-08-14T21:42:54.7965070Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:42:54.7965146Z self_attention_outputs = self.attention( 2025-08-14T21:42:54.7965397Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:42:54.7965468Z self_outputs = self.self( 2025-08-14T21:42:54.7965721Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 245, in forward 2025-08-14T21:42:54.7965787Z self.query(query_tensor) 2025-08-14T21:42:54.7965795Z 2025-08-14T21:42:54.7965887Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7966065Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7966133Z return mod(**inputs) 2025-08-14T21:42:54.7966387Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7966447Z outputs = self.mobilebert( 2025-08-14T21:42:54.7966707Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7966770Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7967031Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7967092Z layer_outputs = layer_module( 2025-08-14T21:42:54.7967349Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:42:54.7967429Z self_attention_outputs = self.attention( 2025-08-14T21:42:54.7967685Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:42:54.7967747Z self_outputs = self.self( 2025-08-14T21:42:54.7968007Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 250, in forward 2025-08-14T21:42:54.7968064Z self.key(key_tensor) 2025-08-14T21:42:54.7968067Z 2025-08-14T21:42:54.7968144Z cudagraph partition due to non gpu ops 2025-08-14T21:42:54.7968215Z cudagraph partition due to non gpu ops 2025-08-14T21:42:54.7968305Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7968491Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7968549Z return mod(**inputs) 2025-08-14T21:42:54.7968810Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7968871Z outputs = self.mobilebert( 2025-08-14T21:42:54.7969162Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7969235Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7969489Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7969551Z layer_outputs = layer_module( 2025-08-14T21:42:54.7969809Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:42:54.7969884Z self_attention_outputs = self.attention( 2025-08-14T21:42:54.7970157Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 344, in forward 2025-08-14T21:42:54.7970267Z attention_output = self.output(self_outputs[0], layer_input) 2025-08-14T21:42:54.7970533Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 292, in forward 2025-08-14T21:42:54.7970612Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:42:54.7970616Z 2025-08-14T21:42:54.7970705Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7970884Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7970938Z return mod(**inputs) 2025-08-14T21:42:54.7971188Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7971251Z outputs = self.mobilebert( 2025-08-14T21:42:54.7971503Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7971563Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7971818Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7971878Z layer_outputs = layer_module( 2025-08-14T21:42:54.7972133Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:42:54.7972202Z self_attention_outputs = self.attention( 2025-08-14T21:42:54.7972450Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 344, in forward 2025-08-14T21:42:54.7972561Z attention_output = self.output(self_outputs[0], layer_input) 2025-08-14T21:42:54.7972810Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 295, in forward 2025-08-14T21:42:54.7972926Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:42:54.7973175Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.7973258Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.7973263Z 2025-08-14T21:42:54.7973353Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7973527Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7973582Z return mod(**inputs) 2025-08-14T21:42:54.7973836Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7973897Z outputs = self.mobilebert( 2025-08-14T21:42:54.7974152Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7974216Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7974466Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7974548Z layer_outputs = layer_module( 2025-08-14T21:42:54.7974823Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7974916Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7975170Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.7975268Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.7975528Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:42:54.7975616Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:54.7975619Z 2025-08-14T21:42:54.7975709Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7975891Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7975967Z return mod(**inputs) 2025-08-14T21:42:54.7976227Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7976289Z outputs = self.mobilebert( 2025-08-14T21:42:54.7976542Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7976609Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7976864Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7976932Z layer_outputs = layer_module( 2025-08-14T21:42:54.7977180Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7977259Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7977518Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.7977613Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.7977864Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:42:54.7977966Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:42:54.7977969Z 2025-08-14T21:42:54.7978057Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7978238Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7978297Z return mod(**inputs) 2025-08-14T21:42:54.7978547Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7978614Z outputs = self.mobilebert( 2025-08-14T21:42:54.7978867Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7978933Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7979182Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7979241Z layer_outputs = layer_module( 2025-08-14T21:42:54.7979492Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7979571Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7979821Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.7979935Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.7980200Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:42:54.7980292Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:42:54.7980295Z 2025-08-14T21:42:54.7980385Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7980560Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7980624Z return mod(**inputs) 2025-08-14T21:42:54.7980878Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7980947Z outputs = self.mobilebert( 2025-08-14T21:42:54.7981203Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7981723Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7981984Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7982064Z layer_outputs = layer_module( 2025-08-14T21:42:54.7982326Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7982420Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7982674Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.7982797Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.7983052Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:42:54.7983159Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:42:54.7983421Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.7983505Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.7983509Z 2025-08-14T21:42:54.7983611Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7983794Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7983852Z return mod(**inputs) 2025-08-14T21:42:54.7984112Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7984175Z outputs = self.mobilebert( 2025-08-14T21:42:54.7984434Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7984499Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7984927Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7985005Z layer_outputs = layer_module( 2025-08-14T21:42:54.7985270Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7985357Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7985631Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.7985741Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.7986018Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:42:54.7986094Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:54.7986098Z 2025-08-14T21:42:54.7986190Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7986375Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7986470Z return mod(**inputs) 2025-08-14T21:42:54.7986750Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7986815Z outputs = self.mobilebert( 2025-08-14T21:42:54.7987067Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7987138Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7987389Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7987476Z layer_outputs = layer_module( 2025-08-14T21:42:54.7987748Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7987836Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7988143Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.7988247Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.7988511Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:42:54.7988620Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:42:54.7988625Z 2025-08-14T21:42:54.7988720Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7988913Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7988975Z return mod(**inputs) 2025-08-14T21:42:54.7989240Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7989312Z outputs = self.mobilebert( 2025-08-14T21:42:54.7989579Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7989646Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7989915Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7989979Z layer_outputs = layer_module( 2025-08-14T21:42:54.7990250Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7990335Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7990601Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.7990724Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.7990993Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:42:54.7991078Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:42:54.7991081Z 2025-08-14T21:42:54.7991176Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7991358Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7991421Z return mod(**inputs) 2025-08-14T21:42:54.7991688Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7991755Z outputs = self.mobilebert( 2025-08-14T21:42:54.7992029Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7992096Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7992381Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7992450Z layer_outputs = layer_module( 2025-08-14T21:42:54.7992761Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7992856Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7993125Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.7993248Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.7993518Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:42:54.7993649Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:42:54.7993923Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.7994029Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.7994035Z 2025-08-14T21:42:54.7994139Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7994327Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7994386Z return mod(**inputs) 2025-08-14T21:42:54.7994661Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7994729Z outputs = self.mobilebert( 2025-08-14T21:42:54.7994999Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7995077Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7995348Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7995429Z layer_outputs = layer_module( 2025-08-14T21:42:54.7995701Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7995793Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7996069Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.7996175Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.7996445Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:42:54.7996534Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:54.7996537Z 2025-08-14T21:42:54.7996637Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7996834Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7996901Z return mod(**inputs) 2025-08-14T21:42:54.7997174Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7997249Z outputs = self.mobilebert( 2025-08-14T21:42:54.7997523Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.7997601Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.7997869Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.7997933Z layer_outputs = layer_module( 2025-08-14T21:42:54.7998215Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.7998296Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.7998575Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.7998678Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.7998930Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:42:54.7999034Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:42:54.7999037Z 2025-08-14T21:42:54.7999131Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.7999312Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.7999393Z return mod(**inputs) 2025-08-14T21:42:54.7999655Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.7999723Z outputs = self.mobilebert( 2025-08-14T21:42:54.7999994Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.8000058Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.8000312Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.8000374Z layer_outputs = layer_module( 2025-08-14T21:42:54.8000632Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.8000714Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.8000964Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.8001081Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.8001335Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:42:54.8001410Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:42:54.8001414Z 2025-08-14T21:42:54.8001511Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.8001688Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.8001751Z return mod(**inputs) 2025-08-14T21:42:54.8002002Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.8002064Z outputs = self.mobilebert( 2025-08-14T21:42:54.8002321Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.8002386Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.8002642Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.8002707Z layer_outputs = layer_module( 2025-08-14T21:42:54.8002959Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.8003046Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.8003298Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.8003407Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.8003662Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:42:54.8003771Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:42:54.8004026Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.8004120Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.8004124Z 2025-08-14T21:42:54.8004230Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.8004411Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.8004466Z return mod(**inputs) 2025-08-14T21:42:54.8004721Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.8004779Z outputs = self.mobilebert( 2025-08-14T21:42:54.8005030Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.8005117Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.8005376Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.8005456Z layer_outputs = layer_module( 2025-08-14T21:42:54.8005713Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 518, in forward 2025-08-14T21:42:54.8005817Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:42:54.8006072Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:42:54.8006145Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:54.8006148Z 2025-08-14T21:42:54.8006239Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.8006420Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.8006479Z return mod(**inputs) 2025-08-14T21:42:54.8006735Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.8006797Z outputs = self.mobilebert( 2025-08-14T21:42:54.8007052Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.8007121Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.8007373Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.8007435Z layer_outputs = layer_module( 2025-08-14T21:42:54.8007695Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 518, in forward 2025-08-14T21:42:54.8007800Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:42:54.8008059Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:42:54.8008156Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:42:54.8008161Z 2025-08-14T21:42:54.8008254Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.8008440Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.8008497Z return mod(**inputs) 2025-08-14T21:42:54.8008754Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.8008815Z outputs = self.mobilebert( 2025-08-14T21:42:54.8009065Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.8009135Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.8009386Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.8009448Z layer_outputs = layer_module( 2025-08-14T21:42:54.8009723Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:42:54.8009883Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:42:54.8010143Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 392, in forward 2025-08-14T21:42:54.8010225Z layer_output = self.dense(intermediate_states) 2025-08-14T21:42:54.8010228Z 2025-08-14T21:42:54.8010321Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.8010499Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.8010583Z return mod(**inputs) 2025-08-14T21:42:54.8010835Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.8010896Z outputs = self.mobilebert( 2025-08-14T21:42:54.8011162Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.8011235Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.8011485Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.8011545Z layer_outputs = layer_module( 2025-08-14T21:42:54.8011806Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:42:54.8011949Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:42:54.8012205Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 397, in forward 2025-08-14T21:42:54.8012315Z layer_output = self.LayerNorm(layer_output + residual_tensor_1) 2025-08-14T21:42:54.8012568Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.8012658Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.8012662Z 2025-08-14T21:42:54.8012752Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.8012936Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.8012993Z return mod(**inputs) 2025-08-14T21:42:54.8013246Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.8013315Z outputs = self.mobilebert( 2025-08-14T21:42:54.8013565Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.8013638Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.8013893Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.8013955Z layer_outputs = layer_module( 2025-08-14T21:42:54.8014214Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:42:54.8014359Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:42:54.8014609Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 398, in forward 2025-08-14T21:42:54.8014728Z layer_output = self.bottleneck(layer_output, residual_tensor_2) 2025-08-14T21:42:54.8014981Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 372, in forward 2025-08-14T21:42:54.8015064Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:42:54.8015068Z 2025-08-14T21:42:54.8015156Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.8015366Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.8015451Z return mod(**inputs) 2025-08-14T21:42:54.8015703Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.8015770Z outputs = self.mobilebert( 2025-08-14T21:42:54.8016017Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.8016080Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.8016338Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.8016418Z layer_outputs = layer_module( 2025-08-14T21:42:54.8016673Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:42:54.8016839Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:42:54.8017094Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 398, in forward 2025-08-14T21:42:54.8017209Z layer_output = self.bottleneck(layer_output, residual_tensor_2) 2025-08-14T21:42:54.8017463Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 374, in forward 2025-08-14T21:42:54.8017569Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:42:54.8017834Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.8017915Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.8017918Z 2025-08-14T21:42:54.8018015Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.8018200Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.8018261Z return mod(**inputs) 2025-08-14T21:42:54.8018520Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.8018581Z outputs = self.mobilebert( 2025-08-14T21:42:54.8018837Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.8018899Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.8019153Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.8019221Z layer_outputs = layer_module( 2025-08-14T21:42:54.8019474Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:42:54.8019623Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:42:54.8019885Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 444, in forward 2025-08-14T21:42:54.8019982Z shared_attention_input = self.attention(hidden_states) 2025-08-14T21:42:54.8020241Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 409, in forward 2025-08-14T21:42:54.8020315Z layer_input = self.dense(hidden_states) 2025-08-14T21:42:54.8020318Z 2025-08-14T21:42:54.8020408Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.8020594Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.8020652Z return mod(**inputs) 2025-08-14T21:42:54.8020911Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.8020988Z outputs = self.mobilebert( 2025-08-14T21:42:54.8021257Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.8021331Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.8021588Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.8021650Z layer_outputs = layer_module( 2025-08-14T21:42:54.8021917Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:42:54.8022010Z self_attention_outputs = self.attention( 2025-08-14T21:42:54.8022267Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:42:54.8022332Z self_outputs = self.self( 2025-08-14T21:42:54.8022600Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 255, in forward 2025-08-14T21:42:54.8022669Z self.value(value_tensor) 2025-08-14T21:42:54.8022672Z 2025-08-14T21:42:54.8022762Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.8022942Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.8022996Z return mod(**inputs) 2025-08-14T21:42:54.8023246Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.8023313Z outputs = self.mobilebert( 2025-08-14T21:42:54.8023562Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.8023626Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.8023888Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.8023951Z layer_outputs = layer_module( 2025-08-14T21:42:54.8024209Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:42:54.8024353Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:42:54.8024605Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 440, in forward 2025-08-14T21:42:54.8024778Z bottlenecked_hidden_states = self.input(hidden_states) 2025-08-14T21:42:54.8025060Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 409, in forward 2025-08-14T21:42:54.8025148Z layer_input = self.dense(hidden_states) 2025-08-14T21:42:54.8025152Z 2025-08-14T21:42:54.8025253Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.8025448Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.8025519Z return mod(**inputs) 2025-08-14T21:42:54.8025788Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.8025853Z outputs = self.mobilebert( 2025-08-14T21:42:54.8026133Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.8026201Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.8026487Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.8026554Z layer_outputs = layer_module( 2025-08-14T21:42:54.8026809Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:42:54.8026990Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:42:54.8027245Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 444, in forward 2025-08-14T21:42:54.8027349Z shared_attention_input = self.attention(hidden_states) 2025-08-14T21:42:54.8027598Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 410, in forward 2025-08-14T21:42:54.8027673Z layer_input = self.LayerNorm(layer_input) 2025-08-14T21:42:54.8027927Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.8028024Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.8028027Z 2025-08-14T21:42:54.8028125Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.8028319Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.8028378Z return mod(**inputs) 2025-08-14T21:42:54.8028637Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.8028700Z outputs = self.mobilebert( 2025-08-14T21:42:54.8028950Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.8029021Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.8029271Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.8029341Z layer_outputs = layer_module( 2025-08-14T21:42:54.8029592Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:42:54.8029669Z self_attention_outputs = self.attention( 2025-08-14T21:42:54.8029929Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:42:54.8029991Z self_outputs = self.self( 2025-08-14T21:42:54.8030245Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 245, in forward 2025-08-14T21:42:54.8030316Z self.query(query_tensor) 2025-08-14T21:42:54.8030319Z 2025-08-14T21:42:54.8030411Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.8030593Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.8030648Z return mod(**inputs) 2025-08-14T21:42:54.8030893Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.8030959Z outputs = self.mobilebert( 2025-08-14T21:42:54.8031212Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.8031282Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.8031531Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.8031592Z layer_outputs = layer_module( 2025-08-14T21:42:54.8031849Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:42:54.8031924Z self_attention_outputs = self.attention( 2025-08-14T21:42:54.8032174Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:42:54.8032243Z self_outputs = self.self( 2025-08-14T21:42:54.8032494Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 250, in forward 2025-08-14T21:42:54.8032577Z self.key(key_tensor) 2025-08-14T21:42:54.8032581Z 2025-08-14T21:42:54.8032679Z cudagraph partition due to non gpu ops 2025-08-14T21:42:54.8032753Z cudagraph partition due to non gpu ops 2025-08-14T21:42:54.8032851Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.8033031Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.8033094Z return mod(**inputs) 2025-08-14T21:42:54.8033345Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.8033407Z outputs = self.mobilebert( 2025-08-14T21:42:54.8033678Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.8033741Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.8034006Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.8034076Z layer_outputs = layer_module( 2025-08-14T21:42:54.8034327Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:42:54.8034406Z self_attention_outputs = self.attention( 2025-08-14T21:42:54.8034654Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 344, in forward 2025-08-14T21:42:54.8034761Z attention_output = self.output(self_outputs[0], layer_input) 2025-08-14T21:42:54.8035013Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 292, in forward 2025-08-14T21:42:54.8035086Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:42:54.8035089Z 2025-08-14T21:42:54.8035184Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.8035362Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.8035420Z return mod(**inputs) 2025-08-14T21:42:54.8035677Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.8035738Z outputs = self.mobilebert( 2025-08-14T21:42:54.8035988Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.8036059Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.8036308Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.8036378Z layer_outputs = layer_module( 2025-08-14T21:42:54.8036637Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:42:54.8036716Z self_attention_outputs = self.attention( 2025-08-14T21:42:54.8036983Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 344, in forward 2025-08-14T21:42:54.8037095Z attention_output = self.output(self_outputs[0], layer_input) 2025-08-14T21:42:54.8037362Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 295, in forward 2025-08-14T21:42:54.8037476Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:42:54.8037762Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.8037853Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.8037857Z 2025-08-14T21:42:54.8037947Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.8038140Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.8038207Z return mod(**inputs) 2025-08-14T21:42:54.8038474Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.8038544Z outputs = self.mobilebert( 2025-08-14T21:42:54.8038801Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.8038863Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.8039134Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.8039215Z layer_outputs = layer_module( 2025-08-14T21:42:54.8039483Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.8039587Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.8039851Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.8039960Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.8040223Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:42:54.8040298Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:54.8040308Z 2025-08-14T21:42:54.8040401Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.8040582Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.8040649Z return mod(**inputs) 2025-08-14T21:42:54.8040910Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.8040975Z outputs = self.mobilebert( 2025-08-14T21:42:54.8041243Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.8041307Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.8041572Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.8041632Z layer_outputs = layer_module( 2025-08-14T21:42:54.8041893Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.8041983Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.8042244Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.8042345Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.8042613Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:42:54.8042718Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:42:54.8042721Z 2025-08-14T21:42:54.8042822Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.8043005Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.8043064Z return mod(**inputs) 2025-08-14T21:42:54.8043331Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.8043393Z outputs = self.mobilebert( 2025-08-14T21:42:54.8043660Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.8043724Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.8043999Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.8044083Z layer_outputs = layer_module( 2025-08-14T21:42:54.8044345Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.8044431Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.8044696Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.8044810Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.8045077Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:42:54.8045174Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:42:54.8045178Z 2025-08-14T21:42:54.8045270Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.8045479Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.8045540Z return mod(**inputs) 2025-08-14T21:42:54.8045811Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.8045873Z outputs = self.mobilebert( 2025-08-14T21:42:54.8046133Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.8046206Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.8046467Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.8046532Z layer_outputs = layer_module( 2025-08-14T21:42:54.8046798Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.8046887Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.8047153Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.8047267Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.8047525Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:42:54.8047644Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:42:54.8047902Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.8047993Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.8047997Z 2025-08-14T21:42:54.8048090Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.8048275Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.8048340Z return mod(**inputs) 2025-08-14T21:42:54.8048599Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.8048669Z outputs = self.mobilebert( 2025-08-14T21:42:54.8048922Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.8048985Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.8049248Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.8049312Z layer_outputs = layer_module( 2025-08-14T21:42:54.8049570Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.8049660Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.8049943Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.8050052Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.8050309Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:42:54.8050384Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:54.8050388Z 2025-08-14T21:42:54.8050485Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.8050669Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.8050761Z return mod(**inputs) 2025-08-14T21:42:54.8051022Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.8051099Z outputs = self.mobilebert( 2025-08-14T21:42:54.8051366Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.8051431Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.8051684Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.8051755Z layer_outputs = layer_module( 2025-08-14T21:42:54.8052009Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.8052099Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.8052354Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.8052448Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.8052708Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:42:54.8052807Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:42:54.8052811Z 2025-08-14T21:42:54.8052909Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.8053088Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.8053146Z return mod(**inputs) 2025-08-14T21:42:54.8053407Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.8053469Z outputs = self.mobilebert( 2025-08-14T21:42:54.8053724Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.8053795Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.8054048Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.8054121Z layer_outputs = layer_module( 2025-08-14T21:42:54.8054375Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.8054457Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.8054717Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.8054826Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.8055086Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:42:54.8055162Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:42:54.8055165Z 2025-08-14T21:42:54.8055257Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.8055466Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.8055539Z return mod(**inputs) 2025-08-14T21:42:54.8055792Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.8055861Z outputs = self.mobilebert( 2025-08-14T21:42:54.8056110Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.8056173Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.8056421Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.8056500Z layer_outputs = layer_module( 2025-08-14T21:42:54.8056753Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.8056851Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.8057106Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.8057217Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.8057472Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:42:54.8057585Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:42:54.8057839Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.8057920Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.8057930Z 2025-08-14T21:42:54.8058021Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.8058202Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.8058265Z return mod(**inputs) 2025-08-14T21:42:54.8058519Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.8058581Z outputs = self.mobilebert( 2025-08-14T21:42:54.8058839Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.8058901Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.8059158Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.8059222Z layer_outputs = layer_module( 2025-08-14T21:42:54.8059475Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.8059566Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.8059818Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.8059914Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.8060171Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:42:54.8060246Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:54.8060249Z 2025-08-14T21:42:54.8060345Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.8060523Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.8060581Z return mod(**inputs) 2025-08-14T21:42:54.8060837Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.8060897Z outputs = self.mobilebert( 2025-08-14T21:42:54.8061182Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.8061247Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.8061499Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.8061567Z layer_outputs = layer_module( 2025-08-14T21:42:54.8061819Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.8061901Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.8062182Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.8062280Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.8062554Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:42:54.8062654Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:42:54.8062657Z 2025-08-14T21:42:54.8062748Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.8062933Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.8062991Z return mod(**inputs) 2025-08-14T21:42:54.8063249Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.8063311Z outputs = self.mobilebert( 2025-08-14T21:42:54.8063565Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.8063635Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.8063884Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.8063948Z layer_outputs = layer_module( 2025-08-14T21:42:54.8064206Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.8064287Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.8064543Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.8064653Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.8064968Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:42:54.8065067Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:42:54.8065070Z 2025-08-14T21:42:54.8065171Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.8065379Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.8065444Z return mod(**inputs) 2025-08-14T21:42:54.8065737Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.8065810Z outputs = self.mobilebert( 2025-08-14T21:42:54.8066084Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.8066159Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.8066437Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.8066505Z layer_outputs = layer_module( 2025-08-14T21:42:54.8066781Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.8066898Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.8067177Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.8067301Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.8067565Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:42:54.8067681Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:42:54.8067950Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.8068057Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.8068060Z 2025-08-14T21:42:54.8068164Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.8068350Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.8068438Z return mod(**inputs) 2025-08-14T21:42:54.8068704Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.8068769Z outputs = self.mobilebert( 2025-08-14T21:42:54.8069043Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.8069111Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.8069378Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.8069450Z layer_outputs = layer_module( 2025-08-14T21:42:54.8069716Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 518, in forward 2025-08-14T21:42:54.8069838Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:42:54.8070107Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:42:54.8070185Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:54.8070189Z 2025-08-14T21:42:54.8070291Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.8070472Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.8070535Z return mod(**inputs) 2025-08-14T21:42:54.8070804Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.8070874Z outputs = self.mobilebert( 2025-08-14T21:42:54.8071146Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.8071212Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.8071482Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.8071554Z layer_outputs = layer_module( 2025-08-14T21:42:54.8071819Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 518, in forward 2025-08-14T21:42:54.8071935Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:42:54.8072198Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:42:54.8072301Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:42:54.8072306Z 2025-08-14T21:42:54.8072409Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.8072595Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.8072664Z return mod(**inputs) 2025-08-14T21:42:54.8072957Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.8073025Z outputs = self.mobilebert( 2025-08-14T21:42:54.8073301Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.8073368Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.8073637Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.8073708Z layer_outputs = layer_module( 2025-08-14T21:42:54.8073975Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:42:54.8074153Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:42:54.8074442Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 392, in forward 2025-08-14T21:42:54.8074531Z layer_output = self.dense(intermediate_states) 2025-08-14T21:42:54.8074535Z 2025-08-14T21:42:54.8074639Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.8074826Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.8074892Z return mod(**inputs) 2025-08-14T21:42:54.8075155Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.8075219Z outputs = self.mobilebert( 2025-08-14T21:42:54.8075494Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.8075562Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.8075827Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.8075901Z layer_outputs = layer_module( 2025-08-14T21:42:54.8076164Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:42:54.8076319Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:42:54.8076582Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 397, in forward 2025-08-14T21:42:54.8076697Z layer_output = self.LayerNorm(layer_output + residual_tensor_1) 2025-08-14T21:42:54.8076971Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.8077057Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.8077061Z 2025-08-14T21:42:54.8077159Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.8077346Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.8077407Z return mod(**inputs) 2025-08-14T21:42:54.8077679Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.8077739Z outputs = self.mobilebert( 2025-08-14T21:42:54.8077995Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.8078058Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.8078305Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.8078373Z layer_outputs = layer_module( 2025-08-14T21:42:54.8078620Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:42:54.8078777Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:42:54.8079049Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 398, in forward 2025-08-14T21:42:54.8079161Z layer_output = self.bottleneck(layer_output, residual_tensor_2) 2025-08-14T21:42:54.8079419Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 372, in forward 2025-08-14T21:42:54.8079493Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:42:54.8079496Z 2025-08-14T21:42:54.8079587Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.8079788Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.8079846Z return mod(**inputs) 2025-08-14T21:42:54.8080104Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.8080184Z outputs = self.mobilebert( 2025-08-14T21:42:54.8080436Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.8080505Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.8080755Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.8080817Z layer_outputs = layer_module( 2025-08-14T21:42:54.8081074Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:42:54.8081216Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:42:54.8081473Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 398, in forward 2025-08-14T21:42:54.8081584Z layer_output = self.bottleneck(layer_output, residual_tensor_2) 2025-08-14T21:42:54.8081836Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 374, in forward 2025-08-14T21:42:54.8081947Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:42:54.8082197Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.8082286Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.8082290Z 2025-08-14T21:42:54.8082378Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.8082556Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.8082618Z return mod(**inputs) 2025-08-14T21:42:54.8082863Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.8082934Z outputs = self.mobilebert( 2025-08-14T21:42:54.8083186Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.8083252Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.8083504Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.8083563Z layer_outputs = layer_module( 2025-08-14T21:42:54.8083811Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:42:54.8083960Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:42:54.8084213Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 444, in forward 2025-08-14T21:42:54.8084355Z shared_attention_input = self.attention(hidden_states) 2025-08-14T21:42:54.8084803Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 409, in forward 2025-08-14T21:42:54.8084890Z layer_input = self.dense(hidden_states) 2025-08-14T21:42:54.8084895Z 2025-08-14T21:42:54.8085000Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.8085185Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.8085254Z return mod(**inputs) 2025-08-14T21:42:54.8085516Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.8085604Z outputs = self.mobilebert( 2025-08-14T21:42:54.8085873Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.8085960Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.8086231Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.8086304Z layer_outputs = layer_module( 2025-08-14T21:42:54.8086560Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:42:54.8086645Z self_attention_outputs = self.attention( 2025-08-14T21:42:54.8086901Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:42:54.8086964Z self_outputs = self.self( 2025-08-14T21:42:54.8087224Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 255, in forward 2025-08-14T21:42:54.8087286Z self.value(value_tensor) 2025-08-14T21:42:54.8087290Z 2025-08-14T21:42:54.8087389Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.8087569Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.8087627Z return mod(**inputs) 2025-08-14T21:42:54.8087887Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.8087949Z outputs = self.mobilebert( 2025-08-14T21:42:54.8088202Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.8088273Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.8088524Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.8088593Z layer_outputs = layer_module( 2025-08-14T21:42:54.8088843Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:42:54.8088991Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:42:54.8089253Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 440, in forward 2025-08-14T21:42:54.8089350Z bottlenecked_hidden_states = self.input(hidden_states) 2025-08-14T21:42:54.8089610Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 409, in forward 2025-08-14T21:42:54.8089683Z layer_input = self.dense(hidden_states) 2025-08-14T21:42:54.8089686Z 2025-08-14T21:42:54.8089777Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.8089962Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.8090020Z return mod(**inputs) 2025-08-14T21:42:54.8090294Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.8090365Z outputs = self.mobilebert( 2025-08-14T21:42:54.8090633Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.8090706Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.8090959Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.8091020Z layer_outputs = layer_module( 2025-08-14T21:42:54.8091277Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:42:54.8091434Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:42:54.8091689Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 444, in forward 2025-08-14T21:42:54.8091803Z shared_attention_input = self.attention(hidden_states) 2025-08-14T21:42:54.8092052Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 410, in forward 2025-08-14T21:42:54.8092134Z layer_input = self.LayerNorm(layer_input) 2025-08-14T21:42:54.8092386Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.8092467Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.8092476Z 2025-08-14T21:42:54.8092566Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.8092740Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.8092806Z return mod(**inputs) 2025-08-14T21:42:54.8093056Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.8093120Z outputs = self.mobilebert( 2025-08-14T21:42:54.8093379Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.8093442Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.8093696Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.8093757Z layer_outputs = layer_module( 2025-08-14T21:42:54.8094009Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:42:54.8094092Z self_attention_outputs = self.attention( 2025-08-14T21:42:54.8094344Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:42:54.8094407Z self_outputs = self.self( 2025-08-14T21:42:54.8094668Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 245, in forward 2025-08-14T21:42:54.8094731Z self.query(query_tensor) 2025-08-14T21:42:54.8094734Z 2025-08-14T21:42:54.8094829Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.8095006Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.8095063Z return mod(**inputs) 2025-08-14T21:42:54.8095320Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.8095381Z outputs = self.mobilebert( 2025-08-14T21:42:54.8095638Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.8095701Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.8095966Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.8096050Z layer_outputs = layer_module( 2025-08-14T21:42:54.8096303Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:42:54.8096377Z self_attention_outputs = self.attention( 2025-08-14T21:42:54.8096633Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:42:54.8096694Z self_outputs = self.self( 2025-08-14T21:42:54.8096951Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 250, in forward 2025-08-14T21:42:54.8097032Z self.key(key_tensor) 2025-08-14T21:42:54.8097036Z 2025-08-14T21:42:54.8097107Z cudagraph partition due to non gpu ops 2025-08-14T21:42:54.8097186Z cudagraph partition due to non gpu ops 2025-08-14T21:42:54.8097295Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.8097481Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.8097538Z return mod(**inputs) 2025-08-14T21:42:54.8097788Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.8097856Z outputs = self.mobilebert( 2025-08-14T21:42:54.8098112Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.8098173Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.8098433Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.8098496Z layer_outputs = layer_module( 2025-08-14T21:42:54.8098756Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:42:54.8098835Z self_attention_outputs = self.attention( 2025-08-14T21:42:54.8099091Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 344, in forward 2025-08-14T21:42:54.8099208Z attention_output = self.output(self_outputs[0], layer_input) 2025-08-14T21:42:54.8099462Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 292, in forward 2025-08-14T21:42:54.8099538Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:42:54.8099549Z 2025-08-14T21:42:54.8099639Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.8099817Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.8099881Z return mod(**inputs) 2025-08-14T21:42:54.8100136Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.8100199Z outputs = self.mobilebert( 2025-08-14T21:42:54.8100460Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.8100525Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.8100784Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.8100846Z layer_outputs = layer_module( 2025-08-14T21:42:54.8101100Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:42:54.8101183Z self_attention_outputs = self.attention( 2025-08-14T21:42:54.8101437Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 344, in forward 2025-08-14T21:42:54.8101563Z attention_output = self.output(self_outputs[0], layer_input) 2025-08-14T21:42:54.8101840Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 295, in forward 2025-08-14T21:42:54.8101953Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:42:54.8102211Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.8102293Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.8102296Z 2025-08-14T21:42:54.8102386Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.8102567Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.8102639Z return mod(**inputs) 2025-08-14T21:42:54.8102892Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.8102968Z outputs = self.mobilebert( 2025-08-14T21:42:54.8103219Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.8103286Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.8103538Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.8103608Z layer_outputs = layer_module( 2025-08-14T21:42:54.8103862Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.8103947Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.8104205Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.8104306Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.8104565Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:42:54.8104648Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:54.8104651Z 2025-08-14T21:42:54.8104797Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.8104989Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.8105047Z return mod(**inputs) 2025-08-14T21:42:54.8105300Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.8105372Z outputs = self.mobilebert( 2025-08-14T21:42:54.8105629Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.8105701Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.8105960Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.8106026Z layer_outputs = layer_module( 2025-08-14T21:42:54.8106289Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.8106373Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.8106622Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.8106728Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.8106982Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:42:54.8107091Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:42:54.8107094Z 2025-08-14T21:42:54.8107189Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.8107405Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.8107474Z return mod(**inputs) 2025-08-14T21:42:54.8107728Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.8107799Z outputs = self.mobilebert( 2025-08-14T21:42:54.8108050Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.8108110Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.8108362Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.8108439Z layer_outputs = layer_module( 2025-08-14T21:42:54.8108689Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.8108791Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.8109040Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.8109159Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.8109409Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:42:54.8109484Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:42:54.8109487Z 2025-08-14T21:42:54.8109583Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.8109759Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.8109821Z return mod(**inputs) 2025-08-14T21:42:54.8110072Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.8110137Z outputs = self.mobilebert( 2025-08-14T21:42:54.8110395Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.8110458Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.8110709Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.8110777Z layer_outputs = layer_module( 2025-08-14T21:42:54.8111025Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.8111117Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.8111368Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.8111479Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.8111739Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:42:54.8111848Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:42:54.8112106Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.8112187Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.8112190Z 2025-08-14T21:42:54.8112282Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.8112465Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.8112526Z return mod(**inputs) 2025-08-14T21:42:54.8112776Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.8112847Z outputs = self.mobilebert( 2025-08-14T21:42:54.8113124Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.8113198Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.8113452Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.8113514Z layer_outputs = layer_module( 2025-08-14T21:42:54.8113774Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.8113858Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.8114127Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.8114224Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.8114494Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:42:54.8114574Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:54.8114578Z 2025-08-14T21:42:54.8114668Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.8114851Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.8114909Z return mod(**inputs) 2025-08-14T21:42:54.8115161Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.8115229Z outputs = self.mobilebert( 2025-08-14T21:42:54.8115484Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.8115548Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.8115809Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.8115873Z layer_outputs = layer_module( 2025-08-14T21:42:54.8116132Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.8116215Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.8116467Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.8116572Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.8116823Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:42:54.8116930Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:42:54.8116933Z 2025-08-14T21:42:54.8117026Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.8117203Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.8117270Z return mod(**inputs) 2025-08-14T21:42:54.8117522Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.8117584Z outputs = self.mobilebert( 2025-08-14T21:42:54.8117844Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.8117908Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.8118166Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.8118232Z layer_outputs = layer_module( 2025-08-14T21:42:54.8118486Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.8118592Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.8118869Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.8118985Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.8119233Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:42:54.8119304Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:42:54.8119307Z 2025-08-14T21:42:54.8119404Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.8119595Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.8119649Z return mod(**inputs) 2025-08-14T21:42:54.8119907Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.8119988Z outputs = self.mobilebert( 2025-08-14T21:42:54.8120254Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.8120318Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.8120572Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.8120642Z layer_outputs = layer_module( 2025-08-14T21:42:54.8120897Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.8120988Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.8121242Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.8121355Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.8121620Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:42:54.8121729Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:42:54.8121986Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.8122072Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.8122075Z 2025-08-14T21:42:54.8122165Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.8122348Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.8122406Z return mod(**inputs) 2025-08-14T21:42:54.8122663Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.8122733Z outputs = self.mobilebert( 2025-08-14T21:42:54.8122992Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.8123062Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.8123318Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.8123380Z layer_outputs = layer_module( 2025-08-14T21:42:54.8123641Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.8123722Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.8123979Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.8124085Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.8124356Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:42:54.8124455Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:54.8124459Z 2025-08-14T21:42:54.8124551Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.8124729Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.8124790Z return mod(**inputs) 2025-08-14T21:42:54.8125037Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.8125102Z outputs = self.mobilebert( 2025-08-14T21:42:54.8125369Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.8125429Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.8125706Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.8125766Z layer_outputs = layer_module( 2025-08-14T21:42:54.8126014Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.8126100Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.8126353Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:42:54.8126458Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:42:54.8126707Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:42:54.8126806Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:42:54.8126810Z 2025-08-14T21:42:54.8126910Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.8127091Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.8127154Z return mod(**inputs) 2025-08-14T21:42:54.8127407Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.8127467Z outputs = self.mobilebert( 2025-08-14T21:42:54.8127726Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.8127789Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.8128041Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.8128110Z layer_outputs = layer_module( 2025-08-14T21:42:54.8128363Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.8128458Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.8128709Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.8128821Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.8129080Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:42:54.8129155Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:42:54.8129158Z 2025-08-14T21:42:54.8129255Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.8129431Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.8129489Z return mod(**inputs) 2025-08-14T21:42:54.8129745Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.8129826Z outputs = self.mobilebert( 2025-08-14T21:42:54.8130098Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.8130163Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.8130415Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.8130483Z layer_outputs = layer_module( 2025-08-14T21:42:54.8130734Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:42:54.8130834Z attention_output = ffn_module(attention_output) 2025-08-14T21:42:54.8131097Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:42:54.8131222Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:42:54.8131478Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:42:54.8131583Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:42:54.8131835Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.8131920Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.8131924Z 2025-08-14T21:42:54.8132012Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.8132195Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.8132254Z return mod(**inputs) 2025-08-14T21:42:54.8132507Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.8132577Z outputs = self.mobilebert( 2025-08-14T21:42:54.8132831Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.8132896Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.8133153Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.8133217Z layer_outputs = layer_module( 2025-08-14T21:42:54.8133474Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 518, in forward 2025-08-14T21:42:54.8133580Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:42:54.8133835Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:42:54.8133917Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:54.8133922Z 2025-08-14T21:42:54.8134015Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.8134204Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.8134261Z return mod(**inputs) 2025-08-14T21:42:54.8134511Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.8134578Z outputs = self.mobilebert( 2025-08-14T21:42:54.8134829Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.8134892Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.8135144Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.8135205Z layer_outputs = layer_module( 2025-08-14T21:42:54.8135477Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 518, in forward 2025-08-14T21:42:54.8135602Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:42:54.8135853Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:42:54.8135956Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:42:54.8135960Z 2025-08-14T21:42:54.8136050Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.8136232Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.8136290Z return mod(**inputs) 2025-08-14T21:42:54.8136558Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.8136626Z outputs = self.mobilebert( 2025-08-14T21:42:54.8136880Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.8136961Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.8137219Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.8137281Z layer_outputs = layer_module( 2025-08-14T21:42:54.8137538Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:42:54.8137679Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:42:54.8137927Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 392, in forward 2025-08-14T21:42:54.8138018Z layer_output = self.dense(intermediate_states) 2025-08-14T21:42:54.8138021Z 2025-08-14T21:42:54.8138112Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.8138297Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.8138356Z return mod(**inputs) 2025-08-14T21:42:54.8138608Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.8138677Z outputs = self.mobilebert( 2025-08-14T21:42:54.8138928Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.8138991Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.8139249Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.8139313Z layer_outputs = layer_module( 2025-08-14T21:42:54.8139564Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:42:54.8139704Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:42:54.8139957Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 397, in forward 2025-08-14T21:42:54.8140071Z layer_output = self.LayerNorm(layer_output + residual_tensor_1) 2025-08-14T21:42:54.8140321Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.8140407Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.8140411Z 2025-08-14T21:42:54.8140501Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.8140680Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.8140747Z return mod(**inputs) 2025-08-14T21:42:54.8140998Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.8141092Z outputs = self.mobilebert( 2025-08-14T21:42:54.8141359Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.8141426Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.8141688Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.8141752Z layer_outputs = layer_module( 2025-08-14T21:42:54.8142003Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:42:54.8142167Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:42:54.8142420Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 398, in forward 2025-08-14T21:42:54.8142554Z layer_output = self.bottleneck(layer_output, residual_tensor_2) 2025-08-14T21:42:54.8142811Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 372, in forward 2025-08-14T21:42:54.8142886Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:42:54.8142890Z 2025-08-14T21:42:54.8142985Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.8143165Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.8143225Z return mod(**inputs) 2025-08-14T21:42:54.8143479Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 976, in forward 2025-08-14T21:42:54.8143541Z outputs = self.mobilebert( 2025-08-14T21:42:54.8143800Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:42:54.8143862Z encoder_outputs = self.encoder( 2025-08-14T21:42:54.8144123Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:42:54.8144192Z layer_outputs = layer_module( 2025-08-14T21:42:54.8144447Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:42:54.8144592Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:42:54.8144907Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 398, in forward 2025-08-14T21:42:54.8145026Z layer_output = self.bottleneck(layer_output, residual_tensor_2) 2025-08-14T21:42:54.8145289Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 374, in forward 2025-08-14T21:42:54.8145399Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:42:54.8145663Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:42:54.8145744Z return input_tensor * self.weight + self.bias 2025-08-14T21:42:54.8145748Z 2025-08-14T21:42:54.8145842Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.8146028Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.8146086Z return mod(**inputs) 2025-08-14T21:42:54.8146348Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 989, in forward 2025-08-14T21:42:54.8146435Z prediction_scores = self.cls(sequence_output) 2025-08-14T21:42:54.8146688Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 643, in forward 2025-08-14T21:42:54.8146815Z prediction_scores = self.predictions(sequence_output) 2025-08-14T21:42:54.8147081Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 631, in forward 2025-08-14T21:42:54.8147164Z hidden_states = self.transform(hidden_states) 2025-08-14T21:42:54.8147420Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 609, in forward 2025-08-14T21:42:54.8147491Z hidden_states = self.dense(hidden_states) 2025-08-14T21:42:54.8147494Z 2025-08-14T21:42:54.8147593Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.8147770Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.8147843Z return mod(**inputs) 2025-08-14T21:42:54.8148109Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 989, in forward 2025-08-14T21:42:54.8148208Z prediction_scores = self.cls(sequence_output) 2025-08-14T21:42:54.8148471Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 643, in forward 2025-08-14T21:42:54.8148571Z prediction_scores = self.predictions(sequence_output) 2025-08-14T21:42:54.8148824Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 632, in forward 2025-08-14T21:42:54.8149019Z hidden_states = hidden_states.matmul(torch.cat([self.decoder.weight.t(), self.dense.weight], dim=0)) 2025-08-14T21:42:54.8149023Z 2025-08-14T21:42:54.8149114Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.8149302Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.8149361Z return mod(**inputs) 2025-08-14T21:42:54.8149612Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 989, in forward 2025-08-14T21:42:54.8149705Z prediction_scores = self.cls(sequence_output) 2025-08-14T21:42:54.8149959Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 643, in forward 2025-08-14T21:42:54.8150055Z prediction_scores = self.predictions(sequence_output) 2025-08-14T21:42:54.8150311Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 633, in forward 2025-08-14T21:42:54.8150382Z hidden_states += self.decoder.bias 2025-08-14T21:42:54.8150385Z 2025-08-14T21:42:54.8150482Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:42:54.8150660Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:42:54.8150717Z return mod(**inputs) 2025-08-14T21:42:54.8150974Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 994, in forward 2025-08-14T21:42:54.8151151Z masked_lm_loss = loss_fct(prediction_scores.view(-1, self.config.vocab_size), labels.view(-1)) 2025-08-14T21:42:54.8151154Z 2025-08-14T21:43:06.2568506Z Compilation time (from dynamo_timed): 34.840613507 2025-08-14T21:43:06.2573094Z pass 2025-08-14T21:43:06.2576736Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:43:06.2581022Z TIMING: _recursive_pre_grad_passes:0.01941 _recursive_joint_graph_passes:1.21659 _recursive_post_grad_passes:0.19497 async_compile.wait:0.67532 code_gen:8.6363 inductor_compile:12.74314 backend_compile:24.23529 gc:0.0004 entire_frame_compile:34.84061 total_wall_time:34.84061 2025-08-14T21:43:06.2582550Z STATS: call_* op count: 1449 | FakeTensorMode.__torch_dispatch__:56776 | FakeTensor.__torch_dispatch__:16414 | ProxyTorchDispatchMode.__torch_dispatch__:21632 2025-08-14T21:43:06.2583240Z Dynamo produced 1 graphs covering 1449 ops with 0 graph breaks (0 unique) 2025-08-14T21:43:11.1427456Z /opt/conda/envs/py_3.9/lib/python3.9/site-packages/llvmlite/binding/ffi.py:175: UserWarning: pkg_resources is deprecated as an API. See https://setuptools.pypa.io/en/latest/pkg_resources.html. The pkg_resources package is slated for removal as early as 2025-11-30. Refrain from using this package or pin to Setuptools<81. 2025-08-14T21:43:11.1428374Z from pkg_resources import resource_filename 2025-08-14T21:43:11.7357324Z 2025-08-14T21:43:12.2272986Z loading model: 0it [00:00, ?it/s] 2025-08-14T21:43:12.2276848Z loading model: 0it [00:00, ?it/s] 2025-08-14T21:43:12.2333433Z cpu eval MobileBertForQuestionAnswering 2025-08-14T21:43:12.4191008Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:43:12.5396077Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:43:12.6449673Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:43:36.5157440Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5158054Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5164761Z return mod(**inputs) 2025-08-14T21:43:36.5165239Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5165660Z outputs = self.mobilebert( 2025-08-14T21:43:36.5166041Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 791, in forward 2025-08-14T21:43:36.5171498Z embedding_output = self.embeddings( 2025-08-14T21:43:36.5172129Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 199, in forward 2025-08-14T21:43:36.5172660Z inputs_embeds = torch.cat( 2025-08-14T21:43:36.5173065Z 2025-08-14T21:43:36.5173214Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5173581Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5173906Z return mod(**inputs) 2025-08-14T21:43:36.5174409Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5174803Z outputs = self.mobilebert( 2025-08-14T21:43:36.5175178Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 791, in forward 2025-08-14T21:43:36.5175564Z embedding_output = self.embeddings( 2025-08-14T21:43:36.5175944Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 208, in forward 2025-08-14T21:43:36.5176374Z inputs_embeds = self.embedding_transformation(inputs_embeds) 2025-08-14T21:43:36.5176550Z 2025-08-14T21:43:36.5176653Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5176988Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5177289Z return mod(**inputs) 2025-08-14T21:43:36.5177654Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5178047Z outputs = self.mobilebert( 2025-08-14T21:43:36.5178420Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 791, in forward 2025-08-14T21:43:36.5178818Z embedding_output = self.embeddings( 2025-08-14T21:43:36.5179202Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 215, in forward 2025-08-14T21:43:36.5179593Z embeddings = self.LayerNorm(embeddings) 2025-08-14T21:43:36.5180207Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.5180668Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.5180822Z 2025-08-14T21:43:36.5180951Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5181288Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5181583Z return mod(**inputs) 2025-08-14T21:43:36.5181947Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5182326Z outputs = self.mobilebert( 2025-08-14T21:43:36.5182693Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5183127Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5183506Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5183931Z layer_outputs = layer_module( 2025-08-14T21:43:36.5184310Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:43:36.5185017Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:43:36.5185486Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 444, in forward 2025-08-14T21:43:36.5185904Z shared_attention_input = self.attention(hidden_states) 2025-08-14T21:43:36.5186307Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 409, in forward 2025-08-14T21:43:36.5186700Z layer_input = self.dense(hidden_states) 2025-08-14T21:43:36.5186837Z 2025-08-14T21:43:36.5186934Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5187270Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5187566Z return mod(**inputs) 2025-08-14T21:43:36.5187931Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5188315Z outputs = self.mobilebert( 2025-08-14T21:43:36.5188674Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5189054Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5189428Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5189811Z layer_outputs = layer_module( 2025-08-14T21:43:36.5190176Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:43:36.5190637Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:43:36.5191129Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 444, in forward 2025-08-14T21:43:36.5191605Z shared_attention_input = self.attention(hidden_states) 2025-08-14T21:43:36.5192006Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 410, in forward 2025-08-14T21:43:36.5192398Z layer_input = self.LayerNorm(layer_input) 2025-08-14T21:43:36.5192790Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.5193184Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.5193320Z 2025-08-14T21:43:36.5193415Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5193746Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5194079Z return mod(**inputs) 2025-08-14T21:43:36.5194457Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5194841Z outputs = self.mobilebert( 2025-08-14T21:43:36.5195201Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5195582Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5195947Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5196377Z layer_outputs = layer_module( 2025-08-14T21:43:36.5196751Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:43:36.5197134Z self_attention_outputs = self.attention( 2025-08-14T21:43:36.5197541Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:43:36.5197923Z self_outputs = self.self( 2025-08-14T21:43:36.5198291Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 245, in forward 2025-08-14T21:43:36.5198663Z self.query(query_tensor) 2025-08-14T21:43:36.5198777Z 2025-08-14T21:43:36.5198873Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5199200Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5199496Z return mod(**inputs) 2025-08-14T21:43:36.5199851Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5200224Z outputs = self.mobilebert( 2025-08-14T21:43:36.5200590Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5200962Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5201337Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5201716Z layer_outputs = layer_module( 2025-08-14T21:43:36.5202082Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:43:36.5202460Z self_attention_outputs = self.attention( 2025-08-14T21:43:36.5202844Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:43:36.5203220Z self_outputs = self.self( 2025-08-14T21:43:36.5203583Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 250, in forward 2025-08-14T21:43:36.5203948Z self.key(key_tensor) 2025-08-14T21:43:36.5204051Z 2025-08-14T21:43:36.5204147Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5204471Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5204761Z return mod(**inputs) 2025-08-14T21:43:36.5205119Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5205492Z outputs = self.mobilebert( 2025-08-14T21:43:36.5205854Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5206225Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5206594Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5206969Z layer_outputs = layer_module( 2025-08-14T21:43:36.5207360Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:43:36.5207749Z self_attention_outputs = self.attention( 2025-08-14T21:43:36.5208134Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:43:36.5208512Z self_outputs = self.self( 2025-08-14T21:43:36.5208867Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 255, in forward 2025-08-14T21:43:36.5209245Z self.value(value_tensor) 2025-08-14T21:43:36.5209348Z 2025-08-14T21:43:36.5209447Z cudagraph partition due to non gpu ops 2025-08-14T21:43:36.5209648Z cudagraph partition due to non gpu ops 2025-08-14T21:43:36.5209861Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5210191Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5210533Z return mod(**inputs) 2025-08-14T21:43:36.5210886Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5211263Z outputs = self.mobilebert( 2025-08-14T21:43:36.5211623Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5212000Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5212363Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5212746Z layer_outputs = layer_module( 2025-08-14T21:43:36.5213113Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:43:36.5213497Z self_attention_outputs = self.attention( 2025-08-14T21:43:36.5213893Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 344, in forward 2025-08-14T21:43:36.5214315Z attention_output = self.output(self_outputs[0], layer_input) 2025-08-14T21:43:36.5214736Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 292, in forward 2025-08-14T21:43:36.5215114Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:43:36.5215250Z 2025-08-14T21:43:36.5215344Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5215671Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5215973Z return mod(**inputs) 2025-08-14T21:43:36.5216326Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5216708Z outputs = self.mobilebert( 2025-08-14T21:43:36.5217072Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5217444Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5217816Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5218194Z layer_outputs = layer_module( 2025-08-14T21:43:36.5218561Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:43:36.5219010Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:43:36.5219468Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 440, in forward 2025-08-14T21:43:36.5219876Z bottlenecked_hidden_states = self.input(hidden_states) 2025-08-14T21:43:36.5220314Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 409, in forward 2025-08-14T21:43:36.5220723Z layer_input = self.dense(hidden_states) 2025-08-14T21:43:36.5220858Z 2025-08-14T21:43:36.5220951Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5221275Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5221566Z return mod(**inputs) 2025-08-14T21:43:36.5221920Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5222294Z outputs = self.mobilebert( 2025-08-14T21:43:36.5222678Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5223056Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5223454Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5223830Z layer_outputs = layer_module( 2025-08-14T21:43:36.5224201Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:43:36.5224581Z self_attention_outputs = self.attention( 2025-08-14T21:43:36.5225066Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 344, in forward 2025-08-14T21:43:36.5225496Z attention_output = self.output(self_outputs[0], layer_input) 2025-08-14T21:43:36.5225908Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 295, in forward 2025-08-14T21:43:36.5226336Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:43:36.5226764Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.5227167Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.5227306Z 2025-08-14T21:43:36.5227403Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5227735Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5228040Z return mod(**inputs) 2025-08-14T21:43:36.5228401Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5228774Z outputs = self.mobilebert( 2025-08-14T21:43:36.5229138Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5229518Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5229884Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5230267Z layer_outputs = layer_module( 2025-08-14T21:43:36.5230636Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.5231034Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.5231423Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.5231836Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.5232247Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:43:36.5232636Z hidden_states = self.dense(hidden_states) 2025-08-14T21:43:36.5232763Z 2025-08-14T21:43:36.5232857Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5233182Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5233500Z return mod(**inputs) 2025-08-14T21:43:36.5233875Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5234254Z outputs = self.mobilebert( 2025-08-14T21:43:36.5234616Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5234988Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5235350Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5235748Z layer_outputs = layer_module( 2025-08-14T21:43:36.5236117Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.5236515Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.5236928Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.5237346Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.5237755Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:43:36.5238168Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:43:36.5238323Z 2025-08-14T21:43:36.5238418Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5238749Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5239048Z return mod(**inputs) 2025-08-14T21:43:36.5239400Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5239781Z outputs = self.mobilebert( 2025-08-14T21:43:36.5240145Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5240524Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5240893Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5241288Z layer_outputs = layer_module( 2025-08-14T21:43:36.5241660Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.5242052Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.5242451Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.5242878Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.5243311Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:43:36.5243699Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:43:36.5243851Z 2025-08-14T21:43:36.5243946Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5244276Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5244574Z return mod(**inputs) 2025-08-14T21:43:36.5244931Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5245313Z outputs = self.mobilebert( 2025-08-14T21:43:36.5245678Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5246050Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5247056Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5247450Z layer_outputs = layer_module( 2025-08-14T21:43:36.5247843Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.5248238Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.5248634Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.5249058Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.5249481Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:43:36.5249942Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:43:36.5250362Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.5250783Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.5250920Z 2025-08-14T21:43:36.5251022Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5251345Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5251639Z return mod(**inputs) 2025-08-14T21:43:36.5251996Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5252366Z outputs = self.mobilebert( 2025-08-14T21:43:36.5252734Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5253115Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5253485Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5253859Z layer_outputs = layer_module( 2025-08-14T21:43:36.5254228Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.5254624Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.5255017Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.5255423Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.5255832Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:43:36.5256221Z hidden_states = self.dense(hidden_states) 2025-08-14T21:43:36.5256349Z 2025-08-14T21:43:36.5256444Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5256776Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5257078Z return mod(**inputs) 2025-08-14T21:43:36.5257443Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5257813Z outputs = self.mobilebert( 2025-08-14T21:43:36.5258176Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5258555Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5258918Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5259295Z layer_outputs = layer_module( 2025-08-14T21:43:36.5259665Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.5260063Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.5260487Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.5260899Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.5261305Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:43:36.5261717Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:43:36.5261867Z 2025-08-14T21:43:36.5261962Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5262293Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5262620Z return mod(**inputs) 2025-08-14T21:43:36.5262983Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5263371Z outputs = self.mobilebert( 2025-08-14T21:43:36.5263736Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5264115Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5264480Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5264941Z layer_outputs = layer_module( 2025-08-14T21:43:36.5265319Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.5265716Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.5266108Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.5266549Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.5266981Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:43:36.5267377Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:43:36.5267505Z 2025-08-14T21:43:36.5267602Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5267933Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5268235Z return mod(**inputs) 2025-08-14T21:43:36.5268589Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5268971Z outputs = self.mobilebert( 2025-08-14T21:43:36.5269336Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5269717Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5270085Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5270485Z layer_outputs = layer_module( 2025-08-14T21:43:36.5270863Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.5271264Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.5271656Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.5272078Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.5272502Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:43:36.5272921Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:43:36.5273339Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.5273785Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.5273942Z 2025-08-14T21:43:36.5274052Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5274380Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5274682Z return mod(**inputs) 2025-08-14T21:43:36.5275042Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5275423Z outputs = self.mobilebert( 2025-08-14T21:43:36.5275782Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5276184Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5276557Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5276947Z layer_outputs = layer_module( 2025-08-14T21:43:36.5277317Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.5277720Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.5278123Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.5278529Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.5278939Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:43:36.5279328Z hidden_states = self.dense(hidden_states) 2025-08-14T21:43:36.5279457Z 2025-08-14T21:43:36.5279557Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5279879Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5280177Z return mod(**inputs) 2025-08-14T21:43:36.5280536Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5280904Z outputs = self.mobilebert( 2025-08-14T21:43:36.5281271Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5281656Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5282023Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5282391Z layer_outputs = layer_module( 2025-08-14T21:43:36.5282761Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.5283155Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.5283554Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.5283959Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.5284369Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:43:36.5284920Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:43:36.5285079Z 2025-08-14T21:43:36.5285187Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5285519Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5285825Z return mod(**inputs) 2025-08-14T21:43:36.5286189Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5286574Z outputs = self.mobilebert( 2025-08-14T21:43:36.5287020Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5287406Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5287781Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5288152Z layer_outputs = layer_module( 2025-08-14T21:43:36.5288521Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.5288921Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.5289313Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.5289768Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.5290223Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:43:36.5290619Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:43:36.5290746Z 2025-08-14T21:43:36.5290840Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5291165Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5291461Z return mod(**inputs) 2025-08-14T21:43:36.5291818Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5292189Z outputs = self.mobilebert( 2025-08-14T21:43:36.5292554Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5292933Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5293298Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5293677Z layer_outputs = layer_module( 2025-08-14T21:43:36.5294046Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.5294441Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.5294831Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.5295252Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.5295671Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:43:36.5296098Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:43:36.5296508Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.5296906Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.5297039Z 2025-08-14T21:43:36.5297142Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5297470Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5297762Z return mod(**inputs) 2025-08-14T21:43:36.5298123Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5298501Z outputs = self.mobilebert( 2025-08-14T21:43:36.5298857Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5299239Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5299618Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5300017Z layer_outputs = layer_module( 2025-08-14T21:43:36.5300400Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 518, in forward 2025-08-14T21:43:36.5300825Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:43:36.5301245Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:43:36.5301634Z hidden_states = self.dense(hidden_states) 2025-08-14T21:43:36.5301762Z 2025-08-14T21:43:36.5301856Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5302182Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5302501Z return mod(**inputs) 2025-08-14T21:43:36.5302864Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5303263Z outputs = self.mobilebert( 2025-08-14T21:43:36.5303629Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5304007Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5304372Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5304814Z layer_outputs = layer_module( 2025-08-14T21:43:36.5305185Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 518, in forward 2025-08-14T21:43:36.5305603Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:43:36.5306019Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:43:36.5306432Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:43:36.5306586Z 2025-08-14T21:43:36.5306692Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5307018Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5307322Z return mod(**inputs) 2025-08-14T21:43:36.5307678Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5308057Z outputs = self.mobilebert( 2025-08-14T21:43:36.5308413Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5308791Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5309164Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5309529Z layer_outputs = layer_module( 2025-08-14T21:43:36.5309895Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:43:36.5310350Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:43:36.5310805Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 392, in forward 2025-08-14T21:43:36.5311192Z layer_output = self.dense(intermediate_states) 2025-08-14T21:43:36.5311335Z 2025-08-14T21:43:36.5311428Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5311752Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5312049Z return mod(**inputs) 2025-08-14T21:43:36.5312400Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5312781Z outputs = self.mobilebert( 2025-08-14T21:43:36.5313174Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5313581Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5313948Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5314320Z layer_outputs = layer_module( 2025-08-14T21:43:36.5314684Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:43:36.5315131Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:43:36.5315606Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 397, in forward 2025-08-14T21:43:36.5316029Z layer_output = self.LayerNorm(layer_output + residual_tensor_1) 2025-08-14T21:43:36.5316471Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.5316874Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.5317019Z 2025-08-14T21:43:36.5317117Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5317454Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5317763Z return mod(**inputs) 2025-08-14T21:43:36.5318120Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5318508Z outputs = self.mobilebert( 2025-08-14T21:43:36.5318879Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5319260Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5319640Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5320033Z layer_outputs = layer_module( 2025-08-14T21:43:36.5320409Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:43:36.5320864Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:43:36.5321329Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 398, in forward 2025-08-14T21:43:36.5321760Z layer_output = self.bottleneck(layer_output, residual_tensor_2) 2025-08-14T21:43:36.5322192Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 372, in forward 2025-08-14T21:43:36.5322585Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:43:36.5322723Z 2025-08-14T21:43:36.5322824Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5323163Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5323462Z return mod(**inputs) 2025-08-14T21:43:36.5323828Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5324221Z outputs = self.mobilebert( 2025-08-14T21:43:36.5324598Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5324973Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5325353Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5325735Z layer_outputs = layer_module( 2025-08-14T21:43:36.5326108Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:43:36.5326581Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:43:36.5327050Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 398, in forward 2025-08-14T21:43:36.5327484Z layer_output = self.bottleneck(layer_output, residual_tensor_2) 2025-08-14T21:43:36.5327905Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 374, in forward 2025-08-14T21:43:36.5328320Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:43:36.5328742Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.5329159Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.5329291Z 2025-08-14T21:43:36.5329387Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5329733Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5330031Z return mod(**inputs) 2025-08-14T21:43:36.5330388Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5330762Z outputs = self.mobilebert( 2025-08-14T21:43:36.5331124Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5331502Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5331872Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5332239Z layer_outputs = layer_module( 2025-08-14T21:43:36.5332608Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:43:36.5333070Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:43:36.5333522Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 444, in forward 2025-08-14T21:43:36.5333933Z shared_attention_input = self.attention(hidden_states) 2025-08-14T21:43:36.5334339Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 409, in forward 2025-08-14T21:43:36.5334739Z layer_input = self.dense(hidden_states) 2025-08-14T21:43:36.5334866Z 2025-08-14T21:43:36.5334962Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5335301Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5335596Z return mod(**inputs) 2025-08-14T21:43:36.5335956Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5336330Z outputs = self.mobilebert( 2025-08-14T21:43:36.5336693Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5337071Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5337434Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5337811Z layer_outputs = layer_module( 2025-08-14T21:43:36.5338178Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:43:36.5338632Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:43:36.5339081Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 444, in forward 2025-08-14T21:43:36.5339522Z shared_attention_input = self.attention(hidden_states) 2025-08-14T21:43:36.5339946Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 410, in forward 2025-08-14T21:43:36.5340344Z layer_input = self.LayerNorm(layer_input) 2025-08-14T21:43:36.5340730Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.5341132Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.5341264Z 2025-08-14T21:43:36.5341367Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5341698Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5342008Z return mod(**inputs) 2025-08-14T21:43:36.5342363Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5342758Z outputs = self.mobilebert( 2025-08-14T21:43:36.5343118Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5343494Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5343878Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5344253Z layer_outputs = layer_module( 2025-08-14T21:43:36.5344618Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:43:36.5345083Z self_attention_outputs = self.attention( 2025-08-14T21:43:36.5345481Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:43:36.5345851Z self_outputs = self.self( 2025-08-14T21:43:36.5346227Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 245, in forward 2025-08-14T21:43:36.5346608Z self.query(query_tensor) 2025-08-14T21:43:36.5346715Z 2025-08-14T21:43:36.5346821Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5347145Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5347449Z return mod(**inputs) 2025-08-14T21:43:36.5347811Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5348190Z outputs = self.mobilebert( 2025-08-14T21:43:36.5348548Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5348927Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5349299Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5349670Z layer_outputs = layer_module( 2025-08-14T21:43:36.5350041Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:43:36.5350430Z self_attention_outputs = self.attention( 2025-08-14T21:43:36.5350816Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:43:36.5351188Z self_outputs = self.self( 2025-08-14T21:43:36.5351554Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 250, in forward 2025-08-14T21:43:36.5351927Z self.key(key_tensor) 2025-08-14T21:43:36.5352025Z 2025-08-14T21:43:36.5352124Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5352443Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5352759Z return mod(**inputs) 2025-08-14T21:43:36.5353137Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5353511Z outputs = self.mobilebert( 2025-08-14T21:43:36.5353875Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5354251Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5354627Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5354999Z layer_outputs = layer_module( 2025-08-14T21:43:36.5355391Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:43:36.5355779Z self_attention_outputs = self.attention( 2025-08-14T21:43:36.5356177Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:43:36.5356551Z self_outputs = self.self( 2025-08-14T21:43:36.5356912Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 255, in forward 2025-08-14T21:43:36.5357285Z self.value(value_tensor) 2025-08-14T21:43:36.5357389Z 2025-08-14T21:43:36.5357464Z cudagraph partition due to non gpu ops 2025-08-14T21:43:36.5357661Z cudagraph partition due to non gpu ops 2025-08-14T21:43:36.5357891Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5358215Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5358514Z return mod(**inputs) 2025-08-14T21:43:36.5358880Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5359264Z outputs = self.mobilebert( 2025-08-14T21:43:36.5359624Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5360002Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5360374Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5360761Z layer_outputs = layer_module( 2025-08-14T21:43:36.5361125Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:43:36.5361524Z self_attention_outputs = self.attention( 2025-08-14T21:43:36.5361910Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 344, in forward 2025-08-14T21:43:36.5362324Z attention_output = self.output(self_outputs[0], layer_input) 2025-08-14T21:43:36.5362749Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 292, in forward 2025-08-14T21:43:36.5363139Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:43:36.5363265Z 2025-08-14T21:43:36.5363366Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5363683Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5363980Z return mod(**inputs) 2025-08-14T21:43:36.5364338Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5364721Z outputs = self.mobilebert( 2025-08-14T21:43:36.5365081Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5365460Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5365860Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5366253Z layer_outputs = layer_module( 2025-08-14T21:43:36.5366630Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:43:36.5367089Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:43:36.5367547Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 440, in forward 2025-08-14T21:43:36.5367955Z bottlenecked_hidden_states = self.input(hidden_states) 2025-08-14T21:43:36.5368387Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 409, in forward 2025-08-14T21:43:36.5368778Z layer_input = self.dense(hidden_states) 2025-08-14T21:43:36.5368920Z 2025-08-14T21:43:36.5369022Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5369343Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5369638Z return mod(**inputs) 2025-08-14T21:43:36.5369995Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5370366Z outputs = self.mobilebert( 2025-08-14T21:43:36.5370730Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5371104Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5371476Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5371847Z layer_outputs = layer_module( 2025-08-14T21:43:36.5372217Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:43:36.5372613Z self_attention_outputs = self.attention( 2025-08-14T21:43:36.5373004Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 344, in forward 2025-08-14T21:43:36.5373421Z attention_output = self.output(self_outputs[0], layer_input) 2025-08-14T21:43:36.5373838Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 295, in forward 2025-08-14T21:43:36.5375293Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:43:36.5375714Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.5376107Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.5376250Z 2025-08-14T21:43:36.5376344Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5376674Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5376966Z return mod(**inputs) 2025-08-14T21:43:36.5377328Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5377706Z outputs = self.mobilebert( 2025-08-14T21:43:36.5378070Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5378439Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5378809Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5379185Z layer_outputs = layer_module( 2025-08-14T21:43:36.5379550Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.5379972Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.5380396Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.5380822Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.5381227Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:43:36.5381622Z hidden_states = self.dense(hidden_states) 2025-08-14T21:43:36.5381757Z 2025-08-14T21:43:36.5381853Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5382182Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5382498Z return mod(**inputs) 2025-08-14T21:43:36.5382856Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5383265Z outputs = self.mobilebert( 2025-08-14T21:43:36.5383632Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5384008Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5384380Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5384921Z layer_outputs = layer_module( 2025-08-14T21:43:36.5385284Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.5385687Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.5386094Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.5386506Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.5386916Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:43:36.5387332Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:43:36.5387486Z 2025-08-14T21:43:36.5387591Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5387923Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5388214Z return mod(**inputs) 2025-08-14T21:43:36.5388573Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5388951Z outputs = self.mobilebert( 2025-08-14T21:43:36.5389309Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5389689Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5390060Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5390436Z layer_outputs = layer_module( 2025-08-14T21:43:36.5390796Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.5391193Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.5391589Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.5392011Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.5392424Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:43:36.5392810Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:43:36.5392937Z 2025-08-14T21:43:36.5393042Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5393432Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5393733Z return mod(**inputs) 2025-08-14T21:43:36.5394091Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5394477Z outputs = self.mobilebert( 2025-08-14T21:43:36.5394834Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5395217Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5395589Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5396002Z layer_outputs = layer_module( 2025-08-14T21:43:36.5396377Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.5396802Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.5397199Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.5397617Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.5398043Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:43:36.5398466Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:43:36.5398890Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.5399286Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.5399433Z 2025-08-14T21:43:36.5399529Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5399864Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5400164Z return mod(**inputs) 2025-08-14T21:43:36.5400519Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5400902Z outputs = self.mobilebert( 2025-08-14T21:43:36.5401272Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5401646Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5402022Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5402405Z layer_outputs = layer_module( 2025-08-14T21:43:36.5402786Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.5403183Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.5403582Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.5403998Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.5404410Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:43:36.5404793Z hidden_states = self.dense(hidden_states) 2025-08-14T21:43:36.5404927Z 2025-08-14T21:43:36.5405021Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5405353Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5405647Z return mod(**inputs) 2025-08-14T21:43:36.5406007Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5406388Z outputs = self.mobilebert( 2025-08-14T21:43:36.5406814Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5407189Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5407561Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5407935Z layer_outputs = layer_module( 2025-08-14T21:43:36.5408306Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.5408694Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.5409107Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.5409531Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.5409969Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:43:36.5410389Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:43:36.5410548Z 2025-08-14T21:43:36.5410644Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5410980Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5411275Z return mod(**inputs) 2025-08-14T21:43:36.5411636Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5412019Z outputs = self.mobilebert( 2025-08-14T21:43:36.5412388Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5412762Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5413141Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5413525Z layer_outputs = layer_module( 2025-08-14T21:43:36.5413894Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.5414298Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.5414700Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.5415133Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.5415555Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:43:36.5415954Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:43:36.5416091Z 2025-08-14T21:43:36.5416189Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5416527Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5416822Z return mod(**inputs) 2025-08-14T21:43:36.5417186Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5417570Z outputs = self.mobilebert( 2025-08-14T21:43:36.5417940Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5418328Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5418703Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5419082Z layer_outputs = layer_module( 2025-08-14T21:43:36.5419448Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.5419867Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.5420280Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.5420706Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.5421123Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:43:36.5421544Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:43:36.5421965Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.5422370Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.5422511Z 2025-08-14T21:43:36.5422604Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5422951Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5423247Z return mod(**inputs) 2025-08-14T21:43:36.5423598Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5423978Z outputs = self.mobilebert( 2025-08-14T21:43:36.5424341Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5424798Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5425174Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5425559Z layer_outputs = layer_module( 2025-08-14T21:43:36.5425934Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.5426329Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.5426729Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.5427147Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.5427561Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:43:36.5427947Z hidden_states = self.dense(hidden_states) 2025-08-14T21:43:36.5428085Z 2025-08-14T21:43:36.5428180Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5428512Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5428810Z return mod(**inputs) 2025-08-14T21:43:36.5429161Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5429542Z outputs = self.mobilebert( 2025-08-14T21:43:36.5429914Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5430292Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5430667Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5431047Z layer_outputs = layer_module( 2025-08-14T21:43:36.5431422Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.5431814Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.5432212Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.5432630Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.5433063Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:43:36.5433480Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:43:36.5433638Z 2025-08-14T21:43:36.5433733Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5434056Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5434341Z return mod(**inputs) 2025-08-14T21:43:36.5434696Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5435074Z outputs = self.mobilebert( 2025-08-14T21:43:36.5435454Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5435825Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5436222Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5436600Z layer_outputs = layer_module( 2025-08-14T21:43:36.5436962Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.5437362Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.5437758Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.5438184Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.5438599Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:43:36.5438990Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:43:36.5439123Z 2025-08-14T21:43:36.5439219Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5439551Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5439843Z return mod(**inputs) 2025-08-14T21:43:36.5440203Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5440583Z outputs = self.mobilebert( 2025-08-14T21:43:36.5440941Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5441321Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5441697Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5442074Z layer_outputs = layer_module( 2025-08-14T21:43:36.5442435Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.5442835Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.5443234Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.5443658Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.5444075Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:43:36.5444496Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:43:36.5444918Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.5445318Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.5445452Z 2025-08-14T21:43:36.5445548Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5445897Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5446196Z return mod(**inputs) 2025-08-14T21:43:36.5446560Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5446941Z outputs = self.mobilebert( 2025-08-14T21:43:36.5447308Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5447687Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5448049Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5448443Z layer_outputs = layer_module( 2025-08-14T21:43:36.5448811Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 518, in forward 2025-08-14T21:43:36.5449248Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:43:36.5449663Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:43:36.5450061Z hidden_states = self.dense(hidden_states) 2025-08-14T21:43:36.5450187Z 2025-08-14T21:43:36.5450289Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5450612Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5450916Z return mod(**inputs) 2025-08-14T21:43:36.5451275Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5451655Z outputs = self.mobilebert( 2025-08-14T21:43:36.5452011Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5452393Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5452765Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5453132Z layer_outputs = layer_module( 2025-08-14T21:43:36.5453500Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 518, in forward 2025-08-14T21:43:36.5453918Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:43:36.5454334Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:43:36.5454737Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:43:36.5454894Z 2025-08-14T21:43:36.5454986Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5455315Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5455614Z return mod(**inputs) 2025-08-14T21:43:36.5455968Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5456346Z outputs = self.mobilebert( 2025-08-14T21:43:36.5456712Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5457082Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5457452Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5457827Z layer_outputs = layer_module( 2025-08-14T21:43:36.5458197Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:43:36.5458645Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:43:36.5459118Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 392, in forward 2025-08-14T21:43:36.5459537Z layer_output = self.dense(intermediate_states) 2025-08-14T21:43:36.5459674Z 2025-08-14T21:43:36.5459776Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5460101Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5460397Z return mod(**inputs) 2025-08-14T21:43:36.5460755Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5461125Z outputs = self.mobilebert( 2025-08-14T21:43:36.5461516Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5461891Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5462280Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5462651Z layer_outputs = layer_module( 2025-08-14T21:43:36.5463018Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:43:36.5463470Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:43:36.5463925Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 397, in forward 2025-08-14T21:43:36.5464341Z layer_output = self.LayerNorm(layer_output + residual_tensor_1) 2025-08-14T21:43:36.5464842Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.5465253Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.5465389Z 2025-08-14T21:43:36.5465494Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5465819Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5466119Z return mod(**inputs) 2025-08-14T21:43:36.5466478Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5466852Z outputs = self.mobilebert( 2025-08-14T21:43:36.5467223Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5467603Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5467974Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5468346Z layer_outputs = layer_module( 2025-08-14T21:43:36.5468713Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:43:36.5469166Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:43:36.5469617Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 398, in forward 2025-08-14T21:43:36.5470033Z layer_output = self.bottleneck(layer_output, residual_tensor_2) 2025-08-14T21:43:36.5470455Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 372, in forward 2025-08-14T21:43:36.5470845Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:43:36.5470971Z 2025-08-14T21:43:36.5471064Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5471394Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5471690Z return mod(**inputs) 2025-08-14T21:43:36.5472070Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5472466Z outputs = self.mobilebert( 2025-08-14T21:43:36.5472836Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5473214Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5473579Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5473961Z layer_outputs = layer_module( 2025-08-14T21:43:36.5474330Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:43:36.5474807Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:43:36.5475255Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 398, in forward 2025-08-14T21:43:36.5475701Z layer_output = self.bottleneck(layer_output, residual_tensor_2) 2025-08-14T21:43:36.5476127Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 374, in forward 2025-08-14T21:43:36.5476551Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:43:36.5476966Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.5477364Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.5477505Z 2025-08-14T21:43:36.5477600Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5477931Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5478224Z return mod(**inputs) 2025-08-14T21:43:36.5478586Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5478969Z outputs = self.mobilebert( 2025-08-14T21:43:36.5479328Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5479707Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5480078Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5480457Z layer_outputs = layer_module( 2025-08-14T21:43:36.5480822Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:43:36.5481283Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:43:36.5481744Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 444, in forward 2025-08-14T21:43:36.5482162Z shared_attention_input = self.attention(hidden_states) 2025-08-14T21:43:36.5482568Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 409, in forward 2025-08-14T21:43:36.5482957Z layer_input = self.dense(hidden_states) 2025-08-14T21:43:36.5483082Z 2025-08-14T21:43:36.5483184Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5483510Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5483811Z return mod(**inputs) 2025-08-14T21:43:36.5484171Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5484553Z outputs = self.mobilebert( 2025-08-14T21:43:36.5485064Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5485502Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5485901Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5486278Z layer_outputs = layer_module( 2025-08-14T21:43:36.5486640Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:43:36.5487099Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:43:36.5487563Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 444, in forward 2025-08-14T21:43:36.5488004Z shared_attention_input = self.attention(hidden_states) 2025-08-14T21:43:36.5488412Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 410, in forward 2025-08-14T21:43:36.5488829Z layer_input = self.LayerNorm(layer_input) 2025-08-14T21:43:36.5489221Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.5489611Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.5489751Z 2025-08-14T21:43:36.5489845Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5490173Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5490469Z return mod(**inputs) 2025-08-14T21:43:36.5490818Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5491198Z outputs = self.mobilebert( 2025-08-14T21:43:36.5491559Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5491928Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5492299Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5492673Z layer_outputs = layer_module( 2025-08-14T21:43:36.5493040Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:43:36.5493422Z self_attention_outputs = self.attention( 2025-08-14T21:43:36.5493809Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:43:36.5494180Z self_outputs = self.self( 2025-08-14T21:43:36.5494544Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 245, in forward 2025-08-14T21:43:36.5494913Z self.query(query_tensor) 2025-08-14T21:43:36.5495026Z 2025-08-14T21:43:36.5495120Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5495447Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5495738Z return mod(**inputs) 2025-08-14T21:43:36.5496093Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5496468Z outputs = self.mobilebert( 2025-08-14T21:43:36.5496830Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5497199Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5497564Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5497939Z layer_outputs = layer_module( 2025-08-14T21:43:36.5498307Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:43:36.5498706Z self_attention_outputs = self.attention( 2025-08-14T21:43:36.5499110Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:43:36.5499495Z self_outputs = self.self( 2025-08-14T21:43:36.5499853Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 250, in forward 2025-08-14T21:43:36.5500223Z self.key(key_tensor) 2025-08-14T21:43:36.5500328Z 2025-08-14T21:43:36.5500424Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5500751Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5501060Z return mod(**inputs) 2025-08-14T21:43:36.5501419Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5501817Z outputs = self.mobilebert( 2025-08-14T21:43:36.5502174Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5502553Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5502925Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5503301Z layer_outputs = layer_module( 2025-08-14T21:43:36.5503664Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:43:36.5504051Z self_attention_outputs = self.attention( 2025-08-14T21:43:36.5504440Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:43:36.5504873Z self_outputs = self.self( 2025-08-14T21:43:36.5505239Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 255, in forward 2025-08-14T21:43:36.5505614Z self.value(value_tensor) 2025-08-14T21:43:36.5505718Z 2025-08-14T21:43:36.5505800Z cudagraph partition due to non gpu ops 2025-08-14T21:43:36.5505992Z cudagraph partition due to non gpu ops 2025-08-14T21:43:36.5506212Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5506546Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5506846Z return mod(**inputs) 2025-08-14T21:43:36.5507202Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5507585Z outputs = self.mobilebert( 2025-08-14T21:43:36.5507954Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5508329Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5508704Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5509078Z layer_outputs = layer_module( 2025-08-14T21:43:36.5509447Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:43:36.5509827Z self_attention_outputs = self.attention( 2025-08-14T21:43:36.5510211Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 344, in forward 2025-08-14T21:43:36.5510633Z attention_output = self.output(self_outputs[0], layer_input) 2025-08-14T21:43:36.5511054Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 292, in forward 2025-08-14T21:43:36.5511434Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:43:36.5511570Z 2025-08-14T21:43:36.5511683Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5512037Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5512331Z return mod(**inputs) 2025-08-14T21:43:36.5512690Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5513070Z outputs = self.mobilebert( 2025-08-14T21:43:36.5513433Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5513805Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5514177Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5514573Z layer_outputs = layer_module( 2025-08-14T21:43:36.5514939Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:43:36.5515413Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:43:36.5515875Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 440, in forward 2025-08-14T21:43:36.5516287Z bottlenecked_hidden_states = self.input(hidden_states) 2025-08-14T21:43:36.5516689Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 409, in forward 2025-08-14T21:43:36.5517077Z layer_input = self.dense(hidden_states) 2025-08-14T21:43:36.5517208Z 2025-08-14T21:43:36.5517304Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5517631Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5517921Z return mod(**inputs) 2025-08-14T21:43:36.5518280Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5518662Z outputs = self.mobilebert( 2025-08-14T21:43:36.5519024Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5519390Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5519759Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5520138Z layer_outputs = layer_module( 2025-08-14T21:43:36.5520500Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:43:36.5520891Z self_attention_outputs = self.attention( 2025-08-14T21:43:36.5521279Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 344, in forward 2025-08-14T21:43:36.5521700Z attention_output = self.output(self_outputs[0], layer_input) 2025-08-14T21:43:36.5522112Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 295, in forward 2025-08-14T21:43:36.5522533Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:43:36.5522955Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.5523350Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.5523485Z 2025-08-14T21:43:36.5523578Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5523907Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5524205Z return mod(**inputs) 2025-08-14T21:43:36.5524560Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5524957Z outputs = self.mobilebert( 2025-08-14T21:43:36.5525341Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5525722Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5526092Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5526465Z layer_outputs = layer_module( 2025-08-14T21:43:36.5526836Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.5527255Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.5527644Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.5528073Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.5528485Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:43:36.5528866Z hidden_states = self.dense(hidden_states) 2025-08-14T21:43:36.5529000Z 2025-08-14T21:43:36.5529094Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5529418Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5529711Z return mod(**inputs) 2025-08-14T21:43:36.5530063Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5530439Z outputs = self.mobilebert( 2025-08-14T21:43:36.5530798Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5531176Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5531542Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5531916Z layer_outputs = layer_module( 2025-08-14T21:43:36.5532283Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.5532670Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.5533067Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.5533476Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.5533881Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:43:36.5534282Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:43:36.5534440Z 2025-08-14T21:43:36.5534536Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5534864Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5535161Z return mod(**inputs) 2025-08-14T21:43:36.5535511Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5535889Z outputs = self.mobilebert( 2025-08-14T21:43:36.5536256Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5536627Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5536998Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5537376Z layer_outputs = layer_module( 2025-08-14T21:43:36.5537758Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.5538171Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.5538572Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.5538998Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.5539426Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:43:36.5539809Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:43:36.5539943Z 2025-08-14T21:43:36.5540037Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5540380Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5540671Z return mod(**inputs) 2025-08-14T21:43:36.5541048Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5541428Z outputs = self.mobilebert( 2025-08-14T21:43:36.5541788Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5542156Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5542526Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5542898Z layer_outputs = layer_module( 2025-08-14T21:43:36.5543257Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.5543656Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.5544051Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.5544474Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.5544974Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:43:36.5545406Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:43:36.5545831Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.5546229Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.5546367Z 2025-08-14T21:43:36.5546464Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5546797Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5547096Z return mod(**inputs) 2025-08-14T21:43:36.5547447Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5547833Z outputs = self.mobilebert( 2025-08-14T21:43:36.5548199Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5548580Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5548943Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5549327Z layer_outputs = layer_module( 2025-08-14T21:43:36.5549696Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.5550094Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.5550483Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.5550594Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.5550891Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:43:36.5550972Z hidden_states = self.dense(hidden_states) 2025-08-14T21:43:36.5550982Z 2025-08-14T21:43:36.5551078Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5551261Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5551330Z return mod(**inputs) 2025-08-14T21:43:36.5551592Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5551677Z outputs = self.mobilebert( 2025-08-14T21:43:36.5551942Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5552027Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5552296Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5552364Z layer_outputs = layer_module( 2025-08-14T21:43:36.5552625Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.5552723Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.5552983Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.5553086Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.5553359Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:43:36.5553461Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:43:36.5553466Z 2025-08-14T21:43:36.5553569Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5553753Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5553813Z return mod(**inputs) 2025-08-14T21:43:36.5554085Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5554151Z outputs = self.mobilebert( 2025-08-14T21:43:36.5554416Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5554481Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5554740Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5554812Z layer_outputs = layer_module( 2025-08-14T21:43:36.5555073Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.5555166Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.5555430Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.5555543Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.5555807Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:43:36.5555884Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:43:36.5555887Z 2025-08-14T21:43:36.5555979Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5556170Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5556230Z return mod(**inputs) 2025-08-14T21:43:36.5556523Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5556605Z outputs = self.mobilebert( 2025-08-14T21:43:36.5556864Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5556939Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5557192Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5557257Z layer_outputs = layer_module( 2025-08-14T21:43:36.5557516Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.5557620Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.5557881Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.5558013Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.5558270Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:43:36.5558387Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:43:36.5558640Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.5558730Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.5558733Z 2025-08-14T21:43:36.5558826Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5559006Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5559074Z return mod(**inputs) 2025-08-14T21:43:36.5559329Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5559403Z outputs = self.mobilebert( 2025-08-14T21:43:36.5559656Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5559721Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5559983Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5560047Z layer_outputs = layer_module( 2025-08-14T21:43:36.5560299Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.5560392Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.5560647Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.5560754Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.5561010Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:43:36.5561086Z hidden_states = self.dense(hidden_states) 2025-08-14T21:43:36.5561089Z 2025-08-14T21:43:36.5561187Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5561368Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5561432Z return mod(**inputs) 2025-08-14T21:43:36.5561688Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5561753Z outputs = self.mobilebert( 2025-08-14T21:43:36.5562012Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5562079Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5562361Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5562433Z layer_outputs = layer_module( 2025-08-14T21:43:36.5562684Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.5562773Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.5563027Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.5563124Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.5563402Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:43:36.5563502Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:43:36.5563523Z 2025-08-14T21:43:36.5563624Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5563805Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5563864Z return mod(**inputs) 2025-08-14T21:43:36.5564127Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5564190Z outputs = self.mobilebert( 2025-08-14T21:43:36.5564444Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5564515Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5564767Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5564840Z layer_outputs = layer_module( 2025-08-14T21:43:36.5565095Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.5565182Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.5565448Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.5565559Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.5565817Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:43:36.5565893Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:43:36.5565896Z 2025-08-14T21:43:36.5565987Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5566174Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5566232Z return mod(**inputs) 2025-08-14T21:43:36.5566488Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5566564Z outputs = self.mobilebert( 2025-08-14T21:43:36.5566816Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5566888Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5567141Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5567206Z layer_outputs = layer_module( 2025-08-14T21:43:36.5567466Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.5567553Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.5567813Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.5567942Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.5568212Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:43:36.5568333Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:43:36.5568586Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.5568674Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.5568678Z 2025-08-14T21:43:36.5568769Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5568966Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5569034Z return mod(**inputs) 2025-08-14T21:43:36.5569288Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5569376Z outputs = self.mobilebert( 2025-08-14T21:43:36.5569640Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5569705Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5569967Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5570031Z layer_outputs = layer_module( 2025-08-14T21:43:36.5570285Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 518, in forward 2025-08-14T21:43:36.5570399Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:43:36.5570654Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:43:36.5570738Z hidden_states = self.dense(hidden_states) 2025-08-14T21:43:36.5570741Z 2025-08-14T21:43:36.5570833Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5571012Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5571076Z return mod(**inputs) 2025-08-14T21:43:36.5571332Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5571395Z outputs = self.mobilebert( 2025-08-14T21:43:36.5571655Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5571719Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5571983Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5572048Z layer_outputs = layer_module( 2025-08-14T21:43:36.5572305Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 518, in forward 2025-08-14T21:43:36.5572420Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:43:36.5572673Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:43:36.5572779Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:43:36.5572782Z 2025-08-14T21:43:36.5572874Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5573051Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5573119Z return mod(**inputs) 2025-08-14T21:43:36.5573376Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5573439Z outputs = self.mobilebert( 2025-08-14T21:43:36.5573717Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5573800Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5574065Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5574129Z layer_outputs = layer_module( 2025-08-14T21:43:36.5574385Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:43:36.5574539Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:43:36.5574809Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 392, in forward 2025-08-14T21:43:36.5574901Z layer_output = self.dense(intermediate_states) 2025-08-14T21:43:36.5574928Z 2025-08-14T21:43:36.5575022Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5575204Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5575273Z return mod(**inputs) 2025-08-14T21:43:36.5575530Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5575596Z outputs = self.mobilebert( 2025-08-14T21:43:36.5575859Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5575927Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5576190Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5576258Z layer_outputs = layer_module( 2025-08-14T21:43:36.5576515Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:43:36.5576669Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:43:36.5576926Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 397, in forward 2025-08-14T21:43:36.5577047Z layer_output = self.LayerNorm(layer_output + residual_tensor_1) 2025-08-14T21:43:36.5577304Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.5577389Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.5577392Z 2025-08-14T21:43:36.5577494Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5577677Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5577738Z return mod(**inputs) 2025-08-14T21:43:36.5578007Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5578076Z outputs = self.mobilebert( 2025-08-14T21:43:36.5578343Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5578411Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5578666Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5578740Z layer_outputs = layer_module( 2025-08-14T21:43:36.5578995Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:43:36.5579148Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:43:36.5579406Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 398, in forward 2025-08-14T21:43:36.5579573Z layer_output = self.bottleneck(layer_output, residual_tensor_2) 2025-08-14T21:43:36.5579835Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 372, in forward 2025-08-14T21:43:36.5579911Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:43:36.5579914Z 2025-08-14T21:43:36.5580013Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5580190Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5580249Z return mod(**inputs) 2025-08-14T21:43:36.5580513Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5580594Z outputs = self.mobilebert( 2025-08-14T21:43:36.5580850Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5580943Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5581206Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5581279Z layer_outputs = layer_module( 2025-08-14T21:43:36.5581543Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:43:36.5581685Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:43:36.5581953Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 398, in forward 2025-08-14T21:43:36.5582069Z layer_output = self.bottleneck(layer_output, residual_tensor_2) 2025-08-14T21:43:36.5582336Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 374, in forward 2025-08-14T21:43:36.5582452Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:43:36.5582714Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.5582808Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.5582811Z 2025-08-14T21:43:36.5582908Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5583100Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5583163Z return mod(**inputs) 2025-08-14T21:43:36.5583425Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5583501Z outputs = self.mobilebert( 2025-08-14T21:43:36.5583761Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5583831Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5584100Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5584165Z layer_outputs = layer_module( 2025-08-14T21:43:36.5584435Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:43:36.5584753Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:43:36.5585021Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 444, in forward 2025-08-14T21:43:36.5585135Z shared_attention_input = self.attention(hidden_states) 2025-08-14T21:43:36.5585391Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 409, in forward 2025-08-14T21:43:36.5585519Z layer_input = self.dense(hidden_states) 2025-08-14T21:43:36.5585523Z 2025-08-14T21:43:36.5585642Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5585825Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5585890Z return mod(**inputs) 2025-08-14T21:43:36.5586149Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5586212Z outputs = self.mobilebert( 2025-08-14T21:43:36.5586472Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5586562Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5586821Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5586910Z layer_outputs = layer_module( 2025-08-14T21:43:36.5587175Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:43:36.5587330Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:43:36.5587594Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 444, in forward 2025-08-14T21:43:36.5587702Z shared_attention_input = self.attention(hidden_states) 2025-08-14T21:43:36.5587963Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 410, in forward 2025-08-14T21:43:36.5588045Z layer_input = self.LayerNorm(layer_input) 2025-08-14T21:43:36.5588312Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.5588396Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.5588399Z 2025-08-14T21:43:36.5588492Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5588682Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5588741Z return mod(**inputs) 2025-08-14T21:43:36.5589011Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5589075Z outputs = self.mobilebert( 2025-08-14T21:43:36.5589335Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5589407Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5589669Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5589738Z layer_outputs = layer_module( 2025-08-14T21:43:36.5590002Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:43:36.5590081Z self_attention_outputs = self.attention( 2025-08-14T21:43:36.5590348Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:43:36.5590413Z self_outputs = self.self( 2025-08-14T21:43:36.5590674Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 245, in forward 2025-08-14T21:43:36.5590745Z self.query(query_tensor) 2025-08-14T21:43:36.5590749Z 2025-08-14T21:43:36.5590841Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5591031Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5591088Z return mod(**inputs) 2025-08-14T21:43:36.5591366Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5591454Z outputs = self.mobilebert( 2025-08-14T21:43:36.5591710Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5591781Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5592033Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5592094Z layer_outputs = layer_module( 2025-08-14T21:43:36.5592351Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:43:36.5592445Z self_attention_outputs = self.attention( 2025-08-14T21:43:36.5592698Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:43:36.5592785Z self_outputs = self.self( 2025-08-14T21:43:36.5593040Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 250, in forward 2025-08-14T21:43:36.5593105Z self.key(key_tensor) 2025-08-14T21:43:36.5593108Z 2025-08-14T21:43:36.5593200Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5593376Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5593442Z return mod(**inputs) 2025-08-14T21:43:36.5593700Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5593764Z outputs = self.mobilebert( 2025-08-14T21:43:36.5594023Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5594088Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5594351Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5594414Z layer_outputs = layer_module( 2025-08-14T21:43:36.5594667Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:43:36.5594750Z self_attention_outputs = self.attention( 2025-08-14T21:43:36.5595003Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:43:36.5595072Z self_outputs = self.self( 2025-08-14T21:43:36.5595324Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 255, in forward 2025-08-14T21:43:36.5595389Z self.value(value_tensor) 2025-08-14T21:43:36.5595392Z 2025-08-14T21:43:36.5595473Z cudagraph partition due to non gpu ops 2025-08-14T21:43:36.5595546Z cudagraph partition due to non gpu ops 2025-08-14T21:43:36.5595640Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5595825Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5595884Z return mod(**inputs) 2025-08-14T21:43:36.5596149Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5596211Z outputs = self.mobilebert( 2025-08-14T21:43:36.5596466Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5596538Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5596795Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5596859Z layer_outputs = layer_module( 2025-08-14T21:43:36.5597139Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:43:36.5597230Z self_attention_outputs = self.attention( 2025-08-14T21:43:36.5597498Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 344, in forward 2025-08-14T21:43:36.5597610Z attention_output = self.output(self_outputs[0], layer_input) 2025-08-14T21:43:36.5597869Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 292, in forward 2025-08-14T21:43:36.5597953Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:43:36.5597957Z 2025-08-14T21:43:36.5598072Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5598259Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5598319Z return mod(**inputs) 2025-08-14T21:43:36.5598598Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5598670Z outputs = self.mobilebert( 2025-08-14T21:43:36.5598920Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5598992Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5599247Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5599311Z layer_outputs = layer_module( 2025-08-14T21:43:36.5599570Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:43:36.5599716Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:43:36.5599974Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 440, in forward 2025-08-14T21:43:36.5600082Z bottlenecked_hidden_states = self.input(hidden_states) 2025-08-14T21:43:36.5600336Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 409, in forward 2025-08-14T21:43:36.5600417Z layer_input = self.dense(hidden_states) 2025-08-14T21:43:36.5600420Z 2025-08-14T21:43:36.5600511Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5600689Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5600754Z return mod(**inputs) 2025-08-14T21:43:36.5601011Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5601083Z outputs = self.mobilebert( 2025-08-14T21:43:36.5601338Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5601405Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5601666Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5601729Z layer_outputs = layer_module( 2025-08-14T21:43:36.5601981Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:43:36.5602062Z self_attention_outputs = self.attention( 2025-08-14T21:43:36.5602312Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 344, in forward 2025-08-14T21:43:36.5602432Z attention_output = self.output(self_outputs[0], layer_input) 2025-08-14T21:43:36.5602683Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 295, in forward 2025-08-14T21:43:36.5602814Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:43:36.5603091Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.5603178Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.5603182Z 2025-08-14T21:43:36.5603282Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5603461Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5603519Z return mod(**inputs) 2025-08-14T21:43:36.5603785Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5603865Z outputs = self.mobilebert( 2025-08-14T21:43:36.5604122Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5604214Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5604472Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5604544Z layer_outputs = layer_module( 2025-08-14T21:43:36.5604799Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.5604886Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.5605151Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.5605252Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.5605513Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:43:36.5605590Z hidden_states = self.dense(hidden_states) 2025-08-14T21:43:36.5605594Z 2025-08-14T21:43:36.5605685Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5605870Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5605930Z return mod(**inputs) 2025-08-14T21:43:36.5606189Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5606260Z outputs = self.mobilebert( 2025-08-14T21:43:36.5606515Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5606585Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5606842Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5606905Z layer_outputs = layer_module( 2025-08-14T21:43:36.5607167Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.5607254Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.5607513Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.5607612Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.5607864Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:43:36.5607970Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:43:36.5607975Z 2025-08-14T21:43:36.5608067Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5608250Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5608310Z return mod(**inputs) 2025-08-14T21:43:36.5608596Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5608667Z outputs = self.mobilebert( 2025-08-14T21:43:36.5608922Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5608985Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5609246Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5609309Z layer_outputs = layer_module( 2025-08-14T21:43:36.5609571Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.5609676Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.5609930Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.5610070Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.5610320Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:43:36.5610404Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:43:36.5610407Z 2025-08-14T21:43:36.5610500Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5610678Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5610743Z return mod(**inputs) 2025-08-14T21:43:36.5611000Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5611065Z outputs = self.mobilebert( 2025-08-14T21:43:36.5611324Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5611392Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5611652Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5611716Z layer_outputs = layer_module( 2025-08-14T21:43:36.5611968Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.5612058Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.5612312Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.5612433Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.5612687Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:43:36.5612800Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:43:36.5613060Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.5613142Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.5613145Z 2025-08-14T21:43:36.5613238Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5613425Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5613484Z return mod(**inputs) 2025-08-14T21:43:36.5613746Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5613813Z outputs = self.mobilebert( 2025-08-14T21:43:36.5614067Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5614176Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5614455Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5614527Z layer_outputs = layer_module( 2025-08-14T21:43:36.5614786Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.5614881Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.5615144Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.5615245Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.5615520Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:43:36.5615624Z hidden_states = self.dense(hidden_states) 2025-08-14T21:43:36.5615628Z 2025-08-14T21:43:36.5615723Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5615911Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5615971Z return mod(**inputs) 2025-08-14T21:43:36.5616227Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5616299Z outputs = self.mobilebert( 2025-08-14T21:43:36.5616552Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5616623Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5616878Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5616940Z layer_outputs = layer_module( 2025-08-14T21:43:36.5617204Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.5617289Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.5617542Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.5617649Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.5617900Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:43:36.5618005Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:43:36.5618010Z 2025-08-14T21:43:36.5618102Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5618280Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5618346Z return mod(**inputs) 2025-08-14T21:43:36.5618606Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5618678Z outputs = self.mobilebert( 2025-08-14T21:43:36.5618932Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5618996Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5619253Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5619316Z layer_outputs = layer_module( 2025-08-14T21:43:36.5619569Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.5619663Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.5619915Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.5620079Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.5620338Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:43:36.5620414Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:43:36.5620418Z 2025-08-14T21:43:36.5620516Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5620696Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5620760Z return mod(**inputs) 2025-08-14T21:43:36.5621018Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5621098Z outputs = self.mobilebert( 2025-08-14T21:43:36.5621360Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5621442Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5621703Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5621773Z layer_outputs = layer_module( 2025-08-14T21:43:36.5622033Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.5622122Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.5622422Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.5622561Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.5622877Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:43:36.5623015Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:43:36.5623332Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.5623439Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.5623443Z 2025-08-14T21:43:36.5623653Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5623884Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5650886Z return mod(**inputs) 2025-08-14T21:43:36.5651287Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5651386Z outputs = self.mobilebert( 2025-08-14T21:43:36.5651661Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5651748Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5652020Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5652085Z layer_outputs = layer_module( 2025-08-14T21:43:36.5652354Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.5652446Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.5652708Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.5652816Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.5653077Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:43:36.5653167Z hidden_states = self.dense(hidden_states) 2025-08-14T21:43:36.5653174Z 2025-08-14T21:43:36.5653345Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5653570Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5653647Z return mod(**inputs) 2025-08-14T21:43:36.5653915Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5653994Z outputs = self.mobilebert( 2025-08-14T21:43:36.5654249Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5654321Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5654615Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5654682Z layer_outputs = layer_module( 2025-08-14T21:43:36.5654978Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.5655069Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.5655326Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.5655439Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.5655696Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:43:36.5655800Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:43:36.5655812Z 2025-08-14T21:43:36.5655912Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5656101Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5656172Z return mod(**inputs) 2025-08-14T21:43:36.5656436Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5656505Z outputs = self.mobilebert( 2025-08-14T21:43:36.5656769Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5656837Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5657099Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5657166Z layer_outputs = layer_module( 2025-08-14T21:43:36.5657422Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.5657520Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.5657774Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.5657896Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.5658160Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:43:36.5658237Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:43:36.5658241Z 2025-08-14T21:43:36.5658341Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5658528Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5658589Z return mod(**inputs) 2025-08-14T21:43:36.5658855Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5658921Z outputs = self.mobilebert( 2025-08-14T21:43:36.5659181Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5659265Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5659539Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5659613Z layer_outputs = layer_module( 2025-08-14T21:43:36.5659869Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.5659956Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.5660221Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.5660354Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.5660616Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:43:36.5660743Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:43:36.5660998Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.5661087Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.5661091Z 2025-08-14T21:43:36.5661184Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5661372Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5661434Z return mod(**inputs) 2025-08-14T21:43:36.5661689Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5661763Z outputs = self.mobilebert( 2025-08-14T21:43:36.5662015Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5662089Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5662344Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5662410Z layer_outputs = layer_module( 2025-08-14T21:43:36.5662671Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 518, in forward 2025-08-14T21:43:36.5662780Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:43:36.5663031Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:43:36.5663115Z hidden_states = self.dense(hidden_states) 2025-08-14T21:43:36.5663119Z 2025-08-14T21:43:36.5663212Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5663400Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5663464Z return mod(**inputs) 2025-08-14T21:43:36.5663722Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5663795Z outputs = self.mobilebert( 2025-08-14T21:43:36.5664048Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5664123Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5664376Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5664441Z layer_outputs = layer_module( 2025-08-14T21:43:36.5664782Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 518, in forward 2025-08-14T21:43:36.5664898Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:43:36.5665178Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:43:36.5665307Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:43:36.5665311Z 2025-08-14T21:43:36.5665407Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5665595Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5665654Z return mod(**inputs) 2025-08-14T21:43:36.5665911Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5665985Z outputs = self.mobilebert( 2025-08-14T21:43:36.5666264Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5666340Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5667095Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5667161Z layer_outputs = layer_module( 2025-08-14T21:43:36.5667422Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:43:36.5667571Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:43:36.5667825Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 392, in forward 2025-08-14T21:43:36.5667922Z layer_output = self.dense(intermediate_states) 2025-08-14T21:43:36.5667926Z 2025-08-14T21:43:36.5668020Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5668210Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5668270Z return mod(**inputs) 2025-08-14T21:43:36.5668529Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5668603Z outputs = self.mobilebert( 2025-08-14T21:43:36.5668858Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5668933Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5669184Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5669248Z layer_outputs = layer_module( 2025-08-14T21:43:36.5669508Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:43:36.5669654Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:43:36.5669910Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 397, in forward 2025-08-14T21:43:36.5670037Z layer_output = self.LayerNorm(layer_output + residual_tensor_1) 2025-08-14T21:43:36.5670289Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.5670381Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.5670385Z 2025-08-14T21:43:36.5670478Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5670658Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5670726Z return mod(**inputs) 2025-08-14T21:43:36.5670984Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5671058Z outputs = self.mobilebert( 2025-08-14T21:43:36.5671328Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5671398Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5671679Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5671746Z layer_outputs = layer_module( 2025-08-14T21:43:36.5672004Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:43:36.5672156Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:43:36.5672408Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 398, in forward 2025-08-14T21:43:36.5672559Z layer_output = self.bottleneck(layer_output, residual_tensor_2) 2025-08-14T21:43:36.5672812Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 372, in forward 2025-08-14T21:43:36.5672908Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:43:36.5672919Z 2025-08-14T21:43:36.5673013Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5673194Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5673260Z return mod(**inputs) 2025-08-14T21:43:36.5673513Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5673578Z outputs = self.mobilebert( 2025-08-14T21:43:36.5673836Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5673903Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5674158Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5674225Z layer_outputs = layer_module( 2025-08-14T21:43:36.5674478Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:43:36.5674625Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:43:36.5674875Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 398, in forward 2025-08-14T21:43:36.5674986Z layer_output = self.bottleneck(layer_output, residual_tensor_2) 2025-08-14T21:43:36.5675243Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 374, in forward 2025-08-14T21:43:36.5675355Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:43:36.5675612Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.5675698Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.5675702Z 2025-08-14T21:43:36.5675795Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5675981Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5676039Z return mod(**inputs) 2025-08-14T21:43:36.5676299Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5676364Z outputs = self.mobilebert( 2025-08-14T21:43:36.5676616Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5676689Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5676940Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5677006Z layer_outputs = layer_module( 2025-08-14T21:43:36.5677294Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:43:36.5677445Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:43:36.5677707Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 444, in forward 2025-08-14T21:43:36.5677809Z shared_attention_input = self.attention(hidden_states) 2025-08-14T21:43:36.5678063Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 409, in forward 2025-08-14T21:43:36.5678162Z layer_input = self.dense(hidden_states) 2025-08-14T21:43:36.5678166Z 2025-08-14T21:43:36.5678262Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5678452Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5678530Z return mod(**inputs) 2025-08-14T21:43:36.5678790Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5678862Z outputs = self.mobilebert( 2025-08-14T21:43:36.5679118Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5679193Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5679449Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5679515Z layer_outputs = layer_module( 2025-08-14T21:43:36.5679780Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:43:36.5679926Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:43:36.5680187Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 444, in forward 2025-08-14T21:43:36.5680297Z shared_attention_input = self.attention(hidden_states) 2025-08-14T21:43:36.5680553Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 410, in forward 2025-08-14T21:43:36.5680640Z layer_input = self.LayerNorm(layer_input) 2025-08-14T21:43:36.5680897Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.5680979Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.5680984Z 2025-08-14T21:43:36.5681087Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5681268Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5681337Z return mod(**inputs) 2025-08-14T21:43:36.5681597Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5681662Z outputs = self.mobilebert( 2025-08-14T21:43:36.5681927Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5681994Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5682248Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5682322Z layer_outputs = layer_module( 2025-08-14T21:43:36.5682576Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:43:36.5682663Z self_attention_outputs = self.attention( 2025-08-14T21:43:36.5682934Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:43:36.5683018Z self_outputs = self.self( 2025-08-14T21:43:36.5683289Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 245, in forward 2025-08-14T21:43:36.5683356Z self.query(query_tensor) 2025-08-14T21:43:36.5683359Z 2025-08-14T21:43:36.5683462Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5683639Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5683699Z return mod(**inputs) 2025-08-14T21:43:36.5683959Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5684041Z outputs = self.mobilebert( 2025-08-14T21:43:36.5684296Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5684386Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5684808Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5684887Z layer_outputs = layer_module( 2025-08-14T21:43:36.5685142Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:43:36.5685220Z self_attention_outputs = self.attention( 2025-08-14T21:43:36.5685485Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:43:36.5685551Z self_outputs = self.self( 2025-08-14T21:43:36.5685814Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 250, in forward 2025-08-14T21:43:36.5685879Z self.key(key_tensor) 2025-08-14T21:43:36.5685883Z 2025-08-14T21:43:36.5685979Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5686160Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5686231Z return mod(**inputs) 2025-08-14T21:43:36.5686489Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5686562Z outputs = self.mobilebert( 2025-08-14T21:43:36.5686815Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5686882Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5687149Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5687214Z layer_outputs = layer_module( 2025-08-14T21:43:36.5687472Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:43:36.5687559Z self_attention_outputs = self.attention( 2025-08-14T21:43:36.5687814Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:43:36.5687887Z self_outputs = self.self( 2025-08-14T21:43:36.5688139Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 255, in forward 2025-08-14T21:43:36.5688202Z self.value(value_tensor) 2025-08-14T21:43:36.5688205Z 2025-08-14T21:43:36.5688289Z cudagraph partition due to non gpu ops 2025-08-14T21:43:36.5688362Z cudagraph partition due to non gpu ops 2025-08-14T21:43:36.5688455Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5688640Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5688700Z return mod(**inputs) 2025-08-14T21:43:36.5689049Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5689116Z outputs = self.mobilebert( 2025-08-14T21:43:36.5689370Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5689444Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5689697Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5689770Z layer_outputs = layer_module( 2025-08-14T21:43:36.5690051Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:43:36.5690127Z self_attention_outputs = self.attention( 2025-08-14T21:43:36.5690415Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 344, in forward 2025-08-14T21:43:36.5690530Z attention_output = self.output(self_outputs[0], layer_input) 2025-08-14T21:43:36.5690786Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 292, in forward 2025-08-14T21:43:36.5690871Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:43:36.5690874Z 2025-08-14T21:43:36.5690964Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5691147Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5691207Z return mod(**inputs) 2025-08-14T21:43:36.5691465Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5691537Z outputs = self.mobilebert( 2025-08-14T21:43:36.5691795Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5691868Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5692120Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5692184Z layer_outputs = layer_module( 2025-08-14T21:43:36.5692447Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:43:36.5692591Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:43:36.5692847Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 440, in forward 2025-08-14T21:43:36.5692957Z bottlenecked_hidden_states = self.input(hidden_states) 2025-08-14T21:43:36.5693213Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 409, in forward 2025-08-14T21:43:36.5693295Z layer_input = self.dense(hidden_states) 2025-08-14T21:43:36.5693300Z 2025-08-14T21:43:36.5693391Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5693569Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5693637Z return mod(**inputs) 2025-08-14T21:43:36.5693893Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5693961Z outputs = self.mobilebert( 2025-08-14T21:43:36.5694213Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5694281Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5694540Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5694621Z layer_outputs = layer_module( 2025-08-14T21:43:36.5694899Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:43:36.5694984Z self_attention_outputs = self.attention( 2025-08-14T21:43:36.5695236Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 344, in forward 2025-08-14T21:43:36.5695354Z attention_output = self.output(self_outputs[0], layer_input) 2025-08-14T21:43:36.5695607Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 295, in forward 2025-08-14T21:43:36.5695735Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:43:36.5695993Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.5696096Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.5696099Z 2025-08-14T21:43:36.5696200Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5696379Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5696437Z return mod(**inputs) 2025-08-14T21:43:36.5696699Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5696762Z outputs = self.mobilebert( 2025-08-14T21:43:36.5697021Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5697087Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5697337Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5697408Z layer_outputs = layer_module( 2025-08-14T21:43:36.5697662Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.5697749Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.5698009Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.5698109Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.5698368Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:43:36.5698443Z hidden_states = self.dense(hidden_states) 2025-08-14T21:43:36.5698447Z 2025-08-14T21:43:36.5698539Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5698727Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5698789Z return mod(**inputs) 2025-08-14T21:43:36.5699053Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5699117Z outputs = self.mobilebert( 2025-08-14T21:43:36.5699371Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5699443Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5699695Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5699759Z layer_outputs = layer_module( 2025-08-14T21:43:36.5700023Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.5700107Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.5700384Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.5700501Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.5700758Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:43:36.5700866Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:43:36.5700869Z 2025-08-14T21:43:36.5700961Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5701147Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5701207Z return mod(**inputs) 2025-08-14T21:43:36.5701480Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5701551Z outputs = self.mobilebert( 2025-08-14T21:43:36.5701826Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5701893Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5702154Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5702217Z layer_outputs = layer_module( 2025-08-14T21:43:36.5702479Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.5702564Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.5702819Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.5702942Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.5703197Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:43:36.5703283Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:43:36.5703288Z 2025-08-14T21:43:36.5703378Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5703559Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5703627Z return mod(**inputs) 2025-08-14T21:43:36.5703882Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5703947Z outputs = self.mobilebert( 2025-08-14T21:43:36.5704208Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5704274Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5704538Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5704606Z layer_outputs = layer_module( 2025-08-14T21:43:36.5704924Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.5705019Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.5705274Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.5705393Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.5705646Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:43:36.5705757Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:43:36.5706020Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.5706123Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.5706127Z 2025-08-14T21:43:36.5706247Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5706429Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5706491Z return mod(**inputs) 2025-08-14T21:43:36.5706756Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5706821Z outputs = self.mobilebert( 2025-08-14T21:43:36.5707075Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5707169Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5707424Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5707516Z layer_outputs = layer_module( 2025-08-14T21:43:36.5707773Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.5707858Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.5708117Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.5708216Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.5708467Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:43:36.5708549Z hidden_states = self.dense(hidden_states) 2025-08-14T21:43:36.5708553Z 2025-08-14T21:43:36.5708644Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5708828Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5708887Z return mod(**inputs) 2025-08-14T21:43:36.5709144Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5709215Z outputs = self.mobilebert( 2025-08-14T21:43:36.5709466Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5709537Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5709790Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5709852Z layer_outputs = layer_module( 2025-08-14T21:43:36.5710110Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.5710194Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.5710448Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.5710553Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.5710804Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:43:36.5710909Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:43:36.5710912Z 2025-08-14T21:43:36.5711004Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5711180Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5711248Z return mod(**inputs) 2025-08-14T21:43:36.5711512Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5711576Z outputs = self.mobilebert( 2025-08-14T21:43:36.5711852Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5711933Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5712196Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5712259Z layer_outputs = layer_module( 2025-08-14T21:43:36.5712510Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.5712601Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.5712854Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.5712984Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.5713243Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:43:36.5713341Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:43:36.5713344Z 2025-08-14T21:43:36.5713444Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5713625Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5713683Z return mod(**inputs) 2025-08-14T21:43:36.5713949Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5714011Z outputs = self.mobilebert( 2025-08-14T21:43:36.5714271Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5714338Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5714592Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5714665Z layer_outputs = layer_module( 2025-08-14T21:43:36.5714918Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.5715001Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.5715259Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.5715369Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.5715629Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:43:36.5715741Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:43:36.5715994Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.5716083Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.5716088Z 2025-08-14T21:43:36.5716181Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5716368Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5716427Z return mod(**inputs) 2025-08-14T21:43:36.5716685Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5716753Z outputs = self.mobilebert( 2025-08-14T21:43:36.5717005Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5717070Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5717327Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5717391Z layer_outputs = layer_module( 2025-08-14T21:43:36.5717709Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.5717795Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.5718049Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.5718154Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.5718404Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:43:36.5718482Z hidden_states = self.dense(hidden_states) 2025-08-14T21:43:36.5718500Z 2025-08-14T21:43:36.5718596Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5718773Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5718854Z return mod(**inputs) 2025-08-14T21:43:36.5719118Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5719188Z outputs = self.mobilebert( 2025-08-14T21:43:36.5719445Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5719511Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5719774Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5719838Z layer_outputs = layer_module( 2025-08-14T21:43:36.5720094Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.5720186Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.5720445Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.5720552Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.5720811Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:43:36.5720910Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:43:36.5720913Z 2025-08-14T21:43:36.5721011Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5721190Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5721256Z return mod(**inputs) 2025-08-14T21:43:36.5721517Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5721581Z outputs = self.mobilebert( 2025-08-14T21:43:36.5721846Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5721914Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5722172Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5722243Z layer_outputs = layer_module( 2025-08-14T21:43:36.5722501Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.5722592Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.5722849Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.5722963Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.5723231Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:43:36.5723323Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:43:36.5723327Z 2025-08-14T21:43:36.5723443Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5723623Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5723682Z return mod(**inputs) 2025-08-14T21:43:36.5723944Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5724007Z outputs = self.mobilebert( 2025-08-14T21:43:36.5724260Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5724348Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5724608Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5724699Z layer_outputs = layer_module( 2025-08-14T21:43:36.5724960Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.5725043Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.5725305Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.5725414Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.5725674Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:43:36.5725784Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:43:36.5726041Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.5726130Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.5726134Z 2025-08-14T21:43:36.5726227Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5726414Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5726473Z return mod(**inputs) 2025-08-14T21:43:36.5726733Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5726801Z outputs = self.mobilebert( 2025-08-14T21:43:36.5727056Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5727121Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5727387Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5727453Z layer_outputs = layer_module( 2025-08-14T21:43:36.5727717Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 518, in forward 2025-08-14T21:43:36.5727825Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:43:36.5728080Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:43:36.5728163Z hidden_states = self.dense(hidden_states) 2025-08-14T21:43:36.5728166Z 2025-08-14T21:43:36.5728255Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5728442Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5728502Z return mod(**inputs) 2025-08-14T21:43:36.5728763Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5728834Z outputs = self.mobilebert( 2025-08-14T21:43:36.5729119Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5729186Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5729445Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5729509Z layer_outputs = layer_module( 2025-08-14T21:43:36.5729768Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 518, in forward 2025-08-14T21:43:36.5729873Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:43:36.5730146Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:43:36.5730252Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:43:36.5730271Z 2025-08-14T21:43:36.5730364Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5730552Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5730612Z return mod(**inputs) 2025-08-14T21:43:36.5730867Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5730936Z outputs = self.mobilebert( 2025-08-14T21:43:36.5731189Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5731253Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5731514Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5731580Z layer_outputs = layer_module( 2025-08-14T21:43:36.5731838Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:43:36.5731985Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:43:36.5732238Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 392, in forward 2025-08-14T21:43:36.5732329Z layer_output = self.dense(intermediate_states) 2025-08-14T21:43:36.5732332Z 2025-08-14T21:43:36.5732423Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5732607Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5732664Z return mod(**inputs) 2025-08-14T21:43:36.5732922Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5732989Z outputs = self.mobilebert( 2025-08-14T21:43:36.5733245Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5733311Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5733571Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5733633Z layer_outputs = layer_module( 2025-08-14T21:43:36.5733891Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:43:36.5734035Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:43:36.5734289Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 397, in forward 2025-08-14T21:43:36.5734406Z layer_output = self.LayerNorm(layer_output + residual_tensor_1) 2025-08-14T21:43:36.5734659Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.5734765Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.5734783Z 2025-08-14T21:43:36.5734878Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5735060Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5735126Z return mod(**inputs) 2025-08-14T21:43:36.5735384Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5735448Z outputs = self.mobilebert( 2025-08-14T21:43:36.5735707Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5735790Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5736056Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5736146Z layer_outputs = layer_module( 2025-08-14T21:43:36.5736403Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:43:36.5736549Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:43:36.5736804Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 398, in forward 2025-08-14T21:43:36.5736922Z layer_output = self.bottleneck(layer_output, residual_tensor_2) 2025-08-14T21:43:36.5737175Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 372, in forward 2025-08-14T21:43:36.5737254Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:43:36.5737257Z 2025-08-14T21:43:36.5737356Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5737538Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5737596Z return mod(**inputs) 2025-08-14T21:43:36.5737860Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5737924Z outputs = self.mobilebert( 2025-08-14T21:43:36.5738183Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5738248Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5738499Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5738573Z layer_outputs = layer_module( 2025-08-14T21:43:36.5738825Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:43:36.5738976Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:43:36.5739231Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 398, in forward 2025-08-14T21:43:36.5739341Z layer_output = self.bottleneck(layer_output, residual_tensor_2) 2025-08-14T21:43:36.5739603Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 374, in forward 2025-08-14T21:43:36.5739711Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:43:36.5739973Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.5740055Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.5740058Z 2025-08-14T21:43:36.5740149Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5740340Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5740415Z return mod(**inputs) 2025-08-14T21:43:36.5740691Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5740764Z outputs = self.mobilebert( 2025-08-14T21:43:36.5741019Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5741092Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5741346Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5741427Z layer_outputs = layer_module( 2025-08-14T21:43:36.5741689Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:43:36.5741852Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:43:36.5742115Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 444, in forward 2025-08-14T21:43:36.5742216Z shared_attention_input = self.attention(hidden_states) 2025-08-14T21:43:36.5742467Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 409, in forward 2025-08-14T21:43:36.5742549Z layer_input = self.dense(hidden_states) 2025-08-14T21:43:36.5742553Z 2025-08-14T21:43:36.5742644Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5742822Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5742889Z return mod(**inputs) 2025-08-14T21:43:36.5743144Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5743214Z outputs = self.mobilebert( 2025-08-14T21:43:36.5743466Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5743530Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5743790Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5743852Z layer_outputs = layer_module( 2025-08-14T21:43:36.5744107Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:43:36.5744250Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:43:36.5744504Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 444, in forward 2025-08-14T21:43:36.5744607Z shared_attention_input = self.attention(hidden_states) 2025-08-14T21:43:36.5744963Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 410, in forward 2025-08-14T21:43:36.5745054Z layer_input = self.LayerNorm(layer_input) 2025-08-14T21:43:36.5745310Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.5745391Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.5745395Z 2025-08-14T21:43:36.5745495Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5745672Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5745734Z return mod(**inputs) 2025-08-14T21:43:36.5745999Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5746063Z outputs = self.mobilebert( 2025-08-14T21:43:36.5746360Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5746427Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5746681Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5746755Z layer_outputs = layer_module( 2025-08-14T21:43:36.5747009Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:43:36.5747093Z self_attention_outputs = self.attention( 2025-08-14T21:43:36.5747347Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:43:36.5747428Z self_outputs = self.self( 2025-08-14T21:43:36.5747687Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 245, in forward 2025-08-14T21:43:36.5747770Z self.query(query_tensor) 2025-08-14T21:43:36.5747773Z 2025-08-14T21:43:36.5747868Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5748056Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5748115Z return mod(**inputs) 2025-08-14T21:43:36.5748379Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5748443Z outputs = self.mobilebert( 2025-08-14T21:43:36.5748699Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5748771Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5749025Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5749091Z layer_outputs = layer_module( 2025-08-14T21:43:36.5749356Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:43:36.5749434Z self_attention_outputs = self.attention( 2025-08-14T21:43:36.5749696Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:43:36.5749760Z self_outputs = self.self( 2025-08-14T21:43:36.5750017Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 250, in forward 2025-08-14T21:43:36.5750085Z self.key(key_tensor) 2025-08-14T21:43:36.5750089Z 2025-08-14T21:43:36.5750183Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5750369Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5750430Z return mod(**inputs) 2025-08-14T21:43:36.5750689Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5750760Z outputs = self.mobilebert( 2025-08-14T21:43:36.5751015Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5751080Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5751344Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5751407Z layer_outputs = layer_module( 2025-08-14T21:43:36.5751670Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:43:36.5751748Z self_attention_outputs = self.attention( 2025-08-14T21:43:36.5752017Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:43:36.5752091Z self_outputs = self.self( 2025-08-14T21:43:36.5752358Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 255, in forward 2025-08-14T21:43:36.5752432Z self.value(value_tensor) 2025-08-14T21:43:36.5752435Z 2025-08-14T21:43:36.5752509Z cudagraph partition due to non gpu ops 2025-08-14T21:43:36.5752582Z cudagraph partition due to non gpu ops 2025-08-14T21:43:36.5752681Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5752861Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5752938Z return mod(**inputs) 2025-08-14T21:43:36.5753207Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5753287Z outputs = self.mobilebert( 2025-08-14T21:43:36.5753557Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5753622Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5753880Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5753952Z layer_outputs = layer_module( 2025-08-14T21:43:36.5754209Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:43:36.5754283Z self_attention_outputs = self.attention( 2025-08-14T21:43:36.5754550Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 344, in forward 2025-08-14T21:43:36.5754661Z attention_output = self.output(self_outputs[0], layer_input) 2025-08-14T21:43:36.5754928Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 292, in forward 2025-08-14T21:43:36.5755005Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:43:36.5755009Z 2025-08-14T21:43:36.5755100Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5755287Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5755344Z return mod(**inputs) 2025-08-14T21:43:36.5755611Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5755673Z outputs = self.mobilebert( 2025-08-14T21:43:36.5755928Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5756001Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5756257Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5756321Z layer_outputs = layer_module( 2025-08-14T21:43:36.5756585Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:43:36.5756731Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:43:36.5756995Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 440, in forward 2025-08-14T21:43:36.5757093Z bottlenecked_hidden_states = self.input(hidden_states) 2025-08-14T21:43:36.5757350Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 409, in forward 2025-08-14T21:43:36.5757430Z layer_input = self.dense(hidden_states) 2025-08-14T21:43:36.5757434Z 2025-08-14T21:43:36.5757524Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5757731Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5757804Z return mod(**inputs) 2025-08-14T21:43:36.5758063Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5758132Z outputs = self.mobilebert( 2025-08-14T21:43:36.5758387Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5758457Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5758708Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5758797Z layer_outputs = layer_module( 2025-08-14T21:43:36.5759057Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:43:36.5759149Z self_attention_outputs = self.attention( 2025-08-14T21:43:36.5759405Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 344, in forward 2025-08-14T21:43:36.5759522Z attention_output = self.output(self_outputs[0], layer_input) 2025-08-14T21:43:36.5759775Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 295, in forward 2025-08-14T21:43:36.5759892Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:43:36.5760146Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.5760229Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.5760232Z 2025-08-14T21:43:36.5760331Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5760509Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5760577Z return mod(**inputs) 2025-08-14T21:43:36.5760837Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5760899Z outputs = self.mobilebert( 2025-08-14T21:43:36.5761156Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5761220Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5761472Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5761542Z layer_outputs = layer_module( 2025-08-14T21:43:36.5761795Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.5761885Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.5762142Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.5762240Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.5762498Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:43:36.5762573Z hidden_states = self.dense(hidden_states) 2025-08-14T21:43:36.5762576Z 2025-08-14T21:43:36.5762675Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5762852Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5762911Z return mod(**inputs) 2025-08-14T21:43:36.5763175Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5763236Z outputs = self.mobilebert( 2025-08-14T21:43:36.5763521Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5763593Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5763844Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5763914Z layer_outputs = layer_module( 2025-08-14T21:43:36.5764167Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.5764251Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.5764509Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.5764628Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.5764887Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:43:36.5765005Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:43:36.5765008Z 2025-08-14T21:43:36.5765101Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5765289Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5765347Z return mod(**inputs) 2025-08-14T21:43:36.5765602Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5765672Z outputs = self.mobilebert( 2025-08-14T21:43:36.5765924Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5765995Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5766252Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5766315Z layer_outputs = layer_module( 2025-08-14T21:43:36.5766574Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.5766659Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.5766919Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.5767031Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.5767283Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:43:36.5767368Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:43:36.5767371Z 2025-08-14T21:43:36.5767462Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5767651Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5767710Z return mod(**inputs) 2025-08-14T21:43:36.5767969Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5768037Z outputs = self.mobilebert( 2025-08-14T21:43:36.5768291Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5768354Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5768614Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5768678Z layer_outputs = layer_module( 2025-08-14T21:43:36.5768935Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.5769020Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.5769298Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.5769419Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.5769672Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:43:36.5769779Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:43:36.5770038Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.5770136Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.5770139Z 2025-08-14T21:43:36.5770237Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5770418Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5770493Z return mod(**inputs) 2025-08-14T21:43:36.5770767Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5770832Z outputs = self.mobilebert( 2025-08-14T21:43:36.5771091Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5771157Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5771411Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5771482Z layer_outputs = layer_module( 2025-08-14T21:43:36.5771738Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.5771828Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.5772083Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.5772183Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.5772444Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:43:36.5772521Z hidden_states = self.dense(hidden_states) 2025-08-14T21:43:36.5772524Z 2025-08-14T21:43:36.5772616Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5772800Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5772862Z return mod(**inputs) 2025-08-14T21:43:36.5773128Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5773192Z outputs = self.mobilebert( 2025-08-14T21:43:36.5773449Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5773525Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5773779Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5773849Z layer_outputs = layer_module( 2025-08-14T21:43:36.5774102Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.5774186Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.5774446Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.5774547Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.5774814Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:43:36.5774943Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:43:36.5774947Z 2025-08-14T21:43:36.5775042Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5775227Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5775287Z return mod(**inputs) 2025-08-14T21:43:36.5775541Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5775612Z outputs = self.mobilebert( 2025-08-14T21:43:36.5775861Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5775948Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5776202Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5776284Z layer_outputs = layer_module( 2025-08-14T21:43:36.5776546Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.5776630Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.5776882Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.5777000Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.5777253Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:43:36.5777333Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:43:36.5777336Z 2025-08-14T21:43:36.5777426Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5777606Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5777672Z return mod(**inputs) 2025-08-14T21:43:36.5777931Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5778000Z outputs = self.mobilebert( 2025-08-14T21:43:36.5778250Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5778315Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5778571Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5778636Z layer_outputs = layer_module( 2025-08-14T21:43:36.5778888Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.5778977Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.5779233Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.5779350Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.5779600Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:43:36.5779710Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:43:36.5779970Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.5780052Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.5780056Z 2025-08-14T21:43:36.5780155Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5780334Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5780409Z return mod(**inputs) 2025-08-14T21:43:36.5780694Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5780760Z outputs = self.mobilebert( 2025-08-14T21:43:36.5781016Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5781088Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5781340Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5781408Z layer_outputs = layer_module( 2025-08-14T21:43:36.5781680Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.5781763Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.5782046Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.5782146Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.5782407Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:43:36.5782481Z hidden_states = self.dense(hidden_states) 2025-08-14T21:43:36.5782485Z 2025-08-14T21:43:36.5782577Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5782764Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5782824Z return mod(**inputs) 2025-08-14T21:43:36.5783082Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5783153Z outputs = self.mobilebert( 2025-08-14T21:43:36.5783409Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5783481Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5783733Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5783795Z layer_outputs = layer_module( 2025-08-14T21:43:36.5784054Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.5784138Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.5784396Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.5784496Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.5785002Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:43:36.5785119Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:43:36.5785123Z 2025-08-14T21:43:36.5785217Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5785405Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5785465Z return mod(**inputs) 2025-08-14T21:43:36.5785721Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5785793Z outputs = self.mobilebert( 2025-08-14T21:43:36.5786046Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5786112Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5786371Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5786476Z layer_outputs = layer_module( 2025-08-14T21:43:36.5786764Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.5786852Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.5787104Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.5787220Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.5787472Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:43:36.5787576Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:43:36.5787580Z 2025-08-14T21:43:36.5787672Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5787878Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5787946Z return mod(**inputs) 2025-08-14T21:43:36.5788205Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5788269Z outputs = self.mobilebert( 2025-08-14T21:43:36.5788531Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5788596Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5788854Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5788919Z layer_outputs = layer_module( 2025-08-14T21:43:36.5789172Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.5789263Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.5789517Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.5789635Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.5789887Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:43:36.5789996Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:43:36.5790253Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.5790336Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.5790340Z 2025-08-14T21:43:36.5790431Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5790617Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5790678Z return mod(**inputs) 2025-08-14T21:43:36.5790939Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5791002Z outputs = self.mobilebert( 2025-08-14T21:43:36.5791254Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5791325Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5791575Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5791643Z layer_outputs = layer_module( 2025-08-14T21:43:36.5791895Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 518, in forward 2025-08-14T21:43:36.5792002Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:43:36.5792293Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:43:36.5792370Z hidden_states = self.dense(hidden_states) 2025-08-14T21:43:36.5792374Z 2025-08-14T21:43:36.5792471Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5792653Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5792711Z return mod(**inputs) 2025-08-14T21:43:36.5792978Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5793041Z outputs = self.mobilebert( 2025-08-14T21:43:36.5793313Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5793382Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5793652Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5793721Z layer_outputs = layer_module( 2025-08-14T21:43:36.5793973Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 518, in forward 2025-08-14T21:43:36.5794078Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:43:36.5794337Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:43:36.5794435Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:43:36.5794438Z 2025-08-14T21:43:36.5794535Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5794711Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5794768Z return mod(**inputs) 2025-08-14T21:43:36.5795034Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5795098Z outputs = self.mobilebert( 2025-08-14T21:43:36.5795348Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5795418Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5795669Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5795739Z layer_outputs = layer_module( 2025-08-14T21:43:36.5795989Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:43:36.5796134Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:43:36.5796391Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 392, in forward 2025-08-14T21:43:36.5796477Z layer_output = self.dense(intermediate_states) 2025-08-14T21:43:36.5796482Z 2025-08-14T21:43:36.5796580Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5796756Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5796814Z return mod(**inputs) 2025-08-14T21:43:36.5797074Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5797137Z outputs = self.mobilebert( 2025-08-14T21:43:36.5797387Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5797461Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5797712Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5797796Z layer_outputs = layer_module( 2025-08-14T21:43:36.5798067Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:43:36.5798212Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:43:36.5798470Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 397, in forward 2025-08-14T21:43:36.5798581Z layer_output = self.LayerNorm(layer_output + residual_tensor_1) 2025-08-14T21:43:36.5798835Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.5798934Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.5798937Z 2025-08-14T21:43:36.5799029Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5799236Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5799297Z return mod(**inputs) 2025-08-14T21:43:36.5799557Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5799629Z outputs = self.mobilebert( 2025-08-14T21:43:36.5799884Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5799955Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5800209Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5800274Z layer_outputs = layer_module( 2025-08-14T21:43:36.5800536Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:43:36.5800680Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:43:36.5800941Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 398, in forward 2025-08-14T21:43:36.5801054Z layer_output = self.bottleneck(layer_output, residual_tensor_2) 2025-08-14T21:43:36.5801309Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 372, in forward 2025-08-14T21:43:36.5801390Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:43:36.5801393Z 2025-08-14T21:43:36.5801483Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5801665Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5801731Z return mod(**inputs) 2025-08-14T21:43:36.5801989Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5802061Z outputs = self.mobilebert( 2025-08-14T21:43:36.5802316Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5802381Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5802644Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5802707Z layer_outputs = layer_module( 2025-08-14T21:43:36.5802969Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:43:36.5803111Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:43:36.5803364Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 398, in forward 2025-08-14T21:43:36.5803507Z layer_output = self.bottleneck(layer_output, residual_tensor_2) 2025-08-14T21:43:36.5803775Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 374, in forward 2025-08-14T21:43:36.5803893Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:43:36.5804147Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.5804228Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.5804231Z 2025-08-14T21:43:36.5804330Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5804510Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5804586Z return mod(**inputs) 2025-08-14T21:43:36.5804853Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5804937Z outputs = self.mobilebert( 2025-08-14T21:43:36.5805202Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5805266Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5805522Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5805592Z layer_outputs = layer_module( 2025-08-14T21:43:36.5805850Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:43:36.5806003Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:43:36.5806263Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 444, in forward 2025-08-14T21:43:36.5806363Z shared_attention_input = self.attention(hidden_states) 2025-08-14T21:43:36.5806626Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 409, in forward 2025-08-14T21:43:36.5806699Z layer_input = self.dense(hidden_states) 2025-08-14T21:43:36.5806702Z 2025-08-14T21:43:36.5806793Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5806978Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5807035Z return mod(**inputs) 2025-08-14T21:43:36.5807299Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5807363Z outputs = self.mobilebert( 2025-08-14T21:43:36.5807621Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5807694Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5807952Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5808021Z layer_outputs = layer_module( 2025-08-14T21:43:36.5808278Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:43:36.5808421Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:43:36.5808686Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 444, in forward 2025-08-14T21:43:36.5808782Z shared_attention_input = self.attention(hidden_states) 2025-08-14T21:43:36.5809039Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 410, in forward 2025-08-14T21:43:36.5809121Z layer_input = self.LayerNorm(layer_input) 2025-08-14T21:43:36.5809418Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.5809521Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.5809525Z 2025-08-14T21:43:36.5809618Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5809798Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5809863Z return mod(**inputs) 2025-08-14T21:43:36.5810118Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5810190Z outputs = self.mobilebert( 2025-08-14T21:43:36.5810460Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5810525Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5810801Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5810865Z layer_outputs = layer_module( 2025-08-14T21:43:36.5811115Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:43:36.5811198Z self_attention_outputs = self.attention( 2025-08-14T21:43:36.5811450Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:43:36.5811517Z self_outputs = self.self( 2025-08-14T21:43:36.5811768Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 245, in forward 2025-08-14T21:43:36.5811834Z self.query(query_tensor) 2025-08-14T21:43:36.5811837Z 2025-08-14T21:43:36.5811935Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5812117Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5812182Z return mod(**inputs) 2025-08-14T21:43:36.5812437Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5812500Z outputs = self.mobilebert( 2025-08-14T21:43:36.5812759Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5812825Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5813076Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5813148Z layer_outputs = layer_module( 2025-08-14T21:43:36.5813398Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:43:36.5813484Z self_attention_outputs = self.attention( 2025-08-14T21:43:36.5813738Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:43:36.5813799Z self_outputs = self.self( 2025-08-14T21:43:36.5814056Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 250, in forward 2025-08-14T21:43:36.5814115Z self.key(key_tensor) 2025-08-14T21:43:36.5814118Z 2025-08-14T21:43:36.5814216Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5814392Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5814451Z return mod(**inputs) 2025-08-14T21:43:36.5814713Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5814775Z outputs = self.mobilebert( 2025-08-14T21:43:36.5815044Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5815134Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5815386Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5815457Z layer_outputs = layer_module( 2025-08-14T21:43:36.5815708Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:43:36.5815783Z self_attention_outputs = self.attention( 2025-08-14T21:43:36.5816039Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:43:36.5816132Z self_outputs = self.self( 2025-08-14T21:43:36.5816394Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 255, in forward 2025-08-14T21:43:36.5816479Z self.value(value_tensor) 2025-08-14T21:43:36.5816482Z 2025-08-14T21:43:36.5816557Z cudagraph partition due to non gpu ops 2025-08-14T21:43:36.5816635Z cudagraph partition due to non gpu ops 2025-08-14T21:43:36.5816727Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5816905Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5816969Z return mod(**inputs) 2025-08-14T21:43:36.5817223Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5817293Z outputs = self.mobilebert( 2025-08-14T21:43:36.5817546Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5817613Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5817875Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5817939Z layer_outputs = layer_module( 2025-08-14T21:43:36.5818193Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:43:36.5818275Z self_attention_outputs = self.attention( 2025-08-14T21:43:36.5818526Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 344, in forward 2025-08-14T21:43:36.5818643Z attention_output = self.output(self_outputs[0], layer_input) 2025-08-14T21:43:36.5818895Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 292, in forward 2025-08-14T21:43:36.5818972Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:43:36.5818976Z 2025-08-14T21:43:36.5819073Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5819253Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5819318Z return mod(**inputs) 2025-08-14T21:43:36.5819573Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5819635Z outputs = self.mobilebert( 2025-08-14T21:43:36.5819894Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5819958Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5820209Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5820280Z layer_outputs = layer_module( 2025-08-14T21:43:36.5820533Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:43:36.5820703Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:43:36.5820979Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 440, in forward 2025-08-14T21:43:36.5821080Z bottlenecked_hidden_states = self.input(hidden_states) 2025-08-14T21:43:36.5821343Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 409, in forward 2025-08-14T21:43:36.5821417Z layer_input = self.dense(hidden_states) 2025-08-14T21:43:36.5821421Z 2025-08-14T21:43:36.5821518Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5821714Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5821773Z return mod(**inputs) 2025-08-14T21:43:36.5822036Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5822123Z outputs = self.mobilebert( 2025-08-14T21:43:36.5822378Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5822448Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5822702Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5822768Z layer_outputs = layer_module( 2025-08-14T21:43:36.5823022Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:43:36.5823098Z self_attention_outputs = self.attention( 2025-08-14T21:43:36.5823361Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 344, in forward 2025-08-14T21:43:36.5823473Z attention_output = self.output(self_outputs[0], layer_input) 2025-08-14T21:43:36.5823734Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 295, in forward 2025-08-14T21:43:36.5823847Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:43:36.5824100Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.5824189Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.5824192Z 2025-08-14T21:43:36.5824282Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5824466Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5824526Z return mod(**inputs) 2025-08-14T21:43:36.5824845Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5824924Z outputs = self.mobilebert( 2025-08-14T21:43:36.5825180Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5825245Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5825506Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5825569Z layer_outputs = layer_module( 2025-08-14T21:43:36.5825831Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.5825917Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.5826172Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.5826277Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.5826550Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:43:36.5826650Z hidden_states = self.dense(hidden_states) 2025-08-14T21:43:36.5826654Z 2025-08-14T21:43:36.5826749Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5826927Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5826994Z return mod(**inputs) 2025-08-14T21:43:36.5827251Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5827315Z outputs = self.mobilebert( 2025-08-14T21:43:36.5827595Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5827661Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5827941Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5828007Z layer_outputs = layer_module( 2025-08-14T21:43:36.5828261Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.5828353Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.5828605Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.5828709Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.5828962Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:43:36.5829062Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:43:36.5829066Z 2025-08-14T21:43:36.5829164Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5829344Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5829403Z return mod(**inputs) 2025-08-14T21:43:36.5829668Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5829730Z outputs = self.mobilebert( 2025-08-14T21:43:36.5829993Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5830056Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5830306Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5830377Z layer_outputs = layer_module( 2025-08-14T21:43:36.5830629Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.5830724Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.5830975Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.5831087Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.5831348Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:43:36.5831421Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:43:36.5831424Z 2025-08-14T21:43:36.5831521Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5831704Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5831763Z return mod(**inputs) 2025-08-14T21:43:36.5832024Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5832104Z outputs = self.mobilebert( 2025-08-14T21:43:36.5832374Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5832449Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5832698Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5832768Z layer_outputs = layer_module( 2025-08-14T21:43:36.5833016Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.5833116Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.5833373Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.5833501Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.5833757Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:43:36.5833873Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:43:36.5834125Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.5834212Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.5834215Z 2025-08-14T21:43:36.5834305Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5834483Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5834551Z return mod(**inputs) 2025-08-14T21:43:36.5834807Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5834877Z outputs = self.mobilebert( 2025-08-14T21:43:36.5835130Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5835194Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5835451Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5835514Z layer_outputs = layer_module( 2025-08-14T21:43:36.5835765Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.5835854Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.5836106Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.5836208Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.5836461Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:43:36.5836536Z hidden_states = self.dense(hidden_states) 2025-08-14T21:43:36.5836539Z 2025-08-14T21:43:36.5836637Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5836813Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5836877Z return mod(**inputs) 2025-08-14T21:43:36.5837133Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5837195Z outputs = self.mobilebert( 2025-08-14T21:43:36.5837454Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5837517Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5837792Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5837871Z layer_outputs = layer_module( 2025-08-14T21:43:36.5838124Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.5838213Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.5838464Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.5838560Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.5838817Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:43:36.5838934Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:43:36.5838937Z 2025-08-14T21:43:36.5839034Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5839229Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5839289Z return mod(**inputs) 2025-08-14T21:43:36.5839551Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5839613Z outputs = self.mobilebert( 2025-08-14T21:43:36.5839870Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5839935Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5840184Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5840256Z layer_outputs = layer_module( 2025-08-14T21:43:36.5840505Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.5840591Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.5840852Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.5840962Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.5841218Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:43:36.5841292Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:43:36.5841295Z 2025-08-14T21:43:36.5841386Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5841570Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5841630Z return mod(**inputs) 2025-08-14T21:43:36.5841890Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5841956Z outputs = self.mobilebert( 2025-08-14T21:43:36.5842209Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5842280Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5842530Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5842594Z layer_outputs = layer_module( 2025-08-14T21:43:36.5842855Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.5842939Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.5843197Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.5843308Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.5843591Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:43:36.5843711Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:43:36.5843966Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.5844052Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.5844056Z 2025-08-14T21:43:36.5844146Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5844325Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5844416Z return mod(**inputs) 2025-08-14T21:43:36.5844672Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5844752Z outputs = self.mobilebert( 2025-08-14T21:43:36.5845022Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5845085Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5845352Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5845415Z layer_outputs = layer_module( 2025-08-14T21:43:36.5845675Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.5845764Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.5846023Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.5846127Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.5846389Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:43:36.5846464Z hidden_states = self.dense(hidden_states) 2025-08-14T21:43:36.5846468Z 2025-08-14T21:43:36.5846565Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5846748Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5846806Z return mod(**inputs) 2025-08-14T21:43:36.5847074Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5847136Z outputs = self.mobilebert( 2025-08-14T21:43:36.5847405Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5847470Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5847730Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5847802Z layer_outputs = layer_module( 2025-08-14T21:43:36.5848062Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.5848152Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.5848410Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.5848510Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.5848775Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:43:36.5848877Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:43:36.5848881Z 2025-08-14T21:43:36.5848977Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5849178Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5849254Z return mod(**inputs) 2025-08-14T21:43:36.5849522Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5849584Z outputs = self.mobilebert( 2025-08-14T21:43:36.5849835Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5849905Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5850157Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5850242Z layer_outputs = layer_module( 2025-08-14T21:43:36.5850500Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.5850603Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.5850862Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.5850975Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.5851230Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:43:36.5851306Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:43:36.5851309Z 2025-08-14T21:43:36.5851400Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5851584Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5851645Z return mod(**inputs) 2025-08-14T21:43:36.5851900Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5851974Z outputs = self.mobilebert( 2025-08-14T21:43:36.5852226Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5852296Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5852548Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5852611Z layer_outputs = layer_module( 2025-08-14T21:43:36.5852867Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.5852950Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.5853208Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.5853319Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.5853576Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:43:36.5853691Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:43:36.5853947Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.5854029Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.5854038Z 2025-08-14T21:43:36.5854131Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5854309Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5854376Z return mod(**inputs) 2025-08-14T21:43:36.5854631Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5854695Z outputs = self.mobilebert( 2025-08-14T21:43:36.5854980Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5855046Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5855303Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5855366Z layer_outputs = layer_module( 2025-08-14T21:43:36.5855620Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 518, in forward 2025-08-14T21:43:36.5855733Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:43:36.5856007Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:43:36.5856082Z hidden_states = self.dense(hidden_states) 2025-08-14T21:43:36.5856107Z 2025-08-14T21:43:36.5856204Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5856385Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5856449Z return mod(**inputs) 2025-08-14T21:43:36.5856706Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5856768Z outputs = self.mobilebert( 2025-08-14T21:43:36.5857029Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5857094Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5857353Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5857418Z layer_outputs = layer_module( 2025-08-14T21:43:36.5857673Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 518, in forward 2025-08-14T21:43:36.5857788Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:43:36.5858041Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:43:36.5858141Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:43:36.5858151Z 2025-08-14T21:43:36.5858244Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5858423Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5858489Z return mod(**inputs) 2025-08-14T21:43:36.5858748Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5858812Z outputs = self.mobilebert( 2025-08-14T21:43:36.5859072Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5859136Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5859396Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5859458Z layer_outputs = layer_module( 2025-08-14T21:43:36.5859710Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:43:36.5859859Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:43:36.5860112Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 392, in forward 2025-08-14T21:43:36.5860198Z layer_output = self.dense(intermediate_states) 2025-08-14T21:43:36.5860208Z 2025-08-14T21:43:36.5860299Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5860493Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5860574Z return mod(**inputs) 2025-08-14T21:43:36.5860832Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5860895Z outputs = self.mobilebert( 2025-08-14T21:43:36.5861151Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5861214Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5861470Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5861550Z layer_outputs = layer_module( 2025-08-14T21:43:36.5861803Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:43:36.5861974Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:43:36.5862228Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 397, in forward 2025-08-14T21:43:36.5862345Z layer_output = self.LayerNorm(layer_output + residual_tensor_1) 2025-08-14T21:43:36.5862598Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.5862680Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.5862683Z 2025-08-14T21:43:36.5862781Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5862963Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5863022Z return mod(**inputs) 2025-08-14T21:43:36.5863285Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5863350Z outputs = self.mobilebert( 2025-08-14T21:43:36.5863610Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5863673Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5863925Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5863997Z layer_outputs = layer_module( 2025-08-14T21:43:36.5864249Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:43:36.5864394Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:43:36.5864647Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 398, in forward 2025-08-14T21:43:36.5864823Z layer_output = self.bottleneck(layer_output, residual_tensor_2) 2025-08-14T21:43:36.5865093Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 372, in forward 2025-08-14T21:43:36.5865170Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:43:36.5865174Z 2025-08-14T21:43:36.5865266Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5865451Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5865509Z return mod(**inputs) 2025-08-14T21:43:36.5865772Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5865837Z outputs = self.mobilebert( 2025-08-14T21:43:36.5866092Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5866166Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5866462Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5866534Z layer_outputs = layer_module( 2025-08-14T21:43:36.5866789Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:43:36.5866930Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:43:36.5867187Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 398, in forward 2025-08-14T21:43:36.5867316Z layer_output = self.bottleneck(layer_output, residual_tensor_2) 2025-08-14T21:43:36.5867570Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 374, in forward 2025-08-14T21:43:36.5867700Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:43:36.5867952Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.5868038Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.5868041Z 2025-08-14T21:43:36.5868133Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5868313Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5868377Z return mod(**inputs) 2025-08-14T21:43:36.5868630Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5868700Z outputs = self.mobilebert( 2025-08-14T21:43:36.5868953Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5869019Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5869281Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5869344Z layer_outputs = layer_module( 2025-08-14T21:43:36.5869603Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:43:36.5869748Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:43:36.5870001Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 444, in forward 2025-08-14T21:43:36.5870106Z shared_attention_input = self.attention(hidden_states) 2025-08-14T21:43:36.5870358Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 409, in forward 2025-08-14T21:43:36.5870430Z layer_input = self.dense(hidden_states) 2025-08-14T21:43:36.5870442Z 2025-08-14T21:43:36.5870536Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5870718Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5870784Z return mod(**inputs) 2025-08-14T21:43:36.5871040Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5871103Z outputs = self.mobilebert( 2025-08-14T21:43:36.5871360Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5871423Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5871683Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5871746Z layer_outputs = layer_module( 2025-08-14T21:43:36.5872012Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:43:36.5872183Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:43:36.5872439Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 444, in forward 2025-08-14T21:43:36.5872538Z shared_attention_input = self.attention(hidden_states) 2025-08-14T21:43:36.5872799Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 410, in forward 2025-08-14T21:43:36.5872875Z layer_input = self.LayerNorm(layer_input) 2025-08-14T21:43:36.5873148Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.5873228Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.5873247Z 2025-08-14T21:43:36.5873342Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5873530Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5873588Z return mod(**inputs) 2025-08-14T21:43:36.5873851Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5873914Z outputs = self.mobilebert( 2025-08-14T21:43:36.5874168Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5874240Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5874492Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5874556Z layer_outputs = layer_module( 2025-08-14T21:43:36.5874816Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:43:36.5874896Z self_attention_outputs = self.attention( 2025-08-14T21:43:36.5875154Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:43:36.5875218Z self_outputs = self.self( 2025-08-14T21:43:36.5875470Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 245, in forward 2025-08-14T21:43:36.5875538Z self.query(query_tensor) 2025-08-14T21:43:36.5875541Z 2025-08-14T21:43:36.5875633Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5875817Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5875877Z return mod(**inputs) 2025-08-14T21:43:36.5876132Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5876207Z outputs = self.mobilebert( 2025-08-14T21:43:36.5876461Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5876527Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5876790Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5876853Z layer_outputs = layer_module( 2025-08-14T21:43:36.5877112Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:43:36.5877188Z self_attention_outputs = self.attention( 2025-08-14T21:43:36.5877443Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:43:36.5877513Z self_outputs = self.self( 2025-08-14T21:43:36.5877780Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 250, in forward 2025-08-14T21:43:36.5877861Z self.key(key_tensor) 2025-08-14T21:43:36.5877865Z 2025-08-14T21:43:36.5877960Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5878138Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5878203Z return mod(**inputs) 2025-08-14T21:43:36.5878456Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5878516Z outputs = self.mobilebert( 2025-08-14T21:43:36.5878792Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5878857Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5879115Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5879196Z layer_outputs = layer_module( 2025-08-14T21:43:36.5879448Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:43:36.5879528Z self_attention_outputs = self.attention( 2025-08-14T21:43:36.5879779Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:43:36.5879848Z self_outputs = self.self( 2025-08-14T21:43:36.5880098Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 255, in forward 2025-08-14T21:43:36.5880161Z self.value(value_tensor) 2025-08-14T21:43:36.5880164Z 2025-08-14T21:43:36.5880241Z cudagraph partition due to non gpu ops 2025-08-14T21:43:36.5880311Z cudagraph partition due to non gpu ops 2025-08-14T21:43:36.5880407Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5880595Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5880654Z return mod(**inputs) 2025-08-14T21:43:36.5880912Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5880974Z outputs = self.mobilebert( 2025-08-14T21:43:36.5881223Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5881292Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5881542Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5881607Z layer_outputs = layer_module( 2025-08-14T21:43:36.5881873Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:43:36.5881948Z self_attention_outputs = self.attention( 2025-08-14T21:43:36.5882206Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 344, in forward 2025-08-14T21:43:36.5882318Z attention_output = self.output(self_outputs[0], layer_input) 2025-08-14T21:43:36.5882570Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 292, in forward 2025-08-14T21:43:36.5882652Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:43:36.5882655Z 2025-08-14T21:43:36.5882746Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5882933Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5882991Z return mod(**inputs) 2025-08-14T21:43:36.5883262Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5883337Z outputs = self.mobilebert( 2025-08-14T21:43:36.5883605Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5883672Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5883932Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5883994Z layer_outputs = layer_module( 2025-08-14T21:43:36.5884256Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:43:36.5884416Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:43:36.5884814Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 440, in forward 2025-08-14T21:43:36.5884964Z bottlenecked_hidden_states = self.input(hidden_states) 2025-08-14T21:43:36.5885219Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 409, in forward 2025-08-14T21:43:36.5885301Z layer_input = self.dense(hidden_states) 2025-08-14T21:43:36.5885305Z 2025-08-14T21:43:36.5885400Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5885583Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5885652Z return mod(**inputs) 2025-08-14T21:43:36.5885911Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5885980Z outputs = self.mobilebert( 2025-08-14T21:43:36.5886244Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5886314Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5886579Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5886647Z layer_outputs = layer_module( 2025-08-14T21:43:36.5886905Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:43:36.5886993Z self_attention_outputs = self.attention( 2025-08-14T21:43:36.5887248Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 344, in forward 2025-08-14T21:43:36.5887372Z attention_output = self.output(self_outputs[0], layer_input) 2025-08-14T21:43:36.5887630Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 295, in forward 2025-08-14T21:43:36.5887748Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:43:36.5888015Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.5888102Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.5888106Z 2025-08-14T21:43:36.5888207Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5888389Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5888450Z return mod(**inputs) 2025-08-14T21:43:36.5888717Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5888783Z outputs = self.mobilebert( 2025-08-14T21:43:36.5889038Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5889121Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5889427Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5889497Z layer_outputs = layer_module( 2025-08-14T21:43:36.5889744Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.5889825Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.5890079Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.5890173Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.5890553Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:43:36.5890628Z hidden_states = self.dense(hidden_states) 2025-08-14T21:43:36.5890648Z 2025-08-14T21:43:36.5890743Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5890932Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5890992Z return mod(**inputs) 2025-08-14T21:43:36.5891248Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5891321Z outputs = self.mobilebert( 2025-08-14T21:43:36.5891575Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5891646Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5891901Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5891966Z layer_outputs = layer_module( 2025-08-14T21:43:36.5892231Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.5892318Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.5892582Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.5892683Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.5892939Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:43:36.5893047Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:43:36.5893050Z 2025-08-14T21:43:36.5893142Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5893318Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5893381Z return mod(**inputs) 2025-08-14T21:43:36.5893640Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5893711Z outputs = self.mobilebert( 2025-08-14T21:43:36.5893960Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5894022Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5894276Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5894335Z layer_outputs = layer_module( 2025-08-14T21:43:36.5894589Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.5894675Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.5894927Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.5895064Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.5895333Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:43:36.5895411Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:43:36.5895420Z 2025-08-14T21:43:36.5895513Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5895694Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5895759Z return mod(**inputs) 2025-08-14T21:43:36.5896016Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5896098Z outputs = self.mobilebert( 2025-08-14T21:43:36.5896359Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5896463Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5896723Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5896787Z layer_outputs = layer_module( 2025-08-14T21:43:36.5897039Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.5897126Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.5897373Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.5897482Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.5897734Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:43:36.5897838Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:43:36.5898103Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.5898184Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.5898188Z 2025-08-14T21:43:36.5898280Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5898466Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5898523Z return mod(**inputs) 2025-08-14T21:43:36.5898788Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5898853Z outputs = self.mobilebert( 2025-08-14T21:43:36.5899106Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5899179Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5899435Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5899499Z layer_outputs = layer_module( 2025-08-14T21:43:36.5899757Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.5899841Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.5900101Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.5900201Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.5900455Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:43:36.5900539Z hidden_states = self.dense(hidden_states) 2025-08-14T21:43:36.5900543Z 2025-08-14T21:43:36.5900636Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5900850Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5900913Z return mod(**inputs) 2025-08-14T21:43:36.5901168Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5901235Z outputs = self.mobilebert( 2025-08-14T21:43:36.5901483Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5901545Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5901797Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5901879Z layer_outputs = layer_module( 2025-08-14T21:43:36.5902141Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.5902238Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.5902491Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.5902590Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.5902839Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:43:36.5902941Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:43:36.5902944Z 2025-08-14T21:43:36.5903035Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5903214Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5903281Z return mod(**inputs) 2025-08-14T21:43:36.5903536Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5903605Z outputs = self.mobilebert( 2025-08-14T21:43:36.5903858Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5903921Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5904179Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5904242Z layer_outputs = layer_module( 2025-08-14T21:43:36.5904492Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.5904581Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.5904888Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.5905011Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.5905265Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:43:36.5905336Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:43:36.5905340Z 2025-08-14T21:43:36.5905433Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5905609Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5905674Z return mod(**inputs) 2025-08-14T21:43:36.5905933Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5906000Z outputs = self.mobilebert( 2025-08-14T21:43:36.5906262Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5906327Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5906614Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5906687Z layer_outputs = layer_module( 2025-08-14T21:43:36.5906940Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.5907031Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.5907283Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.5907394Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.5907671Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:43:36.5907779Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:43:36.5908068Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.5908152Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.5908155Z 2025-08-14T21:43:36.5908246Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5908433Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5908494Z return mod(**inputs) 2025-08-14T21:43:36.5908750Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5908822Z outputs = self.mobilebert( 2025-08-14T21:43:36.5909074Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5909145Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5909401Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5909464Z layer_outputs = layer_module( 2025-08-14T21:43:36.5909722Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.5909801Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.5910054Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.5910150Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.5910399Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:43:36.5910476Z hidden_states = self.dense(hidden_states) 2025-08-14T21:43:36.5910481Z 2025-08-14T21:43:36.5910569Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5910755Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5910813Z return mod(**inputs) 2025-08-14T21:43:36.5911066Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5911131Z outputs = self.mobilebert( 2025-08-14T21:43:36.5911382Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5911446Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5911706Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5911771Z layer_outputs = layer_module( 2025-08-14T21:43:36.5912029Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.5912129Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.5912400Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.5912509Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.5912761Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:43:36.5912860Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:43:36.5912869Z 2025-08-14T21:43:36.5912960Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5913159Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5913223Z return mod(**inputs) 2025-08-14T21:43:36.5913478Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5913558Z outputs = self.mobilebert( 2025-08-14T21:43:36.5913820Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5913885Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5914143Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5914206Z layer_outputs = layer_module( 2025-08-14T21:43:36.5914454Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.5914538Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.5914786Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.5914897Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.5915155Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:43:36.5915226Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:43:36.5915229Z 2025-08-14T21:43:36.5915325Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5915503Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5915561Z return mod(**inputs) 2025-08-14T21:43:36.5915826Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5915887Z outputs = self.mobilebert( 2025-08-14T21:43:36.5916144Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5916211Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5916467Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5916538Z layer_outputs = layer_module( 2025-08-14T21:43:36.5916791Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.5916881Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.5917134Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.5917249Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.5917509Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:43:36.5917618Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:43:36.5917900Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.5917992Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.5917996Z 2025-08-14T21:43:36.5918088Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5918274Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5918334Z return mod(**inputs) 2025-08-14T21:43:36.5918589Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5918677Z outputs = self.mobilebert( 2025-08-14T21:43:36.5918937Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5919008Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5919280Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5919345Z layer_outputs = layer_module( 2025-08-14T21:43:36.5919603Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 518, in forward 2025-08-14T21:43:36.5919712Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:43:36.5919967Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:43:36.5920047Z hidden_states = self.dense(hidden_states) 2025-08-14T21:43:36.5920050Z 2025-08-14T21:43:36.5920142Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5920331Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5920390Z return mod(**inputs) 2025-08-14T21:43:36.5920649Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5920722Z outputs = self.mobilebert( 2025-08-14T21:43:36.5920976Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5921046Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5921299Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5921364Z layer_outputs = layer_module( 2025-08-14T21:43:36.5921623Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 518, in forward 2025-08-14T21:43:36.5921732Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:43:36.5921983Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:43:36.5922093Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:43:36.5922097Z 2025-08-14T21:43:36.5922191Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5922378Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5922438Z return mod(**inputs) 2025-08-14T21:43:36.5922696Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5922767Z outputs = self.mobilebert( 2025-08-14T21:43:36.5923022Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5923095Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5923347Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5923428Z layer_outputs = layer_module( 2025-08-14T21:43:36.5923704Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:43:36.5923852Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:43:36.5924108Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 392, in forward 2025-08-14T21:43:36.5924199Z layer_output = self.dense(intermediate_states) 2025-08-14T21:43:36.5924203Z 2025-08-14T21:43:36.5924290Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5924484Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5924539Z return mod(**inputs) 2025-08-14T21:43:36.5924794Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5924880Z outputs = self.mobilebert( 2025-08-14T21:43:36.5925135Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5925205Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5925464Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5925525Z layer_outputs = layer_module( 2025-08-14T21:43:36.5925788Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:43:36.5925931Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:43:36.5926187Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 397, in forward 2025-08-14T21:43:36.5926308Z layer_output = self.LayerNorm(layer_output + residual_tensor_1) 2025-08-14T21:43:36.5926564Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.5926650Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.5926653Z 2025-08-14T21:43:36.5926743Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5926923Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5926985Z return mod(**inputs) 2025-08-14T21:43:36.5927247Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5927316Z outputs = self.mobilebert( 2025-08-14T21:43:36.5927571Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5927639Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5927901Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5927964Z layer_outputs = layer_module( 2025-08-14T21:43:36.5928229Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:43:36.5928371Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:43:36.5928627Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 398, in forward 2025-08-14T21:43:36.5928747Z layer_output = self.bottleneck(layer_output, residual_tensor_2) 2025-08-14T21:43:36.5929000Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 372, in forward 2025-08-14T21:43:36.5929073Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:43:36.5929096Z 2025-08-14T21:43:36.5929199Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5929378Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5929439Z return mod(**inputs) 2025-08-14T21:43:36.5929694Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5929752Z outputs = self.mobilebert( 2025-08-14T21:43:36.5930009Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5930096Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5930354Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5930435Z layer_outputs = layer_module( 2025-08-14T21:43:36.5930698Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:43:36.5930846Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:43:36.5931105Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 398, in forward 2025-08-14T21:43:36.5931216Z layer_output = self.bottleneck(layer_output, residual_tensor_2) 2025-08-14T21:43:36.5931481Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 374, in forward 2025-08-14T21:43:36.5931591Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:43:36.5931857Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.5931942Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.5931945Z 2025-08-14T21:43:36.5932039Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5932234Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5932293Z return mod(**inputs) 2025-08-14T21:43:36.5932560Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5932622Z outputs = self.mobilebert( 2025-08-14T21:43:36.5932878Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5932946Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5933200Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5933259Z layer_outputs = layer_module( 2025-08-14T21:43:36.5933519Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:43:36.5933667Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:43:36.5933926Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 444, in forward 2025-08-14T21:43:36.5934024Z shared_attention_input = self.attention(hidden_states) 2025-08-14T21:43:36.5934281Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 409, in forward 2025-08-14T21:43:36.5934362Z layer_input = self.dense(hidden_states) 2025-08-14T21:43:36.5934367Z 2025-08-14T21:43:36.5934459Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5934648Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5934708Z return mod(**inputs) 2025-08-14T21:43:36.5935002Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5935076Z outputs = self.mobilebert( 2025-08-14T21:43:36.5935329Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5935401Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5935654Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5935717Z layer_outputs = layer_module( 2025-08-14T21:43:36.5935974Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:43:36.5936136Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:43:36.5936412Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 444, in forward 2025-08-14T21:43:36.5936518Z shared_attention_input = self.attention(hidden_states) 2025-08-14T21:43:36.5936775Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 410, in forward 2025-08-14T21:43:36.5936856Z layer_input = self.LayerNorm(layer_input) 2025-08-14T21:43:36.5937112Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.5937192Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.5937195Z 2025-08-14T21:43:36.5937293Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5937472Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5937537Z return mod(**inputs) 2025-08-14T21:43:36.5937800Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5937864Z outputs = self.mobilebert( 2025-08-14T21:43:36.5938127Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5938190Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5938445Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5938515Z layer_outputs = layer_module( 2025-08-14T21:43:36.5938768Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:43:36.5938853Z self_attention_outputs = self.attention( 2025-08-14T21:43:36.5939104Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:43:36.5939171Z self_outputs = self.self( 2025-08-14T21:43:36.5939432Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 245, in forward 2025-08-14T21:43:36.5939493Z self.query(query_tensor) 2025-08-14T21:43:36.5939496Z 2025-08-14T21:43:36.5939587Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5939764Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5939818Z return mod(**inputs) 2025-08-14T21:43:36.5940076Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5940134Z outputs = self.mobilebert( 2025-08-14T21:43:36.5940385Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5940453Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5940739Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5940808Z layer_outputs = layer_module( 2025-08-14T21:43:36.5941061Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:43:36.5941138Z self_attention_outputs = self.attention( 2025-08-14T21:43:36.5941394Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:43:36.5941458Z self_outputs = self.self( 2025-08-14T21:43:36.5941733Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 250, in forward 2025-08-14T21:43:36.5941792Z self.key(key_tensor) 2025-08-14T21:43:36.5941795Z 2025-08-14T21:43:36.5941903Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5942091Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5942149Z return mod(**inputs) 2025-08-14T21:43:36.5942404Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5942473Z outputs = self.mobilebert( 2025-08-14T21:43:36.5942722Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5942792Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5943043Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5943106Z layer_outputs = layer_module( 2025-08-14T21:43:36.5943363Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:43:36.5943438Z self_attention_outputs = self.attention( 2025-08-14T21:43:36.5943693Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:43:36.5943753Z self_outputs = self.self( 2025-08-14T21:43:36.5944000Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 255, in forward 2025-08-14T21:43:36.5944062Z self.value(value_tensor) 2025-08-14T21:43:36.5944065Z 2025-08-14T21:43:36.5944136Z cudagraph partition due to non gpu ops 2025-08-14T21:43:36.5944207Z cudagraph partition due to non gpu ops 2025-08-14T21:43:36.5944310Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5944489Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5944553Z return mod(**inputs) 2025-08-14T21:43:36.5944877Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5944945Z outputs = self.mobilebert( 2025-08-14T21:43:36.5945207Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5945271Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5945528Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5945599Z layer_outputs = layer_module( 2025-08-14T21:43:36.5945853Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:43:36.5945939Z self_attention_outputs = self.attention( 2025-08-14T21:43:36.5946196Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 344, in forward 2025-08-14T21:43:36.5946329Z attention_output = self.output(self_outputs[0], layer_input) 2025-08-14T21:43:36.5946608Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 292, in forward 2025-08-14T21:43:36.5946688Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:43:36.5946692Z 2025-08-14T21:43:36.5946791Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5946971Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5947032Z return mod(**inputs) 2025-08-14T21:43:36.5947297Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5947377Z outputs = self.mobilebert( 2025-08-14T21:43:36.5947631Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5947721Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5947978Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5948043Z layer_outputs = layer_module( 2025-08-14T21:43:36.5948297Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:43:36.5948443Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:43:36.5948704Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 440, in forward 2025-08-14T21:43:36.5948804Z bottlenecked_hidden_states = self.input(hidden_states) 2025-08-14T21:43:36.5949068Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 409, in forward 2025-08-14T21:43:36.5949144Z layer_input = self.dense(hidden_states) 2025-08-14T21:43:36.5949148Z 2025-08-14T21:43:36.5949242Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5949428Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5949486Z return mod(**inputs) 2025-08-14T21:43:36.5949744Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5949814Z outputs = self.mobilebert( 2025-08-14T21:43:36.5950068Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5950142Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5950396Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5950460Z layer_outputs = layer_module( 2025-08-14T21:43:36.5950722Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:43:36.5950798Z self_attention_outputs = self.attention( 2025-08-14T21:43:36.5951058Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 344, in forward 2025-08-14T21:43:36.5951169Z attention_output = self.output(self_outputs[0], layer_input) 2025-08-14T21:43:36.5951422Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 295, in forward 2025-08-14T21:43:36.5951543Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:43:36.5951799Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.5951882Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.5951886Z 2025-08-14T21:43:36.5951990Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5952193Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5952257Z return mod(**inputs) 2025-08-14T21:43:36.5952513Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5952575Z outputs = self.mobilebert( 2025-08-14T21:43:36.5952838Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5952901Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5953180Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5953243Z layer_outputs = layer_module( 2025-08-14T21:43:36.5953515Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.5953608Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.5953860Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.5953967Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.5954220Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:43:36.5954294Z hidden_states = self.dense(hidden_states) 2025-08-14T21:43:36.5954297Z 2025-08-14T21:43:36.5954395Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5954571Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5954630Z return mod(**inputs) 2025-08-14T21:43:36.5954896Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5954961Z outputs = self.mobilebert( 2025-08-14T21:43:36.5955221Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5955284Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5955538Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5955606Z layer_outputs = layer_module( 2025-08-14T21:43:36.5955857Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.5955951Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.5956203Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.5956303Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.5956562Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:43:36.5956661Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:43:36.5956664Z 2025-08-14T21:43:36.5956755Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5956939Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5956996Z return mod(**inputs) 2025-08-14T21:43:36.5957253Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5957317Z outputs = self.mobilebert( 2025-08-14T21:43:36.5957569Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5957651Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5957927Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5957992Z layer_outputs = layer_module( 2025-08-14T21:43:36.5958247Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.5958332Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.5958591Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.5958724Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.5958980Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:43:36.5959079Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:43:36.5959083Z 2025-08-14T21:43:36.5959178Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5959362Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5959420Z return mod(**inputs) 2025-08-14T21:43:36.5959675Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5959745Z outputs = self.mobilebert( 2025-08-14T21:43:36.5960000Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5960073Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5960328Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5960393Z layer_outputs = layer_module( 2025-08-14T21:43:36.5960657Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.5960742Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.5960995Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.5961107Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.5961356Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:43:36.5961471Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:43:36.5961729Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.5961809Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.5961814Z 2025-08-14T21:43:36.5961913Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5962093Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5962158Z return mod(**inputs) 2025-08-14T21:43:36.5962417Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5962480Z outputs = self.mobilebert( 2025-08-14T21:43:36.5962743Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5962805Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5963060Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5963130Z layer_outputs = layer_module( 2025-08-14T21:43:36.5963400Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.5963505Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.5963759Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.5963858Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.5964117Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:43:36.5964187Z hidden_states = self.dense(hidden_states) 2025-08-14T21:43:36.5964190Z 2025-08-14T21:43:36.5964299Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5964477Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5964535Z return mod(**inputs) 2025-08-14T21:43:36.5964814Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5964876Z outputs = self.mobilebert( 2025-08-14T21:43:36.5965130Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5965201Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5965457Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5965527Z layer_outputs = layer_module( 2025-08-14T21:43:36.5965782Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.5965870Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.5966138Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.5966243Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.5966512Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:43:36.5966613Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:43:36.5966617Z 2025-08-14T21:43:36.5966709Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5966902Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5966963Z return mod(**inputs) 2025-08-14T21:43:36.5967243Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5967308Z outputs = self.mobilebert( 2025-08-14T21:43:36.5967565Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5967639Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5967903Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5967968Z layer_outputs = layer_module( 2025-08-14T21:43:36.5968235Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.5968317Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.5968577Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.5968695Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.5968953Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:43:36.5969034Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:43:36.5969053Z 2025-08-14T21:43:36.5969168Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5969355Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5969410Z return mod(**inputs) 2025-08-14T21:43:36.5969674Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5969745Z outputs = self.mobilebert( 2025-08-14T21:43:36.5970003Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5970086Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5970362Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5970443Z layer_outputs = layer_module( 2025-08-14T21:43:36.5970715Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.5970800Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.5971067Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.5971190Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.5971456Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:43:36.5971574Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:43:36.5971844Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.5971926Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.5971931Z 2025-08-14T21:43:36.5972026Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5972216Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5972274Z return mod(**inputs) 2025-08-14T21:43:36.5972545Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5972605Z outputs = self.mobilebert( 2025-08-14T21:43:36.5972873Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5972937Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5973207Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5973279Z layer_outputs = layer_module( 2025-08-14T21:43:36.5973547Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.5973641Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.5973906Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.5974004Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.5974271Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:43:36.5974346Z hidden_states = self.dense(hidden_states) 2025-08-14T21:43:36.5974349Z 2025-08-14T21:43:36.5974446Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5974642Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5974703Z return mod(**inputs) 2025-08-14T21:43:36.5975006Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5975090Z outputs = self.mobilebert( 2025-08-14T21:43:36.5975351Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5975425Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5975686Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5975763Z layer_outputs = layer_module( 2025-08-14T21:43:36.5976061Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.5976165Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.5976432Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.5976551Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.5976813Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:43:36.5976918Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:43:36.5976922Z 2025-08-14T21:43:36.5977016Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5977208Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5977268Z return mod(**inputs) 2025-08-14T21:43:36.5977531Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5977604Z outputs = self.mobilebert( 2025-08-14T21:43:36.5977861Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5977938Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5978199Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5978263Z layer_outputs = layer_module( 2025-08-14T21:43:36.5978527Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.5978614Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.5978874Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.5978995Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.5979250Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:43:36.5979330Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:43:36.5979334Z 2025-08-14T21:43:36.5979428Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5979607Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5979670Z return mod(**inputs) 2025-08-14T21:43:36.5979928Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5979994Z outputs = self.mobilebert( 2025-08-14T21:43:36.5980249Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5980311Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5980571Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5980633Z layer_outputs = layer_module( 2025-08-14T21:43:36.5980920Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.5981009Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.5981272Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.5981394Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.5981657Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:43:36.5981769Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:43:36.5982057Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.5982140Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.5982159Z 2025-08-14T21:43:36.5982262Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5982446Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5982506Z return mod(**inputs) 2025-08-14T21:43:36.5982775Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5982840Z outputs = self.mobilebert( 2025-08-14T21:43:36.5983105Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5983170Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5983428Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5983498Z layer_outputs = layer_module( 2025-08-14T21:43:36.5983759Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 518, in forward 2025-08-14T21:43:36.5983871Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:43:36.5984135Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:43:36.5984212Z hidden_states = self.dense(hidden_states) 2025-08-14T21:43:36.5984215Z 2025-08-14T21:43:36.5984316Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5984495Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5984551Z return mod(**inputs) 2025-08-14T21:43:36.5984997Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5985064Z outputs = self.mobilebert( 2025-08-14T21:43:36.5985327Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5985392Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5985645Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5985717Z layer_outputs = layer_module( 2025-08-14T21:43:36.5985977Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 518, in forward 2025-08-14T21:43:36.5986089Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:43:36.5986354Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:43:36.5986458Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:43:36.5986461Z 2025-08-14T21:43:36.5986559Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5986777Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5986863Z return mod(**inputs) 2025-08-14T21:43:36.5987139Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5987202Z outputs = self.mobilebert( 2025-08-14T21:43:36.5987468Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5987533Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5987793Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5987927Z layer_outputs = layer_module( 2025-08-14T21:43:36.5988189Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:43:36.5988363Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:43:36.5988631Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 392, in forward 2025-08-14T21:43:36.5988727Z layer_output = self.dense(intermediate_states) 2025-08-14T21:43:36.5988731Z 2025-08-14T21:43:36.5988828Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5989006Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5989066Z return mod(**inputs) 2025-08-14T21:43:36.5989335Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5989402Z outputs = self.mobilebert( 2025-08-14T21:43:36.5989658Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5989724Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5989982Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5990052Z layer_outputs = layer_module( 2025-08-14T21:43:36.5990306Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:43:36.5990449Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:43:36.5990715Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 397, in forward 2025-08-14T21:43:36.5990832Z layer_output = self.LayerNorm(layer_output + residual_tensor_1) 2025-08-14T21:43:36.5991095Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.5991180Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.5991183Z 2025-08-14T21:43:36.5991275Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5991463Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5991520Z return mod(**inputs) 2025-08-14T21:43:36.5991783Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5991848Z outputs = self.mobilebert( 2025-08-14T21:43:36.5992103Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5992180Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5992435Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5992503Z layer_outputs = layer_module( 2025-08-14T21:43:36.5992796Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:43:36.5992941Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:43:36.5993200Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 398, in forward 2025-08-14T21:43:36.5993314Z layer_output = self.bottleneck(layer_output, residual_tensor_2) 2025-08-14T21:43:36.5993569Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 372, in forward 2025-08-14T21:43:36.5993668Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:43:36.5993671Z 2025-08-14T21:43:36.5993761Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5993945Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5994031Z return mod(**inputs) 2025-08-14T21:43:36.5994294Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5994366Z outputs = self.mobilebert( 2025-08-14T21:43:36.5994624Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5994694Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5994953Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5995017Z layer_outputs = layer_module( 2025-08-14T21:43:36.5995280Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:43:36.5995419Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:43:36.5995683Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 398, in forward 2025-08-14T21:43:36.5995803Z layer_output = self.bottleneck(layer_output, residual_tensor_2) 2025-08-14T21:43:36.5996062Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 374, in forward 2025-08-14T21:43:36.5996179Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:43:36.5996440Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.5996521Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.5996526Z 2025-08-14T21:43:36.5996625Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5996806Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5996874Z return mod(**inputs) 2025-08-14T21:43:36.5997138Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5997202Z outputs = self.mobilebert( 2025-08-14T21:43:36.5997462Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5997523Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.5997780Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.5997842Z layer_outputs = layer_module( 2025-08-14T21:43:36.5998103Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:43:36.5998250Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:43:36.5998536Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 444, in forward 2025-08-14T21:43:36.5998637Z shared_attention_input = self.attention(hidden_states) 2025-08-14T21:43:36.5998895Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 409, in forward 2025-08-14T21:43:36.5998967Z layer_input = self.dense(hidden_states) 2025-08-14T21:43:36.5998971Z 2025-08-14T21:43:36.5999068Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.5999246Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.5999320Z return mod(**inputs) 2025-08-14T21:43:36.5999589Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.5999652Z outputs = self.mobilebert( 2025-08-14T21:43:36.5999931Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.5999998Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6000254Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6000328Z layer_outputs = layer_module( 2025-08-14T21:43:36.6000583Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:43:36.6000728Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:43:36.6000992Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 444, in forward 2025-08-14T21:43:36.6001093Z shared_attention_input = self.attention(hidden_states) 2025-08-14T21:43:36.6001357Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 410, in forward 2025-08-14T21:43:36.6001435Z layer_input = self.LayerNorm(layer_input) 2025-08-14T21:43:36.6001687Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.6001773Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.6001776Z 2025-08-14T21:43:36.6001870Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6002050Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6002113Z return mod(**inputs) 2025-08-14T21:43:36.6002371Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6002445Z outputs = self.mobilebert( 2025-08-14T21:43:36.6002700Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6002771Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6003033Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6003099Z layer_outputs = layer_module( 2025-08-14T21:43:36.6003358Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:43:36.6003438Z self_attention_outputs = self.attention( 2025-08-14T21:43:36.6003693Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:43:36.6003767Z self_outputs = self.self( 2025-08-14T21:43:36.6004022Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 245, in forward 2025-08-14T21:43:36.6004097Z self.query(query_tensor) 2025-08-14T21:43:36.6004116Z 2025-08-14T21:43:36.6004224Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6004405Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6004469Z return mod(**inputs) 2025-08-14T21:43:36.6004723Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6004788Z outputs = self.mobilebert( 2025-08-14T21:43:36.6005042Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6005124Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6005381Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6005456Z layer_outputs = layer_module( 2025-08-14T21:43:36.6005708Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:43:36.6005789Z self_attention_outputs = self.attention( 2025-08-14T21:43:36.6006047Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:43:36.6006118Z self_outputs = self.self( 2025-08-14T21:43:36.6006373Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 250, in forward 2025-08-14T21:43:36.6006435Z self.key(key_tensor) 2025-08-14T21:43:36.6006438Z 2025-08-14T21:43:36.6006538Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6006722Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6006782Z return mod(**inputs) 2025-08-14T21:43:36.6007054Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6007119Z outputs = self.mobilebert( 2025-08-14T21:43:36.6007381Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6007446Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6007701Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6007771Z layer_outputs = layer_module( 2025-08-14T21:43:36.6008028Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:43:36.6008115Z self_attention_outputs = self.attention( 2025-08-14T21:43:36.6008372Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:43:36.6008440Z self_outputs = self.self( 2025-08-14T21:43:36.6008706Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 255, in forward 2025-08-14T21:43:36.6008770Z self.value(value_tensor) 2025-08-14T21:43:36.6008774Z 2025-08-14T21:43:36.6008846Z cudagraph partition due to non gpu ops 2025-08-14T21:43:36.6008921Z cudagraph partition due to non gpu ops 2025-08-14T21:43:36.6009011Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6009193Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6009248Z return mod(**inputs) 2025-08-14T21:43:36.6009508Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6009574Z outputs = self.mobilebert( 2025-08-14T21:43:36.6009853Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6009940Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6010203Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6010266Z layer_outputs = layer_module( 2025-08-14T21:43:36.6010525Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:43:36.6010602Z self_attention_outputs = self.attention( 2025-08-14T21:43:36.6010856Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 344, in forward 2025-08-14T21:43:36.6010992Z attention_output = self.output(self_outputs[0], layer_input) 2025-08-14T21:43:36.6011244Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 292, in forward 2025-08-14T21:43:36.6011345Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:43:36.6011348Z 2025-08-14T21:43:36.6011443Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6011624Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6011688Z return mod(**inputs) 2025-08-14T21:43:36.6011940Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6012001Z outputs = self.mobilebert( 2025-08-14T21:43:36.6012259Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6012326Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6012582Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6012646Z layer_outputs = layer_module( 2025-08-14T21:43:36.6012899Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:43:36.6013044Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:43:36.6013296Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 440, in forward 2025-08-14T21:43:36.6013398Z bottlenecked_hidden_states = self.input(hidden_states) 2025-08-14T21:43:36.6013649Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 409, in forward 2025-08-14T21:43:36.6013721Z layer_input = self.dense(hidden_states) 2025-08-14T21:43:36.6013724Z 2025-08-14T21:43:36.6013822Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6013998Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6014057Z return mod(**inputs) 2025-08-14T21:43:36.6014321Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6014382Z outputs = self.mobilebert( 2025-08-14T21:43:36.6014637Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6014702Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6014953Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6015024Z layer_outputs = layer_module( 2025-08-14T21:43:36.6015273Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:43:36.6015353Z self_attention_outputs = self.attention( 2025-08-14T21:43:36.6015635Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 344, in forward 2025-08-14T21:43:36.6015750Z attention_output = self.output(self_outputs[0], layer_input) 2025-08-14T21:43:36.6016007Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 295, in forward 2025-08-14T21:43:36.6016119Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:43:36.6016370Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.6016459Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.6016485Z 2025-08-14T21:43:36.6016578Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6016763Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6016837Z return mod(**inputs) 2025-08-14T21:43:36.6017100Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6017172Z outputs = self.mobilebert( 2025-08-14T21:43:36.6017425Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6017496Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6017750Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6017813Z layer_outputs = layer_module( 2025-08-14T21:43:36.6018070Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6018158Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6018414Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.6018525Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.6018780Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:43:36.6018862Z hidden_states = self.dense(hidden_states) 2025-08-14T21:43:36.6018865Z 2025-08-14T21:43:36.6018956Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6019138Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6019205Z return mod(**inputs) 2025-08-14T21:43:36.6019462Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6019530Z outputs = self.mobilebert( 2025-08-14T21:43:36.6019786Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6019855Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6020114Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6020178Z layer_outputs = layer_module( 2025-08-14T21:43:36.6020430Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6020523Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6020776Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.6020883Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.6021139Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:43:36.6021257Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:43:36.6021261Z 2025-08-14T21:43:36.6021377Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6021559Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6021624Z return mod(**inputs) 2025-08-14T21:43:36.6021880Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6021945Z outputs = self.mobilebert( 2025-08-14T21:43:36.6022205Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6022285Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6022541Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6022617Z layer_outputs = layer_module( 2025-08-14T21:43:36.6022869Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6022957Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6023210Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.6023323Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.6023577Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:43:36.6023653Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:43:36.6023657Z 2025-08-14T21:43:36.6023754Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6023932Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6023994Z return mod(**inputs) 2025-08-14T21:43:36.6024260Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6024322Z outputs = self.mobilebert( 2025-08-14T21:43:36.6024581Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6024646Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6024962Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6025038Z layer_outputs = layer_module( 2025-08-14T21:43:36.6025296Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6025381Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6025644Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.6025758Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.6026023Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:43:36.6026132Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:43:36.6026386Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.6026476Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.6026481Z 2025-08-14T21:43:36.6026574Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6026762Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6026826Z return mod(**inputs) 2025-08-14T21:43:36.6027115Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6027189Z outputs = self.mobilebert( 2025-08-14T21:43:36.6027443Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6027508Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6027767Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6027830Z layer_outputs = layer_module( 2025-08-14T21:43:36.6028089Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6028188Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6028443Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.6028564Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.6028819Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:43:36.6028898Z hidden_states = self.dense(hidden_states) 2025-08-14T21:43:36.6028901Z 2025-08-14T21:43:36.6028989Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6029166Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6029232Z return mod(**inputs) 2025-08-14T21:43:36.6029487Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6029551Z outputs = self.mobilebert( 2025-08-14T21:43:36.6029813Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6029878Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6030138Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6030202Z layer_outputs = layer_module( 2025-08-14T21:43:36.6030455Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6030543Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6030798Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.6030905Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.6031160Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:43:36.6031262Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:43:36.6031265Z 2025-08-14T21:43:36.6031365Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6031546Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6031610Z return mod(**inputs) 2025-08-14T21:43:36.6031868Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6031930Z outputs = self.mobilebert( 2025-08-14T21:43:36.6032192Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6032258Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6032511Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6032579Z layer_outputs = layer_module( 2025-08-14T21:43:36.6032860Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6032948Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6033201Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.6033308Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.6033567Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:43:36.6033661Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:43:36.6033664Z 2025-08-14T21:43:36.6033763Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6033943Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6034019Z return mod(**inputs) 2025-08-14T21:43:36.6034284Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6034347Z outputs = self.mobilebert( 2025-08-14T21:43:36.6034599Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6034671Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6034923Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6034993Z layer_outputs = layer_module( 2025-08-14T21:43:36.6035248Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6035333Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6035597Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.6035710Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.6035965Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:43:36.6036074Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:43:36.6036323Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.6036406Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.6036410Z 2025-08-14T21:43:36.6036503Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6036678Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6036743Z return mod(**inputs) 2025-08-14T21:43:36.6036997Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6037059Z outputs = self.mobilebert( 2025-08-14T21:43:36.6037309Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6037374Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6037626Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6037685Z layer_outputs = layer_module( 2025-08-14T21:43:36.6037941Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6038026Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6038295Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.6038431Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.6038687Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:43:36.6038762Z hidden_states = self.dense(hidden_states) 2025-08-14T21:43:36.6038772Z 2025-08-14T21:43:36.6038864Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6039044Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6039110Z return mod(**inputs) 2025-08-14T21:43:36.6039366Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6039446Z outputs = self.mobilebert( 2025-08-14T21:43:36.6039706Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6039787Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6040047Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6040111Z layer_outputs = layer_module( 2025-08-14T21:43:36.6040362Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6040451Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6040704Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.6040801Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.6041057Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:43:36.6041153Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:43:36.6041157Z 2025-08-14T21:43:36.6041256Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6041438Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6041497Z return mod(**inputs) 2025-08-14T21:43:36.6041762Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6041826Z outputs = self.mobilebert( 2025-08-14T21:43:36.6042085Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6042151Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6042404Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6042475Z layer_outputs = layer_module( 2025-08-14T21:43:36.6042730Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6042813Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6043072Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.6043184Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.6043444Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:43:36.6043521Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:43:36.6043524Z 2025-08-14T21:43:36.6043614Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6043798Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6043872Z return mod(**inputs) 2025-08-14T21:43:36.6044142Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6044205Z outputs = self.mobilebert( 2025-08-14T21:43:36.6044453Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6044520Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6044767Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6044830Z layer_outputs = layer_module( 2025-08-14T21:43:36.6045102Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6045185Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6045462Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.6045573Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.6045825Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:43:36.6045939Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:43:36.6046190Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.6046277Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.6046282Z 2025-08-14T21:43:36.6046373Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6046551Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6046618Z return mod(**inputs) 2025-08-14T21:43:36.6046874Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6046943Z outputs = self.mobilebert( 2025-08-14T21:43:36.6047196Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6047260Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6047515Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6047578Z layer_outputs = layer_module( 2025-08-14T21:43:36.6047825Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 518, in forward 2025-08-14T21:43:36.6047934Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:43:36.6048188Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:43:36.6048267Z hidden_states = self.dense(hidden_states) 2025-08-14T21:43:36.6048270Z 2025-08-14T21:43:36.6048360Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6048532Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6048594Z return mod(**inputs) 2025-08-14T21:43:36.6048843Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6048913Z outputs = self.mobilebert( 2025-08-14T21:43:36.6049165Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6049230Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6049489Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6049569Z layer_outputs = layer_module( 2025-08-14T21:43:36.6049842Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 518, in forward 2025-08-14T21:43:36.6049959Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:43:36.6050214Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:43:36.6050320Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:43:36.6050323Z 2025-08-14T21:43:36.6050415Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6050610Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6050674Z return mod(**inputs) 2025-08-14T21:43:36.6050932Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6051020Z outputs = self.mobilebert( 2025-08-14T21:43:36.6051276Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6051342Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6051599Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6051662Z layer_outputs = layer_module( 2025-08-14T21:43:36.6051913Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:43:36.6052066Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:43:36.6052319Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 392, in forward 2025-08-14T21:43:36.6052413Z layer_output = self.dense(intermediate_states) 2025-08-14T21:43:36.6052416Z 2025-08-14T21:43:36.6052509Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6052687Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6052744Z return mod(**inputs) 2025-08-14T21:43:36.6052996Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6053061Z outputs = self.mobilebert( 2025-08-14T21:43:36.6053309Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6053371Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6053622Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6053686Z layer_outputs = layer_module( 2025-08-14T21:43:36.6053940Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:43:36.6054088Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:43:36.6054342Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 397, in forward 2025-08-14T21:43:36.6054460Z layer_output = self.LayerNorm(layer_output + residual_tensor_1) 2025-08-14T21:43:36.6054713Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.6054794Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.6054797Z 2025-08-14T21:43:36.6054895Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6055072Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6055155Z return mod(**inputs) 2025-08-14T21:43:36.6055426Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6055492Z outputs = self.mobilebert( 2025-08-14T21:43:36.6055749Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6055815Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6056066Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6056136Z layer_outputs = layer_module( 2025-08-14T21:43:36.6056405Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:43:36.6056554Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:43:36.6056828Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 398, in forward 2025-08-14T21:43:36.6056939Z layer_output = self.bottleneck(layer_output, residual_tensor_2) 2025-08-14T21:43:36.6057196Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 372, in forward 2025-08-14T21:43:36.6057270Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:43:36.6057274Z 2025-08-14T21:43:36.6057370Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6057549Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6057608Z return mod(**inputs) 2025-08-14T21:43:36.6057867Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6057932Z outputs = self.mobilebert( 2025-08-14T21:43:36.6058186Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6058245Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6058497Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6058564Z layer_outputs = layer_module( 2025-08-14T21:43:36.6058818Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:43:36.6058959Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:43:36.6059211Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 398, in forward 2025-08-14T21:43:36.6059321Z layer_output = self.bottleneck(layer_output, residual_tensor_2) 2025-08-14T21:43:36.6059578Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 374, in forward 2025-08-14T21:43:36.6059686Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:43:36.6059939Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.6060028Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.6060032Z 2025-08-14T21:43:36.6060123Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6060306Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6060366Z return mod(**inputs) 2025-08-14T21:43:36.6060624Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6060697Z outputs = self.mobilebert( 2025-08-14T21:43:36.6060988Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6061057Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6061317Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6061380Z layer_outputs = layer_module( 2025-08-14T21:43:36.6061642Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:43:36.6061789Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:43:36.6062059Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 444, in forward 2025-08-14T21:43:36.6062166Z shared_attention_input = self.attention(hidden_states) 2025-08-14T21:43:36.6062439Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 409, in forward 2025-08-14T21:43:36.6062513Z layer_input = self.dense(hidden_states) 2025-08-14T21:43:36.6062517Z 2025-08-14T21:43:36.6062608Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6062788Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6062846Z return mod(**inputs) 2025-08-14T21:43:36.6063097Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6063163Z outputs = self.mobilebert( 2025-08-14T21:43:36.6063415Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6063477Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6063737Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6063803Z layer_outputs = layer_module( 2025-08-14T21:43:36.6064057Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:43:36.6064209Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:43:36.6064464Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 444, in forward 2025-08-14T21:43:36.6064570Z shared_attention_input = self.attention(hidden_states) 2025-08-14T21:43:36.6064887Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 410, in forward 2025-08-14T21:43:36.6064973Z layer_input = self.LayerNorm(layer_input) 2025-08-14T21:43:36.6065236Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.6065322Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.6065326Z 2025-08-14T21:43:36.6065426Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6065607Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6065667Z return mod(**inputs) 2025-08-14T21:43:36.6065929Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6065992Z outputs = self.mobilebert( 2025-08-14T21:43:36.6066243Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6066318Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6066567Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6066654Z layer_outputs = layer_module( 2025-08-14T21:43:36.6066921Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:43:36.6067006Z self_attention_outputs = self.attention( 2025-08-14T21:43:36.6067266Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:43:36.6067330Z self_outputs = self.self( 2025-08-14T21:43:36.6067591Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 245, in forward 2025-08-14T21:43:36.6067674Z self.query(query_tensor) 2025-08-14T21:43:36.6067677Z 2025-08-14T21:43:36.6067772Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6067960Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6068037Z return mod(**inputs) 2025-08-14T21:43:36.6068295Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6068366Z outputs = self.mobilebert( 2025-08-14T21:43:36.6068620Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6068687Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6068939Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6069003Z layer_outputs = layer_module( 2025-08-14T21:43:36.6069263Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:43:36.6069339Z self_attention_outputs = self.attention( 2025-08-14T21:43:36.6069602Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:43:36.6069662Z self_outputs = self.self( 2025-08-14T21:43:36.6069915Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 250, in forward 2025-08-14T21:43:36.6069977Z self.key(key_tensor) 2025-08-14T21:43:36.6069980Z 2025-08-14T21:43:36.6070066Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6070244Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6070302Z return mod(**inputs) 2025-08-14T21:43:36.6070551Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6070621Z outputs = self.mobilebert( 2025-08-14T21:43:36.6070876Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6070940Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6071200Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6071263Z layer_outputs = layer_module( 2025-08-14T21:43:36.6071515Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:43:36.6071599Z self_attention_outputs = self.attention( 2025-08-14T21:43:36.6071851Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:43:36.6071921Z self_outputs = self.self( 2025-08-14T21:43:36.6072171Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 255, in forward 2025-08-14T21:43:36.6072237Z self.value(value_tensor) 2025-08-14T21:43:36.6072240Z 2025-08-14T21:43:36.6072334Z cudagraph partition due to non gpu ops 2025-08-14T21:43:36.6072424Z cudagraph partition due to non gpu ops 2025-08-14T21:43:36.6072530Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6072710Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6072768Z return mod(**inputs) 2025-08-14T21:43:36.6073038Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6073104Z outputs = self.mobilebert( 2025-08-14T21:43:36.6073359Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6073447Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6073703Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6073791Z layer_outputs = layer_module( 2025-08-14T21:43:36.6074051Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:43:36.6074126Z self_attention_outputs = self.attention( 2025-08-14T21:43:36.6074391Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 344, in forward 2025-08-14T21:43:36.6074503Z attention_output = self.output(self_outputs[0], layer_input) 2025-08-14T21:43:36.6074762Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 292, in forward 2025-08-14T21:43:36.6074839Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:43:36.6074843Z 2025-08-14T21:43:36.6074934Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6075128Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6075187Z return mod(**inputs) 2025-08-14T21:43:36.6075449Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6075519Z outputs = self.mobilebert( 2025-08-14T21:43:36.6075777Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6075848Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6076105Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6076169Z layer_outputs = layer_module( 2025-08-14T21:43:36.6076432Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:43:36.6076581Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:43:36.6076848Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 440, in forward 2025-08-14T21:43:36.6076948Z bottlenecked_hidden_states = self.input(hidden_states) 2025-08-14T21:43:36.6077206Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 409, in forward 2025-08-14T21:43:36.6077287Z layer_input = self.dense(hidden_states) 2025-08-14T21:43:36.6077290Z 2025-08-14T21:43:36.6077382Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6077566Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6077635Z return mod(**inputs) 2025-08-14T21:43:36.6077896Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6077968Z outputs = self.mobilebert( 2025-08-14T21:43:36.6078277Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6078342Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6078601Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6078665Z layer_outputs = layer_module( 2025-08-14T21:43:36.6078925Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:43:36.6079000Z self_attention_outputs = self.attention( 2025-08-14T21:43:36.6079275Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 344, in forward 2025-08-14T21:43:36.6079392Z attention_output = self.output(self_outputs[0], layer_input) 2025-08-14T21:43:36.6079673Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 295, in forward 2025-08-14T21:43:36.6079790Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:43:36.6080048Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.6080132Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.6080135Z 2025-08-14T21:43:36.6080235Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6080416Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6080480Z return mod(**inputs) 2025-08-14T21:43:36.6080743Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6080808Z outputs = self.mobilebert( 2025-08-14T21:43:36.6081070Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6081138Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6081393Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6081466Z layer_outputs = layer_module( 2025-08-14T21:43:36.6081721Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6081809Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6082070Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.6082172Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.6082433Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:43:36.6082508Z hidden_states = self.dense(hidden_states) 2025-08-14T21:43:36.6082511Z 2025-08-14T21:43:36.6082601Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6082780Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6082839Z return mod(**inputs) 2025-08-14T21:43:36.6083105Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6083168Z outputs = self.mobilebert( 2025-08-14T21:43:36.6083421Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6083496Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6083751Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6083832Z layer_outputs = layer_module( 2025-08-14T21:43:36.6084111Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6084201Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6084461Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.6084561Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.6084960Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:43:36.6085110Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:43:36.6085113Z 2025-08-14T21:43:36.6085207Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6085426Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6085487Z return mod(**inputs) 2025-08-14T21:43:36.6085750Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6085822Z outputs = self.mobilebert( 2025-08-14T21:43:36.6086076Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6086152Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6086405Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6086469Z layer_outputs = layer_module( 2025-08-14T21:43:36.6086726Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6086811Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6087065Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.6087191Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.6087451Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:43:36.6087536Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:43:36.6087539Z 2025-08-14T21:43:36.6087633Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6087818Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6087888Z return mod(**inputs) 2025-08-14T21:43:36.6088151Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6088224Z outputs = self.mobilebert( 2025-08-14T21:43:36.6088488Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6088552Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6088816Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6088880Z layer_outputs = layer_module( 2025-08-14T21:43:36.6089139Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6089230Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6089490Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.6089604Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.6089882Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:43:36.6090019Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:43:36.6090291Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.6090375Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.6090379Z 2025-08-14T21:43:36.6090481Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6090663Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6090742Z return mod(**inputs) 2025-08-14T21:43:36.6091015Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6091081Z outputs = self.mobilebert( 2025-08-14T21:43:36.6091364Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6091439Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6091706Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6091778Z layer_outputs = layer_module( 2025-08-14T21:43:36.6092041Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6092127Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6092395Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.6092497Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.6092768Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:43:36.6092847Z hidden_states = self.dense(hidden_states) 2025-08-14T21:43:36.6092851Z 2025-08-14T21:43:36.6092942Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6093130Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6093190Z return mod(**inputs) 2025-08-14T21:43:36.6093456Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6093530Z outputs = self.mobilebert( 2025-08-14T21:43:36.6093794Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6093871Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6094135Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6094203Z layer_outputs = layer_module( 2025-08-14T21:43:36.6094473Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6094561Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6094835Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.6094936Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.6095198Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:43:36.6095304Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:43:36.6095308Z 2025-08-14T21:43:36.6095402Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6095612Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6095676Z return mod(**inputs) 2025-08-14T21:43:36.6095958Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6096032Z outputs = self.mobilebert( 2025-08-14T21:43:36.6096293Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6096360Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6096627Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6096710Z layer_outputs = layer_module( 2025-08-14T21:43:36.6096977Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6097079Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6097341Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.6097462Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.6097724Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:43:36.6097801Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:43:36.6097812Z 2025-08-14T21:43:36.6097906Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6098090Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6098161Z return mod(**inputs) 2025-08-14T21:43:36.6098429Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6098494Z outputs = self.mobilebert( 2025-08-14T21:43:36.6098761Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6098830Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6099092Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6099159Z layer_outputs = layer_module( 2025-08-14T21:43:36.6099414Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6099508Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6099764Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.6099879Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.6100142Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:43:36.6100256Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:43:36.6100517Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.6100603Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.6100606Z 2025-08-14T21:43:36.6100699Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6100889Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6100951Z return mod(**inputs) 2025-08-14T21:43:36.6101217Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6101282Z outputs = self.mobilebert( 2025-08-14T21:43:36.6101552Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6101642Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6101897Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6101967Z layer_outputs = layer_module( 2025-08-14T21:43:36.6102216Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6102298Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6102558Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.6102684Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.6102941Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:43:36.6103038Z hidden_states = self.dense(hidden_states) 2025-08-14T21:43:36.6103043Z 2025-08-14T21:43:36.6103135Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6103320Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6103379Z return mod(**inputs) 2025-08-14T21:43:36.6103635Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6103704Z outputs = self.mobilebert( 2025-08-14T21:43:36.6103956Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6104024Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6104271Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6104336Z layer_outputs = layer_module( 2025-08-14T21:43:36.6104592Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6104718Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6104975Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.6105076Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.6105323Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:43:36.6105430Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:43:36.6105434Z 2025-08-14T21:43:36.6105524Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6105706Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6105772Z return mod(**inputs) 2025-08-14T21:43:36.6106030Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6106098Z outputs = self.mobilebert( 2025-08-14T21:43:36.6106350Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6106414Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6106673Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6106738Z layer_outputs = layer_module( 2025-08-14T21:43:36.6106992Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6107084Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6107364Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.6107483Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.6107732Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:43:36.6107803Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:43:36.6107806Z 2025-08-14T21:43:36.6107907Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6108086Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6108172Z return mod(**inputs) 2025-08-14T21:43:36.6108433Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6108514Z outputs = self.mobilebert( 2025-08-14T21:43:36.6108782Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6108847Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6109105Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6109176Z layer_outputs = layer_module( 2025-08-14T21:43:36.6109431Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6109520Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6109780Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.6109891Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.6110161Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:43:36.6110273Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:43:36.6110538Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.6110620Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.6110624Z 2025-08-14T21:43:36.6110717Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6110906Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6110965Z return mod(**inputs) 2025-08-14T21:43:36.6111236Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6111301Z outputs = self.mobilebert( 2025-08-14T21:43:36.6111561Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6111635Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6111891Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6111950Z layer_outputs = layer_module( 2025-08-14T21:43:36.6112211Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 518, in forward 2025-08-14T21:43:36.6112322Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:43:36.6112585Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:43:36.6112661Z hidden_states = self.dense(hidden_states) 2025-08-14T21:43:36.6112664Z 2025-08-14T21:43:36.6112759Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6112963Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6113616Z return mod(**inputs) 2025-08-14T21:43:36.6113884Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6113949Z outputs = self.mobilebert( 2025-08-14T21:43:36.6114202Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6114275Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6114528Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6114613Z layer_outputs = layer_module( 2025-08-14T21:43:36.6114876Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 518, in forward 2025-08-14T21:43:36.6115004Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:43:36.6115267Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:43:36.6115364Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:43:36.6115368Z 2025-08-14T21:43:36.6115460Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6115646Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6115704Z return mod(**inputs) 2025-08-14T21:43:36.6115967Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6116032Z outputs = self.mobilebert( 2025-08-14T21:43:36.6116283Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6116358Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6116610Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6116674Z layer_outputs = layer_module( 2025-08-14T21:43:36.6116929Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:43:36.6117072Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:43:36.6117331Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 392, in forward 2025-08-14T21:43:36.6117415Z layer_output = self.dense(intermediate_states) 2025-08-14T21:43:36.6117419Z 2025-08-14T21:43:36.6117510Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6117694Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6117754Z return mod(**inputs) 2025-08-14T21:43:36.6118008Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6118071Z outputs = self.mobilebert( 2025-08-14T21:43:36.6118322Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6118393Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6118643Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6118708Z layer_outputs = layer_module( 2025-08-14T21:43:36.6118968Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:43:36.6119108Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:43:36.6119395Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 397, in forward 2025-08-14T21:43:36.6119510Z layer_output = self.LayerNorm(layer_output + residual_tensor_1) 2025-08-14T21:43:36.6119762Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.6119850Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.6119853Z 2025-08-14T21:43:36.6119944Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6120128Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6120203Z return mod(**inputs) 2025-08-14T21:43:36.6120463Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6120551Z outputs = self.mobilebert( 2025-08-14T21:43:36.6120804Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6120867Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6121123Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6121184Z layer_outputs = layer_module( 2025-08-14T21:43:36.6121443Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:43:36.6121584Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:43:36.6121836Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 398, in forward 2025-08-14T21:43:36.6121952Z layer_output = self.bottleneck(layer_output, residual_tensor_2) 2025-08-14T21:43:36.6122208Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 372, in forward 2025-08-14T21:43:36.6122289Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:43:36.6122292Z 2025-08-14T21:43:36.6122384Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6122564Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6122630Z return mod(**inputs) 2025-08-14T21:43:36.6122886Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6122959Z outputs = self.mobilebert( 2025-08-14T21:43:36.6123213Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6123277Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6123541Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6123605Z layer_outputs = layer_module( 2025-08-14T21:43:36.6123859Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:43:36.6124005Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:43:36.6124258Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 398, in forward 2025-08-14T21:43:36.6124375Z layer_output = self.bottleneck(layer_output, residual_tensor_2) 2025-08-14T21:43:36.6124631Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 374, in forward 2025-08-14T21:43:36.6124740Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:43:36.6125039Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.6125119Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.6125123Z 2025-08-14T21:43:36.6125221Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6125402Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6125459Z return mod(**inputs) 2025-08-14T21:43:36.6125722Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6125784Z outputs = self.mobilebert( 2025-08-14T21:43:36.6126056Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6126126Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6126395Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6126466Z layer_outputs = layer_module( 2025-08-14T21:43:36.6126719Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:43:36.6126867Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:43:36.6127126Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 444, in forward 2025-08-14T21:43:36.6127230Z shared_attention_input = self.attention(hidden_states) 2025-08-14T21:43:36.6127490Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 409, in forward 2025-08-14T21:43:36.6127564Z layer_input = self.dense(hidden_states) 2025-08-14T21:43:36.6127568Z 2025-08-14T21:43:36.6127660Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6127851Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6127910Z return mod(**inputs) 2025-08-14T21:43:36.6128172Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6128236Z outputs = self.mobilebert( 2025-08-14T21:43:36.6128488Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6128558Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6128810Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6128874Z layer_outputs = layer_module( 2025-08-14T21:43:36.6129136Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:43:36.6129285Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:43:36.6129546Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 444, in forward 2025-08-14T21:43:36.6129645Z shared_attention_input = self.attention(hidden_states) 2025-08-14T21:43:36.6129899Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 410, in forward 2025-08-14T21:43:36.6129985Z layer_input = self.LayerNorm(layer_input) 2025-08-14T21:43:36.6130239Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.6130330Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.6130333Z 2025-08-14T21:43:36.6130425Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6130622Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6130707Z return mod(**inputs) 2025-08-14T21:43:36.6130967Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6131033Z outputs = self.mobilebert( 2025-08-14T21:43:36.6131294Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6131358Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6131618Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6131699Z layer_outputs = layer_module( 2025-08-14T21:43:36.6131953Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:43:36.6132056Z self_attention_outputs = self.attention( 2025-08-14T21:43:36.6132321Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:43:36.6132393Z self_outputs = self.self( 2025-08-14T21:43:36.6132653Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 245, in forward 2025-08-14T21:43:36.6132717Z self.query(query_tensor) 2025-08-14T21:43:36.6132720Z 2025-08-14T21:43:36.6132818Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6132999Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6133058Z return mod(**inputs) 2025-08-14T21:43:36.6133330Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6133393Z outputs = self.mobilebert( 2025-08-14T21:43:36.6133660Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6133724Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6133984Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6134054Z layer_outputs = layer_module( 2025-08-14T21:43:36.6134317Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:43:36.6134401Z self_attention_outputs = self.attention( 2025-08-14T21:43:36.6134663Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:43:36.6134726Z self_outputs = self.self( 2025-08-14T21:43:36.6134994Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 250, in forward 2025-08-14T21:43:36.6135054Z self.key(key_tensor) 2025-08-14T21:43:36.6135058Z 2025-08-14T21:43:36.6135151Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6135336Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6135394Z return mod(**inputs) 2025-08-14T21:43:36.6135665Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6135727Z outputs = self.mobilebert( 2025-08-14T21:43:36.6135987Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6136061Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6136320Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6136399Z layer_outputs = layer_module( 2025-08-14T21:43:36.6136678Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:43:36.6136758Z self_attention_outputs = self.attention( 2025-08-14T21:43:36.6137018Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:43:36.6137080Z self_outputs = self.self( 2025-08-14T21:43:36.6137331Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 255, in forward 2025-08-14T21:43:36.6137417Z self.value(value_tensor) 2025-08-14T21:43:36.6137420Z 2025-08-14T21:43:36.6137494Z cudagraph partition due to non gpu ops 2025-08-14T21:43:36.6137571Z cudagraph partition due to non gpu ops 2025-08-14T21:43:36.6137664Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6137866Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6137934Z return mod(**inputs) 2025-08-14T21:43:36.6138190Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6138253Z outputs = self.mobilebert( 2025-08-14T21:43:36.6138513Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6138578Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6138838Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6138903Z layer_outputs = layer_module( 2025-08-14T21:43:36.6139154Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:43:36.6139241Z self_attention_outputs = self.attention( 2025-08-14T21:43:36.6139494Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 344, in forward 2025-08-14T21:43:36.6139607Z attention_output = self.output(self_outputs[0], layer_input) 2025-08-14T21:43:36.6139867Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 292, in forward 2025-08-14T21:43:36.6139943Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:43:36.6139946Z 2025-08-14T21:43:36.6140045Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6140224Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6140284Z return mod(**inputs) 2025-08-14T21:43:36.6140547Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6140613Z outputs = self.mobilebert( 2025-08-14T21:43:36.6140873Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6140938Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6141189Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6141260Z layer_outputs = layer_module( 2025-08-14T21:43:36.6141511Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:43:36.6141665Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:43:36.6141918Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 440, in forward 2025-08-14T21:43:36.6142018Z bottlenecked_hidden_states = self.input(hidden_states) 2025-08-14T21:43:36.6142309Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 409, in forward 2025-08-14T21:43:36.6142385Z layer_input = self.dense(hidden_states) 2025-08-14T21:43:36.6142389Z 2025-08-14T21:43:36.6142480Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6142667Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6142725Z return mod(**inputs) 2025-08-14T21:43:36.6142990Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6143069Z outputs = self.mobilebert( 2025-08-14T21:43:36.6143326Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6143416Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6143680Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6143753Z layer_outputs = layer_module( 2025-08-14T21:43:36.6144012Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:43:36.6144088Z self_attention_outputs = self.attention( 2025-08-14T21:43:36.6144351Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 344, in forward 2025-08-14T21:43:36.6144460Z attention_output = self.output(self_outputs[0], layer_input) 2025-08-14T21:43:36.6144793Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 295, in forward 2025-08-14T21:43:36.6144921Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:43:36.6145181Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.6145274Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.6145278Z 2025-08-14T21:43:36.6145371Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6145550Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6145620Z return mod(**inputs) 2025-08-14T21:43:36.6145877Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6145951Z outputs = self.mobilebert( 2025-08-14T21:43:36.6146208Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6146275Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6146543Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6146609Z layer_outputs = layer_module( 2025-08-14T21:43:36.6146862Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6146958Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6147211Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.6147318Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.6147574Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:43:36.6147650Z hidden_states = self.dense(hidden_states) 2025-08-14T21:43:36.6147654Z 2025-08-14T21:43:36.6147756Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6147965Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6148048Z return mod(**inputs) 2025-08-14T21:43:36.6148308Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6148372Z outputs = self.mobilebert( 2025-08-14T21:43:36.6148635Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6148699Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6148954Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6149040Z layer_outputs = layer_module( 2025-08-14T21:43:36.6149296Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6149407Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6149666Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.6149765Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.6150023Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:43:36.6150123Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:43:36.6150126Z 2025-08-14T21:43:36.6150226Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6150406Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6150467Z return mod(**inputs) 2025-08-14T21:43:36.6150735Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6150805Z outputs = self.mobilebert( 2025-08-14T21:43:36.6151061Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6151134Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6151390Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6151462Z layer_outputs = layer_module( 2025-08-14T21:43:36.6151718Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6151802Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6152066Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.6152179Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.6152445Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:43:36.6152522Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:43:36.6152525Z 2025-08-14T21:43:36.6152616Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6152805Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6152865Z return mod(**inputs) 2025-08-14T21:43:36.6153130Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6153195Z outputs = self.mobilebert( 2025-08-14T21:43:36.6153450Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6153522Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6153812Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6153879Z layer_outputs = layer_module( 2025-08-14T21:43:36.6154143Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6154227Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6154487Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.6154602Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.6154875Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:43:36.6154993Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:43:36.6155263Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.6155352Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.6155355Z 2025-08-14T21:43:36.6155449Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6155629Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6155697Z return mod(**inputs) 2025-08-14T21:43:36.6155954Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6156018Z outputs = self.mobilebert( 2025-08-14T21:43:36.6156279Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6156343Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6156603Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6156671Z layer_outputs = layer_module( 2025-08-14T21:43:36.6156922Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6157014Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6157264Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.6157371Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.6157622Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:43:36.6157698Z hidden_states = self.dense(hidden_states) 2025-08-14T21:43:36.6157701Z 2025-08-14T21:43:36.6157800Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6157982Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6158042Z return mod(**inputs) 2025-08-14T21:43:36.6158306Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6158370Z outputs = self.mobilebert( 2025-08-14T21:43:36.6158627Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6158692Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6158945Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6159018Z layer_outputs = layer_module( 2025-08-14T21:43:36.6159271Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6159378Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6159650Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.6159751Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.6160008Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:43:36.6160108Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:43:36.6160111Z 2025-08-14T21:43:36.6160202Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6160390Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6160468Z return mod(**inputs) 2025-08-14T21:43:36.6160732Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6160816Z outputs = self.mobilebert( 2025-08-14T21:43:36.6161077Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6161148Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6161407Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6161479Z layer_outputs = layer_module( 2025-08-14T21:43:36.6161737Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6161820Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6162088Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.6162199Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.6162462Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:43:36.6162545Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:43:36.6162548Z 2025-08-14T21:43:36.6162642Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6162828Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6162886Z return mod(**inputs) 2025-08-14T21:43:36.6163147Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6163219Z outputs = self.mobilebert( 2025-08-14T21:43:36.6163481Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6163551Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6163813Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6163879Z layer_outputs = layer_module( 2025-08-14T21:43:36.6164144Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6164226Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6164483Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.6164600Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.6164860Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:43:36.6164975Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:43:36.6165255Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.6165354Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.6165358Z 2025-08-14T21:43:36.6165459Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6165636Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6165702Z return mod(**inputs) 2025-08-14T21:43:36.6165959Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6166024Z outputs = self.mobilebert( 2025-08-14T21:43:36.6166303Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6166370Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6166635Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6166724Z layer_outputs = layer_module( 2025-08-14T21:43:36.6166978Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6167068Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6167320Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.6167420Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.6167679Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:43:36.6167755Z hidden_states = self.dense(hidden_states) 2025-08-14T21:43:36.6167758Z 2025-08-14T21:43:36.6167857Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6168041Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6168103Z return mod(**inputs) 2025-08-14T21:43:36.6168366Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6168431Z outputs = self.mobilebert( 2025-08-14T21:43:36.6168688Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6168754Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6169005Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6169077Z layer_outputs = layer_module( 2025-08-14T21:43:36.6169332Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6169419Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6169687Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.6169787Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.6170048Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:43:36.6170147Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:43:36.6170150Z 2025-08-14T21:43:36.6170241Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6170429Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6170489Z return mod(**inputs) 2025-08-14T21:43:36.6170754Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6170865Z outputs = self.mobilebert( 2025-08-14T21:43:36.6171139Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6171214Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6171470Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6171533Z layer_outputs = layer_module( 2025-08-14T21:43:36.6171798Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6171881Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6172161Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.6172272Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.6172543Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:43:36.6172626Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:43:36.6172629Z 2025-08-14T21:43:36.6172723Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6172907Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6172964Z return mod(**inputs) 2025-08-14T21:43:36.6173216Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6173284Z outputs = self.mobilebert( 2025-08-14T21:43:36.6173536Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6173600Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6173860Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6173925Z layer_outputs = layer_module( 2025-08-14T21:43:36.6174181Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6174259Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6174510Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.6174629Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.6174881Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:43:36.6174998Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:43:36.6175252Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.6175333Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.6175336Z 2025-08-14T21:43:36.6175433Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6175612Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6175671Z return mod(**inputs) 2025-08-14T21:43:36.6175934Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6175997Z outputs = self.mobilebert( 2025-08-14T21:43:36.6176256Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6176320Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6176583Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6176653Z layer_outputs = layer_module( 2025-08-14T21:43:36.6176923Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 518, in forward 2025-08-14T21:43:36.6177035Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:43:36.6177284Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:43:36.6177355Z hidden_states = self.dense(hidden_states) 2025-08-14T21:43:36.6177359Z 2025-08-14T21:43:36.6177452Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6177646Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6177703Z return mod(**inputs) 2025-08-14T21:43:36.6177956Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6178032Z outputs = self.mobilebert( 2025-08-14T21:43:36.6178288Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6178351Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6178602Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6178683Z layer_outputs = layer_module( 2025-08-14T21:43:36.6178934Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 518, in forward 2025-08-14T21:43:36.6179046Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:43:36.6179299Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:43:36.6179399Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:43:36.6179402Z 2025-08-14T21:43:36.6179500Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6179682Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6179746Z return mod(**inputs) 2025-08-14T21:43:36.6180005Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6180068Z outputs = self.mobilebert( 2025-08-14T21:43:36.6180329Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6180394Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6180649Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6180720Z layer_outputs = layer_module( 2025-08-14T21:43:36.6180977Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:43:36.6181128Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:43:36.6181383Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 392, in forward 2025-08-14T21:43:36.6181467Z layer_output = self.dense(intermediate_states) 2025-08-14T21:43:36.6181470Z 2025-08-14T21:43:36.6181569Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6181749Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6181815Z return mod(**inputs) 2025-08-14T21:43:36.6182071Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6182136Z outputs = self.mobilebert( 2025-08-14T21:43:36.6182427Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6182493Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6182745Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6182811Z layer_outputs = layer_module( 2025-08-14T21:43:36.6183064Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:43:36.6183210Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:43:36.6183479Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 397, in forward 2025-08-14T21:43:36.6183586Z layer_output = self.LayerNorm(layer_output + residual_tensor_1) 2025-08-14T21:43:36.6183864Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.6183943Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.6183946Z 2025-08-14T21:43:36.6184043Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6184218Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6184276Z return mod(**inputs) 2025-08-14T21:43:36.6184537Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6184734Z outputs = self.mobilebert( 2025-08-14T21:43:36.6185002Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6185080Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6185349Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6185422Z layer_outputs = layer_module( 2025-08-14T21:43:36.6185684Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:43:36.6185830Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:43:36.6186101Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 398, in forward 2025-08-14T21:43:36.6186216Z layer_output = self.bottleneck(layer_output, residual_tensor_2) 2025-08-14T21:43:36.6186512Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 372, in forward 2025-08-14T21:43:36.6186594Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:43:36.6186599Z 2025-08-14T21:43:36.6186694Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6186884Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6186946Z return mod(**inputs) 2025-08-14T21:43:36.6187212Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6187277Z outputs = self.mobilebert( 2025-08-14T21:43:36.6187551Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6187623Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6187885Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6187949Z layer_outputs = layer_module( 2025-08-14T21:43:36.6188253Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:43:36.6188425Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:43:36.6188700Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 398, in forward 2025-08-14T21:43:36.6188814Z layer_output = self.bottleneck(layer_output, residual_tensor_2) 2025-08-14T21:43:36.6189079Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 374, in forward 2025-08-14T21:43:36.6189198Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:43:36.6189491Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.6189583Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.6189609Z 2025-08-14T21:43:36.6189708Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6189897Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6189966Z return mod(**inputs) 2025-08-14T21:43:36.6190232Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6190297Z outputs = self.mobilebert( 2025-08-14T21:43:36.6190564Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6190630Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6190898Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6190965Z layer_outputs = layer_module( 2025-08-14T21:43:36.6191227Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:43:36.6191389Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:43:36.6191653Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 444, in forward 2025-08-14T21:43:36.6191763Z shared_attention_input = self.attention(hidden_states) 2025-08-14T21:43:36.6192024Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 409, in forward 2025-08-14T21:43:36.6192100Z layer_input = self.dense(hidden_states) 2025-08-14T21:43:36.6192104Z 2025-08-14T21:43:36.6192208Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6192394Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6192454Z return mod(**inputs) 2025-08-14T21:43:36.6192730Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6192797Z outputs = self.mobilebert( 2025-08-14T21:43:36.6193068Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6193134Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6193393Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6193467Z layer_outputs = layer_module( 2025-08-14T21:43:36.6193727Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:43:36.6193885Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:43:36.6194147Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 444, in forward 2025-08-14T21:43:36.6194265Z shared_attention_input = self.attention(hidden_states) 2025-08-14T21:43:36.6194550Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 410, in forward 2025-08-14T21:43:36.6194632Z layer_input = self.LayerNorm(layer_input) 2025-08-14T21:43:36.6194903Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.6194987Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.6194990Z 2025-08-14T21:43:36.6195085Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6195296Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6195357Z return mod(**inputs) 2025-08-14T21:43:36.6195626Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6195715Z outputs = self.mobilebert( 2025-08-14T21:43:36.6195978Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6196050Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6196311Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6196376Z layer_outputs = layer_module( 2025-08-14T21:43:36.6196642Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:43:36.6196722Z self_attention_outputs = self.attention( 2025-08-14T21:43:36.6196992Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:43:36.6197060Z self_outputs = self.self( 2025-08-14T21:43:36.6197321Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 245, in forward 2025-08-14T21:43:36.6197394Z self.query(query_tensor) 2025-08-14T21:43:36.6197398Z 2025-08-14T21:43:36.6197493Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6197679Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6197747Z return mod(**inputs) 2025-08-14T21:43:36.6198012Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6198083Z outputs = self.mobilebert( 2025-08-14T21:43:36.6198345Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6198411Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6198682Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6198749Z layer_outputs = layer_module( 2025-08-14T21:43:36.6199016Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:43:36.6199099Z self_attention_outputs = self.attention( 2025-08-14T21:43:36.6199352Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:43:36.6199421Z self_outputs = self.self( 2025-08-14T21:43:36.6199677Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 250, in forward 2025-08-14T21:43:36.6199739Z self.key(key_tensor) 2025-08-14T21:43:36.6199742Z 2025-08-14T21:43:36.6199842Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6200037Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6200123Z return mod(**inputs) 2025-08-14T21:43:36.6200382Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6200444Z outputs = self.mobilebert( 2025-08-14T21:43:36.6200704Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6200769Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6201022Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6201112Z layer_outputs = layer_module( 2025-08-14T21:43:36.6201369Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:43:36.6201469Z self_attention_outputs = self.attention( 2025-08-14T21:43:36.6201728Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:43:36.6201790Z self_outputs = self.self( 2025-08-14T21:43:36.6202051Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 255, in forward 2025-08-14T21:43:36.6202115Z self.value(value_tensor) 2025-08-14T21:43:36.6202118Z 2025-08-14T21:43:36.6202197Z cudagraph partition due to non gpu ops 2025-08-14T21:43:36.6202266Z cudagraph partition due to non gpu ops 2025-08-14T21:43:36.6202359Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6202545Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6202604Z return mod(**inputs) 2025-08-14T21:43:36.6202862Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6202936Z outputs = self.mobilebert( 2025-08-14T21:43:36.6203192Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6203266Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6203517Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6203580Z layer_outputs = layer_module( 2025-08-14T21:43:36.6203839Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:43:36.6203917Z self_attention_outputs = self.attention( 2025-08-14T21:43:36.6204169Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 344, in forward 2025-08-14T21:43:36.6204290Z attention_output = self.output(self_outputs[0], layer_input) 2025-08-14T21:43:36.6204545Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 292, in forward 2025-08-14T21:43:36.6204626Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:43:36.6204629Z 2025-08-14T21:43:36.6204719Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6204896Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6204963Z return mod(**inputs) 2025-08-14T21:43:36.6205218Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6205289Z outputs = self.mobilebert( 2025-08-14T21:43:36.6205542Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6205610Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6205901Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6205967Z layer_outputs = layer_module( 2025-08-14T21:43:36.6206223Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:43:36.6206378Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:43:36.6206637Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 440, in forward 2025-08-14T21:43:36.6206743Z bottlenecked_hidden_states = self.input(hidden_states) 2025-08-14T21:43:36.6207018Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 409, in forward 2025-08-14T21:43:36.6207093Z layer_input = self.dense(hidden_states) 2025-08-14T21:43:36.6207111Z 2025-08-14T21:43:36.6207213Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6207394Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6207459Z return mod(**inputs) 2025-08-14T21:43:36.6207719Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6207782Z outputs = self.mobilebert( 2025-08-14T21:43:36.6208045Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6208111Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6208374Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6208439Z layer_outputs = layer_module( 2025-08-14T21:43:36.6208699Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:43:36.6208785Z self_attention_outputs = self.attention( 2025-08-14T21:43:36.6209044Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 344, in forward 2025-08-14T21:43:36.6209155Z attention_output = self.output(self_outputs[0], layer_input) 2025-08-14T21:43:36.6209418Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 295, in forward 2025-08-14T21:43:36.6209532Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:43:36.6209798Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.6209881Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.6209886Z 2025-08-14T21:43:36.6209982Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6210173Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6210232Z return mod(**inputs) 2025-08-14T21:43:36.6210499Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6210564Z outputs = self.mobilebert( 2025-08-14T21:43:36.6210818Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6210892Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6211148Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6211215Z layer_outputs = layer_module( 2025-08-14T21:43:36.6211476Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6211582Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6211866Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.6211970Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.6212222Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:43:36.6212306Z hidden_states = self.dense(hidden_states) 2025-08-14T21:43:36.6212309Z 2025-08-14T21:43:36.6212401Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6212607Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6212668Z return mod(**inputs) 2025-08-14T21:43:36.6212927Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6213016Z outputs = self.mobilebert( 2025-08-14T21:43:36.6213273Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6213340Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6213599Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6213664Z layer_outputs = layer_module( 2025-08-14T21:43:36.6213924Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6214014Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6214269Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.6214379Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.6214638Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:43:36.6214746Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:43:36.6214749Z 2025-08-14T21:43:36.6214842Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6215023Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6215091Z return mod(**inputs) 2025-08-14T21:43:36.6215348Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6215414Z outputs = self.mobilebert( 2025-08-14T21:43:36.6215678Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6215745Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6216009Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6216075Z layer_outputs = layer_module( 2025-08-14T21:43:36.6216329Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6216422Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6216678Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.6216798Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.6217055Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:43:36.6217133Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:43:36.6217137Z 2025-08-14T21:43:36.6217255Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6217453Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6217522Z return mod(**inputs) 2025-08-14T21:43:36.6217778Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6217841Z outputs = self.mobilebert( 2025-08-14T21:43:36.6218105Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6218173Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6218623Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6219009Z layer_outputs = layer_module( 2025-08-14T21:43:36.6219408Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6219816Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6220216Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.6220635Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.6221069Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:43:36.6221498Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:43:36.6221929Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.6222326Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.6222471Z 2025-08-14T21:43:36.6222566Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6222909Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6223213Z return mod(**inputs) 2025-08-14T21:43:36.6223573Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6223991Z outputs = self.mobilebert( 2025-08-14T21:43:36.6224430Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6224887Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6225262Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6225648Z layer_outputs = layer_module( 2025-08-14T21:43:36.6226027Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6226433Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6226828Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.6227245Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.6227661Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:43:36.6228050Z hidden_states = self.dense(hidden_states) 2025-08-14T21:43:36.6228181Z 2025-08-14T21:43:36.6228276Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6228613Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6228915Z return mod(**inputs) 2025-08-14T21:43:36.6229271Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6229676Z outputs = self.mobilebert( 2025-08-14T21:43:36.6230066Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6230448Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6230814Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6231193Z layer_outputs = layer_module( 2025-08-14T21:43:36.6231566Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6231981Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6232370Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.6232804Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.6233217Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:43:36.6233623Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:43:36.6233782Z 2025-08-14T21:43:36.6233876Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6234204Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6234501Z return mod(**inputs) 2025-08-14T21:43:36.6234854Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6235233Z outputs = self.mobilebert( 2025-08-14T21:43:36.6235599Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6235972Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6236338Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6236710Z layer_outputs = layer_module( 2025-08-14T21:43:36.6237076Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6237464Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6237859Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.6238278Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.6238701Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:43:36.6239082Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:43:36.6239219Z 2025-08-14T21:43:36.6239315Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6239645Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6239941Z return mod(**inputs) 2025-08-14T21:43:36.6240297Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6240673Z outputs = self.mobilebert( 2025-08-14T21:43:36.6241035Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6241402Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6241773Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6242145Z layer_outputs = layer_module( 2025-08-14T21:43:36.6242534Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6242941Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6243338Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.6243761Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.6244185Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:43:36.6244598Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:43:36.6245017Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.6245431Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.6245565Z 2025-08-14T21:43:36.6245687Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6246019Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6246321Z return mod(**inputs) 2025-08-14T21:43:36.6246688Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6247070Z outputs = self.mobilebert( 2025-08-14T21:43:36.6247446Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6247832Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6248213Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6248594Z layer_outputs = layer_module( 2025-08-14T21:43:36.6248980Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6249393Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6249793Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.6250218Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.6250639Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:43:36.6251032Z hidden_states = self.dense(hidden_states) 2025-08-14T21:43:36.6251159Z 2025-08-14T21:43:36.6251253Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6251585Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6251890Z return mod(**inputs) 2025-08-14T21:43:36.6252253Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6252636Z outputs = self.mobilebert( 2025-08-14T21:43:36.6253010Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6253393Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6253767Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6254151Z layer_outputs = layer_module( 2025-08-14T21:43:36.6254526Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6254934Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6255329Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.6255752Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.6256214Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:43:36.6256627Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:43:36.6256778Z 2025-08-14T21:43:36.6256872Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6257199Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6257495Z return mod(**inputs) 2025-08-14T21:43:36.6257849Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6258252Z outputs = self.mobilebert( 2025-08-14T21:43:36.6258623Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6259024Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6259394Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6259775Z layer_outputs = layer_module( 2025-08-14T21:43:36.6260146Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6260548Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6260940Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.6261364Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.6261795Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:43:36.6262178Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:43:36.6262314Z 2025-08-14T21:43:36.6262412Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6262744Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6263043Z return mod(**inputs) 2025-08-14T21:43:36.6263398Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6263779Z outputs = self.mobilebert( 2025-08-14T21:43:36.6264146Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6264523Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6264957Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6265345Z layer_outputs = layer_module( 2025-08-14T21:43:36.6265725Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6266130Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6266535Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.6266961Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.6267394Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:43:36.6267818Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:43:36.6268252Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.6268655Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.6268792Z 2025-08-14T21:43:36.6268897Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6269256Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6269554Z return mod(**inputs) 2025-08-14T21:43:36.6269910Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6270279Z outputs = self.mobilebert( 2025-08-14T21:43:36.6270640Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6271013Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6271383Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6271768Z layer_outputs = layer_module( 2025-08-14T21:43:36.6272139Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 518, in forward 2025-08-14T21:43:36.6272581Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:43:36.6273013Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:43:36.6273404Z hidden_states = self.dense(hidden_states) 2025-08-14T21:43:36.6273545Z 2025-08-14T21:43:36.6273642Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6273978Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6274278Z return mod(**inputs) 2025-08-14T21:43:36.6274647Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6275036Z outputs = self.mobilebert( 2025-08-14T21:43:36.6275411Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6275789Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6276174Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6276561Z layer_outputs = layer_module( 2025-08-14T21:43:36.6276941Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 518, in forward 2025-08-14T21:43:36.6277360Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:43:36.6277785Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:43:36.6278205Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:43:36.6278360Z 2025-08-14T21:43:36.6278456Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6278793Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6279102Z return mod(**inputs) 2025-08-14T21:43:36.6279467Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6279848Z outputs = self.mobilebert( 2025-08-14T21:43:36.6280224Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6280606Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6280984Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6281365Z layer_outputs = layer_module( 2025-08-14T21:43:36.6281743Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:43:36.6282208Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:43:36.6282704Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 392, in forward 2025-08-14T21:43:36.6283103Z layer_output = self.dense(intermediate_states) 2025-08-14T21:43:36.6283245Z 2025-08-14T21:43:36.6283340Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6283666Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6283955Z return mod(**inputs) 2025-08-14T21:43:36.6284312Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6284859Z outputs = self.mobilebert( 2025-08-14T21:43:36.6285241Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6285663Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6286047Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6286435Z layer_outputs = layer_module( 2025-08-14T21:43:36.6286793Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:43:36.6287246Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:43:36.6287714Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 397, in forward 2025-08-14T21:43:36.6288151Z layer_output = self.LayerNorm(layer_output + residual_tensor_1) 2025-08-14T21:43:36.6288574Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.6288979Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.6289125Z 2025-08-14T21:43:36.6289223Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6289559Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6289854Z return mod(**inputs) 2025-08-14T21:43:36.6290219Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6290607Z outputs = self.mobilebert( 2025-08-14T21:43:36.6290973Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6291356Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6291738Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6292122Z layer_outputs = layer_module( 2025-08-14T21:43:36.6292496Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:43:36.6292963Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:43:36.6293430Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 398, in forward 2025-08-14T21:43:36.6293860Z layer_output = self.bottleneck(layer_output, residual_tensor_2) 2025-08-14T21:43:36.6294284Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 372, in forward 2025-08-14T21:43:36.6294680Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:43:36.6294813Z 2025-08-14T21:43:36.6294916Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6295249Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6295552Z return mod(**inputs) 2025-08-14T21:43:36.6295960Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6296353Z outputs = self.mobilebert( 2025-08-14T21:43:36.6296720Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6297112Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6297496Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6297882Z layer_outputs = layer_module( 2025-08-14T21:43:36.6298256Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:43:36.6298746Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:43:36.6299248Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 398, in forward 2025-08-14T21:43:36.6299685Z layer_output = self.bottleneck(layer_output, residual_tensor_2) 2025-08-14T21:43:36.6300122Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 374, in forward 2025-08-14T21:43:36.6300543Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:43:36.6300963Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.6301351Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.6301496Z 2025-08-14T21:43:36.6301595Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6301926Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6302222Z return mod(**inputs) 2025-08-14T21:43:36.6302578Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6302962Z outputs = self.mobilebert( 2025-08-14T21:43:36.6303328Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6303708Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6304071Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6304446Z layer_outputs = layer_module( 2025-08-14T21:43:36.6304877Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:43:36.6305342Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:43:36.6305811Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 444, in forward 2025-08-14T21:43:36.6306228Z shared_attention_input = self.attention(hidden_states) 2025-08-14T21:43:36.6306645Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 409, in forward 2025-08-14T21:43:36.6307028Z layer_input = self.dense(hidden_states) 2025-08-14T21:43:36.6307164Z 2025-08-14T21:43:36.6307260Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6307595Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6307897Z return mod(**inputs) 2025-08-14T21:43:36.6308256Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6308642Z outputs = self.mobilebert( 2025-08-14T21:43:36.6309036Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6309409Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6309797Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6310178Z layer_outputs = layer_module( 2025-08-14T21:43:36.6310547Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:43:36.6310993Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:43:36.6311457Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 444, in forward 2025-08-14T21:43:36.6311893Z shared_attention_input = self.attention(hidden_states) 2025-08-14T21:43:36.6312300Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 410, in forward 2025-08-14T21:43:36.6312730Z layer_input = self.LayerNorm(layer_input) 2025-08-14T21:43:36.6313120Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.6313517Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.6313651Z 2025-08-14T21:43:36.6313751Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6314072Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6314372Z return mod(**inputs) 2025-08-14T21:43:36.6314735Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6315108Z outputs = self.mobilebert( 2025-08-14T21:43:36.6315475Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6315855Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6316226Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6316597Z layer_outputs = layer_module( 2025-08-14T21:43:36.6316964Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:43:36.6317352Z self_attention_outputs = self.attention( 2025-08-14T21:43:36.6317731Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:43:36.6318107Z self_outputs = self.self( 2025-08-14T21:43:36.6318472Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 245, in forward 2025-08-14T21:43:36.6318849Z self.query(query_tensor) 2025-08-14T21:43:36.6318955Z 2025-08-14T21:43:36.6319050Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6319379Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6319677Z return mod(**inputs) 2025-08-14T21:43:36.6320027Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6320405Z outputs = self.mobilebert( 2025-08-14T21:43:36.6320763Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6321139Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6321503Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6321876Z layer_outputs = layer_module( 2025-08-14T21:43:36.6322255Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:43:36.6322662Z self_attention_outputs = self.attention( 2025-08-14T21:43:36.6323043Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:43:36.6323424Z self_outputs = self.self( 2025-08-14T21:43:36.6323789Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 250, in forward 2025-08-14T21:43:36.6324153Z self.key(key_tensor) 2025-08-14T21:43:36.6324257Z 2025-08-14T21:43:36.6324353Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6324706Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6325001Z return mod(**inputs) 2025-08-14T21:43:36.6325351Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6325752Z outputs = self.mobilebert( 2025-08-14T21:43:36.6326120Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6326502Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6326869Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6327245Z layer_outputs = layer_module( 2025-08-14T21:43:36.6327617Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:43:36.6328006Z self_attention_outputs = self.attention( 2025-08-14T21:43:36.6328398Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:43:36.6328779Z self_outputs = self.self( 2025-08-14T21:43:36.6329149Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 255, in forward 2025-08-14T21:43:36.6329525Z self.value(value_tensor) 2025-08-14T21:43:36.6329637Z 2025-08-14T21:43:36.6329713Z cudagraph partition due to non gpu ops 2025-08-14T21:43:36.6329911Z cudagraph partition due to non gpu ops 2025-08-14T21:43:36.6330123Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6330454Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6330753Z return mod(**inputs) 2025-08-14T21:43:36.6331112Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6331489Z outputs = self.mobilebert( 2025-08-14T21:43:36.6331858Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6332239Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6332607Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6332987Z layer_outputs = layer_module( 2025-08-14T21:43:36.6333357Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:43:36.6333748Z self_attention_outputs = self.attention( 2025-08-14T21:43:36.6334130Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 344, in forward 2025-08-14T21:43:36.6334558Z attention_output = self.output(self_outputs[0], layer_input) 2025-08-14T21:43:36.6334981Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 292, in forward 2025-08-14T21:43:36.6335379Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:43:36.6335507Z 2025-08-14T21:43:36.6335615Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6335961Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6336261Z return mod(**inputs) 2025-08-14T21:43:36.6336612Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6336989Z outputs = self.mobilebert( 2025-08-14T21:43:36.6337352Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6337743Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6338109Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6338485Z layer_outputs = layer_module( 2025-08-14T21:43:36.6338869Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:43:36.6339324Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:43:36.6339775Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 440, in forward 2025-08-14T21:43:36.6340182Z bottlenecked_hidden_states = self.input(hidden_states) 2025-08-14T21:43:36.6340587Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 409, in forward 2025-08-14T21:43:36.6340967Z layer_input = self.dense(hidden_states) 2025-08-14T21:43:36.6341096Z 2025-08-14T21:43:36.6341189Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6341514Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6341811Z return mod(**inputs) 2025-08-14T21:43:36.6342163Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6342540Z outputs = self.mobilebert( 2025-08-14T21:43:36.6342900Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6343277Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6343638Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6344010Z layer_outputs = layer_module( 2025-08-14T21:43:36.6344376Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:43:36.6344822Z self_attention_outputs = self.attention( 2025-08-14T21:43:36.6345214Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 344, in forward 2025-08-14T21:43:36.6345643Z attention_output = self.output(self_outputs[0], layer_input) 2025-08-14T21:43:36.6346065Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 295, in forward 2025-08-14T21:43:36.6346484Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:43:36.6346910Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.6347307Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.6347444Z 2025-08-14T21:43:36.6347549Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6347872Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6348174Z return mod(**inputs) 2025-08-14T21:43:36.6348558Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6348950Z outputs = self.mobilebert( 2025-08-14T21:43:36.6349320Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6349698Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6350073Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6350449Z layer_outputs = layer_module( 2025-08-14T21:43:36.6350820Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6351251Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6351645Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.6352071Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.6352482Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:43:36.6352875Z hidden_states = self.dense(hidden_states) 2025-08-14T21:43:36.6353003Z 2025-08-14T21:43:36.6353103Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6353426Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6353722Z return mod(**inputs) 2025-08-14T21:43:36.6354079Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6354450Z outputs = self.mobilebert( 2025-08-14T21:43:36.6354813Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6355188Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6355562Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6355928Z layer_outputs = layer_module( 2025-08-14T21:43:36.6356293Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6356690Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6357075Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.6357490Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.6357897Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:43:36.6358309Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:43:36.6358461Z 2025-08-14T21:43:36.6358556Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6358883Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6359181Z return mod(**inputs) 2025-08-14T21:43:36.6359540Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6359911Z outputs = self.mobilebert( 2025-08-14T21:43:36.6360274Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6360651Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6361015Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6361390Z layer_outputs = layer_module( 2025-08-14T21:43:36.6361792Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6362195Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6362583Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.6363013Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.6363443Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:43:36.6363838Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:43:36.6363985Z 2025-08-14T21:43:36.6364079Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6364404Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6364723Z return mod(**inputs) 2025-08-14T21:43:36.6365082Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6365467Z outputs = self.mobilebert( 2025-08-14T21:43:36.6365832Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6366210Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6366579Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6366957Z layer_outputs = layer_module( 2025-08-14T21:43:36.6367329Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6367732Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6368121Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.6368551Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.6368980Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:43:36.6369397Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:43:36.6369823Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.6370221Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.6370356Z 2025-08-14T21:43:36.6370458Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6370781Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6371079Z return mod(**inputs) 2025-08-14T21:43:36.6371441Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6371823Z outputs = self.mobilebert( 2025-08-14T21:43:36.6372184Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6372562Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6372934Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6373304Z layer_outputs = layer_module( 2025-08-14T21:43:36.6373677Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6374080Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6374476Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.6374907Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.6375335Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:43:36.6375725Z hidden_states = self.dense(hidden_states) 2025-08-14T21:43:36.6375851Z 2025-08-14T21:43:36.6375951Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6376271Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6376566Z return mod(**inputs) 2025-08-14T21:43:36.6376921Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6377311Z outputs = self.mobilebert( 2025-08-14T21:43:36.6377675Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6378072Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6378441Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6378810Z layer_outputs = layer_module( 2025-08-14T21:43:36.6379181Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6379577Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6379973Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.6380382Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.6380792Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:43:36.6381207Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:43:36.6381361Z 2025-08-14T21:43:36.6381454Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6381786Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6382083Z return mod(**inputs) 2025-08-14T21:43:36.6382441Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6382817Z outputs = self.mobilebert( 2025-08-14T21:43:36.6383182Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6383560Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6383928Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6384296Z layer_outputs = layer_module( 2025-08-14T21:43:36.6384874Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6385291Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6385690Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.6386133Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.6386571Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:43:36.6386955Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:43:36.6387083Z 2025-08-14T21:43:36.6387177Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6387509Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6387808Z return mod(**inputs) 2025-08-14T21:43:36.6388228Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6388605Z outputs = self.mobilebert( 2025-08-14T21:43:36.6388969Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6389347Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6389711Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6390088Z layer_outputs = layer_module( 2025-08-14T21:43:36.6390458Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6390878Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6391267Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.6391715Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.6392137Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:43:36.6392552Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:43:36.6392960Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.6393351Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.6393484Z 2025-08-14T21:43:36.6393585Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6393909Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6394203Z return mod(**inputs) 2025-08-14T21:43:36.6394564Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6394944Z outputs = self.mobilebert( 2025-08-14T21:43:36.6395301Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6395673Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6396043Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6396415Z layer_outputs = layer_module( 2025-08-14T21:43:36.6396774Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6397170Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6397566Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.6397972Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.6398383Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:43:36.6398766Z hidden_states = self.dense(hidden_states) 2025-08-14T21:43:36.6398892Z 2025-08-14T21:43:36.6398992Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6399311Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6399607Z return mod(**inputs) 2025-08-14T21:43:36.6399963Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6400337Z outputs = self.mobilebert( 2025-08-14T21:43:36.6400691Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6401086Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6401483Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6401854Z layer_outputs = layer_module( 2025-08-14T21:43:36.6402219Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6402613Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6403004Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.6403403Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.6403835Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:43:36.6404260Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:43:36.6404411Z 2025-08-14T21:43:36.6404514Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6404695Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6404756Z return mod(**inputs) 2025-08-14T21:43:36.6405023Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6405087Z outputs = self.mobilebert( 2025-08-14T21:43:36.6405338Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6405413Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6405667Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6405737Z layer_outputs = layer_module( 2025-08-14T21:43:36.6405992Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6406077Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6406335Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.6406445Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.6406705Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:43:36.6406781Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:43:36.6406785Z 2025-08-14T21:43:36.6406878Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6407063Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6407123Z return mod(**inputs) 2025-08-14T21:43:36.6407379Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6407449Z outputs = self.mobilebert( 2025-08-14T21:43:36.6407703Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6407776Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6408027Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6408090Z layer_outputs = layer_module( 2025-08-14T21:43:36.6408349Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6408434Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6408705Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.6408832Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.6409087Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:43:36.6409202Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:43:36.6409455Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.6409546Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.6409549Z 2025-08-14T21:43:36.6409642Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6409839Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6409904Z return mod(**inputs) 2025-08-14T21:43:36.6410179Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6410244Z outputs = self.mobilebert( 2025-08-14T21:43:36.6410504Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6410569Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6410826Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6410889Z layer_outputs = layer_module( 2025-08-14T21:43:36.6411141Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 518, in forward 2025-08-14T21:43:36.6411257Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:43:36.6411509Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:43:36.6411594Z hidden_states = self.dense(hidden_states) 2025-08-14T21:43:36.6411598Z 2025-08-14T21:43:36.6411690Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6411870Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6411935Z return mod(**inputs) 2025-08-14T21:43:36.6412192Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6412257Z outputs = self.mobilebert( 2025-08-14T21:43:36.6412516Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6412582Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6412842Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6412909Z layer_outputs = layer_module( 2025-08-14T21:43:36.6413163Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 518, in forward 2025-08-14T21:43:36.6413279Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:43:36.6413530Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:43:36.6413637Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:43:36.6413640Z 2025-08-14T21:43:36.6413730Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6413909Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6413976Z return mod(**inputs) 2025-08-14T21:43:36.6414232Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6414299Z outputs = self.mobilebert( 2025-08-14T21:43:36.6414592Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6414660Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6414919Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6414983Z layer_outputs = layer_module( 2025-08-14T21:43:36.6415234Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:43:36.6415385Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:43:36.6415654Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 392, in forward 2025-08-14T21:43:36.6415746Z layer_output = self.dense(intermediate_states) 2025-08-14T21:43:36.6415766Z 2025-08-14T21:43:36.6415863Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6416044Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6416110Z return mod(**inputs) 2025-08-14T21:43:36.6416369Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6416431Z outputs = self.mobilebert( 2025-08-14T21:43:36.6416691Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6416755Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6417018Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6417081Z layer_outputs = layer_module( 2025-08-14T21:43:36.6417340Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:43:36.6417491Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:43:36.6417747Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 397, in forward 2025-08-14T21:43:36.6417863Z layer_output = self.LayerNorm(layer_output + residual_tensor_1) 2025-08-14T21:43:36.6418117Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.6418201Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.6418205Z 2025-08-14T21:43:36.6418303Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6418485Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6418551Z return mod(**inputs) 2025-08-14T21:43:36.6418810Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6418874Z outputs = self.mobilebert( 2025-08-14T21:43:36.6419135Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6419200Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6419454Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6419525Z layer_outputs = layer_module( 2025-08-14T21:43:36.6419779Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:43:36.6419927Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:43:36.6420198Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 398, in forward 2025-08-14T21:43:36.6420327Z layer_output = self.bottleneck(layer_output, residual_tensor_2) 2025-08-14T21:43:36.6420590Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 372, in forward 2025-08-14T21:43:36.6420666Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:43:36.6420670Z 2025-08-14T21:43:36.6420767Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6420947Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6421005Z return mod(**inputs) 2025-08-14T21:43:36.6421286Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6421351Z outputs = self.mobilebert( 2025-08-14T21:43:36.6421624Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6421698Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6421957Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6422027Z layer_outputs = layer_module( 2025-08-14T21:43:36.6422287Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:43:36.6422427Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:43:36.6422695Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 398, in forward 2025-08-14T21:43:36.6422809Z layer_output = self.bottleneck(layer_output, residual_tensor_2) 2025-08-14T21:43:36.6423078Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 374, in forward 2025-08-14T21:43:36.6423191Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:43:36.6423451Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.6423541Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.6423544Z 2025-08-14T21:43:36.6423637Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6423828Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6423887Z return mod(**inputs) 2025-08-14T21:43:36.6424152Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6424225Z outputs = self.mobilebert( 2025-08-14T21:43:36.6424486Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6424554Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6424883Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6424953Z layer_outputs = layer_module( 2025-08-14T21:43:36.6425213Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:43:36.6425361Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:43:36.6425614Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 444, in forward 2025-08-14T21:43:36.6425726Z shared_attention_input = self.attention(hidden_states) 2025-08-14T21:43:36.6426006Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 409, in forward 2025-08-14T21:43:36.6426095Z layer_input = self.dense(hidden_states) 2025-08-14T21:43:36.6426118Z 2025-08-14T21:43:36.6426214Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6426396Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6426463Z return mod(**inputs) 2025-08-14T21:43:36.6426723Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6426788Z outputs = self.mobilebert( 2025-08-14T21:43:36.6427055Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6427141Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6427402Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6427487Z layer_outputs = layer_module( 2025-08-14T21:43:36.6427746Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:43:36.6427899Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:43:36.6428158Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 444, in forward 2025-08-14T21:43:36.6428264Z shared_attention_input = self.attention(hidden_states) 2025-08-14T21:43:36.6428520Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 410, in forward 2025-08-14T21:43:36.6428600Z layer_input = self.LayerNorm(layer_input) 2025-08-14T21:43:36.6428862Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.6428948Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.6428951Z 2025-08-14T21:43:36.6429045Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6429235Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6429294Z return mod(**inputs) 2025-08-14T21:43:36.6429563Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6429625Z outputs = self.mobilebert( 2025-08-14T21:43:36.6429881Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6429955Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6430213Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6430285Z layer_outputs = layer_module( 2025-08-14T21:43:36.6430543Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:43:36.6430620Z self_attention_outputs = self.attention( 2025-08-14T21:43:36.6430886Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:43:36.6430952Z self_outputs = self.self( 2025-08-14T21:43:36.6431209Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 245, in forward 2025-08-14T21:43:36.6431282Z self.query(query_tensor) 2025-08-14T21:43:36.6431287Z 2025-08-14T21:43:36.6431379Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6431564Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6431624Z return mod(**inputs) 2025-08-14T21:43:36.6431912Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6431986Z outputs = self.mobilebert( 2025-08-14T21:43:36.6432240Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6432315Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6432567Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6432632Z layer_outputs = layer_module( 2025-08-14T21:43:36.6432893Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:43:36.6432989Z self_attention_outputs = self.attention( 2025-08-14T21:43:36.6433245Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:43:36.6433338Z self_outputs = self.self( 2025-08-14T21:43:36.6433598Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 250, in forward 2025-08-14T21:43:36.6433665Z self.key(key_tensor) 2025-08-14T21:43:36.6433668Z 2025-08-14T21:43:36.6433762Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6433940Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6434007Z return mod(**inputs) 2025-08-14T21:43:36.6434265Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6434338Z outputs = self.mobilebert( 2025-08-14T21:43:36.6434592Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6434659Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6434921Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6434985Z layer_outputs = layer_module( 2025-08-14T21:43:36.6435240Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:43:36.6435326Z self_attention_outputs = self.attention( 2025-08-14T21:43:36.6435580Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:43:36.6435651Z self_outputs = self.self( 2025-08-14T21:43:36.6435906Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 255, in forward 2025-08-14T21:43:36.6435970Z self.value(value_tensor) 2025-08-14T21:43:36.6435975Z 2025-08-14T21:43:36.6436057Z cudagraph partition due to non gpu ops 2025-08-14T21:43:36.6436132Z cudagraph partition due to non gpu ops 2025-08-14T21:43:36.6436226Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6436415Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6436475Z return mod(**inputs) 2025-08-14T21:43:36.6436739Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6436803Z outputs = self.mobilebert( 2025-08-14T21:43:36.6437057Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6437129Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6437384Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6437454Z layer_outputs = layer_module( 2025-08-14T21:43:36.6437739Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:43:36.6437817Z self_attention_outputs = self.attention( 2025-08-14T21:43:36.6438077Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 344, in forward 2025-08-14T21:43:36.6438189Z attention_output = self.output(self_outputs[0], layer_input) 2025-08-14T21:43:36.6438441Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 292, in forward 2025-08-14T21:43:36.6438524Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:43:36.6438543Z 2025-08-14T21:43:36.6438637Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6438824Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6438900Z return mod(**inputs) 2025-08-14T21:43:36.6439167Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6439239Z outputs = self.mobilebert( 2025-08-14T21:43:36.6439501Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6439572Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6439832Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6439896Z layer_outputs = layer_module( 2025-08-14T21:43:36.6440165Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:43:36.6440312Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:43:36.6440578Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 440, in forward 2025-08-14T21:43:36.6440686Z bottlenecked_hidden_states = self.input(hidden_states) 2025-08-14T21:43:36.6440946Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 409, in forward 2025-08-14T21:43:36.6441027Z layer_input = self.dense(hidden_states) 2025-08-14T21:43:36.6441031Z 2025-08-14T21:43:36.6441122Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6441304Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6441373Z return mod(**inputs) 2025-08-14T21:43:36.6441638Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6441709Z outputs = self.mobilebert( 2025-08-14T21:43:36.6441972Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6442039Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6442307Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6442370Z layer_outputs = layer_module( 2025-08-14T21:43:36.6442629Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:43:36.6442711Z self_attention_outputs = self.attention( 2025-08-14T21:43:36.6442976Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 344, in forward 2025-08-14T21:43:36.6443097Z attention_output = self.output(self_outputs[0], layer_input) 2025-08-14T21:43:36.6443528Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 295, in forward 2025-08-14T21:43:36.6443664Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:43:36.6443928Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.6444013Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.6444016Z 2025-08-14T21:43:36.6444116Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6444298Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6444357Z return mod(**inputs) 2025-08-14T21:43:36.6444620Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6444704Z outputs = self.mobilebert( 2025-08-14T21:43:36.6444964Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6445064Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6445319Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6445390Z layer_outputs = layer_module( 2025-08-14T21:43:36.6445643Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6445735Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6445992Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.6446095Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.6446354Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:43:36.6446431Z hidden_states = self.dense(hidden_states) 2025-08-14T21:43:36.6446434Z 2025-08-14T21:43:36.6446526Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6446712Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6446770Z return mod(**inputs) 2025-08-14T21:43:36.6447026Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6447096Z outputs = self.mobilebert( 2025-08-14T21:43:36.6447346Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6447418Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6447669Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6447734Z layer_outputs = layer_module( 2025-08-14T21:43:36.6447994Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6448080Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6448338Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.6448437Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.6448689Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:43:36.6448799Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:43:36.6448803Z 2025-08-14T21:43:36.6448893Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6449077Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6449137Z return mod(**inputs) 2025-08-14T21:43:36.6449423Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6449498Z outputs = self.mobilebert( 2025-08-14T21:43:36.6449756Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6449820Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6450084Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6450148Z layer_outputs = layer_module( 2025-08-14T21:43:36.6450423Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6450508Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6450779Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.6450901Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.6451156Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:43:36.6451236Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:43:36.6451239Z 2025-08-14T21:43:36.6451330Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6451506Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6451570Z return mod(**inputs) 2025-08-14T21:43:36.6451829Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6451893Z outputs = self.mobilebert( 2025-08-14T21:43:36.6452154Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6452220Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6452477Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6452539Z layer_outputs = layer_module( 2025-08-14T21:43:36.6452790Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6452879Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6466964Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.6467219Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.6467540Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:43:36.6467666Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:43:36.6467937Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.6468039Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.6468045Z 2025-08-14T21:43:36.6468150Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6468345Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6468422Z return mod(**inputs) 2025-08-14T21:43:36.6468689Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6468772Z outputs = self.mobilebert( 2025-08-14T21:43:36.6469100Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6469179Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6469468Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6469540Z layer_outputs = layer_module( 2025-08-14T21:43:36.6469809Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6469898Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6470158Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.6470329Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.6470586Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:43:36.6470691Z hidden_states = self.dense(hidden_states) 2025-08-14T21:43:36.6470701Z 2025-08-14T21:43:36.6470801Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6470989Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6471060Z return mod(**inputs) 2025-08-14T21:43:36.6471320Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6471386Z outputs = self.mobilebert( 2025-08-14T21:43:36.6471650Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6471720Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6471981Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6472050Z layer_outputs = layer_module( 2025-08-14T21:43:36.6472309Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6472404Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6472661Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.6472764Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.6473026Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:43:36.6473127Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:43:36.6473133Z 2025-08-14T21:43:36.6473233Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6473414Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6473476Z return mod(**inputs) 2025-08-14T21:43:36.6473745Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6473810Z outputs = self.mobilebert( 2025-08-14T21:43:36.6474071Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6474137Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6474394Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6474465Z layer_outputs = layer_module( 2025-08-14T21:43:36.6474722Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6474807Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6475088Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.6475225Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.6475486Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:43:36.6475566Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:43:36.6475570Z 2025-08-14T21:43:36.6475665Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6475856Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6475918Z return mod(**inputs) 2025-08-14T21:43:36.6476204Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6476268Z outputs = self.mobilebert( 2025-08-14T21:43:36.6476542Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6476618Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6476875Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6476949Z layer_outputs = layer_module( 2025-08-14T21:43:36.6477203Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6477289Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6477552Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.6477668Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.6477925Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:43:36.6478046Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:43:36.6478303Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.6478395Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.6478399Z 2025-08-14T21:43:36.6478493Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6478674Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6478744Z return mod(**inputs) 2025-08-14T21:43:36.6479003Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6479076Z outputs = self.mobilebert( 2025-08-14T21:43:36.6479333Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6479400Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6479663Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6479727Z layer_outputs = layer_module( 2025-08-14T21:43:36.6479980Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6480071Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6480326Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.6480435Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.6480688Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:43:36.6480780Z hidden_states = self.dense(hidden_states) 2025-08-14T21:43:36.6480784Z 2025-08-14T21:43:36.6480902Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6481082Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6481147Z return mod(**inputs) 2025-08-14T21:43:36.6481403Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6481466Z outputs = self.mobilebert( 2025-08-14T21:43:36.6481725Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6481819Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6482075Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6482162Z layer_outputs = layer_module( 2025-08-14T21:43:36.6482420Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6482512Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6482766Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.6482864Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.6483125Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:43:36.6483226Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:43:36.6483231Z 2025-08-14T21:43:36.6483330Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6483509Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6483569Z return mod(**inputs) 2025-08-14T21:43:36.6483837Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6483901Z outputs = self.mobilebert( 2025-08-14T21:43:36.6484151Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6484225Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6484480Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6484550Z layer_outputs = layer_module( 2025-08-14T21:43:36.6485010Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6485101Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6485375Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.6485492Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.6485766Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:43:36.6485843Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:43:36.6485847Z 2025-08-14T21:43:36.6485941Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6486138Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6486199Z return mod(**inputs) 2025-08-14T21:43:36.6486475Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6486548Z outputs = self.mobilebert( 2025-08-14T21:43:36.6486847Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6486946Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6487198Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6487261Z layer_outputs = layer_module( 2025-08-14T21:43:36.6487527Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6487611Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6487878Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.6488020Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.6488280Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:43:36.6488430Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:43:36.6488692Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.6488784Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.6488787Z 2025-08-14T21:43:36.6488882Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6489066Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6489133Z return mod(**inputs) 2025-08-14T21:43:36.6489397Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6489465Z outputs = self.mobilebert( 2025-08-14T21:43:36.6489733Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6489804Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6490071Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6490137Z layer_outputs = layer_module( 2025-08-14T21:43:36.6490400Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 518, in forward 2025-08-14T21:43:36.6490519Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:43:36.6490779Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:43:36.6490867Z hidden_states = self.dense(hidden_states) 2025-08-14T21:43:36.6490870Z 2025-08-14T21:43:36.6490963Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6491148Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6491216Z return mod(**inputs) 2025-08-14T21:43:36.6491484Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6491551Z outputs = self.mobilebert( 2025-08-14T21:43:36.6491816Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6491883Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6492151Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6492217Z layer_outputs = layer_module( 2025-08-14T21:43:36.6492477Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 518, in forward 2025-08-14T21:43:36.6492597Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:43:36.6492887Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:43:36.6493000Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:43:36.6493004Z 2025-08-14T21:43:36.6493098Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6493282Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6493351Z return mod(**inputs) 2025-08-14T21:43:36.6493624Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6493709Z outputs = self.mobilebert( 2025-08-14T21:43:36.6493977Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6494060Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6494330Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6494397Z layer_outputs = layer_module( 2025-08-14T21:43:36.6494657Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:43:36.6494813Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:43:36.6495074Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 392, in forward 2025-08-14T21:43:36.6495169Z layer_output = self.dense(intermediate_states) 2025-08-14T21:43:36.6495174Z 2025-08-14T21:43:36.6495266Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6495452Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6495522Z return mod(**inputs) 2025-08-14T21:43:36.6495788Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6495853Z outputs = self.mobilebert( 2025-08-14T21:43:36.6496118Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6496184Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6496447Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6496511Z layer_outputs = layer_module( 2025-08-14T21:43:36.6496769Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:43:36.6496925Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:43:36.6497186Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 397, in forward 2025-08-14T21:43:36.6497309Z layer_output = self.LayerNorm(layer_output + residual_tensor_1) 2025-08-14T21:43:36.6497566Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.6497651Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.6497655Z 2025-08-14T21:43:36.6497754Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6497938Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6498007Z return mod(**inputs) 2025-08-14T21:43:36.6498269Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6498332Z outputs = self.mobilebert( 2025-08-14T21:43:36.6498636Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6498732Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6498994Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6499067Z layer_outputs = layer_module( 2025-08-14T21:43:36.6499328Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:43:36.6499480Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:43:36.6499739Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 398, in forward 2025-08-14T21:43:36.6499876Z layer_output = self.bottleneck(layer_output, residual_tensor_2) 2025-08-14T21:43:36.6500168Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 372, in forward 2025-08-14T21:43:36.6500246Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:43:36.6500250Z 2025-08-14T21:43:36.6500349Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6500528Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6500588Z return mod(**inputs) 2025-08-14T21:43:36.6500851Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6500915Z outputs = self.mobilebert( 2025-08-14T21:43:36.6501165Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6501238Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6501495Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6501569Z layer_outputs = layer_module( 2025-08-14T21:43:36.6501826Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:43:36.6501968Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:43:36.6502230Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 398, in forward 2025-08-14T21:43:36.6502342Z layer_output = self.bottleneck(layer_output, residual_tensor_2) 2025-08-14T21:43:36.6502600Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 374, in forward 2025-08-14T21:43:36.6502711Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:43:36.6502966Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.6503060Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.6503063Z 2025-08-14T21:43:36.6503155Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6503334Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6503401Z return mod(**inputs) 2025-08-14T21:43:36.6503657Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6503728Z outputs = self.mobilebert( 2025-08-14T21:43:36.6503982Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6504047Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6504305Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6504383Z layer_outputs = layer_module( 2025-08-14T21:43:36.6504733Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:43:36.6504898Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:43:36.6505152Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 444, in forward 2025-08-14T21:43:36.6505258Z shared_attention_input = self.attention(hidden_states) 2025-08-14T21:43:36.6505509Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 409, in forward 2025-08-14T21:43:36.6505604Z layer_input = self.dense(hidden_states) 2025-08-14T21:43:36.6505613Z 2025-08-14T21:43:36.6505705Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6505903Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6505973Z return mod(**inputs) 2025-08-14T21:43:36.6506232Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6506295Z outputs = self.mobilebert( 2025-08-14T21:43:36.6506557Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6506622Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6506886Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6506950Z layer_outputs = layer_module( 2025-08-14T21:43:36.6507205Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:43:36.6507361Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:43:36.6507620Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 444, in forward 2025-08-14T21:43:36.6507724Z shared_attention_input = self.attention(hidden_states) 2025-08-14T21:43:36.6507980Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 410, in forward 2025-08-14T21:43:36.6508059Z layer_input = self.LayerNorm(layer_input) 2025-08-14T21:43:36.6508321Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.6508403Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.6508406Z 2025-08-14T21:43:36.6508497Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6508680Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6508740Z return mod(**inputs) 2025-08-14T21:43:36.6509003Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6509067Z outputs = self.mobilebert( 2025-08-14T21:43:36.6509319Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6509390Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6509643Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6509715Z layer_outputs = layer_module( 2025-08-14T21:43:36.6509967Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:43:36.6510047Z self_attention_outputs = self.attention( 2025-08-14T21:43:36.6510339Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:43:36.6510408Z self_outputs = self.self( 2025-08-14T21:43:36.6510664Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 245, in forward 2025-08-14T21:43:36.6510736Z self.query(query_tensor) 2025-08-14T21:43:36.6510739Z 2025-08-14T21:43:36.6510831Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6511017Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6511076Z return mod(**inputs) 2025-08-14T21:43:36.6511351Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6511420Z outputs = self.mobilebert( 2025-08-14T21:43:36.6511695Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6511761Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6512023Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6512086Z layer_outputs = layer_module( 2025-08-14T21:43:36.6512343Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:43:36.6512421Z self_attention_outputs = self.attention( 2025-08-14T21:43:36.6512675Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:43:36.6512746Z self_outputs = self.self( 2025-08-14T21:43:36.6512999Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 250, in forward 2025-08-14T21:43:36.6513070Z self.key(key_tensor) 2025-08-14T21:43:36.6513073Z 2025-08-14T21:43:36.6513167Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6513343Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6513410Z return mod(**inputs) 2025-08-14T21:43:36.6513663Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6513726Z outputs = self.mobilebert( 2025-08-14T21:43:36.6513986Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6514052Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6514312Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6514375Z layer_outputs = layer_module( 2025-08-14T21:43:36.6514627Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:43:36.6514711Z self_attention_outputs = self.attention( 2025-08-14T21:43:36.6514964Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:43:36.6515033Z self_outputs = self.self( 2025-08-14T21:43:36.6515285Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 255, in forward 2025-08-14T21:43:36.6515350Z self.value(value_tensor) 2025-08-14T21:43:36.6515353Z 2025-08-14T21:43:36.6515435Z cudagraph partition due to non gpu ops 2025-08-14T21:43:36.6515507Z cudagraph partition due to non gpu ops 2025-08-14T21:43:36.6515602Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6515788Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6515865Z return mod(**inputs) 2025-08-14T21:43:36.6516144Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6516210Z outputs = self.mobilebert( 2025-08-14T21:43:36.6516465Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6516539Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6516792Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6516873Z layer_outputs = layer_module( 2025-08-14T21:43:36.6517134Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:43:36.6517226Z self_attention_outputs = self.attention( 2025-08-14T21:43:36.6517485Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 344, in forward 2025-08-14T21:43:36.6517598Z attention_output = self.output(self_outputs[0], layer_input) 2025-08-14T21:43:36.6517851Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 292, in forward 2025-08-14T21:43:36.6517931Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:43:36.6517935Z 2025-08-14T21:43:36.6518026Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6518207Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6518266Z return mod(**inputs) 2025-08-14T21:43:36.6518520Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6518589Z outputs = self.mobilebert( 2025-08-14T21:43:36.6518843Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6518907Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6519168Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6519231Z layer_outputs = layer_module( 2025-08-14T21:43:36.6519490Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:43:36.6519635Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:43:36.6519891Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 440, in forward 2025-08-14T21:43:36.6519994Z bottlenecked_hidden_states = self.input(hidden_states) 2025-08-14T21:43:36.6520250Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 409, in forward 2025-08-14T21:43:36.6520329Z layer_input = self.dense(hidden_states) 2025-08-14T21:43:36.6520332Z 2025-08-14T21:43:36.6520423Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6520600Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6520664Z return mod(**inputs) 2025-08-14T21:43:36.6520917Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6520983Z outputs = self.mobilebert( 2025-08-14T21:43:36.6521237Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6521302Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6521577Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6521656Z layer_outputs = layer_module( 2025-08-14T21:43:36.6521911Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:43:36.6521994Z self_attention_outputs = self.attention( 2025-08-14T21:43:36.6522247Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 344, in forward 2025-08-14T21:43:36.6522362Z attention_output = self.output(self_outputs[0], layer_input) 2025-08-14T21:43:36.6522619Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 295, in forward 2025-08-14T21:43:36.6522753Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:43:36.6523015Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.6523118Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.6523122Z 2025-08-14T21:43:36.6523219Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6523395Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6523453Z return mod(**inputs) 2025-08-14T21:43:36.6523711Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6523772Z outputs = self.mobilebert( 2025-08-14T21:43:36.6524026Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6524100Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6524350Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6524420Z layer_outputs = layer_module( 2025-08-14T21:43:36.6524675Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6524760Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6525019Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.6525118Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.6525375Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:43:36.6525452Z hidden_states = self.dense(hidden_states) 2025-08-14T21:43:36.6525455Z 2025-08-14T21:43:36.6525543Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6525724Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6525783Z return mod(**inputs) 2025-08-14T21:43:36.6526039Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6526106Z outputs = self.mobilebert( 2025-08-14T21:43:36.6526359Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6526429Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6526690Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6526754Z layer_outputs = layer_module( 2025-08-14T21:43:36.6527013Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6527098Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6527389Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.6527497Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.6527754Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:43:36.6527860Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:43:36.6527863Z 2025-08-14T21:43:36.6527955Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6528133Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6528217Z return mod(**inputs) 2025-08-14T21:43:36.6528479Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6528559Z outputs = self.mobilebert( 2025-08-14T21:43:36.6528823Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6528886Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6529144Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6529207Z layer_outputs = layer_module( 2025-08-14T21:43:36.6529460Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6529551Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6529807Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.6529927Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.6530183Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:43:36.6530260Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:43:36.6530264Z 2025-08-14T21:43:36.6530361Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6530539Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6530597Z return mod(**inputs) 2025-08-14T21:43:36.6530861Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6530923Z outputs = self.mobilebert( 2025-08-14T21:43:36.6531183Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6531247Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6531500Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6531571Z layer_outputs = layer_module( 2025-08-14T21:43:36.6531824Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6531916Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6532169Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.6532283Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.6532542Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:43:36.6532656Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:43:36.6532907Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.6533013Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.6533031Z 2025-08-14T21:43:36.6533123Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6533307Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6533366Z return mod(**inputs) 2025-08-14T21:43:36.6533622Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6533692Z outputs = self.mobilebert( 2025-08-14T21:43:36.6533945Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6534033Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6534289Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6534374Z layer_outputs = layer_module( 2025-08-14T21:43:36.6534632Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6534718Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6534969Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.6535075Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.6535326Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:43:36.6535407Z hidden_states = self.dense(hidden_states) 2025-08-14T21:43:36.6535411Z 2025-08-14T21:43:36.6535501Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6535678Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6535747Z return mod(**inputs) 2025-08-14T21:43:36.6536003Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6536071Z outputs = self.mobilebert( 2025-08-14T21:43:36.6536321Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6536385Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6536642Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6536706Z layer_outputs = layer_module( 2025-08-14T21:43:36.6536956Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6537047Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6537301Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.6537408Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.6537660Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:43:36.6537758Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:43:36.6537762Z 2025-08-14T21:43:36.6537861Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6538040Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6538138Z return mod(**inputs) 2025-08-14T21:43:36.6538394Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6538460Z outputs = self.mobilebert( 2025-08-14T21:43:36.6538750Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6538818Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6539077Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6539148Z layer_outputs = layer_module( 2025-08-14T21:43:36.6539404Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6539497Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6539752Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.6539883Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.6540163Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:43:36.6540239Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:43:36.6540242Z 2025-08-14T21:43:36.6540338Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6540516Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6540576Z return mod(**inputs) 2025-08-14T21:43:36.6540835Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6540897Z outputs = self.mobilebert( 2025-08-14T21:43:36.6541157Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6541223Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6541477Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6541548Z layer_outputs = layer_module( 2025-08-14T21:43:36.6541799Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6541881Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6542138Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.6542249Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.6542505Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:43:36.6542615Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:43:36.6542867Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.6542959Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.6542963Z 2025-08-14T21:43:36.6543053Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6543236Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6543294Z return mod(**inputs) 2025-08-14T21:43:36.6543547Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6543617Z outputs = self.mobilebert( 2025-08-14T21:43:36.6543869Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6543935Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6544194Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6544273Z layer_outputs = layer_module( 2025-08-14T21:43:36.6544549Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6544635Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6544965Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.6545079Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.6545334Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:43:36.6545440Z hidden_states = self.dense(hidden_states) 2025-08-14T21:43:36.6545444Z 2025-08-14T21:43:36.6545536Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6545715Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6545814Z return mod(**inputs) 2025-08-14T21:43:36.6546078Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6546143Z outputs = self.mobilebert( 2025-08-14T21:43:36.6546409Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6546474Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6546736Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6546800Z layer_outputs = layer_module( 2025-08-14T21:43:36.6547054Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6547145Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6547406Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.6547508Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.6547762Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:43:36.6547859Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:43:36.6547863Z 2025-08-14T21:43:36.6547958Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6548136Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6548196Z return mod(**inputs) 2025-08-14T21:43:36.6548461Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6548526Z outputs = self.mobilebert( 2025-08-14T21:43:36.6548788Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6548853Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6549108Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6549177Z layer_outputs = layer_module( 2025-08-14T21:43:36.6549428Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6549519Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6549770Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.6549883Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.6550163Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:43:36.6550254Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:43:36.6550258Z 2025-08-14T21:43:36.6550360Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6550538Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6550598Z return mod(**inputs) 2025-08-14T21:43:36.6550861Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6550925Z outputs = self.mobilebert( 2025-08-14T21:43:36.6551176Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6551265Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6551519Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6551606Z layer_outputs = layer_module( 2025-08-14T21:43:36.6551868Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6551952Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6552213Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.6552324Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.6552579Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:43:36.6552696Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:43:36.6552947Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.6553037Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.6553042Z 2025-08-14T21:43:36.6553133Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6553311Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6553375Z return mod(**inputs) 2025-08-14T21:43:36.6553630Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6553701Z outputs = self.mobilebert( 2025-08-14T21:43:36.6553954Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6554022Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6554282Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6554350Z layer_outputs = layer_module( 2025-08-14T21:43:36.6554611Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 518, in forward 2025-08-14T21:43:36.6554719Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:43:36.6554971Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:43:36.6555051Z hidden_states = self.dense(hidden_states) 2025-08-14T21:43:36.6555055Z 2025-08-14T21:43:36.6555146Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6555324Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6555390Z return mod(**inputs) 2025-08-14T21:43:36.6555644Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6555729Z outputs = self.mobilebert( 2025-08-14T21:43:36.6555997Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6556064Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6556323Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6556385Z layer_outputs = layer_module( 2025-08-14T21:43:36.6556642Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 518, in forward 2025-08-14T21:43:36.6556749Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:43:36.6557022Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:43:36.6557125Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:43:36.6557145Z 2025-08-14T21:43:36.6557238Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6557420Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6557484Z return mod(**inputs) 2025-08-14T21:43:36.6557742Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6557810Z outputs = self.mobilebert( 2025-08-14T21:43:36.6558064Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6558128Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6558391Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6558453Z layer_outputs = layer_module( 2025-08-14T21:43:36.6558717Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:43:36.6558864Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:43:36.6559119Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 392, in forward 2025-08-14T21:43:36.6559212Z layer_output = self.dense(intermediate_states) 2025-08-14T21:43:36.6559215Z 2025-08-14T21:43:36.6559308Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6559485Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6559551Z return mod(**inputs) 2025-08-14T21:43:36.6559810Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6559878Z outputs = self.mobilebert( 2025-08-14T21:43:36.6560136Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6560201Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6560460Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6560522Z layer_outputs = layer_module( 2025-08-14T21:43:36.6560782Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:43:36.6560925Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:43:36.6561182Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 397, in forward 2025-08-14T21:43:36.6561298Z layer_output = self.LayerNorm(layer_output + residual_tensor_1) 2025-08-14T21:43:36.6561567Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.6561695Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.6561706Z 2025-08-14T21:43:36.6561798Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6561978Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6562044Z return mod(**inputs) 2025-08-14T21:43:36.6562300Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6562362Z outputs = self.mobilebert( 2025-08-14T21:43:36.6562639Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6562702Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6562963Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6563046Z layer_outputs = layer_module( 2025-08-14T21:43:36.6563303Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:43:36.6563451Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:43:36.6563702Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 398, in forward 2025-08-14T21:43:36.6563816Z layer_output = self.bottleneck(layer_output, residual_tensor_2) 2025-08-14T21:43:36.6564066Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 372, in forward 2025-08-14T21:43:36.6564143Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:43:36.6564147Z 2025-08-14T21:43:36.6564246Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6564429Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6564487Z return mod(**inputs) 2025-08-14T21:43:36.6564749Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6564811Z outputs = self.mobilebert( 2025-08-14T21:43:36.6565068Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6565132Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6565383Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6565455Z layer_outputs = layer_module( 2025-08-14T21:43:36.6565706Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:43:36.6565854Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:43:36.6566106Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 398, in forward 2025-08-14T21:43:36.6566215Z layer_output = self.bottleneck(layer_output, residual_tensor_2) 2025-08-14T21:43:36.6566472Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 374, in forward 2025-08-14T21:43:36.6566582Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:43:36.6566834Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.6566924Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.6566928Z 2025-08-14T21:43:36.6567018Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6567218Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6567298Z return mod(**inputs) 2025-08-14T21:43:36.6567555Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6567624Z outputs = self.mobilebert( 2025-08-14T21:43:36.6567877Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6567947Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6568200Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6568287Z layer_outputs = layer_module( 2025-08-14T21:43:36.6568548Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:43:36.6568712Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:43:36.6568974Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 444, in forward 2025-08-14T21:43:36.6569073Z shared_attention_input = self.attention(hidden_states) 2025-08-14T21:43:36.6569325Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 409, in forward 2025-08-14T21:43:36.6569404Z layer_input = self.dense(hidden_states) 2025-08-14T21:43:36.6569407Z 2025-08-14T21:43:36.6569497Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6569678Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6569742Z return mod(**inputs) 2025-08-14T21:43:36.6569997Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6570071Z outputs = self.mobilebert( 2025-08-14T21:43:36.6570325Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6570390Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6570650Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6570711Z layer_outputs = layer_module( 2025-08-14T21:43:36.6570970Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:43:36.6571117Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:43:36.6571371Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 444, in forward 2025-08-14T21:43:36.6571479Z shared_attention_input = self.attention(hidden_states) 2025-08-14T21:43:36.6571733Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 410, in forward 2025-08-14T21:43:36.6571810Z layer_input = self.LayerNorm(layer_input) 2025-08-14T21:43:36.6572068Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.6572150Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.6572154Z 2025-08-14T21:43:36.6572252Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6572430Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6572489Z return mod(**inputs) 2025-08-14T21:43:36.6572753Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6572818Z outputs = self.mobilebert( 2025-08-14T21:43:36.6573105Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6573172Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6573424Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6573492Z layer_outputs = layer_module( 2025-08-14T21:43:36.6573742Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:43:36.6573816Z self_attention_outputs = self.attention( 2025-08-14T21:43:36.6574093Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:43:36.6574156Z self_outputs = self.self( 2025-08-14T21:43:36.6574438Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 245, in forward 2025-08-14T21:43:36.6574503Z self.query(query_tensor) 2025-08-14T21:43:36.6574507Z 2025-08-14T21:43:36.6574597Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6574779Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6574836Z return mod(**inputs) 2025-08-14T21:43:36.6575098Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6575158Z outputs = self.mobilebert( 2025-08-14T21:43:36.6575409Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6575480Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6575734Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6575797Z layer_outputs = layer_module( 2025-08-14T21:43:36.6576060Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:43:36.6576135Z self_attention_outputs = self.attention( 2025-08-14T21:43:36.6576396Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:43:36.6576457Z self_outputs = self.self( 2025-08-14T21:43:36.6576711Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 250, in forward 2025-08-14T21:43:36.6576780Z self.key(key_tensor) 2025-08-14T21:43:36.6576783Z 2025-08-14T21:43:36.6576871Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6577056Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6577116Z return mod(**inputs) 2025-08-14T21:43:36.6577373Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6577441Z outputs = self.mobilebert( 2025-08-14T21:43:36.6577694Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6577756Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6578016Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6578078Z layer_outputs = layer_module( 2025-08-14T21:43:36.6578338Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:43:36.6578413Z self_attention_outputs = self.attention( 2025-08-14T21:43:36.6578680Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:43:36.6578766Z self_outputs = self.self( 2025-08-14T21:43:36.6579020Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 255, in forward 2025-08-14T21:43:36.6579082Z self.value(value_tensor) 2025-08-14T21:43:36.6579092Z 2025-08-14T21:43:36.6579163Z cudagraph partition due to non gpu ops 2025-08-14T21:43:36.6579233Z cudagraph partition due to non gpu ops 2025-08-14T21:43:36.6579329Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6579506Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6579583Z return mod(**inputs) 2025-08-14T21:43:36.6579852Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6579932Z outputs = self.mobilebert( 2025-08-14T21:43:36.6580196Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6580260Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6580517Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6580586Z layer_outputs = layer_module( 2025-08-14T21:43:36.6580839Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:43:36.6580914Z self_attention_outputs = self.attention( 2025-08-14T21:43:36.6581174Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 344, in forward 2025-08-14T21:43:36.6581288Z attention_output = self.output(self_outputs[0], layer_input) 2025-08-14T21:43:36.6581556Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 292, in forward 2025-08-14T21:43:36.6581631Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:43:36.6581635Z 2025-08-14T21:43:36.6581725Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6581910Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6581969Z return mod(**inputs) 2025-08-14T21:43:36.6582233Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6582297Z outputs = self.mobilebert( 2025-08-14T21:43:36.6582552Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6582623Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6582880Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6582943Z layer_outputs = layer_module( 2025-08-14T21:43:36.6583203Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:43:36.6583348Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:43:36.6583607Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 440, in forward 2025-08-14T21:43:36.6583704Z bottlenecked_hidden_states = self.input(hidden_states) 2025-08-14T21:43:36.6583960Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 409, in forward 2025-08-14T21:43:36.6584040Z layer_input = self.dense(hidden_states) 2025-08-14T21:43:36.6584045Z 2025-08-14T21:43:36.6584135Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6584347Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6584410Z return mod(**inputs) 2025-08-14T21:43:36.6584823Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6584901Z outputs = self.mobilebert( 2025-08-14T21:43:36.6585166Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6585231Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6585498Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6585605Z layer_outputs = layer_module( 2025-08-14T21:43:36.6585875Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:43:36.6585980Z self_attention_outputs = self.attention( 2025-08-14T21:43:36.6586252Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 344, in forward 2025-08-14T21:43:36.6586371Z attention_output = self.output(self_outputs[0], layer_input) 2025-08-14T21:43:36.6586629Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 295, in forward 2025-08-14T21:43:36.6586753Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:43:36.6587009Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.6587095Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.6587098Z 2025-08-14T21:43:36.6587202Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6587389Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6587452Z return mod(**inputs) 2025-08-14T21:43:36.6587726Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6587789Z outputs = self.mobilebert( 2025-08-14T21:43:36.6588059Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6588125Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6588388Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6588461Z layer_outputs = layer_module( 2025-08-14T21:43:36.6588722Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6588819Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6589080Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.6589183Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.6589449Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:43:36.6589525Z hidden_states = self.dense(hidden_states) 2025-08-14T21:43:36.6589528Z 2025-08-14T21:43:36.6589628Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6589811Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6589872Z return mod(**inputs) 2025-08-14T21:43:36.6590142Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6590206Z outputs = self.mobilebert( 2025-08-14T21:43:36.6590515Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6590591Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6590850Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6590922Z layer_outputs = layer_module( 2025-08-14T21:43:36.6591181Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6591266Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6591550Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.6591652Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.6591930Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:43:36.6592040Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:43:36.6592044Z 2025-08-14T21:43:36.6592136Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6592327Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6592386Z return mod(**inputs) 2025-08-14T21:43:36.6592646Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6592718Z outputs = self.mobilebert( 2025-08-14T21:43:36.6592980Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6593053Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6593313Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6593379Z layer_outputs = layer_module( 2025-08-14T21:43:36.6593644Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6593728Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6593985Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.6594106Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.6594365Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:43:36.6594451Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:43:36.6594454Z 2025-08-14T21:43:36.6594549Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6594732Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6594798Z return mod(**inputs) 2025-08-14T21:43:36.6595061Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6595133Z outputs = self.mobilebert( 2025-08-14T21:43:36.6595391Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6595457Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6595723Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6595788Z layer_outputs = layer_module( 2025-08-14T21:43:36.6596052Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6596154Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6596432Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.6596556Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.6596815Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:43:36.6596929Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:43:36.6597200Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.6597303Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.6597306Z 2025-08-14T21:43:36.6597410Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6597611Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6597674Z return mod(**inputs) 2025-08-14T21:43:36.6597949Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6598013Z outputs = self.mobilebert( 2025-08-14T21:43:36.6598272Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6598337Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6598589Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6598662Z layer_outputs = layer_module( 2025-08-14T21:43:36.6598914Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6599001Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6599261Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.6599359Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.6599618Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:43:36.6599692Z hidden_states = self.dense(hidden_states) 2025-08-14T21:43:36.6599696Z 2025-08-14T21:43:36.6599786Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6599973Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6600033Z return mod(**inputs) 2025-08-14T21:43:36.6600294Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6600360Z outputs = self.mobilebert( 2025-08-14T21:43:36.6600614Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6600685Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6600938Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6601001Z layer_outputs = layer_module( 2025-08-14T21:43:36.6601260Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6601342Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6601604Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.6601702Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.6601973Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:43:36.6602099Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:43:36.6602103Z 2025-08-14T21:43:36.6602197Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6602383Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6602442Z return mod(**inputs) 2025-08-14T21:43:36.6602700Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6602772Z outputs = self.mobilebert( 2025-08-14T21:43:36.6603041Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6603106Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6603386Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6603451Z layer_outputs = layer_module( 2025-08-14T21:43:36.6603711Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6603795Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6604047Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.6604165Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.6604419Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:43:36.6604504Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:43:36.6604507Z 2025-08-14T21:43:36.6604601Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6604782Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6604846Z return mod(**inputs) 2025-08-14T21:43:36.6605103Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6605164Z outputs = self.mobilebert( 2025-08-14T21:43:36.6605426Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6605491Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6605751Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6605814Z layer_outputs = layer_module( 2025-08-14T21:43:36.6606065Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6606157Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6606410Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.6606528Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.6606781Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:43:36.6606889Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:43:36.6607146Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.6607229Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.6607232Z 2025-08-14T21:43:36.6607327Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6607521Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6607596Z return mod(**inputs) 2025-08-14T21:43:36.6607858Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6607920Z outputs = self.mobilebert( 2025-08-14T21:43:36.6608172Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6608245Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6608496Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6608579Z layer_outputs = layer_module( 2025-08-14T21:43:36.6608833Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6608933Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6609194Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.6609293Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.6609553Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:43:36.6609627Z hidden_states = self.dense(hidden_states) 2025-08-14T21:43:36.6609631Z 2025-08-14T21:43:36.6609722Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6609906Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6609965Z return mod(**inputs) 2025-08-14T21:43:36.6610219Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6610290Z outputs = self.mobilebert( 2025-08-14T21:43:36.6610546Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6610615Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6610869Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6610932Z layer_outputs = layer_module( 2025-08-14T21:43:36.6611192Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6611275Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6611536Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.6611635Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.6611891Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:43:36.6611997Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:43:36.6612000Z 2025-08-14T21:43:36.6612090Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6612270Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6612336Z return mod(**inputs) 2025-08-14T21:43:36.6612592Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6612661Z outputs = self.mobilebert( 2025-08-14T21:43:36.6612917Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6612982Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6613263Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6613344Z layer_outputs = layer_module( 2025-08-14T21:43:36.6613607Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6613691Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6613945Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.6614063Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.6614317Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:43:36.6614412Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:43:36.6614422Z 2025-08-14T21:43:36.6614531Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6614711Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6614776Z return mod(**inputs) 2025-08-14T21:43:36.6615033Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6615096Z outputs = self.mobilebert( 2025-08-14T21:43:36.6615356Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6615423Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6615681Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6615745Z layer_outputs = layer_module( 2025-08-14T21:43:36.6615998Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6616092Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6616346Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.6616457Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.6616721Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:43:36.6616831Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:43:36.6617093Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.6617178Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.6617181Z 2025-08-14T21:43:36.6617272Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6617463Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6617521Z return mod(**inputs) 2025-08-14T21:43:36.6617786Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6617848Z outputs = self.mobilebert( 2025-08-14T21:43:36.6618103Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6618176Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6618429Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6618493Z layer_outputs = layer_module( 2025-08-14T21:43:36.6618753Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 518, in forward 2025-08-14T21:43:36.6618863Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:43:36.6619153Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:43:36.6619231Z hidden_states = self.dense(hidden_states) 2025-08-14T21:43:36.6619235Z 2025-08-14T21:43:36.6619326Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6619511Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6619570Z return mod(**inputs) 2025-08-14T21:43:36.6619835Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6619914Z outputs = self.mobilebert( 2025-08-14T21:43:36.6620177Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6620270Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6620528Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6620592Z layer_outputs = layer_module( 2025-08-14T21:43:36.6620854Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 518, in forward 2025-08-14T21:43:36.6620960Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:43:36.6621222Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:43:36.6621320Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:43:36.6621325Z 2025-08-14T21:43:36.6621416Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6621604Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6621664Z return mod(**inputs) 2025-08-14T21:43:36.6621930Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6621993Z outputs = self.mobilebert( 2025-08-14T21:43:36.6622248Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6622319Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6622574Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6622642Z layer_outputs = layer_module( 2025-08-14T21:43:36.6622898Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:43:36.6623042Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:43:36.6623305Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 392, in forward 2025-08-14T21:43:36.6623391Z layer_output = self.dense(intermediate_states) 2025-08-14T21:43:36.6623394Z 2025-08-14T21:43:36.6623485Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6623668Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6623726Z return mod(**inputs) 2025-08-14T21:43:36.6623990Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6624053Z outputs = self.mobilebert( 2025-08-14T21:43:36.6624310Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6624382Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6624653Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6624804Z layer_outputs = layer_module( 2025-08-14T21:43:36.6625064Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:43:36.6625207Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:43:36.6625466Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 397, in forward 2025-08-14T21:43:36.6625579Z layer_output = self.LayerNorm(layer_output + residual_tensor_1) 2025-08-14T21:43:36.6625831Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.6625939Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.6625942Z 2025-08-14T21:43:36.6626050Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6626239Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6626299Z return mod(**inputs) 2025-08-14T21:43:36.6626558Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6626631Z outputs = self.mobilebert( 2025-08-14T21:43:36.6626886Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6626960Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6627215Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6627279Z layer_outputs = layer_module( 2025-08-14T21:43:36.6627541Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:43:36.6627686Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:43:36.6627939Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 398, in forward 2025-08-14T21:43:36.6628057Z layer_output = self.bottleneck(layer_output, residual_tensor_2) 2025-08-14T21:43:36.6628310Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 372, in forward 2025-08-14T21:43:36.6628394Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:43:36.6628397Z 2025-08-14T21:43:36.6628487Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6628667Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6628732Z return mod(**inputs) 2025-08-14T21:43:36.6628991Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6629063Z outputs = self.mobilebert( 2025-08-14T21:43:36.6629316Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6629380Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6629639Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6629703Z layer_outputs = layer_module( 2025-08-14T21:43:36.6629962Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:43:36.6630103Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:43:36.6630356Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 398, in forward 2025-08-14T21:43:36.6630490Z layer_output = self.bottleneck(layer_output, residual_tensor_2) 2025-08-14T21:43:36.6630761Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 374, in forward 2025-08-14T21:43:36.6630874Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:43:36.6631135Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.6631216Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.6631219Z 2025-08-14T21:43:36.6631315Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6631510Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6631569Z return mod(**inputs) 2025-08-14T21:43:36.6631831Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6631920Z outputs = self.mobilebert( 2025-08-14T21:43:36.6632187Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6632253Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6632507Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6632576Z layer_outputs = layer_module( 2025-08-14T21:43:36.6632831Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:43:36.6632977Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:43:36.6633242Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 444, in forward 2025-08-14T21:43:36.6633344Z shared_attention_input = self.attention(hidden_states) 2025-08-14T21:43:36.6633611Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 409, in forward 2025-08-14T21:43:36.6633685Z layer_input = self.dense(hidden_states) 2025-08-14T21:43:36.6633689Z 2025-08-14T21:43:36.6633781Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6633969Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6634028Z return mod(**inputs) 2025-08-14T21:43:36.6634296Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6634361Z outputs = self.mobilebert( 2025-08-14T21:43:36.6634616Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6634694Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6634952Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6635014Z layer_outputs = layer_module( 2025-08-14T21:43:36.6635278Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:43:36.6635421Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:43:36.6635687Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 444, in forward 2025-08-14T21:43:36.6635787Z shared_attention_input = self.attention(hidden_states) 2025-08-14T21:43:36.6636042Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 410, in forward 2025-08-14T21:43:36.6636129Z layer_input = self.LayerNorm(layer_input) 2025-08-14T21:43:36.6636415Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.6636506Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.6636509Z 2025-08-14T21:43:36.6636602Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6636783Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6636851Z return mod(**inputs) 2025-08-14T21:43:36.6637110Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6637197Z outputs = self.mobilebert( 2025-08-14T21:43:36.6637455Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6637537Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6637799Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6637863Z layer_outputs = layer_module( 2025-08-14T21:43:36.6638116Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:43:36.6638198Z self_attention_outputs = self.attention( 2025-08-14T21:43:36.6638448Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:43:36.6638519Z self_outputs = self.self( 2025-08-14T21:43:36.6638773Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 245, in forward 2025-08-14T21:43:36.6638840Z self.query(query_tensor) 2025-08-14T21:43:36.6638844Z 2025-08-14T21:43:36.6638942Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6639120Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6639186Z return mod(**inputs) 2025-08-14T21:43:36.6639440Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6639502Z outputs = self.mobilebert( 2025-08-14T21:43:36.6639763Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6639828Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6640081Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6640151Z layer_outputs = layer_module( 2025-08-14T21:43:36.6640403Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:43:36.6640489Z self_attention_outputs = self.attention( 2025-08-14T21:43:36.6640747Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:43:36.6640811Z self_outputs = self.self( 2025-08-14T21:43:36.6641071Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 250, in forward 2025-08-14T21:43:36.6641129Z self.key(key_tensor) 2025-08-14T21:43:36.6641133Z 2025-08-14T21:43:36.6641229Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6641406Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6641465Z return mod(**inputs) 2025-08-14T21:43:36.6641730Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6641795Z outputs = self.mobilebert( 2025-08-14T21:43:36.6642080Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6642155Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6642412Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6642482Z layer_outputs = layer_module( 2025-08-14T21:43:36.6642737Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:43:36.6642814Z self_attention_outputs = self.attention( 2025-08-14T21:43:36.6643091Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:43:36.6643153Z self_outputs = self.self( 2025-08-14T21:43:36.6643426Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 255, in forward 2025-08-14T21:43:36.6643498Z self.value(value_tensor) 2025-08-14T21:43:36.6643502Z 2025-08-14T21:43:36.6643575Z cudagraph partition due to non gpu ops 2025-08-14T21:43:36.6643652Z cudagraph partition due to non gpu ops 2025-08-14T21:43:36.6643745Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6643921Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6643985Z return mod(**inputs) 2025-08-14T21:43:36.6644243Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6644306Z outputs = self.mobilebert( 2025-08-14T21:43:36.6644565Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6644630Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6644891Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6644959Z layer_outputs = layer_module( 2025-08-14T21:43:36.6645211Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:43:36.6645293Z self_attention_outputs = self.attention( 2025-08-14T21:43:36.6645544Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 344, in forward 2025-08-14T21:43:36.6645661Z attention_output = self.output(self_outputs[0], layer_input) 2025-08-14T21:43:36.6645917Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 292, in forward 2025-08-14T21:43:36.6645994Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:43:36.6645999Z 2025-08-14T21:43:36.6646097Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6646277Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6646343Z return mod(**inputs) 2025-08-14T21:43:36.6646599Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6646660Z outputs = self.mobilebert( 2025-08-14T21:43:36.6646921Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6646985Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6647237Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6647308Z layer_outputs = layer_module( 2025-08-14T21:43:36.6647574Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:43:36.6647741Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:43:36.6647997Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 440, in forward 2025-08-14T21:43:36.6648096Z bottlenecked_hidden_states = self.input(hidden_states) 2025-08-14T21:43:36.6648357Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 409, in forward 2025-08-14T21:43:36.6648430Z layer_input = self.dense(hidden_states) 2025-08-14T21:43:36.6648434Z 2025-08-14T21:43:36.6648550Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6648729Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6648787Z return mod(**inputs) 2025-08-14T21:43:36.6649067Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6649130Z outputs = self.mobilebert( 2025-08-14T21:43:36.6649387Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6649460Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6649712Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6649781Z layer_outputs = layer_module( 2025-08-14T21:43:36.6650034Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:43:36.6650110Z self_attention_outputs = self.attention( 2025-08-14T21:43:36.6650368Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 344, in forward 2025-08-14T21:43:36.6650480Z attention_output = self.output(self_outputs[0], layer_input) 2025-08-14T21:43:36.6650743Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 295, in forward 2025-08-14T21:43:36.6650855Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:43:36.6651108Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.6651196Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.6651200Z 2025-08-14T21:43:36.6651291Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6651469Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6651536Z return mod(**inputs) 2025-08-14T21:43:36.6651790Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6651862Z outputs = self.mobilebert( 2025-08-14T21:43:36.6652114Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6652178Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6652436Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6652498Z layer_outputs = layer_module( 2025-08-14T21:43:36.6652755Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6652843Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6653094Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.6653201Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.6653517Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:43:36.6653593Z hidden_states = self.dense(hidden_states) 2025-08-14T21:43:36.6653603Z 2025-08-14T21:43:36.6653695Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6653876Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6653942Z return mod(**inputs) 2025-08-14T21:43:36.6654203Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6654291Z outputs = self.mobilebert( 2025-08-14T21:43:36.6654551Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6654636Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6654899Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6654964Z layer_outputs = layer_module( 2025-08-14T21:43:36.6655217Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6655311Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6655563Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.6655664Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.6655928Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:43:36.6656030Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:43:36.6656035Z 2025-08-14T21:43:36.6656137Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6656317Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6656376Z return mod(**inputs) 2025-08-14T21:43:36.6656642Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6656706Z outputs = self.mobilebert( 2025-08-14T21:43:36.6656967Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6657033Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6657288Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6657360Z layer_outputs = layer_module( 2025-08-14T21:43:36.6657615Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6657703Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6657963Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.6658075Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.6658334Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:43:36.6658409Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:43:36.6658412Z 2025-08-14T21:43:36.6658505Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6658695Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6658756Z return mod(**inputs) 2025-08-14T21:43:36.6659038Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6659124Z outputs = self.mobilebert( 2025-08-14T21:43:36.6659379Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6659452Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6659703Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6659766Z layer_outputs = layer_module( 2025-08-14T21:43:36.6660025Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6660128Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6660389Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.6660521Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.6660775Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:43:36.6660892Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:43:36.6661145Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.6661235Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.6661238Z 2025-08-14T21:43:36.6661329Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6661508Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6661576Z return mod(**inputs) 2025-08-14T21:43:36.6661835Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6661907Z outputs = self.mobilebert( 2025-08-14T21:43:36.6662161Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6662227Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6662487Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6662551Z layer_outputs = layer_module( 2025-08-14T21:43:36.6662805Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6662899Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6663153Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.6663260Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.6663516Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:43:36.6663592Z hidden_states = self.dense(hidden_states) 2025-08-14T21:43:36.6663595Z 2025-08-14T21:43:36.6663694Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6663872Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6663937Z return mod(**inputs) 2025-08-14T21:43:36.6664192Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6664256Z outputs = self.mobilebert( 2025-08-14T21:43:36.6664516Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6664581Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6664933Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6665012Z layer_outputs = layer_module( 2025-08-14T21:43:36.6665266Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6665356Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6665607Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.6665706Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.6665983Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:43:36.6666084Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:43:36.6666103Z 2025-08-14T21:43:36.6666207Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6666388Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6666448Z return mod(**inputs) 2025-08-14T21:43:36.6666711Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6666774Z outputs = self.mobilebert( 2025-08-14T21:43:36.6667029Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6667102Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6667360Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6667431Z layer_outputs = layer_module( 2025-08-14T21:43:36.6667685Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6667771Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6668031Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.6668142Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.6668400Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:43:36.6668474Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:43:36.6668477Z 2025-08-14T21:43:36.6668568Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6668754Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6668813Z return mod(**inputs) 2025-08-14T21:43:36.6669070Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6669143Z outputs = self.mobilebert( 2025-08-14T21:43:36.6669396Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6669467Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6669718Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6669781Z layer_outputs = layer_module( 2025-08-14T21:43:36.6670039Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6670123Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6670385Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.6670515Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.6670786Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:43:36.6670906Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:43:36.6671161Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.6671250Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.6671254Z 2025-08-14T21:43:36.6671347Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6671563Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6671628Z return mod(**inputs) 2025-08-14T21:43:36.6671884Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6671977Z outputs = self.mobilebert( 2025-08-14T21:43:36.6672246Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6672311Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6672578Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6672642Z layer_outputs = layer_module( 2025-08-14T21:43:36.6672899Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6672992Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6673250Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.6673357Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.6673618Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:43:36.6673693Z hidden_states = self.dense(hidden_states) 2025-08-14T21:43:36.6673696Z 2025-08-14T21:43:36.6673794Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6673972Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6674032Z return mod(**inputs) 2025-08-14T21:43:36.6674298Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6674361Z outputs = self.mobilebert( 2025-08-14T21:43:36.6674623Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6674692Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6674954Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6675027Z layer_outputs = layer_module( 2025-08-14T21:43:36.6675288Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6675380Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6675636Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.6675735Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.6676000Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:43:36.6676099Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:43:36.6676104Z 2025-08-14T21:43:36.6676210Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6676420Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6676481Z return mod(**inputs) 2025-08-14T21:43:36.6676748Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6676810Z outputs = self.mobilebert( 2025-08-14T21:43:36.6677064Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6677136Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6677391Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6677478Z layer_outputs = layer_module( 2025-08-14T21:43:36.6677734Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6678166Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6678431Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.6678543Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.6678799Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:43:36.6678881Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:43:36.6678885Z 2025-08-14T21:43:36.6678976Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6679164Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6679221Z return mod(**inputs) 2025-08-14T21:43:36.6679484Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6679556Z outputs = self.mobilebert( 2025-08-14T21:43:36.6679815Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6679884Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6680141Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6680204Z layer_outputs = layer_module( 2025-08-14T21:43:36.6680469Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6680554Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6680810Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.6680931Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.6681188Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:43:36.6681302Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:43:36.6681560Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.6681641Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.6681644Z 2025-08-14T21:43:36.6681745Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6681925Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6681990Z return mod(**inputs) 2025-08-14T21:43:36.6682251Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6682329Z outputs = self.mobilebert( 2025-08-14T21:43:36.6682603Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6682671Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6682925Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6682995Z layer_outputs = layer_module( 2025-08-14T21:43:36.6683247Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 518, in forward 2025-08-14T21:43:36.6683379Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:43:36.6683634Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:43:36.6683724Z hidden_states = self.dense(hidden_states) 2025-08-14T21:43:36.6683727Z 2025-08-14T21:43:36.6683828Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6684009Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6684075Z return mod(**inputs) 2025-08-14T21:43:36.6684382Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6684447Z outputs = self.mobilebert( 2025-08-14T21:43:36.6684857Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6684933Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6685196Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6685270Z layer_outputs = layer_module( 2025-08-14T21:43:36.6685532Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 518, in forward 2025-08-14T21:43:36.6685651Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:43:36.6685912Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:43:36.6686013Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:43:36.6686017Z 2025-08-14T21:43:36.6686121Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6686316Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6686384Z return mod(**inputs) 2025-08-14T21:43:36.6686640Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6686704Z outputs = self.mobilebert( 2025-08-14T21:43:36.6686970Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6687037Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6687330Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6687402Z layer_outputs = layer_module( 2025-08-14T21:43:36.6687662Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:43:36.6687818Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:43:36.6688085Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 392, in forward 2025-08-14T21:43:36.6688174Z layer_output = self.dense(intermediate_states) 2025-08-14T21:43:36.6688179Z 2025-08-14T21:43:36.6688282Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6688520Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6688589Z return mod(**inputs) 2025-08-14T21:43:36.6688859Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6688925Z outputs = self.mobilebert( 2025-08-14T21:43:36.6689196Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6689262Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6689534Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6689624Z layer_outputs = layer_module( 2025-08-14T21:43:36.6689888Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:43:36.6690072Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:43:36.6690335Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 397, in forward 2025-08-14T21:43:36.6690448Z layer_output = self.LayerNorm(layer_output + residual_tensor_1) 2025-08-14T21:43:36.6690716Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.6690800Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.6690804Z 2025-08-14T21:43:36.6690905Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6691089Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6691149Z return mod(**inputs) 2025-08-14T21:43:36.6691422Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6691487Z outputs = self.mobilebert( 2025-08-14T21:43:36.6691753Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6691819Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6692079Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6692151Z layer_outputs = layer_module( 2025-08-14T21:43:36.6692411Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:43:36.6692556Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:43:36.6692824Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 398, in forward 2025-08-14T21:43:36.6692943Z layer_output = self.bottleneck(layer_output, residual_tensor_2) 2025-08-14T21:43:36.6693208Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 372, in forward 2025-08-14T21:43:36.6693284Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:43:36.6693288Z 2025-08-14T21:43:36.6693380Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6693569Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6693630Z return mod(**inputs) 2025-08-14T21:43:36.6693900Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6693965Z outputs = self.mobilebert( 2025-08-14T21:43:36.6694222Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6694313Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6696668Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6696739Z layer_outputs = layer_module( 2025-08-14T21:43:36.6697015Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:43:36.6697163Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:43:36.6697435Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 398, in forward 2025-08-14T21:43:36.6697568Z layer_output = self.bottleneck(layer_output, residual_tensor_2) 2025-08-14T21:43:36.6697826Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 374, in forward 2025-08-14T21:43:36.6697947Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:43:36.6698236Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.6698321Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.6698324Z 2025-08-14T21:43:36.6698424Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6698606Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6698674Z return mod(**inputs) 2025-08-14T21:43:36.6698939Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6699006Z outputs = self.mobilebert( 2025-08-14T21:43:36.6699271Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6699339Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6699604Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6699674Z layer_outputs = layer_module( 2025-08-14T21:43:36.6699925Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:43:36.6700076Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:43:36.6700327Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 444, in forward 2025-08-14T21:43:36.6700427Z shared_attention_input = self.attention(hidden_states) 2025-08-14T21:43:36.6700687Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 409, in forward 2025-08-14T21:43:36.6700760Z layer_input = self.dense(hidden_states) 2025-08-14T21:43:36.6700764Z 2025-08-14T21:43:36.6700860Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6701039Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6701097Z return mod(**inputs) 2025-08-14T21:43:36.6701358Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6701419Z outputs = self.mobilebert( 2025-08-14T21:43:36.6701677Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6701742Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6701994Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6702062Z layer_outputs = layer_module( 2025-08-14T21:43:36.6702356Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:43:36.6702551Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:43:36.6702811Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 444, in forward 2025-08-14T21:43:36.6702909Z shared_attention_input = self.attention(hidden_states) 2025-08-14T21:43:36.6703166Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 410, in forward 2025-08-14T21:43:36.6703261Z layer_input = self.LayerNorm(layer_input) 2025-08-14T21:43:36.6703514Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.6703601Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.6703605Z 2025-08-14T21:43:36.6703697Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6703885Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6703944Z return mod(**inputs) 2025-08-14T21:43:36.6704199Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6704270Z outputs = self.mobilebert( 2025-08-14T21:43:36.6704521Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6704586Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6704923Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6704993Z layer_outputs = layer_module( 2025-08-14T21:43:36.6705259Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:43:36.6705338Z self_attention_outputs = self.attention( 2025-08-14T21:43:36.6705595Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:43:36.6705666Z self_outputs = self.self( 2025-08-14T21:43:36.6705920Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 245, in forward 2025-08-14T21:43:36.6705992Z self.query(query_tensor) 2025-08-14T21:43:36.6705996Z 2025-08-14T21:43:36.6706088Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6706268Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6706334Z return mod(**inputs) 2025-08-14T21:43:36.6706593Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6706658Z outputs = self.mobilebert( 2025-08-14T21:43:36.6706924Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6706988Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6707249Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6707318Z layer_outputs = layer_module( 2025-08-14T21:43:36.6707572Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:43:36.6707654Z self_attention_outputs = self.attention( 2025-08-14T21:43:36.6707909Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:43:36.6707977Z self_outputs = self.self( 2025-08-14T21:43:36.6708264Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 250, in forward 2025-08-14T21:43:36.6708350Z self.key(key_tensor) 2025-08-14T21:43:36.6708353Z 2025-08-14T21:43:36.6708452Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6708629Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6708686Z return mod(**inputs) 2025-08-14T21:43:36.6708951Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6709030Z outputs = self.mobilebert( 2025-08-14T21:43:36.6709291Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6709355Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6709615Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6709684Z layer_outputs = layer_module( 2025-08-14T21:43:36.6709940Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:43:36.6710015Z self_attention_outputs = self.attention( 2025-08-14T21:43:36.6710277Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:43:36.6710338Z self_outputs = self.self( 2025-08-14T21:43:36.6710605Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 255, in forward 2025-08-14T21:43:36.6710669Z self.value(value_tensor) 2025-08-14T21:43:36.6710672Z 2025-08-14T21:43:36.6710747Z cudagraph partition due to non gpu ops 2025-08-14T21:43:36.6710826Z cudagraph partition due to non gpu ops 2025-08-14T21:43:36.6710919Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6711106Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6711164Z return mod(**inputs) 2025-08-14T21:43:36.6711426Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6711494Z outputs = self.mobilebert( 2025-08-14T21:43:36.6711749Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6711812Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6712076Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6712136Z layer_outputs = layer_module( 2025-08-14T21:43:36.6712399Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:43:36.6712477Z self_attention_outputs = self.attention( 2025-08-14T21:43:36.6712735Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 344, in forward 2025-08-14T21:43:36.6712855Z attention_output = self.output(self_outputs[0], layer_input) 2025-08-14T21:43:36.6713116Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 292, in forward 2025-08-14T21:43:36.6713199Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:43:36.6713205Z 2025-08-14T21:43:36.6713297Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6713479Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6713543Z return mod(**inputs) 2025-08-14T21:43:36.6713823Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6713922Z outputs = self.mobilebert( 2025-08-14T21:43:36.6714202Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6714268Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6714530Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6714593Z layer_outputs = layer_module( 2025-08-14T21:43:36.6714848Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:43:36.6715016Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:43:36.6715270Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 440, in forward 2025-08-14T21:43:36.6715375Z bottlenecked_hidden_states = self.input(hidden_states) 2025-08-14T21:43:36.6715627Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 409, in forward 2025-08-14T21:43:36.6715697Z layer_input = self.dense(hidden_states) 2025-08-14T21:43:36.6715701Z 2025-08-14T21:43:36.6715796Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6715971Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6716030Z return mod(**inputs) 2025-08-14T21:43:36.6716292Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6716356Z outputs = self.mobilebert( 2025-08-14T21:43:36.6716616Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6716680Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6716934Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6717004Z layer_outputs = layer_module( 2025-08-14T21:43:36.6717256Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:43:36.6717335Z self_attention_outputs = self.attention( 2025-08-14T21:43:36.6717584Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 344, in forward 2025-08-14T21:43:36.6717694Z attention_output = self.output(self_outputs[0], layer_input) 2025-08-14T21:43:36.6717953Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 295, in forward 2025-08-14T21:43:36.6718065Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:43:36.6718316Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.6718407Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.6718410Z 2025-08-14T21:43:36.6718500Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6718683Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6718742Z return mod(**inputs) 2025-08-14T21:43:36.6718996Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6719067Z outputs = self.mobilebert( 2025-08-14T21:43:36.6719318Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6719401Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6719671Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6719753Z layer_outputs = layer_module( 2025-08-14T21:43:36.6720012Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6720099Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6720349Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.6720481Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.6720734Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:43:36.6720817Z hidden_states = self.dense(hidden_states) 2025-08-14T21:43:36.6720821Z 2025-08-14T21:43:36.6720912Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6721093Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6721158Z return mod(**inputs) 2025-08-14T21:43:36.6721414Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6721484Z outputs = self.mobilebert( 2025-08-14T21:43:36.6721736Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6721801Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6722064Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6722126Z layer_outputs = layer_module( 2025-08-14T21:43:36.6722380Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6722476Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6722729Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.6722835Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.6723088Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:43:36.6723186Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:43:36.6723190Z 2025-08-14T21:43:36.6723288Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6723467Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6723532Z return mod(**inputs) 2025-08-14T21:43:36.6723791Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6723856Z outputs = self.mobilebert( 2025-08-14T21:43:36.6724116Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6724180Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6724447Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6724511Z layer_outputs = layer_module( 2025-08-14T21:43:36.6724766Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6724860Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6725114Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.6725255Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.6725533Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:43:36.6725608Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:43:36.6725611Z 2025-08-14T21:43:36.6725711Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6725893Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6725952Z return mod(**inputs) 2025-08-14T21:43:36.6726216Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6726295Z outputs = self.mobilebert( 2025-08-14T21:43:36.6726559Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6726625Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6726882Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6726950Z layer_outputs = layer_module( 2025-08-14T21:43:36.6727202Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6727285Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6727545Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.6727657Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.6727915Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:43:36.6728025Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:43:36.6728280Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.6728372Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.6728375Z 2025-08-14T21:43:36.6728465Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6728647Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6728704Z return mod(**inputs) 2025-08-14T21:43:36.6728960Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6729031Z outputs = self.mobilebert( 2025-08-14T21:43:36.6729283Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6729348Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6729612Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6729674Z layer_outputs = layer_module( 2025-08-14T21:43:36.6729933Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6730017Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6730269Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.6730375Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.6730628Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:43:36.6730709Z hidden_states = self.dense(hidden_states) 2025-08-14T21:43:36.6730712Z 2025-08-14T21:43:36.6730816Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6731027Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6731094Z return mod(**inputs) 2025-08-14T21:43:36.6731353Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6731416Z outputs = self.mobilebert( 2025-08-14T21:43:36.6731677Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6731743Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6732020Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6732084Z layer_outputs = layer_module( 2025-08-14T21:43:36.6732336Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6732430Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6732683Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.6732786Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.6733040Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:43:36.6733138Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:43:36.6733143Z 2025-08-14T21:43:36.6733240Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6733419Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6733477Z return mod(**inputs) 2025-08-14T21:43:36.6733742Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6733806Z outputs = self.mobilebert( 2025-08-14T21:43:36.6734063Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6734127Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6734379Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6734448Z layer_outputs = layer_module( 2025-08-14T21:43:36.6734697Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6734788Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6735040Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.6735152Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.6735412Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:43:36.6735486Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:43:36.6735489Z 2025-08-14T21:43:36.6735586Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6735762Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6735820Z return mod(**inputs) 2025-08-14T21:43:36.6736081Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6736143Z outputs = self.mobilebert( 2025-08-14T21:43:36.6736397Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6736481Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6736769Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6736841Z layer_outputs = layer_module( 2025-08-14T21:43:36.6737092Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6737173Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6737430Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.6737556Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.6737815Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:43:36.6737923Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:43:36.6738178Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.6738267Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.6738270Z 2025-08-14T21:43:36.6738362Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6738544Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6738605Z return mod(**inputs) 2025-08-14T21:43:36.6738857Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6738925Z outputs = self.mobilebert( 2025-08-14T21:43:36.6739176Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6739241Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6739502Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6739566Z layer_outputs = layer_module( 2025-08-14T21:43:36.6739821Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6739904Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6740153Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.6740259Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.6740513Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:43:36.6740588Z hidden_states = self.dense(hidden_states) 2025-08-14T21:43:36.6740599Z 2025-08-14T21:43:36.6740691Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6740866Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6740931Z return mod(**inputs) 2025-08-14T21:43:36.6741188Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6741250Z outputs = self.mobilebert( 2025-08-14T21:43:36.6741509Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6741572Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6741833Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6741894Z layer_outputs = layer_module( 2025-08-14T21:43:36.6742161Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6742291Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6742542Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.6742638Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.6742895Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:43:36.6742993Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:43:36.6742996Z 2025-08-14T21:43:36.6743137Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6743316Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6743372Z return mod(**inputs) 2025-08-14T21:43:36.6743632Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6743697Z outputs = self.mobilebert( 2025-08-14T21:43:36.6743955Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6744018Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6744270Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6744336Z layer_outputs = layer_module( 2025-08-14T21:43:36.6744589Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6744738Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6745007Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.6745117Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.6745375Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:43:36.6745450Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:43:36.6745454Z 2025-08-14T21:43:36.6745543Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6745727Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6745785Z return mod(**inputs) 2025-08-14T21:43:36.6746045Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6746109Z outputs = self.mobilebert( 2025-08-14T21:43:36.6746359Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6746432Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6746683Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6746746Z layer_outputs = layer_module( 2025-08-14T21:43:36.6747001Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6747083Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6747344Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.6747456Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.6747707Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:43:36.6747839Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:43:36.6748105Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.6748209Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.6748213Z 2025-08-14T21:43:36.6748301Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6748480Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6748542Z return mod(**inputs) 2025-08-14T21:43:36.6748799Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6748885Z outputs = self.mobilebert( 2025-08-14T21:43:36.6749138Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6749203Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6749462Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6749525Z layer_outputs = layer_module( 2025-08-14T21:43:36.6749780Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 518, in forward 2025-08-14T21:43:36.6749898Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:43:36.6750152Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:43:36.6750233Z hidden_states = self.dense(hidden_states) 2025-08-14T21:43:36.6750237Z 2025-08-14T21:43:36.6750327Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6750507Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6750573Z return mod(**inputs) 2025-08-14T21:43:36.6750832Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6750904Z outputs = self.mobilebert( 2025-08-14T21:43:36.6751158Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6751222Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6751486Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6751547Z layer_outputs = layer_module( 2025-08-14T21:43:36.6751801Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 518, in forward 2025-08-14T21:43:36.6751916Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:43:36.6752171Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:43:36.6752280Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:43:36.6752283Z 2025-08-14T21:43:36.6752372Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6752551Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6752617Z return mod(**inputs) 2025-08-14T21:43:36.6752873Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6752940Z outputs = self.mobilebert( 2025-08-14T21:43:36.6753195Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6753257Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6753536Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6753621Z layer_outputs = layer_module( 2025-08-14T21:43:36.6753892Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:43:36.6754041Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:43:36.6754292Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 392, in forward 2025-08-14T21:43:36.6754382Z layer_output = self.dense(intermediate_states) 2025-08-14T21:43:36.6754386Z 2025-08-14T21:43:36.6754492Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6754669Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6754733Z return mod(**inputs) 2025-08-14T21:43:36.6754991Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6755063Z outputs = self.mobilebert( 2025-08-14T21:43:36.6755315Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6755380Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6755638Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6755701Z layer_outputs = layer_module( 2025-08-14T21:43:36.6755953Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:43:36.6756104Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:43:36.6756358Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 397, in forward 2025-08-14T21:43:36.6756477Z layer_output = self.LayerNorm(layer_output + residual_tensor_1) 2025-08-14T21:43:36.6756731Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.6756813Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.6756816Z 2025-08-14T21:43:36.6756913Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6757090Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6757154Z return mod(**inputs) 2025-08-14T21:43:36.6757410Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6757472Z outputs = self.mobilebert( 2025-08-14T21:43:36.6757732Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6757795Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6758049Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6758117Z layer_outputs = layer_module( 2025-08-14T21:43:36.6758370Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:43:36.6758516Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:43:36.6758770Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 398, in forward 2025-08-14T21:43:36.6758880Z layer_output = self.bottleneck(layer_output, residual_tensor_2) 2025-08-14T21:43:36.6759136Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 372, in forward 2025-08-14T21:43:36.6759223Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:43:36.6759243Z 2025-08-14T21:43:36.6759355Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6759537Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6759595Z return mod(**inputs) 2025-08-14T21:43:36.6759857Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6759921Z outputs = self.mobilebert( 2025-08-14T21:43:36.6760180Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6760261Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6760514Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6760583Z layer_outputs = layer_module( 2025-08-14T21:43:36.6760839Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:43:36.6760981Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:43:36.6761241Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 398, in forward 2025-08-14T21:43:36.6761350Z layer_output = self.bottleneck(layer_output, residual_tensor_2) 2025-08-14T21:43:36.6761610Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 374, in forward 2025-08-14T21:43:36.6761719Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:43:36.6761973Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.6762063Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.6762066Z 2025-08-14T21:43:36.6762161Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6762347Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6762407Z return mod(**inputs) 2025-08-14T21:43:36.6762663Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6762734Z outputs = self.mobilebert( 2025-08-14T21:43:36.6762987Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6763052Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6763312Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6763376Z layer_outputs = layer_module( 2025-08-14T21:43:36.6763639Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:43:36.6763789Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:43:36.6764044Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 444, in forward 2025-08-14T21:43:36.6764152Z shared_attention_input = self.attention(hidden_states) 2025-08-14T21:43:36.6764403Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 409, in forward 2025-08-14T21:43:36.6764484Z layer_input = self.dense(hidden_states) 2025-08-14T21:43:36.6764487Z 2025-08-14T21:43:36.6764579Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6764758Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6764838Z return mod(**inputs) 2025-08-14T21:43:36.6765119Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6765207Z outputs = self.mobilebert( 2025-08-14T21:43:36.6765467Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6765533Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6765800Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6765878Z layer_outputs = layer_module( 2025-08-14T21:43:36.6766138Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:43:36.6766294Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:43:36.6766559Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 444, in forward 2025-08-14T21:43:36.6766664Z shared_attention_input = self.attention(hidden_states) 2025-08-14T21:43:36.6766925Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 410, in forward 2025-08-14T21:43:36.6767002Z layer_input = self.LayerNorm(layer_input) 2025-08-14T21:43:36.6767266Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.6767347Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.6767352Z 2025-08-14T21:43:36.6767449Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6767630Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6767688Z return mod(**inputs) 2025-08-14T21:43:36.6767961Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6768025Z outputs = self.mobilebert( 2025-08-14T21:43:36.6768283Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6768355Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6768613Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6768684Z layer_outputs = layer_module( 2025-08-14T21:43:36.6768946Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:43:36.6769023Z self_attention_outputs = self.attention( 2025-08-14T21:43:36.6769295Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:43:36.6769358Z self_outputs = self.self( 2025-08-14T21:43:36.6769626Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 245, in forward 2025-08-14T21:43:36.6769689Z self.query(query_tensor) 2025-08-14T21:43:36.6769692Z 2025-08-14T21:43:36.6769783Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6769971Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6770030Z return mod(**inputs) 2025-08-14T21:43:36.6770293Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6770364Z outputs = self.mobilebert( 2025-08-14T21:43:36.6770622Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6770708Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6770977Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6771056Z layer_outputs = layer_module( 2025-08-14T21:43:36.6771316Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:43:36.6771392Z self_attention_outputs = self.attention( 2025-08-14T21:43:36.6771651Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:43:36.6771729Z self_outputs = self.self( 2025-08-14T21:43:36.6771988Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 250, in forward 2025-08-14T21:43:36.6772053Z self.key(key_tensor) 2025-08-14T21:43:36.6772057Z 2025-08-14T21:43:36.6772149Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6772331Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6772398Z return mod(**inputs) 2025-08-14T21:43:36.6772659Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6772728Z outputs = self.mobilebert( 2025-08-14T21:43:36.6772986Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6773050Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6773315Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6773379Z layer_outputs = layer_module( 2025-08-14T21:43:36.6773637Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:43:36.6773721Z self_attention_outputs = self.attention( 2025-08-14T21:43:36.6773981Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:43:36.6774049Z self_outputs = self.self( 2025-08-14T21:43:36.6774306Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 255, in forward 2025-08-14T21:43:36.6774369Z self.value(value_tensor) 2025-08-14T21:43:36.6774373Z 2025-08-14T21:43:36.6774451Z cudagraph partition due to non gpu ops 2025-08-14T21:43:36.6774526Z cudagraph partition due to non gpu ops 2025-08-14T21:43:36.6774625Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6774804Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6774862Z return mod(**inputs) 2025-08-14T21:43:36.6775133Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6775198Z outputs = self.mobilebert( 2025-08-14T21:43:36.6775455Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6775526Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6775786Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6775855Z layer_outputs = layer_module( 2025-08-14T21:43:36.6776115Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:43:36.6776190Z self_attention_outputs = self.attention( 2025-08-14T21:43:36.6776470Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 344, in forward 2025-08-14T21:43:36.6776600Z attention_output = self.output(self_outputs[0], layer_input) 2025-08-14T21:43:36.6776878Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 292, in forward 2025-08-14T21:43:36.6776952Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:43:36.6776956Z 2025-08-14T21:43:36.6777045Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6777231Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6777289Z return mod(**inputs) 2025-08-14T21:43:36.6777561Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6777632Z outputs = self.mobilebert( 2025-08-14T21:43:36.6777884Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6777956Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6778210Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6778272Z layer_outputs = layer_module( 2025-08-14T21:43:36.6778531Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:43:36.6778674Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:43:36.6778935Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 440, in forward 2025-08-14T21:43:36.6779035Z bottlenecked_hidden_states = self.input(hidden_states) 2025-08-14T21:43:36.6779293Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 409, in forward 2025-08-14T21:43:36.6779372Z layer_input = self.dense(hidden_states) 2025-08-14T21:43:36.6779378Z 2025-08-14T21:43:36.6779468Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6779644Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6779709Z return mod(**inputs) 2025-08-14T21:43:36.6779963Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6780032Z outputs = self.mobilebert( 2025-08-14T21:43:36.6780285Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6780351Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6780609Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6780673Z layer_outputs = layer_module( 2025-08-14T21:43:36.6780930Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:43:36.6781007Z self_attention_outputs = self.attention( 2025-08-14T21:43:36.6781257Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 344, in forward 2025-08-14T21:43:36.6781371Z attention_output = self.output(self_outputs[0], layer_input) 2025-08-14T21:43:36.6781624Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 295, in forward 2025-08-14T21:43:36.6781738Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:43:36.6781994Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.6782091Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.6782095Z 2025-08-14T21:43:36.6782212Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6782408Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6782466Z return mod(**inputs) 2025-08-14T21:43:36.6782729Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6782792Z outputs = self.mobilebert( 2025-08-14T21:43:36.6783052Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6783132Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6783386Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6783455Z layer_outputs = layer_module( 2025-08-14T21:43:36.6783712Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6783799Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6784059Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.6784158Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.6784416Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:43:36.6784490Z hidden_states = self.dense(hidden_states) 2025-08-14T21:43:36.6784495Z 2025-08-14T21:43:36.6784760Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6784964Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6785025Z return mod(**inputs) 2025-08-14T21:43:36.6785303Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6785369Z outputs = self.mobilebert( 2025-08-14T21:43:36.6785630Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6785705Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6786037Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6786100Z layer_outputs = layer_module( 2025-08-14T21:43:36.6786363Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6786448Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6786717Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.6786818Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.6787079Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:43:36.6787189Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:43:36.6787193Z 2025-08-14T21:43:36.6787286Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6787474Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6787535Z return mod(**inputs) 2025-08-14T21:43:36.6787801Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6787872Z outputs = self.mobilebert( 2025-08-14T21:43:36.6788178Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6788274Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6788574Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6788638Z layer_outputs = layer_module( 2025-08-14T21:43:36.6788905Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6788993Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6789253Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.6789400Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.6789660Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:43:36.6789744Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:43:36.6789750Z 2025-08-14T21:43:36.6789844Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6790026Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6790094Z return mod(**inputs) 2025-08-14T21:43:36.6790357Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6790427Z outputs = self.mobilebert( 2025-08-14T21:43:36.6790687Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6790756Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6791023Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6791089Z layer_outputs = layer_module( 2025-08-14T21:43:36.6791352Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6791445Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6791704Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.6791827Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.6792088Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:43:36.6792200Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:43:36.6792465Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.6792548Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.6792552Z 2025-08-14T21:43:36.6792653Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6792834Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6792894Z return mod(**inputs) 2025-08-14T21:43:36.6793165Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6793228Z outputs = self.mobilebert( 2025-08-14T21:43:36.6793489Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6793564Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6793822Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6793894Z layer_outputs = layer_module( 2025-08-14T21:43:36.6794184Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6794305Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6794570Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.6794672Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.6794937Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:43:36.6795012Z hidden_states = self.dense(hidden_states) 2025-08-14T21:43:36.6795030Z 2025-08-14T21:43:36.6795125Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6795313Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6795373Z return mod(**inputs) 2025-08-14T21:43:36.6795641Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6795713Z outputs = self.mobilebert( 2025-08-14T21:43:36.6795972Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6796043Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6796300Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6796362Z layer_outputs = layer_module( 2025-08-14T21:43:36.6796627Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6796710Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6796974Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.6797077Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.6797339Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:43:36.6797447Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:43:36.6797451Z 2025-08-14T21:43:36.6797543Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6797725Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6797791Z return mod(**inputs) 2025-08-14T21:43:36.6798057Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6798128Z outputs = self.mobilebert( 2025-08-14T21:43:36.6798388Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6798454Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6798728Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6798790Z layer_outputs = layer_module( 2025-08-14T21:43:36.6799049Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6799131Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6799383Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.6799502Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.6799755Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:43:36.6799843Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:43:36.6799868Z 2025-08-14T21:43:36.6799977Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6800157Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6800223Z return mod(**inputs) 2025-08-14T21:43:36.6800477Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6800537Z outputs = self.mobilebert( 2025-08-14T21:43:36.6800796Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6800878Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6801136Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6801200Z layer_outputs = layer_module( 2025-08-14T21:43:36.6801454Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6801543Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6801795Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.6801902Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.6802159Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:43:36.6802267Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:43:36.6802524Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.6802605Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.6802609Z 2025-08-14T21:43:36.6802700Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6802885Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6802943Z return mod(**inputs) 2025-08-14T21:43:36.6803202Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6803263Z outputs = self.mobilebert( 2025-08-14T21:43:36.6803512Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6803585Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6803837Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6803905Z layer_outputs = layer_module( 2025-08-14T21:43:36.6804158Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6804239Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6804497Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.6804594Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.6804844Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:43:36.6804924Z hidden_states = self.dense(hidden_states) 2025-08-14T21:43:36.6804929Z 2025-08-14T21:43:36.6805018Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6805201Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6805258Z return mod(**inputs) 2025-08-14T21:43:36.6805542Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6805629Z outputs = self.mobilebert( 2025-08-14T21:43:36.6805882Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6805951Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6806201Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6806262Z layer_outputs = layer_module( 2025-08-14T21:43:36.6806519Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6806625Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6806877Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.6806983Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.6807233Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:43:36.6807337Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:43:36.6807340Z 2025-08-14T21:43:36.6807431Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6807606Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6807671Z return mod(**inputs) 2025-08-14T21:43:36.6807924Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6807993Z outputs = self.mobilebert( 2025-08-14T21:43:36.6808243Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6808308Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6808570Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6808633Z layer_outputs = layer_module( 2025-08-14T21:43:36.6808882Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6808973Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6809222Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.6809341Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.6809591Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:43:36.6809667Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:43:36.6809670Z 2025-08-14T21:43:36.6809771Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6809948Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6810014Z return mod(**inputs) 2025-08-14T21:43:36.6810265Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6810328Z outputs = self.mobilebert( 2025-08-14T21:43:36.6810584Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6810648Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6810899Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6810985Z layer_outputs = layer_module( 2025-08-14T21:43:36.6811255Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6811363Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6811612Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.6811723Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.6811981Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:43:36.6812105Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:43:36.6812361Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.6812441Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.6812446Z 2025-08-14T21:43:36.6812536Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6812716Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6812773Z return mod(**inputs) 2025-08-14T21:43:36.6813026Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6813095Z outputs = self.mobilebert( 2025-08-14T21:43:36.6813343Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6813410Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6813659Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6813720Z layer_outputs = layer_module( 2025-08-14T21:43:36.6813982Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 518, in forward 2025-08-14T21:43:36.6814091Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:43:36.6814348Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:43:36.6814420Z hidden_states = self.dense(hidden_states) 2025-08-14T21:43:36.6814423Z 2025-08-14T21:43:36.6814512Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6814695Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6814755Z return mod(**inputs) 2025-08-14T21:43:36.6815008Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6815078Z outputs = self.mobilebert( 2025-08-14T21:43:36.6815331Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6815401Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6815650Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6815708Z layer_outputs = layer_module( 2025-08-14T21:43:36.6815966Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 518, in forward 2025-08-14T21:43:36.6816068Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:43:36.6816319Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:43:36.6816413Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:43:36.6816416Z 2025-08-14T21:43:36.6816503Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6816708Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6816780Z return mod(**inputs) 2025-08-14T21:43:36.6817042Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6817099Z outputs = self.mobilebert( 2025-08-14T21:43:36.6817351Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6817417Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6817668Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6817743Z layer_outputs = layer_module( 2025-08-14T21:43:36.6818002Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:43:36.6818146Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:43:36.6818405Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 392, in forward 2025-08-14T21:43:36.6818489Z layer_output = self.dense(intermediate_states) 2025-08-14T21:43:36.6818493Z 2025-08-14T21:43:36.6818582Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6818764Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6818821Z return mod(**inputs) 2025-08-14T21:43:36.6819085Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6819147Z outputs = self.mobilebert( 2025-08-14T21:43:36.6819399Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6819471Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6819725Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6819788Z layer_outputs = layer_module( 2025-08-14T21:43:36.6820045Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:43:36.6820187Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:43:36.6820444Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 397, in forward 2025-08-14T21:43:36.6820555Z layer_output = self.LayerNorm(layer_output + residual_tensor_1) 2025-08-14T21:43:36.6820807Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.6820896Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.6820902Z 2025-08-14T21:43:36.6820992Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6821175Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6821236Z return mod(**inputs) 2025-08-14T21:43:36.6821490Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6821560Z outputs = self.mobilebert( 2025-08-14T21:43:36.6821812Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6821874Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6822133Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6822207Z layer_outputs = layer_module( 2025-08-14T21:43:36.6822482Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:43:36.6822637Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:43:36.6822889Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 398, in forward 2025-08-14T21:43:36.6822999Z layer_output = self.bottleneck(layer_output, residual_tensor_2) 2025-08-14T21:43:36.6823253Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 372, in forward 2025-08-14T21:43:36.6823344Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:43:36.6823348Z 2025-08-14T21:43:36.6823697Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6823878Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6823943Z return mod(**inputs) 2025-08-14T21:43:36.6824204Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6824275Z outputs = self.mobilebert( 2025-08-14T21:43:36.6824529Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6824594Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6824919Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6824992Z layer_outputs = layer_module( 2025-08-14T21:43:36.6825245Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:43:36.6825400Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:43:36.6825656Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 398, in forward 2025-08-14T21:43:36.6825777Z layer_output = self.bottleneck(layer_output, residual_tensor_2) 2025-08-14T21:43:36.6826028Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 374, in forward 2025-08-14T21:43:36.6826135Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:43:36.6826393Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.6826477Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.6826480Z 2025-08-14T21:43:36.6826580Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6826760Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6826820Z return mod(**inputs) 2025-08-14T21:43:36.6827083Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6827146Z outputs = self.mobilebert( 2025-08-14T21:43:36.6827402Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6827466Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6827719Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6827788Z layer_outputs = layer_module( 2025-08-14T21:43:36.6828043Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:43:36.6828182Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:43:36.6828468Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 444, in forward 2025-08-14T21:43:36.6828590Z shared_attention_input = self.attention(hidden_states) 2025-08-14T21:43:36.6828841Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 409, in forward 2025-08-14T21:43:36.6828913Z layer_input = self.dense(hidden_states) 2025-08-14T21:43:36.6828916Z 2025-08-14T21:43:36.6829003Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6829185Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6829262Z return mod(**inputs) 2025-08-14T21:43:36.6829525Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6829586Z outputs = self.mobilebert( 2025-08-14T21:43:36.6829840Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6829913Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6830165Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6830227Z layer_outputs = layer_module( 2025-08-14T21:43:36.6830484Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:43:36.6830628Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:43:36.6830889Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 444, in forward 2025-08-14T21:43:36.6830984Z shared_attention_input = self.attention(hidden_states) 2025-08-14T21:43:36.6831238Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 410, in forward 2025-08-14T21:43:36.6831322Z layer_input = self.LayerNorm(layer_input) 2025-08-14T21:43:36.6831570Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.6831652Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.6831655Z 2025-08-14T21:43:36.6831746Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6831923Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6831988Z return mod(**inputs) 2025-08-14T21:43:36.6832242Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6832307Z outputs = self.mobilebert( 2025-08-14T21:43:36.6832560Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6832623Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6832880Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6832940Z layer_outputs = layer_module( 2025-08-14T21:43:36.6833192Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:43:36.6833271Z self_attention_outputs = self.attention( 2025-08-14T21:43:36.6833521Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:43:36.6833590Z self_outputs = self.self( 2025-08-14T21:43:36.6833842Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 245, in forward 2025-08-14T21:43:36.6833917Z self.query(query_tensor) 2025-08-14T21:43:36.6833921Z 2025-08-14T21:43:36.6834076Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6834254Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6834317Z return mod(**inputs) 2025-08-14T21:43:36.6834571Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6834632Z outputs = self.mobilebert( 2025-08-14T21:43:36.6834888Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6834970Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6835222Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6835291Z layer_outputs = layer_module( 2025-08-14T21:43:36.6835545Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:43:36.6835626Z self_attention_outputs = self.attention( 2025-08-14T21:43:36.6835877Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:43:36.6835939Z self_outputs = self.self( 2025-08-14T21:43:36.6836194Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 250, in forward 2025-08-14T21:43:36.6836253Z self.key(key_tensor) 2025-08-14T21:43:36.6836258Z 2025-08-14T21:43:36.6836348Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6836530Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6836588Z return mod(**inputs) 2025-08-14T21:43:36.6836844Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6836904Z outputs = self.mobilebert( 2025-08-14T21:43:36.6837160Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6837232Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6837486Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6837554Z layer_outputs = layer_module( 2025-08-14T21:43:36.6837805Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:43:36.6837882Z self_attention_outputs = self.attention( 2025-08-14T21:43:36.6838141Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:43:36.6838202Z self_outputs = self.self( 2025-08-14T21:43:36.6838455Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 255, in forward 2025-08-14T21:43:36.6838524Z self.value(value_tensor) 2025-08-14T21:43:36.6838527Z 2025-08-14T21:43:36.6838601Z cudagraph partition due to non gpu ops 2025-08-14T21:43:36.6838677Z cudagraph partition due to non gpu ops 2025-08-14T21:43:36.6838766Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6838942Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6839007Z return mod(**inputs) 2025-08-14T21:43:36.6839262Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6839323Z outputs = self.mobilebert( 2025-08-14T21:43:36.6839606Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6839688Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6839953Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6840015Z layer_outputs = layer_module( 2025-08-14T21:43:36.6840267Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:43:36.6840345Z self_attention_outputs = self.attention( 2025-08-14T21:43:36.6840598Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 344, in forward 2025-08-14T21:43:36.6840728Z attention_output = self.output(self_outputs[0], layer_input) 2025-08-14T21:43:36.6840982Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 292, in forward 2025-08-14T21:43:36.6841055Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:43:36.6841060Z 2025-08-14T21:43:36.6841158Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6841335Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6841391Z return mod(**inputs) 2025-08-14T21:43:36.6841658Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6841720Z outputs = self.mobilebert( 2025-08-14T21:43:36.6841980Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6842046Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6842300Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6842370Z layer_outputs = layer_module( 2025-08-14T21:43:36.6842628Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:43:36.6842781Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:43:36.6843033Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 440, in forward 2025-08-14T21:43:36.6843133Z bottlenecked_hidden_states = self.input(hidden_states) 2025-08-14T21:43:36.6843392Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 409, in forward 2025-08-14T21:43:36.6843466Z layer_input = self.dense(hidden_states) 2025-08-14T21:43:36.6843470Z 2025-08-14T21:43:36.6843567Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6843747Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6843805Z return mod(**inputs) 2025-08-14T21:43:36.6844067Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6844125Z outputs = self.mobilebert( 2025-08-14T21:43:36.6844380Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6844449Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6844702Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6844772Z layer_outputs = layer_module( 2025-08-14T21:43:36.6845027Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:43:36.6845101Z self_attention_outputs = self.attention( 2025-08-14T21:43:36.6845392Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 344, in forward 2025-08-14T21:43:36.6845520Z attention_output = self.output(self_outputs[0], layer_input) 2025-08-14T21:43:36.6845784Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 295, in forward 2025-08-14T21:43:36.6845898Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:43:36.6846155Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.6846259Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.6846263Z 2025-08-14T21:43:36.6846353Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6846530Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6846594Z return mod(**inputs) 2025-08-14T21:43:36.6846849Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6846917Z outputs = self.mobilebert( 2025-08-14T21:43:36.6847168Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6847232Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6847488Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6847549Z layer_outputs = layer_module( 2025-08-14T21:43:36.6847801Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6847885Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6848138Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.6848244Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.6848493Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:43:36.6848566Z hidden_states = self.dense(hidden_states) 2025-08-14T21:43:36.6848574Z 2025-08-14T21:43:36.6848662Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6848841Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6848907Z return mod(**inputs) 2025-08-14T21:43:36.6849163Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6849223Z outputs = self.mobilebert( 2025-08-14T21:43:36.6849483Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6849548Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6849806Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6849868Z layer_outputs = layer_module( 2025-08-14T21:43:36.6850122Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6850208Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6850460Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.6850559Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.6850828Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:43:36.6850948Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:43:36.6850965Z 2025-08-14T21:43:36.6851065Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6851244Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6851302Z return mod(**inputs) 2025-08-14T21:43:36.6851566Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6851628Z outputs = self.mobilebert( 2025-08-14T21:43:36.6851889Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6851971Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6852235Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6852304Z layer_outputs = layer_module( 2025-08-14T21:43:36.6852565Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6852647Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6852916Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.6853026Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.6853294Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:43:36.6853369Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:43:36.6853372Z 2025-08-14T21:43:36.6853461Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6853653Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6853711Z return mod(**inputs) 2025-08-14T21:43:36.6853984Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6854046Z outputs = self.mobilebert( 2025-08-14T21:43:36.6854306Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6854375Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6854633Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6854696Z layer_outputs = layer_module( 2025-08-14T21:43:36.6854966Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6855048Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6855317Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.6855432Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.6855690Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:43:36.6855807Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:43:36.6856068Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.6856151Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.6856155Z 2025-08-14T21:43:36.6856245Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6856424Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6856503Z return mod(**inputs) 2025-08-14T21:43:36.6856776Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6856864Z outputs = self.mobilebert( 2025-08-14T21:43:36.6857118Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6857182Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6857439Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6857501Z layer_outputs = layer_module( 2025-08-14T21:43:36.6857770Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6857857Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6858113Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.6858217Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.6858474Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:43:36.6858549Z hidden_states = self.dense(hidden_states) 2025-08-14T21:43:36.6858553Z 2025-08-14T21:43:36.6858655Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6858839Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6858905Z return mod(**inputs) 2025-08-14T21:43:36.6859165Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6859228Z outputs = self.mobilebert( 2025-08-14T21:43:36.6859493Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6859561Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6859812Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6859882Z layer_outputs = layer_module( 2025-08-14T21:43:36.6860135Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6860227Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6860483Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.6860584Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.6860844Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:43:36.6860947Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:43:36.6860952Z 2025-08-14T21:43:36.6861050Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6861229Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6861289Z return mod(**inputs) 2025-08-14T21:43:36.6861548Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6861612Z outputs = self.mobilebert( 2025-08-14T21:43:36.6861864Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6861935Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6862189Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6862271Z layer_outputs = layer_module( 2025-08-14T21:43:36.6862550Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6862632Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6862882Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.6862990Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.6863245Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:43:36.6863337Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:43:36.6863341Z 2025-08-14T21:43:36.6863432Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6863622Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6863680Z return mod(**inputs) 2025-08-14T21:43:36.6863940Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6864009Z outputs = self.mobilebert( 2025-08-14T21:43:36.6864262Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6864332Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6864585Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6864649Z layer_outputs = layer_module( 2025-08-14T21:43:36.6864970Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6865054Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6865308Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.6865421Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.6865674Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:43:36.6865787Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:43:36.6866040Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.6866122Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.6866134Z 2025-08-14T21:43:36.6866224Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6866400Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6866467Z return mod(**inputs) 2025-08-14T21:43:36.6866726Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6866789Z outputs = self.mobilebert( 2025-08-14T21:43:36.6867048Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6867111Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6867370Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6867431Z layer_outputs = layer_module( 2025-08-14T21:43:36.6867684Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6867773Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6868065Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.6868182Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.6868442Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:43:36.6868514Z hidden_states = self.dense(hidden_states) 2025-08-14T21:43:36.6868518Z 2025-08-14T21:43:36.6868614Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6868792Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6868868Z return mod(**inputs) 2025-08-14T21:43:36.6869130Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6869192Z outputs = self.mobilebert( 2025-08-14T21:43:36.6869448Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6869511Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6869762Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6869830Z layer_outputs = layer_module( 2025-08-14T21:43:36.6870085Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6870167Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6870425Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.6870524Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.6870783Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:43:36.6870884Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:43:36.6870888Z 2025-08-14T21:43:36.6870979Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6871168Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6871226Z return mod(**inputs) 2025-08-14T21:43:36.6871491Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6871555Z outputs = self.mobilebert( 2025-08-14T21:43:36.6871807Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6871879Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6872134Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6872204Z layer_outputs = layer_module( 2025-08-14T21:43:36.6872460Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6872544Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6872805Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.6872916Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.6873169Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:43:36.6873250Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:43:36.6873254Z 2025-08-14T21:43:36.6873345Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6873553Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6873612Z return mod(**inputs) 2025-08-14T21:43:36.6873903Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6873973Z outputs = self.mobilebert( 2025-08-14T21:43:36.6874226Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6874294Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6874546Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6874628Z layer_outputs = layer_module( 2025-08-14T21:43:36.6874883Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6874962Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6875214Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.6875329Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.6875581Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:43:36.6875695Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:43:36.6875944Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.6876027Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.6876030Z 2025-08-14T21:43:36.6876127Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6876304Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6876369Z return mod(**inputs) 2025-08-14T21:43:36.6876624Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6876687Z outputs = self.mobilebert( 2025-08-14T21:43:36.6876942Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6877005Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6877255Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6877324Z layer_outputs = layer_module( 2025-08-14T21:43:36.6877575Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 518, in forward 2025-08-14T21:43:36.6877687Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:43:36.6877941Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:43:36.6878011Z hidden_states = self.dense(hidden_states) 2025-08-14T21:43:36.6878014Z 2025-08-14T21:43:36.6878110Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6878288Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6878352Z return mod(**inputs) 2025-08-14T21:43:36.6878610Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6878672Z outputs = self.mobilebert( 2025-08-14T21:43:36.6878931Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6878995Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6879261Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6879363Z layer_outputs = layer_module( 2025-08-14T21:43:36.6879617Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 518, in forward 2025-08-14T21:43:36.6879729Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:43:36.6879981Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:43:36.6880077Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:43:36.6880081Z 2025-08-14T21:43:36.6880198Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6880377Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6880441Z return mod(**inputs) 2025-08-14T21:43:36.6880698Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6880764Z outputs = self.mobilebert( 2025-08-14T21:43:36.6881023Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6881088Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6881344Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6881409Z layer_outputs = layer_module( 2025-08-14T21:43:36.6881662Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:43:36.6881810Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:43:36.6882066Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 392, in forward 2025-08-14T21:43:36.6882152Z layer_output = self.dense(intermediate_states) 2025-08-14T21:43:36.6882158Z 2025-08-14T21:43:36.6882257Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6882434Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6882498Z return mod(**inputs) 2025-08-14T21:43:36.6882752Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6882813Z outputs = self.mobilebert( 2025-08-14T21:43:36.6883070Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6883136Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6883399Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6883462Z layer_outputs = layer_module( 2025-08-14T21:43:36.6883718Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:43:36.6883874Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:43:36.6884125Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 397, in forward 2025-08-14T21:43:36.6884234Z layer_output = self.LayerNorm(layer_output + residual_tensor_1) 2025-08-14T21:43:36.6884489Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.6884677Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.6884683Z 2025-08-14T21:43:36.6884788Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6885011Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6885098Z return mod(**inputs) 2025-08-14T21:43:36.6885391Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6885453Z outputs = self.mobilebert( 2025-08-14T21:43:36.6885718Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6885783Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6886042Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6886135Z layer_outputs = layer_module( 2025-08-14T21:43:36.6886437Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:43:36.6886580Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:43:36.6886839Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 398, in forward 2025-08-14T21:43:36.6886947Z layer_output = self.bottleneck(layer_output, residual_tensor_2) 2025-08-14T21:43:36.6887200Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 372, in forward 2025-08-14T21:43:36.6887273Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:43:36.6887277Z 2025-08-14T21:43:36.6887365Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6887551Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6887610Z return mod(**inputs) 2025-08-14T21:43:36.6887873Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6887934Z outputs = self.mobilebert( 2025-08-14T21:43:36.6888188Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6888260Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6888511Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6888573Z layer_outputs = layer_module( 2025-08-14T21:43:36.6888832Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:43:36.6888975Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:43:36.6889229Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 398, in forward 2025-08-14T21:43:36.6889334Z layer_output = self.bottleneck(layer_output, residual_tensor_2) 2025-08-14T21:43:36.6889586Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 374, in forward 2025-08-14T21:43:36.6889699Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:43:36.6889947Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.6890034Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.6890037Z 2025-08-14T21:43:36.6890125Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6890302Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6890367Z return mod(**inputs) 2025-08-14T21:43:36.6890620Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6890703Z outputs = self.mobilebert( 2025-08-14T21:43:36.6890971Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6891051Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6891308Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6891370Z layer_outputs = layer_module( 2025-08-14T21:43:36.6891621Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:43:36.6891771Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:43:36.6892050Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 444, in forward 2025-08-14T21:43:36.6892154Z shared_attention_input = self.attention(hidden_states) 2025-08-14T21:43:36.6892405Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 409, in forward 2025-08-14T21:43:36.6892478Z layer_input = self.dense(hidden_states) 2025-08-14T21:43:36.6892481Z 2025-08-14T21:43:36.6892575Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6892753Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6892817Z return mod(**inputs) 2025-08-14T21:43:36.6893072Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6893134Z outputs = self.mobilebert( 2025-08-14T21:43:36.6893390Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6893454Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6893708Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6893777Z layer_outputs = layer_module( 2025-08-14T21:43:36.6894031Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:43:36.6894177Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:43:36.6894429Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 444, in forward 2025-08-14T21:43:36.6894525Z shared_attention_input = self.attention(hidden_states) 2025-08-14T21:43:36.6894788Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 410, in forward 2025-08-14T21:43:36.6894865Z layer_input = self.LayerNorm(layer_input) 2025-08-14T21:43:36.6895127Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.6895211Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.6895214Z 2025-08-14T21:43:36.6895305Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6895492Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6895550Z return mod(**inputs) 2025-08-14T21:43:36.6895812Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6895873Z outputs = self.mobilebert( 2025-08-14T21:43:36.6896128Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6896198Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6896463Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6896543Z layer_outputs = layer_module( 2025-08-14T21:43:36.6896822Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:43:36.6896899Z self_attention_outputs = self.attention( 2025-08-14T21:43:36.6897158Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:43:36.6897220Z self_outputs = self.self( 2025-08-14T21:43:36.6897469Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 245, in forward 2025-08-14T21:43:36.6897556Z self.query(query_tensor) 2025-08-14T21:43:36.6897560Z 2025-08-14T21:43:36.6897651Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6897839Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6897897Z return mod(**inputs) 2025-08-14T21:43:36.6898164Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6898233Z outputs = self.mobilebert( 2025-08-14T21:43:36.6898496Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6898559Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6898826Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6898890Z layer_outputs = layer_module( 2025-08-14T21:43:36.6899155Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:43:36.6899230Z self_attention_outputs = self.attention( 2025-08-14T21:43:36.6899494Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:43:36.6899564Z self_outputs = self.self( 2025-08-14T21:43:36.6899821Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 250, in forward 2025-08-14T21:43:36.6899880Z self.key(key_tensor) 2025-08-14T21:43:36.6899890Z 2025-08-14T21:43:36.6899981Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6900163Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6900228Z return mod(**inputs) 2025-08-14T21:43:36.6900493Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6900554Z outputs = self.mobilebert( 2025-08-14T21:43:36.6900823Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6900889Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6901155Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6901217Z layer_outputs = layer_module( 2025-08-14T21:43:36.6901478Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:43:36.6901560Z self_attention_outputs = self.attention( 2025-08-14T21:43:36.6901820Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:43:36.6901883Z self_outputs = self.self( 2025-08-14T21:43:36.6902151Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 255, in forward 2025-08-14T21:43:36.6902231Z self.value(value_tensor) 2025-08-14T21:43:36.6902235Z 2025-08-14T21:43:36.6902823Z cudagraph partition due to non gpu ops 2025-08-14T21:43:36.6902905Z cudagraph partition due to non gpu ops 2025-08-14T21:43:36.6902997Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6903184Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6903243Z return mod(**inputs) 2025-08-14T21:43:36.6903499Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6903567Z outputs = self.mobilebert( 2025-08-14T21:43:36.6903837Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6903905Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6904158Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6904221Z layer_outputs = layer_module( 2025-08-14T21:43:36.6904479Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:43:36.6904554Z self_attention_outputs = self.attention( 2025-08-14T21:43:36.6904867Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 344, in forward 2025-08-14T21:43:36.6904980Z attention_output = self.output(self_outputs[0], layer_input) 2025-08-14T21:43:36.6905231Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 292, in forward 2025-08-14T21:43:36.6905315Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:43:36.6905318Z 2025-08-14T21:43:36.6905407Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6905583Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6905651Z return mod(**inputs) 2025-08-14T21:43:36.6905907Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6905977Z outputs = self.mobilebert( 2025-08-14T21:43:36.6906230Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6906292Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6906550Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6906613Z layer_outputs = layer_module( 2025-08-14T21:43:36.6906873Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:43:36.6907018Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:43:36.6907276Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 440, in forward 2025-08-14T21:43:36.6907381Z bottlenecked_hidden_states = self.input(hidden_states) 2025-08-14T21:43:36.6907636Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 409, in forward 2025-08-14T21:43:36.6907713Z layer_input = self.dense(hidden_states) 2025-08-14T21:43:36.6907717Z 2025-08-14T21:43:36.6907809Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6907987Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6908052Z return mod(**inputs) 2025-08-14T21:43:36.6908309Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6908389Z outputs = self.mobilebert( 2025-08-14T21:43:36.6908661Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6908745Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6909004Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6909064Z layer_outputs = layer_module( 2025-08-14T21:43:36.6909316Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:43:36.6909411Z self_attention_outputs = self.attention( 2025-08-14T21:43:36.6909663Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 344, in forward 2025-08-14T21:43:36.6909770Z attention_output = self.output(self_outputs[0], layer_input) 2025-08-14T21:43:36.6910032Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 295, in forward 2025-08-14T21:43:36.6910145Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:43:36.6910404Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.6910486Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.6910489Z 2025-08-14T21:43:36.6910579Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6910765Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6910823Z return mod(**inputs) 2025-08-14T21:43:36.6911084Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6911146Z outputs = self.mobilebert( 2025-08-14T21:43:36.6911399Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6911471Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6911723Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6911791Z layer_outputs = layer_module( 2025-08-14T21:43:36.6912042Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6912128Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6912383Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.6912483Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.6912732Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:43:36.6912812Z hidden_states = self.dense(hidden_states) 2025-08-14T21:43:36.6912815Z 2025-08-14T21:43:36.6912904Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6913087Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6913143Z return mod(**inputs) 2025-08-14T21:43:36.6913397Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6913460Z outputs = self.mobilebert( 2025-08-14T21:43:36.6913713Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6913781Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6914045Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6914121Z layer_outputs = layer_module( 2025-08-14T21:43:36.6914408Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6914492Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6914744Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.6914847Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.6915098Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:43:36.6915219Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:43:36.6915222Z 2025-08-14T21:43:36.6915312Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6915493Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6915561Z return mod(**inputs) 2025-08-14T21:43:36.6915816Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6915884Z outputs = self.mobilebert( 2025-08-14T21:43:36.6916135Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6916200Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6916457Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6916520Z layer_outputs = layer_module( 2025-08-14T21:43:36.6916768Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6916859Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6917111Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.6917228Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.6917476Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:43:36.6917547Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:43:36.6917550Z 2025-08-14T21:43:36.6917646Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6917825Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6917887Z return mod(**inputs) 2025-08-14T21:43:36.6918140Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6918203Z outputs = self.mobilebert( 2025-08-14T21:43:36.6918464Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6918529Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6918783Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6918849Z layer_outputs = layer_module( 2025-08-14T21:43:36.6919100Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6919187Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6919440Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.6919550Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.6919836Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:43:36.6919960Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:43:36.6920218Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.6920301Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.6920304Z 2025-08-14T21:43:36.6920394Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6920574Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6920645Z return mod(**inputs) 2025-08-14T21:43:36.6920901Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6920962Z outputs = self.mobilebert( 2025-08-14T21:43:36.6921211Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6921280Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6921526Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6921588Z layer_outputs = layer_module( 2025-08-14T21:43:36.6921841Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6921920Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6922175Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.6922272Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.6922521Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:43:36.6922603Z hidden_states = self.dense(hidden_states) 2025-08-14T21:43:36.6922606Z 2025-08-14T21:43:36.6922696Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6922878Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6922935Z return mod(**inputs) 2025-08-14T21:43:36.6923189Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6923257Z outputs = self.mobilebert( 2025-08-14T21:43:36.6923510Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6923573Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6923828Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6923891Z layer_outputs = layer_module( 2025-08-14T21:43:36.6924142Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6924220Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6924470Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.6924572Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.6924823Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:43:36.6924921Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:43:36.6924925Z 2025-08-14T21:43:36.6925015Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6925202Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6925325Z return mod(**inputs) 2025-08-14T21:43:36.6925584Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6925645Z outputs = self.mobilebert( 2025-08-14T21:43:36.6925904Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6925968Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6926226Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6926304Z layer_outputs = layer_module( 2025-08-14T21:43:36.6926557Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6926646Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6926899Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.6927016Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.6927269Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:43:36.6927342Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:43:36.6927345Z 2025-08-14T21:43:36.6927443Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6927619Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6927678Z return mod(**inputs) 2025-08-14T21:43:36.6927942Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6928003Z outputs = self.mobilebert( 2025-08-14T21:43:36.6928265Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6928329Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6928581Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6928649Z layer_outputs = layer_module( 2025-08-14T21:43:36.6928901Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6928987Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6929236Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.6929346Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.6929604Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:43:36.6929714Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:43:36.6929964Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.6930049Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.6930053Z 2025-08-14T21:43:36.6930143Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6930324Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6930383Z return mod(**inputs) 2025-08-14T21:43:36.6930646Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6930714Z outputs = self.mobilebert( 2025-08-14T21:43:36.6930999Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6931085Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6931338Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6931400Z layer_outputs = layer_module( 2025-08-14T21:43:36.6931656Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6931739Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6932006Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.6932110Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.6932364Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:43:36.6932445Z hidden_states = self.dense(hidden_states) 2025-08-14T21:43:36.6932449Z 2025-08-14T21:43:36.6932540Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6932718Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6932783Z return mod(**inputs) 2025-08-14T21:43:36.6933036Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6933102Z outputs = self.mobilebert( 2025-08-14T21:43:36.6933355Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6933419Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6933677Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6933739Z layer_outputs = layer_module( 2025-08-14T21:43:36.6933991Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6934077Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6934329Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.6934426Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.6934673Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:43:36.6934771Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:43:36.6934775Z 2025-08-14T21:43:36.6934870Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6935047Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6935109Z return mod(**inputs) 2025-08-14T21:43:36.6935361Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6935422Z outputs = self.mobilebert( 2025-08-14T21:43:36.6935677Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6935739Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6935989Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6936055Z layer_outputs = layer_module( 2025-08-14T21:43:36.6936306Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6936410Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6936692Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.6936821Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.6937084Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:43:36.6937159Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:43:36.6937163Z 2025-08-14T21:43:36.6937258Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6937439Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6937513Z return mod(**inputs) 2025-08-14T21:43:36.6937783Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6937848Z outputs = self.mobilebert( 2025-08-14T21:43:36.6938112Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6938179Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6938435Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6938499Z layer_outputs = layer_module( 2025-08-14T21:43:36.6938756Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6938837Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6939099Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.6939207Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.6939474Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:43:36.6939586Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:43:36.6939844Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.6939931Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.6939934Z 2025-08-14T21:43:36.6940026Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6940216Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6940275Z return mod(**inputs) 2025-08-14T21:43:36.6940538Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6940609Z outputs = self.mobilebert( 2025-08-14T21:43:36.6940870Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6940935Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6941199Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6941260Z layer_outputs = layer_module( 2025-08-14T21:43:36.6941523Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 518, in forward 2025-08-14T21:43:36.6941631Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:43:36.6941891Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:43:36.6941971Z hidden_states = self.dense(hidden_states) 2025-08-14T21:43:36.6941975Z 2025-08-14T21:43:36.6942079Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6942277Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6942353Z return mod(**inputs) 2025-08-14T21:43:36.6942610Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6942678Z outputs = self.mobilebert( 2025-08-14T21:43:36.6942929Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6942997Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6943261Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6943341Z layer_outputs = layer_module( 2025-08-14T21:43:36.6943599Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 518, in forward 2025-08-14T21:43:36.6943708Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:43:36.6943960Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:43:36.6944064Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:43:36.6944068Z 2025-08-14T21:43:36.6944159Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6944344Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6944402Z return mod(**inputs) 2025-08-14T21:43:36.6944656Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6944786Z outputs = self.mobilebert( 2025-08-14T21:43:36.6945047Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6945111Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6945375Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6945438Z layer_outputs = layer_module( 2025-08-14T21:43:36.6945702Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:43:36.6945847Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:43:36.6946101Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 392, in forward 2025-08-14T21:43:36.6946196Z layer_output = self.dense(intermediate_states) 2025-08-14T21:43:36.6946200Z 2025-08-14T21:43:36.6946291Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6946480Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6946539Z return mod(**inputs) 2025-08-14T21:43:36.6946797Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6946861Z outputs = self.mobilebert( 2025-08-14T21:43:36.6947115Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6947179Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6947437Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6947502Z layer_outputs = layer_module( 2025-08-14T21:43:36.6947763Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:43:36.6947926Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:43:36.6948196Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 397, in forward 2025-08-14T21:43:36.6948329Z layer_output = self.LayerNorm(layer_output + residual_tensor_1) 2025-08-14T21:43:36.6948580Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.6948664Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.6948667Z 2025-08-14T21:43:36.6948756Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6948954Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6949020Z return mod(**inputs) 2025-08-14T21:43:36.6949275Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6949345Z outputs = self.mobilebert( 2025-08-14T21:43:36.6949595Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6949659Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6949919Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6949980Z layer_outputs = layer_module( 2025-08-14T21:43:36.6950231Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:43:36.6950379Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:43:36.6950631Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 398, in forward 2025-08-14T21:43:36.6950748Z layer_output = self.bottleneck(layer_output, residual_tensor_2) 2025-08-14T21:43:36.6951002Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 372, in forward 2025-08-14T21:43:36.6951076Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:43:36.6951080Z 2025-08-14T21:43:36.6951174Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6951353Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6951414Z return mod(**inputs) 2025-08-14T21:43:36.6951669Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6951732Z outputs = self.mobilebert( 2025-08-14T21:43:36.6951991Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6952054Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6952308Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6952374Z layer_outputs = layer_module( 2025-08-14T21:43:36.6952627Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:43:36.6952770Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:43:36.6953023Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 398, in forward 2025-08-14T21:43:36.6953130Z layer_output = self.bottleneck(layer_output, residual_tensor_2) 2025-08-14T21:43:36.6953387Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 374, in forward 2025-08-14T21:43:36.6953494Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:43:36.6953782Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.6953880Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.6953884Z 2025-08-14T21:43:36.6953974Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6954160Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6954216Z return mod(**inputs) 2025-08-14T21:43:36.6954478Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6954560Z outputs = self.mobilebert( 2025-08-14T21:43:36.6954815Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6954882Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6955137Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6955200Z layer_outputs = layer_module( 2025-08-14T21:43:36.6955459Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:43:36.6955605Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:43:36.6955865Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 444, in forward 2025-08-14T21:43:36.6955965Z shared_attention_input = self.attention(hidden_states) 2025-08-14T21:43:36.6956219Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 409, in forward 2025-08-14T21:43:36.6956296Z layer_input = self.dense(hidden_states) 2025-08-14T21:43:36.6956299Z 2025-08-14T21:43:36.6956389Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6956574Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6956635Z return mod(**inputs) 2025-08-14T21:43:36.6956891Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6956960Z outputs = self.mobilebert( 2025-08-14T21:43:36.6957210Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6957273Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6957535Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6957598Z layer_outputs = layer_module( 2025-08-14T21:43:36.6957857Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:43:36.6958003Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:43:36.6958254Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 444, in forward 2025-08-14T21:43:36.6958358Z shared_attention_input = self.attention(hidden_states) 2025-08-14T21:43:36.6958610Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 410, in forward 2025-08-14T21:43:36.6958692Z layer_input = self.LayerNorm(layer_input) 2025-08-14T21:43:36.6958948Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.6959028Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.6959032Z 2025-08-14T21:43:36.6959152Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6959345Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6959417Z return mod(**inputs) 2025-08-14T21:43:36.6959674Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6959737Z outputs = self.mobilebert( 2025-08-14T21:43:36.6959994Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6960054Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6960307Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6960394Z layer_outputs = layer_module( 2025-08-14T21:43:36.6960647Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:43:36.6960730Z self_attention_outputs = self.attention( 2025-08-14T21:43:36.6960982Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:43:36.6961045Z self_outputs = self.self( 2025-08-14T21:43:36.6961301Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 245, in forward 2025-08-14T21:43:36.6961364Z self.query(query_tensor) 2025-08-14T21:43:36.6961367Z 2025-08-14T21:43:36.6961458Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6961642Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6961701Z return mod(**inputs) 2025-08-14T21:43:36.6961964Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6962027Z outputs = self.mobilebert( 2025-08-14T21:43:36.6962280Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6962351Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6962603Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6962668Z layer_outputs = layer_module( 2025-08-14T21:43:36.6962919Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:43:36.6962994Z self_attention_outputs = self.attention( 2025-08-14T21:43:36.6963252Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:43:36.6963311Z self_outputs = self.self( 2025-08-14T21:43:36.6963561Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 250, in forward 2025-08-14T21:43:36.6963627Z self.key(key_tensor) 2025-08-14T21:43:36.6963631Z 2025-08-14T21:43:36.6963720Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6963902Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6963958Z return mod(**inputs) 2025-08-14T21:43:36.6964209Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6964270Z outputs = self.mobilebert( 2025-08-14T21:43:36.6964521Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6964590Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6964857Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6964937Z layer_outputs = layer_module( 2025-08-14T21:43:36.6965209Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:43:36.6965284Z self_attention_outputs = self.attention( 2025-08-14T21:43:36.6965532Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:43:36.6965599Z self_outputs = self.self( 2025-08-14T21:43:36.6965847Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 255, in forward 2025-08-14T21:43:36.6965933Z self.value(value_tensor) 2025-08-14T21:43:36.6965936Z 2025-08-14T21:43:36.6966011Z cudagraph partition due to non gpu ops 2025-08-14T21:43:36.6966082Z cudagraph partition due to non gpu ops 2025-08-14T21:43:36.6966183Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6966365Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6966422Z return mod(**inputs) 2025-08-14T21:43:36.6966683Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6966742Z outputs = self.mobilebert( 2025-08-14T21:43:36.6967002Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6967064Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6967316Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6967387Z layer_outputs = layer_module( 2025-08-14T21:43:36.6967642Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:43:36.6967721Z self_attention_outputs = self.attention( 2025-08-14T21:43:36.6967977Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 344, in forward 2025-08-14T21:43:36.6968089Z attention_output = self.output(self_outputs[0], layer_input) 2025-08-14T21:43:36.6968347Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 292, in forward 2025-08-14T21:43:36.6968419Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:43:36.6968423Z 2025-08-14T21:43:36.6968511Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6968695Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6968755Z return mod(**inputs) 2025-08-14T21:43:36.6969017Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6969077Z outputs = self.mobilebert( 2025-08-14T21:43:36.6969335Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6969407Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6969665Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6969734Z layer_outputs = layer_module( 2025-08-14T21:43:36.6969989Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:43:36.6970136Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:43:36.6970400Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 440, in forward 2025-08-14T21:43:36.6970512Z bottlenecked_hidden_states = self.input(hidden_states) 2025-08-14T21:43:36.6970782Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 409, in forward 2025-08-14T21:43:36.6970876Z layer_input = self.dense(hidden_states) 2025-08-14T21:43:36.6970880Z 2025-08-14T21:43:36.6970969Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6971155Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6971212Z return mod(**inputs) 2025-08-14T21:43:36.6971466Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6971550Z outputs = self.mobilebert( 2025-08-14T21:43:36.6971802Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6971873Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6972129Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6972191Z layer_outputs = layer_module( 2025-08-14T21:43:36.6972448Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:43:36.6972522Z self_attention_outputs = self.attention( 2025-08-14T21:43:36.6972772Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 344, in forward 2025-08-14T21:43:36.6972888Z attention_output = self.output(self_outputs[0], layer_input) 2025-08-14T21:43:36.6973142Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 295, in forward 2025-08-14T21:43:36.6973260Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:43:36.6973514Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.6973596Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.6973600Z 2025-08-14T21:43:36.6973694Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6973870Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6973931Z return mod(**inputs) 2025-08-14T21:43:36.6974183Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6974243Z outputs = self.mobilebert( 2025-08-14T21:43:36.6974495Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6974556Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6974814Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6974877Z layer_outputs = layer_module( 2025-08-14T21:43:36.6975130Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6975219Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6975470Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.6975570Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.6975829Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:43:36.6975904Z hidden_states = self.dense(hidden_states) 2025-08-14T21:43:36.6975907Z 2025-08-14T21:43:36.6976002Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6976205Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6976285Z return mod(**inputs) 2025-08-14T21:43:36.6976547Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6976608Z outputs = self.mobilebert( 2025-08-14T21:43:36.6976869Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6976933Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6977181Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6977279Z layer_outputs = layer_module( 2025-08-14T21:43:36.6977539Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6977624Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6977892Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.6977990Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.6978253Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:43:36.6978351Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:43:36.6978354Z 2025-08-14T21:43:36.6978445Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6978630Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6978685Z return mod(**inputs) 2025-08-14T21:43:36.6978949Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6979008Z outputs = self.mobilebert( 2025-08-14T21:43:36.6979267Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6979331Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6979584Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6979645Z layer_outputs = layer_module( 2025-08-14T21:43:36.6979905Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6979991Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6980253Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.6980366Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.6980628Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:43:36.6980709Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:43:36.6980712Z 2025-08-14T21:43:36.6980801Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6980987Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6981045Z return mod(**inputs) 2025-08-14T21:43:36.6981305Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6981376Z outputs = self.mobilebert( 2025-08-14T21:43:36.6981632Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6981694Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6981988Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6982065Z layer_outputs = layer_module( 2025-08-14T21:43:36.6982329Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6982412Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6982666Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.6982782Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.6983060Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:43:36.6983174Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:43:36.6983430Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.6983508Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.6983512Z 2025-08-14T21:43:36.6983603Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6983782Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6983837Z return mod(**inputs) 2025-08-14T21:43:36.6984095Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6984157Z outputs = self.mobilebert( 2025-08-14T21:43:36.6984416Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6984481Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6984898Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6984973Z layer_outputs = layer_module( 2025-08-14T21:43:36.6985229Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6985319Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6985577Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.6985680Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.6985956Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:43:36.6986028Z hidden_states = self.dense(hidden_states) 2025-08-14T21:43:36.6986031Z 2025-08-14T21:43:36.6986127Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6986308Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6986368Z return mod(**inputs) 2025-08-14T21:43:36.6986631Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6986696Z outputs = self.mobilebert( 2025-08-14T21:43:36.6986949Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6987017Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6987271Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6987345Z layer_outputs = layer_module( 2025-08-14T21:43:36.6987597Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6987711Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6988010Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.6988111Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.6988373Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:43:36.6988473Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:43:36.6988477Z 2025-08-14T21:43:36.6988568Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6988779Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6988839Z return mod(**inputs) 2025-08-14T21:43:36.6989098Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6989169Z outputs = self.mobilebert( 2025-08-14T21:43:36.6989423Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6989493Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6989746Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6989808Z layer_outputs = layer_module( 2025-08-14T21:43:36.6990068Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6990151Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6990411Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.6990523Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.6990777Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:43:36.6990861Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:43:36.6990865Z 2025-08-14T21:43:36.6990956Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6991131Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6991189Z return mod(**inputs) 2025-08-14T21:43:36.6991444Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6991507Z outputs = self.mobilebert( 2025-08-14T21:43:36.6991759Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6991820Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6992075Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6992138Z layer_outputs = layer_module( 2025-08-14T21:43:36.6992398Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6992481Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6992732Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.6992847Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.6993104Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:43:36.6993211Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:43:36.6993498Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.6993595Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.6993598Z 2025-08-14T21:43:36.6993695Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6993878Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6993934Z return mod(**inputs) 2025-08-14T21:43:36.6994198Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6994276Z outputs = self.mobilebert( 2025-08-14T21:43:36.6994535Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6994598Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6994854Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6994921Z layer_outputs = layer_module( 2025-08-14T21:43:36.6995173Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6995253Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6995513Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.6995609Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.6995867Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:43:36.6995940Z hidden_states = self.dense(hidden_states) 2025-08-14T21:43:36.6995944Z 2025-08-14T21:43:36.6996035Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6996222Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6996281Z return mod(**inputs) 2025-08-14T21:43:36.6996542Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6996601Z outputs = self.mobilebert( 2025-08-14T21:43:36.6996854Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6996924Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6997179Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6997242Z layer_outputs = layer_module( 2025-08-14T21:43:36.6997502Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.6997587Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.6997851Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.6997951Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.6998202Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:43:36.6998308Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:43:36.6998312Z 2025-08-14T21:43:36.6998404Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.6998589Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.6998646Z return mod(**inputs) 2025-08-14T21:43:36.6998918Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.6998985Z outputs = self.mobilebert( 2025-08-14T21:43:36.6999277Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.6999343Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.6999603Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.6999667Z layer_outputs = layer_module( 2025-08-14T21:43:36.6999927Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.7000030Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.7000280Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.7000399Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.7000651Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:43:36.7000734Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:43:36.7000737Z 2025-08-14T21:43:36.7000829Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.7001006Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.7001074Z return mod(**inputs) 2025-08-14T21:43:36.7001328Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.7001397Z outputs = self.mobilebert( 2025-08-14T21:43:36.7001646Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.7001717Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.7001980Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.7002044Z layer_outputs = layer_module( 2025-08-14T21:43:36.7002290Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.7002376Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.7002623Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.7002738Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.7002991Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:43:36.7003096Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:43:36.7003354Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.7003436Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.7003439Z 2025-08-14T21:43:36.7003536Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.7003713Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.7003771Z return mod(**inputs) 2025-08-14T21:43:36.7004030Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.7004094Z outputs = self.mobilebert( 2025-08-14T21:43:36.7004343Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.7004412Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.7004690Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.7004776Z layer_outputs = layer_module( 2025-08-14T21:43:36.7005030Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 518, in forward 2025-08-14T21:43:36.7005137Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:43:36.7005393Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:43:36.7005466Z hidden_states = self.dense(hidden_states) 2025-08-14T21:43:36.7005470Z 2025-08-14T21:43:36.7005583Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.7005761Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.7005816Z return mod(**inputs) 2025-08-14T21:43:36.7006078Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.7006143Z outputs = self.mobilebert( 2025-08-14T21:43:36.7006394Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.7006459Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.7006709Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.7006772Z layer_outputs = layer_module( 2025-08-14T21:43:36.7007023Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 518, in forward 2025-08-14T21:43:36.7007131Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:43:36.7007388Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:43:36.7007489Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:43:36.7007494Z 2025-08-14T21:43:36.7007592Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.7007769Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.7007825Z return mod(**inputs) 2025-08-14T21:43:36.7008086Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.7008147Z outputs = self.mobilebert( 2025-08-14T21:43:36.7008400Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.7008471Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.7008725Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.7008794Z layer_outputs = layer_module( 2025-08-14T21:43:36.7009046Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:43:36.7009189Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:43:36.7009448Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 392, in forward 2025-08-14T21:43:36.7009531Z layer_output = self.dense(intermediate_states) 2025-08-14T21:43:36.7009534Z 2025-08-14T21:43:36.7009626Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.7009803Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.7009858Z return mod(**inputs) 2025-08-14T21:43:36.7010120Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.7010198Z outputs = self.mobilebert( 2025-08-14T21:43:36.7010486Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.7010559Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.7010812Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.7010880Z layer_outputs = layer_module( 2025-08-14T21:43:36.7011132Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:43:36.7011294Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:43:36.7011557Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 397, in forward 2025-08-14T21:43:36.7011669Z layer_output = self.LayerNorm(layer_output + residual_tensor_1) 2025-08-14T21:43:36.7011936Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.7012019Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.7012023Z 2025-08-14T21:43:36.7012116Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.7012307Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.7012365Z return mod(**inputs) 2025-08-14T21:43:36.7012631Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.7012696Z outputs = self.mobilebert( 2025-08-14T21:43:36.7012950Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.7013022Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.7013279Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.7013344Z layer_outputs = layer_module( 2025-08-14T21:43:36.7013606Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:43:36.7013748Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:43:36.7014003Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 398, in forward 2025-08-14T21:43:36.7014112Z layer_output = self.bottleneck(layer_output, residual_tensor_2) 2025-08-14T21:43:36.7014365Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 372, in forward 2025-08-14T21:43:36.7014444Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:43:36.7014449Z 2025-08-14T21:43:36.7014541Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.7014728Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.7014787Z return mod(**inputs) 2025-08-14T21:43:36.7015041Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.7015107Z outputs = self.mobilebert( 2025-08-14T21:43:36.7015359Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.7015423Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.7015684Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.7015747Z layer_outputs = layer_module( 2025-08-14T21:43:36.7016066Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:43:36.7016221Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:43:36.7016474Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 398, in forward 2025-08-14T21:43:36.7016586Z layer_output = self.bottleneck(layer_output, residual_tensor_2) 2025-08-14T21:43:36.7016838Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 374, in forward 2025-08-14T21:43:36.7016950Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:43:36.7017221Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.7017302Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.7017306Z 2025-08-14T21:43:36.7017401Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.7017581Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.7017642Z return mod(**inputs) 2025-08-14T21:43:36.7017892Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.7017952Z outputs = self.mobilebert( 2025-08-14T21:43:36.7018204Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.7018267Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.7018520Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.7018591Z layer_outputs = layer_module( 2025-08-14T21:43:36.7018842Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:43:36.7018996Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:43:36.7019249Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 444, in forward 2025-08-14T21:43:36.7019348Z shared_attention_input = self.attention(hidden_states) 2025-08-14T21:43:36.7019607Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 409, in forward 2025-08-14T21:43:36.7019678Z layer_input = self.dense(hidden_states) 2025-08-14T21:43:36.7019683Z 2025-08-14T21:43:36.7019778Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.7019954Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.7020013Z return mod(**inputs) 2025-08-14T21:43:36.7020273Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.7020336Z outputs = self.mobilebert( 2025-08-14T21:43:36.7020586Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.7020655Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.7020906Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.7020973Z layer_outputs = layer_module( 2025-08-14T21:43:36.7021223Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:43:36.7021369Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:43:36.7021640Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 444, in forward 2025-08-14T21:43:36.7021776Z shared_attention_input = self.attention(hidden_states) 2025-08-14T21:43:36.7022037Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 410, in forward 2025-08-14T21:43:36.7022113Z layer_input = self.LayerNorm(layer_input) 2025-08-14T21:43:36.7022363Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.7022447Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.7022450Z 2025-08-14T21:43:36.7022557Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.7022736Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.7022794Z return mod(**inputs) 2025-08-14T21:43:36.7023054Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.7023119Z outputs = self.mobilebert( 2025-08-14T21:43:36.7023372Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.7023437Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.7023696Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.7023756Z layer_outputs = layer_module( 2025-08-14T21:43:36.7024014Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:43:36.7024093Z self_attention_outputs = self.attention( 2025-08-14T21:43:36.7024347Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:43:36.7024417Z self_outputs = self.self( 2025-08-14T21:43:36.7024730Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 245, in forward 2025-08-14T21:43:36.7024805Z self.query(query_tensor) 2025-08-14T21:43:36.7024815Z 2025-08-14T21:43:36.7024910Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.7025089Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.7025155Z return mod(**inputs) 2025-08-14T21:43:36.7025413Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.7025479Z outputs = self.mobilebert( 2025-08-14T21:43:36.7025741Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.7025806Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.7026070Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.7026136Z layer_outputs = layer_module( 2025-08-14T21:43:36.7026390Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:43:36.7026475Z self_attention_outputs = self.attention( 2025-08-14T21:43:36.7026730Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:43:36.7026794Z self_outputs = self.self( 2025-08-14T21:43:36.7027057Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 250, in forward 2025-08-14T21:43:36.7027115Z self.key(key_tensor) 2025-08-14T21:43:36.7027118Z 2025-08-14T21:43:36.7027213Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.7027424Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.7027501Z return mod(**inputs) 2025-08-14T21:43:36.7027763Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.7027822Z outputs = self.mobilebert( 2025-08-14T21:43:36.7028082Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.7028146Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.7028398Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.7028482Z layer_outputs = layer_module( 2025-08-14T21:43:36.7028734Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:43:36.7028811Z self_attention_outputs = self.attention( 2025-08-14T21:43:36.7029072Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 334, in forward 2025-08-14T21:43:36.7029134Z self_outputs = self.self( 2025-08-14T21:43:36.7029393Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 255, in forward 2025-08-14T21:43:36.7029456Z self.value(value_tensor) 2025-08-14T21:43:36.7029459Z 2025-08-14T21:43:36.7029533Z cudagraph partition due to non gpu ops 2025-08-14T21:43:36.7029612Z cudagraph partition due to non gpu ops 2025-08-14T21:43:36.7029703Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.7029880Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.7029946Z return mod(**inputs) 2025-08-14T21:43:36.7030203Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.7030278Z outputs = self.mobilebert( 2025-08-14T21:43:36.7030531Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.7030596Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.7030855Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.7030918Z layer_outputs = layer_module( 2025-08-14T21:43:36.7031175Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:43:36.7031253Z self_attention_outputs = self.attention( 2025-08-14T21:43:36.7031503Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 344, in forward 2025-08-14T21:43:36.7031622Z attention_output = self.output(self_outputs[0], layer_input) 2025-08-14T21:43:36.7031878Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 292, in forward 2025-08-14T21:43:36.7031953Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:43:36.7031963Z 2025-08-14T21:43:36.7032054Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.7032231Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.7032295Z return mod(**inputs) 2025-08-14T21:43:36.7032551Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.7032615Z outputs = self.mobilebert( 2025-08-14T21:43:36.7032874Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.7032952Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.7033230Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.7033307Z layer_outputs = layer_module( 2025-08-14T21:43:36.7033563Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 496, in forward 2025-08-14T21:43:36.7033714Z query_tensor, key_tensor, value_tensor, layer_input = self.bottleneck(hidden_states) 2025-08-14T21:43:36.7033969Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 440, in forward 2025-08-14T21:43:36.7034084Z bottlenecked_hidden_states = self.input(hidden_states) 2025-08-14T21:43:36.7034344Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 409, in forward 2025-08-14T21:43:36.7034416Z layer_input = self.dense(hidden_states) 2025-08-14T21:43:36.7034419Z 2025-08-14T21:43:36.7034518Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.7034699Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.7034755Z return mod(**inputs) 2025-08-14T21:43:36.7035018Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.7035078Z outputs = self.mobilebert( 2025-08-14T21:43:36.7035336Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.7035402Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.7035653Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.7035723Z layer_outputs = layer_module( 2025-08-14T21:43:36.7035975Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 500, in forward 2025-08-14T21:43:36.7036051Z self_attention_outputs = self.attention( 2025-08-14T21:43:36.7036308Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 344, in forward 2025-08-14T21:43:36.7036415Z attention_output = self.output(self_outputs[0], layer_input) 2025-08-14T21:43:36.7036672Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 295, in forward 2025-08-14T21:43:36.7036781Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:43:36.7037031Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.7037115Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.7037118Z 2025-08-14T21:43:36.7037207Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.7037390Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.7037447Z return mod(**inputs) 2025-08-14T21:43:36.7037699Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.7037766Z outputs = self.mobilebert( 2025-08-14T21:43:36.7038016Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.7038078Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.7038337Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.7038399Z layer_outputs = layer_module( 2025-08-14T21:43:36.7038671Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.7038790Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.7039045Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.7039150Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.7039401Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:43:36.7039480Z hidden_states = self.dense(hidden_states) 2025-08-14T21:43:36.7039483Z 2025-08-14T21:43:36.7039600Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.7039778Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.7039842Z return mod(**inputs) 2025-08-14T21:43:36.7040098Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.7040171Z outputs = self.mobilebert( 2025-08-14T21:43:36.7040423Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.7040487Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.7040745Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.7040807Z layer_outputs = layer_module( 2025-08-14T21:43:36.7041056Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.7041142Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.7041391Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.7041490Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.7041743Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:43:36.7041838Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:43:36.7041841Z 2025-08-14T21:43:36.7041935Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.7042113Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.7042178Z return mod(**inputs) 2025-08-14T21:43:36.7042432Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.7042496Z outputs = self.mobilebert( 2025-08-14T21:43:36.7042755Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.7042819Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.7043070Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.7043141Z layer_outputs = layer_module( 2025-08-14T21:43:36.7043392Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.7043484Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.7043736Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.7043850Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.7044108Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:43:36.7044183Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:43:36.7044209Z 2025-08-14T21:43:36.7044319Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.7044514Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.7044571Z return mod(**inputs) 2025-08-14T21:43:36.7044829Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.7044888Z outputs = self.mobilebert( 2025-08-14T21:43:36.7045139Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.7045220Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.7045470Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.7045534Z layer_outputs = layer_module( 2025-08-14T21:43:36.7045785Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.7045868Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.7046122Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.7046232Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.7046489Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:43:36.7046595Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:43:36.7046847Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.7046933Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.7046936Z 2025-08-14T21:43:36.7047027Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.7047206Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.7047271Z return mod(**inputs) 2025-08-14T21:43:36.7047525Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.7047593Z outputs = self.mobilebert( 2025-08-14T21:43:36.7047844Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.7047907Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.7048164Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.7048225Z layer_outputs = layer_module( 2025-08-14T21:43:36.7048482Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.7048569Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.7048820Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.7048923Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.7049172Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:43:36.7049247Z hidden_states = self.dense(hidden_states) 2025-08-14T21:43:36.7049256Z 2025-08-14T21:43:36.7049348Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.7049526Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.7049591Z return mod(**inputs) 2025-08-14T21:43:36.7049859Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.7049954Z outputs = self.mobilebert( 2025-08-14T21:43:36.7050216Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.7050279Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.7050538Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.7050599Z layer_outputs = layer_module( 2025-08-14T21:43:36.7050851Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.7050957Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.7051214Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.7051312Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.7051572Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:43:36.7051669Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:43:36.7051672Z 2025-08-14T21:43:36.7051767Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.7051943Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.7052001Z return mod(**inputs) 2025-08-14T21:43:36.7052262Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.7052323Z outputs = self.mobilebert( 2025-08-14T21:43:36.7052583Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.7052648Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.7052901Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.7052969Z layer_outputs = layer_module( 2025-08-14T21:43:36.7053224Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.7053307Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.7053566Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.7053677Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.7053937Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:43:36.7054011Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:43:36.7054016Z 2025-08-14T21:43:36.7054106Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.7054293Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.7054350Z return mod(**inputs) 2025-08-14T21:43:36.7054618Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.7054678Z outputs = self.mobilebert( 2025-08-14T21:43:36.7054931Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.7055002Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.7055258Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.7055323Z layer_outputs = layer_module( 2025-08-14T21:43:36.7055601Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.7055702Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.7055959Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.7056065Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.7056315Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:43:36.7056428Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:43:36.7056700Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.7056785Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.7056789Z 2025-08-14T21:43:36.7056880Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.7057062Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.7057127Z return mod(**inputs) 2025-08-14T21:43:36.7057381Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.7057449Z outputs = self.mobilebert( 2025-08-14T21:43:36.7057700Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.7057765Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.7058028Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.7058090Z layer_outputs = layer_module( 2025-08-14T21:43:36.7058343Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.7058436Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.7058690Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.7058796Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.7059050Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:43:36.7059124Z hidden_states = self.dense(hidden_states) 2025-08-14T21:43:36.7059128Z 2025-08-14T21:43:36.7059227Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.7059405Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.7059468Z return mod(**inputs) 2025-08-14T21:43:36.7059725Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.7059791Z outputs = self.mobilebert( 2025-08-14T21:43:36.7060051Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.7060122Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.7060374Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.7060442Z layer_outputs = layer_module( 2025-08-14T21:43:36.7060693Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.7060784Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.7061036Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 469, in forward 2025-08-14T21:43:36.7061149Z intermediate_output = self.intermediate(hidden_states) 2025-08-14T21:43:36.7061426Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:43:36.7061541Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:43:36.7061545Z 2025-08-14T21:43:36.7061641Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.7061819Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.7061876Z return mod(**inputs) 2025-08-14T21:43:36.7062135Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.7062223Z outputs = self.mobilebert( 2025-08-14T21:43:36.7062480Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.7062550Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.7062806Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.7062874Z layer_outputs = layer_module( 2025-08-14T21:43:36.7063132Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.7063215Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.7063477Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.7063588Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.7063849Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 457, in forward 2025-08-14T21:43:36.7063923Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:43:36.7063928Z 2025-08-14T21:43:36.7064020Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.7064211Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.7064269Z return mod(**inputs) 2025-08-14T21:43:36.7064528Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.7064596Z outputs = self.mobilebert( 2025-08-14T21:43:36.7064915Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.7064995Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.7065248Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.7065312Z layer_outputs = layer_module( 2025-08-14T21:43:36.7065574Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 515, in forward 2025-08-14T21:43:36.7065658Z attention_output = ffn_module(attention_output) 2025-08-14T21:43:36.7065920Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 470, in forward 2025-08-14T21:43:36.7066030Z layer_outputs = self.output(intermediate_output, hidden_states) 2025-08-14T21:43:36.7066279Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 458, in forward 2025-08-14T21:43:36.7066394Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:43:36.7066649Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.7066737Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.7066741Z 2025-08-14T21:43:36.7066856Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.7067052Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.7067156Z return mod(**inputs) 2025-08-14T21:43:36.7067417Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.7067478Z outputs = self.mobilebert( 2025-08-14T21:43:36.7067741Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.7067804Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.7068076Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.7068137Z layer_outputs = layer_module( 2025-08-14T21:43:36.7068388Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 518, in forward 2025-08-14T21:43:36.7068500Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:43:36.7068756Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 359, in forward 2025-08-14T21:43:36.7068836Z hidden_states = self.dense(hidden_states) 2025-08-14T21:43:36.7068840Z 2025-08-14T21:43:36.7068930Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.7069107Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.7069171Z return mod(**inputs) 2025-08-14T21:43:36.7069426Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.7069487Z outputs = self.mobilebert( 2025-08-14T21:43:36.7069749Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.7069815Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.7070073Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.7070136Z layer_outputs = layer_module( 2025-08-14T21:43:36.7070386Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 518, in forward 2025-08-14T21:43:36.7070498Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:43:36.7070748Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 360, in forward 2025-08-14T21:43:36.7070854Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:43:36.7070857Z 2025-08-14T21:43:36.7070947Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.7071124Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.7071188Z return mod(**inputs) 2025-08-14T21:43:36.7071446Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.7071506Z outputs = self.mobilebert( 2025-08-14T21:43:36.7071766Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.7071828Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.7072085Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.7072147Z layer_outputs = layer_module( 2025-08-14T21:43:36.7072400Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:43:36.7072565Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:43:36.7072836Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 392, in forward 2025-08-14T21:43:36.7072947Z layer_output = self.dense(intermediate_states) 2025-08-14T21:43:36.7072950Z 2025-08-14T21:43:36.7073041Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.7073218Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.7073283Z return mod(**inputs) 2025-08-14T21:43:36.7073535Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.7073616Z outputs = self.mobilebert( 2025-08-14T21:43:36.7073877Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.7073943Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.7074201Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.7074265Z layer_outputs = layer_module( 2025-08-14T21:43:36.7074516Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:43:36.7074662Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:43:36.7074914Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 397, in forward 2025-08-14T21:43:36.7075033Z layer_output = self.LayerNorm(layer_output + residual_tensor_1) 2025-08-14T21:43:36.7075284Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.7075366Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.7075369Z 2025-08-14T21:43:36.7075465Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.7075645Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.7075708Z return mod(**inputs) 2025-08-14T21:43:36.7075963Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.7076025Z outputs = self.mobilebert( 2025-08-14T21:43:36.7076283Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.7076347Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.7076600Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.7076668Z layer_outputs = layer_module( 2025-08-14T21:43:36.7076918Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:43:36.7077065Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:43:36.7077318Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 398, in forward 2025-08-14T21:43:36.7077426Z layer_output = self.bottleneck(layer_output, residual_tensor_2) 2025-08-14T21:43:36.7077683Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 372, in forward 2025-08-14T21:43:36.7077759Z layer_outputs = self.dense(hidden_states) 2025-08-14T21:43:36.7077762Z 2025-08-14T21:43:36.7077859Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.7078036Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.7078108Z return mod(**inputs) 2025-08-14T21:43:36.7078383Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1242, in forward 2025-08-14T21:43:36.7078465Z outputs = self.mobilebert( 2025-08-14T21:43:36.7078723Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 794, in forward 2025-08-14T21:43:36.7078793Z encoder_outputs = self.encoder( 2025-08-14T21:43:36.7079049Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 557, in forward 2025-08-14T21:43:36.7079116Z layer_outputs = layer_module( 2025-08-14T21:43:36.7079383Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 519, in forward 2025-08-14T21:43:36.7079522Z layer_output = self.output(intermediate_output, attention_output, hidden_states) 2025-08-14T21:43:36.7079782Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 398, in forward 2025-08-14T21:43:36.7079894Z layer_output = self.bottleneck(layer_output, residual_tensor_2) 2025-08-14T21:43:36.7080148Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 374, in forward 2025-08-14T21:43:36.7080256Z layer_outputs = self.LayerNorm(layer_outputs + residual_tensor) 2025-08-14T21:43:36.7080507Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 138, in forward 2025-08-14T21:43:36.7080593Z return input_tensor * self.weight + self.bias 2025-08-14T21:43:36.7080598Z 2025-08-14T21:43:36.7080689Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.7080871Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.7080928Z return mod(**inputs) 2025-08-14T21:43:36.7081183Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1256, in forward 2025-08-14T21:43:36.7081264Z logits = self.qa_outputs(sequence_output) 2025-08-14T21:43:36.7081267Z 2025-08-14T21:43:36.7081357Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.7081534Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.7081599Z return mod(**inputs) 2025-08-14T21:43:36.7081850Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1274, in forward 2025-08-14T21:43:36.7081951Z start_loss = loss_fct(start_logits, start_positions) 2025-08-14T21:43:36.7081955Z 2025-08-14T21:43:36.7082044Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:43:36.7082221Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:43:36.7082283Z return mod(**inputs) 2025-08-14T21:43:36.7082540Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/mobilebert/modeling_mobilebert.py", line 1275, in forward 2025-08-14T21:43:36.7082628Z end_loss = loss_fct(end_logits, end_positions) 2025-08-14T21:43:36.7082631Z 2025-08-14T21:43:47.4406878Z Compilation time (from dynamo_timed): 33.834980259 2025-08-14T21:43:47.4409728Z pass 2025-08-14T21:43:47.4411830Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:43:47.4412689Z TIMING: _recursive_pre_grad_passes:0.01834 _recursive_joint_graph_passes:1.20979 _recursive_post_grad_passes:0.20108 async_compile.wait:0.31431 code_gen:7.98345 inductor_compile:12.04012 backend_compile:23.39385 gc:0.00055 entire_frame_compile:33.83498 total_wall_time:33.83498 2025-08-14T21:43:47.4417305Z STATS: call_* op count: 1453 | FakeTensorMode.__torch_dispatch__:56761 | FakeTensor.__torch_dispatch__:16441 | ProxyTorchDispatchMode.__torch_dispatch__:21655 2025-08-14T21:43:47.4421290Z Dynamo produced 1 graphs covering 1453 ops with 0 graph breaks (0 unique) 2025-08-14T21:43:52.0971299Z /opt/conda/envs/py_3.9/lib/python3.9/site-packages/llvmlite/binding/ffi.py:175: UserWarning: pkg_resources is deprecated as an API. See https://setuptools.pypa.io/en/latest/pkg_resources.html. The pkg_resources package is slated for removal as early as 2025-11-30. Refrain from using this package or pin to Setuptools<81. 2025-08-14T21:43:52.0972282Z from pkg_resources import resource_filename 2025-08-14T21:43:52.6242642Z 2025-08-14T21:43:54.4262822Z loading model: 0it [00:00, ?it/s] 2025-08-14T21:43:54.4267249Z loading model: 0it [00:01, ?it/s] 2025-08-14T21:43:54.4274664Z cpu eval OPTForCausalLM 2025-08-14T21:43:55.7451167Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:43:56.3448662Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:43:56.9922021Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:44:03.6605726Z cudagraph partition due to non gpu ops 2025-08-14T21:44:03.6609915Z cudagraph partition due to non gpu ops 2025-08-14T21:44:03.6611495Z cudagraph partition due to non gpu ops 2025-08-14T21:44:03.6611733Z cudagraph partition due to non gpu ops 2025-08-14T21:44:03.6611932Z cudagraph partition due to non gpu ops 2025-08-14T21:44:03.6612242Z cudagraph partition due to non gpu ops 2025-08-14T21:44:03.6616249Z cudagraph partition due to non gpu ops 2025-08-14T21:44:03.6618025Z cudagraph partition due to non gpu ops 2025-08-14T21:44:03.6618396Z cudagraph partition due to non gpu ops 2025-08-14T21:44:03.6622890Z cudagraph partition due to non gpu ops 2025-08-14T21:44:03.6624569Z cudagraph partition due to non gpu ops 2025-08-14T21:44:03.6625015Z cudagraph partition due to non gpu ops 2025-08-14T21:44:03.6627731Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:03.6628305Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:03.6628703Z return mod(**inputs) 2025-08-14T21:44:03.6633608Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.6635276Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.6635827Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 826, in forward 2025-08-14T21:44:03.6641097Z outputs = self.model.decoder( 2025-08-14T21:44:03.6643139Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.6643643Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.6647962Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 653, in forward 2025-08-14T21:44:03.6649667Z layer_outputs = decoder_layer( 2025-08-14T21:44:03.6650190Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:03.6655048Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:03.6657519Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 259, in forward 2025-08-14T21:44:03.6658095Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:03.6659226Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 159, in forward 2025-08-14T21:44:03.6661231Z query_states = self.q_proj(hidden_states) * self.scaling 2025-08-14T21:44:03.6661431Z 2025-08-14T21:44:03.6661929Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:03.6662364Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:03.6662688Z return mod(**inputs) 2025-08-14T21:44:03.6663337Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.6663754Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.6664114Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 826, in forward 2025-08-14T21:44:03.6664481Z outputs = self.model.decoder( 2025-08-14T21:44:03.6664921Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.6665262Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.6665606Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 653, in forward 2025-08-14T21:44:03.6666038Z layer_outputs = decoder_layer( 2025-08-14T21:44:03.6666381Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:03.6666728Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:03.6667108Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 259, in forward 2025-08-14T21:44:03.6667491Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:03.6667867Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 162, in forward 2025-08-14T21:44:03.6668218Z key_states = self.k_proj(hidden_states) 2025-08-14T21:44:03.6668359Z 2025-08-14T21:44:03.6668459Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:03.6668798Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:03.6669101Z return mod(**inputs) 2025-08-14T21:44:03.6669656Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.6669986Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.6670334Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 826, in forward 2025-08-14T21:44:03.6670687Z outputs = self.model.decoder( 2025-08-14T21:44:03.6671012Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.6671344Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.6671692Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 653, in forward 2025-08-14T21:44:03.6672037Z layer_outputs = decoder_layer( 2025-08-14T21:44:03.6672364Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:03.6672707Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:03.6673070Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 259, in forward 2025-08-14T21:44:03.6673449Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:03.6673827Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 163, in forward 2025-08-14T21:44:03.6674189Z value_states = self.v_proj(hidden_states) 2025-08-14T21:44:03.6674318Z 2025-08-14T21:44:03.6674393Z cudagraph partition due to non gpu ops 2025-08-14T21:44:03.6674589Z cudagraph partition due to non gpu ops 2025-08-14T21:44:03.6674780Z cudagraph partition due to non gpu ops 2025-08-14T21:44:03.6674967Z cudagraph partition due to non gpu ops 2025-08-14T21:44:03.6675176Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:03.6675515Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:03.6675819Z return mod(**inputs) 2025-08-14T21:44:03.6676119Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.6676480Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.6676848Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 826, in forward 2025-08-14T21:44:03.6677218Z outputs = self.model.decoder( 2025-08-14T21:44:03.6677526Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.6677850Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.6678199Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 653, in forward 2025-08-14T21:44:03.6678536Z layer_outputs = decoder_layer( 2025-08-14T21:44:03.6678889Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:03.6679222Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:03.6679571Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 259, in forward 2025-08-14T21:44:03.6679935Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:03.6680304Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 184, in forward 2025-08-14T21:44:03.6680686Z attn_output, attn_weights = attention_interface( 2025-08-14T21:44:03.6681092Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:44:03.6681539Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:44:03.6681713Z 2025-08-14T21:44:03.6681811Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:03.6682146Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:03.6682442Z return mod(**inputs) 2025-08-14T21:44:03.6682748Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.6683077Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.6683424Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 826, in forward 2025-08-14T21:44:03.6683765Z outputs = self.model.decoder( 2025-08-14T21:44:03.6684081Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.6684407Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.6684945Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 653, in forward 2025-08-14T21:44:03.6685302Z layer_outputs = decoder_layer( 2025-08-14T21:44:03.6685627Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:03.6685963Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:03.6686305Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 259, in forward 2025-08-14T21:44:03.6686682Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:03.6687055Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 184, in forward 2025-08-14T21:44:03.6687418Z attn_output, attn_weights = attention_interface( 2025-08-14T21:44:03.6687832Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:44:03.6688263Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:44:03.6688414Z 2025-08-14T21:44:03.6688520Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:03.6688847Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:03.6689147Z return mod(**inputs) 2025-08-14T21:44:03.6689489Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.6689838Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.6690205Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 826, in forward 2025-08-14T21:44:03.6690550Z outputs = self.model.decoder( 2025-08-14T21:44:03.6690868Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.6691185Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.6691531Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 653, in forward 2025-08-14T21:44:03.6691903Z layer_outputs = decoder_layer( 2025-08-14T21:44:03.6692222Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:03.6692548Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:03.6692897Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 259, in forward 2025-08-14T21:44:03.6693269Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:03.6693627Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 196, in forward 2025-08-14T21:44:03.6693979Z attn_output = self.out_proj(attn_output) 2025-08-14T21:44:03.6694110Z 2025-08-14T21:44:03.6694206Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:03.6694536Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:03.6694828Z return mod(**inputs) 2025-08-14T21:44:03.6695130Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.6695453Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.6695793Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 826, in forward 2025-08-14T21:44:03.6696131Z outputs = self.model.decoder( 2025-08-14T21:44:03.6696447Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.6696764Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.6697096Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 653, in forward 2025-08-14T21:44:03.6697437Z layer_outputs = decoder_layer( 2025-08-14T21:44:03.6697749Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:03.6698078Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:03.6698412Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 285, in forward 2025-08-14T21:44:03.6698759Z hidden_states = self.fc1(hidden_states) 2025-08-14T21:44:03.6698883Z 2025-08-14T21:44:03.6698987Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:03.6699307Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:03.6699608Z return mod(**inputs) 2025-08-14T21:44:03.6699907Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.6700227Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.6700563Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 826, in forward 2025-08-14T21:44:03.6700906Z outputs = self.model.decoder( 2025-08-14T21:44:03.6701221Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.6701532Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.6701874Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 653, in forward 2025-08-14T21:44:03.6702236Z layer_outputs = decoder_layer( 2025-08-14T21:44:03.6702616Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:03.6702945Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:03.6703290Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 286, in forward 2025-08-14T21:44:03.6703659Z hidden_states = self.activation_fn(hidden_states) 2025-08-14T21:44:03.6703797Z 2025-08-14T21:44:03.6703897Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:03.6704218Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:03.6704538Z return mod(**inputs) 2025-08-14T21:44:03.6704897Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.6705220Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.6705568Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 826, in forward 2025-08-14T21:44:03.6705916Z outputs = self.model.decoder( 2025-08-14T21:44:03.6706231Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.6706547Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.6706887Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 653, in forward 2025-08-14T21:44:03.6707232Z layer_outputs = decoder_layer( 2025-08-14T21:44:03.6707548Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:03.6707888Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:03.6708238Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 288, in forward 2025-08-14T21:44:03.6708595Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:44:03.6708721Z 2025-08-14T21:44:03.6708822Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:03.6709154Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:03.6709454Z return mod(**inputs) 2025-08-14T21:44:03.6709751Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.6710077Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.6710414Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 826, in forward 2025-08-14T21:44:03.6710760Z outputs = self.model.decoder( 2025-08-14T21:44:03.6711065Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.6711385Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.6711730Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 653, in forward 2025-08-14T21:44:03.6712076Z layer_outputs = decoder_layer( 2025-08-14T21:44:03.6712390Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:03.6712723Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:03.6713068Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 259, in forward 2025-08-14T21:44:03.6713428Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:03.6713793Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 159, in forward 2025-08-14T21:44:03.6714175Z query_states = self.q_proj(hidden_states) * self.scaling 2025-08-14T21:44:03.6714326Z 2025-08-14T21:44:03.6714428Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:03.6714769Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:03.6715104Z return mod(**inputs) 2025-08-14T21:44:03.6715409Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.6715723Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.6716068Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 826, in forward 2025-08-14T21:44:03.6716415Z outputs = self.model.decoder( 2025-08-14T21:44:03.6716729Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.6717600Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.6717947Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 653, in forward 2025-08-14T21:44:03.6718294Z layer_outputs = decoder_layer( 2025-08-14T21:44:03.6718618Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:03.6718947Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:03.6719299Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 259, in forward 2025-08-14T21:44:03.6719685Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:03.6720043Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 162, in forward 2025-08-14T21:44:03.6720393Z key_states = self.k_proj(hidden_states) 2025-08-14T21:44:03.6720526Z 2025-08-14T21:44:03.6720623Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:03.6720956Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:03.6721248Z return mod(**inputs) 2025-08-14T21:44:03.6721548Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.6721873Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.6722211Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 826, in forward 2025-08-14T21:44:03.6722556Z outputs = self.model.decoder( 2025-08-14T21:44:03.6722872Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.6723195Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.6723527Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 653, in forward 2025-08-14T21:44:03.6723870Z layer_outputs = decoder_layer( 2025-08-14T21:44:03.6724187Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:03.6724511Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:03.6724859Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 259, in forward 2025-08-14T21:44:03.6725228Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:03.6725587Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 163, in forward 2025-08-14T21:44:03.6725930Z value_states = self.v_proj(hidden_states) 2025-08-14T21:44:03.6726068Z 2025-08-14T21:44:03.6726141Z cudagraph partition due to non gpu ops 2025-08-14T21:44:03.6726338Z cudagraph partition due to non gpu ops 2025-08-14T21:44:03.6726530Z cudagraph partition due to non gpu ops 2025-08-14T21:44:03.6726712Z cudagraph partition due to non gpu ops 2025-08-14T21:44:03.6726929Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:03.6727263Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:03.6727556Z return mod(**inputs) 2025-08-14T21:44:03.6727900Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.6728248Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.6728584Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 826, in forward 2025-08-14T21:44:03.6728928Z outputs = self.model.decoder( 2025-08-14T21:44:03.6729239Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.6729561Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.6729895Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 653, in forward 2025-08-14T21:44:03.6730260Z layer_outputs = decoder_layer( 2025-08-14T21:44:03.6730581Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:03.6730918Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:03.6731266Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 259, in forward 2025-08-14T21:44:03.6731644Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:03.6732016Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 184, in forward 2025-08-14T21:44:03.6732378Z attn_output, attn_weights = attention_interface( 2025-08-14T21:44:03.6732793Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:44:03.6733240Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:44:03.6733411Z 2025-08-14T21:44:03.6733515Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:03.6733847Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:03.6734149Z return mod(**inputs) 2025-08-14T21:44:03.6734454Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.6734778Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.6735124Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 826, in forward 2025-08-14T21:44:03.6735472Z outputs = self.model.decoder( 2025-08-14T21:44:03.6735794Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.6736111Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.6736456Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 653, in forward 2025-08-14T21:44:03.6736805Z layer_outputs = decoder_layer( 2025-08-14T21:44:03.6737128Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:03.6737457Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:03.6737807Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 259, in forward 2025-08-14T21:44:03.6738182Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:03.6738542Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 184, in forward 2025-08-14T21:44:03.6738912Z attn_output, attn_weights = attention_interface( 2025-08-14T21:44:03.6739323Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:44:03.6739753Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:44:03.6739901Z 2025-08-14T21:44:03.6739996Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:03.6740330Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:03.6740651Z return mod(**inputs) 2025-08-14T21:44:03.6740971Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.6741314Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.6741658Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 826, in forward 2025-08-14T21:44:03.6742000Z outputs = self.model.decoder( 2025-08-14T21:44:03.6742309Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.6742629Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.6742995Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 653, in forward 2025-08-14T21:44:03.6743337Z layer_outputs = decoder_layer( 2025-08-14T21:44:03.6743653Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:03.6743993Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:03.6744350Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 259, in forward 2025-08-14T21:44:03.6744801Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:03.6745190Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 196, in forward 2025-08-14T21:44:03.6745560Z attn_output = self.out_proj(attn_output) 2025-08-14T21:44:03.6745690Z 2025-08-14T21:44:03.6745797Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:03.6746136Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:03.6746447Z return mod(**inputs) 2025-08-14T21:44:03.6746759Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.6747096Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.6747461Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 826, in forward 2025-08-14T21:44:03.6747831Z outputs = self.model.decoder( 2025-08-14T21:44:03.6748151Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.6748471Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.6748819Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 653, in forward 2025-08-14T21:44:03.6749165Z layer_outputs = decoder_layer( 2025-08-14T21:44:03.6749485Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:03.6749815Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:03.6750164Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 285, in forward 2025-08-14T21:44:03.6750523Z hidden_states = self.fc1(hidden_states) 2025-08-14T21:44:03.6750653Z 2025-08-14T21:44:03.6750749Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:03.6751082Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:03.6751379Z return mod(**inputs) 2025-08-14T21:44:03.6751683Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.6752003Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.6752348Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 826, in forward 2025-08-14T21:44:03.6752699Z outputs = self.model.decoder( 2025-08-14T21:44:03.6753007Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.6753333Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.6753733Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 653, in forward 2025-08-14T21:44:03.6754096Z layer_outputs = decoder_layer( 2025-08-14T21:44:03.6754405Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:03.6754734Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:03.6755079Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 286, in forward 2025-08-14T21:44:03.6755438Z hidden_states = self.activation_fn(hidden_states) 2025-08-14T21:44:03.6755583Z 2025-08-14T21:44:03.6755697Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:03.6756029Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:03.6756328Z return mod(**inputs) 2025-08-14T21:44:03.6756626Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.6756956Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.6757303Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 826, in forward 2025-08-14T21:44:03.6757652Z outputs = self.model.decoder( 2025-08-14T21:44:03.6757960Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.6758286Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.6758626Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 653, in forward 2025-08-14T21:44:03.6758964Z layer_outputs = decoder_layer( 2025-08-14T21:44:03.6759281Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:03.6759611Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:03.6759961Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 288, in forward 2025-08-14T21:44:03.6760311Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:44:03.6760444Z 2025-08-14T21:44:03.6760538Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:03.6760867Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:03.6761156Z return mod(**inputs) 2025-08-14T21:44:03.6761461Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.6761787Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.6762134Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 826, in forward 2025-08-14T21:44:03.6762479Z outputs = self.model.decoder( 2025-08-14T21:44:03.6762798Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.6763127Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.6763464Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 653, in forward 2025-08-14T21:44:03.6763810Z layer_outputs = decoder_layer( 2025-08-14T21:44:03.6764132Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:03.6764465Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:03.6764806Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 259, in forward 2025-08-14T21:44:03.6765184Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:03.6765555Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 159, in forward 2025-08-14T21:44:03.6765940Z query_states = self.q_proj(hidden_states) * self.scaling 2025-08-14T21:44:03.6766114Z 2025-08-14T21:44:03.6766210Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:03.6766579Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:03.6766881Z return mod(**inputs) 2025-08-14T21:44:03.6767172Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.6767498Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.6767842Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 826, in forward 2025-08-14T21:44:03.6768186Z outputs = self.model.decoder( 2025-08-14T21:44:03.6768519Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.6768847Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.6769190Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 653, in forward 2025-08-14T21:44:03.6769531Z layer_outputs = decoder_layer( 2025-08-14T21:44:03.6769852Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:03.6770186Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:03.6770537Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 259, in forward 2025-08-14T21:44:03.6770901Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:03.6771269Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 162, in forward 2025-08-14T21:44:03.6771625Z key_states = self.k_proj(hidden_states) 2025-08-14T21:44:03.6771751Z 2025-08-14T21:44:03.6771854Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:03.6772175Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:03.6772476Z return mod(**inputs) 2025-08-14T21:44:03.6772779Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.6773099Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.6773445Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 826, in forward 2025-08-14T21:44:03.6773789Z outputs = self.model.decoder( 2025-08-14T21:44:03.6774102Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.6774418Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.6774762Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 653, in forward 2025-08-14T21:44:03.6775110Z layer_outputs = decoder_layer( 2025-08-14T21:44:03.6775423Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:03.6775754Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:03.6776105Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 259, in forward 2025-08-14T21:44:03.6776476Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:03.6776836Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 163, in forward 2025-08-14T21:44:03.6777190Z value_states = self.v_proj(hidden_states) 2025-08-14T21:44:03.6777318Z 2025-08-14T21:44:03.6777399Z cudagraph partition due to non gpu ops 2025-08-14T21:44:03.6777587Z cudagraph partition due to non gpu ops 2025-08-14T21:44:03.6777785Z cudagraph partition due to non gpu ops 2025-08-14T21:44:03.6777974Z cudagraph partition due to non gpu ops 2025-08-14T21:44:03.6778188Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:03.6778539Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:03.6778842Z return mod(**inputs) 2025-08-14T21:44:03.6779181Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.6779505Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.6779850Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 826, in forward 2025-08-14T21:44:03.6780203Z outputs = self.model.decoder( 2025-08-14T21:44:03.6780525Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.6780868Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.6781210Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 653, in forward 2025-08-14T21:44:03.6781554Z layer_outputs = decoder_layer( 2025-08-14T21:44:03.6781864Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:03.6782195Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:03.6782539Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 259, in forward 2025-08-14T21:44:03.6782905Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:03.6783259Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 184, in forward 2025-08-14T21:44:03.6783625Z attn_output, attn_weights = attention_interface( 2025-08-14T21:44:03.6784037Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:44:03.6784480Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:44:03.6784854Z 2025-08-14T21:44:03.6784957Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:03.6785295Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:03.6785597Z return mod(**inputs) 2025-08-14T21:44:03.6785896Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.6786225Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.6786570Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 826, in forward 2025-08-14T21:44:03.6786918Z outputs = self.model.decoder( 2025-08-14T21:44:03.6787228Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.6787554Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.6787900Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 653, in forward 2025-08-14T21:44:03.6788239Z layer_outputs = decoder_layer( 2025-08-14T21:44:03.6788562Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:03.6788897Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:03.6789240Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 259, in forward 2025-08-14T21:44:03.6789599Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:03.6789962Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 184, in forward 2025-08-14T21:44:03.6807072Z attn_output, attn_weights = attention_interface( 2025-08-14T21:44:03.6807533Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:44:03.6807986Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:44:03.6808141Z 2025-08-14T21:44:03.6808243Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:03.6808782Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:03.6809137Z return mod(**inputs) 2025-08-14T21:44:03.6809451Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.6809791Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.6810151Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 826, in forward 2025-08-14T21:44:03.6810512Z outputs = self.model.decoder( 2025-08-14T21:44:03.6810833Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.6811204Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.6811559Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 653, in forward 2025-08-14T21:44:03.6811903Z layer_outputs = decoder_layer( 2025-08-14T21:44:03.6812237Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:03.6812675Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:03.6813038Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 259, in forward 2025-08-14T21:44:03.6813413Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:03.6813832Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 196, in forward 2025-08-14T21:44:03.6814196Z attn_output = self.out_proj(attn_output) 2025-08-14T21:44:03.6814329Z 2025-08-14T21:44:03.6814439Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:03.6814779Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:03.6815088Z return mod(**inputs) 2025-08-14T21:44:03.6815399Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.6815726Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.6816074Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 826, in forward 2025-08-14T21:44:03.6816425Z outputs = self.model.decoder( 2025-08-14T21:44:03.6816743Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.6817061Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.6817407Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 653, in forward 2025-08-14T21:44:03.6817754Z layer_outputs = decoder_layer( 2025-08-14T21:44:03.6818071Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:03.6818410Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:03.6818760Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 285, in forward 2025-08-14T21:44:03.6819115Z hidden_states = self.fc1(hidden_states) 2025-08-14T21:44:03.6819243Z 2025-08-14T21:44:03.6819341Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:03.6819679Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:03.6819982Z return mod(**inputs) 2025-08-14T21:44:03.6820279Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.6820606Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.6820951Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 826, in forward 2025-08-14T21:44:03.6821301Z outputs = self.model.decoder( 2025-08-14T21:44:03.6821631Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.6821983Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.6822344Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 653, in forward 2025-08-14T21:44:03.6822689Z layer_outputs = decoder_layer( 2025-08-14T21:44:03.6823004Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:03.6823339Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:03.6823684Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 286, in forward 2025-08-14T21:44:03.6824062Z hidden_states = self.activation_fn(hidden_states) 2025-08-14T21:44:03.6824212Z 2025-08-14T21:44:03.6824307Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:03.6824640Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:03.6825007Z return mod(**inputs) 2025-08-14T21:44:03.6825307Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.6825644Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.6825990Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 826, in forward 2025-08-14T21:44:03.6826331Z outputs = self.model.decoder( 2025-08-14T21:44:03.6826654Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.6826981Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.6827331Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 653, in forward 2025-08-14T21:44:03.6827672Z layer_outputs = decoder_layer( 2025-08-14T21:44:03.6828001Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:03.6828344Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:03.6828687Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 288, in forward 2025-08-14T21:44:03.6829047Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:44:03.6829182Z 2025-08-14T21:44:03.6829279Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:03.6829615Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:03.6829910Z return mod(**inputs) 2025-08-14T21:44:03.6830216Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.6830544Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.6830887Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 826, in forward 2025-08-14T21:44:03.6831227Z outputs = self.model.decoder( 2025-08-14T21:44:03.6831546Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.6831874Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.6832212Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 653, in forward 2025-08-14T21:44:03.6832557Z layer_outputs = decoder_layer( 2025-08-14T21:44:03.6832879Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:03.6833212Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:03.6833555Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 291, in forward 2025-08-14T21:44:03.6833954Z hidden_states = (residual + hidden_states).view(hidden_states_shape) 2025-08-14T21:44:03.6834129Z 2025-08-14T21:44:03.6834232Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:03.6834597Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:03.6834935Z return mod(**inputs) 2025-08-14T21:44:03.6835239Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.6835565Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.6835901Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 826, in forward 2025-08-14T21:44:03.6836252Z outputs = self.model.decoder( 2025-08-14T21:44:03.6836572Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.6836915Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.6837251Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 653, in forward 2025-08-14T21:44:03.6837598Z layer_outputs = decoder_layer( 2025-08-14T21:44:03.6837920Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:03.6838248Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:03.6838599Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 259, in forward 2025-08-14T21:44:03.6838972Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:03.6839339Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 159, in forward 2025-08-14T21:44:03.6839713Z query_states = self.q_proj(hidden_states) * self.scaling 2025-08-14T21:44:03.6839877Z 2025-08-14T21:44:03.6839973Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:03.6840303Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:03.6840595Z return mod(**inputs) 2025-08-14T21:44:03.6840900Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.6841228Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.6841570Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 826, in forward 2025-08-14T21:44:03.6841910Z outputs = self.model.decoder( 2025-08-14T21:44:03.6842224Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.6842549Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.6842883Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 653, in forward 2025-08-14T21:44:03.6843235Z layer_outputs = decoder_layer( 2025-08-14T21:44:03.6843555Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:03.6843883Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:03.6844232Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 259, in forward 2025-08-14T21:44:03.6844604Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:03.6844971Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 162, in forward 2025-08-14T21:44:03.6845315Z key_states = self.k_proj(hidden_states) 2025-08-14T21:44:03.6845450Z 2025-08-14T21:44:03.6845547Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:03.6845874Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:03.6846167Z return mod(**inputs) 2025-08-14T21:44:03.6846467Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.6846792Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.6847168Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 826, in forward 2025-08-14T21:44:03.6847524Z outputs = self.model.decoder( 2025-08-14T21:44:03.6847838Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.6848158Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.6848493Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 653, in forward 2025-08-14T21:44:03.6848837Z layer_outputs = decoder_layer( 2025-08-14T21:44:03.6849154Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:03.6849506Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:03.6849846Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 259, in forward 2025-08-14T21:44:03.6850216Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:03.6850582Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 163, in forward 2025-08-14T21:44:03.6850937Z value_states = self.v_proj(hidden_states) 2025-08-14T21:44:03.6851067Z 2025-08-14T21:44:03.6851143Z cudagraph partition due to non gpu ops 2025-08-14T21:44:03.6851342Z cudagraph partition due to non gpu ops 2025-08-14T21:44:03.6851534Z cudagraph partition due to non gpu ops 2025-08-14T21:44:03.6851716Z cudagraph partition due to non gpu ops 2025-08-14T21:44:03.6851929Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:03.6852259Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:03.6852550Z return mod(**inputs) 2025-08-14T21:44:03.6852851Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.6853174Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.6853517Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 826, in forward 2025-08-14T21:44:03.6853855Z outputs = self.model.decoder( 2025-08-14T21:44:03.6854167Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.6854491Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.6854823Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 653, in forward 2025-08-14T21:44:03.6855167Z layer_outputs = decoder_layer( 2025-08-14T21:44:03.6855486Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:03.6855821Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:03.6856157Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 259, in forward 2025-08-14T21:44:03.6856526Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:03.6856891Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 184, in forward 2025-08-14T21:44:03.6857267Z attn_output, attn_weights = attention_interface( 2025-08-14T21:44:03.6857671Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:44:03.6858115Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:44:03.6858286Z 2025-08-14T21:44:03.6858390Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:03.6858714Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:03.6859014Z return mod(**inputs) 2025-08-14T21:44:03.6859316Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.6859675Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.6860030Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 826, in forward 2025-08-14T21:44:03.6860393Z outputs = self.model.decoder( 2025-08-14T21:44:03.6860710Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.6861024Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.6861365Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 653, in forward 2025-08-14T21:44:03.6861704Z layer_outputs = decoder_layer( 2025-08-14T21:44:03.6862081Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:03.6862407Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:03.6862759Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 259, in forward 2025-08-14T21:44:03.6863132Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:03.6863497Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 184, in forward 2025-08-14T21:44:03.6863860Z attn_output, attn_weights = attention_interface( 2025-08-14T21:44:03.6864269Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:44:03.6864780Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:44:03.6864938Z 2025-08-14T21:44:03.6865034Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:03.6865370Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:03.6865671Z return mod(**inputs) 2025-08-14T21:44:03.6865978Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.6866295Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.6866646Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 826, in forward 2025-08-14T21:44:03.6866995Z outputs = self.model.decoder( 2025-08-14T21:44:03.6867308Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.6867623Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.6867965Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 653, in forward 2025-08-14T21:44:03.6868310Z layer_outputs = decoder_layer( 2025-08-14T21:44:03.6868623Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:03.6868954Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:03.6869299Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 259, in forward 2025-08-14T21:44:03.6869671Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:03.6870033Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 196, in forward 2025-08-14T21:44:03.6870384Z attn_output = self.out_proj(attn_output) 2025-08-14T21:44:03.6870507Z 2025-08-14T21:44:03.6870607Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:03.6870927Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:03.6871226Z return mod(**inputs) 2025-08-14T21:44:03.6871527Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.6871842Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.6872211Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 826, in forward 2025-08-14T21:44:03.6872565Z outputs = self.model.decoder( 2025-08-14T21:44:03.6872910Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.6873224Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.6873562Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 653, in forward 2025-08-14T21:44:03.6873905Z layer_outputs = decoder_layer( 2025-08-14T21:44:03.6874216Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:03.6874544Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:03.6874905Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 285, in forward 2025-08-14T21:44:03.6875257Z hidden_states = self.fc1(hidden_states) 2025-08-14T21:44:03.6875381Z 2025-08-14T21:44:03.6875477Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:03.6875810Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:03.6876111Z return mod(**inputs) 2025-08-14T21:44:03.6876403Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.6876723Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.6877062Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 826, in forward 2025-08-14T21:44:03.6877405Z outputs = self.model.decoder( 2025-08-14T21:44:03.6877715Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.6878038Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.6878378Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 653, in forward 2025-08-14T21:44:03.6878719Z layer_outputs = decoder_layer( 2025-08-14T21:44:03.6879030Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:03.6879360Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:03.6879701Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 286, in forward 2025-08-14T21:44:03.6880061Z hidden_states = self.activation_fn(hidden_states) 2025-08-14T21:44:03.6880205Z 2025-08-14T21:44:03.6880296Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:03.6880622Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:03.6880922Z return mod(**inputs) 2025-08-14T21:44:03.6881212Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.6881534Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.6881880Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 826, in forward 2025-08-14T21:44:03.6882220Z outputs = self.model.decoder( 2025-08-14T21:44:03.6882533Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.6882853Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.6883193Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 653, in forward 2025-08-14T21:44:03.6883528Z layer_outputs = decoder_layer( 2025-08-14T21:44:03.6883847Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:03.6884179Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:03.6884515Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 288, in forward 2025-08-14T21:44:03.6885048Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:44:03.6885189Z 2025-08-14T21:44:03.6885308Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:03.6885672Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:03.6885976Z return mod(**inputs) 2025-08-14T21:44:03.6886284Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.6886621Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.6886972Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 826, in forward 2025-08-14T21:44:03.6887332Z outputs = self.model.decoder( 2025-08-14T21:44:03.6887645Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.6887967Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.6888301Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 653, in forward 2025-08-14T21:44:03.6888651Z layer_outputs = decoder_layer( 2025-08-14T21:44:03.6888968Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:03.6889300Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:03.6889639Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 259, in forward 2025-08-14T21:44:03.6890005Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:03.6890369Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 159, in forward 2025-08-14T21:44:03.6890745Z query_states = self.q_proj(hidden_states) * self.scaling 2025-08-14T21:44:03.6890901Z 2025-08-14T21:44:03.6890994Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:03.6891325Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:03.6891623Z return mod(**inputs) 2025-08-14T21:44:03.6891919Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.6892239Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.6892582Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 826, in forward 2025-08-14T21:44:03.6892922Z outputs = self.model.decoder( 2025-08-14T21:44:03.6893229Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.6893549Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.6893886Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 653, in forward 2025-08-14T21:44:03.6894221Z layer_outputs = decoder_layer( 2025-08-14T21:44:03.6894539Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:03.6894873Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:03.6895217Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 259, in forward 2025-08-14T21:44:03.6895576Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:03.6895939Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 162, in forward 2025-08-14T21:44:03.6896286Z key_states = self.k_proj(hidden_states) 2025-08-14T21:44:03.6896410Z 2025-08-14T21:44:03.6896505Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:03.6896840Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:03.6897136Z return mod(**inputs) 2025-08-14T21:44:03.6897449Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.6897770Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.6898138Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 826, in forward 2025-08-14T21:44:03.6898495Z outputs = self.model.decoder( 2025-08-14T21:44:03.6898801Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.6899126Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.6899469Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 653, in forward 2025-08-14T21:44:03.6899836Z layer_outputs = decoder_layer( 2025-08-14T21:44:03.6900152Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:03.6900488Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:03.6900840Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 259, in forward 2025-08-14T21:44:03.6901215Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:03.6901573Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 163, in forward 2025-08-14T21:44:03.6901928Z value_states = self.v_proj(hidden_states) 2025-08-14T21:44:03.6902056Z 2025-08-14T21:44:03.6902139Z cudagraph partition due to non gpu ops 2025-08-14T21:44:03.6902328Z cudagraph partition due to non gpu ops 2025-08-14T21:44:03.6902519Z cudagraph partition due to non gpu ops 2025-08-14T21:44:03.6902708Z cudagraph partition due to non gpu ops 2025-08-14T21:44:03.6902926Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:03.6903253Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:03.6903552Z return mod(**inputs) 2025-08-14T21:44:03.6903860Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.6904183Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.6904532Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 826, in forward 2025-08-14T21:44:03.6904934Z outputs = self.model.decoder( 2025-08-14T21:44:03.6905255Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.6905579Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.6905924Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 653, in forward 2025-08-14T21:44:03.6906274Z layer_outputs = decoder_layer( 2025-08-14T21:44:03.6906589Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:03.6906924Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:03.6907279Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 259, in forward 2025-08-14T21:44:03.6907653Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:03.6908015Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 184, in forward 2025-08-14T21:44:03.6908385Z attn_output, attn_weights = attention_interface( 2025-08-14T21:44:03.6908797Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:44:03.6909239Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:44:03.6909415Z 2025-08-14T21:44:03.6909507Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:03.6909839Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:03.6910139Z return mod(**inputs) 2025-08-14T21:44:03.6910463Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.6910806Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.6911150Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 826, in forward 2025-08-14T21:44:03.6911491Z outputs = self.model.decoder( 2025-08-14T21:44:03.6911798Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.6912116Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.6912458Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 653, in forward 2025-08-14T21:44:03.6912813Z layer_outputs = decoder_layer( 2025-08-14T21:44:03.6913131Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:03.6913465Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:03.6913810Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 259, in forward 2025-08-14T21:44:03.6914171Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:03.6914533Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 184, in forward 2025-08-14T21:44:03.6914898Z attn_output, attn_weights = attention_interface( 2025-08-14T21:44:03.6915297Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:44:03.6915717Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:44:03.6915874Z 2025-08-14T21:44:03.6915969Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:03.6916296Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:03.6916585Z return mod(**inputs) 2025-08-14T21:44:03.6916886Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.6917209Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.6917550Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 826, in forward 2025-08-14T21:44:03.6917887Z outputs = self.model.decoder( 2025-08-14T21:44:03.6918200Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.6918524Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.6918857Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 653, in forward 2025-08-14T21:44:03.6919200Z layer_outputs = decoder_layer( 2025-08-14T21:44:03.6919518Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:03.6919847Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:03.6920184Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 259, in forward 2025-08-14T21:44:03.6920554Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:03.6920917Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 196, in forward 2025-08-14T21:44:03.6921263Z attn_output = self.out_proj(attn_output) 2025-08-14T21:44:03.6921393Z 2025-08-14T21:44:03.6921486Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:03.6921817Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:03.6922114Z return mod(**inputs) 2025-08-14T21:44:03.6922403Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.6922720Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.6923084Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 826, in forward 2025-08-14T21:44:03.6923441Z outputs = self.model.decoder( 2025-08-14T21:44:03.6923746Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.6924066Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.6924406Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 653, in forward 2025-08-14T21:44:03.6924740Z layer_outputs = decoder_layer( 2025-08-14T21:44:03.6925055Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:03.6925399Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:03.6925745Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 285, in forward 2025-08-14T21:44:03.6926096Z hidden_states = self.fc1(hidden_states) 2025-08-14T21:44:03.6926229Z 2025-08-14T21:44:03.6926326Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:03.6926656Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:03.6926946Z return mod(**inputs) 2025-08-14T21:44:03.6927245Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.6927568Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.6927912Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 826, in forward 2025-08-14T21:44:03.6928249Z outputs = self.model.decoder( 2025-08-14T21:44:03.6928563Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.6928886Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.6929221Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 653, in forward 2025-08-14T21:44:03.6929563Z layer_outputs = decoder_layer( 2025-08-14T21:44:03.6929882Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:03.6930214Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:03.6930553Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 286, in forward 2025-08-14T21:44:03.6930917Z hidden_states = self.activation_fn(hidden_states) 2025-08-14T21:44:03.6931051Z 2025-08-14T21:44:03.6931143Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:03.6931461Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:03.6931747Z return mod(**inputs) 2025-08-14T21:44:03.6932040Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.6932359Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.6932695Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 826, in forward 2025-08-14T21:44:03.6933036Z outputs = self.model.decoder( 2025-08-14T21:44:03.6933352Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.6933677Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.6934012Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 653, in forward 2025-08-14T21:44:03.6934354Z layer_outputs = decoder_layer( 2025-08-14T21:44:03.6934674Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:03.6934993Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:03.6935348Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 288, in forward 2025-08-14T21:44:03.6935725Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:44:03.6935849Z 2025-08-14T21:44:03.6935945Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:03.6936258Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:03.6936548Z return mod(**inputs) 2025-08-14T21:44:03.6936837Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.6937144Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.6937499Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 826, in forward 2025-08-14T21:44:03.6937841Z outputs = self.model.decoder( 2025-08-14T21:44:03.6938153Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.6938469Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.6938810Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 653, in forward 2025-08-14T21:44:03.6939156Z layer_outputs = decoder_layer( 2025-08-14T21:44:03.6939478Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:03.6939808Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:03.6940156Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 291, in forward 2025-08-14T21:44:03.6940556Z hidden_states = (residual + hidden_states).view(hidden_states_shape) 2025-08-14T21:44:03.6940732Z 2025-08-14T21:44:03.6940828Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:03.6941163Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:03.6941466Z return mod(**inputs) 2025-08-14T21:44:03.6941770Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.6942095Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.6942440Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 826, in forward 2025-08-14T21:44:03.6942792Z outputs = self.model.decoder( 2025-08-14T21:44:03.6943103Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.6943430Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.6943777Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 653, in forward 2025-08-14T21:44:03.6944125Z layer_outputs = decoder_layer( 2025-08-14T21:44:03.6944440Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:03.6944840Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:03.6945190Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 259, in forward 2025-08-14T21:44:03.6945562Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:03.6945915Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 159, in forward 2025-08-14T21:44:03.6946294Z query_states = self.q_proj(hidden_states) * self.scaling 2025-08-14T21:44:03.6946444Z 2025-08-14T21:44:03.6946543Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:03.6946868Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:03.6947176Z return mod(**inputs) 2025-08-14T21:44:03.6947480Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.6947829Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.6948181Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 826, in forward 2025-08-14T21:44:03.6948555Z outputs = self.model.decoder( 2025-08-14T21:44:03.6948864Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.6949176Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.6949518Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 653, in forward 2025-08-14T21:44:03.6949861Z layer_outputs = decoder_layer( 2025-08-14T21:44:03.6950201Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:03.6950531Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:03.6950877Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 259, in forward 2025-08-14T21:44:03.6951255Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:03.6951624Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 162, in forward 2025-08-14T21:44:03.6951982Z key_states = self.k_proj(hidden_states) 2025-08-14T21:44:03.6952114Z 2025-08-14T21:44:03.6952211Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:03.6952548Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:03.6952844Z return mod(**inputs) 2025-08-14T21:44:03.6953147Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.6953477Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.6953821Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 826, in forward 2025-08-14T21:44:03.6954161Z outputs = self.model.decoder( 2025-08-14T21:44:03.6954477Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.6954804Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.6955142Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 653, in forward 2025-08-14T21:44:03.6955491Z layer_outputs = decoder_layer( 2025-08-14T21:44:03.6955812Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:03.6956149Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:03.6956492Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 259, in forward 2025-08-14T21:44:03.6956868Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:03.6957234Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 163, in forward 2025-08-14T21:44:03.6957586Z value_states = self.v_proj(hidden_states) 2025-08-14T21:44:03.6957727Z 2025-08-14T21:44:03.6957804Z cudagraph partition due to non gpu ops 2025-08-14T21:44:03.6958007Z cudagraph partition due to non gpu ops 2025-08-14T21:44:03.6958200Z cudagraph partition due to non gpu ops 2025-08-14T21:44:03.6958387Z cudagraph partition due to non gpu ops 2025-08-14T21:44:03.6958609Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:03.6958946Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:03.6959244Z return mod(**inputs) 2025-08-14T21:44:03.6959553Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.6959885Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.6960233Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 826, in forward 2025-08-14T21:44:03.6960595Z outputs = self.model.decoder( 2025-08-14T21:44:03.6960928Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.6961266Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.6961599Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 653, in forward 2025-08-14T21:44:03.6961943Z layer_outputs = decoder_layer( 2025-08-14T21:44:03.6962261Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:03.6962592Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:03.6962944Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 259, in forward 2025-08-14T21:44:03.6963309Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:03.6963671Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 184, in forward 2025-08-14T21:44:03.6964034Z attn_output, attn_weights = attention_interface( 2025-08-14T21:44:03.6964436Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:44:03.6964879Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:44:03.6965047Z 2025-08-14T21:44:03.6965149Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:03.6965471Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:03.6965768Z return mod(**inputs) 2025-08-14T21:44:03.6966067Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.6966393Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.6966727Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 826, in forward 2025-08-14T21:44:03.6967074Z outputs = self.model.decoder( 2025-08-14T21:44:03.6967387Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.6967698Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.6968040Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 653, in forward 2025-08-14T21:44:03.6968378Z layer_outputs = decoder_layer( 2025-08-14T21:44:03.6968693Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:03.6969018Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:03.6969357Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 259, in forward 2025-08-14T21:44:03.6969723Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:03.6970087Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 184, in forward 2025-08-14T21:44:03.6970444Z attn_output, attn_weights = attention_interface( 2025-08-14T21:44:03.6970848Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:44:03.6971265Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:44:03.6971411Z 2025-08-14T21:44:03.6971503Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:03.6971832Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:03.6972129Z return mod(**inputs) 2025-08-14T21:44:03.6972423Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.6972740Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.6973095Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 826, in forward 2025-08-14T21:44:03.6973470Z outputs = self.model.decoder( 2025-08-14T21:44:03.6973778Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.6974098Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.6974438Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 653, in forward 2025-08-14T21:44:03.6974780Z layer_outputs = decoder_layer( 2025-08-14T21:44:03.6975092Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:03.6975439Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:03.6975788Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 259, in forward 2025-08-14T21:44:03.6976152Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:03.6976514Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 196, in forward 2025-08-14T21:44:03.6976869Z attn_output = self.out_proj(attn_output) 2025-08-14T21:44:03.6976993Z 2025-08-14T21:44:03.6977095Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:03.6977418Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:03.6977718Z return mod(**inputs) 2025-08-14T21:44:03.6978014Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.6978335Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.6978672Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 826, in forward 2025-08-14T21:44:03.6979012Z outputs = self.model.decoder( 2025-08-14T21:44:03.6979327Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.6979645Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.6979986Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 653, in forward 2025-08-14T21:44:03.6980327Z layer_outputs = decoder_layer( 2025-08-14T21:44:03.6980643Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:03.6980969Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:03.6981314Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 285, in forward 2025-08-14T21:44:03.6981668Z hidden_states = self.fc1(hidden_states) 2025-08-14T21:44:03.6981791Z 2025-08-14T21:44:03.6981894Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:03.6982218Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:03.6982517Z return mod(**inputs) 2025-08-14T21:44:03.6982817Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.6983134Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.6983478Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 826, in forward 2025-08-14T21:44:03.6983818Z outputs = self.model.decoder( 2025-08-14T21:44:03.6984127Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.6984443Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.6984972Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 653, in forward 2025-08-14T21:44:03.6985327Z layer_outputs = decoder_layer( 2025-08-14T21:44:03.6985676Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:03.6986011Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:03.6986396Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 286, in forward 2025-08-14T21:44:03.6986768Z hidden_states = self.activation_fn(hidden_states) 2025-08-14T21:44:03.6986909Z 2025-08-14T21:44:03.6987004Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:03.6987335Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:03.6987634Z return mod(**inputs) 2025-08-14T21:44:03.6987926Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.6988276Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.6988618Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 826, in forward 2025-08-14T21:44:03.6988962Z outputs = self.model.decoder( 2025-08-14T21:44:03.6989270Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.6989592Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.6989936Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 653, in forward 2025-08-14T21:44:03.6990267Z layer_outputs = decoder_layer( 2025-08-14T21:44:03.6990586Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:03.6990917Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:03.6991265Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 288, in forward 2025-08-14T21:44:03.6991606Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:44:03.6991738Z 2025-08-14T21:44:03.6991830Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:03.6992170Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:03.6992472Z return mod(**inputs) 2025-08-14T21:44:03.6992766Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.6993089Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.6993432Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 826, in forward 2025-08-14T21:44:03.6993766Z outputs = self.model.decoder( 2025-08-14T21:44:03.6994079Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.6994402Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.6994742Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 653, in forward 2025-08-14T21:44:03.6995075Z layer_outputs = decoder_layer( 2025-08-14T21:44:03.6995394Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:03.6995723Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:03.6996060Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 259, in forward 2025-08-14T21:44:03.6996425Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:03.6996790Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 159, in forward 2025-08-14T21:44:03.6997170Z query_states = self.q_proj(hidden_states) * self.scaling 2025-08-14T21:44:03.6997325Z 2025-08-14T21:44:03.6997417Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:03.6997747Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:03.6998043Z return mod(**inputs) 2025-08-14T21:44:03.6998374Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.6998748Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.6999093Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 826, in forward 2025-08-14T21:44:03.6999434Z outputs = self.model.decoder( 2025-08-14T21:44:03.6999742Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.7000060Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.7000398Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 653, in forward 2025-08-14T21:44:03.7000760Z layer_outputs = decoder_layer( 2025-08-14T21:44:03.7001072Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:03.7001406Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:03.7001754Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 259, in forward 2025-08-14T21:44:03.7002116Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:03.7002479Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 162, in forward 2025-08-14T21:44:03.7002826Z key_states = self.k_proj(hidden_states) 2025-08-14T21:44:03.7002950Z 2025-08-14T21:44:03.7003050Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:03.7003369Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:03.7003670Z return mod(**inputs) 2025-08-14T21:44:03.7003969Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.7004288Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.7004629Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 826, in forward 2025-08-14T21:44:03.7004975Z outputs = self.model.decoder( 2025-08-14T21:44:03.7005286Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.7005596Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.7005935Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 653, in forward 2025-08-14T21:44:03.7006278Z layer_outputs = decoder_layer( 2025-08-14T21:44:03.7006595Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:03.7006920Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:03.7007268Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 259, in forward 2025-08-14T21:44:03.7007638Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:03.7007994Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 163, in forward 2025-08-14T21:44:03.7008350Z value_states = self.v_proj(hidden_states) 2025-08-14T21:44:03.7008484Z 2025-08-14T21:44:03.7008559Z cudagraph partition due to non gpu ops 2025-08-14T21:44:03.7008754Z cudagraph partition due to non gpu ops 2025-08-14T21:44:03.7008822Z cudagraph partition due to non gpu ops 2025-08-14T21:44:03.7008891Z cudagraph partition due to non gpu ops 2025-08-14T21:44:03.7008992Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:03.7009179Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:03.7009241Z return mod(**inputs) 2025-08-14T21:44:03.7009449Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.7009517Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.7009771Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 826, in forward 2025-08-14T21:44:03.7009854Z outputs = self.model.decoder( 2025-08-14T21:44:03.7010052Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.7010125Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.7010343Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 653, in forward 2025-08-14T21:44:03.7010409Z layer_outputs = decoder_layer( 2025-08-14T21:44:03.7010620Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:03.7010711Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:03.7010937Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 259, in forward 2025-08-14T21:44:03.7011029Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:03.7011250Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 184, in forward 2025-08-14T21:44:03.7011350Z attn_output, attn_weights = attention_interface( 2025-08-14T21:44:03.7011618Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:44:03.7011748Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:44:03.7011752Z 2025-08-14T21:44:03.7011845Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:03.7012029Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:03.7012097Z return mod(**inputs) 2025-08-14T21:44:03.7012296Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.7012364Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.7012592Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 826, in forward 2025-08-14T21:44:03.7012661Z outputs = self.model.decoder( 2025-08-14T21:44:03.7012866Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.7012933Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.7013151Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 653, in forward 2025-08-14T21:44:03.7013224Z layer_outputs = decoder_layer( 2025-08-14T21:44:03.7013427Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:03.7013497Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:03.7013721Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 259, in forward 2025-08-14T21:44:03.7013810Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:03.7014038Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 184, in forward 2025-08-14T21:44:03.7014127Z attn_output, attn_weights = attention_interface( 2025-08-14T21:44:03.7014391Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:44:03.7014497Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:44:03.7014501Z 2025-08-14T21:44:03.7014593Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:03.7014785Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:03.7014846Z return mod(**inputs) 2025-08-14T21:44:03.7015043Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.7015130Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.7015366Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 826, in forward 2025-08-14T21:44:03.7015449Z outputs = self.model.decoder( 2025-08-14T21:44:03.7015653Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.7015718Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.7015942Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 653, in forward 2025-08-14T21:44:03.7016008Z layer_outputs = decoder_layer( 2025-08-14T21:44:03.7016225Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:03.7016302Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:03.7016523Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 259, in forward 2025-08-14T21:44:03.7016620Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:03.7016839Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 196, in forward 2025-08-14T21:44:03.7016911Z attn_output = self.out_proj(attn_output) 2025-08-14T21:44:03.7016914Z 2025-08-14T21:44:03.7017013Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:03.7017195Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:03.7017255Z return mod(**inputs) 2025-08-14T21:44:03.7017460Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.7017527Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.7017751Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 826, in forward 2025-08-14T21:44:03.7017819Z outputs = self.model.decoder( 2025-08-14T21:44:03.7018018Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.7018095Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.7018312Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 653, in forward 2025-08-14T21:44:03.7018376Z layer_outputs = decoder_layer( 2025-08-14T21:44:03.7018584Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:03.7018654Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:03.7018880Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 285, in forward 2025-08-14T21:44:03.7018953Z hidden_states = self.fc1(hidden_states) 2025-08-14T21:44:03.7018956Z 2025-08-14T21:44:03.7019049Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:03.7019242Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:03.7019304Z return mod(**inputs) 2025-08-14T21:44:03.7019507Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.7019574Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.7019791Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 826, in forward 2025-08-14T21:44:03.7019866Z outputs = self.model.decoder( 2025-08-14T21:44:03.7020061Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.7020128Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.7020354Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 653, in forward 2025-08-14T21:44:03.7020419Z layer_outputs = decoder_layer( 2025-08-14T21:44:03.7020654Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:03.7020749Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:03.7020968Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 286, in forward 2025-08-14T21:44:03.7021065Z hidden_states = self.activation_fn(hidden_states) 2025-08-14T21:44:03.7021068Z 2025-08-14T21:44:03.7021158Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:03.7021350Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:03.7021426Z return mod(**inputs) 2025-08-14T21:44:03.7021626Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.7021699Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.7021921Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 826, in forward 2025-08-14T21:44:03.7021990Z outputs = self.model.decoder( 2025-08-14T21:44:03.7022194Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.7022260Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.7022486Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 653, in forward 2025-08-14T21:44:03.7022549Z layer_outputs = decoder_layer( 2025-08-14T21:44:03.7022748Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:03.7022827Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:03.7023046Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 288, in forward 2025-08-14T21:44:03.7023119Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:44:03.7023124Z 2025-08-14T21:44:03.7023225Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:03.7023406Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:03.7023471Z return mod(**inputs) 2025-08-14T21:44:03.7023668Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.7023734Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.7023959Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 826, in forward 2025-08-14T21:44:03.7024024Z outputs = self.model.decoder( 2025-08-14T21:44:03.7024222Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.7024296Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.7024517Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 653, in forward 2025-08-14T21:44:03.7024590Z layer_outputs = decoder_layer( 2025-08-14T21:44:03.7024863Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:03.7024940Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:03.7025165Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 291, in forward 2025-08-14T21:44:03.7025286Z hidden_states = (residual + hidden_states).view(hidden_states_shape) 2025-08-14T21:44:03.7025289Z 2025-08-14T21:44:03.7025389Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:03.7025574Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:03.7025634Z return mod(**inputs) 2025-08-14T21:44:03.7025842Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.7025936Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.7026190Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 826, in forward 2025-08-14T21:44:03.7026266Z outputs = self.model.decoder( 2025-08-14T21:44:03.7026469Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.7026547Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.7026769Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 653, in forward 2025-08-14T21:44:03.7026836Z layer_outputs = decoder_layer( 2025-08-14T21:44:03.7027062Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:03.7027134Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:03.7027354Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 259, in forward 2025-08-14T21:44:03.7027451Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:03.7027668Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 159, in forward 2025-08-14T21:44:03.7027776Z query_states = self.q_proj(hidden_states) * self.scaling 2025-08-14T21:44:03.7027780Z 2025-08-14T21:44:03.7027870Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:03.7028049Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:03.7028114Z return mod(**inputs) 2025-08-14T21:44:03.7028313Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.7028384Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.7028601Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 826, in forward 2025-08-14T21:44:03.7028666Z outputs = self.model.decoder( 2025-08-14T21:44:03.7028871Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.7028935Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.7029151Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 653, in forward 2025-08-14T21:44:03.7029221Z layer_outputs = decoder_layer( 2025-08-14T21:44:03.7029420Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:03.7029497Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:03.7029716Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 259, in forward 2025-08-14T21:44:03.7029804Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:03.7030030Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 162, in forward 2025-08-14T21:44:03.7030104Z key_states = self.k_proj(hidden_states) 2025-08-14T21:44:03.7030108Z 2025-08-14T21:44:03.7030207Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:03.7030389Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:03.7030447Z return mod(**inputs) 2025-08-14T21:44:03.7030652Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.7030717Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.7030935Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 826, in forward 2025-08-14T21:44:03.7031013Z outputs = self.model.decoder( 2025-08-14T21:44:03.7031208Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.7031295Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.7031529Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 653, in forward 2025-08-14T21:44:03.7031633Z layer_outputs = decoder_layer( 2025-08-14T21:44:03.7031847Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:03.7031917Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:03.7032141Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 259, in forward 2025-08-14T21:44:03.7032239Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:03.7032473Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 163, in forward 2025-08-14T21:44:03.7032557Z value_states = self.v_proj(hidden_states) 2025-08-14T21:44:03.7032560Z 2025-08-14T21:44:03.7032635Z cudagraph partition due to non gpu ops 2025-08-14T21:44:03.7032706Z cudagraph partition due to non gpu ops 2025-08-14T21:44:03.7032788Z cudagraph partition due to non gpu ops 2025-08-14T21:44:03.7032857Z cudagraph partition due to non gpu ops 2025-08-14T21:44:03.7032949Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:03.7033138Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:03.7033198Z return mod(**inputs) 2025-08-14T21:44:03.7033401Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.7033465Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.7033685Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 826, in forward 2025-08-14T21:44:03.7033760Z outputs = self.model.decoder( 2025-08-14T21:44:03.7033958Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.7034033Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.7034257Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 653, in forward 2025-08-14T21:44:03.7034323Z layer_outputs = decoder_layer( 2025-08-14T21:44:03.7034535Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:03.7034607Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:03.7034824Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 259, in forward 2025-08-14T21:44:03.7034922Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:03.7035140Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 184, in forward 2025-08-14T21:44:03.7035237Z attn_output, attn_weights = attention_interface( 2025-08-14T21:44:03.7035503Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:44:03.7035626Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:44:03.7035629Z 2025-08-14T21:44:03.7035730Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:03.7035908Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:03.7035974Z return mod(**inputs) 2025-08-14T21:44:03.7036170Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.7036240Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.7036457Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 826, in forward 2025-08-14T21:44:03.7036519Z outputs = self.model.decoder( 2025-08-14T21:44:03.7036728Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.7036830Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.7037044Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 653, in forward 2025-08-14T21:44:03.7037110Z layer_outputs = decoder_layer( 2025-08-14T21:44:03.7037304Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:03.7037372Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:03.7037590Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 259, in forward 2025-08-14T21:44:03.7037692Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:03.7037909Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 184, in forward 2025-08-14T21:44:03.7037998Z attn_output, attn_weights = attention_interface( 2025-08-14T21:44:03.7038259Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:44:03.7038361Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:44:03.7038365Z 2025-08-14T21:44:03.7038453Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:03.7038627Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:03.7038686Z return mod(**inputs) 2025-08-14T21:44:03.7038879Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.7038945Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.7039160Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 826, in forward 2025-08-14T21:44:03.7039222Z outputs = self.model.decoder( 2025-08-14T21:44:03.7039420Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.7039484Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.7039700Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 653, in forward 2025-08-14T21:44:03.7039764Z layer_outputs = decoder_layer( 2025-08-14T21:44:03.7039958Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:03.7040029Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:03.7040242Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 259, in forward 2025-08-14T21:44:03.7040327Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:03.7040546Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 196, in forward 2025-08-14T21:44:03.7040617Z attn_output = self.out_proj(attn_output) 2025-08-14T21:44:03.7040620Z 2025-08-14T21:44:03.7040712Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:03.7040887Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:03.7040943Z return mod(**inputs) 2025-08-14T21:44:03.7041138Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.7041201Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.7041413Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 826, in forward 2025-08-14T21:44:03.7041479Z outputs = self.model.decoder( 2025-08-14T21:44:03.7041670Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.7041735Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.7041978Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 653, in forward 2025-08-14T21:44:03.7042054Z layer_outputs = decoder_layer( 2025-08-14T21:44:03.7042254Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:03.7042323Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:03.7042538Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 285, in forward 2025-08-14T21:44:03.7042612Z hidden_states = self.fc1(hidden_states) 2025-08-14T21:44:03.7042615Z 2025-08-14T21:44:03.7042705Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:03.7042912Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:03.7042967Z return mod(**inputs) 2025-08-14T21:44:03.7043161Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.7043228Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.7043445Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 826, in forward 2025-08-14T21:44:03.7043507Z outputs = self.model.decoder( 2025-08-14T21:44:03.7043708Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.7043770Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.7043990Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 653, in forward 2025-08-14T21:44:03.7044053Z layer_outputs = decoder_layer( 2025-08-14T21:44:03.7044251Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:03.7044326Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:03.7044542Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 286, in forward 2025-08-14T21:44:03.7044633Z hidden_states = self.activation_fn(hidden_states) 2025-08-14T21:44:03.7044636Z 2025-08-14T21:44:03.7044724Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:03.7044901Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:03.7044962Z return mod(**inputs) 2025-08-14T21:44:03.7045157Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.7045219Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.7045439Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 826, in forward 2025-08-14T21:44:03.7045502Z outputs = self.model.decoder( 2025-08-14T21:44:03.7045698Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.7045762Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.7045979Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 653, in forward 2025-08-14T21:44:03.7046047Z layer_outputs = decoder_layer( 2025-08-14T21:44:03.7046250Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:03.7046321Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:03.7046548Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 288, in forward 2025-08-14T21:44:03.7046621Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:44:03.7046626Z 2025-08-14T21:44:03.7046726Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:03.7046906Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:03.7046964Z return mod(**inputs) 2025-08-14T21:44:03.7047194Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.7047276Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.7047509Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 826, in forward 2025-08-14T21:44:03.7047578Z outputs = self.model.decoder( 2025-08-14T21:44:03.7047778Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.7047852Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.7048074Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 653, in forward 2025-08-14T21:44:03.7048156Z layer_outputs = decoder_layer( 2025-08-14T21:44:03.7048365Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:03.7048438Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:03.7048668Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 259, in forward 2025-08-14T21:44:03.7048758Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:03.7048979Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 159, in forward 2025-08-14T21:44:03.7049088Z query_states = self.q_proj(hidden_states) * self.scaling 2025-08-14T21:44:03.7049091Z 2025-08-14T21:44:03.7049183Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:03.7049377Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:03.7049438Z return mod(**inputs) 2025-08-14T21:44:03.7049640Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.7049712Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.7049935Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 826, in forward 2025-08-14T21:44:03.7050003Z outputs = self.model.decoder( 2025-08-14T21:44:03.7050210Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.7050276Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.7050506Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 653, in forward 2025-08-14T21:44:03.7050571Z layer_outputs = decoder_layer( 2025-08-14T21:44:03.7050775Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:03.7050855Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:03.7051078Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 259, in forward 2025-08-14T21:44:03.7051169Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:03.7051400Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 162, in forward 2025-08-14T21:44:03.7051474Z key_states = self.k_proj(hidden_states) 2025-08-14T21:44:03.7051477Z 2025-08-14T21:44:03.7051578Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:03.7051762Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:03.7051821Z return mod(**inputs) 2025-08-14T21:44:03.7052029Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.7052097Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.7052326Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 826, in forward 2025-08-14T21:44:03.7052391Z outputs = self.model.decoder( 2025-08-14T21:44:03.7052619Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.7052706Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.7052926Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 653, in forward 2025-08-14T21:44:03.7052989Z layer_outputs = decoder_layer( 2025-08-14T21:44:03.7053197Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:03.7053270Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:03.7053495Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 259, in forward 2025-08-14T21:44:03.7053599Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:03.7053819Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 163, in forward 2025-08-14T21:44:03.7053903Z value_states = self.v_proj(hidden_states) 2025-08-14T21:44:03.7053907Z 2025-08-14T21:44:03.7053980Z cudagraph partition due to non gpu ops 2025-08-14T21:44:03.7054050Z cudagraph partition due to non gpu ops 2025-08-14T21:44:03.7054124Z cudagraph partition due to non gpu ops 2025-08-14T21:44:03.7054191Z cudagraph partition due to non gpu ops 2025-08-14T21:44:03.7054290Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:03.7054471Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:03.7054529Z return mod(**inputs) 2025-08-14T21:44:03.7054733Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.7054801Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.7055021Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 826, in forward 2025-08-14T21:44:03.7055095Z outputs = self.model.decoder( 2025-08-14T21:44:03.7055294Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.7055367Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.7055585Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 653, in forward 2025-08-14T21:44:03.7055650Z layer_outputs = decoder_layer( 2025-08-14T21:44:03.7055857Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:03.7055927Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:03.7056148Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 259, in forward 2025-08-14T21:44:03.7056244Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:03.7056468Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 184, in forward 2025-08-14T21:44:03.7056564Z attn_output, attn_weights = attention_interface( 2025-08-14T21:44:03.7056833Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:44:03.7056956Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:44:03.7056959Z 2025-08-14T21:44:03.7057056Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:03.7057238Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:03.7057302Z return mod(**inputs) 2025-08-14T21:44:03.7057503Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.7057570Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.7057795Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 826, in forward 2025-08-14T21:44:03.7057877Z outputs = self.model.decoder( 2025-08-14T21:44:03.7058109Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.7058186Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.7058406Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 653, in forward 2025-08-14T21:44:03.7058477Z layer_outputs = decoder_layer( 2025-08-14T21:44:03.7058678Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:03.7058750Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:03.7058991Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 259, in forward 2025-08-14T21:44:03.7059081Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:03.7059309Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 184, in forward 2025-08-14T21:44:03.7059400Z attn_output, attn_weights = attention_interface( 2025-08-14T21:44:03.7059665Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:44:03.7059773Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:44:03.7059777Z 2025-08-14T21:44:03.7059868Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:03.7060052Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:03.7060119Z return mod(**inputs) 2025-08-14T21:44:03.7060315Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.7060389Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.7060608Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 826, in forward 2025-08-14T21:44:03.7060676Z outputs = self.model.decoder( 2025-08-14T21:44:03.7060883Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.7060947Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.7061172Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 653, in forward 2025-08-14T21:44:03.7061236Z layer_outputs = decoder_layer( 2025-08-14T21:44:03.7061436Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:03.7061514Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:03.7061733Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 259, in forward 2025-08-14T21:44:03.7061821Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:03.7062048Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 196, in forward 2025-08-14T21:44:03.7062123Z attn_output = self.out_proj(attn_output) 2025-08-14T21:44:03.7062126Z 2025-08-14T21:44:03.7062226Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:03.7062408Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:03.7062467Z return mod(**inputs) 2025-08-14T21:44:03.7062671Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.7062735Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.7062955Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 826, in forward 2025-08-14T21:44:03.7063030Z outputs = self.model.decoder( 2025-08-14T21:44:03.7063241Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.7063318Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.7063572Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 653, in forward 2025-08-14T21:44:03.7063638Z layer_outputs = decoder_layer( 2025-08-14T21:44:03.7063848Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:03.7063918Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:03.7064141Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 285, in forward 2025-08-14T21:44:03.7064231Z hidden_states = self.fc1(hidden_states) 2025-08-14T21:44:03.7064234Z 2025-08-14T21:44:03.7064327Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:03.7064514Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:03.7064576Z return mod(**inputs) 2025-08-14T21:44:03.7064848Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.7064928Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.7065148Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 826, in forward 2025-08-14T21:44:03.7065222Z outputs = self.model.decoder( 2025-08-14T21:44:03.7065419Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.7065485Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.7065711Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 653, in forward 2025-08-14T21:44:03.7065777Z layer_outputs = decoder_layer( 2025-08-14T21:44:03.7065979Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:03.7066060Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:03.7066282Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 286, in forward 2025-08-14T21:44:03.7066380Z hidden_states = self.activation_fn(hidden_states) 2025-08-14T21:44:03.7066384Z 2025-08-14T21:44:03.7066477Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:03.7066660Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:03.7066725Z return mod(**inputs) 2025-08-14T21:44:03.7066923Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.7066999Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.7067220Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 826, in forward 2025-08-14T21:44:03.7067289Z outputs = self.model.decoder( 2025-08-14T21:44:03.7067496Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.7067565Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.7067784Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 653, in forward 2025-08-14T21:44:03.7067856Z layer_outputs = decoder_layer( 2025-08-14T21:44:03.7068055Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:03.7068133Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:03.7068350Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 288, in forward 2025-08-14T21:44:03.7068423Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:44:03.7068426Z 2025-08-14T21:44:03.7068525Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:03.7068721Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:03.7068814Z return mod(**inputs) 2025-08-14T21:44:03.7069020Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.7069086Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.7069312Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 826, in forward 2025-08-14T21:44:03.7069378Z outputs = self.model.decoder( 2025-08-14T21:44:03.7069575Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.7069667Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.7069886Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 653, in forward 2025-08-14T21:44:03.7069957Z layer_outputs = decoder_layer( 2025-08-14T21:44:03.7070155Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:03.7070228Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:03.7070454Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 291, in forward 2025-08-14T21:44:03.7070577Z hidden_states = (residual + hidden_states).view(hidden_states_shape) 2025-08-14T21:44:03.7070580Z 2025-08-14T21:44:03.7070672Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:03.7070858Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:03.7070918Z return mod(**inputs) 2025-08-14T21:44:03.7071126Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.7071191Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.7071411Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 826, in forward 2025-08-14T21:44:03.7071489Z outputs = self.model.decoder( 2025-08-14T21:44:03.7071690Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.7071754Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.7071982Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 653, in forward 2025-08-14T21:44:03.7072048Z layer_outputs = decoder_layer( 2025-08-14T21:44:03.7072256Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:03.7072328Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:03.7072545Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 259, in forward 2025-08-14T21:44:03.7072644Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:03.7072866Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 159, in forward 2025-08-14T21:44:03.7072981Z query_states = self.q_proj(hidden_states) * self.scaling 2025-08-14T21:44:03.7072984Z 2025-08-14T21:44:03.7073074Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:03.7073255Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:03.7073321Z return mod(**inputs) 2025-08-14T21:44:03.7073519Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.7073585Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.7073811Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 826, in forward 2025-08-14T21:44:03.7073876Z outputs = self.model.decoder( 2025-08-14T21:44:03.7074093Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.7074176Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.7074413Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 653, in forward 2025-08-14T21:44:03.7074487Z layer_outputs = decoder_layer( 2025-08-14T21:44:03.7074690Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:03.7074766Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:03.7074986Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 259, in forward 2025-08-14T21:44:03.7075089Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:03.7075316Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 162, in forward 2025-08-14T21:44:03.7075390Z key_states = self.k_proj(hidden_states) 2025-08-14T21:44:03.7075395Z 2025-08-14T21:44:03.7075489Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:03.7075684Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:03.7075742Z return mod(**inputs) 2025-08-14T21:44:03.7075951Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.7076018Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.7076238Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 826, in forward 2025-08-14T21:44:03.7076312Z outputs = self.model.decoder( 2025-08-14T21:44:03.7076511Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.7076576Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.7076804Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 653, in forward 2025-08-14T21:44:03.7076872Z layer_outputs = decoder_layer( 2025-08-14T21:44:03.7077084Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:03.7077156Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:03.7077375Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 259, in forward 2025-08-14T21:44:03.7077471Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:03.7077692Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 163, in forward 2025-08-14T21:44:03.7077774Z value_states = self.v_proj(hidden_states) 2025-08-14T21:44:03.7077778Z 2025-08-14T21:44:03.7077850Z cudagraph partition due to non gpu ops 2025-08-14T21:44:03.7077921Z cudagraph partition due to non gpu ops 2025-08-14T21:44:03.7077996Z cudagraph partition due to non gpu ops 2025-08-14T21:44:03.7078065Z cudagraph partition due to non gpu ops 2025-08-14T21:44:03.7078158Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:03.7078350Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:03.7078408Z return mod(**inputs) 2025-08-14T21:44:03.7078616Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.7078680Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.7078903Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 826, in forward 2025-08-14T21:44:03.7078978Z outputs = self.model.decoder( 2025-08-14T21:44:03.7079180Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.7079245Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.7079489Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 653, in forward 2025-08-14T21:44:03.7079583Z layer_outputs = decoder_layer( 2025-08-14T21:44:03.7079791Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:03.7079860Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:03.7080076Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 259, in forward 2025-08-14T21:44:03.7080170Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:03.7080387Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 184, in forward 2025-08-14T21:44:03.7080492Z attn_output, attn_weights = attention_interface( 2025-08-14T21:44:03.7080764Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:44:03.7080887Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:44:03.7080893Z 2025-08-14T21:44:03.7080993Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:03.7081174Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:03.7081232Z return mod(**inputs) 2025-08-14T21:44:03.7081436Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.7081502Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.7081727Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 826, in forward 2025-08-14T21:44:03.7081795Z outputs = self.model.decoder( 2025-08-14T21:44:03.7081990Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.7082063Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.7082282Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 653, in forward 2025-08-14T21:44:03.7082352Z layer_outputs = decoder_layer( 2025-08-14T21:44:03.7082560Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:03.7082630Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:03.7082857Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 259, in forward 2025-08-14T21:44:03.7082945Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:03.7083161Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 184, in forward 2025-08-14T21:44:03.7083257Z attn_output, attn_weights = attention_interface( 2025-08-14T21:44:03.7083522Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:44:03.7083628Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:44:03.7083633Z 2025-08-14T21:44:03.7083724Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:03.7083904Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:03.7083971Z return mod(**inputs) 2025-08-14T21:44:03.7084169Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.7084233Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.7084458Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 826, in forward 2025-08-14T21:44:03.7084524Z outputs = self.model.decoder( 2025-08-14T21:44:03.7084888Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.7084997Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.7085251Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 653, in forward 2025-08-14T21:44:03.7085349Z layer_outputs = decoder_layer( 2025-08-14T21:44:03.7085551Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:03.7085624Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:03.7085851Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 259, in forward 2025-08-14T21:44:03.7085941Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:03.7086190Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 196, in forward 2025-08-14T21:44:03.7086266Z attn_output = self.out_proj(attn_output) 2025-08-14T21:44:03.7086269Z 2025-08-14T21:44:03.7086364Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:03.7086560Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:03.7086623Z return mod(**inputs) 2025-08-14T21:44:03.7086832Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.7086896Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.7087115Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 826, in forward 2025-08-14T21:44:03.7087189Z outputs = self.model.decoder( 2025-08-14T21:44:03.7087385Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.7087451Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.7087676Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 653, in forward 2025-08-14T21:44:03.7087740Z layer_outputs = decoder_layer( 2025-08-14T21:44:03.7087952Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:03.7088027Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:03.7088246Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 285, in forward 2025-08-14T21:44:03.7088327Z hidden_states = self.fc1(hidden_states) 2025-08-14T21:44:03.7088331Z 2025-08-14T21:44:03.7088424Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:03.7088611Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:03.7088671Z return mod(**inputs) 2025-08-14T21:44:03.7088866Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.7088941Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.7089162Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 826, in forward 2025-08-14T21:44:03.7089229Z outputs = self.model.decoder( 2025-08-14T21:44:03.7089434Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.7089499Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.7089724Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 653, in forward 2025-08-14T21:44:03.7089788Z layer_outputs = decoder_layer( 2025-08-14T21:44:03.7089985Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:03.7090064Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:03.7090290Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 286, in forward 2025-08-14T21:44:03.7090380Z hidden_states = self.activation_fn(hidden_states) 2025-08-14T21:44:03.7090399Z 2025-08-14T21:44:03.7090518Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:03.7090723Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:03.7090791Z return mod(**inputs) 2025-08-14T21:44:03.7090998Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.7091066Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.7091297Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 826, in forward 2025-08-14T21:44:03.7091363Z outputs = self.model.decoder( 2025-08-14T21:44:03.7091583Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.7091656Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.7091882Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 653, in forward 2025-08-14T21:44:03.7091956Z layer_outputs = decoder_layer( 2025-08-14T21:44:03.7092161Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:03.7092232Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:03.7092464Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 288, in forward 2025-08-14T21:44:03.7092537Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:44:03.7092541Z 2025-08-14T21:44:03.7092642Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:03.7092829Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:03.7092888Z return mod(**inputs) 2025-08-14T21:44:03.7093099Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.7093167Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.7093394Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 826, in forward 2025-08-14T21:44:03.7093468Z outputs = self.model.decoder( 2025-08-14T21:44:03.7093671Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.7093745Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.7093970Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 653, in forward 2025-08-14T21:44:03.7094035Z layer_outputs = decoder_layer( 2025-08-14T21:44:03.7094245Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:03.7094316Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:03.7094541Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 259, in forward 2025-08-14T21:44:03.7094641Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:03.7094864Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 159, in forward 2025-08-14T21:44:03.7094976Z query_states = self.q_proj(hidden_states) * self.scaling 2025-08-14T21:44:03.7094980Z 2025-08-14T21:44:03.7095072Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:03.7095255Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:03.7095324Z return mod(**inputs) 2025-08-14T21:44:03.7095526Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.7095601Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.7095827Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 826, in forward 2025-08-14T21:44:03.7095907Z outputs = self.model.decoder( 2025-08-14T21:44:03.7096133Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.7096214Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.7096441Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 653, in forward 2025-08-14T21:44:03.7096514Z layer_outputs = decoder_layer( 2025-08-14T21:44:03.7096718Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:03.7096799Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:03.7097041Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 259, in forward 2025-08-14T21:44:03.7097132Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:03.7097368Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 162, in forward 2025-08-14T21:44:03.7097445Z key_states = self.k_proj(hidden_states) 2025-08-14T21:44:03.7097450Z 2025-08-14T21:44:03.7097552Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:03.7097740Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:03.7097802Z return mod(**inputs) 2025-08-14T21:44:03.7098015Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.7098084Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.7098316Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 826, in forward 2025-08-14T21:44:03.7098398Z outputs = self.model.decoder( 2025-08-14T21:44:03.7098603Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.7098680Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.7098909Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 653, in forward 2025-08-14T21:44:03.7098975Z layer_outputs = decoder_layer( 2025-08-14T21:44:03.7099191Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:03.7099340Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:03.7099576Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 259, in forward 2025-08-14T21:44:03.7099669Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:03.7099899Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 163, in forward 2025-08-14T21:44:03.7099984Z value_states = self.v_proj(hidden_states) 2025-08-14T21:44:03.7099987Z 2025-08-14T21:44:03.7100061Z cudagraph partition due to non gpu ops 2025-08-14T21:44:03.7100134Z cudagraph partition due to non gpu ops 2025-08-14T21:44:03.7100215Z cudagraph partition due to non gpu ops 2025-08-14T21:44:03.7100286Z cudagraph partition due to non gpu ops 2025-08-14T21:44:03.7100385Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:03.7100575Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:03.7100634Z return mod(**inputs) 2025-08-14T21:44:03.7100848Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.7100915Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.7101151Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 826, in forward 2025-08-14T21:44:03.7101227Z outputs = self.model.decoder( 2025-08-14T21:44:03.7101431Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.7101520Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.7102474Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 653, in forward 2025-08-14T21:44:03.7102548Z layer_outputs = decoder_layer( 2025-08-14T21:44:03.7102764Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:03.7102838Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:03.7103065Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 259, in forward 2025-08-14T21:44:03.7103166Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:03.7103407Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 184, in forward 2025-08-14T21:44:03.7103506Z attn_output, attn_weights = attention_interface( 2025-08-14T21:44:03.7103781Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:44:03.7103908Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:44:03.7103912Z 2025-08-14T21:44:03.7104017Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:03.7104206Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:03.7104274Z return mod(**inputs) 2025-08-14T21:44:03.7104481Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.7104550Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.7104855Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 826, in forward 2025-08-14T21:44:03.7104929Z outputs = self.model.decoder( 2025-08-14T21:44:03.7105136Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.7105214Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.7105443Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 653, in forward 2025-08-14T21:44:03.7105520Z layer_outputs = decoder_layer( 2025-08-14T21:44:03.7105727Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:03.7105800Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:03.7106033Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 259, in forward 2025-08-14T21:44:03.7106127Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:03.7106354Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 184, in forward 2025-08-14T21:44:03.7106453Z attn_output, attn_weights = attention_interface( 2025-08-14T21:44:03.7106734Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:44:03.7106845Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:44:03.7106849Z 2025-08-14T21:44:03.7106941Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:03.7107122Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:03.7107189Z return mod(**inputs) 2025-08-14T21:44:03.7107388Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.7107465Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.7107685Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 826, in forward 2025-08-14T21:44:03.7107750Z outputs = self.model.decoder( 2025-08-14T21:44:03.7107982Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.7108079Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.7108298Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 653, in forward 2025-08-14T21:44:03.7108372Z layer_outputs = decoder_layer( 2025-08-14T21:44:03.7108571Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:03.7108650Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:03.7108868Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 259, in forward 2025-08-14T21:44:03.7108974Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:03.7109199Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 196, in forward 2025-08-14T21:44:03.7109275Z attn_output = self.out_proj(attn_output) 2025-08-14T21:44:03.7109278Z 2025-08-14T21:44:03.7109378Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:03.7109560Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:03.7109619Z return mod(**inputs) 2025-08-14T21:44:03.7109821Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.7109886Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.7110103Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 826, in forward 2025-08-14T21:44:03.7110177Z outputs = self.model.decoder( 2025-08-14T21:44:03.7110373Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.7110445Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.7110662Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 653, in forward 2025-08-14T21:44:03.7110731Z layer_outputs = decoder_layer( 2025-08-14T21:44:03.7110939Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:03.7111009Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:03.7111231Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 285, in forward 2025-08-14T21:44:03.7111311Z hidden_states = self.fc1(hidden_states) 2025-08-14T21:44:03.7111314Z 2025-08-14T21:44:03.7111405Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:03.7111595Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:03.7111655Z return mod(**inputs) 2025-08-14T21:44:03.7111850Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.7111923Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.7112143Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 826, in forward 2025-08-14T21:44:03.7112220Z outputs = self.model.decoder( 2025-08-14T21:44:03.7112416Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.7112480Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.7112704Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 653, in forward 2025-08-14T21:44:03.7112768Z layer_outputs = decoder_layer( 2025-08-14T21:44:03.7112966Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:03.7113044Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:03.7113276Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 286, in forward 2025-08-14T21:44:03.7113386Z hidden_states = self.activation_fn(hidden_states) 2025-08-14T21:44:03.7113406Z 2025-08-14T21:44:03.7113499Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:03.7113682Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:03.7113747Z return mod(**inputs) 2025-08-14T21:44:03.7113947Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.7114012Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.7114239Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 826, in forward 2025-08-14T21:44:03.7114321Z outputs = self.model.decoder( 2025-08-14T21:44:03.7114525Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.7114594Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.7114814Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 653, in forward 2025-08-14T21:44:03.7114889Z layer_outputs = decoder_layer( 2025-08-14T21:44:03.7115089Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:03.7115168Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:03.7115388Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 288, in forward 2025-08-14T21:44:03.7115461Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:44:03.7115466Z 2025-08-14T21:44:03.7115565Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:03.7115746Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:03.7115805Z return mod(**inputs) 2025-08-14T21:44:03.7116010Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.7116080Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.7116304Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 826, in forward 2025-08-14T21:44:03.7116369Z outputs = self.model.decoder( 2025-08-14T21:44:03.7116566Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.7116639Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.7116856Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 653, in forward 2025-08-14T21:44:03.7116924Z layer_outputs = decoder_layer( 2025-08-14T21:44:03.7117130Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:03.7117202Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:03.7117430Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 291, in forward 2025-08-14T21:44:03.7117558Z hidden_states = (residual + hidden_states).view(hidden_states_shape) 2025-08-14T21:44:03.7117561Z 2025-08-14T21:44:03.7117654Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:03.7117843Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:03.7117903Z return mod(**inputs) 2025-08-14T21:44:03.7118106Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.7118173Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.7118390Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 826, in forward 2025-08-14T21:44:03.7118462Z outputs = self.model.decoder( 2025-08-14T21:44:03.7118674Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.7118799Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.7119027Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 653, in forward 2025-08-14T21:44:03.7119090Z layer_outputs = decoder_layer( 2025-08-14T21:44:03.7119297Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:03.7119368Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:03.7119586Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 259, in forward 2025-08-14T21:44:03.7119699Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:03.7119918Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 159, in forward 2025-08-14T21:44:03.7120020Z query_states = self.q_proj(hidden_states) * self.scaling 2025-08-14T21:44:03.7120030Z 2025-08-14T21:44:03.7120125Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:03.7120306Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:03.7120373Z return mod(**inputs) 2025-08-14T21:44:03.7120570Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.7120637Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.7120863Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 826, in forward 2025-08-14T21:44:03.7120931Z outputs = self.model.decoder( 2025-08-14T21:44:03.7121135Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.7121201Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.7121422Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 653, in forward 2025-08-14T21:44:03.7121494Z layer_outputs = decoder_layer( 2025-08-14T21:44:03.7121694Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:03.7121765Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:03.7121989Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 259, in forward 2025-08-14T21:44:03.7122078Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:03.7122305Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 162, in forward 2025-08-14T21:44:03.7122378Z key_states = self.k_proj(hidden_states) 2025-08-14T21:44:03.7122382Z 2025-08-14T21:44:03.7122473Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:03.7122661Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:03.7122722Z return mod(**inputs) 2025-08-14T21:44:03.7122928Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.7122993Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.7123213Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 826, in forward 2025-08-14T21:44:03.7123286Z outputs = self.model.decoder( 2025-08-14T21:44:03.7123482Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.7123548Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.7123775Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 653, in forward 2025-08-14T21:44:03.7123838Z layer_outputs = decoder_layer( 2025-08-14T21:44:03.7124062Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:03.7124160Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:03.7124380Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 259, in forward 2025-08-14T21:44:03.7124477Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:03.7124694Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 163, in forward 2025-08-14T21:44:03.7124770Z value_states = self.v_proj(hidden_states) 2025-08-14T21:44:03.7124783Z 2025-08-14T21:44:03.7124856Z cudagraph partition due to non gpu ops 2025-08-14T21:44:03.7124951Z cudagraph partition due to non gpu ops 2025-08-14T21:44:03.7125032Z cudagraph partition due to non gpu ops 2025-08-14T21:44:03.7125103Z cudagraph partition due to non gpu ops 2025-08-14T21:44:03.7125195Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:03.7125386Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:03.7125448Z return mod(**inputs) 2025-08-14T21:44:03.7125643Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.7125718Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.7125939Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 826, in forward 2025-08-14T21:44:03.7126012Z outputs = self.model.decoder( 2025-08-14T21:44:03.7126209Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.7126276Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.7126503Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 653, in forward 2025-08-14T21:44:03.7126567Z layer_outputs = decoder_layer( 2025-08-14T21:44:03.7126770Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:03.7126851Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:03.7127069Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 259, in forward 2025-08-14T21:44:03.7127164Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:03.7127385Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 184, in forward 2025-08-14T21:44:03.7127472Z attn_output, attn_weights = attention_interface( 2025-08-14T21:44:03.7127750Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:44:03.7127874Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:44:03.7127877Z 2025-08-14T21:44:03.7127977Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:03.7128161Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:03.7128221Z return mod(**inputs) 2025-08-14T21:44:03.7128427Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.7128495Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.7128714Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 826, in forward 2025-08-14T21:44:03.7128787Z outputs = self.model.decoder( 2025-08-14T21:44:03.7128985Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.7129061Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.7129279Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 653, in forward 2025-08-14T21:44:03.7129360Z layer_outputs = decoder_layer( 2025-08-14T21:44:03.7129584Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:03.7129669Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:03.7129892Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 259, in forward 2025-08-14T21:44:03.7129984Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:03.7130201Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 184, in forward 2025-08-14T21:44:03.7130296Z attn_output, attn_weights = attention_interface( 2025-08-14T21:44:03.7130580Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:44:03.7130680Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:44:03.7130691Z 2025-08-14T21:44:03.7130783Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:03.7130967Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:03.7131033Z return mod(**inputs) 2025-08-14T21:44:03.7131233Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.7131299Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.7131527Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 826, in forward 2025-08-14T21:44:03.7131593Z outputs = self.model.decoder( 2025-08-14T21:44:03.7131800Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.7131866Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.7132084Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 653, in forward 2025-08-14T21:44:03.7132157Z layer_outputs = decoder_layer( 2025-08-14T21:44:03.7132363Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:03.7132434Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:03.7132662Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 259, in forward 2025-08-14T21:44:03.7132749Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:03.7132976Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 196, in forward 2025-08-14T21:44:03.7133052Z attn_output = self.out_proj(attn_output) 2025-08-14T21:44:03.7133056Z 2025-08-14T21:44:03.7133148Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:03.7133337Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:03.7133397Z return mod(**inputs) 2025-08-14T21:44:03.7133596Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.7133671Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.7133892Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 826, in forward 2025-08-14T21:44:03.7133964Z outputs = self.model.decoder( 2025-08-14T21:44:03.7134160Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.7134224Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.7134451Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 653, in forward 2025-08-14T21:44:03.7134516Z layer_outputs = decoder_layer( 2025-08-14T21:44:03.7134725Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:03.7134810Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:03.7135045Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 285, in forward 2025-08-14T21:44:03.7135142Z hidden_states = self.fc1(hidden_states) 2025-08-14T21:44:03.7135146Z 2025-08-14T21:44:03.7135240Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:03.7135420Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:03.7135487Z return mod(**inputs) 2025-08-14T21:44:03.7135683Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.7135774Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.7135992Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 826, in forward 2025-08-14T21:44:03.7136058Z outputs = self.model.decoder( 2025-08-14T21:44:03.7136266Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.7136334Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.7136550Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 653, in forward 2025-08-14T21:44:03.7136624Z layer_outputs = decoder_layer( 2025-08-14T21:44:03.7136822Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:03.7136901Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:03.7137121Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 286, in forward 2025-08-14T21:44:03.7137209Z hidden_states = self.activation_fn(hidden_states) 2025-08-14T21:44:03.7137212Z 2025-08-14T21:44:03.7137311Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:03.7137495Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:03.7137565Z return mod(**inputs) 2025-08-14T21:44:03.7137762Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.7137828Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.7138052Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 826, in forward 2025-08-14T21:44:03.7138116Z outputs = self.model.decoder( 2025-08-14T21:44:03.7138312Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.7138390Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.7138606Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 653, in forward 2025-08-14T21:44:03.7138677Z layer_outputs = decoder_layer( 2025-08-14T21:44:03.7138876Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:03.7138951Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:03.7139175Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 288, in forward 2025-08-14T21:44:03.7139247Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:44:03.7139250Z 2025-08-14T21:44:03.7139348Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:03.7139528Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:03.7139586Z return mod(**inputs) 2025-08-14T21:44:03.7139791Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.7139857Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.7140089Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 841, in forward 2025-08-14T21:44:03.7140201Z logits = self.lm_head(outputs[0]).contiguous() 2025-08-14T21:44:03.7140219Z 2025-08-14T21:44:03.7140313Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:03.7140501Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:03.7140559Z return mod(**inputs) 2025-08-14T21:44:03.7140755Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/generic.py", line 961, in wrapper 2025-08-14T21:44:03.7140828Z output = func(self, *args, **kwargs) 2025-08-14T21:44:03.7141047Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/opt/modeling_opt.py", line 847, in forward 2025-08-14T21:44:03.7141129Z loss = self.loss_function( 2025-08-14T21:44:03.7141367Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/loss/loss_utils.py", line 67, in ForCausalLMLoss 2025-08-14T21:44:03.7141529Z loss = fixed_cross_entropy(logits, shift_labels, num_items_in_batch, ignore_index, **kwargs) 2025-08-14T21:44:03.7141775Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/loss/loss_utils.py", line 36, in fixed_cross_entropy 2025-08-14T21:44:03.7141958Z loss = nn.functional.cross_entropy(source, target, ignore_index=ignore_index, reduction=reduction) 2025-08-14T21:44:03.7141962Z 2025-08-14T21:44:12.9089722Z Compilation time (from dynamo_timed): 14.060638451 2025-08-14T21:44:12.9458601Z pass 2025-08-14T21:44:12.9462342Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:44:12.9466634Z TIMING: _recursive_pre_grad_passes:0.00691 _recursive_joint_graph_passes:0.52462 _recursive_post_grad_passes:0.085 async_compile.wait:0.70955 code_gen:8.07515 inductor_compile:9.16588 backend_compile:11.97902 gc:0.0006 entire_frame_compile:14.06064 total_wall_time:14.06064 2025-08-14T21:44:12.9468116Z STATS: call_* op count: 415 | FakeTensorMode.__torch_dispatch__:12797 | FakeTensor.__torch_dispatch__:4472 | ProxyTorchDispatchMode.__torch_dispatch__:4707 2025-08-14T21:44:12.9468631Z Dynamo produced 1 graphs covering 415 ops with 0 graph breaks (0 unique) 2025-08-14T21:44:17.1109256Z /opt/conda/envs/py_3.9/lib/python3.9/site-packages/llvmlite/binding/ffi.py:175: UserWarning: pkg_resources is deprecated as an API. See https://setuptools.pypa.io/en/latest/pkg_resources.html. The pkg_resources package is slated for removal as early as 2025-11-30. Refrain from using this package or pin to Setuptools<81. 2025-08-14T21:44:17.1110197Z from pkg_resources import resource_filename 2025-08-14T21:44:17.6375107Z 2025-08-14T21:44:18.8588235Z loading model: 0it [00:00, ?it/s] 2025-08-14T21:44:18.8592367Z loading model: 0it [00:01, ?it/s] 2025-08-14T21:44:18.8599177Z cpu eval PLBartForCausalLM 2025-08-14T21:44:19.4306918Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:44:19.6551812Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:44:19.9185840Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:44:24.1922972Z cudagraph partition due to non gpu ops 2025-08-14T21:44:24.1926629Z cudagraph partition due to non gpu ops 2025-08-14T21:44:24.1928659Z cudagraph partition due to non gpu ops 2025-08-14T21:44:24.1928983Z cudagraph partition due to non gpu ops 2025-08-14T21:44:24.1932831Z cudagraph partition due to non gpu ops 2025-08-14T21:44:24.1934975Z cudagraph partition due to non gpu ops 2025-08-14T21:44:24.1935339Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:24.1939847Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:24.1941713Z return mod(**inputs) 2025-08-14T21:44:24.1943177Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1678, in forward 2025-08-14T21:44:24.1947494Z outputs = self.model.decoder( 2025-08-14T21:44:24.1951803Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:24.1952881Z layer_outputs = decoder_layer( 2025-08-14T21:44:24.1953306Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:24.1953726Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:24.1958403Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 760, in forward 2025-08-14T21:44:24.1959206Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:24.1960148Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 400, in forward 2025-08-14T21:44:24.1960751Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:44:24.1964702Z 2025-08-14T21:44:24.1967098Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:24.1967599Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:24.1972063Z return mod(**inputs) 2025-08-14T21:44:24.1976372Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1678, in forward 2025-08-14T21:44:24.1980432Z outputs = self.model.decoder( 2025-08-14T21:44:24.1984263Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:24.1985134Z layer_outputs = decoder_layer( 2025-08-14T21:44:24.1985661Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:24.1986006Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:24.1986397Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 760, in forward 2025-08-14T21:44:24.1986803Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:24.1987193Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 419, in forward 2025-08-14T21:44:24.1987569Z key_states = self.k_proj(current_states) 2025-08-14T21:44:24.1987705Z 2025-08-14T21:44:24.1987806Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:24.1988147Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:24.1988444Z return mod(**inputs) 2025-08-14T21:44:24.1988792Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1678, in forward 2025-08-14T21:44:24.1989160Z outputs = self.model.decoder( 2025-08-14T21:44:24.1989519Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:24.1989878Z layer_outputs = decoder_layer( 2025-08-14T21:44:24.1990200Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:24.1990532Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:24.1990894Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 760, in forward 2025-08-14T21:44:24.1991277Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:24.1991656Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 420, in forward 2025-08-14T21:44:24.1992027Z value_states = self.v_proj(current_states) 2025-08-14T21:44:24.1992157Z 2025-08-14T21:44:24.1992234Z cudagraph partition due to non gpu ops 2025-08-14T21:44:24.1992431Z cudagraph partition due to non gpu ops 2025-08-14T21:44:24.1992803Z cudagraph partition due to non gpu ops 2025-08-14T21:44:24.1992997Z cudagraph partition due to non gpu ops 2025-08-14T21:44:24.1993292Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:24.1993635Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:24.1993941Z return mod(**inputs) 2025-08-14T21:44:24.1994287Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1678, in forward 2025-08-14T21:44:24.1994661Z outputs = self.model.decoder( 2025-08-14T21:44:24.1995027Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:24.1995425Z layer_outputs = decoder_layer( 2025-08-14T21:44:24.1995748Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:24.1996080Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:24.1996444Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 760, in forward 2025-08-14T21:44:24.1996824Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:24.1997206Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 438, in forward 2025-08-14T21:44:24.1997593Z attn_output, attn_weights = attention_interface( 2025-08-14T21:44:24.1997997Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:44:24.1998444Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:44:24.1998624Z 2025-08-14T21:44:24.1998723Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:24.1999052Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:24.1999342Z return mod(**inputs) 2025-08-14T21:44:24.1999684Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1678, in forward 2025-08-14T21:44:24.2000049Z outputs = self.model.decoder( 2025-08-14T21:44:24.2000404Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:24.2000755Z layer_outputs = decoder_layer( 2025-08-14T21:44:24.2001073Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:24.2001404Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:24.2001759Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 760, in forward 2025-08-14T21:44:24.2002140Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:24.2002519Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 438, in forward 2025-08-14T21:44:24.2002897Z attn_output, attn_weights = attention_interface( 2025-08-14T21:44:24.2003296Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:44:24.2003713Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:44:24.2003865Z 2025-08-14T21:44:24.2003960Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:24.2004284Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:24.2004571Z return mod(**inputs) 2025-08-14T21:44:24.2004908Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1678, in forward 2025-08-14T21:44:24.2005266Z outputs = self.model.decoder( 2025-08-14T21:44:24.2005629Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:24.2005994Z layer_outputs = decoder_layer( 2025-08-14T21:44:24.2006347Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:24.2006681Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:24.2007034Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 760, in forward 2025-08-14T21:44:24.2007414Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:24.2007794Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 452, in forward 2025-08-14T21:44:24.2008186Z attn_output = self.out_proj(attn_output) 2025-08-14T21:44:24.2008311Z 2025-08-14T21:44:24.2008406Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:24.2008738Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:24.2009038Z return mod(**inputs) 2025-08-14T21:44:24.2009378Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1678, in forward 2025-08-14T21:44:24.2009746Z outputs = self.model.decoder( 2025-08-14T21:44:24.2010105Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:24.2010468Z layer_outputs = decoder_layer( 2025-08-14T21:44:24.2010784Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:24.2011115Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:24.2011482Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 792, in forward 2025-08-14T21:44:24.2011884Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:44:24.2012053Z 2025-08-14T21:44:24.2012151Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:24.2012482Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:24.2012802Z return mod(**inputs) 2025-08-14T21:44:24.2013137Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1678, in forward 2025-08-14T21:44:24.2013503Z outputs = self.model.decoder( 2025-08-14T21:44:24.2013863Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:24.2014223Z layer_outputs = decoder_layer( 2025-08-14T21:44:24.2014536Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:24.2014869Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:24.2015235Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 792, in forward 2025-08-14T21:44:24.2015631Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:44:24.2015993Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:44:24.2016311Z return self.act(input) 2025-08-14T21:44:24.2016412Z 2025-08-14T21:44:24.2016513Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:24.2016837Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:24.2017136Z return mod(**inputs) 2025-08-14T21:44:24.2017478Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1678, in forward 2025-08-14T21:44:24.2017838Z outputs = self.model.decoder( 2025-08-14T21:44:24.2018196Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:24.2018576Z layer_outputs = decoder_layer( 2025-08-14T21:44:24.2018913Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:24.2019254Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:24.2019615Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 794, in forward 2025-08-14T21:44:24.2019981Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:44:24.2020105Z 2025-08-14T21:44:24.2020206Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:24.2020526Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:24.2020874Z return mod(**inputs) 2025-08-14T21:44:24.2021207Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1678, in forward 2025-08-14T21:44:24.2021560Z outputs = self.model.decoder( 2025-08-14T21:44:24.2021918Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:24.2022276Z layer_outputs = decoder_layer( 2025-08-14T21:44:24.2022591Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:24.2022918Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:24.2023279Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 760, in forward 2025-08-14T21:44:24.2023663Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:24.2024037Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 400, in forward 2025-08-14T21:44:24.2024471Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:44:24.2024746Z 2025-08-14T21:44:24.2024847Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:24.2025184Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:24.2025477Z return mod(**inputs) 2025-08-14T21:44:24.2025817Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1678, in forward 2025-08-14T21:44:24.2026182Z outputs = self.model.decoder( 2025-08-14T21:44:24.2026537Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:24.2026889Z layer_outputs = decoder_layer( 2025-08-14T21:44:24.2027210Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:24.2027550Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:24.2027906Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 760, in forward 2025-08-14T21:44:24.2028294Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:24.2028674Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 419, in forward 2025-08-14T21:44:24.2029063Z key_states = self.k_proj(current_states) 2025-08-14T21:44:24.2029188Z 2025-08-14T21:44:24.2029283Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:24.2029607Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:24.2029905Z return mod(**inputs) 2025-08-14T21:44:24.2030235Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1678, in forward 2025-08-14T21:44:24.2030600Z outputs = self.model.decoder( 2025-08-14T21:44:24.2030953Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:24.2031312Z layer_outputs = decoder_layer( 2025-08-14T21:44:24.2031654Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:24.2031998Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:24.2032355Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 760, in forward 2025-08-14T21:44:24.2032732Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:24.2033099Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 420, in forward 2025-08-14T21:44:24.2033466Z value_states = self.v_proj(current_states) 2025-08-14T21:44:24.2033890Z 2025-08-14T21:44:24.2033971Z cudagraph partition due to non gpu ops 2025-08-14T21:44:24.2034161Z cudagraph partition due to non gpu ops 2025-08-14T21:44:24.2034358Z cudagraph partition due to non gpu ops 2025-08-14T21:44:24.2034548Z cudagraph partition due to non gpu ops 2025-08-14T21:44:24.2034765Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:24.2035091Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:24.2035389Z return mod(**inputs) 2025-08-14T21:44:24.2035729Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1678, in forward 2025-08-14T21:44:24.2036082Z outputs = self.model.decoder( 2025-08-14T21:44:24.2036436Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:24.2036796Z layer_outputs = decoder_layer( 2025-08-14T21:44:24.2037114Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:24.2037437Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:24.2037797Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 760, in forward 2025-08-14T21:44:24.2038178Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:24.2038549Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 438, in forward 2025-08-14T21:44:24.2038930Z attn_output, attn_weights = attention_interface( 2025-08-14T21:44:24.2039337Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:44:24.2039782Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:44:24.2039948Z 2025-08-14T21:44:24.2040041Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:24.2040368Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:24.2040660Z return mod(**inputs) 2025-08-14T21:44:24.2040996Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1678, in forward 2025-08-14T21:44:24.2041355Z outputs = self.model.decoder( 2025-08-14T21:44:24.2041713Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:24.2042076Z layer_outputs = decoder_layer( 2025-08-14T21:44:24.2042388Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:24.2042721Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:24.2043084Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 760, in forward 2025-08-14T21:44:24.2043469Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:24.2043839Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 438, in forward 2025-08-14T21:44:24.2044217Z attn_output, attn_weights = attention_interface( 2025-08-14T21:44:24.2044653Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:44:24.2045085Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:44:24.2045234Z 2025-08-14T21:44:24.2045329Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:24.2045656Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:24.2045954Z return mod(**inputs) 2025-08-14T21:44:24.2046287Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1678, in forward 2025-08-14T21:44:24.2046667Z outputs = self.model.decoder( 2025-08-14T21:44:24.2047018Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:24.2047380Z layer_outputs = decoder_layer( 2025-08-14T21:44:24.2047695Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:24.2048028Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:24.2048388Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 760, in forward 2025-08-14T21:44:24.2048759Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:24.2049133Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 452, in forward 2025-08-14T21:44:24.2049499Z attn_output = self.out_proj(attn_output) 2025-08-14T21:44:24.2049621Z 2025-08-14T21:44:24.2049721Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:24.2050040Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:24.2050338Z return mod(**inputs) 2025-08-14T21:44:24.2050677Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1678, in forward 2025-08-14T21:44:24.2051041Z outputs = self.model.decoder( 2025-08-14T21:44:24.2051384Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:24.2051739Z layer_outputs = decoder_layer( 2025-08-14T21:44:24.2052056Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:24.2052378Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:24.2052739Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 792, in forward 2025-08-14T21:44:24.2053138Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:44:24.2053295Z 2025-08-14T21:44:24.2053398Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:24.2053720Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:24.2054015Z return mod(**inputs) 2025-08-14T21:44:24.2054352Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1678, in forward 2025-08-14T21:44:24.2054701Z outputs = self.model.decoder( 2025-08-14T21:44:24.2055053Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:24.2055405Z layer_outputs = decoder_layer( 2025-08-14T21:44:24.2055720Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:24.2056046Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:24.2056407Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 792, in forward 2025-08-14T21:44:24.2056804Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:44:24.2057187Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:44:24.2057512Z return self.act(input) 2025-08-14T21:44:24.2057620Z 2025-08-14T21:44:24.2057714Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:24.2058042Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:24.2058328Z return mod(**inputs) 2025-08-14T21:44:24.2058668Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1678, in forward 2025-08-14T21:44:24.2059029Z outputs = self.model.decoder( 2025-08-14T21:44:24.2059399Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:24.2059756Z layer_outputs = decoder_layer( 2025-08-14T21:44:24.2060071Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:24.2060405Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:24.2060763Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 794, in forward 2025-08-14T21:44:24.2061131Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:44:24.2061261Z 2025-08-14T21:44:24.2061355Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:24.2061682Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:24.2061973Z return mod(**inputs) 2025-08-14T21:44:24.2062306Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1678, in forward 2025-08-14T21:44:24.2062665Z outputs = self.model.decoder( 2025-08-14T21:44:24.2063013Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:24.2063367Z layer_outputs = decoder_layer( 2025-08-14T21:44:24.2063687Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:24.2064019Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:24.2064370Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 760, in forward 2025-08-14T21:44:24.2064827Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:24.2065213Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 400, in forward 2025-08-14T21:44:24.2065650Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:44:24.2065841Z 2025-08-14T21:44:24.2065937Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:24.2066266Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:24.2066569Z return mod(**inputs) 2025-08-14T21:44:24.2066913Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1678, in forward 2025-08-14T21:44:24.2067272Z outputs = self.model.decoder( 2025-08-14T21:44:24.2067627Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:24.2067990Z layer_outputs = decoder_layer( 2025-08-14T21:44:24.2068300Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:24.2068634Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:24.2068997Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 760, in forward 2025-08-14T21:44:24.2069379Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:24.2069768Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 419, in forward 2025-08-14T21:44:24.2070171Z key_states = self.k_proj(current_states) 2025-08-14T21:44:24.2070297Z 2025-08-14T21:44:24.2070397Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:24.2070718Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:24.2071015Z return mod(**inputs) 2025-08-14T21:44:24.2071351Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1678, in forward 2025-08-14T21:44:24.2071713Z outputs = self.model.decoder( 2025-08-14T21:44:24.2072075Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:24.2072432Z layer_outputs = decoder_layer( 2025-08-14T21:44:24.2072750Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:24.2073083Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:24.2073442Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 760, in forward 2025-08-14T21:44:24.2073823Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:24.2074199Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 420, in forward 2025-08-14T21:44:24.2074561Z value_states = self.v_proj(current_states) 2025-08-14T21:44:24.2074696Z 2025-08-14T21:44:24.2074771Z cudagraph partition due to non gpu ops 2025-08-14T21:44:24.2074967Z cudagraph partition due to non gpu ops 2025-08-14T21:44:24.2075163Z cudagraph partition due to non gpu ops 2025-08-14T21:44:24.2075347Z cudagraph partition due to non gpu ops 2025-08-14T21:44:24.2075560Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:24.2075889Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:24.2076180Z return mod(**inputs) 2025-08-14T21:44:24.2076521Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1678, in forward 2025-08-14T21:44:24.2076882Z outputs = self.model.decoder( 2025-08-14T21:44:24.2077235Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:24.2077585Z layer_outputs = decoder_layer( 2025-08-14T21:44:24.2077903Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:24.2078234Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:24.2078586Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 760, in forward 2025-08-14T21:44:24.2078965Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:24.2079345Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 438, in forward 2025-08-14T21:44:24.2079727Z attn_output, attn_weights = attention_interface( 2025-08-14T21:44:24.2080125Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:44:24.2080561Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:44:24.2080728Z 2025-08-14T21:44:24.2080829Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:24.2081156Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:24.2081449Z return mod(**inputs) 2025-08-14T21:44:24.2081783Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1678, in forward 2025-08-14T21:44:24.2082139Z outputs = self.model.decoder( 2025-08-14T21:44:24.2082513Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:24.2082889Z layer_outputs = decoder_layer( 2025-08-14T21:44:24.2083203Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:24.2083532Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:24.2083886Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 760, in forward 2025-08-14T21:44:24.2084266Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:24.2084762Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 438, in forward 2025-08-14T21:44:24.2085164Z attn_output, attn_weights = attention_interface( 2025-08-14T21:44:24.2085571Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:44:24.2085992Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:44:24.2086142Z 2025-08-14T21:44:24.2086245Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:24.2086568Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:24.2086868Z return mod(**inputs) 2025-08-14T21:44:24.2087212Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1678, in forward 2025-08-14T21:44:24.2087578Z outputs = self.model.decoder( 2025-08-14T21:44:24.2087925Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:24.2088285Z layer_outputs = decoder_layer( 2025-08-14T21:44:24.2088603Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:24.2088926Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:24.2089288Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 760, in forward 2025-08-14T21:44:24.2089671Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:24.2090051Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 452, in forward 2025-08-14T21:44:24.2090407Z attn_output = self.out_proj(attn_output) 2025-08-14T21:44:24.2090537Z 2025-08-14T21:44:24.2090629Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:24.2090956Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:24.2091254Z return mod(**inputs) 2025-08-14T21:44:24.2091584Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1678, in forward 2025-08-14T21:44:24.2091946Z outputs = self.model.decoder( 2025-08-14T21:44:24.2092316Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:24.2092669Z layer_outputs = decoder_layer( 2025-08-14T21:44:24.2092986Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:24.2093313Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:24.2093676Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 792, in forward 2025-08-14T21:44:24.2094067Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:44:24.2094230Z 2025-08-14T21:44:24.2094324Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:24.2094649Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:24.2094936Z return mod(**inputs) 2025-08-14T21:44:24.2095342Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1678, in forward 2025-08-14T21:44:24.2095769Z outputs = self.model.decoder( 2025-08-14T21:44:24.2096121Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:24.2096472Z layer_outputs = decoder_layer( 2025-08-14T21:44:24.2096791Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:24.2097121Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:24.2097479Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 792, in forward 2025-08-14T21:44:24.2097893Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:44:24.2098248Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:44:24.2098567Z return self.act(input) 2025-08-14T21:44:24.2098668Z 2025-08-14T21:44:24.2098765Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:24.2099091Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:24.2099384Z return mod(**inputs) 2025-08-14T21:44:24.2099720Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1678, in forward 2025-08-14T21:44:24.2100072Z outputs = self.model.decoder( 2025-08-14T21:44:24.2100427Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:24.2100782Z layer_outputs = decoder_layer( 2025-08-14T21:44:24.2101093Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:24.2101422Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:24.2101787Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 794, in forward 2025-08-14T21:44:24.2102152Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:44:24.2102275Z 2025-08-14T21:44:24.2102368Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:24.2102691Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:24.2102985Z return mod(**inputs) 2025-08-14T21:44:24.2103315Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1678, in forward 2025-08-14T21:44:24.2103678Z outputs = self.model.decoder( 2025-08-14T21:44:24.2104029Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:24.2104384Z layer_outputs = decoder_layer( 2025-08-14T21:44:24.2104763Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:24.2105107Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:24.2105475Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 760, in forward 2025-08-14T21:44:24.2105862Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:24.2106236Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 400, in forward 2025-08-14T21:44:24.2106670Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:44:24.2106859Z 2025-08-14T21:44:24.2106964Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:24.2107355Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:24.2107801Z return mod(**inputs) 2025-08-14T21:44:24.2108243Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1678, in forward 2025-08-14T21:44:24.2108763Z outputs = self.model.decoder( 2025-08-14T21:44:24.2109186Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:24.2109556Z layer_outputs = decoder_layer( 2025-08-14T21:44:24.2109876Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:24.2110208Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:24.2110563Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 760, in forward 2025-08-14T21:44:24.2110969Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:24.2111352Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 419, in forward 2025-08-14T21:44:24.2111718Z key_states = self.k_proj(current_states) 2025-08-14T21:44:24.2111852Z 2025-08-14T21:44:24.2111951Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:24.2112288Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:24.2112588Z return mod(**inputs) 2025-08-14T21:44:24.2112925Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1678, in forward 2025-08-14T21:44:24.2113291Z outputs = self.model.decoder( 2025-08-14T21:44:24.2113647Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:24.2114005Z layer_outputs = decoder_layer( 2025-08-14T21:44:24.2114326Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:24.2114661Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:24.2115029Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 760, in forward 2025-08-14T21:44:24.2115407Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:24.2115791Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 420, in forward 2025-08-14T21:44:24.2116160Z value_states = self.v_proj(current_states) 2025-08-14T21:44:24.2116290Z 2025-08-14T21:44:24.2116373Z cudagraph partition due to non gpu ops 2025-08-14T21:44:24.2116568Z cudagraph partition due to non gpu ops 2025-08-14T21:44:24.2116761Z cudagraph partition due to non gpu ops 2025-08-14T21:44:24.2116954Z cudagraph partition due to non gpu ops 2025-08-14T21:44:24.2117162Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:24.2117495Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:24.2117796Z return mod(**inputs) 2025-08-14T21:44:24.2118133Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1678, in forward 2025-08-14T21:44:24.2118615Z outputs = self.model.decoder( 2025-08-14T21:44:24.2119031Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:24.2119390Z layer_outputs = decoder_layer( 2025-08-14T21:44:24.2119699Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:24.2120031Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:24.2120389Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 760, in forward 2025-08-14T21:44:24.2120773Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:24.2121144Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 438, in forward 2025-08-14T21:44:24.2121539Z attn_output, attn_weights = attention_interface( 2025-08-14T21:44:24.2121998Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:44:24.2122440Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:44:24.2122616Z 2025-08-14T21:44:24.2122710Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:24.2123040Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:24.2123338Z return mod(**inputs) 2025-08-14T21:44:24.2123669Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1678, in forward 2025-08-14T21:44:24.2124047Z outputs = self.model.decoder( 2025-08-14T21:44:24.2124399Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:24.2124761Z layer_outputs = decoder_layer( 2025-08-14T21:44:24.2125071Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:24.2125402Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:24.2125763Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 760, in forward 2025-08-14T21:44:24.2126135Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:24.2126513Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 438, in forward 2025-08-14T21:44:24.2126893Z attn_output, attn_weights = attention_interface( 2025-08-14T21:44:24.2127298Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:44:24.2127710Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:44:24.2127865Z 2025-08-14T21:44:24.2127960Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:24.2128287Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:24.2128583Z return mod(**inputs) 2025-08-14T21:44:24.2128913Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1678, in forward 2025-08-14T21:44:24.2129273Z outputs = self.model.decoder( 2025-08-14T21:44:24.2129625Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:24.2129981Z layer_outputs = decoder_layer( 2025-08-14T21:44:24.2130297Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:24.2130625Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:24.2130983Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 760, in forward 2025-08-14T21:44:24.2131358Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:24.2131733Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 452, in forward 2025-08-14T21:44:24.2132097Z attn_output = self.out_proj(attn_output) 2025-08-14T21:44:24.2132220Z 2025-08-14T21:44:24.2132313Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:24.2132637Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:24.2132930Z return mod(**inputs) 2025-08-14T21:44:24.2133266Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1678, in forward 2025-08-14T21:44:24.2133616Z outputs = self.model.decoder( 2025-08-14T21:44:24.2133986Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:24.2134364Z layer_outputs = decoder_layer( 2025-08-14T21:44:24.2134695Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:24.2135019Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:24.2135380Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 792, in forward 2025-08-14T21:44:24.2135777Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:44:24.2135936Z 2025-08-14T21:44:24.2136031Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:24.2136378Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:24.2136670Z return mod(**inputs) 2025-08-14T21:44:24.2137005Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1678, in forward 2025-08-14T21:44:24.2137360Z outputs = self.model.decoder( 2025-08-14T21:44:24.2137716Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:24.2138081Z layer_outputs = decoder_layer( 2025-08-14T21:44:24.2138390Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:24.2138721Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:24.2139084Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 792, in forward 2025-08-14T21:44:24.2139484Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:44:24.2139834Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:44:24.2140149Z return self.act(input) 2025-08-14T21:44:24.2140248Z 2025-08-14T21:44:24.2140350Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:24.2140678Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:24.2140968Z return mod(**inputs) 2025-08-14T21:44:24.2141301Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1678, in forward 2025-08-14T21:44:24.2141656Z outputs = self.model.decoder( 2025-08-14T21:44:24.2141999Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:24.2142359Z layer_outputs = decoder_layer( 2025-08-14T21:44:24.2142677Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:24.2143009Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:24.2143363Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 794, in forward 2025-08-14T21:44:24.2143727Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:44:24.2143854Z 2025-08-14T21:44:24.2143954Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:24.2144274Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:24.2144569Z return mod(**inputs) 2025-08-14T21:44:24.2145007Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1678, in forward 2025-08-14T21:44:24.2145371Z outputs = self.model.decoder( 2025-08-14T21:44:24.2145714Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:24.2146080Z layer_outputs = decoder_layer( 2025-08-14T21:44:24.2146399Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:24.2146736Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:24.2147156Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 760, in forward 2025-08-14T21:44:24.2147555Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:24.2147931Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 400, in forward 2025-08-14T21:44:24.2148349Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:44:24.2148543Z 2025-08-14T21:44:24.2148638Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:24.2148964Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:24.2149274Z return mod(**inputs) 2025-08-14T21:44:24.2149603Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1678, in forward 2025-08-14T21:44:24.2149965Z outputs = self.model.decoder( 2025-08-14T21:44:24.2150319Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:24.2150680Z layer_outputs = decoder_layer( 2025-08-14T21:44:24.2150989Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:24.2151320Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:24.2151681Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 760, in forward 2025-08-14T21:44:24.2152053Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:24.2152431Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 419, in forward 2025-08-14T21:44:24.2152794Z key_states = self.k_proj(current_states) 2025-08-14T21:44:24.2152917Z 2025-08-14T21:44:24.2153017Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:24.2153338Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:24.2153636Z return mod(**inputs) 2025-08-14T21:44:24.2153974Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1678, in forward 2025-08-14T21:44:24.2154328Z outputs = self.model.decoder( 2025-08-14T21:44:24.2154680Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:24.2155037Z layer_outputs = decoder_layer( 2025-08-14T21:44:24.2155355Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:24.2155679Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:24.2156038Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 760, in forward 2025-08-14T21:44:24.2156417Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:24.2156805Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 420, in forward 2025-08-14T21:44:24.2157168Z value_states = self.v_proj(current_states) 2025-08-14T21:44:24.2157301Z 2025-08-14T21:44:24.2157374Z cudagraph partition due to non gpu ops 2025-08-14T21:44:24.2157568Z cudagraph partition due to non gpu ops 2025-08-14T21:44:24.2157754Z cudagraph partition due to non gpu ops 2025-08-14T21:44:24.2157946Z cudagraph partition due to non gpu ops 2025-08-14T21:44:24.2158157Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:24.2158481Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:24.2158778Z return mod(**inputs) 2025-08-14T21:44:24.2159118Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1678, in forward 2025-08-14T21:44:24.2159492Z outputs = self.model.decoder( 2025-08-14T21:44:24.2159854Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:24.2160229Z layer_outputs = decoder_layer( 2025-08-14T21:44:24.2160545Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:24.2160872Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:24.2161224Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 760, in forward 2025-08-14T21:44:24.2161603Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:24.2161998Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 438, in forward 2025-08-14T21:44:24.2162370Z attn_output, attn_weights = attention_interface( 2025-08-14T21:44:24.2162784Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:44:24.2163229Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:44:24.2163396Z 2025-08-14T21:44:24.2163496Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:24.2163814Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:24.2164112Z return mod(**inputs) 2025-08-14T21:44:24.2164450Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1678, in forward 2025-08-14T21:44:24.2164812Z outputs = self.model.decoder( 2025-08-14T21:44:24.2165157Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:24.2165514Z layer_outputs = decoder_layer( 2025-08-14T21:44:24.2165832Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:24.2166160Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:24.2166519Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 760, in forward 2025-08-14T21:44:24.2166899Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:24.2167272Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 438, in forward 2025-08-14T21:44:24.2167642Z attn_output, attn_weights = attention_interface( 2025-08-14T21:44:24.2168047Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:44:24.2168472Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:44:24.2168618Z 2025-08-14T21:44:24.2168717Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:24.2169041Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:24.2169342Z return mod(**inputs) 2025-08-14T21:44:24.2169683Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1678, in forward 2025-08-14T21:44:24.2170039Z outputs = self.model.decoder( 2025-08-14T21:44:24.2170389Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:24.2170748Z layer_outputs = decoder_layer( 2025-08-14T21:44:24.2171063Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:24.2171387Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:24.2171799Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 760, in forward 2025-08-14T21:44:24.2172199Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:24.2172594Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 452, in forward 2025-08-14T21:44:24.2172981Z attn_output = self.out_proj(attn_output) 2025-08-14T21:44:24.2173106Z 2025-08-14T21:44:24.2173202Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:24.2173532Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:24.2173832Z return mod(**inputs) 2025-08-14T21:44:24.2174170Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1678, in forward 2025-08-14T21:44:24.2174541Z outputs = self.model.decoder( 2025-08-14T21:44:24.2174899Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:24.2175260Z layer_outputs = decoder_layer( 2025-08-14T21:44:24.2175571Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:24.2175905Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:24.2176268Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 792, in forward 2025-08-14T21:44:24.2176665Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:44:24.2176823Z 2025-08-14T21:44:24.2176918Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:24.2177249Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:24.2177544Z return mod(**inputs) 2025-08-14T21:44:24.2177880Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1678, in forward 2025-08-14T21:44:24.2178232Z outputs = self.model.decoder( 2025-08-14T21:44:24.2178585Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:24.2178946Z layer_outputs = decoder_layer( 2025-08-14T21:44:24.2179258Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:24.2179589Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:24.2179947Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 792, in forward 2025-08-14T21:44:24.2180340Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:44:24.2180688Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:44:24.2181002Z return self.act(input) 2025-08-14T21:44:24.2181103Z 2025-08-14T21:44:24.2181204Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:24.2181525Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:24.2181820Z return mod(**inputs) 2025-08-14T21:44:24.2182161Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1678, in forward 2025-08-14T21:44:24.2182519Z outputs = self.model.decoder( 2025-08-14T21:44:24.2182862Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:24.2183222Z layer_outputs = decoder_layer( 2025-08-14T21:44:24.2183538Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:24.2183872Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:24.2184228Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 794, in forward 2025-08-14T21:44:24.2184850Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:44:24.2185037Z 2025-08-14T21:44:24.2185231Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:24.2185631Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:24.2185984Z return mod(**inputs) 2025-08-14T21:44:24.2186322Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1678, in forward 2025-08-14T21:44:24.2186684Z outputs = self.model.decoder( 2025-08-14T21:44:24.2187031Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:24.2187385Z layer_outputs = decoder_layer( 2025-08-14T21:44:24.2187735Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:24.2188071Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:24.2188448Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 760, in forward 2025-08-14T21:44:24.2188845Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:24.2189236Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 400, in forward 2025-08-14T21:44:24.2189670Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:44:24.2189871Z 2025-08-14T21:44:24.2189966Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:24.2190303Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:24.2190606Z return mod(**inputs) 2025-08-14T21:44:24.2190946Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1678, in forward 2025-08-14T21:44:24.2191317Z outputs = self.model.decoder( 2025-08-14T21:44:24.2191678Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:24.2192038Z layer_outputs = decoder_layer( 2025-08-14T21:44:24.2192361Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:24.2192700Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:24.2193068Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 760, in forward 2025-08-14T21:44:24.2193452Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:24.2193837Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 419, in forward 2025-08-14T21:44:24.2194213Z key_states = self.k_proj(current_states) 2025-08-14T21:44:24.2194336Z 2025-08-14T21:44:24.2194439Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:24.2194767Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:24.2195070Z return mod(**inputs) 2025-08-14T21:44:24.2195414Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1678, in forward 2025-08-14T21:44:24.2195777Z outputs = self.model.decoder( 2025-08-14T21:44:24.2196135Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:24.2196499Z layer_outputs = decoder_layer( 2025-08-14T21:44:24.2196821Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:24.2197149Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:24.2197519Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 760, in forward 2025-08-14T21:44:24.2197905Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:24.2198299Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 420, in forward 2025-08-14T21:44:24.2198708Z value_states = self.v_proj(current_states) 2025-08-14T21:44:24.2198857Z 2025-08-14T21:44:24.2198930Z cudagraph partition due to non gpu ops 2025-08-14T21:44:24.2199122Z cudagraph partition due to non gpu ops 2025-08-14T21:44:24.2199306Z cudagraph partition due to non gpu ops 2025-08-14T21:44:24.2199494Z cudagraph partition due to non gpu ops 2025-08-14T21:44:24.2199706Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:24.2200027Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:24.2200339Z return mod(**inputs) 2025-08-14T21:44:24.2200674Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1678, in forward 2025-08-14T21:44:24.2201033Z outputs = self.model.decoder( 2025-08-14T21:44:24.2201379Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:24.2201742Z layer_outputs = decoder_layer( 2025-08-14T21:44:24.2202058Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:24.2202380Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:24.2202740Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 760, in forward 2025-08-14T21:44:24.2203119Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:24.2203494Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 438, in forward 2025-08-14T21:44:24.2203864Z attn_output, attn_weights = attention_interface( 2025-08-14T21:44:24.2204269Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:44:24.2204707Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:44:24.2204877Z 2025-08-14T21:44:24.2204976Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:24.2205296Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:24.2205589Z return mod(**inputs) 2025-08-14T21:44:24.2205925Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1678, in forward 2025-08-14T21:44:24.2206280Z outputs = self.model.decoder( 2025-08-14T21:44:24.2206631Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:24.2206988Z layer_outputs = decoder_layer( 2025-08-14T21:44:24.2207303Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:24.2207626Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:24.2207988Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 760, in forward 2025-08-14T21:44:24.2208370Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:24.2208748Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 438, in forward 2025-08-14T21:44:24.2209122Z attn_output, attn_weights = attention_interface( 2025-08-14T21:44:24.2209525Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:44:24.2209943Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:44:24.2210089Z 2025-08-14T21:44:24.2210183Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:24.2210508Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:24.2210821Z return mod(**inputs) 2025-08-14T21:44:24.2211173Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1678, in forward 2025-08-14T21:44:24.2211547Z outputs = self.model.decoder( 2025-08-14T21:44:24.2211899Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:24.2212257Z layer_outputs = decoder_layer( 2025-08-14T21:44:24.2212568Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:24.2212898Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:24.2213284Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 760, in forward 2025-08-14T21:44:24.2213665Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:24.2214037Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 452, in forward 2025-08-14T21:44:24.2214407Z attn_output = self.out_proj(attn_output) 2025-08-14T21:44:24.2214538Z 2025-08-14T21:44:24.2214631Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:24.2214958Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:24.2215249Z return mod(**inputs) 2025-08-14T21:44:24.2215586Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1678, in forward 2025-08-14T21:44:24.2215945Z outputs = self.model.decoder( 2025-08-14T21:44:24.2216291Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:24.2216649Z layer_outputs = decoder_layer( 2025-08-14T21:44:24.2216964Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:24.2217295Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:24.2217650Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 792, in forward 2025-08-14T21:44:24.2218053Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:44:24.2218211Z 2025-08-14T21:44:24.2218311Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:24.2218637Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:24.2218928Z return mod(**inputs) 2025-08-14T21:44:24.2219264Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1678, in forward 2025-08-14T21:44:24.2219625Z outputs = self.model.decoder( 2025-08-14T21:44:24.2219972Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:24.2220339Z layer_outputs = decoder_layer( 2025-08-14T21:44:24.2220657Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:24.2220988Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:24.2221343Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 792, in forward 2025-08-14T21:44:24.2221740Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:44:24.2222095Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:44:24.2222401Z return self.act(input) 2025-08-14T21:44:24.2222514Z 2025-08-14T21:44:24.2222608Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:24.2222936Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:24.2223231Z return mod(**inputs) 2025-08-14T21:44:24.2223594Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1678, in forward 2025-08-14T21:44:24.2223972Z outputs = self.model.decoder( 2025-08-14T21:44:24.2224325Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:24.2224747Z layer_outputs = decoder_layer( 2025-08-14T21:44:24.2225067Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:24.2225394Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:24.2225757Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 794, in forward 2025-08-14T21:44:24.2226140Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:44:24.2226273Z 2025-08-14T21:44:24.2226368Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:24.2226700Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:24.2227000Z return mod(**inputs) 2025-08-14T21:44:24.2227335Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1694, in forward 2025-08-14T21:44:24.2227702Z logits = self.lm_head(outputs[0]) 2025-08-14T21:44:24.2227821Z 2025-08-14T21:44:24.2227922Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:24.2228240Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:24.2228532Z return mod(**inputs) 2025-08-14T21:44:24.2228866Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1700, in forward 2025-08-14T21:44:24.2229288Z loss = loss_fct(logits.view(-1, self.config.vocab_size), labels.view(-1)) 2025-08-14T21:44:24.2229469Z 2025-08-14T21:44:31.0744534Z Compilation time (from dynamo_timed): 9.949909471 2025-08-14T21:44:31.1017461Z pass 2025-08-14T21:44:31.1021830Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:44:31.1026217Z TIMING: _recursive_pre_grad_passes:0.00492 _recursive_joint_graph_passes:0.22262 _recursive_post_grad_passes:0.04778 async_compile.wait:0.71042 code_gen:6.37459 inductor_compile:7.19572 backend_compile:8.82707 gc:0.00042 entire_frame_compile:9.94991 total_wall_time:9.94991 2025-08-14T21:44:31.1027760Z STATS: call_* op count: 198 | FakeTensorMode.__torch_dispatch__:7102 | FakeTensor.__torch_dispatch__:2588 | ProxyTorchDispatchMode.__torch_dispatch__:2533 2025-08-14T21:44:31.1028252Z Dynamo produced 1 graphs covering 198 ops with 0 graph breaks (0 unique) 2025-08-14T21:44:35.1917864Z /opt/conda/envs/py_3.9/lib/python3.9/site-packages/llvmlite/binding/ffi.py:175: UserWarning: pkg_resources is deprecated as an API. See https://setuptools.pypa.io/en/latest/pkg_resources.html. The pkg_resources package is slated for removal as early as 2025-11-30. Refrain from using this package or pin to Setuptools<81. 2025-08-14T21:44:35.1918873Z from pkg_resources import resource_filename 2025-08-14T21:44:35.7823119Z 2025-08-14T21:44:37.9885054Z loading model: 0it [00:00, ?it/s] 2025-08-14T21:44:37.9889193Z loading model: 0it [00:02, ?it/s] 2025-08-14T21:44:37.9904353Z cpu eval PLBartForConditionalGeneration 2025-08-14T21:44:38.9042614Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:44:39.3445068Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:44:39.7950696Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:44:48.2204242Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2207742Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2212160Z return mod(**inputs) 2025-08-14T21:44:48.2217234Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1357, in forward 2025-08-14T21:44:48.2218208Z decoder_input_ids = shift_tokens_right(labels, self.config.pad_token_id) 2025-08-14T21:44:48.2218694Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1084, in shift_tokens_right 2025-08-14T21:44:48.2219189Z index_of_eos = (prev_output_tokens.ne(pad_token_id).sum(dim=1) - 1).unsqueeze(-1) 2025-08-14T21:44:48.2219403Z 2025-08-14T21:44:48.2219483Z cudagraph partition due to non gpu ops 2025-08-14T21:44:48.2219682Z cudagraph partition due to non gpu ops 2025-08-14T21:44:48.2219933Z cudagraph partition due to non gpu ops 2025-08-14T21:44:48.2220111Z cudagraph partition due to non gpu ops 2025-08-14T21:44:48.2220296Z cudagraph partition due to non gpu ops 2025-08-14T21:44:48.2220482Z cudagraph partition due to non gpu ops 2025-08-14T21:44:48.2220697Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2221045Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2221356Z return mod(**inputs) 2025-08-14T21:44:48.2221703Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2222081Z outputs = self.model( 2025-08-14T21:44:48.2222429Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1189, in forward 2025-08-14T21:44:48.2222792Z encoder_outputs = self.encoder( 2025-08-14T21:44:48.2223153Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 669, in forward 2025-08-14T21:44:48.2223521Z layer_outputs = encoder_layer( 2025-08-14T21:44:48.2223853Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2224293Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2224754Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 496, in forward 2025-08-14T21:44:48.2225154Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:44:48.2225539Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 400, in forward 2025-08-14T21:44:48.2225964Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:44:48.2226163Z 2025-08-14T21:44:48.2226264Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2226605Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2226902Z return mod(**inputs) 2025-08-14T21:44:48.2227236Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2227599Z outputs = self.model( 2025-08-14T21:44:48.2227944Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1189, in forward 2025-08-14T21:44:48.2228311Z encoder_outputs = self.encoder( 2025-08-14T21:44:48.2228663Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 669, in forward 2025-08-14T21:44:48.2229024Z layer_outputs = encoder_layer( 2025-08-14T21:44:48.2229342Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2229669Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2230033Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 496, in forward 2025-08-14T21:44:48.2230418Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:44:48.2231841Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 419, in forward 2025-08-14T21:44:48.2232273Z key_states = self.k_proj(current_states) 2025-08-14T21:44:48.2232409Z 2025-08-14T21:44:48.2232507Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2232846Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2233147Z return mod(**inputs) 2025-08-14T21:44:48.2233481Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2233839Z outputs = self.model( 2025-08-14T21:44:48.2234198Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1189, in forward 2025-08-14T21:44:48.2234551Z encoder_outputs = self.encoder( 2025-08-14T21:44:48.2234909Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 669, in forward 2025-08-14T21:44:48.2235265Z layer_outputs = encoder_layer( 2025-08-14T21:44:48.2235591Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2235926Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2236300Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 496, in forward 2025-08-14T21:44:48.2236676Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:44:48.2237042Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 420, in forward 2025-08-14T21:44:48.2237415Z value_states = self.v_proj(current_states) 2025-08-14T21:44:48.2237554Z 2025-08-14T21:44:48.2237629Z cudagraph partition due to non gpu ops 2025-08-14T21:44:48.2237824Z cudagraph partition due to non gpu ops 2025-08-14T21:44:48.2238007Z cudagraph partition due to non gpu ops 2025-08-14T21:44:48.2238197Z cudagraph partition due to non gpu ops 2025-08-14T21:44:48.2238416Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2238743Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2239043Z return mod(**inputs) 2025-08-14T21:44:48.2239389Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2239750Z outputs = self.model( 2025-08-14T21:44:48.2240081Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1189, in forward 2025-08-14T21:44:48.2240445Z encoder_outputs = self.encoder( 2025-08-14T21:44:48.2240799Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 669, in forward 2025-08-14T21:44:48.2241149Z layer_outputs = encoder_layer( 2025-08-14T21:44:48.2241475Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2241812Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2242173Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 496, in forward 2025-08-14T21:44:48.2242542Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:44:48.2242913Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 438, in forward 2025-08-14T21:44:48.2243295Z attn_output, attn_weights = attention_interface( 2025-08-14T21:44:48.2243704Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:44:48.2244144Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:44:48.2244321Z 2025-08-14T21:44:48.2244419Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2244810Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2245121Z return mod(**inputs) 2025-08-14T21:44:48.2245455Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2245831Z outputs = self.model( 2025-08-14T21:44:48.2246170Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1189, in forward 2025-08-14T21:44:48.2246525Z encoder_outputs = self.encoder( 2025-08-14T21:44:48.2246879Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 669, in forward 2025-08-14T21:44:48.2247258Z layer_outputs = encoder_layer( 2025-08-14T21:44:48.2247569Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2247905Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2248305Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 496, in forward 2025-08-14T21:44:48.2248688Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:44:48.2249056Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 438, in forward 2025-08-14T21:44:48.2249435Z attn_output, attn_weights = attention_interface( 2025-08-14T21:44:48.2249854Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:44:48.2250275Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:44:48.2250430Z 2025-08-14T21:44:48.2250527Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2250856Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2251153Z return mod(**inputs) 2025-08-14T21:44:48.2251486Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2251847Z outputs = self.model( 2025-08-14T21:44:48.2252187Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1189, in forward 2025-08-14T21:44:48.2252547Z encoder_outputs = self.encoder( 2025-08-14T21:44:48.2252897Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 669, in forward 2025-08-14T21:44:48.2253252Z layer_outputs = encoder_layer( 2025-08-14T21:44:48.2253575Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2253905Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2254268Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 496, in forward 2025-08-14T21:44:48.2254648Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:44:48.2255022Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 452, in forward 2025-08-14T21:44:48.2255381Z attn_output = self.out_proj(attn_output) 2025-08-14T21:44:48.2255513Z 2025-08-14T21:44:48.2255606Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2255936Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2256226Z return mod(**inputs) 2025-08-14T21:44:48.2256577Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2256931Z outputs = self.model( 2025-08-14T21:44:48.2257272Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1189, in forward 2025-08-14T21:44:48.2257642Z encoder_outputs = self.encoder( 2025-08-14T21:44:48.2258025Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 669, in forward 2025-08-14T21:44:48.2258407Z layer_outputs = encoder_layer( 2025-08-14T21:44:48.2258730Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2259059Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2259426Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 507, in forward 2025-08-14T21:44:48.2259833Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:44:48.2260010Z 2025-08-14T21:44:48.2260111Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2260450Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2260756Z return mod(**inputs) 2025-08-14T21:44:48.2261117Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2261482Z outputs = self.model( 2025-08-14T21:44:48.2261836Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1189, in forward 2025-08-14T21:44:48.2262207Z encoder_outputs = self.encoder( 2025-08-14T21:44:48.2262566Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 669, in forward 2025-08-14T21:44:48.2262942Z layer_outputs = encoder_layer( 2025-08-14T21:44:48.2263272Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2263613Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2263979Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 507, in forward 2025-08-14T21:44:48.2264396Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:44:48.2264992Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:44:48.2265312Z return self.act(input) 2025-08-14T21:44:48.2265413Z 2025-08-14T21:44:48.2265508Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2265840Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2266139Z return mod(**inputs) 2025-08-14T21:44:48.2266471Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2266833Z outputs = self.model( 2025-08-14T21:44:48.2267175Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1189, in forward 2025-08-14T21:44:48.2267537Z encoder_outputs = self.encoder( 2025-08-14T21:44:48.2267888Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 669, in forward 2025-08-14T21:44:48.2268246Z layer_outputs = encoder_layer( 2025-08-14T21:44:48.2268564Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2268890Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2269249Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 509, in forward 2025-08-14T21:44:48.2269612Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:44:48.2269739Z 2025-08-14T21:44:48.2269845Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2270172Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2270465Z return mod(**inputs) 2025-08-14T21:44:48.2270817Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2271205Z outputs = self.model( 2025-08-14T21:44:48.2271538Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1189, in forward 2025-08-14T21:44:48.2271898Z encoder_outputs = self.encoder( 2025-08-14T21:44:48.2272252Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 669, in forward 2025-08-14T21:44:48.2272598Z layer_outputs = encoder_layer( 2025-08-14T21:44:48.2272917Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2273270Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2273635Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 496, in forward 2025-08-14T21:44:48.2274006Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:44:48.2274385Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 400, in forward 2025-08-14T21:44:48.2274818Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:44:48.2275009Z 2025-08-14T21:44:48.2275113Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2275437Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2275737Z return mod(**inputs) 2025-08-14T21:44:48.2276077Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2276428Z outputs = self.model( 2025-08-14T21:44:48.2276770Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1189, in forward 2025-08-14T21:44:48.2277134Z encoder_outputs = self.encoder( 2025-08-14T21:44:48.2277492Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 669, in forward 2025-08-14T21:44:48.2277846Z layer_outputs = encoder_layer( 2025-08-14T21:44:48.2278167Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2278499Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2278855Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 496, in forward 2025-08-14T21:44:48.2279230Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:44:48.2279607Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 419, in forward 2025-08-14T21:44:48.2279970Z key_states = self.k_proj(current_states) 2025-08-14T21:44:48.2280093Z 2025-08-14T21:44:48.2280190Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2280522Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2280820Z return mod(**inputs) 2025-08-14T21:44:48.2281161Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2281511Z outputs = self.model( 2025-08-14T21:44:48.2281848Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1189, in forward 2025-08-14T21:44:48.2282209Z encoder_outputs = self.encoder( 2025-08-14T21:44:48.2282558Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 669, in forward 2025-08-14T21:44:48.2282916Z layer_outputs = encoder_layer( 2025-08-14T21:44:48.2283238Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2283584Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2283960Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 496, in forward 2025-08-14T21:44:48.2284354Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:44:48.2284908Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 420, in forward 2025-08-14T21:44:48.2285281Z value_states = self.v_proj(current_states) 2025-08-14T21:44:48.2285418Z 2025-08-14T21:44:48.2285491Z cudagraph partition due to non gpu ops 2025-08-14T21:44:48.2285686Z cudagraph partition due to non gpu ops 2025-08-14T21:44:48.2285879Z cudagraph partition due to non gpu ops 2025-08-14T21:44:48.2286125Z cudagraph partition due to non gpu ops 2025-08-14T21:44:48.2286338Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2286666Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2286955Z return mod(**inputs) 2025-08-14T21:44:48.2287296Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2287650Z outputs = self.model( 2025-08-14T21:44:48.2287991Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1189, in forward 2025-08-14T21:44:48.2288344Z encoder_outputs = self.encoder( 2025-08-14T21:44:48.2288696Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 669, in forward 2025-08-14T21:44:48.2289053Z layer_outputs = encoder_layer( 2025-08-14T21:44:48.2289369Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2289705Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2290068Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 496, in forward 2025-08-14T21:44:48.2290443Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:44:48.2290811Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 438, in forward 2025-08-14T21:44:48.2291192Z attn_output, attn_weights = attention_interface( 2025-08-14T21:44:48.2291598Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:44:48.2292038Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:44:48.2292206Z 2025-08-14T21:44:48.2292300Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2292629Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2292927Z return mod(**inputs) 2025-08-14T21:44:48.2293255Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2293614Z outputs = self.model( 2025-08-14T21:44:48.2293951Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1189, in forward 2025-08-14T21:44:48.2294308Z encoder_outputs = self.encoder( 2025-08-14T21:44:48.2294653Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 669, in forward 2025-08-14T21:44:48.2295011Z layer_outputs = encoder_layer( 2025-08-14T21:44:48.2295326Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2295652Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2296015Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 496, in forward 2025-08-14T21:44:48.2296388Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:44:48.2296813Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 438, in forward 2025-08-14T21:44:48.2297214Z attn_output, attn_weights = attention_interface( 2025-08-14T21:44:48.2297622Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:44:48.2298045Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:44:48.2298192Z 2025-08-14T21:44:48.2298294Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2298616Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2298935Z return mod(**inputs) 2025-08-14T21:44:48.2299279Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2299633Z outputs = self.model( 2025-08-14T21:44:48.2299979Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1189, in forward 2025-08-14T21:44:48.2300344Z encoder_outputs = self.encoder( 2025-08-14T21:44:48.2300702Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 669, in forward 2025-08-14T21:44:48.2301055Z layer_outputs = encoder_layer( 2025-08-14T21:44:48.2301375Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2301708Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2302068Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 496, in forward 2025-08-14T21:44:48.2302437Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:44:48.2302810Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 452, in forward 2025-08-14T21:44:48.2303176Z attn_output = self.out_proj(attn_output) 2025-08-14T21:44:48.2303302Z 2025-08-14T21:44:48.2303402Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2303733Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2304031Z return mod(**inputs) 2025-08-14T21:44:48.2304368Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2304787Z outputs = self.model( 2025-08-14T21:44:48.2305132Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1189, in forward 2025-08-14T21:44:48.2305503Z encoder_outputs = self.encoder( 2025-08-14T21:44:48.2305851Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 669, in forward 2025-08-14T21:44:48.2306213Z layer_outputs = encoder_layer( 2025-08-14T21:44:48.2306540Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2306878Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2307236Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 507, in forward 2025-08-14T21:44:48.2307643Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:44:48.2307805Z 2025-08-14T21:44:48.2307911Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2308244Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2308539Z return mod(**inputs) 2025-08-14T21:44:48.2308882Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2309260Z outputs = self.model( 2025-08-14T21:44:48.2309613Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1189, in forward 2025-08-14T21:44:48.2310017Z encoder_outputs = self.encoder( 2025-08-14T21:44:48.2310373Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 669, in forward 2025-08-14T21:44:48.2310729Z layer_outputs = encoder_layer( 2025-08-14T21:44:48.2311037Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2311366Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2311726Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 507, in forward 2025-08-14T21:44:48.2312138Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:44:48.2312497Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:44:48.2312813Z return self.act(input) 2025-08-14T21:44:48.2312913Z 2025-08-14T21:44:48.2313019Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2313347Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2313647Z return mod(**inputs) 2025-08-14T21:44:48.2313988Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2314346Z outputs = self.model( 2025-08-14T21:44:48.2314682Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1189, in forward 2025-08-14T21:44:48.2315048Z encoder_outputs = self.encoder( 2025-08-14T21:44:48.2315406Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 669, in forward 2025-08-14T21:44:48.2315760Z layer_outputs = encoder_layer( 2025-08-14T21:44:48.2316080Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2316416Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2316781Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 509, in forward 2025-08-14T21:44:48.2317144Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:44:48.2317275Z 2025-08-14T21:44:48.2317370Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2317701Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2317991Z return mod(**inputs) 2025-08-14T21:44:48.2318335Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2318694Z outputs = self.model( 2025-08-14T21:44:48.2319036Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1189, in forward 2025-08-14T21:44:48.2319392Z encoder_outputs = self.encoder( 2025-08-14T21:44:48.2319752Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 669, in forward 2025-08-14T21:44:48.2320113Z layer_outputs = encoder_layer( 2025-08-14T21:44:48.2320426Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2320759Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2321124Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 496, in forward 2025-08-14T21:44:48.2321503Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:44:48.2321876Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 400, in forward 2025-08-14T21:44:48.2322322Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:44:48.2322519Z 2025-08-14T21:44:48.2322631Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2322977Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2323267Z return mod(**inputs) 2025-08-14T21:44:48.2323606Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2323966Z outputs = self.model( 2025-08-14T21:44:48.2324300Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1189, in forward 2025-08-14T21:44:48.2324684Z encoder_outputs = self.encoder( 2025-08-14T21:44:48.2325043Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 669, in forward 2025-08-14T21:44:48.2325403Z layer_outputs = encoder_layer( 2025-08-14T21:44:48.2325717Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2326053Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2326416Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 496, in forward 2025-08-14T21:44:48.2326792Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:44:48.2327163Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 419, in forward 2025-08-14T21:44:48.2327531Z key_states = self.k_proj(current_states) 2025-08-14T21:44:48.2327655Z 2025-08-14T21:44:48.2327759Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2328084Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2328382Z return mod(**inputs) 2025-08-14T21:44:48.2328720Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2329078Z outputs = self.model( 2025-08-14T21:44:48.2329413Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1189, in forward 2025-08-14T21:44:48.2329777Z encoder_outputs = self.encoder( 2025-08-14T21:44:48.2330131Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 669, in forward 2025-08-14T21:44:48.2330478Z layer_outputs = encoder_layer( 2025-08-14T21:44:48.2330800Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2331135Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2331494Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 496, in forward 2025-08-14T21:44:48.2331863Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:44:48.2332242Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 420, in forward 2025-08-14T21:44:48.2332616Z value_states = self.v_proj(current_states) 2025-08-14T21:44:48.2332745Z 2025-08-14T21:44:48.2332824Z cudagraph partition due to non gpu ops 2025-08-14T21:44:48.2333013Z cudagraph partition due to non gpu ops 2025-08-14T21:44:48.2333205Z cudagraph partition due to non gpu ops 2025-08-14T21:44:48.2333394Z cudagraph partition due to non gpu ops 2025-08-14T21:44:48.2333603Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2333934Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2334232Z return mod(**inputs) 2025-08-14T21:44:48.2334565Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2334923Z outputs = self.model( 2025-08-14T21:44:48.2335290Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1189, in forward 2025-08-14T21:44:48.2335682Z encoder_outputs = self.encoder( 2025-08-14T21:44:48.2336031Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 669, in forward 2025-08-14T21:44:48.2336392Z layer_outputs = encoder_layer( 2025-08-14T21:44:48.2336712Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2337046Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2337405Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 496, in forward 2025-08-14T21:44:48.2337796Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:44:48.2338168Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 438, in forward 2025-08-14T21:44:48.2338546Z attn_output, attn_weights = attention_interface( 2025-08-14T21:44:48.2338958Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:44:48.2339397Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:44:48.2339566Z 2025-08-14T21:44:48.2339668Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2339996Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2340295Z return mod(**inputs) 2025-08-14T21:44:48.2340636Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2340987Z outputs = self.model( 2025-08-14T21:44:48.2341326Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1189, in forward 2025-08-14T21:44:48.2341686Z encoder_outputs = self.encoder( 2025-08-14T21:44:48.2342043Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 669, in forward 2025-08-14T21:44:48.2342396Z layer_outputs = encoder_layer( 2025-08-14T21:44:48.2342716Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2343045Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2343410Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 496, in forward 2025-08-14T21:44:48.2343777Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:44:48.2344155Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 438, in forward 2025-08-14T21:44:48.2344535Z attn_output, attn_weights = attention_interface( 2025-08-14T21:44:48.2345015Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:44:48.2345443Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:44:48.2345601Z 2025-08-14T21:44:48.2345698Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2346034Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2346328Z return mod(**inputs) 2025-08-14T21:44:48.2346671Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2347035Z outputs = self.model( 2025-08-14T21:44:48.2347384Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1189, in forward 2025-08-14T21:44:48.2347742Z encoder_outputs = self.encoder( 2025-08-14T21:44:48.2348116Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 669, in forward 2025-08-14T21:44:48.2348499Z layer_outputs = encoder_layer( 2025-08-14T21:44:48.2348830Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2349161Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2349522Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 496, in forward 2025-08-14T21:44:48.2349895Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:44:48.2350255Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 452, in forward 2025-08-14T21:44:48.2350639Z attn_output = self.out_proj(attn_output) 2025-08-14T21:44:48.2350764Z 2025-08-14T21:44:48.2350865Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2351188Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2351485Z return mod(**inputs) 2025-08-14T21:44:48.2351826Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2352184Z outputs = self.model( 2025-08-14T21:44:48.2352517Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1189, in forward 2025-08-14T21:44:48.2352878Z encoder_outputs = self.encoder( 2025-08-14T21:44:48.2353231Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 669, in forward 2025-08-14T21:44:48.2353594Z layer_outputs = encoder_layer( 2025-08-14T21:44:48.2353908Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2354241Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2354604Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 507, in forward 2025-08-14T21:44:48.2355003Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:44:48.2355170Z 2025-08-14T21:44:48.2355267Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2355596Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2355894Z return mod(**inputs) 2025-08-14T21:44:48.2356226Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2356588Z outputs = self.model( 2025-08-14T21:44:48.2356930Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1189, in forward 2025-08-14T21:44:48.2357286Z encoder_outputs = self.encoder( 2025-08-14T21:44:48.2357642Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 669, in forward 2025-08-14T21:44:48.2358002Z layer_outputs = encoder_layer( 2025-08-14T21:44:48.2358323Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2358647Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2359010Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 507, in forward 2025-08-14T21:44:48.2359411Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:44:48.2359769Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:44:48.2360077Z return self.act(input) 2025-08-14T21:44:48.2360183Z 2025-08-14T21:44:48.2360279Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2360610Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2360917Z return mod(**inputs) 2025-08-14T21:44:48.2361274Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2361649Z outputs = self.model( 2025-08-14T21:44:48.2361991Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1189, in forward 2025-08-14T21:44:48.2362349Z encoder_outputs = self.encoder( 2025-08-14T21:44:48.2362706Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 669, in forward 2025-08-14T21:44:48.2363067Z layer_outputs = encoder_layer( 2025-08-14T21:44:48.2363393Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2363727Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2364091Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 509, in forward 2025-08-14T21:44:48.2364455Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:44:48.2364580Z 2025-08-14T21:44:48.2364675Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2365003Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2365298Z return mod(**inputs) 2025-08-14T21:44:48.2365629Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2365983Z outputs = self.model( 2025-08-14T21:44:48.2366319Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1189, in forward 2025-08-14T21:44:48.2366680Z encoder_outputs = self.encoder( 2025-08-14T21:44:48.2367022Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 669, in forward 2025-08-14T21:44:48.2367379Z layer_outputs = encoder_layer( 2025-08-14T21:44:48.2367698Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2368030Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2368381Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 496, in forward 2025-08-14T21:44:48.2368754Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:44:48.2369129Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 400, in forward 2025-08-14T21:44:48.2369551Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:44:48.2369748Z 2025-08-14T21:44:48.2369845Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2370174Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2370471Z return mod(**inputs) 2025-08-14T21:44:48.2370805Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2371163Z outputs = self.model( 2025-08-14T21:44:48.2371500Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1189, in forward 2025-08-14T21:44:48.2371858Z encoder_outputs = self.encoder( 2025-08-14T21:44:48.2372211Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 669, in forward 2025-08-14T21:44:48.2372568Z layer_outputs = encoder_layer( 2025-08-14T21:44:48.2372899Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2373225Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2373603Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 496, in forward 2025-08-14T21:44:48.2373999Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:44:48.2374394Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 419, in forward 2025-08-14T21:44:48.2374753Z key_states = self.k_proj(current_states) 2025-08-14T21:44:48.2374885Z 2025-08-14T21:44:48.2374982Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2375317Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2375611Z return mod(**inputs) 2025-08-14T21:44:48.2375951Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2376325Z outputs = self.model( 2025-08-14T21:44:48.2376667Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1189, in forward 2025-08-14T21:44:48.2377025Z encoder_outputs = self.encoder( 2025-08-14T21:44:48.2377384Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 669, in forward 2025-08-14T21:44:48.2377746Z layer_outputs = encoder_layer( 2025-08-14T21:44:48.2378067Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2378393Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2378759Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 496, in forward 2025-08-14T21:44:48.2379136Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:44:48.2379502Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 420, in forward 2025-08-14T21:44:48.2379877Z value_states = self.v_proj(current_states) 2025-08-14T21:44:48.2380013Z 2025-08-14T21:44:48.2380086Z cudagraph partition due to non gpu ops 2025-08-14T21:44:48.2380282Z cudagraph partition due to non gpu ops 2025-08-14T21:44:48.2380471Z cudagraph partition due to non gpu ops 2025-08-14T21:44:48.2392713Z cudagraph partition due to non gpu ops 2025-08-14T21:44:48.2393054Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2393419Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2393740Z return mod(**inputs) 2025-08-14T21:44:48.2394114Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2394495Z outputs = self.model( 2025-08-14T21:44:48.2394873Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1189, in forward 2025-08-14T21:44:48.2395244Z encoder_outputs = self.encoder( 2025-08-14T21:44:48.2395619Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 669, in forward 2025-08-14T21:44:48.2396001Z layer_outputs = encoder_layer( 2025-08-14T21:44:48.2396334Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2396672Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2397053Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 496, in forward 2025-08-14T21:44:48.2397443Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:44:48.2397830Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 438, in forward 2025-08-14T21:44:48.2398216Z attn_output, attn_weights = attention_interface( 2025-08-14T21:44:48.2398635Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:44:48.2399191Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:44:48.2399370Z 2025-08-14T21:44:48.2399552Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2399900Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2400207Z return mod(**inputs) 2025-08-14T21:44:48.2400565Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2400930Z outputs = self.model( 2025-08-14T21:44:48.2401283Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1189, in forward 2025-08-14T21:44:48.2401689Z encoder_outputs = self.encoder( 2025-08-14T21:44:48.2402043Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 669, in forward 2025-08-14T21:44:48.2402405Z layer_outputs = encoder_layer( 2025-08-14T21:44:48.2402733Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2403071Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2403429Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 496, in forward 2025-08-14T21:44:48.2403809Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:44:48.2404186Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 438, in forward 2025-08-14T21:44:48.2404569Z attn_output, attn_weights = attention_interface( 2025-08-14T21:44:48.2404974Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:44:48.2405400Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:44:48.2405548Z 2025-08-14T21:44:48.2405659Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2405990Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2406293Z return mod(**inputs) 2025-08-14T21:44:48.2406639Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2407001Z outputs = self.model( 2025-08-14T21:44:48.2407337Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1189, in forward 2025-08-14T21:44:48.2407701Z encoder_outputs = self.encoder( 2025-08-14T21:44:48.2408061Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 669, in forward 2025-08-14T21:44:48.2408426Z layer_outputs = encoder_layer( 2025-08-14T21:44:48.2408741Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2409082Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2409452Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 496, in forward 2025-08-14T21:44:48.2409826Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:44:48.2410205Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 452, in forward 2025-08-14T21:44:48.2410574Z attn_output = self.out_proj(attn_output) 2025-08-14T21:44:48.2410704Z 2025-08-14T21:44:48.2410811Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2411137Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2411439Z return mod(**inputs) 2025-08-14T21:44:48.2411781Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2412133Z outputs = self.model( 2025-08-14T21:44:48.2412504Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1189, in forward 2025-08-14T21:44:48.2412895Z encoder_outputs = self.encoder( 2025-08-14T21:44:48.2413252Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 669, in forward 2025-08-14T21:44:48.2413603Z layer_outputs = encoder_layer( 2025-08-14T21:44:48.2413925Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2414259Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2414640Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 507, in forward 2025-08-14T21:44:48.2415041Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:44:48.2415213Z 2025-08-14T21:44:48.2415310Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2415644Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2415938Z return mod(**inputs) 2025-08-14T21:44:48.2416282Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2416640Z outputs = self.model( 2025-08-14T21:44:48.2416979Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1189, in forward 2025-08-14T21:44:48.2417331Z encoder_outputs = self.encoder( 2025-08-14T21:44:48.2417686Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 669, in forward 2025-08-14T21:44:48.2418045Z layer_outputs = encoder_layer( 2025-08-14T21:44:48.2418354Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2418689Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2419051Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 507, in forward 2025-08-14T21:44:48.2419449Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:44:48.2419797Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:44:48.2420109Z return self.act(input) 2025-08-14T21:44:48.2420208Z 2025-08-14T21:44:48.2420308Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2420639Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2420931Z return mod(**inputs) 2025-08-14T21:44:48.2421270Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2421626Z outputs = self.model( 2025-08-14T21:44:48.2421959Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1189, in forward 2025-08-14T21:44:48.2422321Z encoder_outputs = self.encoder( 2025-08-14T21:44:48.2422675Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 669, in forward 2025-08-14T21:44:48.2423030Z layer_outputs = encoder_layer( 2025-08-14T21:44:48.2423340Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2423671Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2424031Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 509, in forward 2025-08-14T21:44:48.2424394Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:44:48.2424530Z 2025-08-14T21:44:48.2424697Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2425067Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2425412Z return mod(**inputs) 2025-08-14T21:44:48.2425753Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2426120Z outputs = self.model( 2025-08-14T21:44:48.2426751Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1189, in forward 2025-08-14T21:44:48.2427114Z encoder_outputs = self.encoder( 2025-08-14T21:44:48.2427461Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 669, in forward 2025-08-14T21:44:48.2427857Z layer_outputs = encoder_layer( 2025-08-14T21:44:48.2428178Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2428501Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2428870Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 496, in forward 2025-08-14T21:44:48.2429253Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:44:48.2429624Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 400, in forward 2025-08-14T21:44:48.2430048Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:44:48.2430243Z 2025-08-14T21:44:48.2430346Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2430673Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2430964Z return mod(**inputs) 2025-08-14T21:44:48.2431305Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2431662Z outputs = self.model( 2025-08-14T21:44:48.2431994Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1189, in forward 2025-08-14T21:44:48.2432362Z encoder_outputs = self.encoder( 2025-08-14T21:44:48.2432715Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 669, in forward 2025-08-14T21:44:48.2433071Z layer_outputs = encoder_layer( 2025-08-14T21:44:48.2433383Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2433716Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2434081Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 496, in forward 2025-08-14T21:44:48.2434451Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:44:48.2434828Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 419, in forward 2025-08-14T21:44:48.2435195Z key_states = self.k_proj(current_states) 2025-08-14T21:44:48.2435320Z 2025-08-14T21:44:48.2435427Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2435750Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2436044Z return mod(**inputs) 2025-08-14T21:44:48.2436386Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2436745Z outputs = self.model( 2025-08-14T21:44:48.2437075Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1189, in forward 2025-08-14T21:44:48.2437438Z encoder_outputs = self.encoder( 2025-08-14T21:44:48.2437791Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 669, in forward 2025-08-14T21:44:48.2438144Z layer_outputs = encoder_layer( 2025-08-14T21:44:48.2438499Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2438845Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2439209Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 496, in forward 2025-08-14T21:44:48.2439584Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:44:48.2439958Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 420, in forward 2025-08-14T21:44:48.2440333Z value_states = self.v_proj(current_states) 2025-08-14T21:44:48.2440479Z 2025-08-14T21:44:48.2440552Z cudagraph partition due to non gpu ops 2025-08-14T21:44:48.2440745Z cudagraph partition due to non gpu ops 2025-08-14T21:44:48.2440934Z cudagraph partition due to non gpu ops 2025-08-14T21:44:48.2441122Z cudagraph partition due to non gpu ops 2025-08-14T21:44:48.2441328Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2441657Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2441953Z return mod(**inputs) 2025-08-14T21:44:48.2442283Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2442642Z outputs = self.model( 2025-08-14T21:44:48.2442979Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1189, in forward 2025-08-14T21:44:48.2443341Z encoder_outputs = self.encoder( 2025-08-14T21:44:48.2443687Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 669, in forward 2025-08-14T21:44:48.2444045Z layer_outputs = encoder_layer( 2025-08-14T21:44:48.2444362Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2444685Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2445050Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 496, in forward 2025-08-14T21:44:48.2445423Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:44:48.2445795Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 438, in forward 2025-08-14T21:44:48.2446166Z attn_output, attn_weights = attention_interface( 2025-08-14T21:44:48.2446573Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:44:48.2447016Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:44:48.2447184Z 2025-08-14T21:44:48.2447285Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2447611Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2447910Z return mod(**inputs) 2025-08-14T21:44:48.2448256Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2448610Z outputs = self.model( 2025-08-14T21:44:48.2448948Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1189, in forward 2025-08-14T21:44:48.2449309Z encoder_outputs = self.encoder( 2025-08-14T21:44:48.2449662Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 669, in forward 2025-08-14T21:44:48.2450010Z layer_outputs = encoder_layer( 2025-08-14T21:44:48.2450326Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2450657Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2451024Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 496, in forward 2025-08-14T21:44:48.2451428Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:44:48.2451804Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 438, in forward 2025-08-14T21:44:48.2452182Z attn_output, attn_weights = attention_interface( 2025-08-14T21:44:48.2452582Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:44:48.2453005Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:44:48.2453179Z 2025-08-14T21:44:48.2453276Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2453606Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2453899Z return mod(**inputs) 2025-08-14T21:44:48.2454238Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2454601Z outputs = self.model( 2025-08-14T21:44:48.2454933Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1189, in forward 2025-08-14T21:44:48.2455299Z encoder_outputs = self.encoder( 2025-08-14T21:44:48.2455655Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 669, in forward 2025-08-14T21:44:48.2456009Z layer_outputs = encoder_layer( 2025-08-14T21:44:48.2456322Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2456655Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2457017Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 496, in forward 2025-08-14T21:44:48.2457389Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:44:48.2457762Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 452, in forward 2025-08-14T21:44:48.2458132Z attn_output = self.out_proj(attn_output) 2025-08-14T21:44:48.2458255Z 2025-08-14T21:44:48.2458355Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2458677Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2458977Z return mod(**inputs) 2025-08-14T21:44:48.2459309Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2459656Z outputs = self.model( 2025-08-14T21:44:48.2459995Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1189, in forward 2025-08-14T21:44:48.2460355Z encoder_outputs = self.encoder( 2025-08-14T21:44:48.2460707Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 669, in forward 2025-08-14T21:44:48.2461055Z layer_outputs = encoder_layer( 2025-08-14T21:44:48.2461372Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2461699Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2462054Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 507, in forward 2025-08-14T21:44:48.2462456Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:44:48.2462622Z 2025-08-14T21:44:48.2462719Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2463044Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2463333Z return mod(**inputs) 2025-08-14T21:44:48.2463700Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2464092Z outputs = self.model( 2025-08-14T21:44:48.2464425Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1189, in forward 2025-08-14T21:44:48.2464863Z encoder_outputs = self.encoder( 2025-08-14T21:44:48.2465220Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 669, in forward 2025-08-14T21:44:48.2465579Z layer_outputs = encoder_layer( 2025-08-14T21:44:48.2465893Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2466243Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2466604Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 507, in forward 2025-08-14T21:44:48.2467004Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:44:48.2467357Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:44:48.2467674Z return self.act(input) 2025-08-14T21:44:48.2467774Z 2025-08-14T21:44:48.2467874Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2468195Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2468488Z return mod(**inputs) 2025-08-14T21:44:48.2468823Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2469170Z outputs = self.model( 2025-08-14T21:44:48.2469503Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1189, in forward 2025-08-14T21:44:48.2469863Z encoder_outputs = self.encoder( 2025-08-14T21:44:48.2470215Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 669, in forward 2025-08-14T21:44:48.2470567Z layer_outputs = encoder_layer( 2025-08-14T21:44:48.2470885Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2471215Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2471575Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 509, in forward 2025-08-14T21:44:48.2471933Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:44:48.2472067Z 2025-08-14T21:44:48.2472160Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2472490Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2472788Z return mod(**inputs) 2025-08-14T21:44:48.2473117Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2473470Z outputs = self.model( 2025-08-14T21:44:48.2473805Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1189, in forward 2025-08-14T21:44:48.2474155Z encoder_outputs = self.encoder( 2025-08-14T21:44:48.2474506Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 669, in forward 2025-08-14T21:44:48.2474857Z layer_outputs = encoder_layer( 2025-08-14T21:44:48.2475173Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2475495Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2475855Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 496, in forward 2025-08-14T21:44:48.2476226Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:44:48.2476604Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 400, in forward 2025-08-14T21:44:48.2477075Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:44:48.2477278Z 2025-08-14T21:44:48.2477373Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2477703Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2477995Z return mod(**inputs) 2025-08-14T21:44:48.2478339Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2478697Z outputs = self.model( 2025-08-14T21:44:48.2479058Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1189, in forward 2025-08-14T21:44:48.2479411Z encoder_outputs = self.encoder( 2025-08-14T21:44:48.2479768Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 669, in forward 2025-08-14T21:44:48.2480125Z layer_outputs = encoder_layer( 2025-08-14T21:44:48.2480435Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2480763Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2481122Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 496, in forward 2025-08-14T21:44:48.2481488Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:44:48.2481851Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 419, in forward 2025-08-14T21:44:48.2482214Z key_states = self.k_proj(current_states) 2025-08-14T21:44:48.2482336Z 2025-08-14T21:44:48.2482436Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2482759Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2483055Z return mod(**inputs) 2025-08-14T21:44:48.2483393Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2483746Z outputs = self.model( 2025-08-14T21:44:48.2484071Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1189, in forward 2025-08-14T21:44:48.2484433Z encoder_outputs = self.encoder( 2025-08-14T21:44:48.2484909Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 669, in forward 2025-08-14T21:44:48.2485273Z layer_outputs = encoder_layer( 2025-08-14T21:44:48.2485586Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2485916Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2486277Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 496, in forward 2025-08-14T21:44:48.2486645Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:44:48.2487021Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 420, in forward 2025-08-14T21:44:48.2487394Z value_states = self.v_proj(current_states) 2025-08-14T21:44:48.2487524Z 2025-08-14T21:44:48.2487606Z cudagraph partition due to non gpu ops 2025-08-14T21:44:48.2487797Z cudagraph partition due to non gpu ops 2025-08-14T21:44:48.2487992Z cudagraph partition due to non gpu ops 2025-08-14T21:44:48.2488183Z cudagraph partition due to non gpu ops 2025-08-14T21:44:48.2488396Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2488731Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2489030Z return mod(**inputs) 2025-08-14T21:44:48.2489404Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2489801Z outputs = self.model( 2025-08-14T21:44:48.2490143Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1189, in forward 2025-08-14T21:44:48.2490502Z encoder_outputs = self.encoder( 2025-08-14T21:44:48.2490845Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 669, in forward 2025-08-14T21:44:48.2491201Z layer_outputs = encoder_layer( 2025-08-14T21:44:48.2491518Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2491877Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2492234Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 496, in forward 2025-08-14T21:44:48.2492615Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:44:48.2492989Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 438, in forward 2025-08-14T21:44:48.2493368Z attn_output, attn_weights = attention_interface( 2025-08-14T21:44:48.2493780Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:44:48.2494223Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:44:48.2494392Z 2025-08-14T21:44:48.2494492Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2494814Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2495112Z return mod(**inputs) 2025-08-14T21:44:48.2495452Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2495811Z outputs = self.model( 2025-08-14T21:44:48.2496142Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1189, in forward 2025-08-14T21:44:48.2496500Z encoder_outputs = self.encoder( 2025-08-14T21:44:48.2496848Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 669, in forward 2025-08-14T21:44:48.2497198Z layer_outputs = encoder_layer( 2025-08-14T21:44:48.2497517Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2497846Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2498205Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 496, in forward 2025-08-14T21:44:48.2498568Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:44:48.2498934Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 438, in forward 2025-08-14T21:44:48.2499313Z attn_output, attn_weights = attention_interface( 2025-08-14T21:44:48.2499717Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:44:48.2500128Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:44:48.2500280Z 2025-08-14T21:44:48.2500372Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2500699Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2500988Z return mod(**inputs) 2025-08-14T21:44:48.2501327Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2501683Z outputs = self.model( 2025-08-14T21:44:48.2502037Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1189, in forward 2025-08-14T21:44:48.2502394Z encoder_outputs = self.encoder( 2025-08-14T21:44:48.2502782Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 669, in forward 2025-08-14T21:44:48.2503144Z layer_outputs = encoder_layer( 2025-08-14T21:44:48.2503456Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2503792Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2504155Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 496, in forward 2025-08-14T21:44:48.2504547Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:44:48.2504983Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 452, in forward 2025-08-14T21:44:48.2505359Z attn_output = self.out_proj(attn_output) 2025-08-14T21:44:48.2505486Z 2025-08-14T21:44:48.2505591Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2505930Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2506232Z return mod(**inputs) 2025-08-14T21:44:48.2506580Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2506946Z outputs = self.model( 2025-08-14T21:44:48.2507281Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1189, in forward 2025-08-14T21:44:48.2507649Z encoder_outputs = self.encoder( 2025-08-14T21:44:48.2508013Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 669, in forward 2025-08-14T21:44:48.2508373Z layer_outputs = encoder_layer( 2025-08-14T21:44:48.2508691Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2509025Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2509393Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 507, in forward 2025-08-14T21:44:48.2509794Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:44:48.2509961Z 2025-08-14T21:44:48.2510055Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2510384Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2510682Z return mod(**inputs) 2025-08-14T21:44:48.2511014Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2511377Z outputs = self.model( 2025-08-14T21:44:48.2511717Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1189, in forward 2025-08-14T21:44:48.2512083Z encoder_outputs = self.encoder( 2025-08-14T21:44:48.2512434Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 669, in forward 2025-08-14T21:44:48.2512797Z layer_outputs = encoder_layer( 2025-08-14T21:44:48.2513115Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2513440Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2513803Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 507, in forward 2025-08-14T21:44:48.2514203Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:44:48.2514562Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:44:48.2514867Z return self.act(input) 2025-08-14T21:44:48.2514976Z 2025-08-14T21:44:48.2515089Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2515435Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2515743Z return mod(**inputs) 2025-08-14T21:44:48.2516081Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2516437Z outputs = self.model( 2025-08-14T21:44:48.2516773Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1189, in forward 2025-08-14T21:44:48.2517127Z encoder_outputs = self.encoder( 2025-08-14T21:44:48.2517479Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 669, in forward 2025-08-14T21:44:48.2517854Z layer_outputs = encoder_layer( 2025-08-14T21:44:48.2518172Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2518500Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2518863Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 509, in forward 2025-08-14T21:44:48.2519231Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:44:48.2519356Z 2025-08-14T21:44:48.2519449Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2519778Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2520071Z return mod(**inputs) 2025-08-14T21:44:48.2520409Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2520764Z outputs = self.model( 2025-08-14T21:44:48.2521103Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1207, in forward 2025-08-14T21:44:48.2521464Z decoder_outputs = self.decoder( 2025-08-14T21:44:48.2521814Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:48.2522176Z layer_outputs = decoder_layer( 2025-08-14T21:44:48.2522483Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2522811Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2523164Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 760, in forward 2025-08-14T21:44:48.2523549Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:48.2523929Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 400, in forward 2025-08-14T21:44:48.2524360Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:44:48.2524548Z 2025-08-14T21:44:48.2524644Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2524975Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2525271Z return mod(**inputs) 2025-08-14T21:44:48.2525603Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2525958Z outputs = self.model( 2025-08-14T21:44:48.2526296Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1207, in forward 2025-08-14T21:44:48.2526657Z decoder_outputs = self.decoder( 2025-08-14T21:44:48.2527004Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:48.2527361Z layer_outputs = decoder_layer( 2025-08-14T21:44:48.2527676Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2528019Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2528420Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 760, in forward 2025-08-14T21:44:48.2528807Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:48.2529186Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 419, in forward 2025-08-14T21:44:48.2529545Z key_states = self.k_proj(current_states) 2025-08-14T21:44:48.2529675Z 2025-08-14T21:44:48.2529769Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2530091Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2530402Z return mod(**inputs) 2025-08-14T21:44:48.2530727Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2531081Z outputs = self.model( 2025-08-14T21:44:48.2531419Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1207, in forward 2025-08-14T21:44:48.2531774Z decoder_outputs = self.decoder( 2025-08-14T21:44:48.2532129Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:48.2532487Z layer_outputs = decoder_layer( 2025-08-14T21:44:48.2532802Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2533123Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2533483Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 760, in forward 2025-08-14T21:44:48.2533866Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:48.2534238Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 420, in forward 2025-08-14T21:44:48.2534611Z value_states = self.v_proj(current_states) 2025-08-14T21:44:48.2534744Z 2025-08-14T21:44:48.2534818Z cudagraph partition due to non gpu ops 2025-08-14T21:44:48.2535005Z cudagraph partition due to non gpu ops 2025-08-14T21:44:48.2535184Z cudagraph partition due to non gpu ops 2025-08-14T21:44:48.2535369Z cudagraph partition due to non gpu ops 2025-08-14T21:44:48.2535583Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2535904Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2536199Z return mod(**inputs) 2025-08-14T21:44:48.2536531Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2536881Z outputs = self.model( 2025-08-14T21:44:48.2537213Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1207, in forward 2025-08-14T21:44:48.2537578Z decoder_outputs = self.decoder( 2025-08-14T21:44:48.2537932Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:48.2538281Z layer_outputs = decoder_layer( 2025-08-14T21:44:48.2538597Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2538925Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2539288Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 760, in forward 2025-08-14T21:44:48.2539665Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:48.2540045Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 438, in forward 2025-08-14T21:44:48.2540426Z attn_output, attn_weights = attention_interface( 2025-08-14T21:44:48.2540862Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:44:48.2541320Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:44:48.2541497Z 2025-08-14T21:44:48.2541593Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2541925Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2542214Z return mod(**inputs) 2025-08-14T21:44:48.2542553Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2542924Z outputs = self.model( 2025-08-14T21:44:48.2543265Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1207, in forward 2025-08-14T21:44:48.2543620Z decoder_outputs = self.decoder( 2025-08-14T21:44:48.2543974Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:48.2544330Z layer_outputs = decoder_layer( 2025-08-14T21:44:48.2544701Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2545030Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2545385Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 760, in forward 2025-08-14T21:44:48.2545767Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:48.2546137Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 438, in forward 2025-08-14T21:44:48.2546511Z attn_output, attn_weights = attention_interface( 2025-08-14T21:44:48.2546917Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:44:48.2547334Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:44:48.2547483Z 2025-08-14T21:44:48.2547578Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2547904Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2548193Z return mod(**inputs) 2025-08-14T21:44:48.2548527Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2548876Z outputs = self.model( 2025-08-14T21:44:48.2549213Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1207, in forward 2025-08-14T21:44:48.2549569Z decoder_outputs = self.decoder( 2025-08-14T21:44:48.2549915Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:48.2550274Z layer_outputs = decoder_layer( 2025-08-14T21:44:48.2550595Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2550927Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2551283Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 760, in forward 2025-08-14T21:44:48.2551662Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:48.2552040Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 452, in forward 2025-08-14T21:44:48.2552399Z attn_output = self.out_proj(attn_output) 2025-08-14T21:44:48.2552528Z 2025-08-14T21:44:48.2552622Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2552945Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2553252Z return mod(**inputs) 2025-08-14T21:44:48.2553602Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2553974Z outputs = self.model( 2025-08-14T21:44:48.2554312Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1207, in forward 2025-08-14T21:44:48.2554672Z decoder_outputs = self.decoder( 2025-08-14T21:44:48.2555023Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:48.2555379Z layer_outputs = decoder_layer( 2025-08-14T21:44:48.2555708Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2556028Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2556392Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 777, in forward 2025-08-14T21:44:48.2556781Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:44:48.2557171Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 400, in forward 2025-08-14T21:44:48.2557590Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:44:48.2557782Z 2025-08-14T21:44:48.2557876Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2558207Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2558501Z return mod(**inputs) 2025-08-14T21:44:48.2558836Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2559191Z outputs = self.model( 2025-08-14T21:44:48.2559528Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1207, in forward 2025-08-14T21:44:48.2559882Z decoder_outputs = self.decoder( 2025-08-14T21:44:48.2560240Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:48.2560597Z layer_outputs = decoder_layer( 2025-08-14T21:44:48.2560913Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2561234Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2561596Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 777, in forward 2025-08-14T21:44:48.2561983Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:44:48.2562363Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 419, in forward 2025-08-14T21:44:48.2562725Z key_states = self.k_proj(current_states) 2025-08-14T21:44:48.2562853Z 2025-08-14T21:44:48.2562946Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2563279Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2563565Z return mod(**inputs) 2025-08-14T21:44:48.2563898Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2564249Z outputs = self.model( 2025-08-14T21:44:48.2564580Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1207, in forward 2025-08-14T21:44:48.2564939Z decoder_outputs = self.decoder( 2025-08-14T21:44:48.2565292Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:48.2565649Z layer_outputs = decoder_layer( 2025-08-14T21:44:48.2565976Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2566328Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2566706Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 777, in forward 2025-08-14T21:44:48.2567095Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:44:48.2567474Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 420, in forward 2025-08-14T21:44:48.2567842Z value_states = self.v_proj(current_states) 2025-08-14T21:44:48.2567971Z 2025-08-14T21:44:48.2568051Z cudagraph partition due to non gpu ops 2025-08-14T21:44:48.2568256Z cudagraph partition due to non gpu ops 2025-08-14T21:44:48.2568451Z cudagraph partition due to non gpu ops 2025-08-14T21:44:48.2568640Z cudagraph partition due to non gpu ops 2025-08-14T21:44:48.2568854Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2569180Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2569478Z return mod(**inputs) 2025-08-14T21:44:48.2569817Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2570165Z outputs = self.model( 2025-08-14T21:44:48.2570500Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1207, in forward 2025-08-14T21:44:48.2570859Z decoder_outputs = self.decoder( 2025-08-14T21:44:48.2571213Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:48.2571566Z layer_outputs = decoder_layer( 2025-08-14T21:44:48.2571883Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2572216Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2572574Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 777, in forward 2025-08-14T21:44:48.2572964Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:44:48.2573350Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 438, in forward 2025-08-14T21:44:48.2573729Z attn_output, attn_weights = attention_interface( 2025-08-14T21:44:48.2574128Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:44:48.2574566Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:44:48.2574736Z 2025-08-14T21:44:48.2574837Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2575165Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2575457Z return mod(**inputs) 2025-08-14T21:44:48.2575797Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2576150Z outputs = self.model( 2025-08-14T21:44:48.2576483Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1207, in forward 2025-08-14T21:44:48.2576844Z decoder_outputs = self.decoder( 2025-08-14T21:44:48.2577198Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:48.2577554Z layer_outputs = decoder_layer( 2025-08-14T21:44:48.2577864Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2578195Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2578583Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 777, in forward 2025-08-14T21:44:48.2578980Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:44:48.2579386Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 438, in forward 2025-08-14T21:44:48.2579481Z attn_output, attn_weights = attention_interface( 2025-08-14T21:44:48.2579749Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:44:48.2579844Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:44:48.2579848Z 2025-08-14T21:44:48.2579950Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2580149Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2580216Z return mod(**inputs) 2025-08-14T21:44:48.2580455Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2580516Z outputs = self.model( 2025-08-14T21:44:48.2580763Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1207, in forward 2025-08-14T21:44:48.2580831Z decoder_outputs = self.decoder( 2025-08-14T21:44:48.2581066Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:48.2581140Z layer_outputs = decoder_layer( 2025-08-14T21:44:48.2581340Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2581418Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2581653Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 777, in forward 2025-08-14T21:44:48.2581748Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:44:48.2581990Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 452, in forward 2025-08-14T21:44:48.2582067Z attn_output = self.out_proj(attn_output) 2025-08-14T21:44:48.2582072Z 2025-08-14T21:44:48.2582170Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2582349Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2582408Z return mod(**inputs) 2025-08-14T21:44:48.2582650Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2582711Z outputs = self.model( 2025-08-14T21:44:48.2582946Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1207, in forward 2025-08-14T21:44:48.2583020Z decoder_outputs = self.decoder( 2025-08-14T21:44:48.2583255Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:48.2583328Z layer_outputs = decoder_layer( 2025-08-14T21:44:48.2583534Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2583604Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2583844Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 792, in forward 2025-08-14T21:44:48.2583955Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:44:48.2583959Z 2025-08-14T21:44:48.2584060Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2584242Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2584300Z return mod(**inputs) 2025-08-14T21:44:48.2584720Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2584798Z outputs = self.model( 2025-08-14T21:44:48.2585092Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1207, in forward 2025-08-14T21:44:48.2585172Z decoder_outputs = self.decoder( 2025-08-14T21:44:48.2585409Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:48.2585483Z layer_outputs = decoder_layer( 2025-08-14T21:44:48.2585687Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2585780Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2586022Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 792, in forward 2025-08-14T21:44:48.2586128Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:44:48.2586325Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:44:48.2586399Z return self.act(input) 2025-08-14T21:44:48.2586402Z 2025-08-14T21:44:48.2586495Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2586683Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2586743Z return mod(**inputs) 2025-08-14T21:44:48.2586982Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2587051Z outputs = self.model( 2025-08-14T21:44:48.2587291Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1207, in forward 2025-08-14T21:44:48.2587357Z decoder_outputs = self.decoder( 2025-08-14T21:44:48.2587604Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:48.2587668Z layer_outputs = decoder_layer( 2025-08-14T21:44:48.2587877Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2587947Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2588183Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 794, in forward 2025-08-14T21:44:48.2588264Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:44:48.2588267Z 2025-08-14T21:44:48.2588359Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2588545Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2588605Z return mod(**inputs) 2025-08-14T21:44:48.2588841Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2588911Z outputs = self.model( 2025-08-14T21:44:48.2589147Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1207, in forward 2025-08-14T21:44:48.2589214Z decoder_outputs = self.decoder( 2025-08-14T21:44:48.2589455Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:48.2589518Z layer_outputs = decoder_layer( 2025-08-14T21:44:48.2589724Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2589794Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2590028Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 760, in forward 2025-08-14T21:44:48.2590126Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:48.2590381Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 400, in forward 2025-08-14T21:44:48.2590540Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:44:48.2590557Z 2025-08-14T21:44:48.2590650Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2590832Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2590900Z return mod(**inputs) 2025-08-14T21:44:48.2591139Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2591200Z outputs = self.model( 2025-08-14T21:44:48.2591458Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1207, in forward 2025-08-14T21:44:48.2591524Z decoder_outputs = self.decoder( 2025-08-14T21:44:48.2591766Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:48.2591830Z layer_outputs = decoder_layer( 2025-08-14T21:44:48.2592034Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2592112Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2592345Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 760, in forward 2025-08-14T21:44:48.2592441Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:48.2592673Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 419, in forward 2025-08-14T21:44:48.2592748Z key_states = self.k_proj(current_states) 2025-08-14T21:44:48.2592751Z 2025-08-14T21:44:48.2592849Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2593029Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2593090Z return mod(**inputs) 2025-08-14T21:44:48.2593333Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2593395Z outputs = self.model( 2025-08-14T21:44:48.2593637Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1207, in forward 2025-08-14T21:44:48.2593702Z decoder_outputs = self.decoder( 2025-08-14T21:44:48.2593937Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:48.2594010Z layer_outputs = decoder_layer( 2025-08-14T21:44:48.2594212Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2594282Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2594524Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 760, in forward 2025-08-14T21:44:48.2594614Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:48.2594856Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 420, in forward 2025-08-14T21:44:48.2594933Z value_states = self.v_proj(current_states) 2025-08-14T21:44:48.2594936Z 2025-08-14T21:44:48.2595008Z cudagraph partition due to non gpu ops 2025-08-14T21:44:48.2595087Z cudagraph partition due to non gpu ops 2025-08-14T21:44:48.2595155Z cudagraph partition due to non gpu ops 2025-08-14T21:44:48.2595228Z cudagraph partition due to non gpu ops 2025-08-14T21:44:48.2595323Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2595503Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2595569Z return mod(**inputs) 2025-08-14T21:44:48.2595820Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2595914Z outputs = self.model( 2025-08-14T21:44:48.2596162Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1207, in forward 2025-08-14T21:44:48.2596229Z decoder_outputs = self.decoder( 2025-08-14T21:44:48.2596475Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:48.2596542Z layer_outputs = decoder_layer( 2025-08-14T21:44:48.2596742Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2596844Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2597082Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 760, in forward 2025-08-14T21:44:48.2597172Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:48.2597420Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 438, in forward 2025-08-14T21:44:48.2597513Z attn_output, attn_weights = attention_interface( 2025-08-14T21:44:48.2597789Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:44:48.2597910Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:44:48.2597913Z 2025-08-14T21:44:48.2598007Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2598196Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2598256Z return mod(**inputs) 2025-08-14T21:44:48.2598503Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2598567Z outputs = self.model( 2025-08-14T21:44:48.2598805Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1207, in forward 2025-08-14T21:44:48.2598883Z decoder_outputs = self.decoder( 2025-08-14T21:44:48.2599119Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:48.2599184Z layer_outputs = decoder_layer( 2025-08-14T21:44:48.2599393Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2599465Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2599706Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 760, in forward 2025-08-14T21:44:48.2599797Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:48.2600034Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 438, in forward 2025-08-14T21:44:48.2600132Z attn_output, attn_weights = attention_interface( 2025-08-14T21:44:48.2600399Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:44:48.2600510Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:44:48.2600513Z 2025-08-14T21:44:48.2600606Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2600791Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2600855Z return mod(**inputs) 2025-08-14T21:44:48.2601094Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2601157Z outputs = self.model( 2025-08-14T21:44:48.2601403Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1207, in forward 2025-08-14T21:44:48.2601485Z decoder_outputs = self.decoder( 2025-08-14T21:44:48.2601757Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:48.2601823Z layer_outputs = decoder_layer( 2025-08-14T21:44:48.2602024Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2602104Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2602338Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 760, in forward 2025-08-14T21:44:48.2602440Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:48.2602681Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 452, in forward 2025-08-14T21:44:48.2602753Z attn_output = self.out_proj(attn_output) 2025-08-14T21:44:48.2602756Z 2025-08-14T21:44:48.2602856Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2603037Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2603095Z return mod(**inputs) 2025-08-14T21:44:48.2603339Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2603400Z outputs = self.model( 2025-08-14T21:44:48.2603643Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1207, in forward 2025-08-14T21:44:48.2603709Z decoder_outputs = self.decoder( 2025-08-14T21:44:48.2603945Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:48.2604015Z layer_outputs = decoder_layer( 2025-08-14T21:44:48.2604217Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2604289Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2604531Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 777, in forward 2025-08-14T21:44:48.2604627Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:44:48.2604865Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 400, in forward 2025-08-14T21:44:48.2605000Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:44:48.2605004Z 2025-08-14T21:44:48.2605096Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2605283Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2605341Z return mod(**inputs) 2025-08-14T21:44:48.2605584Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2605645Z outputs = self.model( 2025-08-14T21:44:48.2605882Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1207, in forward 2025-08-14T21:44:48.2605953Z decoder_outputs = self.decoder( 2025-08-14T21:44:48.2606188Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:48.2606252Z layer_outputs = decoder_layer( 2025-08-14T21:44:48.2606458Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2606531Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2606770Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 777, in forward 2025-08-14T21:44:48.2606863Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:44:48.2607122Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 419, in forward 2025-08-14T21:44:48.2607211Z key_states = self.k_proj(current_states) 2025-08-14T21:44:48.2607214Z 2025-08-14T21:44:48.2607306Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2607488Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2607545Z return mod(**inputs) 2025-08-14T21:44:48.2607780Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2607865Z outputs = self.model( 2025-08-14T21:44:48.2608103Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1207, in forward 2025-08-14T21:44:48.2608166Z decoder_outputs = self.decoder( 2025-08-14T21:44:48.2608404Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:48.2608472Z layer_outputs = decoder_layer( 2025-08-14T21:44:48.2608674Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2608740Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2608971Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 777, in forward 2025-08-14T21:44:48.2609066Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:44:48.2609296Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 420, in forward 2025-08-14T21:44:48.2609371Z value_states = self.v_proj(current_states) 2025-08-14T21:44:48.2609381Z 2025-08-14T21:44:48.2609447Z cudagraph partition due to non gpu ops 2025-08-14T21:44:48.2609513Z cudagraph partition due to non gpu ops 2025-08-14T21:44:48.2609582Z cudagraph partition due to non gpu ops 2025-08-14T21:44:48.2609652Z cudagraph partition due to non gpu ops 2025-08-14T21:44:48.2609744Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2609931Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2609990Z return mod(**inputs) 2025-08-14T21:44:48.2610229Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2610298Z outputs = self.model( 2025-08-14T21:44:48.2610536Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1207, in forward 2025-08-14T21:44:48.2610610Z decoder_outputs = self.decoder( 2025-08-14T21:44:48.2610847Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:48.2610912Z layer_outputs = decoder_layer( 2025-08-14T21:44:48.2611119Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2611190Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2611432Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 777, in forward 2025-08-14T21:44:48.2611527Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:44:48.2611762Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 438, in forward 2025-08-14T21:44:48.2611855Z attn_output, attn_weights = attention_interface( 2025-08-14T21:44:48.2612123Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:44:48.2612242Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:44:48.2612252Z 2025-08-14T21:44:48.2612355Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2612553Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2612632Z return mod(**inputs) 2025-08-14T21:44:48.2612873Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2612931Z outputs = self.model( 2025-08-14T21:44:48.2613175Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1207, in forward 2025-08-14T21:44:48.2613240Z decoder_outputs = self.decoder( 2025-08-14T21:44:48.2613493Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:48.2613554Z layer_outputs = decoder_layer( 2025-08-14T21:44:48.2613757Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2613833Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2614070Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 777, in forward 2025-08-14T21:44:48.2614165Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:44:48.2614404Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 438, in forward 2025-08-14T21:44:48.2614490Z attn_output, attn_weights = attention_interface( 2025-08-14T21:44:48.2614760Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:44:48.2614856Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:44:48.2614859Z 2025-08-14T21:44:48.2614951Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2615137Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2615198Z return mod(**inputs) 2025-08-14T21:44:48.2615441Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2615501Z outputs = self.model( 2025-08-14T21:44:48.2615734Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1207, in forward 2025-08-14T21:44:48.2615806Z decoder_outputs = self.decoder( 2025-08-14T21:44:48.2616039Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:48.2616106Z layer_outputs = decoder_layer( 2025-08-14T21:44:48.2616312Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2616381Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2616622Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 777, in forward 2025-08-14T21:44:48.2616717Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:44:48.2616949Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 452, in forward 2025-08-14T21:44:48.2617027Z attn_output = self.out_proj(attn_output) 2025-08-14T21:44:48.2617030Z 2025-08-14T21:44:48.2617121Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2617304Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2617364Z return mod(**inputs) 2025-08-14T21:44:48.2617599Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2617663Z outputs = self.model( 2025-08-14T21:44:48.2617910Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1207, in forward 2025-08-14T21:44:48.2618017Z decoder_outputs = self.decoder( 2025-08-14T21:44:48.2618262Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:48.2618326Z layer_outputs = decoder_layer( 2025-08-14T21:44:48.2618531Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2618600Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2618832Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 792, in forward 2025-08-14T21:44:48.2618966Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:44:48.2618969Z 2025-08-14T21:44:48.2619060Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2619241Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2619306Z return mod(**inputs) 2025-08-14T21:44:48.2619543Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2619610Z outputs = self.model( 2025-08-14T21:44:48.2619844Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1207, in forward 2025-08-14T21:44:48.2619909Z decoder_outputs = self.decoder( 2025-08-14T21:44:48.2620151Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:48.2620216Z layer_outputs = decoder_layer( 2025-08-14T21:44:48.2620419Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2620488Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2620722Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 792, in forward 2025-08-14T21:44:48.2620836Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:44:48.2621029Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:44:48.2621090Z return self.act(input) 2025-08-14T21:44:48.2621093Z 2025-08-14T21:44:48.2621188Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2621366Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2621431Z return mod(**inputs) 2025-08-14T21:44:48.2621669Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2621730Z outputs = self.model( 2025-08-14T21:44:48.2621974Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1207, in forward 2025-08-14T21:44:48.2622039Z decoder_outputs = self.decoder( 2025-08-14T21:44:48.2622276Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:48.2622347Z layer_outputs = decoder_layer( 2025-08-14T21:44:48.2622545Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2622621Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2622854Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 794, in forward 2025-08-14T21:44:48.2622930Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:44:48.2622934Z 2025-08-14T21:44:48.2623032Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2623209Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2623287Z return mod(**inputs) 2025-08-14T21:44:48.2623539Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2623616Z outputs = self.model( 2025-08-14T21:44:48.2623868Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1207, in forward 2025-08-14T21:44:48.2623934Z decoder_outputs = self.decoder( 2025-08-14T21:44:48.2624176Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:48.2624248Z layer_outputs = decoder_layer( 2025-08-14T21:44:48.2624463Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2624536Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2624835Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 760, in forward 2025-08-14T21:44:48.2624930Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:48.2625166Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 400, in forward 2025-08-14T21:44:48.2625301Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:44:48.2625306Z 2025-08-14T21:44:48.2625403Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2625582Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2625638Z return mod(**inputs) 2025-08-14T21:44:48.2625886Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2625947Z outputs = self.model( 2025-08-14T21:44:48.2626187Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1207, in forward 2025-08-14T21:44:48.2626266Z decoder_outputs = self.decoder( 2025-08-14T21:44:48.2626506Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:48.2626578Z layer_outputs = decoder_layer( 2025-08-14T21:44:48.2626776Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2626845Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2627084Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 760, in forward 2025-08-14T21:44:48.2627174Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:48.2627408Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 419, in forward 2025-08-14T21:44:48.2627487Z key_states = self.k_proj(current_states) 2025-08-14T21:44:48.2627492Z 2025-08-14T21:44:48.2627585Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2627778Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2627837Z return mod(**inputs) 2025-08-14T21:44:48.2628075Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2628142Z outputs = self.model( 2025-08-14T21:44:48.2628377Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1207, in forward 2025-08-14T21:44:48.2628448Z decoder_outputs = self.decoder( 2025-08-14T21:44:48.2628683Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:48.2628746Z layer_outputs = decoder_layer( 2025-08-14T21:44:48.2628968Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2629051Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2629298Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 760, in forward 2025-08-14T21:44:48.2629390Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:48.2629622Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 420, in forward 2025-08-14T21:44:48.2629698Z value_states = self.v_proj(current_states) 2025-08-14T21:44:48.2629701Z 2025-08-14T21:44:48.2629769Z cudagraph partition due to non gpu ops 2025-08-14T21:44:48.2629852Z cudagraph partition due to non gpu ops 2025-08-14T21:44:48.2629921Z cudagraph partition due to non gpu ops 2025-08-14T21:44:48.2629985Z cudagraph partition due to non gpu ops 2025-08-14T21:44:48.2630077Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2630261Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2630323Z return mod(**inputs) 2025-08-14T21:44:48.2630563Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2630622Z outputs = self.model( 2025-08-14T21:44:48.2630855Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1207, in forward 2025-08-14T21:44:48.2630925Z decoder_outputs = self.decoder( 2025-08-14T21:44:48.2631158Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:48.2631230Z layer_outputs = decoder_layer( 2025-08-14T21:44:48.2631429Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2631498Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2631739Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 760, in forward 2025-08-14T21:44:48.2631829Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:48.2632060Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 438, in forward 2025-08-14T21:44:48.2632153Z attn_output, attn_weights = attention_interface( 2025-08-14T21:44:48.2632417Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:44:48.2632539Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:44:48.2632544Z 2025-08-14T21:44:48.2632636Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2632815Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2632878Z return mod(**inputs) 2025-08-14T21:44:48.2633116Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2633183Z outputs = self.model( 2025-08-14T21:44:48.2633418Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1207, in forward 2025-08-14T21:44:48.2633482Z decoder_outputs = self.decoder( 2025-08-14T21:44:48.2633721Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:48.2633784Z layer_outputs = decoder_layer( 2025-08-14T21:44:48.2633984Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2634061Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2634307Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 760, in forward 2025-08-14T21:44:48.2634416Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:48.2634663Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 438, in forward 2025-08-14T21:44:48.2634749Z attn_output, attn_weights = attention_interface( 2025-08-14T21:44:48.2635015Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:44:48.2635108Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:44:48.2635111Z 2025-08-14T21:44:48.2635200Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2635396Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2635454Z return mod(**inputs) 2025-08-14T21:44:48.2635693Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2635753Z outputs = self.model( 2025-08-14T21:44:48.2635994Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1207, in forward 2025-08-14T21:44:48.2636066Z decoder_outputs = self.decoder( 2025-08-14T21:44:48.2636302Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:48.2636374Z layer_outputs = decoder_layer( 2025-08-14T21:44:48.2636575Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2636648Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2636891Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 760, in forward 2025-08-14T21:44:48.2636979Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:48.2637217Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 452, in forward 2025-08-14T21:44:48.2637299Z attn_output = self.out_proj(attn_output) 2025-08-14T21:44:48.2637302Z 2025-08-14T21:44:48.2637395Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2637582Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2637647Z return mod(**inputs) 2025-08-14T21:44:48.2637887Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2637956Z outputs = self.model( 2025-08-14T21:44:48.2638197Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1207, in forward 2025-08-14T21:44:48.2638270Z decoder_outputs = self.decoder( 2025-08-14T21:44:48.2638511Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:48.2638579Z layer_outputs = decoder_layer( 2025-08-14T21:44:48.2638789Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2638861Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2639095Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 777, in forward 2025-08-14T21:44:48.2639201Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:44:48.2639439Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 400, in forward 2025-08-14T21:44:48.2639583Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:44:48.2639587Z 2025-08-14T21:44:48.2639681Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2639881Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2639974Z return mod(**inputs) 2025-08-14T21:44:48.2640227Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2640295Z outputs = self.model( 2025-08-14T21:44:48.2640530Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1207, in forward 2025-08-14T21:44:48.2640595Z decoder_outputs = self.decoder( 2025-08-14T21:44:48.2640838Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:48.2640917Z layer_outputs = decoder_layer( 2025-08-14T21:44:48.2641121Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2641199Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2641437Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 777, in forward 2025-08-14T21:44:48.2641542Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:44:48.2641776Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 419, in forward 2025-08-14T21:44:48.2641848Z key_states = self.k_proj(current_states) 2025-08-14T21:44:48.2641852Z 2025-08-14T21:44:48.2641952Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2642132Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2642199Z return mod(**inputs) 2025-08-14T21:44:48.2642436Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2642497Z outputs = self.model( 2025-08-14T21:44:48.2642742Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1207, in forward 2025-08-14T21:44:48.2642812Z decoder_outputs = self.decoder( 2025-08-14T21:44:48.2643048Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:48.2643119Z layer_outputs = decoder_layer( 2025-08-14T21:44:48.2643319Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2643397Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2643632Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 777, in forward 2025-08-14T21:44:48.2643729Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:44:48.2643970Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 420, in forward 2025-08-14T21:44:48.2644047Z value_states = self.v_proj(current_states) 2025-08-14T21:44:48.2644050Z 2025-08-14T21:44:48.2644124Z cudagraph partition due to non gpu ops 2025-08-14T21:44:48.2644203Z cudagraph partition due to non gpu ops 2025-08-14T21:44:48.2644273Z cudagraph partition due to non gpu ops 2025-08-14T21:44:48.2644347Z cudagraph partition due to non gpu ops 2025-08-14T21:44:48.2644439Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2644619Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2644685Z return mod(**inputs) 2025-08-14T21:44:48.2644925Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2644986Z outputs = self.model( 2025-08-14T21:44:48.2645231Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1207, in forward 2025-08-14T21:44:48.2645309Z decoder_outputs = self.decoder( 2025-08-14T21:44:48.2645567Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:48.2645646Z layer_outputs = decoder_layer( 2025-08-14T21:44:48.2645846Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2645924Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2646160Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 777, in forward 2025-08-14T21:44:48.2646263Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:44:48.2646738Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 438, in forward 2025-08-14T21:44:48.2646827Z attn_output, attn_weights = attention_interface( 2025-08-14T21:44:48.2647100Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:44:48.2647223Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:44:48.2647226Z 2025-08-14T21:44:48.2647320Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2647509Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2647568Z return mod(**inputs) 2025-08-14T21:44:48.2647810Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2647871Z outputs = self.model( 2025-08-14T21:44:48.2648110Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1207, in forward 2025-08-14T21:44:48.2648187Z decoder_outputs = self.decoder( 2025-08-14T21:44:48.2648423Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:48.2648498Z layer_outputs = decoder_layer( 2025-08-14T21:44:48.2648701Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2648773Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2649022Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 777, in forward 2025-08-14T21:44:48.2649119Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:44:48.2649353Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 438, in forward 2025-08-14T21:44:48.2649449Z attn_output, attn_weights = attention_interface( 2025-08-14T21:44:48.2649714Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:44:48.2649821Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:44:48.2649824Z 2025-08-14T21:44:48.2649920Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2650100Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2650168Z return mod(**inputs) 2025-08-14T21:44:48.2650405Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2650473Z outputs = self.model( 2025-08-14T21:44:48.2650707Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1207, in forward 2025-08-14T21:44:48.2650774Z decoder_outputs = self.decoder( 2025-08-14T21:44:48.2651016Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:48.2651081Z layer_outputs = decoder_layer( 2025-08-14T21:44:48.2651309Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2651404Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2651642Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 777, in forward 2025-08-14T21:44:48.2651746Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:44:48.2651983Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 452, in forward 2025-08-14T21:44:48.2652057Z attn_output = self.out_proj(attn_output) 2025-08-14T21:44:48.2652073Z 2025-08-14T21:44:48.2652176Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2652359Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2652418Z return mod(**inputs) 2025-08-14T21:44:48.2652662Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2652725Z outputs = self.model( 2025-08-14T21:44:48.2652968Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1207, in forward 2025-08-14T21:44:48.2653034Z decoder_outputs = self.decoder( 2025-08-14T21:44:48.2653269Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:48.2653341Z layer_outputs = decoder_layer( 2025-08-14T21:44:48.2653543Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2653622Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2653856Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 792, in forward 2025-08-14T21:44:48.2653966Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:44:48.2653969Z 2025-08-14T21:44:48.2654070Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2654252Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2654310Z return mod(**inputs) 2025-08-14T21:44:48.2654553Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2654612Z outputs = self.model( 2025-08-14T21:44:48.2654853Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1207, in forward 2025-08-14T21:44:48.2654921Z decoder_outputs = self.decoder( 2025-08-14T21:44:48.2655155Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:48.2655227Z layer_outputs = decoder_layer( 2025-08-14T21:44:48.2655431Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2655503Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2655746Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 792, in forward 2025-08-14T21:44:48.2655853Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:44:48.2656055Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:44:48.2656118Z return self.act(input) 2025-08-14T21:44:48.2656122Z 2025-08-14T21:44:48.2656215Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2656404Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2656463Z return mod(**inputs) 2025-08-14T21:44:48.2656721Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2656799Z outputs = self.model( 2025-08-14T21:44:48.2657053Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1207, in forward 2025-08-14T21:44:48.2657126Z decoder_outputs = self.decoder( 2025-08-14T21:44:48.2657363Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:48.2657427Z layer_outputs = decoder_layer( 2025-08-14T21:44:48.2657635Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2657722Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2657967Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 794, in forward 2025-08-14T21:44:48.2658040Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:44:48.2658043Z 2025-08-14T21:44:48.2658137Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2658327Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2658385Z return mod(**inputs) 2025-08-14T21:44:48.2658629Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2658690Z outputs = self.model( 2025-08-14T21:44:48.2658928Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1207, in forward 2025-08-14T21:44:48.2659000Z decoder_outputs = self.decoder( 2025-08-14T21:44:48.2659237Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:48.2659302Z layer_outputs = decoder_layer( 2025-08-14T21:44:48.2659511Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2659583Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2659825Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 760, in forward 2025-08-14T21:44:48.2659914Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:48.2660149Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 400, in forward 2025-08-14T21:44:48.2660293Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:44:48.2660297Z 2025-08-14T21:44:48.2660388Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2660631Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2660696Z return mod(**inputs) 2025-08-14T21:44:48.2660937Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2661006Z outputs = self.model( 2025-08-14T21:44:48.2661243Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1207, in forward 2025-08-14T21:44:48.2661308Z decoder_outputs = self.decoder( 2025-08-14T21:44:48.2661558Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:48.2661622Z layer_outputs = decoder_layer( 2025-08-14T21:44:48.2661836Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2661911Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2662157Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 760, in forward 2025-08-14T21:44:48.2662264Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:48.2662547Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 419, in forward 2025-08-14T21:44:48.2662640Z key_states = self.k_proj(current_states) 2025-08-14T21:44:48.2662652Z 2025-08-14T21:44:48.2662750Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2662941Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2663011Z return mod(**inputs) 2025-08-14T21:44:48.2663258Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2663340Z outputs = self.model( 2025-08-14T21:44:48.2663594Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1207, in forward 2025-08-14T21:44:48.2663663Z decoder_outputs = self.decoder( 2025-08-14T21:44:48.2663916Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:48.2663985Z layer_outputs = decoder_layer( 2025-08-14T21:44:48.2664196Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2664277Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2664520Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 760, in forward 2025-08-14T21:44:48.2664613Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:48.2664925Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 420, in forward 2025-08-14T21:44:48.2665012Z value_states = self.v_proj(current_states) 2025-08-14T21:44:48.2665016Z 2025-08-14T21:44:48.2665098Z cudagraph partition due to non gpu ops 2025-08-14T21:44:48.2665172Z cudagraph partition due to non gpu ops 2025-08-14T21:44:48.2665247Z cudagraph partition due to non gpu ops 2025-08-14T21:44:48.2665327Z cudagraph partition due to non gpu ops 2025-08-14T21:44:48.2665424Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2665615Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2665682Z return mod(**inputs) 2025-08-14T21:44:48.2665981Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2666050Z outputs = self.model( 2025-08-14T21:44:48.2666294Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1207, in forward 2025-08-14T21:44:48.2666363Z decoder_outputs = self.decoder( 2025-08-14T21:44:48.2666617Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:48.2666684Z layer_outputs = decoder_layer( 2025-08-14T21:44:48.2666892Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2666975Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2667219Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 760, in forward 2025-08-14T21:44:48.2667322Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:48.2667564Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 438, in forward 2025-08-14T21:44:48.2667653Z attn_output, attn_weights = attention_interface( 2025-08-14T21:44:48.2667938Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:44:48.2668063Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:44:48.2668066Z 2025-08-14T21:44:48.2668186Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2668391Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2668468Z return mod(**inputs) 2025-08-14T21:44:48.2668730Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2668790Z outputs = self.model( 2025-08-14T21:44:48.2669037Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1207, in forward 2025-08-14T21:44:48.2669112Z decoder_outputs = self.decoder( 2025-08-14T21:44:48.2669400Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:48.2669474Z layer_outputs = decoder_layer( 2025-08-14T21:44:48.2669680Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2669753Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2670005Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 760, in forward 2025-08-14T21:44:48.2670095Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:48.2670344Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 438, in forward 2025-08-14T21:44:48.2670432Z attn_output, attn_weights = attention_interface( 2025-08-14T21:44:48.2670704Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:44:48.2670813Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:44:48.2670817Z 2025-08-14T21:44:48.2670911Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2671107Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2671170Z return mod(**inputs) 2025-08-14T21:44:48.2671416Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2671485Z outputs = self.model( 2025-08-14T21:44:48.2671727Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1207, in forward 2025-08-14T21:44:48.2671796Z decoder_outputs = self.decoder( 2025-08-14T21:44:48.2672048Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:48.2672115Z layer_outputs = decoder_layer( 2025-08-14T21:44:48.2672327Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2672398Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2672639Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 760, in forward 2025-08-14T21:44:48.2672737Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:48.2672979Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 452, in forward 2025-08-14T21:44:48.2673053Z attn_output = self.out_proj(attn_output) 2025-08-14T21:44:48.2673064Z 2025-08-14T21:44:48.2673159Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2673340Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2673408Z return mod(**inputs) 2025-08-14T21:44:48.2673651Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2673712Z outputs = self.model( 2025-08-14T21:44:48.2673975Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1207, in forward 2025-08-14T21:44:48.2674057Z decoder_outputs = self.decoder( 2025-08-14T21:44:48.2674320Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:48.2674386Z layer_outputs = decoder_layer( 2025-08-14T21:44:48.2674591Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2674669Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2674909Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 777, in forward 2025-08-14T21:44:48.2675029Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:44:48.2675280Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 400, in forward 2025-08-14T21:44:48.2675421Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:44:48.2675424Z 2025-08-14T21:44:48.2675527Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2675713Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2675773Z return mod(**inputs) 2025-08-14T21:44:48.2676021Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2676082Z outputs = self.model( 2025-08-14T21:44:48.2676331Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1207, in forward 2025-08-14T21:44:48.2676401Z decoder_outputs = self.decoder( 2025-08-14T21:44:48.2676645Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:48.2676716Z layer_outputs = decoder_layer( 2025-08-14T21:44:48.2676924Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2676997Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2677245Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 777, in forward 2025-08-14T21:44:48.2677343Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:44:48.2677590Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 419, in forward 2025-08-14T21:44:48.2677665Z key_states = self.k_proj(current_states) 2025-08-14T21:44:48.2677670Z 2025-08-14T21:44:48.2677775Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2677962Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2678019Z return mod(**inputs) 2025-08-14T21:44:48.2678258Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2678328Z outputs = self.model( 2025-08-14T21:44:48.2678563Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1207, in forward 2025-08-14T21:44:48.2678638Z decoder_outputs = self.decoder( 2025-08-14T21:44:48.2678873Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:48.2678936Z layer_outputs = decoder_layer( 2025-08-14T21:44:48.2679144Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2679215Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2679457Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 777, in forward 2025-08-14T21:44:48.2679567Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:44:48.2679815Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 420, in forward 2025-08-14T21:44:48.2679917Z value_states = self.v_proj(current_states) 2025-08-14T21:44:48.2679920Z 2025-08-14T21:44:48.2679993Z cudagraph partition due to non gpu ops 2025-08-14T21:44:48.2680065Z cudagraph partition due to non gpu ops 2025-08-14T21:44:48.2680142Z cudagraph partition due to non gpu ops 2025-08-14T21:44:48.2680210Z cudagraph partition due to non gpu ops 2025-08-14T21:44:48.2680312Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2680517Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2680575Z return mod(**inputs) 2025-08-14T21:44:48.2680819Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2680881Z outputs = self.model( 2025-08-14T21:44:48.2681117Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1207, in forward 2025-08-14T21:44:48.2681194Z decoder_outputs = self.decoder( 2025-08-14T21:44:48.2681427Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:48.2681497Z layer_outputs = decoder_layer( 2025-08-14T21:44:48.2681698Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2681769Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2682011Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 777, in forward 2025-08-14T21:44:48.2682107Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:44:48.2682341Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 438, in forward 2025-08-14T21:44:48.2682440Z attn_output, attn_weights = attention_interface( 2025-08-14T21:44:48.2682705Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:44:48.2682831Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:44:48.2682834Z 2025-08-14T21:44:48.2682927Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2683107Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2683175Z return mod(**inputs) 2025-08-14T21:44:48.2683413Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2683480Z outputs = self.model( 2025-08-14T21:44:48.2683717Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1207, in forward 2025-08-14T21:44:48.2683783Z decoder_outputs = self.decoder( 2025-08-14T21:44:48.2684027Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:48.2684091Z layer_outputs = decoder_layer( 2025-08-14T21:44:48.2684292Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2684372Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2684742Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 777, in forward 2025-08-14T21:44:48.2684862Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:44:48.2685104Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 438, in forward 2025-08-14T21:44:48.2685226Z attn_output, attn_weights = attention_interface( 2025-08-14T21:44:48.2685534Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:44:48.2685660Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:44:48.2685663Z 2025-08-14T21:44:48.2685766Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2685958Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2686019Z return mod(**inputs) 2025-08-14T21:44:48.2686263Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2686348Z outputs = self.model( 2025-08-14T21:44:48.2686586Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1207, in forward 2025-08-14T21:44:48.2686661Z decoder_outputs = self.decoder( 2025-08-14T21:44:48.2686898Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:48.2686973Z layer_outputs = decoder_layer( 2025-08-14T21:44:48.2687172Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2687243Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2687484Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 777, in forward 2025-08-14T21:44:48.2687578Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:44:48.2687818Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 452, in forward 2025-08-14T21:44:48.2687891Z attn_output = self.out_proj(attn_output) 2025-08-14T21:44:48.2687894Z 2025-08-14T21:44:48.2687985Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2688173Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2688231Z return mod(**inputs) 2025-08-14T21:44:48.2688466Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2688535Z outputs = self.model( 2025-08-14T21:44:48.2688768Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1207, in forward 2025-08-14T21:44:48.2688839Z decoder_outputs = self.decoder( 2025-08-14T21:44:48.2689073Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:48.2689138Z layer_outputs = decoder_layer( 2025-08-14T21:44:48.2689344Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2689415Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2689657Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 792, in forward 2025-08-14T21:44:48.2689766Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:44:48.2689769Z 2025-08-14T21:44:48.2689861Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2690048Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2690107Z return mod(**inputs) 2025-08-14T21:44:48.2690342Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2690412Z outputs = self.model( 2025-08-14T21:44:48.2690646Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1207, in forward 2025-08-14T21:44:48.2690719Z decoder_outputs = self.decoder( 2025-08-14T21:44:48.2690982Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:48.2691062Z layer_outputs = decoder_layer( 2025-08-14T21:44:48.2691269Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2691340Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2691574Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 792, in forward 2025-08-14T21:44:48.2691690Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:44:48.2691900Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:44:48.2691970Z return self.act(input) 2025-08-14T21:44:48.2691974Z 2025-08-14T21:44:48.2692067Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2692250Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2692322Z return mod(**inputs) 2025-08-14T21:44:48.2692558Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2692626Z outputs = self.model( 2025-08-14T21:44:48.2692863Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1207, in forward 2025-08-14T21:44:48.2692929Z decoder_outputs = self.decoder( 2025-08-14T21:44:48.2693172Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:48.2693240Z layer_outputs = decoder_layer( 2025-08-14T21:44:48.2693440Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2693519Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2693754Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 794, in forward 2025-08-14T21:44:48.2693837Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:44:48.2693840Z 2025-08-14T21:44:48.2693931Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2694108Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2694173Z return mod(**inputs) 2025-08-14T21:44:48.2694407Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2694475Z outputs = self.model( 2025-08-14T21:44:48.2694709Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1207, in forward 2025-08-14T21:44:48.2694774Z decoder_outputs = self.decoder( 2025-08-14T21:44:48.2695016Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:48.2695082Z layer_outputs = decoder_layer( 2025-08-14T21:44:48.2695282Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2695360Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2695594Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 760, in forward 2025-08-14T21:44:48.2695690Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:48.2695922Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 400, in forward 2025-08-14T21:44:48.2696059Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:44:48.2696062Z 2025-08-14T21:44:48.2696160Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2696355Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2696448Z return mod(**inputs) 2025-08-14T21:44:48.2696688Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2696749Z outputs = self.model( 2025-08-14T21:44:48.2696990Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1207, in forward 2025-08-14T21:44:48.2697055Z decoder_outputs = self.decoder( 2025-08-14T21:44:48.2697288Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:48.2697388Z layer_outputs = decoder_layer( 2025-08-14T21:44:48.2697593Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2697673Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2697912Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 760, in forward 2025-08-14T21:44:48.2698003Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:48.2698248Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 419, in forward 2025-08-14T21:44:48.2698321Z key_states = self.k_proj(current_states) 2025-08-14T21:44:48.2698324Z 2025-08-14T21:44:48.2698416Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2698604Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2698664Z return mod(**inputs) 2025-08-14T21:44:48.2698910Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2698972Z outputs = self.model( 2025-08-14T21:44:48.2699211Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1207, in forward 2025-08-14T21:44:48.2699286Z decoder_outputs = self.decoder( 2025-08-14T21:44:48.2699525Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:48.2699598Z layer_outputs = decoder_layer( 2025-08-14T21:44:48.2699800Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2699871Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2700115Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 760, in forward 2025-08-14T21:44:48.2700205Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:48.2700441Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 420, in forward 2025-08-14T21:44:48.2700527Z value_states = self.v_proj(current_states) 2025-08-14T21:44:48.2700530Z 2025-08-14T21:44:48.2700604Z cudagraph partition due to non gpu ops 2025-08-14T21:44:48.2700686Z cudagraph partition due to non gpu ops 2025-08-14T21:44:48.2700755Z cudagraph partition due to non gpu ops 2025-08-14T21:44:48.2700822Z cudagraph partition due to non gpu ops 2025-08-14T21:44:48.2700925Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2701105Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2701164Z return mod(**inputs) 2025-08-14T21:44:48.2701410Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2701472Z outputs = self.model( 2025-08-14T21:44:48.2701715Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1207, in forward 2025-08-14T21:44:48.2701795Z decoder_outputs = self.decoder( 2025-08-14T21:44:48.2702047Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:48.2702140Z layer_outputs = decoder_layer( 2025-08-14T21:44:48.2702343Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2702414Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2702656Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 760, in forward 2025-08-14T21:44:48.2702746Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:48.2703005Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 438, in forward 2025-08-14T21:44:48.2703092Z attn_output, attn_weights = attention_interface( 2025-08-14T21:44:48.2703357Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:44:48.2703489Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:44:48.2703492Z 2025-08-14T21:44:48.2703584Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2703771Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2703831Z return mod(**inputs) 2025-08-14T21:44:48.2704065Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2704135Z outputs = self.model( 2025-08-14T21:44:48.2704374Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1207, in forward 2025-08-14T21:44:48.2704440Z decoder_outputs = self.decoder( 2025-08-14T21:44:48.2704739Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:48.2704812Z layer_outputs = decoder_layer( 2025-08-14T21:44:48.2705023Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2705094Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2705326Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 760, in forward 2025-08-14T21:44:48.2705423Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:48.2705655Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 438, in forward 2025-08-14T21:44:48.2705750Z attn_output, attn_weights = attention_interface( 2025-08-14T21:44:48.2706016Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:44:48.2706115Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:44:48.2706118Z 2025-08-14T21:44:48.2706224Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2706406Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2706466Z return mod(**inputs) 2025-08-14T21:44:48.2706713Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2706775Z outputs = self.model( 2025-08-14T21:44:48.2707024Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1207, in forward 2025-08-14T21:44:48.2707094Z decoder_outputs = self.decoder( 2025-08-14T21:44:48.2707334Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:48.2707410Z layer_outputs = decoder_layer( 2025-08-14T21:44:48.2707645Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2707747Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2707988Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 760, in forward 2025-08-14T21:44:48.2708078Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:48.2708321Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 452, in forward 2025-08-14T21:44:48.2708394Z attn_output = self.out_proj(attn_output) 2025-08-14T21:44:48.2708399Z 2025-08-14T21:44:48.2708506Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2708694Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2708755Z return mod(**inputs) 2025-08-14T21:44:48.2709004Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2709071Z outputs = self.model( 2025-08-14T21:44:48.2709309Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1207, in forward 2025-08-14T21:44:48.2709386Z decoder_outputs = self.decoder( 2025-08-14T21:44:48.2709624Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:48.2709690Z layer_outputs = decoder_layer( 2025-08-14T21:44:48.2709901Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2709978Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2710225Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 777, in forward 2025-08-14T21:44:48.2710328Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:44:48.2710565Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 400, in forward 2025-08-14T21:44:48.2710716Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:44:48.2710719Z 2025-08-14T21:44:48.2710817Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2711009Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2711073Z return mod(**inputs) 2025-08-14T21:44:48.2711313Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2711387Z outputs = self.model( 2025-08-14T21:44:48.2711624Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1207, in forward 2025-08-14T21:44:48.2711695Z decoder_outputs = self.decoder( 2025-08-14T21:44:48.2711943Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:48.2712013Z layer_outputs = decoder_layer( 2025-08-14T21:44:48.2712222Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2712297Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2712534Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 777, in forward 2025-08-14T21:44:48.2712643Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:44:48.2712881Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 419, in forward 2025-08-14T21:44:48.2712961Z key_states = self.k_proj(current_states) 2025-08-14T21:44:48.2712964Z 2025-08-14T21:44:48.2713061Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2713275Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2713359Z return mod(**inputs) 2025-08-14T21:44:48.2713596Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2713659Z outputs = self.model( 2025-08-14T21:44:48.2713905Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1207, in forward 2025-08-14T21:44:48.2713970Z decoder_outputs = self.decoder( 2025-08-14T21:44:48.2714215Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:48.2714296Z layer_outputs = decoder_layer( 2025-08-14T21:44:48.2714494Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2714574Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2714809Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 777, in forward 2025-08-14T21:44:48.2714914Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:44:48.2715146Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 420, in forward 2025-08-14T21:44:48.2715223Z value_states = self.v_proj(current_states) 2025-08-14T21:44:48.2715226Z 2025-08-14T21:44:48.2715304Z cudagraph partition due to non gpu ops 2025-08-14T21:44:48.2715373Z cudagraph partition due to non gpu ops 2025-08-14T21:44:48.2715444Z cudagraph partition due to non gpu ops 2025-08-14T21:44:48.2715518Z cudagraph partition due to non gpu ops 2025-08-14T21:44:48.2715610Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2715796Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2715857Z return mod(**inputs) 2025-08-14T21:44:48.2716094Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2716163Z outputs = self.model( 2025-08-14T21:44:48.2716399Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1207, in forward 2025-08-14T21:44:48.2716464Z decoder_outputs = self.decoder( 2025-08-14T21:44:48.2716708Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:48.2716772Z layer_outputs = decoder_layer( 2025-08-14T21:44:48.2716982Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2717053Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2717291Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 777, in forward 2025-08-14T21:44:48.2717396Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:44:48.2717629Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 438, in forward 2025-08-14T21:44:48.2717717Z attn_output, attn_weights = attention_interface( 2025-08-14T21:44:48.2717987Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:44:48.2718107Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:44:48.2718110Z 2025-08-14T21:44:48.2718211Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2718393Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2718452Z return mod(**inputs) 2025-08-14T21:44:48.2718713Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2718807Z outputs = self.model( 2025-08-14T21:44:48.2719055Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1207, in forward 2025-08-14T21:44:48.2719121Z decoder_outputs = self.decoder( 2025-08-14T21:44:48.2719359Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:48.2719432Z layer_outputs = decoder_layer( 2025-08-14T21:44:48.2719635Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2719723Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2719971Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 777, in forward 2025-08-14T21:44:48.2720068Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:44:48.2720317Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 438, in forward 2025-08-14T21:44:48.2720407Z attn_output, attn_weights = attention_interface( 2025-08-14T21:44:48.2720673Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:44:48.2720780Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:44:48.2720783Z 2025-08-14T21:44:48.2720876Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2721066Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2721127Z return mod(**inputs) 2025-08-14T21:44:48.2721364Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2721433Z outputs = self.model( 2025-08-14T21:44:48.2721672Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1207, in forward 2025-08-14T21:44:48.2721740Z decoder_outputs = self.decoder( 2025-08-14T21:44:48.2721987Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:48.2722051Z layer_outputs = decoder_layer( 2025-08-14T21:44:48.2722260Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2722329Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2722563Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 777, in forward 2025-08-14T21:44:48.2722666Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:44:48.2722904Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 452, in forward 2025-08-14T21:44:48.2722986Z attn_output = self.out_proj(attn_output) 2025-08-14T21:44:48.2722992Z 2025-08-14T21:44:48.2723085Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2723265Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2723330Z return mod(**inputs) 2025-08-14T21:44:48.2723567Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2723628Z outputs = self.model( 2025-08-14T21:44:48.2723873Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1207, in forward 2025-08-14T21:44:48.2723941Z decoder_outputs = self.decoder( 2025-08-14T21:44:48.2724188Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:48.2724266Z layer_outputs = decoder_layer( 2025-08-14T21:44:48.2724491Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2724592Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2724826Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 792, in forward 2025-08-14T21:44:48.2724941Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:44:48.2724944Z 2025-08-14T21:44:48.2725037Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2725215Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2725300Z return mod(**inputs) 2025-08-14T21:44:48.2725542Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2725602Z outputs = self.model( 2025-08-14T21:44:48.2725852Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1207, in forward 2025-08-14T21:44:48.2725920Z decoder_outputs = self.decoder( 2025-08-14T21:44:48.2726169Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:48.2726233Z layer_outputs = decoder_layer( 2025-08-14T21:44:48.2726437Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2726518Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2726759Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 792, in forward 2025-08-14T21:44:48.2726868Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:44:48.2727074Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:44:48.2727136Z return self.act(input) 2025-08-14T21:44:48.2727140Z 2025-08-14T21:44:48.2727243Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2727426Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2727485Z return mod(**inputs) 2025-08-14T21:44:48.2727732Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2727792Z outputs = self.model( 2025-08-14T21:44:48.2728040Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1207, in forward 2025-08-14T21:44:48.2728107Z decoder_outputs = self.decoder( 2025-08-14T21:44:48.2728346Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:48.2728418Z layer_outputs = decoder_layer( 2025-08-14T21:44:48.2728626Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2728699Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2728951Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 794, in forward 2025-08-14T21:44:48.2729026Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:44:48.2729029Z 2025-08-14T21:44:48.2729127Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2729310Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2729370Z return mod(**inputs) 2025-08-14T21:44:48.2729617Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2729677Z outputs = self.model( 2025-08-14T21:44:48.2729934Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1207, in forward 2025-08-14T21:44:48.2730038Z decoder_outputs = self.decoder( 2025-08-14T21:44:48.2730274Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:48.2730345Z layer_outputs = decoder_layer( 2025-08-14T21:44:48.2730544Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2730614Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2730855Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 760, in forward 2025-08-14T21:44:48.2730960Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:48.2731201Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 400, in forward 2025-08-14T21:44:48.2731338Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:44:48.2731344Z 2025-08-14T21:44:48.2731436Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2731622Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2731680Z return mod(**inputs) 2025-08-14T21:44:48.2731917Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2731984Z outputs = self.model( 2025-08-14T21:44:48.2732217Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1207, in forward 2025-08-14T21:44:48.2732292Z decoder_outputs = self.decoder( 2025-08-14T21:44:48.2732527Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:48.2732590Z layer_outputs = decoder_layer( 2025-08-14T21:44:48.2732801Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2732871Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2733112Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 760, in forward 2025-08-14T21:44:48.2733203Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:48.2733435Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 419, in forward 2025-08-14T21:44:48.2733514Z key_states = self.k_proj(current_states) 2025-08-14T21:44:48.2733518Z 2025-08-14T21:44:48.2733611Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2733791Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2733858Z return mod(**inputs) 2025-08-14T21:44:48.2734095Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2734165Z outputs = self.model( 2025-08-14T21:44:48.2734400Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1207, in forward 2025-08-14T21:44:48.2734463Z decoder_outputs = self.decoder( 2025-08-14T21:44:48.2734705Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:48.2734768Z layer_outputs = decoder_layer( 2025-08-14T21:44:48.2734976Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2735047Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2735281Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 760, in forward 2025-08-14T21:44:48.2735390Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:48.2735641Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 420, in forward 2025-08-14T21:44:48.2735733Z value_states = self.v_proj(current_states) 2025-08-14T21:44:48.2735736Z 2025-08-14T21:44:48.2735814Z cudagraph partition due to non gpu ops 2025-08-14T21:44:48.2735885Z cudagraph partition due to non gpu ops 2025-08-14T21:44:48.2735961Z cudagraph partition due to non gpu ops 2025-08-14T21:44:48.2736027Z cudagraph partition due to non gpu ops 2025-08-14T21:44:48.2736118Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2736327Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2736387Z return mod(**inputs) 2025-08-14T21:44:48.2736626Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2736696Z outputs = self.model( 2025-08-14T21:44:48.2736935Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1207, in forward 2025-08-14T21:44:48.2737011Z decoder_outputs = self.decoder( 2025-08-14T21:44:48.2737247Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:48.2737311Z layer_outputs = decoder_layer( 2025-08-14T21:44:48.2737517Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2737587Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2737826Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 760, in forward 2025-08-14T21:44:48.2737920Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:48.2738157Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 438, in forward 2025-08-14T21:44:48.2738254Z attn_output, attn_weights = attention_interface( 2025-08-14T21:44:48.2738521Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:44:48.2738642Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:44:48.2738645Z 2025-08-14T21:44:48.2738747Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2738929Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2738998Z return mod(**inputs) 2025-08-14T21:44:48.2739237Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2739298Z outputs = self.model( 2025-08-14T21:44:48.2739544Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1207, in forward 2025-08-14T21:44:48.2739612Z decoder_outputs = self.decoder( 2025-08-14T21:44:48.2739850Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:48.2739921Z layer_outputs = decoder_layer( 2025-08-14T21:44:48.2740122Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2740200Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2740435Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 760, in forward 2025-08-14T21:44:48.2740523Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:48.2740767Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 438, in forward 2025-08-14T21:44:48.2740872Z attn_output, attn_weights = attention_interface( 2025-08-14T21:44:48.2741159Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:44:48.2741274Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:44:48.2741277Z 2025-08-14T21:44:48.2741370Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2741558Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2741618Z return mod(**inputs) 2025-08-14T21:44:48.2741854Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2741936Z outputs = self.model( 2025-08-14T21:44:48.2742170Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1207, in forward 2025-08-14T21:44:48.2742241Z decoder_outputs = self.decoder( 2025-08-14T21:44:48.2742478Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:48.2742543Z layer_outputs = decoder_layer( 2025-08-14T21:44:48.2742749Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2742819Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2743060Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 760, in forward 2025-08-14T21:44:48.2743146Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:44:48.2743379Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 452, in forward 2025-08-14T21:44:48.2743458Z attn_output = self.out_proj(attn_output) 2025-08-14T21:44:48.2743461Z 2025-08-14T21:44:48.2743551Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2743732Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2743801Z return mod(**inputs) 2025-08-14T21:44:48.2744037Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2744105Z outputs = self.model( 2025-08-14T21:44:48.2744340Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1207, in forward 2025-08-14T21:44:48.2744405Z decoder_outputs = self.decoder( 2025-08-14T21:44:48.2744712Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:48.2744789Z layer_outputs = decoder_layer( 2025-08-14T21:44:48.2744994Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2745076Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2745314Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 777, in forward 2025-08-14T21:44:48.2745423Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:44:48.2745656Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 400, in forward 2025-08-14T21:44:48.2745796Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:44:48.2745800Z 2025-08-14T21:44:48.2745899Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2746081Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2746155Z return mod(**inputs) 2025-08-14T21:44:48.2746397Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2746460Z outputs = self.model( 2025-08-14T21:44:48.2746750Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1207, in forward 2025-08-14T21:44:48.2746835Z decoder_outputs = self.decoder( 2025-08-14T21:44:48.2747076Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:48.2747151Z layer_outputs = decoder_layer( 2025-08-14T21:44:48.2747356Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2747437Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2747691Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 777, in forward 2025-08-14T21:44:48.2747789Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:44:48.2748035Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 419, in forward 2025-08-14T21:44:48.2748114Z key_states = self.k_proj(current_states) 2025-08-14T21:44:48.2748119Z 2025-08-14T21:44:48.2748218Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2748397Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2748456Z return mod(**inputs) 2025-08-14T21:44:48.2748699Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2748760Z outputs = self.model( 2025-08-14T21:44:48.2748995Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1207, in forward 2025-08-14T21:44:48.2749070Z decoder_outputs = self.decoder( 2025-08-14T21:44:48.2749305Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:48.2749377Z layer_outputs = decoder_layer( 2025-08-14T21:44:48.2749579Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2749649Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2749892Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 777, in forward 2025-08-14T21:44:48.2749987Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:44:48.2750226Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 420, in forward 2025-08-14T21:44:48.2750305Z value_states = self.v_proj(current_states) 2025-08-14T21:44:48.2750309Z 2025-08-14T21:44:48.2750381Z cudagraph partition due to non gpu ops 2025-08-14T21:44:48.2750457Z cudagraph partition due to non gpu ops 2025-08-14T21:44:48.2750526Z cudagraph partition due to non gpu ops 2025-08-14T21:44:48.2750596Z cudagraph partition due to non gpu ops 2025-08-14T21:44:48.2750696Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2750878Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2750943Z return mod(**inputs) 2025-08-14T21:44:48.2751179Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2751240Z outputs = self.model( 2025-08-14T21:44:48.2751485Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1207, in forward 2025-08-14T21:44:48.2751552Z decoder_outputs = self.decoder( 2025-08-14T21:44:48.2751789Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:48.2751862Z layer_outputs = decoder_layer( 2025-08-14T21:44:48.2752079Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2752173Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2752427Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 777, in forward 2025-08-14T21:44:48.2752522Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:44:48.2752761Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 438, in forward 2025-08-14T21:44:48.2752846Z attn_output, attn_weights = attention_interface( 2025-08-14T21:44:48.2753115Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:44:48.2753259Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:44:48.2753263Z 2025-08-14T21:44:48.2753355Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2753547Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2753610Z return mod(**inputs) 2025-08-14T21:44:48.2753848Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2753917Z outputs = self.model( 2025-08-14T21:44:48.2754155Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1207, in forward 2025-08-14T21:44:48.2754226Z decoder_outputs = self.decoder( 2025-08-14T21:44:48.2754460Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:48.2754525Z layer_outputs = decoder_layer( 2025-08-14T21:44:48.2754730Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2754801Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2755037Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 777, in forward 2025-08-14T21:44:48.2755140Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:44:48.2755375Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 438, in forward 2025-08-14T21:44:48.2755468Z attn_output, attn_weights = attention_interface( 2025-08-14T21:44:48.2755730Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:44:48.2755827Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:44:48.2755831Z 2025-08-14T21:44:48.2755930Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2756108Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2756177Z return mod(**inputs) 2025-08-14T21:44:48.2756415Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2756476Z outputs = self.model( 2025-08-14T21:44:48.2756718Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1207, in forward 2025-08-14T21:44:48.2756783Z decoder_outputs = self.decoder( 2025-08-14T21:44:48.2757016Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:48.2757085Z layer_outputs = decoder_layer( 2025-08-14T21:44:48.2757286Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2757364Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2757610Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 777, in forward 2025-08-14T21:44:48.2757721Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:44:48.2758011Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 452, in forward 2025-08-14T21:44:48.2758084Z attn_output = self.out_proj(attn_output) 2025-08-14T21:44:48.2758088Z 2025-08-14T21:44:48.2758186Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2758366Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2758426Z return mod(**inputs) 2025-08-14T21:44:48.2758670Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2758745Z outputs = self.model( 2025-08-14T21:44:48.2758983Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1207, in forward 2025-08-14T21:44:48.2759058Z decoder_outputs = self.decoder( 2025-08-14T21:44:48.2759300Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:48.2759375Z layer_outputs = decoder_layer( 2025-08-14T21:44:48.2759577Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2759646Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2759890Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 792, in forward 2025-08-14T21:44:48.2759998Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:44:48.2760002Z 2025-08-14T21:44:48.2760103Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2760286Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2760346Z return mod(**inputs) 2025-08-14T21:44:48.2760592Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2760654Z outputs = self.model( 2025-08-14T21:44:48.2760891Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1207, in forward 2025-08-14T21:44:48.2760966Z decoder_outputs = self.decoder( 2025-08-14T21:44:48.2761201Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:48.2761272Z layer_outputs = decoder_layer( 2025-08-14T21:44:48.2761476Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2761547Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2761793Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 792, in forward 2025-08-14T21:44:48.2761901Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:44:48.2762096Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:44:48.2762168Z return self.act(input) 2025-08-14T21:44:48.2762171Z 2025-08-14T21:44:48.2762264Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2762453Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2762512Z return mod(**inputs) 2025-08-14T21:44:48.2762748Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1359, in forward 2025-08-14T21:44:48.2762817Z outputs = self.model( 2025-08-14T21:44:48.2763052Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1207, in forward 2025-08-14T21:44:48.2763125Z decoder_outputs = self.decoder( 2025-08-14T21:44:48.2763391Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1031, in forward 2025-08-14T21:44:48.2763469Z layer_outputs = decoder_layer( 2025-08-14T21:44:48.2763676Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:44:48.2763746Z return super().__call__(*args, **kwargs) 2025-08-14T21:44:48.2763979Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 794, in forward 2025-08-14T21:44:48.2764060Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:44:48.2764085Z 2025-08-14T21:44:48.2764180Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2764367Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2764427Z return mod(**inputs) 2025-08-14T21:44:48.2764667Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1377, in forward 2025-08-14T21:44:48.2764750Z lm_logits = self.lm_head(outputs[0]) 2025-08-14T21:44:48.2764753Z 2025-08-14T21:44:48.2764845Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:44:48.2765032Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:44:48.2765091Z return mod(**inputs) 2025-08-14T21:44:48.2765326Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/plbart/modeling_plbart.py", line 1383, in forward 2025-08-14T21:44:48.2765486Z masked_lm_loss = loss_fct(lm_logits.view(-1, self.config.vocab_size), labels.view(-1)) 2025-08-14T21:44:48.2765491Z 2025-08-14T21:44:56.5502159Z Compilation time (from dynamo_timed): 15.344394592 2025-08-14T21:44:56.5703531Z pass 2025-08-14T21:44:56.5708577Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:44:56.5713277Z TIMING: _recursive_pre_grad_passes:0.0077 _recursive_joint_graph_passes:0.41619 _recursive_post_grad_passes:0.09603 async_compile.wait:0.66954 code_gen:7.6506 inductor_compile:9.10873 backend_compile:12.70793 gc:0.00128 entire_frame_compile:15.34439 total_wall_time:15.34439 2025-08-14T21:44:56.5714223Z STATS: call_* op count: 517 | FakeTensorMode.__torch_dispatch__:17501 | FakeTensor.__torch_dispatch__:6218 | ProxyTorchDispatchMode.__torch_dispatch__:6406 2025-08-14T21:44:56.5714684Z Dynamo produced 1 graphs covering 517 ops with 0 graph breaks (0 unique) 2025-08-14T21:45:00.7567419Z /opt/conda/envs/py_3.9/lib/python3.9/site-packages/llvmlite/binding/ffi.py:175: UserWarning: pkg_resources is deprecated as an API. See https://setuptools.pypa.io/en/latest/pkg_resources.html. The pkg_resources package is slated for removal as early as 2025-11-30. Refrain from using this package or pin to Setuptools<81. 2025-08-14T21:45:00.7568603Z from pkg_resources import resource_filename 2025-08-14T21:45:01.3031623Z 2025-08-14T21:45:04.5434387Z loading model: 0it [00:00, ?it/s] 2025-08-14T21:45:04.5438540Z loading model: 0it [00:03, ?it/s] 2025-08-14T21:45:04.5451907Z cpu eval PegasusForCausalLM 2025-08-14T21:45:04.8631911Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:45:04.9859997Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:45:05.0959910Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:45:11.8244192Z cudagraph partition due to non gpu ops 2025-08-14T21:45:11.8247874Z cudagraph partition due to non gpu ops 2025-08-14T21:45:11.8249900Z cudagraph partition due to non gpu ops 2025-08-14T21:45:11.8254595Z cudagraph partition due to non gpu ops 2025-08-14T21:45:11.8256656Z cudagraph partition due to non gpu ops 2025-08-14T21:45:11.8261141Z cudagraph partition due to non gpu ops 2025-08-14T21:45:11.8266227Z cudagraph partition due to non gpu ops 2025-08-14T21:45:11.8266718Z cudagraph partition due to non gpu ops 2025-08-14T21:45:11.8267038Z cudagraph partition due to non gpu ops 2025-08-14T21:45:11.8271779Z cudagraph partition due to non gpu ops 2025-08-14T21:45:11.8273795Z cudagraph partition due to non gpu ops 2025-08-14T21:45:11.8274140Z cudagraph partition due to non gpu ops 2025-08-14T21:45:11.8278895Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:11.8283298Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:11.8285371Z return mod(**inputs) 2025-08-14T21:45:11.8285907Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1634, in forward 2025-08-14T21:45:11.8290011Z outputs = self.model.decoder( 2025-08-14T21:45:11.8292458Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:11.8292978Z layer_outputs = decoder_layer( 2025-08-14T21:45:11.8297591Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:11.8299896Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:11.8300512Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:11.8301055Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:11.8301469Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 215, in forward 2025-08-14T21:45:11.8301943Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:45:11.8302151Z 2025-08-14T21:45:11.8305300Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:11.8309570Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:11.8310059Z return mod(**inputs) 2025-08-14T21:45:11.8311030Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1634, in forward 2025-08-14T21:45:11.8311624Z outputs = self.model.decoder( 2025-08-14T21:45:11.8312030Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:11.8312467Z layer_outputs = decoder_layer( 2025-08-14T21:45:11.8312812Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:11.8313166Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:11.8313550Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:11.8313952Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:11.8314350Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 234, in forward 2025-08-14T21:45:11.8314735Z key_states = self.k_proj(current_states) 2025-08-14T21:45:11.8314871Z 2025-08-14T21:45:11.8314974Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:11.8315319Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:11.8315620Z return mod(**inputs) 2025-08-14T21:45:11.8315978Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1634, in forward 2025-08-14T21:45:11.8316364Z outputs = self.model.decoder( 2025-08-14T21:45:11.8316755Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:11.8317120Z layer_outputs = decoder_layer( 2025-08-14T21:45:11.8318337Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:11.8318748Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:11.8319147Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:11.8319537Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:11.8319952Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 235, in forward 2025-08-14T21:45:11.8320330Z value_states = self.v_proj(current_states) 2025-08-14T21:45:11.8320461Z 2025-08-14T21:45:11.8320538Z cudagraph partition due to non gpu ops 2025-08-14T21:45:11.8320765Z cudagraph partition due to non gpu ops 2025-08-14T21:45:11.8320956Z cudagraph partition due to non gpu ops 2025-08-14T21:45:11.8321136Z cudagraph partition due to non gpu ops 2025-08-14T21:45:11.8321352Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:11.8321688Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:11.8321993Z return mod(**inputs) 2025-08-14T21:45:11.8322333Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1634, in forward 2025-08-14T21:45:11.8322702Z outputs = self.model.decoder( 2025-08-14T21:45:11.8323069Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:11.8323428Z layer_outputs = decoder_layer( 2025-08-14T21:45:11.8323756Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:11.8324092Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:11.8324457Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:11.8324837Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:11.8325222Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 253, in forward 2025-08-14T21:45:11.8325612Z attn_output, attn_weights = attention_interface( 2025-08-14T21:45:11.8326022Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:45:11.8326461Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:45:11.8326637Z 2025-08-14T21:45:11.8326733Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:11.8327064Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:11.8327357Z return mod(**inputs) 2025-08-14T21:45:11.8327703Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1634, in forward 2025-08-14T21:45:11.8328070Z outputs = self.model.decoder( 2025-08-14T21:45:11.8328434Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:11.8328791Z layer_outputs = decoder_layer( 2025-08-14T21:45:11.8329113Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:11.8329444Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:11.8329813Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:11.8330193Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:11.8330580Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 253, in forward 2025-08-14T21:45:11.8330963Z attn_output, attn_weights = attention_interface( 2025-08-14T21:45:11.8331394Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:45:11.8331839Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:45:11.8331998Z 2025-08-14T21:45:11.8332099Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:11.8332440Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:11.8332742Z return mod(**inputs) 2025-08-14T21:45:11.8333098Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1634, in forward 2025-08-14T21:45:11.8333477Z outputs = self.model.decoder( 2025-08-14T21:45:11.8333867Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:11.8334237Z layer_outputs = decoder_layer( 2025-08-14T21:45:11.8334571Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:11.8334923Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:11.8335304Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:11.8335693Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:11.8336080Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 267, in forward 2025-08-14T21:45:11.8336454Z attn_output = self.out_proj(attn_output) 2025-08-14T21:45:11.8336579Z 2025-08-14T21:45:11.8336675Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:11.8337009Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:11.8337309Z return mod(**inputs) 2025-08-14T21:45:11.8337649Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1634, in forward 2025-08-14T21:45:11.8338016Z outputs = self.model.decoder( 2025-08-14T21:45:11.8338380Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:11.8338742Z layer_outputs = decoder_layer( 2025-08-14T21:45:11.8339058Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:11.8339394Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:11.8339764Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 438, in forward 2025-08-14T21:45:11.8340177Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:45:11.8340340Z 2025-08-14T21:45:11.8340437Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:11.8340764Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:11.8341065Z return mod(**inputs) 2025-08-14T21:45:11.8341407Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1634, in forward 2025-08-14T21:45:11.8341775Z outputs = self.model.decoder( 2025-08-14T21:45:11.8342133Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:11.8342497Z layer_outputs = decoder_layer( 2025-08-14T21:45:11.8342811Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:11.8343144Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:11.8343514Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 438, in forward 2025-08-14T21:45:11.8343921Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:45:11.8344282Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:45:11.8344725Z return self.act(input) 2025-08-14T21:45:11.8344842Z 2025-08-14T21:45:11.8344949Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:11.8345271Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:11.8345572Z return mod(**inputs) 2025-08-14T21:45:11.8345917Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1634, in forward 2025-08-14T21:45:11.8346286Z outputs = self.model.decoder( 2025-08-14T21:45:11.8346640Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:11.8347022Z layer_outputs = decoder_layer( 2025-08-14T21:45:11.8347342Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:11.8347670Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:11.8348043Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 440, in forward 2025-08-14T21:45:11.8348419Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:45:11.8348545Z 2025-08-14T21:45:11.8348649Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:11.8348968Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:11.8349264Z return mod(**inputs) 2025-08-14T21:45:11.8349605Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1634, in forward 2025-08-14T21:45:11.8349962Z outputs = self.model.decoder( 2025-08-14T21:45:11.8350322Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:11.8350685Z layer_outputs = decoder_layer( 2025-08-14T21:45:11.8351008Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:11.8351330Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:11.8351698Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:11.8352086Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:11.8352470Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 215, in forward 2025-08-14T21:45:11.8352897Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:45:11.8353095Z 2025-08-14T21:45:11.8353191Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:11.8353518Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:11.8353806Z return mod(**inputs) 2025-08-14T21:45:11.8354150Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1634, in forward 2025-08-14T21:45:11.8354514Z outputs = self.model.decoder( 2025-08-14T21:45:11.8354871Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:11.8355223Z layer_outputs = decoder_layer( 2025-08-14T21:45:11.8355540Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:11.8355869Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:11.8356236Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:11.8356616Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:11.8357019Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 234, in forward 2025-08-14T21:45:11.8357408Z key_states = self.k_proj(current_states) 2025-08-14T21:45:11.8357568Z 2025-08-14T21:45:11.8357662Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:11.8357990Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:11.8358285Z return mod(**inputs) 2025-08-14T21:45:11.8358625Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1634, in forward 2025-08-14T21:45:11.8358980Z outputs = self.model.decoder( 2025-08-14T21:45:11.8359336Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:11.8359718Z layer_outputs = decoder_layer( 2025-08-14T21:45:11.8360026Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:11.8360358Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:11.8360724Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:11.8361108Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:11.8361482Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 235, in forward 2025-08-14T21:45:11.8361854Z value_states = self.v_proj(current_states) 2025-08-14T21:45:11.8361990Z 2025-08-14T21:45:11.8362062Z cudagraph partition due to non gpu ops 2025-08-14T21:45:11.8362258Z cudagraph partition due to non gpu ops 2025-08-14T21:45:11.8362444Z cudagraph partition due to non gpu ops 2025-08-14T21:45:11.8362632Z cudagraph partition due to non gpu ops 2025-08-14T21:45:11.8362845Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:11.8363170Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:11.8363466Z return mod(**inputs) 2025-08-14T21:45:11.8363816Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1634, in forward 2025-08-14T21:45:11.8364181Z outputs = self.model.decoder( 2025-08-14T21:45:11.8364533Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:11.8364897Z layer_outputs = decoder_layer( 2025-08-14T21:45:11.8365216Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:11.8365540Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:11.8365909Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:11.8366296Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:11.8366682Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 253, in forward 2025-08-14T21:45:11.8367061Z attn_output, attn_weights = attention_interface( 2025-08-14T21:45:11.8367470Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:45:11.8367916Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:45:11.8368085Z 2025-08-14T21:45:11.8368187Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:11.8368509Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:11.8368808Z return mod(**inputs) 2025-08-14T21:45:11.8369147Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1634, in forward 2025-08-14T21:45:11.8369504Z outputs = self.model.decoder( 2025-08-14T21:45:11.8369893Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:11.8370278Z layer_outputs = decoder_layer( 2025-08-14T21:45:11.8370602Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:11.8370930Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:11.8371302Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:11.8371689Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:11.8372071Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 253, in forward 2025-08-14T21:45:11.8372476Z attn_output, attn_weights = attention_interface( 2025-08-14T21:45:11.8372882Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:45:11.8373301Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:45:11.8373450Z 2025-08-14T21:45:11.8373545Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:11.8373873Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:11.8374168Z return mod(**inputs) 2025-08-14T21:45:11.8374508Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1634, in forward 2025-08-14T21:45:11.8374867Z outputs = self.model.decoder( 2025-08-14T21:45:11.8375456Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:11.8375822Z layer_outputs = decoder_layer( 2025-08-14T21:45:11.8376135Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:11.8376466Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:11.8376836Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:11.8377223Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:11.8377598Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 267, in forward 2025-08-14T21:45:11.8377977Z attn_output = self.out_proj(attn_output) 2025-08-14T21:45:11.8378108Z 2025-08-14T21:45:11.8378203Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:11.8378535Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:11.8378832Z return mod(**inputs) 2025-08-14T21:45:11.8379179Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1634, in forward 2025-08-14T21:45:11.8379552Z outputs = self.model.decoder( 2025-08-14T21:45:11.8379907Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:11.8380277Z layer_outputs = decoder_layer( 2025-08-14T21:45:11.8380597Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:11.8380933Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:11.8381298Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 438, in forward 2025-08-14T21:45:11.8381710Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:45:11.8381870Z 2025-08-14T21:45:11.8381974Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:11.8382304Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:11.8382599Z return mod(**inputs) 2025-08-14T21:45:11.8382974Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1634, in forward 2025-08-14T21:45:11.8383363Z outputs = self.model.decoder( 2025-08-14T21:45:11.8383728Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:11.8384093Z layer_outputs = decoder_layer( 2025-08-14T21:45:11.8384410Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:11.8385091Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:11.8385459Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 438, in forward 2025-08-14T21:45:11.8385913Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:45:11.8386272Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:45:11.8386579Z return self.act(input) 2025-08-14T21:45:11.8386691Z 2025-08-14T21:45:11.8386786Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:11.8387111Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:11.8387410Z return mod(**inputs) 2025-08-14T21:45:11.8387745Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1634, in forward 2025-08-14T21:45:11.8388108Z outputs = self.model.decoder( 2025-08-14T21:45:11.8388524Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:11.8388891Z layer_outputs = decoder_layer( 2025-08-14T21:45:11.8389220Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:11.8389563Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:11.8389932Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 440, in forward 2025-08-14T21:45:11.8390300Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:45:11.8390431Z 2025-08-14T21:45:11.8390527Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:11.8390851Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:11.8391149Z return mod(**inputs) 2025-08-14T21:45:11.8391483Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1634, in forward 2025-08-14T21:45:11.8391847Z outputs = self.model.decoder( 2025-08-14T21:45:11.8392204Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:11.8392560Z layer_outputs = decoder_layer( 2025-08-14T21:45:11.8392881Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:11.8393212Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:11.8393576Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:11.8393955Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:11.8394339Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 215, in forward 2025-08-14T21:45:11.8394773Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:45:11.8394963Z 2025-08-14T21:45:11.8395064Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:11.8395381Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:11.8395679Z return mod(**inputs) 2025-08-14T21:45:11.8396067Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1634, in forward 2025-08-14T21:45:11.8396447Z outputs = self.model.decoder( 2025-08-14T21:45:11.8396801Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:11.8397165Z layer_outputs = decoder_layer( 2025-08-14T21:45:11.8397486Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:11.8397811Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:11.8398184Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:11.8398589Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:11.8398976Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 234, in forward 2025-08-14T21:45:11.8399339Z key_states = self.k_proj(current_states) 2025-08-14T21:45:11.8399469Z 2025-08-14T21:45:11.8399563Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:11.8399888Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:11.8400174Z return mod(**inputs) 2025-08-14T21:45:11.8400512Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1634, in forward 2025-08-14T21:45:11.8400872Z outputs = self.model.decoder( 2025-08-14T21:45:11.8401223Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:11.8401578Z layer_outputs = decoder_layer( 2025-08-14T21:45:11.8401894Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:11.8402221Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:11.8402580Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:11.8402965Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:11.8403348Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 235, in forward 2025-08-14T21:45:11.8403719Z value_states = self.v_proj(current_states) 2025-08-14T21:45:11.8403846Z 2025-08-14T21:45:11.8403919Z cudagraph partition due to non gpu ops 2025-08-14T21:45:11.8404118Z cudagraph partition due to non gpu ops 2025-08-14T21:45:11.8404311Z cudagraph partition due to non gpu ops 2025-08-14T21:45:11.8404496Z cudagraph partition due to non gpu ops 2025-08-14T21:45:11.8404711Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:11.8405040Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:11.8405338Z return mod(**inputs) 2025-08-14T21:45:11.8405675Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1634, in forward 2025-08-14T21:45:11.8406041Z outputs = self.model.decoder( 2025-08-14T21:45:11.8406397Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:11.8406751Z layer_outputs = decoder_layer( 2025-08-14T21:45:11.8407069Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:11.8407394Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:11.8407756Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:11.8408131Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:11.8408529Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 253, in forward 2025-08-14T21:45:11.8408958Z attn_output, attn_weights = attention_interface( 2025-08-14T21:45:11.8409370Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:45:11.8409811Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:45:11.8409986Z 2025-08-14T21:45:11.8410080Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:11.8410414Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:11.8410715Z return mod(**inputs) 2025-08-14T21:45:11.8411068Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1634, in forward 2025-08-14T21:45:11.8411431Z outputs = self.model.decoder( 2025-08-14T21:45:11.8411790Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:11.8412148Z layer_outputs = decoder_layer( 2025-08-14T21:45:11.8412469Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:11.8412803Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:11.8413169Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:11.8413547Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:11.8413928Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 253, in forward 2025-08-14T21:45:11.8414310Z attn_output, attn_weights = attention_interface( 2025-08-14T21:45:11.8414708Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:45:11.8415128Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:45:11.8415282Z 2025-08-14T21:45:11.8415376Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:11.8415707Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:11.8415997Z return mod(**inputs) 2025-08-14T21:45:11.8416336Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1634, in forward 2025-08-14T21:45:11.8416698Z outputs = self.model.decoder( 2025-08-14T21:45:11.8417057Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:11.8417411Z layer_outputs = decoder_layer( 2025-08-14T21:45:11.8417729Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:11.8418059Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:11.8418420Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:11.8418806Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:11.8419189Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 267, in forward 2025-08-14T21:45:11.8419559Z attn_output = self.out_proj(attn_output) 2025-08-14T21:45:11.8419683Z 2025-08-14T21:45:11.8419776Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:11.8420103Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:11.8420404Z return mod(**inputs) 2025-08-14T21:45:11.8420737Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1634, in forward 2025-08-14T21:45:11.8421103Z outputs = self.model.decoder( 2025-08-14T21:45:11.8421488Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:11.8421865Z layer_outputs = decoder_layer( 2025-08-14T21:45:11.8422175Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:11.8422505Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:11.8422867Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 438, in forward 2025-08-14T21:45:11.8423269Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:45:11.8423443Z 2025-08-14T21:45:11.8423537Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:11.8423867Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:11.8424168Z return mod(**inputs) 2025-08-14T21:45:11.8424502Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1634, in forward 2025-08-14T21:45:11.8424949Z outputs = self.model.decoder( 2025-08-14T21:45:11.8425314Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:11.8425682Z layer_outputs = decoder_layer( 2025-08-14T21:45:11.8425996Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:11.8426334Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:11.8426704Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 438, in forward 2025-08-14T21:45:11.8427114Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:45:11.8427465Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:45:11.8427782Z return self.act(input) 2025-08-14T21:45:11.8427884Z 2025-08-14T21:45:11.8427989Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:11.8428317Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:11.8428619Z return mod(**inputs) 2025-08-14T21:45:11.8428966Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1634, in forward 2025-08-14T21:45:11.8429334Z outputs = self.model.decoder( 2025-08-14T21:45:11.8429688Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:11.8430056Z layer_outputs = decoder_layer( 2025-08-14T21:45:11.8430374Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:11.8430699Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:11.8431070Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 440, in forward 2025-08-14T21:45:11.8431443Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:45:11.8431568Z 2025-08-14T21:45:11.8431670Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:11.8431992Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:11.8432287Z return mod(**inputs) 2025-08-14T21:45:11.8432628Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1634, in forward 2025-08-14T21:45:11.8432996Z outputs = self.model.decoder( 2025-08-14T21:45:11.8433346Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:11.8433710Z layer_outputs = decoder_layer( 2025-08-14T21:45:11.8434050Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:11.8434391Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:11.8434773Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 442, in forward 2025-08-14T21:45:11.8435143Z hidden_states = residual + hidden_states 2025-08-14T21:45:11.8435268Z 2025-08-14T21:45:11.8435371Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:11.8435690Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:11.8435988Z return mod(**inputs) 2025-08-14T21:45:11.8436329Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1634, in forward 2025-08-14T21:45:11.8436713Z outputs = self.model.decoder( 2025-08-14T21:45:11.8437077Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:11.8437448Z layer_outputs = decoder_layer( 2025-08-14T21:45:11.8437769Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:11.8438097Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:11.8438466Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:11.8438860Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:11.8439248Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 215, in forward 2025-08-14T21:45:11.8439681Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:45:11.8439873Z 2025-08-14T21:45:11.8439966Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:11.8440294Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:11.8440582Z return mod(**inputs) 2025-08-14T21:45:11.8440926Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1634, in forward 2025-08-14T21:45:11.8441295Z outputs = self.model.decoder( 2025-08-14T21:45:11.8441654Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:11.8442010Z layer_outputs = decoder_layer( 2025-08-14T21:45:11.8442328Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:11.8442660Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:11.8443029Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:11.8443411Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:11.8443800Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 234, in forward 2025-08-14T21:45:11.8444174Z key_states = self.k_proj(current_states) 2025-08-14T21:45:11.8444296Z 2025-08-14T21:45:11.8444390Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:11.8444720Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:11.8445020Z return mod(**inputs) 2025-08-14T21:45:11.8445359Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1634, in forward 2025-08-14T21:45:11.8445718Z outputs = self.model.decoder( 2025-08-14T21:45:11.8446079Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:11.8446441Z layer_outputs = decoder_layer( 2025-08-14T21:45:11.8446767Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:11.8447115Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:11.8447494Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:11.8447894Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:11.8448265Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 235, in forward 2025-08-14T21:45:11.8448637Z value_states = self.v_proj(current_states) 2025-08-14T21:45:11.8448771Z 2025-08-14T21:45:11.8448844Z cudagraph partition due to non gpu ops 2025-08-14T21:45:11.8449059Z cudagraph partition due to non gpu ops 2025-08-14T21:45:11.8449242Z cudagraph partition due to non gpu ops 2025-08-14T21:45:11.8449429Z cudagraph partition due to non gpu ops 2025-08-14T21:45:11.8449639Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:11.8449959Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:11.8450258Z return mod(**inputs) 2025-08-14T21:45:11.8450602Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1634, in forward 2025-08-14T21:45:11.8450960Z outputs = self.model.decoder( 2025-08-14T21:45:11.8451322Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:11.8451687Z layer_outputs = decoder_layer( 2025-08-14T21:45:11.8452002Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:11.8452328Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:11.8452697Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:11.8453081Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:11.8453463Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 253, in forward 2025-08-14T21:45:11.8453841Z attn_output, attn_weights = attention_interface( 2025-08-14T21:45:11.8454249Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:45:11.8454689Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:45:11.8454856Z 2025-08-14T21:45:11.8454954Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:11.8455274Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:11.8455572Z return mod(**inputs) 2025-08-14T21:45:11.8455911Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1634, in forward 2025-08-14T21:45:11.8456272Z outputs = self.model.decoder( 2025-08-14T21:45:11.8456632Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:11.8456995Z layer_outputs = decoder_layer( 2025-08-14T21:45:11.8457311Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:11.8457636Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:11.8458008Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:11.8458394Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:11.8458771Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 253, in forward 2025-08-14T21:45:11.8459155Z attn_output, attn_weights = attention_interface( 2025-08-14T21:45:11.8459588Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:45:11.8460040Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:45:11.8460192Z 2025-08-14T21:45:11.8460286Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:11.8460614Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:11.8460911Z return mod(**inputs) 2025-08-14T21:45:11.8461255Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1634, in forward 2025-08-14T21:45:11.8461611Z outputs = self.model.decoder( 2025-08-14T21:45:11.8461987Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:11.8462347Z layer_outputs = decoder_layer( 2025-08-14T21:45:11.8462660Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:11.8462993Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:11.8463361Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:11.8463746Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:11.8464120Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 267, in forward 2025-08-14T21:45:11.8464488Z attn_output = self.out_proj(attn_output) 2025-08-14T21:45:11.8464611Z 2025-08-14T21:45:11.8464785Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:11.8465125Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:11.8465418Z return mod(**inputs) 2025-08-14T21:45:11.8465764Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1634, in forward 2025-08-14T21:45:11.8466132Z outputs = self.model.decoder( 2025-08-14T21:45:11.8466486Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:11.8466851Z layer_outputs = decoder_layer( 2025-08-14T21:45:11.8467170Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:11.8467504Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:11.8467868Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 438, in forward 2025-08-14T21:45:11.8468278Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:45:11.8468436Z 2025-08-14T21:45:11.8468538Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:11.8468855Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:11.8469153Z return mod(**inputs) 2025-08-14T21:45:11.8469498Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1634, in forward 2025-08-14T21:45:11.8469860Z outputs = self.model.decoder( 2025-08-14T21:45:11.8470209Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:11.8470571Z layer_outputs = decoder_layer( 2025-08-14T21:45:11.8470887Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:11.8471220Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:11.8471577Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 438, in forward 2025-08-14T21:45:11.8471979Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:45:11.8472358Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:45:11.8472688Z return self.act(input) 2025-08-14T21:45:11.8472814Z 2025-08-14T21:45:11.8472910Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:11.8473238Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:11.8473537Z return mod(**inputs) 2025-08-14T21:45:11.8473876Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1634, in forward 2025-08-14T21:45:11.8474245Z outputs = self.model.decoder( 2025-08-14T21:45:11.8474603Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:11.8474976Z layer_outputs = decoder_layer( 2025-08-14T21:45:11.8475289Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:11.8475623Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:11.8475990Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 440, in forward 2025-08-14T21:45:11.8476356Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:45:11.8476487Z 2025-08-14T21:45:11.8476580Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:11.8476905Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:11.8477201Z return mod(**inputs) 2025-08-14T21:45:11.8477535Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1634, in forward 2025-08-14T21:45:11.8477898Z outputs = self.model.decoder( 2025-08-14T21:45:11.8478251Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:11.8478604Z layer_outputs = decoder_layer( 2025-08-14T21:45:11.8478922Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:11.8479253Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:11.8479614Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:11.8479992Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:11.8480376Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 215, in forward 2025-08-14T21:45:11.8480808Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:45:11.8480994Z 2025-08-14T21:45:11.8481097Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:11.8481415Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:11.8481710Z return mod(**inputs) 2025-08-14T21:45:11.8482051Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1634, in forward 2025-08-14T21:45:11.8482411Z outputs = self.model.decoder( 2025-08-14T21:45:11.8482766Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:11.8483128Z layer_outputs = decoder_layer( 2025-08-14T21:45:11.8483442Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:11.8483764Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:11.8484131Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:11.8484512Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:11.8485088Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 234, in forward 2025-08-14T21:45:11.8485485Z key_states = self.k_proj(current_states) 2025-08-14T21:45:11.8485638Z 2025-08-14T21:45:11.8485732Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:11.8486059Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:11.8486349Z return mod(**inputs) 2025-08-14T21:45:11.8486689Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1634, in forward 2025-08-14T21:45:11.8487054Z outputs = self.model.decoder( 2025-08-14T21:45:11.8487411Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:11.8487788Z layer_outputs = decoder_layer( 2025-08-14T21:45:11.8488101Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:11.8488434Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:11.8488791Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:11.8489177Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:11.8489555Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 235, in forward 2025-08-14T21:45:11.8489923Z value_states = self.v_proj(current_states) 2025-08-14T21:45:11.8490052Z 2025-08-14T21:45:11.8490126Z cudagraph partition due to non gpu ops 2025-08-14T21:45:11.8490324Z cudagraph partition due to non gpu ops 2025-08-14T21:45:11.8490518Z cudagraph partition due to non gpu ops 2025-08-14T21:45:11.8490702Z cudagraph partition due to non gpu ops 2025-08-14T21:45:11.8490916Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:11.8491247Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:11.8491547Z return mod(**inputs) 2025-08-14T21:45:11.8491889Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1634, in forward 2025-08-14T21:45:11.8492254Z outputs = self.model.decoder( 2025-08-14T21:45:11.8492614Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:11.8492972Z layer_outputs = decoder_layer( 2025-08-14T21:45:11.8493290Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:11.8493620Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:11.8493988Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:11.8494365Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:11.8494749Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 253, in forward 2025-08-14T21:45:11.8495137Z attn_output, attn_weights = attention_interface( 2025-08-14T21:45:11.8495543Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:45:11.8495972Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:45:11.8496147Z 2025-08-14T21:45:11.8496243Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:11.8496573Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:11.8496865Z return mod(**inputs) 2025-08-14T21:45:11.8497207Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1634, in forward 2025-08-14T21:45:11.8497574Z outputs = self.model.decoder( 2025-08-14T21:45:11.8497959Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:11.8498332Z layer_outputs = decoder_layer( 2025-08-14T21:45:11.8498650Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:11.8498980Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:11.8499339Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:11.8499725Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:11.8500107Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 253, in forward 2025-08-14T21:45:11.8500520Z attn_output, attn_weights = attention_interface( 2025-08-14T21:45:11.8500925Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:45:11.8501346Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:45:11.8501503Z 2025-08-14T21:45:11.8501597Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:11.8501926Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:11.8502217Z return mod(**inputs) 2025-08-14T21:45:11.8502558Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1634, in forward 2025-08-14T21:45:11.8502925Z outputs = self.model.decoder( 2025-08-14T21:45:11.8503278Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:11.8503642Z layer_outputs = decoder_layer( 2025-08-14T21:45:11.8503960Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:11.8504292Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:11.8504724Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:11.8505127Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:11.8505510Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 267, in forward 2025-08-14T21:45:11.8505881Z attn_output = self.out_proj(attn_output) 2025-08-14T21:45:11.8506008Z 2025-08-14T21:45:11.8506102Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:11.8506430Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:11.8506730Z return mod(**inputs) 2025-08-14T21:45:11.8507063Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1634, in forward 2025-08-14T21:45:11.8507430Z outputs = self.model.decoder( 2025-08-14T21:45:11.8507793Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:11.8508160Z layer_outputs = decoder_layer( 2025-08-14T21:45:11.8508471Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:11.8508806Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:11.8509171Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 438, in forward 2025-08-14T21:45:11.8509574Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:45:11.8509736Z 2025-08-14T21:45:11.8509830Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:11.8510157Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:11.8510454Z return mod(**inputs) 2025-08-14T21:45:11.8510819Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1634, in forward 2025-08-14T21:45:11.8511201Z outputs = self.model.decoder( 2025-08-14T21:45:11.8511567Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:11.8511930Z layer_outputs = decoder_layer( 2025-08-14T21:45:11.8512237Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:11.8512565Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:11.8512930Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 438, in forward 2025-08-14T21:45:11.8513344Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:45:11.8513698Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:45:11.8514013Z return self.act(input) 2025-08-14T21:45:11.8514114Z 2025-08-14T21:45:11.8514219Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:11.8514540Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:11.8514838Z return mod(**inputs) 2025-08-14T21:45:11.8515180Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1634, in forward 2025-08-14T21:45:11.8515548Z outputs = self.model.decoder( 2025-08-14T21:45:11.8515898Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:11.8516263Z layer_outputs = decoder_layer( 2025-08-14T21:45:11.8516580Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:11.8516903Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:11.8517273Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 440, in forward 2025-08-14T21:45:11.8517646Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:45:11.8517769Z 2025-08-14T21:45:11.8517869Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:11.8518190Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:11.8518490Z return mod(**inputs) 2025-08-14T21:45:11.8518831Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1634, in forward 2025-08-14T21:45:11.8519189Z outputs = self.model.decoder( 2025-08-14T21:45:11.8519547Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:11.8519913Z layer_outputs = decoder_layer( 2025-08-14T21:45:11.8520231Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:11.8520558Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:11.8520924Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 442, in forward 2025-08-14T21:45:11.8521292Z hidden_states = residual + hidden_states 2025-08-14T21:45:11.8521414Z 2025-08-14T21:45:11.8521516Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:11.8521837Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:11.8522134Z return mod(**inputs) 2025-08-14T21:45:11.8522475Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1634, in forward 2025-08-14T21:45:11.8522854Z outputs = self.model.decoder( 2025-08-14T21:45:11.8523232Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:11.8523617Z layer_outputs = decoder_layer( 2025-08-14T21:45:11.8523953Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:11.8524276Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:11.8524638Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:11.8525024Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:11.8525400Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 215, in forward 2025-08-14T21:45:11.8525848Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:45:11.8526042Z 2025-08-14T21:45:11.8526137Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:11.8526468Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:11.8526758Z return mod(**inputs) 2025-08-14T21:45:11.8527103Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1634, in forward 2025-08-14T21:45:11.8527466Z outputs = self.model.decoder( 2025-08-14T21:45:11.8527819Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:11.8528174Z layer_outputs = decoder_layer( 2025-08-14T21:45:11.8528490Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:11.8528818Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:11.8529175Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:11.8529562Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:11.8529947Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 234, in forward 2025-08-14T21:45:11.8530316Z key_states = self.k_proj(current_states) 2025-08-14T21:45:11.8530440Z 2025-08-14T21:45:11.8530533Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:11.8530860Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:11.8531159Z return mod(**inputs) 2025-08-14T21:45:11.8531497Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1634, in forward 2025-08-14T21:45:11.8531856Z outputs = self.model.decoder( 2025-08-14T21:45:11.8532217Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:11.8532580Z layer_outputs = decoder_layer( 2025-08-14T21:45:11.8532894Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:11.8533227Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:11.8533589Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:11.8533975Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:11.8534348Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 235, in forward 2025-08-14T21:45:11.8534719Z value_states = self.v_proj(current_states) 2025-08-14T21:45:11.8534846Z 2025-08-14T21:45:11.8534929Z cudagraph partition due to non gpu ops 2025-08-14T21:45:11.8535115Z cudagraph partition due to non gpu ops 2025-08-14T21:45:11.8535305Z cudagraph partition due to non gpu ops 2025-08-14T21:45:11.8535494Z cudagraph partition due to non gpu ops 2025-08-14T21:45:11.8535704Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:11.8536051Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:11.8536365Z return mod(**inputs) 2025-08-14T21:45:11.8536704Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1634, in forward 2025-08-14T21:45:11.8537058Z outputs = self.model.decoder( 2025-08-14T21:45:11.8537419Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:11.8537779Z layer_outputs = decoder_layer( 2025-08-14T21:45:11.8538096Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:11.8538438Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:11.8538805Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:11.8539189Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:11.8539567Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 253, in forward 2025-08-14T21:45:11.8539957Z attn_output, attn_weights = attention_interface( 2025-08-14T21:45:11.8540367Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:45:11.8540808Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:45:11.8540976Z 2025-08-14T21:45:11.8541070Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:11.8541402Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:11.8541699Z return mod(**inputs) 2025-08-14T21:45:11.8542042Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1634, in forward 2025-08-14T21:45:11.8542402Z outputs = self.model.decoder( 2025-08-14T21:45:11.8542760Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:11.8543126Z layer_outputs = decoder_layer( 2025-08-14T21:45:11.8543436Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:11.8543767Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:11.8544139Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:11.8544525Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:11.8544978Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 253, in forward 2025-08-14T21:45:11.8545369Z attn_output, attn_weights = attention_interface( 2025-08-14T21:45:11.8545780Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:45:11.8546205Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:45:11.8546353Z 2025-08-14T21:45:11.8546446Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:11.8546776Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:11.8547073Z return mod(**inputs) 2025-08-14T21:45:11.8547408Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1634, in forward 2025-08-14T21:45:11.8547775Z outputs = self.model.decoder( 2025-08-14T21:45:11.8548132Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:11.8548493Z layer_outputs = decoder_layer( 2025-08-14T21:45:11.8548823Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:11.8549202Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:11.8549570Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:11.8549953Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:11.8550328Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 267, in forward 2025-08-14T21:45:11.8550695Z attn_output = self.out_proj(attn_output) 2025-08-14T21:45:11.8550818Z 2025-08-14T21:45:11.8550918Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:11.8551256Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:11.8551554Z return mod(**inputs) 2025-08-14T21:45:11.8551899Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1634, in forward 2025-08-14T21:45:11.8552264Z outputs = self.model.decoder( 2025-08-14T21:45:11.8552618Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:11.8552983Z layer_outputs = decoder_layer( 2025-08-14T21:45:11.8553307Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:11.8553634Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:11.8554005Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 438, in forward 2025-08-14T21:45:11.8554416Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:45:11.8554576Z 2025-08-14T21:45:11.8554680Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:11.8555001Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:11.8555300Z return mod(**inputs) 2025-08-14T21:45:11.8555650Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1634, in forward 2025-08-14T21:45:11.8556017Z outputs = self.model.decoder( 2025-08-14T21:45:11.8556370Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:11.8556734Z layer_outputs = decoder_layer( 2025-08-14T21:45:11.8557051Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:11.8557379Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:11.8557748Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 438, in forward 2025-08-14T21:45:11.8558152Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:45:11.8558512Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:45:11.8558823Z return self.act(input) 2025-08-14T21:45:11.8558930Z 2025-08-14T21:45:11.8559025Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:11.8559352Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:11.8559650Z return mod(**inputs) 2025-08-14T21:45:11.8559984Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1634, in forward 2025-08-14T21:45:11.8560351Z outputs = self.model.decoder( 2025-08-14T21:45:11.8560710Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:11.8561068Z layer_outputs = decoder_layer( 2025-08-14T21:45:11.8561403Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:11.8561738Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:11.8562130Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 440, in forward 2025-08-14T21:45:11.8562499Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:45:11.8562629Z 2025-08-14T21:45:11.8562727Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:11.8563053Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:11.8563343Z return mod(**inputs) 2025-08-14T21:45:11.8563686Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1634, in forward 2025-08-14T21:45:11.8564071Z outputs = self.model.decoder( 2025-08-14T21:45:11.8564429Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:11.8564784Z layer_outputs = decoder_layer( 2025-08-14T21:45:11.8565100Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:11.8565430Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:11.8565788Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:11.8566168Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:11.8566547Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 215, in forward 2025-08-14T21:45:11.8566976Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:45:11.8567164Z 2025-08-14T21:45:11.8567258Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:11.8567582Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:11.8567878Z return mod(**inputs) 2025-08-14T21:45:11.8568219Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1634, in forward 2025-08-14T21:45:11.8568577Z outputs = self.model.decoder( 2025-08-14T21:45:11.8568929Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:11.8569290Z layer_outputs = decoder_layer( 2025-08-14T21:45:11.8569601Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:11.8569932Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:11.8570300Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:11.8570687Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:11.8571062Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 234, in forward 2025-08-14T21:45:11.8571435Z key_states = self.k_proj(current_states) 2025-08-14T21:45:11.8571565Z 2025-08-14T21:45:11.8571661Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:11.8571987Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:11.8572275Z return mod(**inputs) 2025-08-14T21:45:11.8572614Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1634, in forward 2025-08-14T21:45:11.8572977Z outputs = self.model.decoder( 2025-08-14T21:45:11.8573328Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:11.8573688Z layer_outputs = decoder_layer( 2025-08-14T21:45:11.8574073Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:11.8574407Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:11.8574808Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:11.8575200Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:11.8575591Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 235, in forward 2025-08-14T21:45:11.8575968Z value_states = self.v_proj(current_states) 2025-08-14T21:45:11.8576096Z 2025-08-14T21:45:11.8576169Z cudagraph partition due to non gpu ops 2025-08-14T21:45:11.8576384Z cudagraph partition due to non gpu ops 2025-08-14T21:45:11.8576578Z cudagraph partition due to non gpu ops 2025-08-14T21:45:11.8576761Z cudagraph partition due to non gpu ops 2025-08-14T21:45:11.8576974Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:11.8577304Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:11.8577597Z return mod(**inputs) 2025-08-14T21:45:11.8577941Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1634, in forward 2025-08-14T21:45:11.8578310Z outputs = self.model.decoder( 2025-08-14T21:45:11.8578669Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:11.8579028Z layer_outputs = decoder_layer( 2025-08-14T21:45:11.8579352Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:11.8579686Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:11.8580052Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:11.8580430Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:11.8580818Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 253, in forward 2025-08-14T21:45:11.8581205Z attn_output, attn_weights = attention_interface( 2025-08-14T21:45:11.8581598Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:45:11.8582035Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:45:11.8582210Z 2025-08-14T21:45:11.8582305Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:11.8582632Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:11.8582925Z return mod(**inputs) 2025-08-14T21:45:11.8583267Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1634, in forward 2025-08-14T21:45:11.8583632Z outputs = self.model.decoder( 2025-08-14T21:45:11.8583990Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:11.8584347Z layer_outputs = decoder_layer( 2025-08-14T21:45:11.8584845Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:11.8585190Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:11.8585555Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:11.8585943Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:11.8586333Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 253, in forward 2025-08-14T21:45:11.8586719Z attn_output, attn_weights = attention_interface( 2025-08-14T21:45:11.8587160Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:45:11.8587603Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:45:11.8587773Z 2025-08-14T21:45:11.8587876Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:11.8588205Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:11.8588497Z return mod(**inputs) 2025-08-14T21:45:11.8588846Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1634, in forward 2025-08-14T21:45:11.8589216Z outputs = self.model.decoder( 2025-08-14T21:45:11.8589590Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:11.8589954Z layer_outputs = decoder_layer( 2025-08-14T21:45:11.8590276Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:11.8590611Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:11.8590973Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:11.8591356Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:11.8591735Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 267, in forward 2025-08-14T21:45:11.8592099Z attn_output = self.out_proj(attn_output) 2025-08-14T21:45:11.8592230Z 2025-08-14T21:45:11.8592325Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:11.8592652Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:11.8592949Z return mod(**inputs) 2025-08-14T21:45:11.8593283Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1634, in forward 2025-08-14T21:45:11.8593653Z outputs = self.model.decoder( 2025-08-14T21:45:11.8594010Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:11.8594376Z layer_outputs = decoder_layer( 2025-08-14T21:45:11.8594686Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:11.8595017Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:11.8595382Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 438, in forward 2025-08-14T21:45:11.8595778Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:45:11.8595947Z 2025-08-14T21:45:11.8596042Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:11.8596365Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:11.8596662Z return mod(**inputs) 2025-08-14T21:45:11.8596998Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1634, in forward 2025-08-14T21:45:11.8597366Z outputs = self.model.decoder( 2025-08-14T21:45:11.8597722Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:11.8598083Z layer_outputs = decoder_layer( 2025-08-14T21:45:11.8598392Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:11.8598720Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:11.8599083Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 438, in forward 2025-08-14T21:45:11.8599478Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:45:11.8599847Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:45:11.8600191Z return self.act(input) 2025-08-14T21:45:11.8600308Z 2025-08-14T21:45:11.8600408Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:11.8600729Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:11.8601026Z return mod(**inputs) 2025-08-14T21:45:11.8601367Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1634, in forward 2025-08-14T21:45:11.8601723Z outputs = self.model.decoder( 2025-08-14T21:45:11.8602080Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:11.8602466Z layer_outputs = decoder_layer( 2025-08-14T21:45:11.8602785Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:11.8603113Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:11.8603483Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 440, in forward 2025-08-14T21:45:11.8603858Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:45:11.8603984Z 2025-08-14T21:45:11.8604084Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:11.8604403Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:11.8604699Z return mod(**inputs) 2025-08-14T21:45:11.8605041Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1634, in forward 2025-08-14T21:45:11.8605399Z outputs = self.model.decoder( 2025-08-14T21:45:11.8605755Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:11.8606116Z layer_outputs = decoder_layer( 2025-08-14T21:45:11.8606433Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:11.8606759Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:11.8607124Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 442, in forward 2025-08-14T21:45:11.8607492Z hidden_states = residual + hidden_states 2025-08-14T21:45:11.8607615Z 2025-08-14T21:45:11.8607709Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:11.8608035Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:11.8608330Z return mod(**inputs) 2025-08-14T21:45:11.8608667Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1634, in forward 2025-08-14T21:45:11.8609023Z outputs = self.model.decoder( 2025-08-14T21:45:11.8609383Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:11.8609748Z layer_outputs = decoder_layer( 2025-08-14T21:45:11.8610059Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:11.8610388Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:11.8610751Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:11.8611136Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:11.8611514Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 215, in forward 2025-08-14T21:45:11.8611947Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:45:11.8612143Z 2025-08-14T21:45:11.8612237Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:11.8612575Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:11.8612905Z return mod(**inputs) 2025-08-14T21:45:11.8613250Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1634, in forward 2025-08-14T21:45:11.8613617Z outputs = self.model.decoder( 2025-08-14T21:45:11.8613966Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:11.8614330Z layer_outputs = decoder_layer( 2025-08-14T21:45:11.8614647Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:11.8614993Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:11.8615350Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:11.8615738Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:11.8616121Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 234, in forward 2025-08-14T21:45:11.8616488Z key_states = self.k_proj(current_states) 2025-08-14T21:45:11.8616609Z 2025-08-14T21:45:11.8616702Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:11.8617027Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:11.8617323Z return mod(**inputs) 2025-08-14T21:45:11.8617653Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1634, in forward 2025-08-14T21:45:11.8618020Z outputs = self.model.decoder( 2025-08-14T21:45:11.8618377Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:11.8618736Z layer_outputs = decoder_layer( 2025-08-14T21:45:11.8619045Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:11.8619378Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:11.8619741Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:11.8620123Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:11.8620496Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 235, in forward 2025-08-14T21:45:11.8620873Z value_states = self.v_proj(current_states) 2025-08-14T21:45:11.8621003Z 2025-08-14T21:45:11.8621084Z cudagraph partition due to non gpu ops 2025-08-14T21:45:11.8621274Z cudagraph partition due to non gpu ops 2025-08-14T21:45:11.8621467Z cudagraph partition due to non gpu ops 2025-08-14T21:45:11.8621655Z cudagraph partition due to non gpu ops 2025-08-14T21:45:11.8621862Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:11.8622192Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:11.8622492Z return mod(**inputs) 2025-08-14T21:45:11.8622829Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1634, in forward 2025-08-14T21:45:11.8623186Z outputs = self.model.decoder( 2025-08-14T21:45:11.8623541Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:11.8626147Z layer_outputs = decoder_layer( 2025-08-14T21:45:11.8626497Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:11.8626838Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:11.8627234Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:11.8627630Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:11.8628028Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 253, in forward 2025-08-14T21:45:11.8628414Z attn_output, attn_weights = attention_interface( 2025-08-14T21:45:11.8628824Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:45:11.8629267Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:45:11.8629440Z 2025-08-14T21:45:11.8629567Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:11.8629918Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:11.8630208Z return mod(**inputs) 2025-08-14T21:45:11.8630551Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1634, in forward 2025-08-14T21:45:11.8630918Z outputs = self.model.decoder( 2025-08-14T21:45:11.8631274Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:11.8631646Z layer_outputs = decoder_layer( 2025-08-14T21:45:11.8631973Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:11.8632312Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:11.8632685Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:11.8633084Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:11.8633475Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 253, in forward 2025-08-14T21:45:11.8633865Z attn_output, attn_weights = attention_interface( 2025-08-14T21:45:11.8634273Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:45:11.8634701Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:45:11.8634857Z 2025-08-14T21:45:11.8634961Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:11.8635290Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:11.8635594Z return mod(**inputs) 2025-08-14T21:45:11.8635947Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1634, in forward 2025-08-14T21:45:11.8636323Z outputs = self.model.decoder( 2025-08-14T21:45:11.8636684Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:11.8637053Z layer_outputs = decoder_layer( 2025-08-14T21:45:11.8637382Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:11.8637723Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:11.8638091Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:11.8638484Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:11.8638872Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 267, in forward 2025-08-14T21:45:11.8639290Z attn_output = self.out_proj(attn_output) 2025-08-14T21:45:11.8639429Z 2025-08-14T21:45:11.8639525Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:11.8639863Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:11.8640168Z return mod(**inputs) 2025-08-14T21:45:11.8640529Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1634, in forward 2025-08-14T21:45:11.8640925Z outputs = self.model.decoder( 2025-08-14T21:45:11.8641299Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:11.8641683Z layer_outputs = decoder_layer( 2025-08-14T21:45:11.8642004Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:11.8642348Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:11.8642729Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 438, in forward 2025-08-14T21:45:11.8643159Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:45:11.8643320Z 2025-08-14T21:45:11.8643415Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:11.8643755Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:11.8644061Z return mod(**inputs) 2025-08-14T21:45:11.8644413Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1634, in forward 2025-08-14T21:45:11.8644779Z outputs = self.model.decoder( 2025-08-14T21:45:11.8645143Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:11.8645518Z layer_outputs = decoder_layer( 2025-08-14T21:45:11.8645834Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:11.8646197Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:11.8646560Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 438, in forward 2025-08-14T21:45:11.8646961Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:45:11.8647309Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:45:11.8647623Z return self.act(input) 2025-08-14T21:45:11.8647723Z 2025-08-14T21:45:11.8647821Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:11.8648145Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:11.8648435Z return mod(**inputs) 2025-08-14T21:45:11.8648772Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1634, in forward 2025-08-14T21:45:11.8649133Z outputs = self.model.decoder( 2025-08-14T21:45:11.8649482Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:11.8649843Z layer_outputs = decoder_layer( 2025-08-14T21:45:11.8650162Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:11.8650489Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:11.8650849Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 440, in forward 2025-08-14T21:45:11.8651218Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:45:11.8651340Z 2025-08-14T21:45:11.8651441Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:11.8651760Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:11.8652054Z return mod(**inputs) 2025-08-14T21:45:11.8652442Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1634, in forward 2025-08-14T21:45:11.8652809Z outputs = self.model.decoder( 2025-08-14T21:45:11.8653173Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:11.8653542Z layer_outputs = decoder_layer( 2025-08-14T21:45:11.8653877Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:11.8654205Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:11.8654562Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:11.8654947Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:11.8655330Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 215, in forward 2025-08-14T21:45:11.8655488Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:45:11.8655492Z 2025-08-14T21:45:11.8655594Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:11.8655777Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:11.8655837Z return mod(**inputs) 2025-08-14T21:45:11.8656088Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1634, in forward 2025-08-14T21:45:11.8656153Z outputs = self.model.decoder( 2025-08-14T21:45:11.8656391Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:11.8656463Z layer_outputs = decoder_layer( 2025-08-14T21:45:11.8656665Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:11.8656745Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:11.8656985Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:11.8657073Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:11.8657318Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 234, in forward 2025-08-14T21:45:11.8657392Z key_states = self.k_proj(current_states) 2025-08-14T21:45:11.8657395Z 2025-08-14T21:45:11.8657488Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:11.8657677Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:11.8657735Z return mod(**inputs) 2025-08-14T21:45:11.8657980Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1634, in forward 2025-08-14T21:45:11.8658046Z outputs = self.model.decoder( 2025-08-14T21:45:11.8658288Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:11.8658358Z layer_outputs = decoder_layer( 2025-08-14T21:45:11.8658560Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:11.8658636Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:11.8658875Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:11.8658962Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:11.8659207Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 235, in forward 2025-08-14T21:45:11.8659284Z value_states = self.v_proj(current_states) 2025-08-14T21:45:11.8659287Z 2025-08-14T21:45:11.8659378Z cudagraph partition due to non gpu ops 2025-08-14T21:45:11.8659463Z cudagraph partition due to non gpu ops 2025-08-14T21:45:11.8659535Z cudagraph partition due to non gpu ops 2025-08-14T21:45:11.8659611Z cudagraph partition due to non gpu ops 2025-08-14T21:45:11.8659705Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:11.8659901Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:11.8659984Z return mod(**inputs) 2025-08-14T21:45:11.8660225Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1634, in forward 2025-08-14T21:45:11.8660290Z outputs = self.model.decoder( 2025-08-14T21:45:11.8660536Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:11.8660601Z layer_outputs = decoder_layer( 2025-08-14T21:45:11.8660807Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:11.8660892Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:11.8661134Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:11.8661230Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:11.8661472Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 253, in forward 2025-08-14T21:45:11.8661570Z attn_output, attn_weights = attention_interface( 2025-08-14T21:45:11.8661835Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:45:11.8661955Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:45:11.8661959Z 2025-08-14T21:45:11.8662055Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:11.8662237Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:11.8662296Z return mod(**inputs) 2025-08-14T21:45:11.8662545Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1634, in forward 2025-08-14T21:45:11.8662611Z outputs = self.model.decoder( 2025-08-14T21:45:11.8662856Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:11.8662920Z layer_outputs = decoder_layer( 2025-08-14T21:45:11.8663123Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:11.8663202Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:11.8663443Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:11.8663537Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:11.8663777Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 253, in forward 2025-08-14T21:45:11.8663863Z attn_output, attn_weights = attention_interface( 2025-08-14T21:45:11.8664137Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:45:11.8664235Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:45:11.8664239Z 2025-08-14T21:45:11.8664329Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:11.8664522Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:11.8664582Z return mod(**inputs) 2025-08-14T21:45:11.8664914Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1634, in forward 2025-08-14T21:45:11.8665010Z outputs = self.model.decoder( 2025-08-14T21:45:11.8665261Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:11.8665337Z layer_outputs = decoder_layer( 2025-08-14T21:45:11.8665564Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:11.8665649Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:11.8665919Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:11.8666009Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:11.8666268Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 267, in forward 2025-08-14T21:45:11.8666341Z attn_output = self.out_proj(attn_output) 2025-08-14T21:45:11.8666345Z 2025-08-14T21:45:11.8666437Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:11.8666645Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:11.8666704Z return mod(**inputs) 2025-08-14T21:45:11.8666956Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1634, in forward 2025-08-14T21:45:11.8667024Z outputs = self.model.decoder( 2025-08-14T21:45:11.8667272Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:11.8667344Z layer_outputs = decoder_layer( 2025-08-14T21:45:11.8667552Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:11.8667622Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:11.8667877Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 438, in forward 2025-08-14T21:45:11.8667986Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:45:11.8667989Z 2025-08-14T21:45:11.8668092Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:11.8668275Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:11.8668338Z return mod(**inputs) 2025-08-14T21:45:11.8668595Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1634, in forward 2025-08-14T21:45:11.8668662Z outputs = self.model.decoder( 2025-08-14T21:45:11.8668913Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:11.8668979Z layer_outputs = decoder_layer( 2025-08-14T21:45:11.8669182Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:11.8669264Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:11.8669512Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 438, in forward 2025-08-14T21:45:11.8669620Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:45:11.8669825Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:45:11.8669892Z return self.act(input) 2025-08-14T21:45:11.8669895Z 2025-08-14T21:45:11.8669998Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:11.8670181Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:11.8670240Z return mod(**inputs) 2025-08-14T21:45:11.8670494Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1634, in forward 2025-08-14T21:45:11.8670562Z outputs = self.model.decoder( 2025-08-14T21:45:11.8670827Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:11.8670898Z layer_outputs = decoder_layer( 2025-08-14T21:45:11.8671106Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:11.8671200Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:11.8671459Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 440, in forward 2025-08-14T21:45:11.8671533Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:45:11.8671545Z 2025-08-14T21:45:11.8671638Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:11.8671821Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:11.8671888Z return mod(**inputs) 2025-08-14T21:45:11.8672133Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1634, in forward 2025-08-14T21:45:11.8672223Z outputs = self.model.decoder( 2025-08-14T21:45:11.8672480Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:11.8672546Z layer_outputs = decoder_layer( 2025-08-14T21:45:11.8672758Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:11.8672831Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:11.8673076Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 442, in forward 2025-08-14T21:45:11.8673157Z hidden_states = residual + hidden_states 2025-08-14T21:45:11.8673160Z 2025-08-14T21:45:11.8673254Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:11.8673439Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:11.8673507Z return mod(**inputs) 2025-08-14T21:45:11.8673754Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1634, in forward 2025-08-14T21:45:11.8673827Z outputs = self.model.decoder( 2025-08-14T21:45:11.8674072Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:11.8674140Z layer_outputs = decoder_layer( 2025-08-14T21:45:11.8674353Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:11.8674426Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:11.8674669Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:11.8674768Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:11.8675013Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 215, in forward 2025-08-14T21:45:11.8675163Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:45:11.8675166Z 2025-08-14T21:45:11.8675259Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:11.8675443Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:11.8675512Z return mod(**inputs) 2025-08-14T21:45:11.8675760Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1634, in forward 2025-08-14T21:45:11.8675833Z outputs = self.model.decoder( 2025-08-14T21:45:11.8676076Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:11.8676142Z layer_outputs = decoder_layer( 2025-08-14T21:45:11.8676366Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:11.8676441Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:11.8676687Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:11.8676796Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:11.8677059Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 234, in forward 2025-08-14T21:45:11.8677138Z key_states = self.k_proj(current_states) 2025-08-14T21:45:11.8677141Z 2025-08-14T21:45:11.8677235Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:11.8677420Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:11.8677487Z return mod(**inputs) 2025-08-14T21:45:11.8677732Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1634, in forward 2025-08-14T21:45:11.8677823Z outputs = self.model.decoder( 2025-08-14T21:45:11.8678071Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:11.8678137Z layer_outputs = decoder_layer( 2025-08-14T21:45:11.8678349Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:11.8678431Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:11.8678671Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:11.8678764Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:11.8679003Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 235, in forward 2025-08-14T21:45:11.8679088Z value_states = self.v_proj(current_states) 2025-08-14T21:45:11.8679092Z 2025-08-14T21:45:11.8679163Z cudagraph partition due to non gpu ops 2025-08-14T21:45:11.8679234Z cudagraph partition due to non gpu ops 2025-08-14T21:45:11.8679310Z cudagraph partition due to non gpu ops 2025-08-14T21:45:11.8679379Z cudagraph partition due to non gpu ops 2025-08-14T21:45:11.8679472Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:11.8679660Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:11.8679718Z return mod(**inputs) 2025-08-14T21:45:11.8679966Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1634, in forward 2025-08-14T21:45:11.8680030Z outputs = self.model.decoder( 2025-08-14T21:45:11.8680272Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:11.8680344Z layer_outputs = decoder_layer( 2025-08-14T21:45:11.8680544Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:11.8680623Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:11.8680864Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:11.8680950Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:11.8681197Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 253, in forward 2025-08-14T21:45:11.8681284Z attn_output, attn_weights = attention_interface( 2025-08-14T21:45:11.8681549Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:45:11.8681677Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:45:11.8681698Z 2025-08-14T21:45:11.8681794Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:11.8681982Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:11.8682041Z return mod(**inputs) 2025-08-14T21:45:11.8682301Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1634, in forward 2025-08-14T21:45:11.8682391Z outputs = self.model.decoder( 2025-08-14T21:45:11.8682630Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:11.8682700Z layer_outputs = decoder_layer( 2025-08-14T21:45:11.8682900Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:11.8682969Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:11.8683216Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:11.8683324Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:11.8683569Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 253, in forward 2025-08-14T21:45:11.8683665Z attn_output, attn_weights = attention_interface( 2025-08-14T21:45:11.8683930Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:45:11.8684036Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:45:11.8684040Z 2025-08-14T21:45:11.8684131Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:11.8684311Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:11.8684379Z return mod(**inputs) 2025-08-14T21:45:11.8684859Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1634, in forward 2025-08-14T21:45:11.8684945Z outputs = self.model.decoder( 2025-08-14T21:45:11.8685196Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:11.8685264Z layer_outputs = decoder_layer( 2025-08-14T21:45:11.8685478Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:11.8685554Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:11.8685803Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:11.8685903Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:11.8686152Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 267, in forward 2025-08-14T21:45:11.8686237Z attn_output = self.out_proj(attn_output) 2025-08-14T21:45:11.8686241Z 2025-08-14T21:45:11.8686335Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:11.8686526Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:11.8686595Z return mod(**inputs) 2025-08-14T21:45:11.8686837Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1634, in forward 2025-08-14T21:45:11.8686913Z outputs = self.model.decoder( 2025-08-14T21:45:11.8687158Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:11.8687223Z layer_outputs = decoder_layer( 2025-08-14T21:45:11.8687432Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:11.8687502Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:11.8687779Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 438, in forward 2025-08-14T21:45:11.8687898Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:45:11.8687901Z 2025-08-14T21:45:11.8687993Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:11.8688203Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:11.8688283Z return mod(**inputs) 2025-08-14T21:45:11.8688532Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1634, in forward 2025-08-14T21:45:11.8688606Z outputs = self.model.decoder( 2025-08-14T21:45:11.8688845Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:11.8688918Z layer_outputs = decoder_layer( 2025-08-14T21:45:11.8689119Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:11.8689210Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:11.8689456Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 438, in forward 2025-08-14T21:45:11.8689563Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:45:11.8689755Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:45:11.8689828Z return self.act(input) 2025-08-14T21:45:11.8689831Z 2025-08-14T21:45:11.8689924Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:11.8690110Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:11.8690169Z return mod(**inputs) 2025-08-14T21:45:11.8690408Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1634, in forward 2025-08-14T21:45:11.8690482Z outputs = self.model.decoder( 2025-08-14T21:45:11.8690721Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:11.8690785Z layer_outputs = decoder_layer( 2025-08-14T21:45:11.8690990Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:11.8691061Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:11.8691303Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 440, in forward 2025-08-14T21:45:11.8691374Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:45:11.8691377Z 2025-08-14T21:45:11.8691467Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:11.8691651Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:11.8691709Z return mod(**inputs) 2025-08-14T21:45:11.8691956Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1634, in forward 2025-08-14T21:45:11.8692019Z outputs = self.model.decoder( 2025-08-14T21:45:11.8692258Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:11.8692329Z layer_outputs = decoder_layer( 2025-08-14T21:45:11.8692526Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:11.8692596Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:11.8692840Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:11.8692926Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:11.8693187Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 215, in forward 2025-08-14T21:45:11.8693328Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:45:11.8693332Z 2025-08-14T21:45:11.8693424Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:11.8693625Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:11.8693709Z return mod(**inputs) 2025-08-14T21:45:11.8693956Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1634, in forward 2025-08-14T21:45:11.8694021Z outputs = self.model.decoder( 2025-08-14T21:45:11.8694259Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:11.8694330Z layer_outputs = decoder_layer( 2025-08-14T21:45:11.8694531Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:11.8694617Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:11.8694862Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:11.8694951Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:11.8695197Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 234, in forward 2025-08-14T21:45:11.8695272Z key_states = self.k_proj(current_states) 2025-08-14T21:45:11.8695275Z 2025-08-14T21:45:11.8695366Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:11.8695550Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:11.8695608Z return mod(**inputs) 2025-08-14T21:45:11.8695856Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1634, in forward 2025-08-14T21:45:11.8695922Z outputs = self.model.decoder( 2025-08-14T21:45:11.8696163Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:11.8696235Z layer_outputs = decoder_layer( 2025-08-14T21:45:11.8696435Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:11.8696509Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:11.8696751Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:11.8696838Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:11.8697082Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 235, in forward 2025-08-14T21:45:11.8697159Z value_states = self.v_proj(current_states) 2025-08-14T21:45:11.8697164Z 2025-08-14T21:45:11.8697235Z cudagraph partition due to non gpu ops 2025-08-14T21:45:11.8697314Z cudagraph partition due to non gpu ops 2025-08-14T21:45:11.8697382Z cudagraph partition due to non gpu ops 2025-08-14T21:45:11.8697448Z cudagraph partition due to non gpu ops 2025-08-14T21:45:11.8697547Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:11.8697725Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:11.8697802Z return mod(**inputs) 2025-08-14T21:45:11.8698042Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1634, in forward 2025-08-14T21:45:11.8698107Z outputs = self.model.decoder( 2025-08-14T21:45:11.8698354Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:11.8698418Z layer_outputs = decoder_layer( 2025-08-14T21:45:11.8698640Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:11.8698713Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:11.8698953Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:11.8699062Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:11.8699320Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 253, in forward 2025-08-14T21:45:11.8699406Z attn_output, attn_weights = attention_interface( 2025-08-14T21:45:11.8699681Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:45:11.8699802Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:45:11.8699805Z 2025-08-14T21:45:11.8699904Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:11.8700100Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:11.8700158Z return mod(**inputs) 2025-08-14T21:45:11.8700407Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1634, in forward 2025-08-14T21:45:11.8700471Z outputs = self.model.decoder( 2025-08-14T21:45:11.8700718Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:11.8700782Z layer_outputs = decoder_layer( 2025-08-14T21:45:11.8700982Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:11.8701061Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:11.8701301Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:11.8701390Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:11.8701635Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 253, in forward 2025-08-14T21:45:11.8701721Z attn_output, attn_weights = attention_interface( 2025-08-14T21:45:11.8701993Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:45:11.8702092Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:45:11.8702096Z 2025-08-14T21:45:11.8702186Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:11.8702373Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:11.8702431Z return mod(**inputs) 2025-08-14T21:45:11.8702676Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1634, in forward 2025-08-14T21:45:11.8702743Z outputs = self.model.decoder( 2025-08-14T21:45:11.8702983Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:11.8703052Z layer_outputs = decoder_layer( 2025-08-14T21:45:11.8703253Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:11.8703324Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:11.8703567Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:11.8703654Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:11.8703898Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 267, in forward 2025-08-14T21:45:11.8703969Z attn_output = self.out_proj(attn_output) 2025-08-14T21:45:11.8703986Z 2025-08-14T21:45:11.8704080Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:11.8704268Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:11.8704326Z return mod(**inputs) 2025-08-14T21:45:11.8704588Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1634, in forward 2025-08-14T21:45:11.8704731Z outputs = self.model.decoder( 2025-08-14T21:45:11.8704976Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:11.8715208Z layer_outputs = decoder_layer( 2025-08-14T21:45:11.8715546Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:11.8715631Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:11.8715913Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 438, in forward 2025-08-14T21:45:11.8716121Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:45:11.8716128Z 2025-08-14T21:45:11.8716236Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:11.8716448Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:11.8716514Z return mod(**inputs) 2025-08-14T21:45:11.8716776Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1634, in forward 2025-08-14T21:45:11.8716861Z outputs = self.model.decoder( 2025-08-14T21:45:11.8717113Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:11.8717183Z layer_outputs = decoder_layer( 2025-08-14T21:45:11.8717400Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:11.8717477Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:11.8717728Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 438, in forward 2025-08-14T21:45:11.8717837Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:45:11.8718038Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:45:11.8718115Z return self.act(input) 2025-08-14T21:45:11.8718119Z 2025-08-14T21:45:11.8718219Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:11.8718413Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:11.8718473Z return mod(**inputs) 2025-08-14T21:45:11.8718720Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1634, in forward 2025-08-14T21:45:11.8718799Z outputs = self.model.decoder( 2025-08-14T21:45:11.8719043Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:11.8719110Z layer_outputs = decoder_layer( 2025-08-14T21:45:11.8719327Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:11.8719399Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:11.8719652Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 440, in forward 2025-08-14T21:45:11.8719725Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:45:11.8719729Z 2025-08-14T21:45:11.8719823Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:11.8720015Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:11.8720077Z return mod(**inputs) 2025-08-14T21:45:11.8720355Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1634, in forward 2025-08-14T21:45:11.8720434Z outputs = self.model.decoder( 2025-08-14T21:45:11.8720677Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:11.8720772Z layer_outputs = decoder_layer( 2025-08-14T21:45:11.8720978Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:11.8721072Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:11.8721322Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 442, in forward 2025-08-14T21:45:11.8721393Z hidden_states = residual + hidden_states 2025-08-14T21:45:11.8721396Z 2025-08-14T21:45:11.8721497Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:11.8721681Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:11.8721756Z return mod(**inputs) 2025-08-14T21:45:11.8722001Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1634, in forward 2025-08-14T21:45:11.8722066Z outputs = self.model.decoder( 2025-08-14T21:45:11.8722307Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:11.8722379Z layer_outputs = decoder_layer( 2025-08-14T21:45:11.8722580Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:11.8722658Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:11.8722899Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:11.8722995Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:11.8723243Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 215, in forward 2025-08-14T21:45:11.8723383Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:45:11.8723387Z 2025-08-14T21:45:11.8723488Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:11.8723667Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:11.8723727Z return mod(**inputs) 2025-08-14T21:45:11.8723976Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1634, in forward 2025-08-14T21:45:11.8724041Z outputs = self.model.decoder( 2025-08-14T21:45:11.8724280Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:11.8724353Z layer_outputs = decoder_layer( 2025-08-14T21:45:11.8724555Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:11.8724631Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:11.8724870Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:11.8724963Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:11.8725210Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 234, in forward 2025-08-14T21:45:11.8725281Z key_states = self.k_proj(current_states) 2025-08-14T21:45:11.8725284Z 2025-08-14T21:45:11.8725383Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:11.8725561Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:11.8725620Z return mod(**inputs) 2025-08-14T21:45:11.8725884Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1634, in forward 2025-08-14T21:45:11.8725953Z outputs = self.model.decoder( 2025-08-14T21:45:11.8726193Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:11.8726282Z layer_outputs = decoder_layer( 2025-08-14T21:45:11.8726485Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:11.8726577Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:11.8726819Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:11.8726910Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:11.8727162Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 235, in forward 2025-08-14T21:45:11.8727256Z value_states = self.v_proj(current_states) 2025-08-14T21:45:11.8727259Z 2025-08-14T21:45:11.8727343Z cudagraph partition due to non gpu ops 2025-08-14T21:45:11.8727414Z cudagraph partition due to non gpu ops 2025-08-14T21:45:11.8727483Z cudagraph partition due to non gpu ops 2025-08-14T21:45:11.8727560Z cudagraph partition due to non gpu ops 2025-08-14T21:45:11.8727654Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:11.8727840Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:11.8727907Z return mod(**inputs) 2025-08-14T21:45:11.8728152Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1634, in forward 2025-08-14T21:45:11.8728217Z outputs = self.model.decoder( 2025-08-14T21:45:11.8728469Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:11.8728536Z layer_outputs = decoder_layer( 2025-08-14T21:45:11.8728746Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:11.8728816Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:11.8729062Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:11.8729160Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:11.8729404Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 253, in forward 2025-08-14T21:45:11.8729501Z attn_output, attn_weights = attention_interface( 2025-08-14T21:45:11.8729771Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:45:11.8729896Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:45:11.8729901Z 2025-08-14T21:45:11.8730001Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:11.8730184Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:11.8730251Z return mod(**inputs) 2025-08-14T21:45:11.8730499Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1634, in forward 2025-08-14T21:45:11.8730567Z outputs = self.model.decoder( 2025-08-14T21:45:11.8730821Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:11.8730886Z layer_outputs = decoder_layer( 2025-08-14T21:45:11.8731088Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:11.8731165Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:11.8731424Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:11.8731526Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:11.8731770Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 253, in forward 2025-08-14T21:45:11.8731879Z attn_output, attn_weights = attention_interface( 2025-08-14T21:45:11.8732169Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:45:11.8732270Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:45:11.8732274Z 2025-08-14T21:45:11.8732376Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:11.8732557Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:11.8732617Z return mod(**inputs) 2025-08-14T21:45:11.8732869Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1634, in forward 2025-08-14T21:45:11.8732955Z outputs = self.model.decoder( 2025-08-14T21:45:11.8733195Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:11.8733271Z layer_outputs = decoder_layer( 2025-08-14T21:45:11.8733472Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:11.8733554Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:11.8733790Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:11.8733877Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:11.8734125Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 267, in forward 2025-08-14T21:45:11.8734200Z attn_output = self.out_proj(attn_output) 2025-08-14T21:45:11.8734203Z 2025-08-14T21:45:11.8734302Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:11.8734479Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:11.8734540Z return mod(**inputs) 2025-08-14T21:45:11.8734789Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1634, in forward 2025-08-14T21:45:11.8734855Z outputs = self.model.decoder( 2025-08-14T21:45:11.8735096Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:11.8735168Z layer_outputs = decoder_layer( 2025-08-14T21:45:11.8735366Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:11.8735443Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:11.8735681Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 438, in forward 2025-08-14T21:45:11.8735789Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:45:11.8735792Z 2025-08-14T21:45:11.8735889Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:11.8736070Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:11.8736131Z return mod(**inputs) 2025-08-14T21:45:11.8736377Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1634, in forward 2025-08-14T21:45:11.8736443Z outputs = self.model.decoder( 2025-08-14T21:45:11.8736690Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:11.8736754Z layer_outputs = decoder_layer( 2025-08-14T21:45:11.8736970Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:11.8737053Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:11.8737298Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 438, in forward 2025-08-14T21:45:11.8737429Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:45:11.8737647Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:45:11.8737710Z return self.act(input) 2025-08-14T21:45:11.8737714Z 2025-08-14T21:45:11.8737814Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:11.8737993Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:11.8738053Z return mod(**inputs) 2025-08-14T21:45:11.8738307Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1634, in forward 2025-08-14T21:45:11.8738389Z outputs = self.model.decoder( 2025-08-14T21:45:11.8738639Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:11.8738703Z layer_outputs = decoder_layer( 2025-08-14T21:45:11.8738909Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:11.8738992Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:11.8739233Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 440, in forward 2025-08-14T21:45:11.8739312Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:45:11.8739316Z 2025-08-14T21:45:11.8739408Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:11.8739589Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:11.8739657Z return mod(**inputs) 2025-08-14T21:45:11.8739901Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1650, in forward 2025-08-14T21:45:11.8739971Z logits = self.lm_head(outputs[0]) 2025-08-14T21:45:11.8739975Z 2025-08-14T21:45:11.8740075Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:11.8740254Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:11.8740323Z return mod(**inputs) 2025-08-14T21:45:11.8740567Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1656, in forward 2025-08-14T21:45:11.8740701Z loss = loss_fct(logits.view(-1, self.config.vocab_size), labels.view(-1)) 2025-08-14T21:45:11.8740705Z 2025-08-14T21:45:19.8401485Z Compilation time (from dynamo_timed): 13.712306663 2025-08-14T21:45:19.8411880Z pass 2025-08-14T21:45:19.8414085Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:45:19.8414967Z TIMING: _recursive_pre_grad_passes:0.00627 _recursive_joint_graph_passes:0.54894 _recursive_post_grad_passes:0.07522 async_compile.wait:0.72407 code_gen:7.35802 inductor_compile:8.49565 backend_compile:11.40036 gc:9e-05 entire_frame_compile:13.71231 total_wall_time:13.71231 2025-08-14T21:45:19.8417436Z STATS: call_* op count: 369 | FakeTensorMode.__torch_dispatch__:13170 | FakeTensor.__torch_dispatch__:4856 | ProxyTorchDispatchMode.__torch_dispatch__:4803 2025-08-14T21:45:19.8417986Z Dynamo produced 1 graphs covering 369 ops with 0 graph breaks (0 unique) 2025-08-14T21:45:23.9227313Z /opt/conda/envs/py_3.9/lib/python3.9/site-packages/llvmlite/binding/ffi.py:175: UserWarning: pkg_resources is deprecated as an API. See https://setuptools.pypa.io/en/latest/pkg_resources.html. The pkg_resources package is slated for removal as early as 2025-11-30. Refrain from using this package or pin to Setuptools<81. 2025-08-14T21:45:23.9228332Z from pkg_resources import resource_filename 2025-08-14T21:45:24.4670867Z 2025-08-14T21:45:29.8373902Z loading model: 0it [00:00, ?it/s] 2025-08-14T21:45:29.8377905Z loading model: 0it [00:05, ?it/s] 2025-08-14T21:45:29.8401706Z cpu eval PegasusForConditionalGeneration 2025-08-14T21:45:30.4202686Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:45:30.6805273Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:45:30.9030649Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:45:46.1076719Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1078975Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1079367Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1079661Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1079900Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1080521Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1088495Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1089215Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1089489Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1089700Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1091898Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1092253Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1092503Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1092868Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1093184Z return mod(**inputs) 2025-08-14T21:45:46.1093573Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1093960Z outputs = self.model( 2025-08-14T21:45:46.1094326Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1279, in forward 2025-08-14T21:45:46.1094702Z encoder_outputs = self.encoder( 2025-08-14T21:45:46.1095074Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 841, in forward 2025-08-14T21:45:46.1095447Z layer_outputs = encoder_layer( 2025-08-14T21:45:46.1095778Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1096121Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1096499Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 312, in forward 2025-08-14T21:45:46.1096889Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:45:46.1097273Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 215, in forward 2025-08-14T21:45:46.1097723Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:45:46.1097924Z 2025-08-14T21:45:46.1098025Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1098371Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1098672Z return mod(**inputs) 2025-08-14T21:45:46.1099026Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1099415Z outputs = self.model( 2025-08-14T21:45:46.1099768Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1279, in forward 2025-08-14T21:45:46.1100134Z encoder_outputs = self.encoder( 2025-08-14T21:45:46.1100500Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 841, in forward 2025-08-14T21:45:46.1101083Z layer_outputs = encoder_layer( 2025-08-14T21:45:46.1101414Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1101753Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1102188Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 312, in forward 2025-08-14T21:45:46.1102621Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:45:46.1102993Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 234, in forward 2025-08-14T21:45:46.1103368Z key_states = self.k_proj(current_states) 2025-08-14T21:45:46.1103494Z 2025-08-14T21:45:46.1103599Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1103935Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1104233Z return mod(**inputs) 2025-08-14T21:45:46.1104615Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1105100Z outputs = self.model( 2025-08-14T21:45:46.1105449Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1279, in forward 2025-08-14T21:45:46.1105825Z encoder_outputs = self.encoder( 2025-08-14T21:45:46.1106198Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 841, in forward 2025-08-14T21:45:46.1106568Z layer_outputs = encoder_layer( 2025-08-14T21:45:46.1106889Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1107237Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1107612Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 312, in forward 2025-08-14T21:45:46.1107992Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:45:46.1108377Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 235, in forward 2025-08-14T21:45:46.1108757Z value_states = self.v_proj(current_states) 2025-08-14T21:45:46.1108888Z 2025-08-14T21:45:46.1108970Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1109162Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1109359Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1109551Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1109762Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1110096Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1110401Z return mod(**inputs) 2025-08-14T21:45:46.1110750Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1111109Z outputs = self.model( 2025-08-14T21:45:46.1111454Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1279, in forward 2025-08-14T21:45:46.1111826Z encoder_outputs = self.encoder( 2025-08-14T21:45:46.1112182Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 841, in forward 2025-08-14T21:45:46.1112560Z layer_outputs = encoder_layer( 2025-08-14T21:45:46.1112891Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1113225Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1113588Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 312, in forward 2025-08-14T21:45:46.1113971Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:45:46.1114377Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 253, in forward 2025-08-14T21:45:46.1114768Z attn_output, attn_weights = attention_interface( 2025-08-14T21:45:46.1115196Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:45:46.1115647Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:45:46.1115838Z 2025-08-14T21:45:46.1115941Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1116271Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1116563Z return mod(**inputs) 2025-08-14T21:45:46.1116908Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1117265Z outputs = self.model( 2025-08-14T21:45:46.1117605Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1279, in forward 2025-08-14T21:45:46.1117992Z encoder_outputs = self.encoder( 2025-08-14T21:45:46.1118354Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 841, in forward 2025-08-14T21:45:46.1118721Z layer_outputs = encoder_layer( 2025-08-14T21:45:46.1119037Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1119374Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1119749Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 312, in forward 2025-08-14T21:45:46.1120125Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:45:46.1120508Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 253, in forward 2025-08-14T21:45:46.1120899Z attn_output, attn_weights = attention_interface( 2025-08-14T21:45:46.1121309Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:45:46.1121730Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:45:46.1121889Z 2025-08-14T21:45:46.1121985Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1122319Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1122617Z return mod(**inputs) 2025-08-14T21:45:46.1122956Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1123330Z outputs = self.model( 2025-08-14T21:45:46.1123677Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1279, in forward 2025-08-14T21:45:46.1124039Z encoder_outputs = self.encoder( 2025-08-14T21:45:46.1124402Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 841, in forward 2025-08-14T21:45:46.1124767Z layer_outputs = encoder_layer( 2025-08-14T21:45:46.1125092Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1125417Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1125789Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 312, in forward 2025-08-14T21:45:46.1126168Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:45:46.1126545Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 267, in forward 2025-08-14T21:45:46.1126912Z attn_output = self.out_proj(attn_output) 2025-08-14T21:45:46.1127043Z 2025-08-14T21:45:46.1127156Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1127490Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1127782Z return mod(**inputs) 2025-08-14T21:45:46.1128152Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1128516Z outputs = self.model( 2025-08-14T21:45:46.1128880Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1279, in forward 2025-08-14T21:45:46.1129236Z encoder_outputs = self.encoder( 2025-08-14T21:45:46.1129594Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 841, in forward 2025-08-14T21:45:46.1129955Z layer_outputs = encoder_layer( 2025-08-14T21:45:46.1130270Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1130617Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1130981Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 323, in forward 2025-08-14T21:45:46.1131389Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:45:46.1131553Z 2025-08-14T21:45:46.1131650Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1131986Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1132286Z return mod(**inputs) 2025-08-14T21:45:46.1132632Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1132986Z outputs = self.model( 2025-08-14T21:45:46.1133330Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1279, in forward 2025-08-14T21:45:46.1133696Z encoder_outputs = self.encoder( 2025-08-14T21:45:46.1134050Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 841, in forward 2025-08-14T21:45:46.1134415Z layer_outputs = encoder_layer( 2025-08-14T21:45:46.1134736Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1135071Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1135433Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 323, in forward 2025-08-14T21:45:46.1135844Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:45:46.1136214Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:45:46.1136526Z return self.act(input) 2025-08-14T21:45:46.1136626Z 2025-08-14T21:45:46.1136724Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1137055Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1137351Z return mod(**inputs) 2025-08-14T21:45:46.1137688Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1138049Z outputs = self.model( 2025-08-14T21:45:46.1138395Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1279, in forward 2025-08-14T21:45:46.1138762Z encoder_outputs = self.encoder( 2025-08-14T21:45:46.1139115Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 841, in forward 2025-08-14T21:45:46.1139480Z layer_outputs = encoder_layer( 2025-08-14T21:45:46.1139801Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1140143Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1140512Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 325, in forward 2025-08-14T21:45:46.1140885Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:45:46.1141015Z 2025-08-14T21:45:46.1141137Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1141468Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1141800Z return mod(**inputs) 2025-08-14T21:45:46.1142151Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1142512Z outputs = self.model( 2025-08-14T21:45:46.1142853Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1279, in forward 2025-08-14T21:45:46.1143225Z encoder_outputs = self.encoder( 2025-08-14T21:45:46.1143603Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 841, in forward 2025-08-14T21:45:46.1143958Z layer_outputs = encoder_layer( 2025-08-14T21:45:46.1144275Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1144700Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1145107Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 312, in forward 2025-08-14T21:45:46.1145505Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:45:46.1145909Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 215, in forward 2025-08-14T21:45:46.1146348Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:45:46.1146538Z 2025-08-14T21:45:46.1146646Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1146975Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1147285Z return mod(**inputs) 2025-08-14T21:45:46.1147648Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1148018Z outputs = self.model( 2025-08-14T21:45:46.1148380Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1279, in forward 2025-08-14T21:45:46.1148759Z encoder_outputs = self.encoder( 2025-08-14T21:45:46.1149138Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 841, in forward 2025-08-14T21:45:46.1149507Z layer_outputs = encoder_layer( 2025-08-14T21:45:46.1149838Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1150183Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1150559Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 312, in forward 2025-08-14T21:45:46.1150959Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:45:46.1151352Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 234, in forward 2025-08-14T21:45:46.1151735Z key_states = self.k_proj(current_states) 2025-08-14T21:45:46.1151861Z 2025-08-14T21:45:46.1151960Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1152312Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1152624Z return mod(**inputs) 2025-08-14T21:45:46.1152979Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1153368Z outputs = self.model( 2025-08-14T21:45:46.1153722Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1279, in forward 2025-08-14T21:45:46.1154095Z encoder_outputs = self.encoder( 2025-08-14T21:45:46.1154470Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 841, in forward 2025-08-14T21:45:46.1154847Z layer_outputs = encoder_layer( 2025-08-14T21:45:46.1155191Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1155524Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1155893Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 312, in forward 2025-08-14T21:45:46.1156280Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:45:46.1156663Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 235, in forward 2025-08-14T21:45:46.1157063Z value_states = self.v_proj(current_states) 2025-08-14T21:45:46.1157194Z 2025-08-14T21:45:46.1157270Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1157469Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1157669Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1157856Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1158078Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1158414Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1158720Z return mod(**inputs) 2025-08-14T21:45:46.1159064Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1159431Z outputs = self.model( 2025-08-14T21:45:46.1159778Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1279, in forward 2025-08-14T21:45:46.1160139Z encoder_outputs = self.encoder( 2025-08-14T21:45:46.1160500Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 841, in forward 2025-08-14T21:45:46.1160862Z layer_outputs = encoder_layer( 2025-08-14T21:45:46.1161181Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1161507Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1161870Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 312, in forward 2025-08-14T21:45:46.1162247Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:45:46.1162616Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 253, in forward 2025-08-14T21:45:46.1163004Z attn_output, attn_weights = attention_interface( 2025-08-14T21:45:46.1163415Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:45:46.1163855Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:45:46.1164023Z 2025-08-14T21:45:46.1164120Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1164450Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1164748Z return mod(**inputs) 2025-08-14T21:45:46.1165084Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1165457Z outputs = self.model( 2025-08-14T21:45:46.1165807Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1279, in forward 2025-08-14T21:45:46.1166187Z encoder_outputs = self.encoder( 2025-08-14T21:45:46.1166545Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 841, in forward 2025-08-14T21:45:46.1166909Z layer_outputs = encoder_layer( 2025-08-14T21:45:46.1167229Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1167578Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1167955Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 312, in forward 2025-08-14T21:45:46.1168335Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:45:46.1168713Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 253, in forward 2025-08-14T21:45:46.1169092Z attn_output, attn_weights = attention_interface( 2025-08-14T21:45:46.1169499Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:45:46.1169939Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:45:46.1170087Z 2025-08-14T21:45:46.1170188Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1170514Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1170816Z return mod(**inputs) 2025-08-14T21:45:46.1171160Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1171520Z outputs = self.model( 2025-08-14T21:45:46.1171853Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1279, in forward 2025-08-14T21:45:46.1172215Z encoder_outputs = self.encoder( 2025-08-14T21:45:46.1172576Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 841, in forward 2025-08-14T21:45:46.1172930Z layer_outputs = encoder_layer( 2025-08-14T21:45:46.1173249Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1173581Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1173948Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 312, in forward 2025-08-14T21:45:46.1174320Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:45:46.1174697Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 267, in forward 2025-08-14T21:45:46.1175080Z attn_output = self.out_proj(attn_output) 2025-08-14T21:45:46.1175210Z 2025-08-14T21:45:46.1175317Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1175641Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1175942Z return mod(**inputs) 2025-08-14T21:45:46.1176288Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1176642Z outputs = self.model( 2025-08-14T21:45:46.1176988Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1279, in forward 2025-08-14T21:45:46.1177351Z encoder_outputs = self.encoder( 2025-08-14T21:45:46.1177710Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 841, in forward 2025-08-14T21:45:46.1178066Z layer_outputs = encoder_layer( 2025-08-14T21:45:46.1178391Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1178729Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1179104Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 323, in forward 2025-08-14T21:45:46.1179514Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:45:46.1179681Z 2025-08-14T21:45:46.1179776Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1180124Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1180423Z return mod(**inputs) 2025-08-14T21:45:46.1180784Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1181143Z outputs = self.model( 2025-08-14T21:45:46.1181488Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1279, in forward 2025-08-14T21:45:46.1181849Z encoder_outputs = self.encoder( 2025-08-14T21:45:46.1182209Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 841, in forward 2025-08-14T21:45:46.1182594Z layer_outputs = encoder_layer( 2025-08-14T21:45:46.1182908Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1183244Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1183614Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 323, in forward 2025-08-14T21:45:46.1184023Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:45:46.1184372Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:45:46.1185002Z return self.act(input) 2025-08-14T21:45:46.1185111Z 2025-08-14T21:45:46.1185243Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1185596Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1185910Z return mod(**inputs) 2025-08-14T21:45:46.1186260Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1186624Z outputs = self.model( 2025-08-14T21:45:46.1186963Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1279, in forward 2025-08-14T21:45:46.1187329Z encoder_outputs = self.encoder( 2025-08-14T21:45:46.1187692Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 841, in forward 2025-08-14T21:45:46.1188056Z layer_outputs = encoder_layer( 2025-08-14T21:45:46.1188370Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1188701Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1189089Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 325, in forward 2025-08-14T21:45:46.1189458Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:45:46.1189590Z 2025-08-14T21:45:46.1189688Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1190019Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1190320Z return mod(**inputs) 2025-08-14T21:45:46.1190656Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1191019Z outputs = self.model( 2025-08-14T21:45:46.1191929Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1279, in forward 2025-08-14T21:45:46.1192300Z encoder_outputs = self.encoder( 2025-08-14T21:45:46.1192655Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 841, in forward 2025-08-14T21:45:46.1193072Z layer_outputs = encoder_layer( 2025-08-14T21:45:46.1193396Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1193719Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1194135Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 312, in forward 2025-08-14T21:45:46.1194533Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:45:46.1194941Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 215, in forward 2025-08-14T21:45:46.1195367Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:45:46.1195565Z 2025-08-14T21:45:46.1195659Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1195987Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1196285Z return mod(**inputs) 2025-08-14T21:45:46.1196648Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1197009Z outputs = self.model( 2025-08-14T21:45:46.1197352Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1279, in forward 2025-08-14T21:45:46.1197712Z encoder_outputs = self.encoder( 2025-08-14T21:45:46.1198071Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 841, in forward 2025-08-14T21:45:46.1198444Z layer_outputs = encoder_layer( 2025-08-14T21:45:46.1198770Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1199094Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1199464Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 312, in forward 2025-08-14T21:45:46.1199842Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:45:46.1200212Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 234, in forward 2025-08-14T21:45:46.1200580Z key_states = self.k_proj(current_states) 2025-08-14T21:45:46.1200710Z 2025-08-14T21:45:46.1200804Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1201134Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1201421Z return mod(**inputs) 2025-08-14T21:45:46.1201760Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1202118Z outputs = self.model( 2025-08-14T21:45:46.1202459Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1279, in forward 2025-08-14T21:45:46.1202819Z encoder_outputs = self.encoder( 2025-08-14T21:45:46.1203180Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 841, in forward 2025-08-14T21:45:46.1203543Z layer_outputs = encoder_layer( 2025-08-14T21:45:46.1203856Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1204189Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1204556Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 312, in forward 2025-08-14T21:45:46.1204944Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:45:46.1205313Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 235, in forward 2025-08-14T21:45:46.1205687Z value_states = self.v_proj(current_states) 2025-08-14T21:45:46.1205815Z 2025-08-14T21:45:46.1205912Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1206112Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1206298Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1206485Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1206700Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1207043Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1207359Z return mod(**inputs) 2025-08-14T21:45:46.1207703Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1208056Z outputs = self.model( 2025-08-14T21:45:46.1208402Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1279, in forward 2025-08-14T21:45:46.1208769Z encoder_outputs = self.encoder( 2025-08-14T21:45:46.1209128Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 841, in forward 2025-08-14T21:45:46.1209505Z layer_outputs = encoder_layer( 2025-08-14T21:45:46.1209825Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1210163Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1210528Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 312, in forward 2025-08-14T21:45:46.1210917Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:45:46.1211300Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 253, in forward 2025-08-14T21:45:46.1211684Z attn_output, attn_weights = attention_interface( 2025-08-14T21:45:46.1212084Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:45:46.1212528Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:45:46.1212704Z 2025-08-14T21:45:46.1212799Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1213130Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1213422Z return mod(**inputs) 2025-08-14T21:45:46.1213767Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1214131Z outputs = self.model( 2025-08-14T21:45:46.1214465Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1279, in forward 2025-08-14T21:45:46.1214830Z encoder_outputs = self.encoder( 2025-08-14T21:45:46.1215187Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 841, in forward 2025-08-14T21:45:46.1215550Z layer_outputs = encoder_layer( 2025-08-14T21:45:46.1215862Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1216195Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1216560Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 312, in forward 2025-08-14T21:45:46.1216936Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:45:46.1217308Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 253, in forward 2025-08-14T21:45:46.1217693Z attn_output, attn_weights = attention_interface( 2025-08-14T21:45:46.1218100Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:45:46.1218514Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:45:46.1218669Z 2025-08-14T21:45:46.1218780Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1219119Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1219418Z return mod(**inputs) 2025-08-14T21:45:46.1219773Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1220140Z outputs = self.model( 2025-08-14T21:45:46.1220502Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1279, in forward 2025-08-14T21:45:46.1220867Z encoder_outputs = self.encoder( 2025-08-14T21:45:46.1221220Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 841, in forward 2025-08-14T21:45:46.1221582Z layer_outputs = encoder_layer( 2025-08-14T21:45:46.1221900Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1222270Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1222635Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 312, in forward 2025-08-14T21:45:46.1223025Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:45:46.1223407Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 267, in forward 2025-08-14T21:45:46.1223778Z attn_output = self.out_proj(attn_output) 2025-08-14T21:45:46.1223912Z 2025-08-14T21:45:46.1224011Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1224345Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1224710Z return mod(**inputs) 2025-08-14T21:45:46.1225060Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1225425Z outputs = self.model( 2025-08-14T21:45:46.1225771Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1279, in forward 2025-08-14T21:45:46.1226129Z encoder_outputs = self.encoder( 2025-08-14T21:45:46.1226503Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 841, in forward 2025-08-14T21:45:46.1226874Z layer_outputs = encoder_layer( 2025-08-14T21:45:46.1227205Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1227534Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1227980Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 323, in forward 2025-08-14T21:45:46.1228390Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:45:46.1228549Z 2025-08-14T21:45:46.1228654Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1228975Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1229273Z return mod(**inputs) 2025-08-14T21:45:46.1229617Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1229977Z outputs = self.model( 2025-08-14T21:45:46.1230316Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1279, in forward 2025-08-14T21:45:46.1230681Z encoder_outputs = self.encoder( 2025-08-14T21:45:46.1231044Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 841, in forward 2025-08-14T21:45:46.1231404Z layer_outputs = encoder_layer( 2025-08-14T21:45:46.1231739Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1232083Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1232461Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 323, in forward 2025-08-14T21:45:46.1232870Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:45:46.1233263Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:45:46.1233597Z return self.act(input) 2025-08-14T21:45:46.1233700Z 2025-08-14T21:45:46.1233807Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1234136Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1234441Z return mod(**inputs) 2025-08-14T21:45:46.1234794Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1235154Z outputs = self.model( 2025-08-14T21:45:46.1235519Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1279, in forward 2025-08-14T21:45:46.1235882Z encoder_outputs = self.encoder( 2025-08-14T21:45:46.1236242Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 841, in forward 2025-08-14T21:45:46.1236599Z layer_outputs = encoder_layer( 2025-08-14T21:45:46.1236919Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1237250Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1237615Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 325, in forward 2025-08-14T21:45:46.1237974Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:45:46.1238104Z 2025-08-14T21:45:46.1238199Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1238528Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1238818Z return mod(**inputs) 2025-08-14T21:45:46.1239161Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1239520Z outputs = self.model( 2025-08-14T21:45:46.1239861Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1279, in forward 2025-08-14T21:45:46.1240219Z encoder_outputs = self.encoder( 2025-08-14T21:45:46.1240578Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 841, in forward 2025-08-14T21:45:46.1240939Z layer_outputs = encoder_layer( 2025-08-14T21:45:46.1241250Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1241581Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1241948Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 327, in forward 2025-08-14T21:45:46.1242315Z hidden_states = residual + hidden_states 2025-08-14T21:45:46.1242439Z 2025-08-14T21:45:46.1242533Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1242860Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1243156Z return mod(**inputs) 2025-08-14T21:45:46.1243494Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1243841Z outputs = self.model( 2025-08-14T21:45:46.1244184Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1279, in forward 2025-08-14T21:45:46.1244544Z encoder_outputs = self.encoder( 2025-08-14T21:45:46.1244918Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 841, in forward 2025-08-14T21:45:46.1245282Z layer_outputs = encoder_layer( 2025-08-14T21:45:46.1245599Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1245942Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1246313Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 312, in forward 2025-08-14T21:45:46.1246689Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:45:46.1247065Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 215, in forward 2025-08-14T21:45:46.1247488Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:45:46.1247683Z 2025-08-14T21:45:46.1247778Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1248126Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1248419Z return mod(**inputs) 2025-08-14T21:45:46.1248754Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1249112Z outputs = self.model( 2025-08-14T21:45:46.1249454Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1279, in forward 2025-08-14T21:45:46.1249814Z encoder_outputs = self.encoder( 2025-08-14T21:45:46.1250164Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 841, in forward 2025-08-14T21:45:46.1250524Z layer_outputs = encoder_layer( 2025-08-14T21:45:46.1250840Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1251166Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1251532Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 312, in forward 2025-08-14T21:45:46.1251910Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:45:46.1252287Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 234, in forward 2025-08-14T21:45:46.1252651Z key_states = self.k_proj(current_states) 2025-08-14T21:45:46.1252780Z 2025-08-14T21:45:46.1252873Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1253202Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1253499Z return mod(**inputs) 2025-08-14T21:45:46.1253829Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1254185Z outputs = self.model( 2025-08-14T21:45:46.1254527Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1279, in forward 2025-08-14T21:45:46.1254880Z encoder_outputs = self.encoder( 2025-08-14T21:45:46.1255239Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 841, in forward 2025-08-14T21:45:46.1255600Z layer_outputs = encoder_layer( 2025-08-14T21:45:46.1255918Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1256241Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1256605Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 312, in forward 2025-08-14T21:45:46.1256980Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:45:46.1257365Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 235, in forward 2025-08-14T21:45:46.1257744Z value_states = self.v_proj(current_states) 2025-08-14T21:45:46.1257882Z 2025-08-14T21:45:46.1257956Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1258153Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1258357Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1258549Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1258778Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1259104Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1259405Z return mod(**inputs) 2025-08-14T21:45:46.1259751Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1260111Z outputs = self.model( 2025-08-14T21:45:46.1260452Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1279, in forward 2025-08-14T21:45:46.1260839Z encoder_outputs = self.encoder( 2025-08-14T21:45:46.1261200Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 841, in forward 2025-08-14T21:45:46.1261569Z layer_outputs = encoder_layer( 2025-08-14T21:45:46.1261915Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1262257Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1262632Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 312, in forward 2025-08-14T21:45:46.1263017Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:45:46.1263416Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 253, in forward 2025-08-14T21:45:46.1263815Z attn_output, attn_weights = attention_interface( 2025-08-14T21:45:46.1264233Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:45:46.1264766Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:45:46.1264956Z 2025-08-14T21:45:46.1265062Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1265410Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1265716Z return mod(**inputs) 2025-08-14T21:45:46.1266077Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1266452Z outputs = self.model( 2025-08-14T21:45:46.1266804Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1279, in forward 2025-08-14T21:45:46.1267171Z encoder_outputs = self.encoder( 2025-08-14T21:45:46.1267532Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 841, in forward 2025-08-14T21:45:46.1267898Z layer_outputs = encoder_layer( 2025-08-14T21:45:46.1268214Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1268553Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1268925Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 312, in forward 2025-08-14T21:45:46.1269307Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:45:46.1269677Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 253, in forward 2025-08-14T21:45:46.1270058Z attn_output, attn_weights = attention_interface( 2025-08-14T21:45:46.1270486Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:45:46.1270904Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:45:46.1271060Z 2025-08-14T21:45:46.1271155Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1271503Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1271804Z return mod(**inputs) 2025-08-14T21:45:46.1272162Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1272520Z outputs = self.model( 2025-08-14T21:45:46.1272862Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1279, in forward 2025-08-14T21:45:46.1273225Z encoder_outputs = self.encoder( 2025-08-14T21:45:46.1273578Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 841, in forward 2025-08-14T21:45:46.1273953Z layer_outputs = encoder_layer( 2025-08-14T21:45:46.1274272Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1274598Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1274968Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 312, in forward 2025-08-14T21:45:46.1275345Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:45:46.1275718Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 267, in forward 2025-08-14T21:45:46.1276077Z attn_output = self.out_proj(attn_output) 2025-08-14T21:45:46.1276208Z 2025-08-14T21:45:46.1276302Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1276633Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1276923Z return mod(**inputs) 2025-08-14T21:45:46.1277270Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1277629Z outputs = self.model( 2025-08-14T21:45:46.1277977Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1279, in forward 2025-08-14T21:45:46.1278337Z encoder_outputs = self.encoder( 2025-08-14T21:45:46.1278698Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 841, in forward 2025-08-14T21:45:46.1279057Z layer_outputs = encoder_layer( 2025-08-14T21:45:46.1279377Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1279700Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1280072Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 323, in forward 2025-08-14T21:45:46.1280478Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:45:46.1280637Z 2025-08-14T21:45:46.1280731Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1281062Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1281359Z return mod(**inputs) 2025-08-14T21:45:46.1281703Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1282053Z outputs = self.model( 2025-08-14T21:45:46.1282399Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1279, in forward 2025-08-14T21:45:46.1282763Z encoder_outputs = self.encoder( 2025-08-14T21:45:46.1283129Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 841, in forward 2025-08-14T21:45:46.1283500Z layer_outputs = encoder_layer( 2025-08-14T21:45:46.1283819Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1284151Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1284528Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 323, in forward 2025-08-14T21:45:46.1285084Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:45:46.1285444Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:45:46.1285758Z return self.act(input) 2025-08-14T21:45:46.1285859Z 2025-08-14T21:45:46.1285953Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1286285Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1286585Z return mod(**inputs) 2025-08-14T21:45:46.1286980Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1287356Z outputs = self.model( 2025-08-14T21:45:46.1287711Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1279, in forward 2025-08-14T21:45:46.1288085Z encoder_outputs = self.encoder( 2025-08-14T21:45:46.1288449Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 841, in forward 2025-08-14T21:45:46.1288823Z layer_outputs = encoder_layer( 2025-08-14T21:45:46.1289154Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1289489Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1289870Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 325, in forward 2025-08-14T21:45:46.1290254Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:45:46.1290386Z 2025-08-14T21:45:46.1290493Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1290826Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1291135Z return mod(**inputs) 2025-08-14T21:45:46.1291488Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1291856Z outputs = self.model( 2025-08-14T21:45:46.1292200Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1279, in forward 2025-08-14T21:45:46.1292575Z encoder_outputs = self.encoder( 2025-08-14T21:45:46.1292943Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 841, in forward 2025-08-14T21:45:46.1293311Z layer_outputs = encoder_layer( 2025-08-14T21:45:46.1293639Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1293981Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1294357Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 312, in forward 2025-08-14T21:45:46.1294739Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:45:46.1295130Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 215, in forward 2025-08-14T21:45:46.1295573Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:45:46.1295766Z 2025-08-14T21:45:46.1295873Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1296205Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1296577Z return mod(**inputs) 2025-08-14T21:45:46.1296929Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1297285Z outputs = self.model( 2025-08-14T21:45:46.1297650Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1279, in forward 2025-08-14T21:45:46.1298027Z encoder_outputs = self.encoder( 2025-08-14T21:45:46.1298410Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 841, in forward 2025-08-14T21:45:46.1298771Z layer_outputs = encoder_layer( 2025-08-14T21:45:46.1299100Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1299442Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1299808Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 312, in forward 2025-08-14T21:45:46.1300207Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:45:46.1300585Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 234, in forward 2025-08-14T21:45:46.1300951Z key_states = self.k_proj(current_states) 2025-08-14T21:45:46.1301074Z 2025-08-14T21:45:46.1301168Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1301499Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1301797Z return mod(**inputs) 2025-08-14T21:45:46.1302138Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1302489Z outputs = self.model( 2025-08-14T21:45:46.1302833Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1279, in forward 2025-08-14T21:45:46.1303195Z encoder_outputs = self.encoder( 2025-08-14T21:45:46.1303549Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 841, in forward 2025-08-14T21:45:46.1303908Z layer_outputs = encoder_layer( 2025-08-14T21:45:46.1304228Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1304558Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1304971Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 312, in forward 2025-08-14T21:45:46.1305356Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:45:46.1305734Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 235, in forward 2025-08-14T21:45:46.1306107Z value_states = self.v_proj(current_states) 2025-08-14T21:45:46.1306233Z 2025-08-14T21:45:46.1306310Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1306512Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1306706Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1306888Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1307104Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1307437Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1307733Z return mod(**inputs) 2025-08-14T21:45:46.1308077Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1308438Z outputs = self.model( 2025-08-14T21:45:46.1308779Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1279, in forward 2025-08-14T21:45:46.1309137Z encoder_outputs = self.encoder( 2025-08-14T21:45:46.1309519Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 841, in forward 2025-08-14T21:45:46.1309893Z layer_outputs = encoder_layer( 2025-08-14T21:45:46.1310208Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1310546Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1310936Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 312, in forward 2025-08-14T21:45:46.1311337Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:45:46.1311705Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 253, in forward 2025-08-14T21:45:46.1312093Z attn_output, attn_weights = attention_interface( 2025-08-14T21:45:46.1312506Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:45:46.1312949Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:45:46.1313135Z 2025-08-14T21:45:46.1313230Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1313563Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1313864Z return mod(**inputs) 2025-08-14T21:45:46.1314204Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1314565Z outputs = self.model( 2025-08-14T21:45:46.1314907Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1279, in forward 2025-08-14T21:45:46.1315272Z encoder_outputs = self.encoder( 2025-08-14T21:45:46.1315622Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 841, in forward 2025-08-14T21:45:46.1315983Z layer_outputs = encoder_layer( 2025-08-14T21:45:46.1316302Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1316633Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1316993Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 312, in forward 2025-08-14T21:45:46.1317373Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:45:46.1317752Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 253, in forward 2025-08-14T21:45:46.1318127Z attn_output, attn_weights = attention_interface( 2025-08-14T21:45:46.1318534Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:45:46.1318952Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:45:46.1319101Z 2025-08-14T21:45:46.1319203Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1319526Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1319822Z return mod(**inputs) 2025-08-14T21:45:46.1320163Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1320518Z outputs = self.model( 2025-08-14T21:45:46.1320855Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1279, in forward 2025-08-14T21:45:46.1321217Z encoder_outputs = self.encoder( 2025-08-14T21:45:46.1321577Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 841, in forward 2025-08-14T21:45:46.1321932Z layer_outputs = encoder_layer( 2025-08-14T21:45:46.1322266Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1322602Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1322967Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 312, in forward 2025-08-14T21:45:46.1323336Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:45:46.1323729Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 267, in forward 2025-08-14T21:45:46.1324118Z attn_output = self.out_proj(attn_output) 2025-08-14T21:45:46.1324243Z 2025-08-14T21:45:46.1324346Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1324672Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1324972Z return mod(**inputs) 2025-08-14T21:45:46.1325315Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1325672Z outputs = self.model( 2025-08-14T21:45:46.1326041Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1279, in forward 2025-08-14T21:45:46.1326410Z encoder_outputs = self.encoder( 2025-08-14T21:45:46.1326773Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 841, in forward 2025-08-14T21:45:46.1327137Z layer_outputs = encoder_layer( 2025-08-14T21:45:46.1327465Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1327801Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1328165Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 323, in forward 2025-08-14T21:45:46.1328586Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:45:46.1328757Z 2025-08-14T21:45:46.1328856Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1329191Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1329485Z return mod(**inputs) 2025-08-14T21:45:46.1329834Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1330202Z outputs = self.model( 2025-08-14T21:45:46.1330544Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1279, in forward 2025-08-14T21:45:46.1330915Z encoder_outputs = self.encoder( 2025-08-14T21:45:46.1331276Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 841, in forward 2025-08-14T21:45:46.1331645Z layer_outputs = encoder_layer( 2025-08-14T21:45:46.1331964Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1332307Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1332680Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 323, in forward 2025-08-14T21:45:46.1333090Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:45:46.1333449Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:45:46.1333769Z return self.act(input) 2025-08-14T21:45:46.1333872Z 2025-08-14T21:45:46.1333975Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1334303Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1334605Z return mod(**inputs) 2025-08-14T21:45:46.1334953Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1336190Z outputs = self.model( 2025-08-14T21:45:46.1336541Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1279, in forward 2025-08-14T21:45:46.1336916Z encoder_outputs = self.encoder( 2025-08-14T21:45:46.1337300Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 841, in forward 2025-08-14T21:45:46.1337662Z layer_outputs = encoder_layer( 2025-08-14T21:45:46.1338016Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1338350Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1338722Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 325, in forward 2025-08-14T21:45:46.1339088Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:45:46.1339220Z 2025-08-14T21:45:46.1339318Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1339663Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1339959Z return mod(**inputs) 2025-08-14T21:45:46.1340294Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1340655Z outputs = self.model( 2025-08-14T21:45:46.1340998Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1279, in forward 2025-08-14T21:45:46.1341360Z encoder_outputs = self.encoder( 2025-08-14T21:45:46.1341719Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 841, in forward 2025-08-14T21:45:46.1342084Z layer_outputs = encoder_layer( 2025-08-14T21:45:46.1342402Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1342729Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1343097Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 327, in forward 2025-08-14T21:45:46.1343467Z hidden_states = residual + hidden_states 2025-08-14T21:45:46.1343589Z 2025-08-14T21:45:46.1343692Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1344014Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1344313Z return mod(**inputs) 2025-08-14T21:45:46.1344719Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1345089Z outputs = self.model( 2025-08-14T21:45:46.1345439Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1279, in forward 2025-08-14T21:45:46.1345815Z encoder_outputs = self.encoder( 2025-08-14T21:45:46.1346185Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 841, in forward 2025-08-14T21:45:46.1346548Z layer_outputs = encoder_layer( 2025-08-14T21:45:46.1346872Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1347212Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1347578Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 312, in forward 2025-08-14T21:45:46.1347965Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:45:46.1348347Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 215, in forward 2025-08-14T21:45:46.1348783Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:45:46.1348972Z 2025-08-14T21:45:46.1349088Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1349427Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1349734Z return mod(**inputs) 2025-08-14T21:45:46.1350092Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1350472Z outputs = self.model( 2025-08-14T21:45:46.1350820Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1279, in forward 2025-08-14T21:45:46.1351201Z encoder_outputs = self.encoder( 2025-08-14T21:45:46.1351552Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 841, in forward 2025-08-14T21:45:46.1351913Z layer_outputs = encoder_layer( 2025-08-14T21:45:46.1352233Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1352565Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1352944Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 312, in forward 2025-08-14T21:45:46.1353323Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:45:46.1353702Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 234, in forward 2025-08-14T21:45:46.1354065Z key_states = self.k_proj(current_states) 2025-08-14T21:45:46.1354198Z 2025-08-14T21:45:46.1354293Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1354625Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1354922Z return mod(**inputs) 2025-08-14T21:45:46.1355257Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1355617Z outputs = self.model( 2025-08-14T21:45:46.1355963Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1279, in forward 2025-08-14T21:45:46.1356328Z encoder_outputs = self.encoder( 2025-08-14T21:45:46.1356680Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 841, in forward 2025-08-14T21:45:46.1357041Z layer_outputs = encoder_layer( 2025-08-14T21:45:46.1357359Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1357681Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1358045Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 312, in forward 2025-08-14T21:45:46.1358422Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:45:46.1358801Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 235, in forward 2025-08-14T21:45:46.1359170Z value_states = self.v_proj(current_states) 2025-08-14T21:45:46.1359307Z 2025-08-14T21:45:46.1359382Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1359576Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1359762Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1359955Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1360173Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1360505Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1360798Z return mod(**inputs) 2025-08-14T21:45:46.1361145Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1361512Z outputs = self.model( 2025-08-14T21:45:46.1361869Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1279, in forward 2025-08-14T21:45:46.1362242Z encoder_outputs = self.encoder( 2025-08-14T21:45:46.1362604Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 841, in forward 2025-08-14T21:45:46.1362965Z layer_outputs = encoder_layer( 2025-08-14T21:45:46.1363292Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1363651Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1364020Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 312, in forward 2025-08-14T21:45:46.1364394Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:45:46.1364774Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 253, in forward 2025-08-14T21:45:46.1365163Z attn_output, attn_weights = attention_interface( 2025-08-14T21:45:46.1365597Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:45:46.1366030Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:45:46.1366209Z 2025-08-14T21:45:46.1366306Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1366637Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1366938Z return mod(**inputs) 2025-08-14T21:45:46.1367276Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1367639Z outputs = self.model( 2025-08-14T21:45:46.1367983Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1279, in forward 2025-08-14T21:45:46.1368341Z encoder_outputs = self.encoder( 2025-08-14T21:45:46.1368705Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 841, in forward 2025-08-14T21:45:46.1369072Z layer_outputs = encoder_layer( 2025-08-14T21:45:46.1369391Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1369716Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1370083Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 312, in forward 2025-08-14T21:45:46.1370464Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:45:46.1370840Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 253, in forward 2025-08-14T21:45:46.1371218Z attn_output, attn_weights = attention_interface( 2025-08-14T21:45:46.1371625Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:45:46.1372044Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:45:46.1372191Z 2025-08-14T21:45:46.1372285Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1372614Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1372915Z return mod(**inputs) 2025-08-14T21:45:46.1373261Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1373617Z outputs = self.model( 2025-08-14T21:45:46.1373965Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1279, in forward 2025-08-14T21:45:46.1374330Z encoder_outputs = self.encoder( 2025-08-14T21:45:46.1374706Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 841, in forward 2025-08-14T21:45:46.1375071Z layer_outputs = encoder_layer( 2025-08-14T21:45:46.1375404Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1375750Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1376143Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 312, in forward 2025-08-14T21:45:46.1376541Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:45:46.1376919Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 267, in forward 2025-08-14T21:45:46.1377290Z attn_output = self.out_proj(attn_output) 2025-08-14T21:45:46.1377414Z 2025-08-14T21:45:46.1377508Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1377837Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1378139Z return mod(**inputs) 2025-08-14T21:45:46.1378497Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1378858Z outputs = self.model( 2025-08-14T21:45:46.1379204Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1279, in forward 2025-08-14T21:45:46.1379569Z encoder_outputs = self.encoder( 2025-08-14T21:45:46.1379926Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 841, in forward 2025-08-14T21:45:46.1380292Z layer_outputs = encoder_layer( 2025-08-14T21:45:46.1380612Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1380946Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1381308Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 323, in forward 2025-08-14T21:45:46.1381716Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:45:46.1381873Z 2025-08-14T21:45:46.1381974Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1382295Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1382594Z return mod(**inputs) 2025-08-14T21:45:46.1382937Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1383301Z outputs = self.model( 2025-08-14T21:45:46.1383639Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1279, in forward 2025-08-14T21:45:46.1384005Z encoder_outputs = self.encoder( 2025-08-14T21:45:46.1384368Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 841, in forward 2025-08-14T21:45:46.1384935Z layer_outputs = encoder_layer( 2025-08-14T21:45:46.1385267Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1385629Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1386008Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 323, in forward 2025-08-14T21:45:46.1386416Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:45:46.1386778Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:45:46.1387099Z return self.act(input) 2025-08-14T21:45:46.1387204Z 2025-08-14T21:45:46.1387308Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1387638Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1387941Z return mod(**inputs) 2025-08-14T21:45:46.1388330Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1388684Z outputs = self.model( 2025-08-14T21:45:46.1389028Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1279, in forward 2025-08-14T21:45:46.1389428Z encoder_outputs = self.encoder( 2025-08-14T21:45:46.1389810Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 841, in forward 2025-08-14T21:45:46.1390164Z layer_outputs = encoder_layer( 2025-08-14T21:45:46.1390486Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1390819Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1391180Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 325, in forward 2025-08-14T21:45:46.1391576Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:45:46.1391709Z 2025-08-14T21:45:46.1391804Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1392131Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1392426Z return mod(**inputs) 2025-08-14T21:45:46.1392770Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1393136Z outputs = self.model( 2025-08-14T21:45:46.1393484Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1279, in forward 2025-08-14T21:45:46.1393844Z encoder_outputs = self.encoder( 2025-08-14T21:45:46.1394207Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 841, in forward 2025-08-14T21:45:46.1394569Z layer_outputs = encoder_layer( 2025-08-14T21:45:46.1394882Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1395214Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1395582Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 312, in forward 2025-08-14T21:45:46.1395962Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:45:46.1396333Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 215, in forward 2025-08-14T21:45:46.1396763Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:45:46.1396954Z 2025-08-14T21:45:46.1397059Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1397387Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1397677Z return mod(**inputs) 2025-08-14T21:45:46.1398019Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1398376Z outputs = self.model( 2025-08-14T21:45:46.1398713Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1279, in forward 2025-08-14T21:45:46.1399075Z encoder_outputs = self.encoder( 2025-08-14T21:45:46.1399434Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 841, in forward 2025-08-14T21:45:46.1399794Z layer_outputs = encoder_layer( 2025-08-14T21:45:46.1400104Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1400435Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1400818Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 312, in forward 2025-08-14T21:45:46.1401196Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:45:46.1401573Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 234, in forward 2025-08-14T21:45:46.1401943Z key_states = self.k_proj(current_states) 2025-08-14T21:45:46.1402065Z 2025-08-14T21:45:46.1402180Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1402521Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1402817Z return mod(**inputs) 2025-08-14T21:45:46.1403161Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1403519Z outputs = self.model( 2025-08-14T21:45:46.1403854Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1279, in forward 2025-08-14T21:45:46.1404216Z encoder_outputs = self.encoder( 2025-08-14T21:45:46.1404594Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 841, in forward 2025-08-14T21:45:46.1404951Z layer_outputs = encoder_layer( 2025-08-14T21:45:46.1405277Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1405615Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1405988Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 312, in forward 2025-08-14T21:45:46.1406365Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:45:46.1406746Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 235, in forward 2025-08-14T21:45:46.1407124Z value_states = self.v_proj(current_states) 2025-08-14T21:45:46.1407255Z 2025-08-14T21:45:46.1407338Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1407534Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1407730Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1407922Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1408134Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1408472Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1408778Z return mod(**inputs) 2025-08-14T21:45:46.1409118Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1409486Z outputs = self.model( 2025-08-14T21:45:46.1409837Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1279, in forward 2025-08-14T21:45:46.1410207Z encoder_outputs = self.encoder( 2025-08-14T21:45:46.1410564Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 841, in forward 2025-08-14T21:45:46.1410936Z layer_outputs = encoder_layer( 2025-08-14T21:45:46.1411260Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1411587Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1411960Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 312, in forward 2025-08-14T21:45:46.1412346Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:45:46.1412727Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 253, in forward 2025-08-14T21:45:46.1413108Z attn_output, attn_weights = attention_interface( 2025-08-14T21:45:46.1413523Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:45:46.1413977Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:45:46.1414152Z 2025-08-14T21:45:46.1414253Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1414580Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1414879Z return mod(**inputs) 2025-08-14T21:45:46.1415240Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1415609Z outputs = self.model( 2025-08-14T21:45:46.1415955Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1279, in forward 2025-08-14T21:45:46.1416316Z encoder_outputs = self.encoder( 2025-08-14T21:45:46.1416674Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 841, in forward 2025-08-14T21:45:46.1417030Z layer_outputs = encoder_layer( 2025-08-14T21:45:46.1417370Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1417708Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1418084Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 312, in forward 2025-08-14T21:45:46.1418463Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:45:46.1418850Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 253, in forward 2025-08-14T21:45:46.1419241Z attn_output, attn_weights = attention_interface( 2025-08-14T21:45:46.1419650Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:45:46.1420076Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:45:46.1420233Z 2025-08-14T21:45:46.1420333Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1420677Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1420976Z return mod(**inputs) 2025-08-14T21:45:46.1421326Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1421696Z outputs = self.model( 2025-08-14T21:45:46.1422042Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1279, in forward 2025-08-14T21:45:46.1422415Z encoder_outputs = self.encoder( 2025-08-14T21:45:46.1422781Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 841, in forward 2025-08-14T21:45:46.1423149Z layer_outputs = encoder_layer( 2025-08-14T21:45:46.1423471Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1423818Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1424193Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 312, in forward 2025-08-14T21:45:46.1424582Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:45:46.1425035Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 267, in forward 2025-08-14T21:45:46.1425416Z attn_output = self.out_proj(attn_output) 2025-08-14T21:45:46.1425542Z 2025-08-14T21:45:46.1425647Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1425973Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1426273Z return mod(**inputs) 2025-08-14T21:45:46.1426620Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1427006Z outputs = self.model( 2025-08-14T21:45:46.1427353Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1279, in forward 2025-08-14T21:45:46.1427722Z encoder_outputs = self.encoder( 2025-08-14T21:45:46.1428105Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 841, in forward 2025-08-14T21:45:46.1428475Z layer_outputs = encoder_layer( 2025-08-14T21:45:46.1428810Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1429142Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1429508Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 323, in forward 2025-08-14T21:45:46.1429908Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:45:46.1430075Z 2025-08-14T21:45:46.1430171Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1430522Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1430823Z return mod(**inputs) 2025-08-14T21:45:46.1431155Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1431519Z outputs = self.model( 2025-08-14T21:45:46.1431864Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1279, in forward 2025-08-14T21:45:46.1432219Z encoder_outputs = self.encoder( 2025-08-14T21:45:46.1432577Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 841, in forward 2025-08-14T21:45:46.1432934Z layer_outputs = encoder_layer( 2025-08-14T21:45:46.1433254Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1433579Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1433943Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 323, in forward 2025-08-14T21:45:46.1434342Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:45:46.1434693Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:45:46.1435001Z return self.act(input) 2025-08-14T21:45:46.1435108Z 2025-08-14T21:45:46.1435203Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1435532Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1435820Z return mod(**inputs) 2025-08-14T21:45:46.1436159Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1436520Z outputs = self.model( 2025-08-14T21:45:46.1436863Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1279, in forward 2025-08-14T21:45:46.1437216Z encoder_outputs = self.encoder( 2025-08-14T21:45:46.1437572Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 841, in forward 2025-08-14T21:45:46.1437931Z layer_outputs = encoder_layer( 2025-08-14T21:45:46.1438239Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1438571Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1438934Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 325, in forward 2025-08-14T21:45:46.1439301Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:45:46.1439424Z 2025-08-14T21:45:46.1439518Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1439877Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1440181Z return mod(**inputs) 2025-08-14T21:45:46.1440528Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1440903Z outputs = self.model( 2025-08-14T21:45:46.1441252Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1279, in forward 2025-08-14T21:45:46.1441639Z encoder_outputs = self.encoder( 2025-08-14T21:45:46.1441991Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 841, in forward 2025-08-14T21:45:46.1442360Z layer_outputs = encoder_layer( 2025-08-14T21:45:46.1442684Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1443017Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1443395Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 327, in forward 2025-08-14T21:45:46.1443763Z hidden_states = residual + hidden_states 2025-08-14T21:45:46.1443885Z 2025-08-14T21:45:46.1443988Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1444308Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1444609Z return mod(**inputs) 2025-08-14T21:45:46.1444951Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1445309Z outputs = self.model( 2025-08-14T21:45:46.1445647Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1279, in forward 2025-08-14T21:45:46.1446009Z encoder_outputs = self.encoder( 2025-08-14T21:45:46.1446367Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 841, in forward 2025-08-14T21:45:46.1446729Z layer_outputs = encoder_layer( 2025-08-14T21:45:46.1447039Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1447368Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1447731Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 312, in forward 2025-08-14T21:45:46.1448105Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:45:46.1448485Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 215, in forward 2025-08-14T21:45:46.1448918Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:45:46.1449105Z 2025-08-14T21:45:46.1449209Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1449532Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1449825Z return mod(**inputs) 2025-08-14T21:45:46.1450167Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1450525Z outputs = self.model( 2025-08-14T21:45:46.1450861Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1279, in forward 2025-08-14T21:45:46.1451222Z encoder_outputs = self.encoder( 2025-08-14T21:45:46.1451577Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 841, in forward 2025-08-14T21:45:46.1451933Z layer_outputs = encoder_layer( 2025-08-14T21:45:46.1452249Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1452597Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1452967Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 312, in forward 2025-08-14T21:45:46.1453337Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:45:46.1453728Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 234, in forward 2025-08-14T21:45:46.1454113Z key_states = self.k_proj(current_states) 2025-08-14T21:45:46.1454234Z 2025-08-14T21:45:46.1454329Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1454659Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1454956Z return mod(**inputs) 2025-08-14T21:45:46.1455296Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1455648Z outputs = self.model( 2025-08-14T21:45:46.1456008Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1279, in forward 2025-08-14T21:45:46.1456372Z encoder_outputs = self.encoder( 2025-08-14T21:45:46.1456730Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 841, in forward 2025-08-14T21:45:46.1457083Z layer_outputs = encoder_layer( 2025-08-14T21:45:46.1457401Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1457731Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1458090Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 312, in forward 2025-08-14T21:45:46.1458476Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:45:46.1458855Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 235, in forward 2025-08-14T21:45:46.1459227Z value_states = self.v_proj(current_states) 2025-08-14T21:45:46.1459356Z 2025-08-14T21:45:46.1459429Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1459626Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1459815Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1459996Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1460209Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1460540Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1460838Z return mod(**inputs) 2025-08-14T21:45:46.1461171Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1461528Z outputs = self.model( 2025-08-14T21:45:46.1461872Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1279, in forward 2025-08-14T21:45:46.1462232Z encoder_outputs = self.encoder( 2025-08-14T21:45:46.1462590Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 841, in forward 2025-08-14T21:45:46.1462948Z layer_outputs = encoder_layer( 2025-08-14T21:45:46.1463269Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1463596Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1463960Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 312, in forward 2025-08-14T21:45:46.1464334Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:45:46.1464773Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 253, in forward 2025-08-14T21:45:46.1465168Z attn_output, attn_weights = attention_interface( 2025-08-14T21:45:46.1465600Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:45:46.1466049Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:45:46.1466220Z 2025-08-14T21:45:46.1466333Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1466666Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1466987Z return mod(**inputs) 2025-08-14T21:45:46.1467333Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1467686Z outputs = self.model( 2025-08-14T21:45:46.1468039Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1279, in forward 2025-08-14T21:45:46.1468403Z encoder_outputs = self.encoder( 2025-08-14T21:45:46.1468760Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 841, in forward 2025-08-14T21:45:46.1469145Z layer_outputs = encoder_layer( 2025-08-14T21:45:46.1469464Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1469800Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1470165Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 312, in forward 2025-08-14T21:45:46.1470547Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:45:46.1470925Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 253, in forward 2025-08-14T21:45:46.1471311Z attn_output, attn_weights = attention_interface( 2025-08-14T21:45:46.1471715Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:45:46.1472136Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:45:46.1472284Z 2025-08-14T21:45:46.1472387Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1472713Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1473015Z return mod(**inputs) 2025-08-14T21:45:46.1473361Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1473729Z outputs = self.model( 2025-08-14T21:45:46.1474071Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1279, in forward 2025-08-14T21:45:46.1474437Z encoder_outputs = self.encoder( 2025-08-14T21:45:46.1474797Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 841, in forward 2025-08-14T21:45:46.1475162Z layer_outputs = encoder_layer( 2025-08-14T21:45:46.1475479Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1475814Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1476183Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 312, in forward 2025-08-14T21:45:46.1476558Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:45:46.1476936Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 267, in forward 2025-08-14T21:45:46.1477306Z attn_output = self.out_proj(attn_output) 2025-08-14T21:45:46.1477429Z 2025-08-14T21:45:46.1477530Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1477854Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1478170Z return mod(**inputs) 2025-08-14T21:45:46.1478518Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1478871Z outputs = self.model( 2025-08-14T21:45:46.1479235Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1279, in forward 2025-08-14T21:45:46.1479602Z encoder_outputs = self.encoder( 2025-08-14T21:45:46.1479980Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 841, in forward 2025-08-14T21:45:46.1480340Z layer_outputs = encoder_layer( 2025-08-14T21:45:46.1480664Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1480998Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1481366Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 323, in forward 2025-08-14T21:45:46.1481796Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:45:46.1481962Z 2025-08-14T21:45:46.1482057Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1482383Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1482673Z return mod(**inputs) 2025-08-14T21:45:46.1483016Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1483381Z outputs = self.model( 2025-08-14T21:45:46.1483725Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1279, in forward 2025-08-14T21:45:46.1484078Z encoder_outputs = self.encoder( 2025-08-14T21:45:46.1484435Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 841, in forward 2025-08-14T21:45:46.1484919Z layer_outputs = encoder_layer( 2025-08-14T21:45:46.1485242Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1485581Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1485956Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 323, in forward 2025-08-14T21:45:46.1486371Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:45:46.1486732Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:45:46.1487056Z return self.act(input) 2025-08-14T21:45:46.1487158Z 2025-08-14T21:45:46.1487262Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1487595Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1487893Z return mod(**inputs) 2025-08-14T21:45:46.1488245Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1488615Z outputs = self.model( 2025-08-14T21:45:46.1488958Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1279, in forward 2025-08-14T21:45:46.1489329Z encoder_outputs = self.encoder( 2025-08-14T21:45:46.1489698Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 841, in forward 2025-08-14T21:45:46.1490069Z layer_outputs = encoder_layer( 2025-08-14T21:45:46.1490384Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1490719Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1491087Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 325, in forward 2025-08-14T21:45:46.1491488Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:45:46.1491623Z 2025-08-14T21:45:46.1491720Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1492050Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1492346Z return mod(**inputs) 2025-08-14T21:45:46.1492701Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1493093Z outputs = self.model( 2025-08-14T21:45:46.1493442Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1279, in forward 2025-08-14T21:45:46.1493810Z encoder_outputs = self.encoder( 2025-08-14T21:45:46.1494164Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 841, in forward 2025-08-14T21:45:46.1494532Z layer_outputs = encoder_layer( 2025-08-14T21:45:46.1494899Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1495224Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1495592Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 312, in forward 2025-08-14T21:45:46.1495970Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:45:46.1496349Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 215, in forward 2025-08-14T21:45:46.1496777Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:45:46.1496971Z 2025-08-14T21:45:46.1497066Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1497398Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1497696Z return mod(**inputs) 2025-08-14T21:45:46.1498034Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1498394Z outputs = self.model( 2025-08-14T21:45:46.1498737Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1279, in forward 2025-08-14T21:45:46.1499094Z encoder_outputs = self.encoder( 2025-08-14T21:45:46.1499454Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 841, in forward 2025-08-14T21:45:46.1499813Z layer_outputs = encoder_layer( 2025-08-14T21:45:46.1500129Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1500454Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1500819Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 312, in forward 2025-08-14T21:45:46.1501199Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:45:46.1501567Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 234, in forward 2025-08-14T21:45:46.1501934Z key_states = self.k_proj(current_states) 2025-08-14T21:45:46.1502061Z 2025-08-14T21:45:46.1502156Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1502490Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1502777Z return mod(**inputs) 2025-08-14T21:45:46.1503119Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1503477Z outputs = self.model( 2025-08-14T21:45:46.1503817Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1279, in forward 2025-08-14T21:45:46.1504199Z encoder_outputs = self.encoder( 2025-08-14T21:45:46.1504562Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 841, in forward 2025-08-14T21:45:46.1504982Z layer_outputs = encoder_layer( 2025-08-14T21:45:46.1505315Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1505651Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1506035Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 312, in forward 2025-08-14T21:45:46.1506415Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:45:46.1506787Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 235, in forward 2025-08-14T21:45:46.1507165Z value_states = self.v_proj(current_states) 2025-08-14T21:45:46.1507294Z 2025-08-14T21:45:46.1507380Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1507592Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1507792Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1507980Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1508191Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1508513Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1508812Z return mod(**inputs) 2025-08-14T21:45:46.1509155Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1509509Z outputs = self.model( 2025-08-14T21:45:46.1509856Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1279, in forward 2025-08-14T21:45:46.1510220Z encoder_outputs = self.encoder( 2025-08-14T21:45:46.1510581Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 841, in forward 2025-08-14T21:45:46.1510938Z layer_outputs = encoder_layer( 2025-08-14T21:45:46.1511259Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1511590Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1511949Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 312, in forward 2025-08-14T21:45:46.1512331Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:45:46.1512709Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 253, in forward 2025-08-14T21:45:46.1513092Z attn_output, attn_weights = attention_interface( 2025-08-14T21:45:46.1513491Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:45:46.1513936Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:45:46.1514113Z 2025-08-14T21:45:46.1514207Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1514536Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1514823Z return mod(**inputs) 2025-08-14T21:45:46.1515167Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1515527Z outputs = self.model( 2025-08-14T21:45:46.1515861Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1279, in forward 2025-08-14T21:45:46.1516225Z encoder_outputs = self.encoder( 2025-08-14T21:45:46.1516584Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 841, in forward 2025-08-14T21:45:46.1516941Z layer_outputs = encoder_layer( 2025-08-14T21:45:46.1517271Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1517607Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1517987Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 312, in forward 2025-08-14T21:45:46.1518371Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:45:46.1518765Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 253, in forward 2025-08-14T21:45:46.1519149Z attn_output, attn_weights = attention_interface( 2025-08-14T21:45:46.1519556Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:45:46.1519972Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:45:46.1520126Z 2025-08-14T21:45:46.1520223Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1520574Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1520870Z return mod(**inputs) 2025-08-14T21:45:46.1521204Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1521567Z outputs = self.model( 2025-08-14T21:45:46.1521912Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1279, in forward 2025-08-14T21:45:46.1522268Z encoder_outputs = self.encoder( 2025-08-14T21:45:46.1522638Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 841, in forward 2025-08-14T21:45:46.1523002Z layer_outputs = encoder_layer( 2025-08-14T21:45:46.1523322Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1523646Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1524010Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 312, in forward 2025-08-14T21:45:46.1524390Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:45:46.1524764Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 267, in forward 2025-08-14T21:45:46.1525128Z attn_output = self.out_proj(attn_output) 2025-08-14T21:45:46.1525258Z 2025-08-14T21:45:46.1525352Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1525680Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1525971Z return mod(**inputs) 2025-08-14T21:45:46.1526314Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1526675Z outputs = self.model( 2025-08-14T21:45:46.1527022Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1279, in forward 2025-08-14T21:45:46.1527381Z encoder_outputs = self.encoder( 2025-08-14T21:45:46.1527742Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 841, in forward 2025-08-14T21:45:46.1528105Z layer_outputs = encoder_layer( 2025-08-14T21:45:46.1528429Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1528751Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1529113Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 323, in forward 2025-08-14T21:45:46.1529515Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:45:46.1529672Z 2025-08-14T21:45:46.1529782Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1530116Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1530411Z return mod(**inputs) 2025-08-14T21:45:46.1530770Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1531126Z outputs = self.model( 2025-08-14T21:45:46.1531503Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1279, in forward 2025-08-14T21:45:46.1531870Z encoder_outputs = self.encoder( 2025-08-14T21:45:46.1532220Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 841, in forward 2025-08-14T21:45:46.1532583Z layer_outputs = encoder_layer( 2025-08-14T21:45:46.1532900Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1533258Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1533627Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 323, in forward 2025-08-14T21:45:46.1534040Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:45:46.1534407Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:45:46.1534730Z return self.act(input) 2025-08-14T21:45:46.1534835Z 2025-08-14T21:45:46.1534934Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1535275Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1535583Z return mod(**inputs) 2025-08-14T21:45:46.1535928Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1536302Z outputs = self.model( 2025-08-14T21:45:46.1536659Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1279, in forward 2025-08-14T21:45:46.1537032Z encoder_outputs = self.encoder( 2025-08-14T21:45:46.1537395Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 841, in forward 2025-08-14T21:45:46.1537768Z layer_outputs = encoder_layer( 2025-08-14T21:45:46.1538097Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1538432Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1538814Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 325, in forward 2025-08-14T21:45:46.1539197Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:45:46.1539326Z 2025-08-14T21:45:46.1539432Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1539766Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1540076Z return mod(**inputs) 2025-08-14T21:45:46.1540425Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1540797Z outputs = self.model( 2025-08-14T21:45:46.1541141Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1279, in forward 2025-08-14T21:45:46.1541516Z encoder_outputs = self.encoder( 2025-08-14T21:45:46.1541881Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 841, in forward 2025-08-14T21:45:46.1542247Z layer_outputs = encoder_layer( 2025-08-14T21:45:46.1542573Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1542930Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1543300Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 327, in forward 2025-08-14T21:45:46.1543662Z hidden_states = residual + hidden_states 2025-08-14T21:45:46.1543790Z 2025-08-14T21:45:46.1543884Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1544230Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1544539Z return mod(**inputs) 2025-08-14T21:45:46.1544954Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1545323Z outputs = self.model( 2025-08-14T21:45:46.1545671Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1279, in forward 2025-08-14T21:45:46.1546032Z encoder_outputs = self.encoder( 2025-08-14T21:45:46.1546396Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 841, in forward 2025-08-14T21:45:46.1546781Z layer_outputs = encoder_layer( 2025-08-14T21:45:46.1547103Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1547431Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1547801Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 312, in forward 2025-08-14T21:45:46.1548185Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:45:46.1548561Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 215, in forward 2025-08-14T21:45:46.1549001Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:45:46.1549197Z 2025-08-14T21:45:46.1549293Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1549627Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1549920Z return mod(**inputs) 2025-08-14T21:45:46.1550265Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1550628Z outputs = self.model( 2025-08-14T21:45:46.1550970Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1279, in forward 2025-08-14T21:45:46.1551335Z encoder_outputs = self.encoder( 2025-08-14T21:45:46.1551698Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 841, in forward 2025-08-14T21:45:46.1552061Z layer_outputs = encoder_layer( 2025-08-14T21:45:46.1552374Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1552708Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1553079Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 312, in forward 2025-08-14T21:45:46.1553460Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:45:46.1553833Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 234, in forward 2025-08-14T21:45:46.1554205Z key_states = self.k_proj(current_states) 2025-08-14T21:45:46.1554329Z 2025-08-14T21:45:46.1554432Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1554754Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1555053Z return mod(**inputs) 2025-08-14T21:45:46.1555397Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1555759Z outputs = self.model( 2025-08-14T21:45:46.1556115Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1279, in forward 2025-08-14T21:45:46.1556484Z encoder_outputs = self.encoder( 2025-08-14T21:45:46.1556858Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 841, in forward 2025-08-14T21:45:46.1557222Z layer_outputs = encoder_layer( 2025-08-14T21:45:46.1557551Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1557885Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1558248Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 312, in forward 2025-08-14T21:45:46.1558619Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:45:46.1559001Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 235, in forward 2025-08-14T21:45:46.1559391Z value_states = self.v_proj(current_states) 2025-08-14T21:45:46.1559518Z 2025-08-14T21:45:46.1559598Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1559785Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1559977Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1560164Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1560369Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1560701Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1561003Z return mod(**inputs) 2025-08-14T21:45:46.1561338Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1561701Z outputs = self.model( 2025-08-14T21:45:46.1562049Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1279, in forward 2025-08-14T21:45:46.1562416Z encoder_outputs = self.encoder( 2025-08-14T21:45:46.1562766Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 841, in forward 2025-08-14T21:45:46.1563124Z layer_outputs = encoder_layer( 2025-08-14T21:45:46.1563444Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1563774Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1564131Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 312, in forward 2025-08-14T21:45:46.1564508Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:45:46.1564884Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 253, in forward 2025-08-14T21:45:46.1565257Z attn_output, attn_weights = attention_interface( 2025-08-14T21:45:46.1565667Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:45:46.1566111Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:45:46.1566278Z 2025-08-14T21:45:46.1566380Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1566704Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1567004Z return mod(**inputs) 2025-08-14T21:45:46.1567350Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1567711Z outputs = self.model( 2025-08-14T21:45:46.1568050Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1279, in forward 2025-08-14T21:45:46.1568416Z encoder_outputs = self.encoder( 2025-08-14T21:45:46.1568790Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 841, in forward 2025-08-14T21:45:46.1569151Z layer_outputs = encoder_layer( 2025-08-14T21:45:46.1569474Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1569827Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1570200Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 312, in forward 2025-08-14T21:45:46.1570591Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:45:46.1570971Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 253, in forward 2025-08-14T21:45:46.1571358Z attn_output, attn_weights = attention_interface( 2025-08-14T21:45:46.1571769Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:45:46.1572199Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:45:46.1572353Z 2025-08-14T21:45:46.1572450Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1572784Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1573084Z return mod(**inputs) 2025-08-14T21:45:46.1573433Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1573800Z outputs = self.model( 2025-08-14T21:45:46.1574153Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1279, in forward 2025-08-14T21:45:46.1574516Z encoder_outputs = self.encoder( 2025-08-14T21:45:46.1574879Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 841, in forward 2025-08-14T21:45:46.1575249Z layer_outputs = encoder_layer( 2025-08-14T21:45:46.1575569Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1575909Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1576285Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 312, in forward 2025-08-14T21:45:46.1576673Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:45:46.1576921Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 267, in forward 2025-08-14T21:45:46.1576998Z attn_output = self.out_proj(attn_output) 2025-08-14T21:45:46.1577002Z 2025-08-14T21:45:46.1577106Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1577291Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1577364Z return mod(**inputs) 2025-08-14T21:45:46.1577611Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1577675Z outputs = self.model( 2025-08-14T21:45:46.1577930Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1279, in forward 2025-08-14T21:45:46.1577999Z encoder_outputs = self.encoder( 2025-08-14T21:45:46.1578245Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 841, in forward 2025-08-14T21:45:46.1578321Z layer_outputs = encoder_layer( 2025-08-14T21:45:46.1578526Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1578607Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1578869Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 323, in forward 2025-08-14T21:45:46.1578983Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:45:46.1578987Z 2025-08-14T21:45:46.1579089Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1579269Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1579360Z return mod(**inputs) 2025-08-14T21:45:46.1579605Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1579683Z outputs = self.model( 2025-08-14T21:45:46.1579935Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1279, in forward 2025-08-14T21:45:46.1580001Z encoder_outputs = self.encoder( 2025-08-14T21:45:46.1580240Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 841, in forward 2025-08-14T21:45:46.1580315Z layer_outputs = encoder_layer( 2025-08-14T21:45:46.1580537Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1580617Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1580863Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 323, in forward 2025-08-14T21:45:46.1580975Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:45:46.1581186Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:45:46.1581253Z return self.act(input) 2025-08-14T21:45:46.1581256Z 2025-08-14T21:45:46.1581360Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1581545Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1581608Z return mod(**inputs) 2025-08-14T21:45:46.1581862Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1581930Z outputs = self.model( 2025-08-14T21:45:46.1582174Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1279, in forward 2025-08-14T21:45:46.1582252Z encoder_outputs = self.encoder( 2025-08-14T21:45:46.1582497Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 841, in forward 2025-08-14T21:45:46.1582574Z layer_outputs = encoder_layer( 2025-08-14T21:45:46.1582777Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1582853Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1583106Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 325, in forward 2025-08-14T21:45:46.1583186Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:45:46.1583191Z 2025-08-14T21:45:46.1583297Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1583482Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1583545Z return mod(**inputs) 2025-08-14T21:45:46.1583799Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1583865Z outputs = self.model( 2025-08-14T21:45:46.1584113Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1279, in forward 2025-08-14T21:45:46.1584191Z encoder_outputs = self.encoder( 2025-08-14T21:45:46.1584437Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 841, in forward 2025-08-14T21:45:46.1584515Z layer_outputs = encoder_layer( 2025-08-14T21:45:46.1584922Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1585004Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1585259Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 312, in forward 2025-08-14T21:45:46.1585380Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:45:46.1585653Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 215, in forward 2025-08-14T21:45:46.1585802Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:45:46.1585806Z 2025-08-14T21:45:46.1585900Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1586098Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1586158Z return mod(**inputs) 2025-08-14T21:45:46.1586405Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1586503Z outputs = self.model( 2025-08-14T21:45:46.1586746Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1279, in forward 2025-08-14T21:45:46.1586822Z encoder_outputs = self.encoder( 2025-08-14T21:45:46.1587067Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 841, in forward 2025-08-14T21:45:46.1587134Z layer_outputs = encoder_layer( 2025-08-14T21:45:46.1587344Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1587418Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1587661Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 312, in forward 2025-08-14T21:45:46.1587755Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:45:46.1587995Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 234, in forward 2025-08-14T21:45:46.1588076Z key_states = self.k_proj(current_states) 2025-08-14T21:45:46.1588079Z 2025-08-14T21:45:46.1588173Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1588357Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1588424Z return mod(**inputs) 2025-08-14T21:45:46.1588667Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1588734Z outputs = self.model( 2025-08-14T21:45:46.1588979Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1279, in forward 2025-08-14T21:45:46.1589045Z encoder_outputs = self.encoder( 2025-08-14T21:45:46.1589294Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 841, in forward 2025-08-14T21:45:46.1589358Z layer_outputs = encoder_layer( 2025-08-14T21:45:46.1589560Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1589639Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1589882Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 312, in forward 2025-08-14T21:45:46.1589971Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:45:46.1590209Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 235, in forward 2025-08-14T21:45:46.1590286Z value_states = self.v_proj(current_states) 2025-08-14T21:45:46.1590289Z 2025-08-14T21:45:46.1590401Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1590477Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1590555Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1590624Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1590717Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1590926Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1590987Z return mod(**inputs) 2025-08-14T21:45:46.1591249Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1591319Z outputs = self.model( 2025-08-14T21:45:46.1591562Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1279, in forward 2025-08-14T21:45:46.1591635Z encoder_outputs = self.encoder( 2025-08-14T21:45:46.1591874Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 841, in forward 2025-08-14T21:45:46.1591959Z layer_outputs = encoder_layer( 2025-08-14T21:45:46.1592168Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1592238Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1592479Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 312, in forward 2025-08-14T21:45:46.1592569Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:45:46.1592808Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 253, in forward 2025-08-14T21:45:46.1592906Z attn_output, attn_weights = attention_interface( 2025-08-14T21:45:46.1593173Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:45:46.1593297Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:45:46.1593302Z 2025-08-14T21:45:46.1593403Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1593585Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1593651Z return mod(**inputs) 2025-08-14T21:45:46.1593897Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1593959Z outputs = self.model( 2025-08-14T21:45:46.1594206Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1279, in forward 2025-08-14T21:45:46.1594273Z encoder_outputs = self.encoder( 2025-08-14T21:45:46.1594511Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 841, in forward 2025-08-14T21:45:46.1594583Z layer_outputs = encoder_layer( 2025-08-14T21:45:46.1594783Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1594863Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1595103Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 312, in forward 2025-08-14T21:45:46.1595186Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:45:46.1595436Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 253, in forward 2025-08-14T21:45:46.1595526Z attn_output, attn_weights = attention_interface( 2025-08-14T21:45:46.1595793Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:45:46.1595901Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:45:46.1595904Z 2025-08-14T21:45:46.1596014Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1596205Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1596265Z return mod(**inputs) 2025-08-14T21:45:46.1596504Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1596588Z outputs = self.model( 2025-08-14T21:45:46.1596835Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1279, in forward 2025-08-14T21:45:46.1596928Z encoder_outputs = self.encoder( 2025-08-14T21:45:46.1597169Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 841, in forward 2025-08-14T21:45:46.1597233Z layer_outputs = encoder_layer( 2025-08-14T21:45:46.1597443Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1597515Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1597773Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 312, in forward 2025-08-14T21:45:46.1597862Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:45:46.1598103Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 267, in forward 2025-08-14T21:45:46.1598186Z attn_output = self.out_proj(attn_output) 2025-08-14T21:45:46.1598190Z 2025-08-14T21:45:46.1598282Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1598463Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1598530Z return mod(**inputs) 2025-08-14T21:45:46.1598770Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1598840Z outputs = self.model( 2025-08-14T21:45:46.1599084Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1279, in forward 2025-08-14T21:45:46.1599148Z encoder_outputs = self.encoder( 2025-08-14T21:45:46.1599396Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 841, in forward 2025-08-14T21:45:46.1599462Z layer_outputs = encoder_layer( 2025-08-14T21:45:46.1599662Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1599739Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1599979Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 323, in forward 2025-08-14T21:45:46.1600095Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:45:46.1600098Z 2025-08-14T21:45:46.1600191Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1600373Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1600439Z return mod(**inputs) 2025-08-14T21:45:46.1600680Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1600748Z outputs = self.model( 2025-08-14T21:45:46.1600991Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1279, in forward 2025-08-14T21:45:46.1601058Z encoder_outputs = self.encoder( 2025-08-14T21:45:46.1601299Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 841, in forward 2025-08-14T21:45:46.1601363Z layer_outputs = encoder_layer( 2025-08-14T21:45:46.1601563Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1601667Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1601913Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 323, in forward 2025-08-14T21:45:46.1602028Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:45:46.1602237Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:45:46.1602320Z return self.act(input) 2025-08-14T21:45:46.1602323Z 2025-08-14T21:45:46.1602422Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1602605Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1602670Z return mod(**inputs) 2025-08-14T21:45:46.1602914Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1602976Z outputs = self.model( 2025-08-14T21:45:46.1603226Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1279, in forward 2025-08-14T21:45:46.1603308Z encoder_outputs = self.encoder( 2025-08-14T21:45:46.1603552Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 841, in forward 2025-08-14T21:45:46.1603624Z layer_outputs = encoder_layer( 2025-08-14T21:45:46.1603829Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1603907Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1604153Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 325, in forward 2025-08-14T21:45:46.1604226Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:45:46.1604230Z 2025-08-14T21:45:46.1604329Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1604512Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1604572Z return mod(**inputs) 2025-08-14T21:45:46.1604825Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1604887Z outputs = self.model( 2025-08-14T21:45:46.1605139Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1279, in forward 2025-08-14T21:45:46.1605205Z encoder_outputs = self.encoder( 2025-08-14T21:45:46.1605449Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 841, in forward 2025-08-14T21:45:46.1605521Z layer_outputs = encoder_layer( 2025-08-14T21:45:46.1605725Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1605803Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1606049Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 327, in forward 2025-08-14T21:45:46.1606121Z hidden_states = residual + hidden_states 2025-08-14T21:45:46.1606124Z 2025-08-14T21:45:46.1606223Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1606407Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1606466Z return mod(**inputs) 2025-08-14T21:45:46.1606719Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1606780Z outputs = self.model( 2025-08-14T21:45:46.1607030Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1279, in forward 2025-08-14T21:45:46.1607092Z encoder_outputs = self.encoder( 2025-08-14T21:45:46.1607350Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 841, in forward 2025-08-14T21:45:46.1607425Z layer_outputs = encoder_layer( 2025-08-14T21:45:46.1607629Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1607716Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1607965Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 312, in forward 2025-08-14T21:45:46.1608067Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:45:46.1608313Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 215, in forward 2025-08-14T21:45:46.1608451Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:45:46.1608455Z 2025-08-14T21:45:46.1608549Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1608733Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1608811Z return mod(**inputs) 2025-08-14T21:45:46.1609060Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1609122Z outputs = self.model( 2025-08-14T21:45:46.1609364Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1279, in forward 2025-08-14T21:45:46.1609438Z encoder_outputs = self.encoder( 2025-08-14T21:45:46.1609674Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 841, in forward 2025-08-14T21:45:46.1609744Z layer_outputs = encoder_layer( 2025-08-14T21:45:46.1609945Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1610016Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1610262Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 312, in forward 2025-08-14T21:45:46.1610345Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:45:46.1610584Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 234, in forward 2025-08-14T21:45:46.1610663Z key_states = self.k_proj(current_states) 2025-08-14T21:45:46.1610668Z 2025-08-14T21:45:46.1610761Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1610947Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1611007Z return mod(**inputs) 2025-08-14T21:45:46.1611247Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1611317Z outputs = self.model( 2025-08-14T21:45:46.1611560Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1279, in forward 2025-08-14T21:45:46.1611625Z encoder_outputs = self.encoder( 2025-08-14T21:45:46.1611874Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 841, in forward 2025-08-14T21:45:46.1611940Z layer_outputs = encoder_layer( 2025-08-14T21:45:46.1612150Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1612220Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1612454Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 312, in forward 2025-08-14T21:45:46.1612545Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:45:46.1612798Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 235, in forward 2025-08-14T21:45:46.1612890Z value_states = self.v_proj(current_states) 2025-08-14T21:45:46.1612893Z 2025-08-14T21:45:46.1612966Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1613038Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1613115Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1613210Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1613307Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1613515Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1613575Z return mod(**inputs) 2025-08-14T21:45:46.1613825Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1613887Z outputs = self.model( 2025-08-14T21:45:46.1614128Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1279, in forward 2025-08-14T21:45:46.1614217Z encoder_outputs = self.encoder( 2025-08-14T21:45:46.1614455Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 841, in forward 2025-08-14T21:45:46.1614520Z layer_outputs = encoder_layer( 2025-08-14T21:45:46.1614729Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1614800Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1615042Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 312, in forward 2025-08-14T21:45:46.1615124Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:45:46.1615361Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 253, in forward 2025-08-14T21:45:46.1615457Z attn_output, attn_weights = attention_interface( 2025-08-14T21:45:46.1615722Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:45:46.1615850Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:45:46.1615854Z 2025-08-14T21:45:46.1615945Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1616127Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1616195Z return mod(**inputs) 2025-08-14T21:45:46.1616436Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1616496Z outputs = self.model( 2025-08-14T21:45:46.1616746Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1279, in forward 2025-08-14T21:45:46.1616812Z encoder_outputs = self.encoder( 2025-08-14T21:45:46.1617057Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 841, in forward 2025-08-14T21:45:46.1617125Z layer_outputs = encoder_layer( 2025-08-14T21:45:46.1617326Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1617402Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1617641Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 312, in forward 2025-08-14T21:45:46.1617732Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:45:46.1617970Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 253, in forward 2025-08-14T21:45:46.1618058Z attn_output, attn_weights = attention_interface( 2025-08-14T21:45:46.1618343Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:45:46.1618446Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:45:46.1618450Z 2025-08-14T21:45:46.1618543Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1618730Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1618804Z return mod(**inputs) 2025-08-14T21:45:46.1619056Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1619134Z outputs = self.model( 2025-08-14T21:45:46.1619375Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1279, in forward 2025-08-14T21:45:46.1619447Z encoder_outputs = self.encoder( 2025-08-14T21:45:46.1619687Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 841, in forward 2025-08-14T21:45:46.1619759Z layer_outputs = encoder_layer( 2025-08-14T21:45:46.1619988Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1620060Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1620306Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 312, in forward 2025-08-14T21:45:46.1620388Z hidden_states, attn_weights = self.self_attn( 2025-08-14T21:45:46.1620634Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 267, in forward 2025-08-14T21:45:46.1620715Z attn_output = self.out_proj(attn_output) 2025-08-14T21:45:46.1620718Z 2025-08-14T21:45:46.1620809Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1620998Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1621055Z return mod(**inputs) 2025-08-14T21:45:46.1621302Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1621372Z outputs = self.model( 2025-08-14T21:45:46.1621619Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1279, in forward 2025-08-14T21:45:46.1621685Z encoder_outputs = self.encoder( 2025-08-14T21:45:46.1621935Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 841, in forward 2025-08-14T21:45:46.1621999Z layer_outputs = encoder_layer( 2025-08-14T21:45:46.1622206Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1622275Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1622516Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 323, in forward 2025-08-14T21:45:46.1622633Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:45:46.1622636Z 2025-08-14T21:45:46.1622729Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1622918Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1622978Z return mod(**inputs) 2025-08-14T21:45:46.1623223Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1623294Z outputs = self.model( 2025-08-14T21:45:46.1623537Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1279, in forward 2025-08-14T21:45:46.1623603Z encoder_outputs = self.encoder( 2025-08-14T21:45:46.1623849Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 841, in forward 2025-08-14T21:45:46.1623931Z layer_outputs = encoder_layer( 2025-08-14T21:45:46.1624142Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1624212Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1624469Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 323, in forward 2025-08-14T21:45:46.1624586Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:45:46.1624861Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:45:46.1624937Z return self.act(input) 2025-08-14T21:45:46.1624941Z 2025-08-14T21:45:46.1625036Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1625216Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1625282Z return mod(**inputs) 2025-08-14T21:45:46.1625531Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1625614Z outputs = self.model( 2025-08-14T21:45:46.1625871Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1279, in forward 2025-08-14T21:45:46.1625940Z encoder_outputs = self.encoder( 2025-08-14T21:45:46.1626188Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 841, in forward 2025-08-14T21:45:46.1626256Z layer_outputs = encoder_layer( 2025-08-14T21:45:46.1626458Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1626538Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1626778Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 325, in forward 2025-08-14T21:45:46.1626856Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:45:46.1626868Z 2025-08-14T21:45:46.1626962Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1627144Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1627213Z return mod(**inputs) 2025-08-14T21:45:46.1627457Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1627519Z outputs = self.model( 2025-08-14T21:45:46.1627768Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1627833Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1628081Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1628146Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1628346Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1628428Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1628669Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:46.1628764Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:46.1629011Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 215, in forward 2025-08-14T21:45:46.1629146Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:45:46.1629150Z 2025-08-14T21:45:46.1629250Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1629431Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1629488Z return mod(**inputs) 2025-08-14T21:45:46.1629759Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1629822Z outputs = self.model( 2025-08-14T21:45:46.1630072Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1630154Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1630400Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1630517Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1630721Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1630794Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1631042Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:46.1631151Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:46.1631399Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 234, in forward 2025-08-14T21:45:46.1631472Z key_states = self.k_proj(current_states) 2025-08-14T21:45:46.1631476Z 2025-08-14T21:45:46.1631568Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1631761Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1631818Z return mod(**inputs) 2025-08-14T21:45:46.1632069Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1632129Z outputs = self.model( 2025-08-14T21:45:46.1632370Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1632444Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1632686Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1632751Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1632962Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1633033Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1633283Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:46.1633370Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:46.1633612Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 235, in forward 2025-08-14T21:45:46.1633698Z value_states = self.v_proj(current_states) 2025-08-14T21:45:46.1633701Z 2025-08-14T21:45:46.1633773Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1633854Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1633922Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1633991Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1634092Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1634274Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1634335Z return mod(**inputs) 2025-08-14T21:45:46.1634585Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1634646Z outputs = self.model( 2025-08-14T21:45:46.1634887Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1634958Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1635214Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1635289Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1635491Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1635561Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1635827Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:46.1635929Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:46.1636177Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 253, in forward 2025-08-14T21:45:46.1636265Z attn_output, attn_weights = attention_interface( 2025-08-14T21:45:46.1636527Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:45:46.1636655Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:45:46.1636674Z 2025-08-14T21:45:46.1636770Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1636961Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1637020Z return mod(**inputs) 2025-08-14T21:45:46.1637264Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1637336Z outputs = self.model( 2025-08-14T21:45:46.1637577Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1637643Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1637891Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1637958Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1638167Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1638238Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1638480Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:46.1638576Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:46.1638818Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 253, in forward 2025-08-14T21:45:46.1638907Z attn_output, attn_weights = attention_interface( 2025-08-14T21:45:46.1639179Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:45:46.1639282Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:45:46.1639285Z 2025-08-14T21:45:46.1639389Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1639572Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1639630Z return mod(**inputs) 2025-08-14T21:45:46.1639882Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1639946Z outputs = self.model( 2025-08-14T21:45:46.1640196Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1640262Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1640504Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1640578Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1640819Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1640896Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1641150Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:46.1641237Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:46.1641498Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 267, in forward 2025-08-14T21:45:46.1641592Z attn_output = self.out_proj(attn_output) 2025-08-14T21:45:46.1641595Z 2025-08-14T21:45:46.1641688Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1641877Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1641936Z return mod(**inputs) 2025-08-14T21:45:46.1642182Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1642245Z outputs = self.model( 2025-08-14T21:45:46.1642504Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1642577Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1642818Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1642886Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1643094Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1643163Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1643411Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 424, in forward 2025-08-14T21:45:46.1643510Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:45:46.1643753Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 215, in forward 2025-08-14T21:45:46.1643901Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:45:46.1643905Z 2025-08-14T21:45:46.1643996Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1644184Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1644245Z return mod(**inputs) 2025-08-14T21:45:46.1644486Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1644555Z outputs = self.model( 2025-08-14T21:45:46.1644793Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1644858Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1645107Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1645172Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1645381Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1645451Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1645692Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 424, in forward 2025-08-14T21:45:46.1645800Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:45:46.1646039Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 234, in forward 2025-08-14T21:45:46.1646118Z key_states = self.k_proj(current_states) 2025-08-14T21:45:46.1646121Z 2025-08-14T21:45:46.1646212Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1646407Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1646477Z return mod(**inputs) 2025-08-14T21:45:46.1646719Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1646781Z outputs = self.model( 2025-08-14T21:45:46.1647048Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1647132Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1647385Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1647450Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1647653Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1647729Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1647972Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 424, in forward 2025-08-14T21:45:46.1648093Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:45:46.1648335Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 235, in forward 2025-08-14T21:45:46.1648413Z value_states = self.v_proj(current_states) 2025-08-14T21:45:46.1648418Z 2025-08-14T21:45:46.1648495Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1648566Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1648633Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1648707Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1648797Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1648984Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1649043Z return mod(**inputs) 2025-08-14T21:45:46.1649287Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1649355Z outputs = self.model( 2025-08-14T21:45:46.1649596Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1649663Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1649912Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1649976Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1650183Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1650252Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1650495Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 424, in forward 2025-08-14T21:45:46.1650600Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:45:46.1650840Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 253, in forward 2025-08-14T21:45:46.1650927Z attn_output, attn_weights = attention_interface( 2025-08-14T21:45:46.1651200Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:45:46.1651320Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:45:46.1651323Z 2025-08-14T21:45:46.1651424Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1651604Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1651663Z return mod(**inputs) 2025-08-14T21:45:46.1651927Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1651992Z outputs = self.model( 2025-08-14T21:45:46.1652242Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1652309Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1652565Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1652656Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1652860Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1652933Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1653180Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 424, in forward 2025-08-14T21:45:46.1653278Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:45:46.1653529Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 253, in forward 2025-08-14T21:45:46.1653636Z attn_output, attn_weights = attention_interface( 2025-08-14T21:45:46.1653899Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:45:46.1654005Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:45:46.1654009Z 2025-08-14T21:45:46.1654103Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1654290Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1654349Z return mod(**inputs) 2025-08-14T21:45:46.1654590Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1654660Z outputs = self.model( 2025-08-14T21:45:46.1654902Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1654968Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1655215Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1655281Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1655487Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1655559Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1655799Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 424, in forward 2025-08-14T21:45:46.1655900Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:45:46.1656137Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 267, in forward 2025-08-14T21:45:46.1656220Z attn_output = self.out_proj(attn_output) 2025-08-14T21:45:46.1656223Z 2025-08-14T21:45:46.1656314Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1656495Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1656561Z return mod(**inputs) 2025-08-14T21:45:46.1656802Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1656864Z outputs = self.model( 2025-08-14T21:45:46.1657108Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1657173Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1657416Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1657497Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1657701Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1657778Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1658032Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 438, in forward 2025-08-14T21:45:46.1658151Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:45:46.1658170Z 2025-08-14T21:45:46.1658264Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1658444Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1658511Z return mod(**inputs) 2025-08-14T21:45:46.1658752Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1658813Z outputs = self.model( 2025-08-14T21:45:46.1659066Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1659150Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1659399Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1659466Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1659669Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1659750Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1659990Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 438, in forward 2025-08-14T21:45:46.1660103Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:45:46.1660298Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:45:46.1660362Z return self.act(input) 2025-08-14T21:45:46.1660367Z 2025-08-14T21:45:46.1660469Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1660648Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1660708Z return mod(**inputs) 2025-08-14T21:45:46.1660957Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1661018Z outputs = self.model( 2025-08-14T21:45:46.1661265Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1661331Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1661571Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1661642Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1661844Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1661916Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1662164Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 440, in forward 2025-08-14T21:45:46.1662237Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:45:46.1662242Z 2025-08-14T21:45:46.1662341Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1662523Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1662579Z return mod(**inputs) 2025-08-14T21:45:46.1662828Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1662889Z outputs = self.model( 2025-08-14T21:45:46.1663151Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1663220Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1663460Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1663533Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1663762Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1663849Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1664097Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:46.1664187Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:46.1664436Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 215, in forward 2025-08-14T21:45:46.1664573Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:45:46.1664592Z 2025-08-14T21:45:46.1664782Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1664979Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1665038Z return mod(**inputs) 2025-08-14T21:45:46.1665291Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1665354Z outputs = self.model( 2025-08-14T21:45:46.1665598Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1665672Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1665915Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1665981Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1666196Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1666268Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1666518Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:46.1666609Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:46.1666851Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 234, in forward 2025-08-14T21:45:46.1666932Z key_states = self.k_proj(current_states) 2025-08-14T21:45:46.1666936Z 2025-08-14T21:45:46.1667027Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1667217Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1667277Z return mod(**inputs) 2025-08-14T21:45:46.1667519Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1667591Z outputs = self.model( 2025-08-14T21:45:46.1667835Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1667904Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1668155Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1668220Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1668428Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1668500Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1668741Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:46.1668859Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:46.1669103Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 235, in forward 2025-08-14T21:45:46.1669180Z value_states = self.v_proj(current_states) 2025-08-14T21:45:46.1669191Z 2025-08-14T21:45:46.1669279Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1669353Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1669448Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1669516Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1669607Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1669794Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1669854Z return mod(**inputs) 2025-08-14T21:45:46.1670096Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1670179Z outputs = self.model( 2025-08-14T21:45:46.1670422Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1670495Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1670738Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1670805Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1671012Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1671082Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1671332Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:46.1671422Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:46.1671662Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 253, in forward 2025-08-14T21:45:46.1671759Z attn_output, attn_weights = attention_interface( 2025-08-14T21:45:46.1672026Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:45:46.1672148Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:45:46.1672159Z 2025-08-14T21:45:46.1672253Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1672436Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1672501Z return mod(**inputs) 2025-08-14T21:45:46.1672747Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1672810Z outputs = self.model( 2025-08-14T21:45:46.1673062Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1673129Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1673376Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1673441Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1673641Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1673722Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1673966Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:46.1674052Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:46.1674299Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 253, in forward 2025-08-14T21:45:46.1674403Z attn_output, attn_weights = attention_interface( 2025-08-14T21:45:46.1674677Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:45:46.1674776Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:45:46.1674780Z 2025-08-14T21:45:46.1674888Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1675080Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1675154Z return mod(**inputs) 2025-08-14T21:45:46.1675408Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1675468Z outputs = self.model( 2025-08-14T21:45:46.1675712Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1675785Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1676048Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1676112Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1676323Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1676394Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1676644Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:46.1676728Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:46.1676968Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 267, in forward 2025-08-14T21:45:46.1677046Z attn_output = self.out_proj(attn_output) 2025-08-14T21:45:46.1677050Z 2025-08-14T21:45:46.1677143Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1677332Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1677391Z return mod(**inputs) 2025-08-14T21:45:46.1677633Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1677702Z outputs = self.model( 2025-08-14T21:45:46.1677943Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1678008Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1678258Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1678324Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1678534Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1678607Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1678851Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 424, in forward 2025-08-14T21:45:46.1678956Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:45:46.1679200Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 215, in forward 2025-08-14T21:45:46.1679347Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:45:46.1679351Z 2025-08-14T21:45:46.1679442Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1679623Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1679691Z return mod(**inputs) 2025-08-14T21:45:46.1679949Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1680013Z outputs = self.model( 2025-08-14T21:45:46.1680262Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1680328Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1680595Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1680677Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1680884Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1680963Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1681210Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 424, in forward 2025-08-14T21:45:46.1681314Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:45:46.1681561Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 234, in forward 2025-08-14T21:45:46.1681654Z key_states = self.k_proj(current_states) 2025-08-14T21:45:46.1681657Z 2025-08-14T21:45:46.1681755Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1681936Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1681997Z return mod(**inputs) 2025-08-14T21:45:46.1682249Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1682310Z outputs = self.model( 2025-08-14T21:45:46.1682556Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1682621Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1682862Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1682936Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1683135Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1683205Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1683454Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 424, in forward 2025-08-14T21:45:46.1683551Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:45:46.1683798Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 235, in forward 2025-08-14T21:45:46.1683875Z value_states = self.v_proj(current_states) 2025-08-14T21:45:46.1683879Z 2025-08-14T21:45:46.1683951Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1684029Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1684101Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1684176Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1684267Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1684445Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1684511Z return mod(**inputs) 2025-08-14T21:45:46.1684902Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1684970Z outputs = self.model( 2025-08-14T21:45:46.1685222Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1685289Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1685538Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1685651Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1685856Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1685937Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1686200Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 424, in forward 2025-08-14T21:45:46.1686299Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:45:46.1686573Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 253, in forward 2025-08-14T21:45:46.1686660Z attn_output, attn_weights = attention_interface( 2025-08-14T21:45:46.1686934Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:45:46.1687052Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:45:46.1687056Z 2025-08-14T21:45:46.1687150Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1687597Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1687656Z return mod(**inputs) 2025-08-14T21:45:46.1687911Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1687973Z outputs = self.model( 2025-08-14T21:45:46.1688219Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1688295Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1688538Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1688604Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1688817Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1688891Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1689143Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 424, in forward 2025-08-14T21:45:46.1689240Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:45:46.1689486Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 253, in forward 2025-08-14T21:45:46.1689582Z attn_output, attn_weights = attention_interface( 2025-08-14T21:45:46.1689850Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:45:46.1689955Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:45:46.1689958Z 2025-08-14T21:45:46.1690052Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1690235Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1690305Z return mod(**inputs) 2025-08-14T21:45:46.1690549Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1690610Z outputs = self.model( 2025-08-14T21:45:46.1690860Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1690927Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1691175Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1691238Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1691439Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1691518Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1691774Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 424, in forward 2025-08-14T21:45:46.1691881Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:45:46.1692138Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 267, in forward 2025-08-14T21:45:46.1692214Z attn_output = self.out_proj(attn_output) 2025-08-14T21:45:46.1692233Z 2025-08-14T21:45:46.1692336Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1692520Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1692578Z return mod(**inputs) 2025-08-14T21:45:46.1692827Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1692887Z outputs = self.model( 2025-08-14T21:45:46.1693137Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1693218Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1693459Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1693535Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1693734Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1693807Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1694052Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 438, in forward 2025-08-14T21:45:46.1694158Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:45:46.1694161Z 2025-08-14T21:45:46.1694261Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1694441Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1694502Z return mod(**inputs) 2025-08-14T21:45:46.1694750Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1694811Z outputs = self.model( 2025-08-14T21:45:46.1695058Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1695124Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1695364Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1695435Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1695636Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1695708Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1695959Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 438, in forward 2025-08-14T21:45:46.1696067Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:45:46.1696267Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:45:46.1696331Z return self.act(input) 2025-08-14T21:45:46.1696334Z 2025-08-14T21:45:46.1696427Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1696622Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1696681Z return mod(**inputs) 2025-08-14T21:45:46.1696929Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1696988Z outputs = self.model( 2025-08-14T21:45:46.1697243Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1697319Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1697560Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1697624Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1697849Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1697934Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1698182Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 440, in forward 2025-08-14T21:45:46.1698255Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:45:46.1698258Z 2025-08-14T21:45:46.1698350Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1698538Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1698615Z return mod(**inputs) 2025-08-14T21:45:46.1698863Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1698922Z outputs = self.model( 2025-08-14T21:45:46.1699165Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1699239Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1699479Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1699542Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1699748Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1699819Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1700065Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 442, in forward 2025-08-14T21:45:46.1700139Z hidden_states = residual + hidden_states 2025-08-14T21:45:46.1700143Z 2025-08-14T21:45:46.1700234Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1700421Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1700479Z return mod(**inputs) 2025-08-14T21:45:46.1700719Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1700787Z outputs = self.model( 2025-08-14T21:45:46.1701025Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1701097Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1701337Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1701403Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1701611Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1701681Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1701926Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:46.1702018Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:46.1702257Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 215, in forward 2025-08-14T21:45:46.1702399Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:45:46.1702402Z 2025-08-14T21:45:46.1702494Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1702695Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1702756Z return mod(**inputs) 2025-08-14T21:45:46.1702997Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1703065Z outputs = self.model( 2025-08-14T21:45:46.1703319Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1703400Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1703650Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1703713Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1703921Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1703991Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1704233Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:46.1704355Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:46.1704596Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 234, in forward 2025-08-14T21:45:46.1704721Z key_states = self.k_proj(current_states) 2025-08-14T21:45:46.1704734Z 2025-08-14T21:45:46.1704827Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1705021Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1705086Z return mod(**inputs) 2025-08-14T21:45:46.1705325Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1705385Z outputs = self.model( 2025-08-14T21:45:46.1705633Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1705699Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1705948Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1706014Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1706217Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1706298Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1706538Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:46.1706628Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:46.1706875Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 235, in forward 2025-08-14T21:45:46.1706955Z value_states = self.v_proj(current_states) 2025-08-14T21:45:46.1706959Z 2025-08-14T21:45:46.1707041Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1707113Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1707182Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1707259Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1707352Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1707532Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1707601Z return mod(**inputs) 2025-08-14T21:45:46.1707842Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1707911Z outputs = self.model( 2025-08-14T21:45:46.1708152Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1708235Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1708490Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1708555Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1708772Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1708851Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1709109Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:46.1709203Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:46.1709447Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 253, in forward 2025-08-14T21:45:46.1709535Z attn_output, attn_weights = attention_interface( 2025-08-14T21:45:46.1709809Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:45:46.1709947Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:45:46.1709950Z 2025-08-14T21:45:46.1710052Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1710236Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1710296Z return mod(**inputs) 2025-08-14T21:45:46.1710549Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1710609Z outputs = self.model( 2025-08-14T21:45:46.1710852Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1710924Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1711166Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1711239Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1711439Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1711509Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1711760Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:46.1711848Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:46.1712095Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 253, in forward 2025-08-14T21:45:46.1712181Z attn_output, attn_weights = attention_interface( 2025-08-14T21:45:46.1712443Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:45:46.1712550Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:45:46.1712555Z 2025-08-14T21:45:46.1712647Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1712834Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1712894Z return mod(**inputs) 2025-08-14T21:45:46.1713137Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1713206Z outputs = self.model( 2025-08-14T21:45:46.1713447Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1713514Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1713761Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1713826Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1714051Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1714128Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1714367Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:46.1714478Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:46.1714733Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 267, in forward 2025-08-14T21:45:46.1714808Z attn_output = self.out_proj(attn_output) 2025-08-14T21:45:46.1714818Z 2025-08-14T21:45:46.1714911Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1715091Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1715156Z return mod(**inputs) 2025-08-14T21:45:46.1715398Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1715475Z outputs = self.model( 2025-08-14T21:45:46.1715726Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1715793Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1716043Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1716109Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1716309Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1716386Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1716626Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 424, in forward 2025-08-14T21:45:46.1716725Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:45:46.1716974Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 215, in forward 2025-08-14T21:45:46.1717111Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:45:46.1717114Z 2025-08-14T21:45:46.1717215Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1717395Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1717452Z return mod(**inputs) 2025-08-14T21:45:46.1717705Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1717764Z outputs = self.model( 2025-08-14T21:45:46.1718013Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1718079Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1718324Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1718394Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1718596Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1718667Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1718916Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 424, in forward 2025-08-14T21:45:46.1719013Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:45:46.1719259Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 234, in forward 2025-08-14T21:45:46.1719332Z key_states = self.k_proj(current_states) 2025-08-14T21:45:46.1719335Z 2025-08-14T21:45:46.1719442Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1719634Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1719690Z return mod(**inputs) 2025-08-14T21:45:46.1719978Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1720040Z outputs = self.model( 2025-08-14T21:45:46.1720300Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1720371Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1720611Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1720675Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1720882Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1720968Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1721219Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 424, in forward 2025-08-14T21:45:46.1721316Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:45:46.1721558Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 235, in forward 2025-08-14T21:45:46.1721645Z value_states = self.v_proj(current_states) 2025-08-14T21:45:46.1721648Z 2025-08-14T21:45:46.1721719Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1721798Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1721866Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1721934Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1722034Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1722216Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1722276Z return mod(**inputs) 2025-08-14T21:45:46.1722527Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1722588Z outputs = self.model( 2025-08-14T21:45:46.1722834Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1722909Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1723150Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1723223Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1723426Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1723497Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1723746Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 424, in forward 2025-08-14T21:45:46.1723844Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:45:46.1724093Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 253, in forward 2025-08-14T21:45:46.1724182Z attn_output, attn_weights = attention_interface( 2025-08-14T21:45:46.1724453Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:45:46.1724581Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:45:46.1724584Z 2025-08-14T21:45:46.1724676Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1724864Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1724923Z return mod(**inputs) 2025-08-14T21:45:46.1725181Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1725252Z outputs = self.model( 2025-08-14T21:45:46.1725494Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1725576Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1725826Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1725917Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1726127Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1726196Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1726435Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 424, in forward 2025-08-14T21:45:46.1726540Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:45:46.1726799Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 253, in forward 2025-08-14T21:45:46.1726889Z attn_output, attn_weights = attention_interface( 2025-08-14T21:45:46.1727163Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:45:46.1727262Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:45:46.1727266Z 2025-08-14T21:45:46.1727364Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1727546Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1727605Z return mod(**inputs) 2025-08-14T21:45:46.1727858Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1727920Z outputs = self.model( 2025-08-14T21:45:46.1728169Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1728236Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1728476Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1728551Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1728754Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1728825Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1729073Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 424, in forward 2025-08-14T21:45:46.1729170Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:45:46.1729419Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 267, in forward 2025-08-14T21:45:46.1729495Z attn_output = self.out_proj(attn_output) 2025-08-14T21:45:46.1729498Z 2025-08-14T21:45:46.1729590Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1729780Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1729839Z return mod(**inputs) 2025-08-14T21:45:46.1730087Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1730147Z outputs = self.model( 2025-08-14T21:45:46.1730387Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1730460Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1730720Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1730788Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1730995Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1731065Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1731325Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 438, in forward 2025-08-14T21:45:46.1731448Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:45:46.1731452Z 2025-08-14T21:45:46.1731543Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1731729Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1731787Z return mod(**inputs) 2025-08-14T21:45:46.1732036Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1732114Z outputs = self.model( 2025-08-14T21:45:46.1732356Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1732426Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1732670Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1732736Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1732944Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1733016Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1733266Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 438, in forward 2025-08-14T21:45:46.1733374Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:45:46.1733571Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:45:46.1733642Z return self.act(input) 2025-08-14T21:45:46.1733646Z 2025-08-14T21:45:46.1733738Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1733928Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1733987Z return mod(**inputs) 2025-08-14T21:45:46.1734229Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1734297Z outputs = self.model( 2025-08-14T21:45:46.1734539Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1734607Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1734856Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1734922Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1735133Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1735205Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1735448Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 440, in forward 2025-08-14T21:45:46.1735532Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:45:46.1735536Z 2025-08-14T21:45:46.1735630Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1735811Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1735878Z return mod(**inputs) 2025-08-14T21:45:46.1736117Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1736200Z outputs = self.model( 2025-08-14T21:45:46.1736445Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1736509Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1736775Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1736842Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1737066Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1737139Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1737378Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:46.1737475Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:46.1737716Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 215, in forward 2025-08-14T21:45:46.1737870Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:45:46.1737880Z 2025-08-14T21:45:46.1737973Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1738155Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1738220Z return mod(**inputs) 2025-08-14T21:45:46.1738463Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1738524Z outputs = self.model( 2025-08-14T21:45:46.1738771Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1738836Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1739087Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1739152Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1739351Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1739430Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1739670Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:46.1739761Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:46.1740009Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 234, in forward 2025-08-14T21:45:46.1740080Z key_states = self.k_proj(current_states) 2025-08-14T21:45:46.1740083Z 2025-08-14T21:45:46.1740182Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1740364Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1740423Z return mod(**inputs) 2025-08-14T21:45:46.1740670Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1740730Z outputs = self.model( 2025-08-14T21:45:46.1740980Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1741046Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1741287Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1741358Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1741560Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1741630Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1741889Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:46.1741982Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:46.1742232Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 235, in forward 2025-08-14T21:45:46.1742325Z value_states = self.v_proj(current_states) 2025-08-14T21:45:46.1742328Z 2025-08-14T21:45:46.1742417Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1742495Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1742564Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1742633Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1742734Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1742916Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1742983Z return mod(**inputs) 2025-08-14T21:45:46.1743224Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1743302Z outputs = self.model( 2025-08-14T21:45:46.1743554Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1743620Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1743863Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1743936Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1744137Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1744213Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1744453Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:46.1744544Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:46.1744857Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 253, in forward 2025-08-14T21:45:46.1744957Z attn_output, attn_weights = attention_interface( 2025-08-14T21:45:46.1745234Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:45:46.1745358Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:45:46.1745361Z 2025-08-14T21:45:46.1745453Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1745644Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1745703Z return mod(**inputs) 2025-08-14T21:45:46.1745946Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1746016Z outputs = self.model( 2025-08-14T21:45:46.1746260Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1746333Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1746573Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1746640Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1746847Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1746918Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1747167Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:46.1747256Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:46.1747517Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 253, in forward 2025-08-14T21:45:46.1747617Z attn_output, attn_weights = attention_interface( 2025-08-14T21:45:46.1747885Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:45:46.1748008Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:45:46.1748033Z 2025-08-14T21:45:46.1748128Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1748311Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1748377Z return mod(**inputs) 2025-08-14T21:45:46.1748620Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1748681Z outputs = self.model( 2025-08-14T21:45:46.1748931Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1749015Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1749264Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1749328Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1749528Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1749608Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1749849Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:46.1749937Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:46.1750183Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 267, in forward 2025-08-14T21:45:46.1750259Z attn_output = self.out_proj(attn_output) 2025-08-14T21:45:46.1750263Z 2025-08-14T21:45:46.1750364Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1750545Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1750604Z return mod(**inputs) 2025-08-14T21:45:46.1750855Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1750917Z outputs = self.model( 2025-08-14T21:45:46.1751165Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1751230Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1751471Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1751544Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1751753Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1751826Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1752074Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 416, in forward 2025-08-14T21:45:46.1752148Z hidden_states = residual + hidden_states 2025-08-14T21:45:46.1752151Z 2025-08-14T21:45:46.1752253Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1752435Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1752493Z return mod(**inputs) 2025-08-14T21:45:46.1752741Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1752800Z outputs = self.model( 2025-08-14T21:45:46.1753054Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1753130Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1753370Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1753441Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1753655Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1753743Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1753992Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 424, in forward 2025-08-14T21:45:46.1754089Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:45:46.1754333Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 215, in forward 2025-08-14T21:45:46.1754471Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:45:46.1754493Z 2025-08-14T21:45:46.1754589Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1754779Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1754838Z return mod(**inputs) 2025-08-14T21:45:46.1755080Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1755149Z outputs = self.model( 2025-08-14T21:45:46.1755388Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1755462Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1755704Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1755768Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1755976Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1756049Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1756296Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 424, in forward 2025-08-14T21:45:46.1756396Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:45:46.1756639Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 234, in forward 2025-08-14T21:45:46.1756720Z key_states = self.k_proj(current_states) 2025-08-14T21:45:46.1756724Z 2025-08-14T21:45:46.1756815Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1756993Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1757058Z return mod(**inputs) 2025-08-14T21:45:46.1757302Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1757370Z outputs = self.model( 2025-08-14T21:45:46.1757610Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1757677Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1757923Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1757989Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1758196Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1758267Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1758508Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 424, in forward 2025-08-14T21:45:46.1758625Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:45:46.1758869Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 235, in forward 2025-08-14T21:45:46.1758946Z value_states = self.v_proj(current_states) 2025-08-14T21:45:46.1758957Z 2025-08-14T21:45:46.1759046Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1759121Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1759214Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1759282Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1759375Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1759562Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1759620Z return mod(**inputs) 2025-08-14T21:45:46.1759861Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1759929Z outputs = self.model( 2025-08-14T21:45:46.1760193Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1760266Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1760506Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1760571Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1760778Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1760847Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1761090Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 424, in forward 2025-08-14T21:45:46.1761193Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:45:46.1761434Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 253, in forward 2025-08-14T21:45:46.1761529Z attn_output, attn_weights = attention_interface( 2025-08-14T21:45:46.1761794Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:45:46.1761916Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:45:46.1761920Z 2025-08-14T21:45:46.1762019Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1762202Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1762268Z return mod(**inputs) 2025-08-14T21:45:46.1762509Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1762569Z outputs = self.model( 2025-08-14T21:45:46.1762820Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1762887Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1763138Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1763205Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1763409Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1763489Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1763730Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 424, in forward 2025-08-14T21:45:46.1763827Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:45:46.1764078Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 253, in forward 2025-08-14T21:45:46.1764182Z attn_output, attn_weights = attention_interface( 2025-08-14T21:45:46.1764462Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:45:46.1764559Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:45:46.1764562Z 2025-08-14T21:45:46.1764670Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1764878Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1764940Z return mod(**inputs) 2025-08-14T21:45:46.1765191Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1765256Z outputs = self.model( 2025-08-14T21:45:46.1765500Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1765578Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1765838Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1765903Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1766113Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1766183Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1766432Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 424, in forward 2025-08-14T21:45:46.1766527Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:45:46.1766766Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 267, in forward 2025-08-14T21:45:46.1766846Z attn_output = self.out_proj(attn_output) 2025-08-14T21:45:46.1766849Z 2025-08-14T21:45:46.1766943Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1767132Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1767190Z return mod(**inputs) 2025-08-14T21:45:46.1767433Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1767500Z outputs = self.model( 2025-08-14T21:45:46.1767743Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1767809Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1768057Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1768122Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1768330Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1768402Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1768642Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 438, in forward 2025-08-14T21:45:46.1768757Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:45:46.1768760Z 2025-08-14T21:45:46.1768855Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1769038Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1769103Z return mod(**inputs) 2025-08-14T21:45:46.1769346Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1769414Z outputs = self.model( 2025-08-14T21:45:46.1769655Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1769747Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1770003Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1770067Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1770292Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1770365Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1770623Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 438, in forward 2025-08-14T21:45:46.1770737Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:45:46.1770931Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:45:46.1770995Z return self.act(input) 2025-08-14T21:45:46.1770998Z 2025-08-14T21:45:46.1771101Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1771298Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1771363Z return mod(**inputs) 2025-08-14T21:45:46.1771603Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1771665Z outputs = self.model( 2025-08-14T21:45:46.1771913Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1771979Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1772221Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1772291Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1772492Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1772571Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1772813Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 440, in forward 2025-08-14T21:45:46.1772886Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:45:46.1772890Z 2025-08-14T21:45:46.1772989Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1773169Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1773236Z return mod(**inputs) 2025-08-14T21:45:46.1773476Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1773535Z outputs = self.model( 2025-08-14T21:45:46.1773783Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1773848Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1774089Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1774160Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1774361Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1774441Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1774684Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:46.1774772Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:46.1775019Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 215, in forward 2025-08-14T21:45:46.1775155Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:45:46.1775158Z 2025-08-14T21:45:46.1775276Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1775460Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1775520Z return mod(**inputs) 2025-08-14T21:45:46.1775782Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1775844Z outputs = self.model( 2025-08-14T21:45:46.1776101Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1776173Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1776417Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1776489Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1776689Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1776764Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1777033Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:46.1777122Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:46.1777373Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 234, in forward 2025-08-14T21:45:46.1777447Z key_states = self.k_proj(current_states) 2025-08-14T21:45:46.1777451Z 2025-08-14T21:45:46.1777542Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1777731Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1777789Z return mod(**inputs) 2025-08-14T21:45:46.1778032Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1778101Z outputs = self.model( 2025-08-14T21:45:46.1778346Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1778420Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1778663Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1778728Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1778940Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1779010Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1779262Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:46.1779349Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:46.1779594Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 235, in forward 2025-08-14T21:45:46.1779680Z value_states = self.v_proj(current_states) 2025-08-14T21:45:46.1779683Z 2025-08-14T21:45:46.1779753Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1779822Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1779901Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1779968Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1780067Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1780249Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1780308Z return mod(**inputs) 2025-08-14T21:45:46.1780560Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1780620Z outputs = self.model( 2025-08-14T21:45:46.1780878Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1780956Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1781199Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1781272Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1781489Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1781581Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1781826Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:46.1781913Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:46.1782151Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 253, in forward 2025-08-14T21:45:46.1782245Z attn_output, attn_weights = attention_interface( 2025-08-14T21:45:46.1782537Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:45:46.1782665Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:45:46.1782669Z 2025-08-14T21:45:46.1782763Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1782942Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1783011Z return mod(**inputs) 2025-08-14T21:45:46.1783251Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1783318Z outputs = self.model( 2025-08-14T21:45:46.1783556Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1783623Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1783870Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1783935Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1784135Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1784213Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1784453Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:46.1784547Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:46.1784927Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 253, in forward 2025-08-14T21:45:46.1785020Z attn_output, attn_weights = attention_interface( 2025-08-14T21:45:46.1785299Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:45:46.1785400Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:45:46.1785403Z 2025-08-14T21:45:46.1785504Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1785693Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1785753Z return mod(**inputs) 2025-08-14T21:45:46.1786009Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1786070Z outputs = self.model( 2025-08-14T21:45:46.1786315Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1786388Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1786668Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1786744Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1786946Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1787017Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1787287Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:46.1787397Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:46.1787647Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 267, in forward 2025-08-14T21:45:46.1787724Z attn_output = self.out_proj(attn_output) 2025-08-14T21:45:46.1787727Z 2025-08-14T21:45:46.1787818Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1788006Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1788098Z return mod(**inputs) 2025-08-14T21:45:46.1788340Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1788407Z outputs = self.model( 2025-08-14T21:45:46.1788649Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1788723Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1788964Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1789027Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1789234Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1789304Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1789553Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 424, in forward 2025-08-14T21:45:46.1789653Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:45:46.1789891Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 215, in forward 2025-08-14T21:45:46.1790035Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:45:46.1790040Z 2025-08-14T21:45:46.1790132Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1790311Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1790378Z return mod(**inputs) 2025-08-14T21:45:46.1790618Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1790686Z outputs = self.model( 2025-08-14T21:45:46.1790926Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1790993Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1791241Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1791305Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1791514Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1791586Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1791823Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 424, in forward 2025-08-14T21:45:46.1791927Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:45:46.1792165Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 234, in forward 2025-08-14T21:45:46.1792254Z key_states = self.k_proj(current_states) 2025-08-14T21:45:46.1792268Z 2025-08-14T21:45:46.1792363Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1792541Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1792609Z return mod(**inputs) 2025-08-14T21:45:46.1792869Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1792947Z outputs = self.model( 2025-08-14T21:45:46.1793201Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1793265Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1793517Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1793581Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1793785Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1793879Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1794120Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 424, in forward 2025-08-14T21:45:46.1794217Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:45:46.1794467Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 235, in forward 2025-08-14T21:45:46.1794543Z value_states = self.v_proj(current_states) 2025-08-14T21:45:46.1794546Z 2025-08-14T21:45:46.1794626Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1794698Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1794769Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1794843Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1794936Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1795117Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1795183Z return mod(**inputs) 2025-08-14T21:45:46.1795427Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1795494Z outputs = self.model( 2025-08-14T21:45:46.1795736Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1795801Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1796046Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1796110Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1796309Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1796390Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1796634Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 424, in forward 2025-08-14T21:45:46.1796738Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:45:46.1796979Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 253, in forward 2025-08-14T21:45:46.1797069Z attn_output, attn_weights = attention_interface( 2025-08-14T21:45:46.1797342Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:45:46.1797460Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:45:46.1797463Z 2025-08-14T21:45:46.1797563Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1797762Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1797827Z return mod(**inputs) 2025-08-14T21:45:46.1798079Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1798140Z outputs = self.model( 2025-08-14T21:45:46.1798400Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1798489Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1798734Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1798805Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1799008Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1799079Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1799334Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 424, in forward 2025-08-14T21:45:46.1799448Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:45:46.1799699Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 253, in forward 2025-08-14T21:45:46.1799788Z attn_output, attn_weights = attention_interface( 2025-08-14T21:45:46.1800057Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:45:46.1800159Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:45:46.1800163Z 2025-08-14T21:45:46.1800252Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1800442Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1800501Z return mod(**inputs) 2025-08-14T21:45:46.1800747Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1800816Z outputs = self.model( 2025-08-14T21:45:46.1801062Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1801130Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1801382Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1801449Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1801659Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1801730Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1801973Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 424, in forward 2025-08-14T21:45:46.1802075Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:45:46.1802319Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 267, in forward 2025-08-14T21:45:46.1802392Z attn_output = self.out_proj(attn_output) 2025-08-14T21:45:46.1802401Z 2025-08-14T21:45:46.1802494Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1802675Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1802743Z return mod(**inputs) 2025-08-14T21:45:46.1802987Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1803048Z outputs = self.model( 2025-08-14T21:45:46.1803302Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1803383Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1803637Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1803702Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1803918Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1804001Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1804259Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 433, in forward 2025-08-14T21:45:46.1804333Z hidden_states = residual + hidden_states 2025-08-14T21:45:46.1804336Z 2025-08-14T21:45:46.1804436Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1804617Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1804683Z return mod(**inputs) 2025-08-14T21:45:46.1804924Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1805001Z outputs = self.model( 2025-08-14T21:45:46.1805252Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1805320Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1805561Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1805636Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1805839Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1805919Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1806162Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 438, in forward 2025-08-14T21:45:46.1806273Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:45:46.1806278Z 2025-08-14T21:45:46.1806381Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1806563Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1806631Z return mod(**inputs) 2025-08-14T21:45:46.1806873Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1806942Z outputs = self.model( 2025-08-14T21:45:46.1807192Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1807256Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1807497Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1807569Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1807772Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1807853Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1808097Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 438, in forward 2025-08-14T21:45:46.1808203Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:45:46.1808405Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:45:46.1808468Z return self.act(input) 2025-08-14T21:45:46.1808471Z 2025-08-14T21:45:46.1808570Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1808751Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1808810Z return mod(**inputs) 2025-08-14T21:45:46.1809099Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1809163Z outputs = self.model( 2025-08-14T21:45:46.1809404Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1809476Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1809730Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1809837Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1810041Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1810110Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1810366Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 440, in forward 2025-08-14T21:45:46.1810441Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:45:46.1810460Z 2025-08-14T21:45:46.1810562Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1810741Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1810800Z return mod(**inputs) 2025-08-14T21:45:46.1811050Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1811111Z outputs = self.model( 2025-08-14T21:45:46.1811353Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1811426Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1811669Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1811740Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1811942Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1812015Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1812264Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:46.1812355Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:46.1812596Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 215, in forward 2025-08-14T21:45:46.1812744Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:45:46.1812747Z 2025-08-14T21:45:46.1812843Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1813030Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1813089Z return mod(**inputs) 2025-08-14T21:45:46.1813333Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1813402Z outputs = self.model( 2025-08-14T21:45:46.1813645Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1813718Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1813957Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1814022Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1837367Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1837478Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1837762Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:46.1837960Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:46.1838220Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 234, in forward 2025-08-14T21:45:46.1838310Z key_states = self.k_proj(current_states) 2025-08-14T21:45:46.1838317Z 2025-08-14T21:45:46.1838459Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1838668Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1838769Z return mod(**inputs) 2025-08-14T21:45:46.1839023Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1839097Z outputs = self.model( 2025-08-14T21:45:46.1839346Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1839417Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1839670Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1839765Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1839981Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1840057Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1840304Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:46.1840404Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:46.1840644Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 235, in forward 2025-08-14T21:45:46.1840730Z value_states = self.v_proj(current_states) 2025-08-14T21:45:46.1840734Z 2025-08-14T21:45:46.1840811Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1840885Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1840962Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1841031Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1841125Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1841323Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1841384Z return mod(**inputs) 2025-08-14T21:45:46.1841627Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1841695Z outputs = self.model( 2025-08-14T21:45:46.1841935Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1842011Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1842252Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1842319Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1842531Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1842604Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1842851Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:46.1842943Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:46.1843182Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 253, in forward 2025-08-14T21:45:46.1843279Z attn_output, attn_weights = attention_interface( 2025-08-14T21:45:46.1843546Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:45:46.1843689Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:45:46.1843700Z 2025-08-14T21:45:46.1843796Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1843978Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1844041Z return mod(**inputs) 2025-08-14T21:45:46.1844304Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1844383Z outputs = self.model( 2025-08-14T21:45:46.1844634Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1844700Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1844947Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1845013Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1845216Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1845312Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1845552Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:46.1845642Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:46.1845888Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 253, in forward 2025-08-14T21:45:46.1845976Z attn_output, attn_weights = attention_interface( 2025-08-14T21:45:46.1846250Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:45:46.1846353Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:45:46.1846357Z 2025-08-14T21:45:46.1846451Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1846640Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1846699Z return mod(**inputs) 2025-08-14T21:45:46.1846946Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1847008Z outputs = self.model( 2025-08-14T21:45:46.1847247Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1847321Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1847559Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1847625Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1847835Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1847910Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1848159Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:46.1848246Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:46.1848486Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 267, in forward 2025-08-14T21:45:46.1848571Z attn_output = self.out_proj(attn_output) 2025-08-14T21:45:46.1848574Z 2025-08-14T21:45:46.1848667Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1848858Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1848918Z return mod(**inputs) 2025-08-14T21:45:46.1849158Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1849243Z outputs = self.model( 2025-08-14T21:45:46.1849489Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1849556Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1849826Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1849895Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1850122Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1850193Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1850432Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 424, in forward 2025-08-14T21:45:46.1850540Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:45:46.1850781Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 215, in forward 2025-08-14T21:45:46.1850945Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:45:46.1850949Z 2025-08-14T21:45:46.1851044Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1851229Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1851300Z return mod(**inputs) 2025-08-14T21:45:46.1851548Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1851613Z outputs = self.model( 2025-08-14T21:45:46.1851863Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1851933Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1852184Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1852254Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1852458Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1852542Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1852786Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 424, in forward 2025-08-14T21:45:46.1852895Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:45:46.1853141Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 234, in forward 2025-08-14T21:45:46.1853221Z key_states = self.k_proj(current_states) 2025-08-14T21:45:46.1853225Z 2025-08-14T21:45:46.1853323Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1853516Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1853580Z return mod(**inputs) 2025-08-14T21:45:46.1853830Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1853894Z outputs = self.model( 2025-08-14T21:45:46.1854140Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1854219Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1854462Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1854530Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1854744Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1854816Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1855080Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 424, in forward 2025-08-14T21:45:46.1855183Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:45:46.1855424Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 235, in forward 2025-08-14T21:45:46.1855541Z value_states = self.v_proj(current_states) 2025-08-14T21:45:46.1855545Z 2025-08-14T21:45:46.1855635Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1855712Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1855781Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1855849Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1855947Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1856130Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1856188Z return mod(**inputs) 2025-08-14T21:45:46.1856440Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1856519Z outputs = self.model( 2025-08-14T21:45:46.1856770Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1856837Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1857082Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1857155Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1857357Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1857428Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1857678Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 424, in forward 2025-08-14T21:45:46.1857773Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:45:46.1858022Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 253, in forward 2025-08-14T21:45:46.1858111Z attn_output, attn_weights = attention_interface( 2025-08-14T21:45:46.1858374Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:45:46.1858501Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:45:46.1858505Z 2025-08-14T21:45:46.1858596Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1858777Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1858831Z return mod(**inputs) 2025-08-14T21:45:46.1859073Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1859142Z outputs = self.model( 2025-08-14T21:45:46.1859388Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1859453Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1859705Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1859772Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1859980Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1860049Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1860290Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 424, in forward 2025-08-14T21:45:46.1860393Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:45:46.1860653Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 253, in forward 2025-08-14T21:45:46.1860753Z attn_output, attn_weights = attention_interface( 2025-08-14T21:45:46.1861018Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:45:46.1861130Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:45:46.1861147Z 2025-08-14T21:45:46.1861244Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1861422Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1861479Z return mod(**inputs) 2025-08-14T21:45:46.1861720Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1861777Z outputs = self.model( 2025-08-14T21:45:46.1862018Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1862098Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1862337Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1862402Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1862605Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1862676Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1862910Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 424, in forward 2025-08-14T21:45:46.1863003Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:45:46.1863241Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 267, in forward 2025-08-14T21:45:46.1863312Z attn_output = self.out_proj(attn_output) 2025-08-14T21:45:46.1863316Z 2025-08-14T21:45:46.1863405Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1863592Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1863649Z return mod(**inputs) 2025-08-14T21:45:46.1863898Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1863961Z outputs = self.model( 2025-08-14T21:45:46.1864251Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1864332Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1864574Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1864693Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1864913Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1864986Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1865235Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 438, in forward 2025-08-14T21:45:46.1865352Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:45:46.1865357Z 2025-08-14T21:45:46.1865450Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1865641Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1865701Z return mod(**inputs) 2025-08-14T21:45:46.1865947Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1866008Z outputs = self.model( 2025-08-14T21:45:46.1866267Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1866347Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1866591Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1866674Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1866887Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1866976Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1867225Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 438, in forward 2025-08-14T21:45:46.1867335Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:45:46.1867530Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:45:46.1867602Z return self.act(input) 2025-08-14T21:45:46.1867621Z 2025-08-14T21:45:46.1867717Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1867908Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1867967Z return mod(**inputs) 2025-08-14T21:45:46.1868207Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1868279Z outputs = self.model( 2025-08-14T21:45:46.1868518Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1868584Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1868833Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1868898Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1869107Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1869178Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1869417Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 440, in forward 2025-08-14T21:45:46.1869500Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:45:46.1869503Z 2025-08-14T21:45:46.1869594Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1869782Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1869840Z return mod(**inputs) 2025-08-14T21:45:46.1870080Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1870148Z outputs = self.model( 2025-08-14T21:45:46.1870387Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1870453Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1870700Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1870763Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1870971Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1871042Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1871280Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 442, in forward 2025-08-14T21:45:46.1871360Z hidden_states = residual + hidden_states 2025-08-14T21:45:46.1871364Z 2025-08-14T21:45:46.1871455Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1871633Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1871712Z return mod(**inputs) 2025-08-14T21:45:46.1871957Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1872017Z outputs = self.model( 2025-08-14T21:45:46.1872276Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1872341Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1872600Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1872662Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1872864Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1872931Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1873168Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:46.1873276Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:46.1873514Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 215, in forward 2025-08-14T21:45:46.1873654Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:45:46.1873660Z 2025-08-14T21:45:46.1873751Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1873926Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1873987Z return mod(**inputs) 2025-08-14T21:45:46.1874230Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1874287Z outputs = self.model( 2025-08-14T21:45:46.1874534Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1874596Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1874839Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1874899Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1875100Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1875176Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1875413Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:46.1875499Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:46.1875739Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 234, in forward 2025-08-14T21:45:46.1875810Z key_states = self.k_proj(current_states) 2025-08-14T21:45:46.1875814Z 2025-08-14T21:45:46.1875905Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1876084Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1876139Z return mod(**inputs) 2025-08-14T21:45:46.1876385Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1876445Z outputs = self.model( 2025-08-14T21:45:46.1876694Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1876767Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1877011Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1877075Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1877308Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1877382Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1877631Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:46.1877736Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:46.1877978Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 235, in forward 2025-08-14T21:45:46.1878079Z value_states = self.v_proj(current_states) 2025-08-14T21:45:46.1878082Z 2025-08-14T21:45:46.1878153Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1878230Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1878298Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1878364Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1878466Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1878671Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1878731Z return mod(**inputs) 2025-08-14T21:45:46.1878982Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1879043Z outputs = self.model( 2025-08-14T21:45:46.1879294Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1879362Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1879605Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1879676Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1879878Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1879952Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1880205Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:46.1880295Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:46.1880545Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 253, in forward 2025-08-14T21:45:46.1880634Z attn_output, attn_weights = attention_interface( 2025-08-14T21:45:46.1880903Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:45:46.1881033Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:45:46.1881036Z 2025-08-14T21:45:46.1881127Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1881318Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1881380Z return mod(**inputs) 2025-08-14T21:45:46.1881623Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1881692Z outputs = self.model( 2025-08-14T21:45:46.1881938Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1882006Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1882257Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1882321Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1882531Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1882602Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1882863Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:46.1882964Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:46.1883207Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 253, in forward 2025-08-14T21:45:46.1883315Z attn_output, attn_weights = attention_interface( 2025-08-14T21:45:46.1883579Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:45:46.1883696Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:45:46.1883700Z 2025-08-14T21:45:46.1883796Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1883978Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1884037Z return mod(**inputs) 2025-08-14T21:45:46.1884288Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1884366Z outputs = self.model( 2025-08-14T21:45:46.1884773Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1884852Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1885094Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1885170Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1885369Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1885439Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1885689Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:46.1885778Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:46.1886028Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 267, in forward 2025-08-14T21:45:46.1886101Z attn_output = self.out_proj(attn_output) 2025-08-14T21:45:46.1886105Z 2025-08-14T21:45:46.1886197Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1886388Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1886448Z return mod(**inputs) 2025-08-14T21:45:46.1886695Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1886756Z outputs = self.model( 2025-08-14T21:45:46.1886997Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1887071Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1887316Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1887381Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1887593Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1887664Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1887913Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 424, in forward 2025-08-14T21:45:46.1888012Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:45:46.1888255Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 215, in forward 2025-08-14T21:45:46.1888402Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:45:46.1888406Z 2025-08-14T21:45:46.1888550Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1888743Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1888802Z return mod(**inputs) 2025-08-14T21:45:46.1889045Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1889140Z outputs = self.model( 2025-08-14T21:45:46.1889383Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1889476Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1889727Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1889790Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1890001Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1890072Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1890337Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 424, in forward 2025-08-14T21:45:46.1890444Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:45:46.1890689Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 234, in forward 2025-08-14T21:45:46.1890768Z key_states = self.k_proj(current_states) 2025-08-14T21:45:46.1890771Z 2025-08-14T21:45:46.1890862Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1891041Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1891105Z return mod(**inputs) 2025-08-14T21:45:46.1891348Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1891411Z outputs = self.model( 2025-08-14T21:45:46.1891662Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1891726Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1891973Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1892037Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1892240Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1892317Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1892556Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 424, in forward 2025-08-14T21:45:46.1892659Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:45:46.1892899Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 235, in forward 2025-08-14T21:45:46.1892978Z value_states = self.v_proj(current_states) 2025-08-14T21:45:46.1892981Z 2025-08-14T21:45:46.1893057Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1893128Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1893196Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1893271Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1893365Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1893551Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1893609Z return mod(**inputs) 2025-08-14T21:45:46.1893850Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1893916Z outputs = self.model( 2025-08-14T21:45:46.1894173Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1894243Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1894493Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1894557Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1894778Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1894866Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1895104Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 424, in forward 2025-08-14T21:45:46.1895206Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:45:46.1895446Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 253, in forward 2025-08-14T21:45:46.1895542Z attn_output, attn_weights = attention_interface( 2025-08-14T21:45:46.1895834Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:45:46.1895954Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:45:46.1895958Z 2025-08-14T21:45:46.1896060Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1896238Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1896298Z return mod(**inputs) 2025-08-14T21:45:46.1896548Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1896608Z outputs = self.model( 2025-08-14T21:45:46.1896854Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1896920Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1897161Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1897231Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1897434Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1897506Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1897750Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 424, in forward 2025-08-14T21:45:46.1897844Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:45:46.1898088Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 253, in forward 2025-08-14T21:45:46.1898176Z attn_output, attn_weights = attention_interface( 2025-08-14T21:45:46.1898438Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:45:46.1898544Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:45:46.1898547Z 2025-08-14T21:45:46.1898639Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1898827Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1898886Z return mod(**inputs) 2025-08-14T21:45:46.1899125Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1899195Z outputs = self.model( 2025-08-14T21:45:46.1899435Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1899506Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1899767Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1899835Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1900041Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1900111Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1900364Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 424, in forward 2025-08-14T21:45:46.1900487Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:45:46.1900730Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 267, in forward 2025-08-14T21:45:46.1900812Z attn_output = self.out_proj(attn_output) 2025-08-14T21:45:46.1900815Z 2025-08-14T21:45:46.1900910Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1901093Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1901177Z return mod(**inputs) 2025-08-14T21:45:46.1901423Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1901484Z outputs = self.model( 2025-08-14T21:45:46.1901737Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1901804Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1902055Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1902120Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1902324Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1902402Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1902645Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 438, in forward 2025-08-14T21:45:46.1902764Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:45:46.1902768Z 2025-08-14T21:45:46.1902861Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1903043Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1903109Z return mod(**inputs) 2025-08-14T21:45:46.1903357Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1903418Z outputs = self.model( 2025-08-14T21:45:46.1903671Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1903736Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1903990Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1904056Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1904261Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1904339Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1904584Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 438, in forward 2025-08-14T21:45:46.1904763Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:45:46.1904965Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:45:46.1905028Z return self.act(input) 2025-08-14T21:45:46.1905032Z 2025-08-14T21:45:46.1905131Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1905311Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1905389Z return mod(**inputs) 2025-08-14T21:45:46.1905645Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1905706Z outputs = self.model( 2025-08-14T21:45:46.1905973Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1906042Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1906301Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1906375Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1906579Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1906648Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1906897Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 440, in forward 2025-08-14T21:45:46.1906989Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:45:46.1906993Z 2025-08-14T21:45:46.1907093Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1907276Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1907337Z return mod(**inputs) 2025-08-14T21:45:46.1907587Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1907652Z outputs = self.model( 2025-08-14T21:45:46.1907902Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1907969Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1908211Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1908283Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1908487Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1908558Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1908809Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:46.1908901Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:46.1909149Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 215, in forward 2025-08-14T21:45:46.1909286Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:45:46.1909290Z 2025-08-14T21:45:46.1909382Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1909570Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1909630Z return mod(**inputs) 2025-08-14T21:45:46.1909881Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1909941Z outputs = self.model( 2025-08-14T21:45:46.1910183Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1910258Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1910497Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1910561Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1910767Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1910837Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1911099Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:46.1911193Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:46.1911432Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 234, in forward 2025-08-14T21:45:46.1911530Z key_states = self.k_proj(current_states) 2025-08-14T21:45:46.1911534Z 2025-08-14T21:45:46.1911645Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1911830Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1911888Z return mod(**inputs) 2025-08-14T21:45:46.1912131Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1912199Z outputs = self.model( 2025-08-14T21:45:46.1912442Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1912522Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1912770Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1912833Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1913039Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1913112Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1913350Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:46.1913445Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:46.1913682Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 235, in forward 2025-08-14T21:45:46.1913766Z value_states = self.v_proj(current_states) 2025-08-14T21:45:46.1913771Z 2025-08-14T21:45:46.1913842Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1913911Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1913986Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1914052Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1914144Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1914333Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1914393Z return mod(**inputs) 2025-08-14T21:45:46.1914638Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1914699Z outputs = self.model( 2025-08-14T21:45:46.1914939Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1915013Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1915254Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1915321Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1915529Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1915602Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1915843Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:46.1915932Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:46.1916170Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 253, in forward 2025-08-14T21:45:46.1916265Z attn_output, attn_weights = attention_interface( 2025-08-14T21:45:46.1916543Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:45:46.1916668Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:45:46.1916679Z 2025-08-14T21:45:46.1916771Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1916968Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1917035Z return mod(**inputs) 2025-08-14T21:45:46.1917303Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1917365Z outputs = self.model( 2025-08-14T21:45:46.1917616Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1917682Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1917933Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1918043Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1918244Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1918322Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1918565Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:46.1918655Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:46.1918900Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 253, in forward 2025-08-14T21:45:46.1918987Z attn_output, attn_weights = attention_interface( 2025-08-14T21:45:46.1919255Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:45:46.1919354Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:45:46.1919358Z 2025-08-14T21:45:46.1919449Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1919636Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1919694Z return mod(**inputs) 2025-08-14T21:45:46.1919941Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1920003Z outputs = self.model( 2025-08-14T21:45:46.1920242Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1920315Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1920556Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1920620Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1920826Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1920899Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1921144Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:46.1921233Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:46.1921470Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 267, in forward 2025-08-14T21:45:46.1921552Z attn_output = self.out_proj(attn_output) 2025-08-14T21:45:46.1921555Z 2025-08-14T21:45:46.1921647Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1921834Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1921893Z return mod(**inputs) 2025-08-14T21:45:46.1922151Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1922224Z outputs = self.model( 2025-08-14T21:45:46.1922465Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1922531Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1922794Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1922877Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1923084Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1923156Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1923395Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 416, in forward 2025-08-14T21:45:46.1923477Z hidden_states = residual + hidden_states 2025-08-14T21:45:46.1923495Z 2025-08-14T21:45:46.1923588Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1923774Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1923832Z return mod(**inputs) 2025-08-14T21:45:46.1924074Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1924142Z outputs = self.model( 2025-08-14T21:45:46.1924383Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1924447Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1924693Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1924757Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1924964Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1925036Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1925274Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 424, in forward 2025-08-14T21:45:46.1925382Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:45:46.1925624Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 215, in forward 2025-08-14T21:45:46.1925769Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:45:46.1925772Z 2025-08-14T21:45:46.1925863Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1926042Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1926108Z return mod(**inputs) 2025-08-14T21:45:46.1926353Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1926416Z outputs = self.model( 2025-08-14T21:45:46.1926661Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1926727Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1926976Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1927042Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1927243Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1927320Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1927559Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 424, in forward 2025-08-14T21:45:46.1927679Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:45:46.1927926Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 234, in forward 2025-08-14T21:45:46.1927999Z key_states = self.k_proj(current_states) 2025-08-14T21:45:46.1928002Z 2025-08-14T21:45:46.1928118Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1928301Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1928591Z return mod(**inputs) 2025-08-14T21:45:46.1928843Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1928905Z outputs = self.model( 2025-08-14T21:45:46.1929152Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1929217Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1929461Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1929553Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1929755Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1929828Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1930075Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 424, in forward 2025-08-14T21:45:46.1930174Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:45:46.1930419Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 235, in forward 2025-08-14T21:45:46.1930497Z value_states = self.v_proj(current_states) 2025-08-14T21:45:46.1930500Z 2025-08-14T21:45:46.1930572Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1930653Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1930725Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1930801Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1930894Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1931077Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1931147Z return mod(**inputs) 2025-08-14T21:45:46.1931390Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1931451Z outputs = self.model( 2025-08-14T21:45:46.1931700Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1931767Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1932019Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1932085Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1932286Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1932364Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1932605Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 424, in forward 2025-08-14T21:45:46.1932703Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:45:46.1932949Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 253, in forward 2025-08-14T21:45:46.1933037Z attn_output, attn_weights = attention_interface( 2025-08-14T21:45:46.1933308Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:45:46.1933448Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:45:46.1933453Z 2025-08-14T21:45:46.1933548Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1933736Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1933794Z return mod(**inputs) 2025-08-14T21:45:46.1934059Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1934136Z outputs = self.model( 2025-08-14T21:45:46.1934377Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1934450Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1934690Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1934755Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1934964Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1935051Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1935296Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 424, in forward 2025-08-14T21:45:46.1935394Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:45:46.1935636Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 253, in forward 2025-08-14T21:45:46.1935731Z attn_output, attn_weights = attention_interface( 2025-08-14T21:45:46.1935995Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:45:46.1936100Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:45:46.1936104Z 2025-08-14T21:45:46.1936196Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1936378Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1936445Z return mod(**inputs) 2025-08-14T21:45:46.1936688Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1936750Z outputs = self.model( 2025-08-14T21:45:46.1936997Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1937064Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1937309Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1937372Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1937574Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1937651Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1937890Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 424, in forward 2025-08-14T21:45:46.1937993Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:45:46.1938233Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 267, in forward 2025-08-14T21:45:46.1938309Z attn_output = self.out_proj(attn_output) 2025-08-14T21:45:46.1938312Z 2025-08-14T21:45:46.1938411Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1938590Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1938646Z return mod(**inputs) 2025-08-14T21:45:46.1938894Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1938970Z outputs = self.model( 2025-08-14T21:45:46.1939223Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1939289Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1939555Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1939629Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1939847Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1939917Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1940167Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 438, in forward 2025-08-14T21:45:46.1940276Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:45:46.1940279Z 2025-08-14T21:45:46.1940379Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1940574Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1940631Z return mod(**inputs) 2025-08-14T21:45:46.1940881Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1940943Z outputs = self.model( 2025-08-14T21:45:46.1941193Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1941261Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1941502Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1941572Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1941774Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1941846Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1942096Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 438, in forward 2025-08-14T21:45:46.1942206Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:45:46.1942407Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:45:46.1942473Z return self.act(input) 2025-08-14T21:45:46.1942477Z 2025-08-14T21:45:46.1942568Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1942756Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1942813Z return mod(**inputs) 2025-08-14T21:45:46.1943062Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1943123Z outputs = self.model( 2025-08-14T21:45:46.1943364Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1943437Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1943679Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1943744Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1943954Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1944024Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1944272Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 440, in forward 2025-08-14T21:45:46.1944345Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:45:46.1944349Z 2025-08-14T21:45:46.1944440Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1944711Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1944782Z return mod(**inputs) 2025-08-14T21:45:46.1945036Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1945114Z outputs = self.model( 2025-08-14T21:45:46.1945362Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1945453Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1945704Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1945771Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1945984Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1946058Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1946328Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:46.1946418Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:46.1946660Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 215, in forward 2025-08-14T21:45:46.1946807Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:45:46.1946812Z 2025-08-14T21:45:46.1946907Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1947097Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1947156Z return mod(**inputs) 2025-08-14T21:45:46.1947399Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1947471Z outputs = self.model( 2025-08-14T21:45:46.1947713Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1947777Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1948026Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1948090Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1948298Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1948367Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1948605Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:46.1948698Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:46.1948939Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 234, in forward 2025-08-14T21:45:46.1949012Z key_states = self.k_proj(current_states) 2025-08-14T21:45:46.1949022Z 2025-08-14T21:45:46.1949112Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1949294Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1949360Z return mod(**inputs) 2025-08-14T21:45:46.1949602Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1949662Z outputs = self.model( 2025-08-14T21:45:46.1949909Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1949973Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1950234Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1950304Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1950505Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1950583Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1950840Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:46.1950946Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:46.1951189Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 235, in forward 2025-08-14T21:45:46.1951265Z value_states = self.v_proj(current_states) 2025-08-14T21:45:46.1951269Z 2025-08-14T21:45:46.1951348Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1951421Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1951490Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1951567Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1951676Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1951857Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1951924Z return mod(**inputs) 2025-08-14T21:45:46.1952167Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1952235Z outputs = self.model( 2025-08-14T21:45:46.1952476Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1952542Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1952791Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1952855Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1953063Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1953137Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1953378Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:46.1953474Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:46.1953713Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 253, in forward 2025-08-14T21:45:46.1953804Z attn_output, attn_weights = attention_interface( 2025-08-14T21:45:46.1954074Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:45:46.1954196Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:45:46.1954199Z 2025-08-14T21:45:46.1954300Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1954483Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1954541Z return mod(**inputs) 2025-08-14T21:45:46.1954792Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1954853Z outputs = self.model( 2025-08-14T21:45:46.1955100Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1955167Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1955410Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1955482Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1955681Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1955766Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1956018Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:46.1956106Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:46.1956367Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 253, in forward 2025-08-14T21:45:46.1956471Z attn_output, attn_weights = attention_interface( 2025-08-14T21:45:46.1956734Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:45:46.1956839Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:45:46.1956842Z 2025-08-14T21:45:46.1956933Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1957121Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1957182Z return mod(**inputs) 2025-08-14T21:45:46.1957443Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1957512Z outputs = self.model( 2025-08-14T21:45:46.1957755Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1957821Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1958076Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1958143Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1958354Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1958425Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1958667Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:46.1958766Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:46.1959010Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 267, in forward 2025-08-14T21:45:46.1959094Z attn_output = self.out_proj(attn_output) 2025-08-14T21:45:46.1959097Z 2025-08-14T21:45:46.1959192Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1959378Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1959444Z return mod(**inputs) 2025-08-14T21:45:46.1959688Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1959747Z outputs = self.model( 2025-08-14T21:45:46.1959998Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1960066Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1960315Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1960380Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1960585Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1960668Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1960909Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 424, in forward 2025-08-14T21:45:46.1961008Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:45:46.1961256Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 215, in forward 2025-08-14T21:45:46.1961421Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:45:46.1961427Z 2025-08-14T21:45:46.1961529Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1961710Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1961768Z return mod(**inputs) 2025-08-14T21:45:46.1962031Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1962110Z outputs = self.model( 2025-08-14T21:45:46.1962361Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1962425Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1962669Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1962739Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1962943Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1963032Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1963279Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 424, in forward 2025-08-14T21:45:46.1963377Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:45:46.1963625Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 234, in forward 2025-08-14T21:45:46.1963699Z key_states = self.k_proj(current_states) 2025-08-14T21:45:46.1963702Z 2025-08-14T21:45:46.1963794Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1963983Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1964042Z return mod(**inputs) 2025-08-14T21:45:46.1964290Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1964353Z outputs = self.model( 2025-08-14T21:45:46.1964594Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1964666Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1964910Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1964975Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1965185Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1965257Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1965503Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 424, in forward 2025-08-14T21:45:46.1965602Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:45:46.1965843Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 235, in forward 2025-08-14T21:45:46.1965926Z value_states = self.v_proj(current_states) 2025-08-14T21:45:46.1965929Z 2025-08-14T21:45:46.1965998Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1966075Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1966144Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1966213Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1966310Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1966490Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1966547Z return mod(**inputs) 2025-08-14T21:45:46.1966796Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1966871Z outputs = self.model( 2025-08-14T21:45:46.1967123Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1967186Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1967442Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1967515Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1967732Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1967803Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1968050Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 424, in forward 2025-08-14T21:45:46.1968146Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:45:46.1968392Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 253, in forward 2025-08-14T21:45:46.1968498Z attn_output, attn_weights = attention_interface( 2025-08-14T21:45:46.1968760Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:45:46.1968886Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:45:46.1968889Z 2025-08-14T21:45:46.1968982Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1969167Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1969225Z return mod(**inputs) 2025-08-14T21:45:46.1969467Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1969532Z outputs = self.model( 2025-08-14T21:45:46.1969770Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1969836Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1970084Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1970148Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1970356Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1970428Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1970666Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 424, in forward 2025-08-14T21:45:46.1970768Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:45:46.1971006Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 253, in forward 2025-08-14T21:45:46.1971100Z attn_output, attn_weights = attention_interface( 2025-08-14T21:45:46.1971363Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:45:46.1971459Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:45:46.1971462Z 2025-08-14T21:45:46.1971561Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1971740Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1971799Z return mod(**inputs) 2025-08-14T21:45:46.1972048Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1972109Z outputs = self.model( 2025-08-14T21:45:46.1972355Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1972420Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1972677Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1972755Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1972956Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1973044Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1973310Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 424, in forward 2025-08-14T21:45:46.1973406Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:45:46.1973655Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 267, in forward 2025-08-14T21:45:46.1973727Z attn_output = self.out_proj(attn_output) 2025-08-14T21:45:46.1973731Z 2025-08-14T21:45:46.1973823Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1974029Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1974088Z return mod(**inputs) 2025-08-14T21:45:46.1974335Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1974397Z outputs = self.model( 2025-08-14T21:45:46.1974633Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1974708Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1974947Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1975012Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1975215Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1975285Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1975531Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 433, in forward 2025-08-14T21:45:46.1975604Z hidden_states = residual + hidden_states 2025-08-14T21:45:46.1975608Z 2025-08-14T21:45:46.1975700Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1975889Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1975949Z return mod(**inputs) 2025-08-14T21:45:46.1976197Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1976258Z outputs = self.model( 2025-08-14T21:45:46.1976499Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1976570Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1976811Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1976879Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1977085Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1977158Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1977406Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 438, in forward 2025-08-14T21:45:46.1977516Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:45:46.1977520Z 2025-08-14T21:45:46.1977611Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1977800Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1977859Z return mod(**inputs) 2025-08-14T21:45:46.1978119Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1978182Z outputs = self.model( 2025-08-14T21:45:46.1978422Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1978512Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1978752Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1978833Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1979042Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1979112Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1979357Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 438, in forward 2025-08-14T21:45:46.1979465Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:45:46.1979685Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:45:46.1979755Z return self.act(input) 2025-08-14T21:45:46.1979759Z 2025-08-14T21:45:46.1979850Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1980038Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1980097Z return mod(**inputs) 2025-08-14T21:45:46.1980338Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1980403Z outputs = self.model( 2025-08-14T21:45:46.1980642Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1980705Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1980950Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1981015Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1981220Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1981291Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1981530Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 440, in forward 2025-08-14T21:45:46.1981611Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:45:46.1981615Z 2025-08-14T21:45:46.1981706Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1981886Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1981949Z return mod(**inputs) 2025-08-14T21:45:46.1982191Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1982259Z outputs = self.model( 2025-08-14T21:45:46.1982499Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1982564Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1982814Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1982880Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1983087Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1983158Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1983395Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:46.1983506Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:46.1983750Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 215, in forward 2025-08-14T21:45:46.1983887Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:45:46.1983897Z 2025-08-14T21:45:46.1984005Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1984187Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1984270Z return mod(**inputs) 2025-08-14T21:45:46.1984518Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1984816Z outputs = self.model( 2025-08-14T21:45:46.1985078Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1985147Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1985401Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1985505Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1985709Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1985791Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1986035Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:46.1986128Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:46.1986377Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 234, in forward 2025-08-14T21:45:46.1986449Z key_states = self.k_proj(current_states) 2025-08-14T21:45:46.1986453Z 2025-08-14T21:45:46.1986697Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1986880Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1986941Z return mod(**inputs) 2025-08-14T21:45:46.1987193Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1987259Z outputs = self.model( 2025-08-14T21:45:46.1987511Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1987581Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1987824Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1987899Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1988102Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1988176Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1988427Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:46.1988518Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:46.1988770Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 235, in forward 2025-08-14T21:45:46.1988847Z value_states = self.v_proj(current_states) 2025-08-14T21:45:46.1988852Z 2025-08-14T21:45:46.1988924Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1989001Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1989070Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1989145Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.1989236Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1989418Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1989508Z return mod(**inputs) 2025-08-14T21:45:46.1989756Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1989817Z outputs = self.model( 2025-08-14T21:45:46.1990086Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1990155Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1990428Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1990492Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1990695Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1990773Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1991015Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:46.1991125Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:46.1991377Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 253, in forward 2025-08-14T21:45:46.1991466Z attn_output, attn_weights = attention_interface( 2025-08-14T21:45:46.1991738Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:45:46.1991860Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:45:46.1991863Z 2025-08-14T21:45:46.1991954Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1992140Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1992198Z return mod(**inputs) 2025-08-14T21:45:46.1992446Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1992508Z outputs = self.model( 2025-08-14T21:45:46.1992755Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1992830Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1993078Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1993146Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1993357Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1993429Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1993679Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:46.1993769Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:46.1994015Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 253, in forward 2025-08-14T21:45:46.1994111Z attn_output, attn_weights = attention_interface( 2025-08-14T21:45:46.1994379Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:45:46.1994488Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:45:46.1994491Z 2025-08-14T21:45:46.1994587Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1994770Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1994838Z return mod(**inputs) 2025-08-14T21:45:46.1995080Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1995157Z outputs = self.model( 2025-08-14T21:45:46.1995414Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1995478Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1995752Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1995819Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1996037Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1996115Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1996357Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:46.1996450Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:46.1996691Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 267, in forward 2025-08-14T21:45:46.1996782Z attn_output = self.out_proj(attn_output) 2025-08-14T21:45:46.1996785Z 2025-08-14T21:45:46.1996883Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1997065Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1997126Z return mod(**inputs) 2025-08-14T21:45:46.1997382Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1997444Z outputs = self.model( 2025-08-14T21:45:46.1997697Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.1997760Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.1998004Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.1998078Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.1998286Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.1998355Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.1998612Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 424, in forward 2025-08-14T21:45:46.1998710Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:45:46.1998962Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 215, in forward 2025-08-14T21:45:46.1999099Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:45:46.1999102Z 2025-08-14T21:45:46.1999194Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.1999385Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.1999446Z return mod(**inputs) 2025-08-14T21:45:46.1999699Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.1999760Z outputs = self.model( 2025-08-14T21:45:46.2000006Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.2000081Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.2000325Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.2000390Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.2000601Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.2000672Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.2000941Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 424, in forward 2025-08-14T21:45:46.2001044Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:45:46.2001286Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 234, in forward 2025-08-14T21:45:46.2001381Z key_states = self.k_proj(current_states) 2025-08-14T21:45:46.2001385Z 2025-08-14T21:45:46.2001479Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.2001691Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.2001749Z return mod(**inputs) 2025-08-14T21:45:46.2001987Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.2002054Z outputs = self.model( 2025-08-14T21:45:46.2002296Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.2002380Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.2002627Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.2002693Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.2002905Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.2002975Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.2003216Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 424, in forward 2025-08-14T21:45:46.2003319Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:45:46.2003561Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 235, in forward 2025-08-14T21:45:46.2003644Z value_states = self.v_proj(current_states) 2025-08-14T21:45:46.2003649Z 2025-08-14T21:45:46.2003721Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.2003792Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.2003868Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.2003934Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.2004027Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.2004215Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.2004274Z return mod(**inputs) 2025-08-14T21:45:46.2004521Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.2004582Z outputs = self.model( 2025-08-14T21:45:46.2004822Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.2004896Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.2005137Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.2005203Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.2005412Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.2005484Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.2005733Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 424, in forward 2025-08-14T21:45:46.2005832Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:45:46.2006074Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 253, in forward 2025-08-14T21:45:46.2006170Z attn_output, attn_weights = attention_interface( 2025-08-14T21:45:46.2006449Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:45:46.2006583Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:45:46.2006587Z 2025-08-14T21:45:46.2006681Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.2006885Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.2006954Z return mod(**inputs) 2025-08-14T21:45:46.2007303Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.2007369Z outputs = self.model( 2025-08-14T21:45:46.2007626Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.2007693Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.2007946Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.2008035Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.2008244Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.2008323Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.2008581Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 424, in forward 2025-08-14T21:45:46.2008682Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:45:46.2008931Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 253, in forward 2025-08-14T21:45:46.2009020Z attn_output, attn_weights = attention_interface( 2025-08-14T21:45:46.2009297Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:45:46.2009395Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:45:46.2009400Z 2025-08-14T21:45:46.2009492Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.2009681Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.2009740Z return mod(**inputs) 2025-08-14T21:45:46.2009991Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.2010052Z outputs = self.model( 2025-08-14T21:45:46.2010296Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.2010367Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.2010607Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.2010678Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.2010882Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.2010954Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.2011201Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 424, in forward 2025-08-14T21:45:46.2011299Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:45:46.2011539Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 267, in forward 2025-08-14T21:45:46.2011621Z attn_output = self.out_proj(attn_output) 2025-08-14T21:45:46.2011625Z 2025-08-14T21:45:46.2011717Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.2011904Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.2011962Z return mod(**inputs) 2025-08-14T21:45:46.2012218Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.2012289Z outputs = self.model( 2025-08-14T21:45:46.2012529Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.2012592Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.2012854Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.2012936Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.2013145Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.2013215Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.2013453Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 438, in forward 2025-08-14T21:45:46.2013570Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:45:46.2013590Z 2025-08-14T21:45:46.2013685Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.2013872Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.2013932Z return mod(**inputs) 2025-08-14T21:45:46.2014175Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.2014246Z outputs = self.model( 2025-08-14T21:45:46.2014492Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.2014559Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.2014810Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.2014875Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.2015088Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.2015162Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.2015405Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 438, in forward 2025-08-14T21:45:46.2015522Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:45:46.2015721Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:45:46.2015792Z return self.act(input) 2025-08-14T21:45:46.2015796Z 2025-08-14T21:45:46.2015886Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.2016067Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.2016132Z return mod(**inputs) 2025-08-14T21:45:46.2016377Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.2016439Z outputs = self.model( 2025-08-14T21:45:46.2016689Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.2016753Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.2017005Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.2017071Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.2017274Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.2017351Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.2017596Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 440, in forward 2025-08-14T21:45:46.2017668Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:45:46.2017692Z 2025-08-14T21:45:46.2017789Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.2017968Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.2018034Z return mod(**inputs) 2025-08-14T21:45:46.2018291Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.2018367Z outputs = self.model( 2025-08-14T21:45:46.2018615Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.2018680Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.2018925Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.2018991Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.2019198Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.2019297Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.2019549Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 442, in forward 2025-08-14T21:45:46.2019622Z hidden_states = residual + hidden_states 2025-08-14T21:45:46.2019634Z 2025-08-14T21:45:46.2019729Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.2019918Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.2019985Z return mod(**inputs) 2025-08-14T21:45:46.2020232Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.2020294Z outputs = self.model( 2025-08-14T21:45:46.2020552Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.2020621Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.2020879Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.2020944Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.2021153Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.2021234Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.2021483Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:46.2021573Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:46.2021826Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 215, in forward 2025-08-14T21:45:46.2021968Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:45:46.2021973Z 2025-08-14T21:45:46.2022074Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.2022265Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.2022323Z return mod(**inputs) 2025-08-14T21:45:46.2022583Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.2022646Z outputs = self.model( 2025-08-14T21:45:46.2022899Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.2022964Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.2023213Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.2023285Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.2023520Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.2023596Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.2023847Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:46.2023964Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:46.2024224Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 234, in forward 2025-08-14T21:45:46.2024316Z key_states = self.k_proj(current_states) 2025-08-14T21:45:46.2024320Z 2025-08-14T21:45:46.2024413Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.2024610Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.2024739Z return mod(**inputs) 2025-08-14T21:45:46.2025000Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.2025082Z outputs = self.model( 2025-08-14T21:45:46.2025331Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.2025407Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.2025657Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.2025725Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.2025985Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.2026058Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.2026315Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:46.2026407Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:46.2026659Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 235, in forward 2025-08-14T21:45:46.2026746Z value_states = self.v_proj(current_states) 2025-08-14T21:45:46.2026750Z 2025-08-14T21:45:46.2026824Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.2026900Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.2026979Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.2027049Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.2027154Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.2027340Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.2027402Z return mod(**inputs) 2025-08-14T21:45:46.2027660Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.2027723Z outputs = self.model( 2025-08-14T21:45:46.2027971Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.2028046Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.2028295Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.2028371Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.2028580Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.2028653Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.2028907Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:46.2028997Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:46.2029268Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 253, in forward 2025-08-14T21:45:46.2029363Z attn_output, attn_weights = attention_interface( 2025-08-14T21:45:46.2029637Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:45:46.2029783Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:45:46.2029787Z 2025-08-14T21:45:46.2029887Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.2030090Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.2030156Z return mod(**inputs) 2025-08-14T21:45:46.2030402Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.2030469Z outputs = self.model( 2025-08-14T21:45:46.2030715Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.2030799Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.2031054Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.2031120Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.2031335Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.2031409Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.2031656Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:46.2031754Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:46.2032000Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 253, in forward 2025-08-14T21:45:46.2032090Z attn_output, attn_weights = attention_interface( 2025-08-14T21:45:46.2032371Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:45:46.2032473Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:45:46.2032477Z 2025-08-14T21:45:46.2032577Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.2032767Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.2032830Z return mod(**inputs) 2025-08-14T21:45:46.2033086Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.2033148Z outputs = self.model( 2025-08-14T21:45:46.2033402Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.2033469Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.2033718Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.2033793Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.2033999Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.2034072Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.2034326Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:46.2034418Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:46.2034683Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 267, in forward 2025-08-14T21:45:46.2034755Z attn_output = self.out_proj(attn_output) 2025-08-14T21:45:46.2034758Z 2025-08-14T21:45:46.2034850Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.2035053Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.2035116Z return mod(**inputs) 2025-08-14T21:45:46.2035362Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.2035424Z outputs = self.model( 2025-08-14T21:45:46.2035677Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.2035766Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.2036012Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.2036077Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.2036288Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.2036360Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.2036624Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 424, in forward 2025-08-14T21:45:46.2036722Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:45:46.2036963Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 215, in forward 2025-08-14T21:45:46.2037108Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:45:46.2037113Z 2025-08-14T21:45:46.2037206Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.2037395Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.2037454Z return mod(**inputs) 2025-08-14T21:45:46.2037697Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.2037766Z outputs = self.model( 2025-08-14T21:45:46.2038008Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.2038074Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.2038322Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.2038387Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.2038596Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.2038666Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.2038907Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 424, in forward 2025-08-14T21:45:46.2039011Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:45:46.2039254Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 234, in forward 2025-08-14T21:45:46.2039333Z key_states = self.k_proj(current_states) 2025-08-14T21:45:46.2039337Z 2025-08-14T21:45:46.2039429Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.2039609Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.2039676Z return mod(**inputs) 2025-08-14T21:45:46.2039918Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.2039979Z outputs = self.model( 2025-08-14T21:45:46.2040227Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.2040292Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.2040537Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.2040616Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.2040821Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.2040900Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.2041155Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 424, in forward 2025-08-14T21:45:46.2041270Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:45:46.2041517Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 235, in forward 2025-08-14T21:45:46.2041593Z value_states = self.v_proj(current_states) 2025-08-14T21:45:46.2041597Z 2025-08-14T21:45:46.2041675Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.2041745Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.2041813Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.2041888Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.2041999Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.2042179Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.2042245Z return mod(**inputs) 2025-08-14T21:45:46.2042488Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.2042559Z outputs = self.model( 2025-08-14T21:45:46.2042801Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.2042870Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.2043121Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.2043186Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.2043395Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.2043467Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.2043707Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 424, in forward 2025-08-14T21:45:46.2043811Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:45:46.2044052Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 253, in forward 2025-08-14T21:45:46.2044142Z attn_output, attn_weights = attention_interface( 2025-08-14T21:45:46.2044412Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:45:46.2044530Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:45:46.2044534Z 2025-08-14T21:45:46.2044633Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.2044814Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.2044873Z return mod(**inputs) 2025-08-14T21:45:46.2045123Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.2045185Z outputs = self.model( 2025-08-14T21:45:46.2045436Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.2045502Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.2045743Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.2045814Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.2046013Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.2046109Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.2046360Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 424, in forward 2025-08-14T21:45:46.2046456Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:45:46.2046717Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 253, in forward 2025-08-14T21:45:46.2046822Z attn_output, attn_weights = attention_interface( 2025-08-14T21:45:46.2047086Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:45:46.2047190Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:45:46.2047194Z 2025-08-14T21:45:46.2047285Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.2047473Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.2047551Z return mod(**inputs) 2025-08-14T21:45:46.2047794Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.2047863Z outputs = self.model( 2025-08-14T21:45:46.2048109Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.2048175Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.2048427Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.2048491Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.2048699Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.2048769Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.2049012Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 424, in forward 2025-08-14T21:45:46.2049115Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:45:46.2049357Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 267, in forward 2025-08-14T21:45:46.2049436Z attn_output = self.out_proj(attn_output) 2025-08-14T21:45:46.2049439Z 2025-08-14T21:45:46.2049531Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.2049714Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.2049776Z return mod(**inputs) 2025-08-14T21:45:46.2050018Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.2050077Z outputs = self.model( 2025-08-14T21:45:46.2050328Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.2050394Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.2050641Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.2050706Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.2050909Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.2050987Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.2051231Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 438, in forward 2025-08-14T21:45:46.2051346Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:45:46.2051349Z 2025-08-14T21:45:46.2051441Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.2051639Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.2051708Z return mod(**inputs) 2025-08-14T21:45:46.2051952Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.2052012Z outputs = self.model( 2025-08-14T21:45:46.2052276Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.2052357Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.2052608Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.2052672Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.2052871Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.2052948Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.2053187Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 438, in forward 2025-08-14T21:45:46.2053312Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:45:46.2053511Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:45:46.2053572Z return self.act(input) 2025-08-14T21:45:46.2053577Z 2025-08-14T21:45:46.2053675Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.2053856Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.2053913Z return mod(**inputs) 2025-08-14T21:45:46.2054161Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.2054221Z outputs = self.model( 2025-08-14T21:45:46.2054470Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.2054535Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.2054772Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.2054843Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.2055043Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.2055115Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.2055357Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 440, in forward 2025-08-14T21:45:46.2055429Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:45:46.2055433Z 2025-08-14T21:45:46.2055530Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.2055709Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.2055770Z return mod(**inputs) 2025-08-14T21:45:46.2056018Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.2056080Z outputs = self.model( 2025-08-14T21:45:46.2056325Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.2056390Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.2056630Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.2056701Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.2056901Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.2056972Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.2057233Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:46.2057326Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:46.2057572Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 215, in forward 2025-08-14T21:45:46.2057724Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:45:46.2057728Z 2025-08-14T21:45:46.2057822Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.2058028Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.2058085Z return mod(**inputs) 2025-08-14T21:45:46.2058333Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.2058393Z outputs = self.model( 2025-08-14T21:45:46.2058633Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.2058722Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.2058964Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.2059030Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.2059238Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.2059310Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.2059556Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:46.2059645Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:46.2059883Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 234, in forward 2025-08-14T21:45:46.2059963Z key_states = self.k_proj(current_states) 2025-08-14T21:45:46.2059968Z 2025-08-14T21:45:46.2060061Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.2060246Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.2060306Z return mod(**inputs) 2025-08-14T21:45:46.2060547Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.2060615Z outputs = self.model( 2025-08-14T21:45:46.2060856Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.2060921Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.2061168Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.2061231Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.2061439Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.2061510Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.2061748Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:46.2061844Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:46.2062082Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 235, in forward 2025-08-14T21:45:46.2062160Z value_states = self.v_proj(current_states) 2025-08-14T21:45:46.2062171Z 2025-08-14T21:45:46.2062243Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.2062314Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.2062389Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.2062457Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.2062549Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.2062753Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.2062815Z return mod(**inputs) 2025-08-14T21:45:46.2063056Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.2063121Z outputs = self.model( 2025-08-14T21:45:46.2063378Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.2063464Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.2063706Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.2063768Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.2063976Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.2064046Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.2064316Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:46.2064436Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:46.2064761Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 253, in forward 2025-08-14T21:45:46.2064868Z attn_output, attn_weights = attention_interface( 2025-08-14T21:45:46.2065153Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:45:46.2065280Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:45:46.2065292Z 2025-08-14T21:45:46.2065389Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.2065579Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.2065648Z return mod(**inputs) 2025-08-14T21:45:46.2065919Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.2065979Z outputs = self.model( 2025-08-14T21:45:46.2066232Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.2066298Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.2066613Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.2066681Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.2066892Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.2066974Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.2067229Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:46.2067322Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:46.2067586Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 253, in forward 2025-08-14T21:45:46.2067679Z attn_output, attn_weights = attention_interface( 2025-08-14T21:45:46.2067968Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:45:46.2068074Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:45:46.2068078Z 2025-08-14T21:45:46.2068175Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.2068373Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.2068435Z return mod(**inputs) 2025-08-14T21:45:46.2068720Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.2068788Z outputs = self.model( 2025-08-14T21:45:46.2069042Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.2069119Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.2069391Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.2069476Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.2069700Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.2069774Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.2070035Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 407, in forward 2025-08-14T21:45:46.2070130Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:45:46.2070400Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 267, in forward 2025-08-14T21:45:46.2070487Z attn_output = self.out_proj(attn_output) 2025-08-14T21:45:46.2070490Z 2025-08-14T21:45:46.2070587Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.2070787Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.2070851Z return mod(**inputs) 2025-08-14T21:45:46.2071103Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.2071173Z outputs = self.model( 2025-08-14T21:45:46.2071426Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.2071495Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.2071757Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.2071826Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.2072043Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.2072118Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.2072371Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 416, in forward 2025-08-14T21:45:46.2072455Z hidden_states = residual + hidden_states 2025-08-14T21:45:46.2072458Z 2025-08-14T21:45:46.2072555Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.2072743Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.2072813Z return mod(**inputs) 2025-08-14T21:45:46.2073068Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.2073138Z outputs = self.model( 2025-08-14T21:45:46.2073393Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.2073460Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.2073721Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.2073790Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.2074007Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.2074083Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.2074341Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 424, in forward 2025-08-14T21:45:46.2074447Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:45:46.2074702Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 215, in forward 2025-08-14T21:45:46.2074843Z query_states = self.q_proj(hidden_states).view(*q_input_shape).transpose(1, 2) 2025-08-14T21:45:46.2074854Z 2025-08-14T21:45:46.2074963Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.2075144Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.2075227Z return mod(**inputs) 2025-08-14T21:45:46.2075467Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.2075527Z outputs = self.model( 2025-08-14T21:45:46.2075777Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.2075840Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.2076088Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.2076169Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.2076371Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.2076449Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.2076689Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 424, in forward 2025-08-14T21:45:46.2076788Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:45:46.2077034Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 234, in forward 2025-08-14T21:45:46.2077107Z key_states = self.k_proj(current_states) 2025-08-14T21:45:46.2077110Z 2025-08-14T21:45:46.2077207Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.2077390Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.2077449Z return mod(**inputs) 2025-08-14T21:45:46.2077694Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.2077756Z outputs = self.model( 2025-08-14T21:45:46.2078002Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.2078067Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.2078305Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.2078376Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.2078573Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.2078645Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.2078892Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 424, in forward 2025-08-14T21:45:46.2078987Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:45:46.2079231Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 235, in forward 2025-08-14T21:45:46.2079308Z value_states = self.v_proj(current_states) 2025-08-14T21:45:46.2079312Z 2025-08-14T21:45:46.2079383Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.2079461Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.2079529Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.2079595Z cudagraph partition due to non gpu ops 2025-08-14T21:45:46.2079693Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.2079871Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.2079956Z return mod(**inputs) 2025-08-14T21:45:46.2080204Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.2080264Z outputs = self.model( 2025-08-14T21:45:46.2080527Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.2080594Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.2080853Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.2080923Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.2081124Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.2081202Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.2081446Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 424, in forward 2025-08-14T21:45:46.2081562Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:45:46.2081807Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 253, in forward 2025-08-14T21:45:46.2081897Z attn_output, attn_weights = attention_interface( 2025-08-14T21:45:46.2082167Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 81, in sdpa_attention_forward 2025-08-14T21:45:46.2082289Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:45:46.2082292Z 2025-08-14T21:45:46.2082382Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.2082569Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.2082628Z return mod(**inputs) 2025-08-14T21:45:46.2082869Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.2082940Z outputs = self.model( 2025-08-14T21:45:46.2083181Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.2083253Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.2083495Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.2083562Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.2083770Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.2083840Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.2084088Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 424, in forward 2025-08-14T21:45:46.2084185Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:45:46.2084427Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 253, in forward 2025-08-14T21:45:46.2084523Z attn_output, attn_weights = attention_interface( 2025-08-14T21:45:46.2084950Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/integrations/sdpa_attention.py", line 91, in sdpa_attention_forward 2025-08-14T21:45:46.2085058Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:45:46.2085070Z 2025-08-14T21:45:46.2085166Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.2085351Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.2085422Z return mod(**inputs) 2025-08-14T21:45:46.2085669Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.2085769Z outputs = self.model( 2025-08-14T21:45:46.2086028Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.2086095Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.2086382Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.2086449Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.2086682Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.2086763Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.2087009Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 424, in forward 2025-08-14T21:45:46.2087104Z hidden_states, cross_attn_weights = self.encoder_attn( 2025-08-14T21:45:46.2087356Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 267, in forward 2025-08-14T21:45:46.2087455Z attn_output = self.out_proj(attn_output) 2025-08-14T21:45:46.2087459Z 2025-08-14T21:45:46.2087559Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.2087739Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.2087799Z return mod(**inputs) 2025-08-14T21:45:46.2088051Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.2088113Z outputs = self.model( 2025-08-14T21:45:46.2088363Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.2088428Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.2088669Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.2088742Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.2088946Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.2089017Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.2089267Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 438, in forward 2025-08-14T21:45:46.2089377Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:45:46.2089381Z 2025-08-14T21:45:46.2089479Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.2089662Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.2089721Z return mod(**inputs) 2025-08-14T21:45:46.2089970Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.2090032Z outputs = self.model( 2025-08-14T21:45:46.2090284Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.2090349Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.2090593Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.2090666Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.2090870Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.2090940Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.2091190Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 438, in forward 2025-08-14T21:45:46.2091296Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:45:46.2091511Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:45:46.2091577Z return self.act(input) 2025-08-14T21:45:46.2091580Z 2025-08-14T21:45:46.2091674Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.2091861Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.2091935Z return mod(**inputs) 2025-08-14T21:45:46.2092178Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1471, in forward 2025-08-14T21:45:46.2092260Z outputs = self.model( 2025-08-14T21:45:46.2092501Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1297, in forward 2025-08-14T21:45:46.2092572Z decoder_outputs = self.decoder( 2025-08-14T21:45:46.2092812Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1115, in forward 2025-08-14T21:45:46.2092878Z layer_outputs = decoder_layer( 2025-08-14T21:45:46.2093102Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:45:46.2093172Z return super().__call__(*args, **kwargs) 2025-08-14T21:45:46.2093417Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 440, in forward 2025-08-14T21:45:46.2093492Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:45:46.2093497Z 2025-08-14T21:45:46.2093589Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.2093773Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.2093830Z return mod(**inputs) 2025-08-14T21:45:46.2094069Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1489, in forward 2025-08-14T21:45:46.2094186Z lm_logits = self.lm_head(outputs[0]) + self.final_logits_bias 2025-08-14T21:45:46.2094191Z 2025-08-14T21:45:46.2094282Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:45:46.2094467Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:45:46.2094527Z return mod(**inputs) 2025-08-14T21:45:46.2094766Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/pegasus/modeling_pegasus.py", line 1494, in forward 2025-08-14T21:45:46.2094929Z masked_lm_loss = loss_fct(lm_logits.view(-1, self.config.vocab_size), labels.view(-1)) 2025-08-14T21:45:46.2094933Z 2025-08-14T21:45:56.4838639Z Compilation time (from dynamo_timed): 24.48749468 2025-08-14T21:45:56.4849237Z pass 2025-08-14T21:45:56.4851077Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:45:56.4851971Z TIMING: _recursive_pre_grad_passes:0.01264 _recursive_joint_graph_passes:1.01224 _recursive_post_grad_passes:0.15132 async_compile.wait:0.70683 code_gen:9.97845 inductor_compile:12.71456 backend_compile:19.33105 gc:0.00073 entire_frame_compile:24.48749 total_wall_time:24.48749 2025-08-14T21:45:56.4856084Z STATS: call_* op count: 965 | FakeTensorMode.__torch_dispatch__:33299 | FakeTensor.__torch_dispatch__:11840 | ProxyTorchDispatchMode.__torch_dispatch__:12299 2025-08-14T21:45:56.4858031Z Dynamo produced 1 graphs covering 965 ops with 0 graph breaks (0 unique) 2025-08-14T21:46:00.9882092Z /opt/conda/envs/py_3.9/lib/python3.9/site-packages/llvmlite/binding/ffi.py:175: UserWarning: pkg_resources is deprecated as an API. See https://setuptools.pypa.io/en/latest/pkg_resources.html. The pkg_resources package is slated for removal as early as 2025-11-30. Refrain from using this package or pin to Setuptools<81. 2025-08-14T21:46:00.9883083Z from pkg_resources import resource_filename 2025-08-14T21:46:01.5203656Z 2025-08-14T21:46:01.5300356Z loading model: 0it [00:00, ?it/s]If you want to use `RobertaLMHeadModel` as a standalone, add `is_decoder=True.` 2025-08-14T21:46:01.5301162Z WARNING:transformers.models.roberta.modeling_roberta:If you want to use `RobertaLMHeadModel` as a standalone, add `is_decoder=True.` 2025-08-14T21:46:02.8216359Z We strongly recommend passing in an `attention_mask` since your input_ids may be padded. See https://huggingface.co/docs/transformers/troubleshooting#incorrect-output-when-padding-tokens-arent-masked. 2025-08-14T21:46:02.8217545Z You may ignore this warning if your `pad_token_id` (0) is identical to the `bos_token_id` (0), `eos_token_id` (2), or the `sep_token_id` (None), and your input is not padded. 2025-08-14T21:46:02.8218979Z WARNING:transformers.modeling_utils:We strongly recommend passing in an `attention_mask` since your input_ids may be padded. See https://huggingface.co/docs/transformers/troubleshooting#incorrect-output-when-padding-tokens-arent-masked. 2025-08-14T21:46:02.8220402Z You may ignore this warning if your `pad_token_id` (0) is identical to the `bos_token_id` (0), `eos_token_id` (2), or the `sep_token_id` (None), and your input is not padded. 2025-08-14T21:46:02.9651905Z 2025-08-14T21:46:02.9655590Z loading model: 0it [00:01, ?it/s] 2025-08-14T21:46:02.9665464Z cpu eval RobertaForCausalLM 2025-08-14T21:46:03.4211320Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:46:03.6388405Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:46:03.9025370Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:46:10.6524852Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:10.6526639Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:10.6531051Z return mod(**inputs) 2025-08-14T21:46:10.6533788Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 999, in forward 2025-08-14T21:46:10.6534382Z outputs = self.roberta( 2025-08-14T21:46:10.6534857Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 826, in forward 2025-08-14T21:46:10.6535737Z embedding_output = self.embeddings( 2025-08-14T21:46:10.6540232Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 89, in forward 2025-08-14T21:46:10.6542092Z position_ids = create_position_ids_from_input_ids(input_ids, self.padding_idx, past_key_values_length) 2025-08-14T21:46:10.6545201Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 1576, in create_position_ids_from_input_ids 2025-08-14T21:46:10.6545792Z mask = input_ids.ne(padding_idx).int() 2025-08-14T21:46:10.6549417Z 2025-08-14T21:46:10.6554077Z cudagraph partition due to non gpu ops 2025-08-14T21:46:10.6556398Z cudagraph partition due to non gpu ops 2025-08-14T21:46:10.6560806Z cudagraph partition due to non gpu ops 2025-08-14T21:46:10.6564757Z cudagraph partition due to non gpu ops 2025-08-14T21:46:10.6565032Z cudagraph partition due to non gpu ops 2025-08-14T21:46:10.6565226Z cudagraph partition due to non gpu ops 2025-08-14T21:46:10.6565433Z cudagraph partition due to non gpu ops 2025-08-14T21:46:10.6565646Z cudagraph partition due to non gpu ops 2025-08-14T21:46:10.6565836Z cudagraph partition due to non gpu ops 2025-08-14T21:46:10.6566051Z cudagraph partition due to non gpu ops 2025-08-14T21:46:10.6566252Z cudagraph partition due to non gpu ops 2025-08-14T21:46:10.6566449Z cudagraph partition due to non gpu ops 2025-08-14T21:46:10.6566669Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:10.6567026Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:10.6567342Z return mod(**inputs) 2025-08-14T21:46:10.6567998Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 999, in forward 2025-08-14T21:46:10.6568397Z outputs = self.roberta( 2025-08-14T21:46:10.6568760Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 826, in forward 2025-08-14T21:46:10.6569141Z embedding_output = self.embeddings( 2025-08-14T21:46:10.6569569Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 89, in forward 2025-08-14T21:46:10.6570117Z position_ids = create_position_ids_from_input_ids(input_ids, self.padding_idx, past_key_values_length) 2025-08-14T21:46:10.6570677Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 1577, in create_position_ids_from_input_ids 2025-08-14T21:46:10.6571216Z incremental_indices = (torch.cumsum(mask, dim=1).type_as(mask) + past_key_values_length) * mask 2025-08-14T21:46:10.6571442Z 2025-08-14T21:46:10.6571546Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:10.6571935Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:10.6572238Z return mod(**inputs) 2025-08-14T21:46:10.6572580Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 999, in forward 2025-08-14T21:46:10.6572945Z outputs = self.roberta( 2025-08-14T21:46:10.6573293Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 826, in forward 2025-08-14T21:46:10.6573663Z embedding_output = self.embeddings( 2025-08-14T21:46:10.6574026Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 89, in forward 2025-08-14T21:46:10.6574510Z position_ids = create_position_ids_from_input_ids(input_ids, self.padding_idx, past_key_values_length) 2025-08-14T21:46:10.6575055Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 1577, in create_position_ids_from_input_ids 2025-08-14T21:46:10.6575588Z incremental_indices = (torch.cumsum(mask, dim=1).type_as(mask) + past_key_values_length) * mask 2025-08-14T21:46:10.6575807Z 2025-08-14T21:46:10.6575907Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:10.6576243Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:10.6576544Z return mod(**inputs) 2025-08-14T21:46:10.6576887Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 999, in forward 2025-08-14T21:46:10.6577242Z outputs = self.roberta( 2025-08-14T21:46:10.6577589Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:10.6577955Z encoder_outputs = self.encoder( 2025-08-14T21:46:10.6578311Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:10.6578907Z layer_outputs = layer_module( 2025-08-14T21:46:10.6579230Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:10.6579572Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:10.6579939Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 539, in forward 2025-08-14T21:46:10.6580314Z self_attention_outputs = self.attention( 2025-08-14T21:46:10.6580672Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:10.6581036Z return func(*args, **kwargs) 2025-08-14T21:46:10.6581390Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 466, in forward 2025-08-14T21:46:10.6581775Z self_outputs = self.self( 2025-08-14T21:46:10.6582114Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:10.6582449Z return func(*args, **kwargs) 2025-08-14T21:46:10.6582827Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 324, in forward 2025-08-14T21:46:10.6583328Z self.query(hidden_states).view(bsz, -1, self.num_attention_heads, self.attention_head_size).transpose(1, 2) 2025-08-14T21:46:10.6583600Z 2025-08-14T21:46:10.6583704Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:10.6584036Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:10.6584339Z return mod(**inputs) 2025-08-14T21:46:10.6584906Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 999, in forward 2025-08-14T21:46:10.6585292Z outputs = self.roberta( 2025-08-14T21:46:10.6585717Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:10.6586078Z encoder_outputs = self.encoder( 2025-08-14T21:46:10.6586442Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:10.6586797Z layer_outputs = layer_module( 2025-08-14T21:46:10.6587123Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:10.6587462Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:10.6587833Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 539, in forward 2025-08-14T21:46:10.6588203Z self_attention_outputs = self.attention( 2025-08-14T21:46:10.6588561Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:10.6588911Z return func(*args, **kwargs) 2025-08-14T21:46:10.6589258Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 466, in forward 2025-08-14T21:46:10.6589618Z self_outputs = self.self( 2025-08-14T21:46:10.6589953Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:10.6590293Z return func(*args, **kwargs) 2025-08-14T21:46:10.6590636Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 352, in forward 2025-08-14T21:46:10.6590994Z self.key(current_states) 2025-08-14T21:46:10.6591101Z 2025-08-14T21:46:10.6591203Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:10.6591535Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:10.6591827Z return mod(**inputs) 2025-08-14T21:46:10.6592169Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 999, in forward 2025-08-14T21:46:10.6592523Z outputs = self.roberta( 2025-08-14T21:46:10.6592861Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:10.6593224Z encoder_outputs = self.encoder( 2025-08-14T21:46:10.6593585Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:10.6593947Z layer_outputs = layer_module( 2025-08-14T21:46:10.6594261Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:10.6594596Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:10.6594989Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 539, in forward 2025-08-14T21:46:10.6595360Z self_attention_outputs = self.attention( 2025-08-14T21:46:10.6595718Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:10.6596060Z return func(*args, **kwargs) 2025-08-14T21:46:10.6596442Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 466, in forward 2025-08-14T21:46:10.6596820Z self_outputs = self.self( 2025-08-14T21:46:10.6597150Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:10.6597487Z return func(*args, **kwargs) 2025-08-14T21:46:10.6597859Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 357, in forward 2025-08-14T21:46:10.6598225Z self.value(current_states) 2025-08-14T21:46:10.6598332Z 2025-08-14T21:46:10.6598414Z cudagraph partition due to non gpu ops 2025-08-14T21:46:10.6598651Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:10.6598988Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:10.6599288Z return mod(**inputs) 2025-08-14T21:46:10.6599636Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 999, in forward 2025-08-14T21:46:10.6599990Z outputs = self.roberta( 2025-08-14T21:46:10.6600332Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:10.6600693Z encoder_outputs = self.encoder( 2025-08-14T21:46:10.6601043Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:10.6601404Z layer_outputs = layer_module( 2025-08-14T21:46:10.6601725Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:10.6602057Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:10.6602416Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 539, in forward 2025-08-14T21:46:10.6602789Z self_attention_outputs = self.attention( 2025-08-14T21:46:10.6603140Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:10.6603482Z return func(*args, **kwargs) 2025-08-14T21:46:10.6603825Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 466, in forward 2025-08-14T21:46:10.6604186Z self_outputs = self.self( 2025-08-14T21:46:10.6604517Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:10.6604849Z return func(*args, **kwargs) 2025-08-14T21:46:10.6605201Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 388, in forward 2025-08-14T21:46:10.6605613Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:46:10.6605782Z 2025-08-14T21:46:10.6605885Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:10.6606209Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:10.6606509Z return mod(**inputs) 2025-08-14T21:46:10.6606848Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 999, in forward 2025-08-14T21:46:10.6607199Z outputs = self.roberta( 2025-08-14T21:46:10.6607540Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:10.6607900Z encoder_outputs = self.encoder( 2025-08-14T21:46:10.6608274Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:10.6608633Z layer_outputs = layer_module( 2025-08-14T21:46:10.6608954Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:10.6609303Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:10.6609671Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 539, in forward 2025-08-14T21:46:10.6610056Z self_attention_outputs = self.attention( 2025-08-14T21:46:10.6610405Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:10.6610747Z return func(*args, **kwargs) 2025-08-14T21:46:10.6611089Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 476, in forward 2025-08-14T21:46:10.6611522Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:46:10.6611932Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 412, in forward 2025-08-14T21:46:10.6612307Z hidden_states = self.dense(hidden_states) 2025-08-14T21:46:10.6612436Z 2025-08-14T21:46:10.6612535Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:10.6612869Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:10.6613169Z return mod(**inputs) 2025-08-14T21:46:10.6613505Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 999, in forward 2025-08-14T21:46:10.6613868Z outputs = self.roberta( 2025-08-14T21:46:10.6614214Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:10.6614577Z encoder_outputs = self.encoder( 2025-08-14T21:46:10.6614928Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:10.6615294Z layer_outputs = layer_module( 2025-08-14T21:46:10.6615615Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:10.6615950Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:10.6616311Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 569, in forward 2025-08-14T21:46:10.6616684Z layer_output = apply_chunking_to_forward( 2025-08-14T21:46:10.6617056Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:46:10.6617412Z return forward_fn(*input_tensors) 2025-08-14T21:46:10.6617808Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 577, in feed_forward_chunk 2025-08-14T21:46:10.6618252Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:46:10.6618658Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 492, in forward 2025-08-14T21:46:10.6619027Z hidden_states = self.dense(hidden_states) 2025-08-14T21:46:10.6619161Z 2025-08-14T21:46:10.6619261Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:10.6619592Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:10.6619889Z return mod(**inputs) 2025-08-14T21:46:10.6620224Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 999, in forward 2025-08-14T21:46:10.6620580Z outputs = self.roberta( 2025-08-14T21:46:10.6620945Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:10.6621301Z encoder_outputs = self.encoder( 2025-08-14T21:46:10.6621658Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:10.6622019Z layer_outputs = layer_module( 2025-08-14T21:46:10.6622353Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:10.6622697Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:10.6623062Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 569, in forward 2025-08-14T21:46:10.6623430Z layer_output = apply_chunking_to_forward( 2025-08-14T21:46:10.6623792Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:46:10.6624150Z return forward_fn(*input_tensors) 2025-08-14T21:46:10.6624561Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 577, in feed_forward_chunk 2025-08-14T21:46:10.6625082Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:46:10.6625493Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 493, in forward 2025-08-14T21:46:10.6625899Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:46:10.6626261Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:46:10.6626582Z return self.act(input) 2025-08-14T21:46:10.6626689Z 2025-08-14T21:46:10.6626786Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:10.6627127Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:10.6627436Z return mod(**inputs) 2025-08-14T21:46:10.6627777Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 999, in forward 2025-08-14T21:46:10.6628147Z outputs = self.roberta( 2025-08-14T21:46:10.6628497Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:10.6628869Z encoder_outputs = self.encoder( 2025-08-14T21:46:10.6629225Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:10.6629595Z layer_outputs = layer_module( 2025-08-14T21:46:10.6629920Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:10.6630258Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:10.6630623Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 569, in forward 2025-08-14T21:46:10.6631003Z layer_output = apply_chunking_to_forward( 2025-08-14T21:46:10.6631378Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:46:10.6631735Z return forward_fn(*input_tensors) 2025-08-14T21:46:10.6632125Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 578, in feed_forward_chunk 2025-08-14T21:46:10.6632578Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:46:10.6632997Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 506, in forward 2025-08-14T21:46:10.6633366Z hidden_states = self.dense(hidden_states) 2025-08-14T21:46:10.6633502Z 2025-08-14T21:46:10.6633599Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:10.6633954Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:10.6634265Z return mod(**inputs) 2025-08-14T21:46:10.6634599Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 999, in forward 2025-08-14T21:46:10.6634958Z outputs = self.roberta( 2025-08-14T21:46:10.6635319Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:10.6635710Z encoder_outputs = self.encoder( 2025-08-14T21:46:10.6636074Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:10.6636440Z layer_outputs = layer_module( 2025-08-14T21:46:10.6636761Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:10.6637090Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:10.6637461Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 539, in forward 2025-08-14T21:46:10.6637853Z self_attention_outputs = self.attention( 2025-08-14T21:46:10.6638207Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:10.6638559Z return func(*args, **kwargs) 2025-08-14T21:46:10.6638923Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 466, in forward 2025-08-14T21:46:10.6639290Z self_outputs = self.self( 2025-08-14T21:46:10.6639623Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:10.6639972Z return func(*args, **kwargs) 2025-08-14T21:46:10.6640330Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 324, in forward 2025-08-14T21:46:10.6640829Z self.query(hidden_states).view(bsz, -1, self.num_attention_heads, self.attention_head_size).transpose(1, 2) 2025-08-14T21:46:10.6641077Z 2025-08-14T21:46:10.6641176Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:10.6641518Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:10.6641823Z return mod(**inputs) 2025-08-14T21:46:10.6642166Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 999, in forward 2025-08-14T21:46:10.6642535Z outputs = self.roberta( 2025-08-14T21:46:10.6642888Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:10.6643259Z encoder_outputs = self.encoder( 2025-08-14T21:46:10.6643618Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:10.6643984Z layer_outputs = layer_module( 2025-08-14T21:46:10.6644315Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:10.6644650Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:10.6645027Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 539, in forward 2025-08-14T21:46:10.6645409Z self_attention_outputs = self.attention( 2025-08-14T21:46:10.6645772Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:10.6646114Z return func(*args, **kwargs) 2025-08-14T21:46:10.6646477Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 466, in forward 2025-08-14T21:46:10.6646846Z self_outputs = self.self( 2025-08-14T21:46:10.6647200Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:10.6647543Z return func(*args, **kwargs) 2025-08-14T21:46:10.6647897Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 352, in forward 2025-08-14T21:46:10.6648258Z self.key(current_states) 2025-08-14T21:46:10.6648363Z 2025-08-14T21:46:10.6648475Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:10.6648812Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:10.6649130Z return mod(**inputs) 2025-08-14T21:46:10.6649469Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 999, in forward 2025-08-14T21:46:10.6649820Z outputs = self.roberta( 2025-08-14T21:46:10.6650166Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:10.6650529Z encoder_outputs = self.encoder( 2025-08-14T21:46:10.6650877Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:10.6651262Z layer_outputs = layer_module( 2025-08-14T21:46:10.6651583Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:10.6651922Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:10.6652287Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 539, in forward 2025-08-14T21:46:10.6652664Z self_attention_outputs = self.attention( 2025-08-14T21:46:10.6653020Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:10.6653367Z return func(*args, **kwargs) 2025-08-14T21:46:10.6653714Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 466, in forward 2025-08-14T21:46:10.6654078Z self_outputs = self.self( 2025-08-14T21:46:10.6654410Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:10.6654747Z return func(*args, **kwargs) 2025-08-14T21:46:10.6655102Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 357, in forward 2025-08-14T21:46:10.6655470Z self.value(current_states) 2025-08-14T21:46:10.6655578Z 2025-08-14T21:46:10.6655658Z cudagraph partition due to non gpu ops 2025-08-14T21:46:10.6655871Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:10.6656201Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:10.6656499Z return mod(**inputs) 2025-08-14T21:46:10.6656835Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 999, in forward 2025-08-14T21:46:10.6657198Z outputs = self.roberta( 2025-08-14T21:46:10.6657547Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:10.6657910Z encoder_outputs = self.encoder( 2025-08-14T21:46:10.6658267Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:10.6658632Z layer_outputs = layer_module( 2025-08-14T21:46:10.6658951Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:10.6659278Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:10.6659650Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 539, in forward 2025-08-14T21:46:10.6660024Z self_attention_outputs = self.attention( 2025-08-14T21:46:10.6660402Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:10.6660744Z return func(*args, **kwargs) 2025-08-14T21:46:10.6661094Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 466, in forward 2025-08-14T21:46:10.6661455Z self_outputs = self.self( 2025-08-14T21:46:10.6661804Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:10.6662767Z return func(*args, **kwargs) 2025-08-14T21:46:10.6663121Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 388, in forward 2025-08-14T21:46:10.6663539Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:46:10.6663711Z 2025-08-14T21:46:10.6663807Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:10.6664156Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:10.6664485Z return mod(**inputs) 2025-08-14T21:46:10.6664901Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 999, in forward 2025-08-14T21:46:10.6665262Z outputs = self.roberta( 2025-08-14T21:46:10.6665616Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:10.6665991Z encoder_outputs = self.encoder( 2025-08-14T21:46:10.6666345Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:10.6666705Z layer_outputs = layer_module( 2025-08-14T21:46:10.6667025Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:10.6667360Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:10.6667723Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 539, in forward 2025-08-14T21:46:10.6668095Z self_attention_outputs = self.attention( 2025-08-14T21:46:10.6668450Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:10.6668793Z return func(*args, **kwargs) 2025-08-14T21:46:10.6669137Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 476, in forward 2025-08-14T21:46:10.6669554Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:46:10.6669965Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 412, in forward 2025-08-14T21:46:10.6670329Z hidden_states = self.dense(hidden_states) 2025-08-14T21:46:10.6670465Z 2025-08-14T21:46:10.6670559Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:10.6670885Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:10.6671184Z return mod(**inputs) 2025-08-14T21:46:10.6671515Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 999, in forward 2025-08-14T21:46:10.6671870Z outputs = self.roberta( 2025-08-14T21:46:10.6672213Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:10.6672567Z encoder_outputs = self.encoder( 2025-08-14T21:46:10.6672922Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:10.6673279Z layer_outputs = layer_module( 2025-08-14T21:46:10.6673595Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:10.6673941Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:10.6674314Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 569, in forward 2025-08-14T21:46:10.6674687Z layer_output = apply_chunking_to_forward( 2025-08-14T21:46:10.6675073Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:46:10.6675429Z return forward_fn(*input_tensors) 2025-08-14T21:46:10.6675846Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 577, in feed_forward_chunk 2025-08-14T21:46:10.6676284Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:46:10.6676682Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 492, in forward 2025-08-14T21:46:10.6677069Z hidden_states = self.dense(hidden_states) 2025-08-14T21:46:10.6677204Z 2025-08-14T21:46:10.6677299Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:10.6677654Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:10.6677947Z return mod(**inputs) 2025-08-14T21:46:10.6678292Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 999, in forward 2025-08-14T21:46:10.6678655Z outputs = self.roberta( 2025-08-14T21:46:10.6679002Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:10.6679355Z encoder_outputs = self.encoder( 2025-08-14T21:46:10.6679717Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:10.6680079Z layer_outputs = layer_module( 2025-08-14T21:46:10.6680393Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:10.6680730Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:10.6681096Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 569, in forward 2025-08-14T21:46:10.6681469Z layer_output = apply_chunking_to_forward( 2025-08-14T21:46:10.6681831Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:46:10.6682191Z return forward_fn(*input_tensors) 2025-08-14T21:46:10.6682583Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 577, in feed_forward_chunk 2025-08-14T21:46:10.6683015Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:46:10.6683412Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 493, in forward 2025-08-14T21:46:10.6683812Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:46:10.6684167Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:46:10.6684477Z return self.act(input) 2025-08-14T21:46:10.6684703Z 2025-08-14T21:46:10.6684808Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:10.6685151Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:10.6685457Z return mod(**inputs) 2025-08-14T21:46:10.6685793Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 999, in forward 2025-08-14T21:46:10.6686158Z outputs = self.roberta( 2025-08-14T21:46:10.6686505Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:10.6686865Z encoder_outputs = self.encoder( 2025-08-14T21:46:10.6687282Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:10.6687643Z layer_outputs = layer_module( 2025-08-14T21:46:10.6687958Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:10.6688280Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:10.6688687Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 569, in forward 2025-08-14T21:46:10.6689092Z layer_output = apply_chunking_to_forward( 2025-08-14T21:46:10.6689459Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:46:10.6689812Z return forward_fn(*input_tensors) 2025-08-14T21:46:10.6690198Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 578, in feed_forward_chunk 2025-08-14T21:46:10.6690642Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:46:10.6691077Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 506, in forward 2025-08-14T21:46:10.6691450Z hidden_states = self.dense(hidden_states) 2025-08-14T21:46:10.6691583Z 2025-08-14T21:46:10.6691681Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:10.6692012Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:10.6692306Z return mod(**inputs) 2025-08-14T21:46:10.6692652Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 999, in forward 2025-08-14T21:46:10.6693010Z outputs = self.roberta( 2025-08-14T21:46:10.6693342Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:10.6693702Z encoder_outputs = self.encoder( 2025-08-14T21:46:10.6694059Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:10.6694417Z layer_outputs = layer_module( 2025-08-14T21:46:10.6694730Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:10.6695062Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:10.6695431Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 539, in forward 2025-08-14T21:46:10.6695803Z self_attention_outputs = self.attention( 2025-08-14T21:46:10.6696148Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:10.6696492Z return func(*args, **kwargs) 2025-08-14T21:46:10.6696846Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 466, in forward 2025-08-14T21:46:10.6697200Z self_outputs = self.self( 2025-08-14T21:46:10.6697533Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:10.6697872Z return func(*args, **kwargs) 2025-08-14T21:46:10.6698225Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 324, in forward 2025-08-14T21:46:10.6698712Z self.query(hidden_states).view(bsz, -1, self.num_attention_heads, self.attention_head_size).transpose(1, 2) 2025-08-14T21:46:10.6698959Z 2025-08-14T21:46:10.6699054Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:10.6699384Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:10.6699677Z return mod(**inputs) 2025-08-14T21:46:10.6700024Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 999, in forward 2025-08-14T21:46:10.6700390Z outputs = self.roberta( 2025-08-14T21:46:10.6700736Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:10.6701091Z encoder_outputs = self.encoder( 2025-08-14T21:46:10.6701470Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:10.6701850Z layer_outputs = layer_module( 2025-08-14T21:46:10.6702169Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:10.6702497Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:10.6702865Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 539, in forward 2025-08-14T21:46:10.6703234Z self_attention_outputs = self.attention( 2025-08-14T21:46:10.6703579Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:10.6703943Z return func(*args, **kwargs) 2025-08-14T21:46:10.6704298Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 466, in forward 2025-08-14T21:46:10.6704661Z self_outputs = self.self( 2025-08-14T21:46:10.6705042Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:10.6705388Z return func(*args, **kwargs) 2025-08-14T21:46:10.6705738Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 352, in forward 2025-08-14T21:46:10.6706097Z self.key(current_states) 2025-08-14T21:46:10.6706203Z 2025-08-14T21:46:10.6706298Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:10.6706629Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:10.6706930Z return mod(**inputs) 2025-08-14T21:46:10.6707264Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 999, in forward 2025-08-14T21:46:10.6707625Z outputs = self.roberta( 2025-08-14T21:46:10.6707974Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:10.6708342Z encoder_outputs = self.encoder( 2025-08-14T21:46:10.6708695Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:10.6709056Z layer_outputs = layer_module( 2025-08-14T21:46:10.6709377Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:10.6709702Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:10.6710070Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 539, in forward 2025-08-14T21:46:10.6710440Z self_attention_outputs = self.attention( 2025-08-14T21:46:10.6710793Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:10.6711130Z return func(*args, **kwargs) 2025-08-14T21:46:10.6711480Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 466, in forward 2025-08-14T21:46:10.6711847Z self_outputs = self.self( 2025-08-14T21:46:10.6712168Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:10.6712510Z return func(*args, **kwargs) 2025-08-14T21:46:10.6712859Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 357, in forward 2025-08-14T21:46:10.6713244Z self.value(current_states) 2025-08-14T21:46:10.6713355Z 2025-08-14T21:46:10.6713432Z cudagraph partition due to non gpu ops 2025-08-14T21:46:10.6713656Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:10.6713993Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:10.6714308Z return mod(**inputs) 2025-08-14T21:46:10.6714648Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 999, in forward 2025-08-14T21:46:10.6715028Z outputs = self.roberta( 2025-08-14T21:46:10.6715372Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:10.6715728Z encoder_outputs = self.encoder( 2025-08-14T21:46:10.6716090Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:10.6716455Z layer_outputs = layer_module( 2025-08-14T21:46:10.6716817Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:10.6717147Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:10.6717518Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 539, in forward 2025-08-14T21:46:10.6717889Z self_attention_outputs = self.attention( 2025-08-14T21:46:10.6718235Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:10.6718581Z return func(*args, **kwargs) 2025-08-14T21:46:10.6718936Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 466, in forward 2025-08-14T21:46:10.6719297Z self_outputs = self.self( 2025-08-14T21:46:10.6719624Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:10.6719971Z return func(*args, **kwargs) 2025-08-14T21:46:10.6720323Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 388, in forward 2025-08-14T21:46:10.6720739Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:46:10.6720911Z 2025-08-14T21:46:10.6721006Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:10.6721339Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:10.6721636Z return mod(**inputs) 2025-08-14T21:46:10.6721968Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 999, in forward 2025-08-14T21:46:10.6722327Z outputs = self.roberta( 2025-08-14T21:46:10.6722672Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:10.6723034Z encoder_outputs = self.encoder( 2025-08-14T21:46:10.6723385Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:10.6723747Z layer_outputs = layer_module( 2025-08-14T21:46:10.6724069Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:10.6724397Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:10.6724766Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 539, in forward 2025-08-14T21:46:10.6725137Z self_attention_outputs = self.attention( 2025-08-14T21:46:10.6725489Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:10.6725825Z return func(*args, **kwargs) 2025-08-14T21:46:10.6726194Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 476, in forward 2025-08-14T21:46:10.6726610Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:46:10.6727025Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 412, in forward 2025-08-14T21:46:10.6727411Z hidden_states = self.dense(hidden_states) 2025-08-14T21:46:10.6727548Z 2025-08-14T21:46:10.6727663Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:10.6727997Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:10.6728291Z return mod(**inputs) 2025-08-14T21:46:10.6728637Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 999, in forward 2025-08-14T21:46:10.6728999Z outputs = self.roberta( 2025-08-14T21:46:10.6729346Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:10.6729729Z encoder_outputs = self.encoder( 2025-08-14T21:46:10.6730088Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:10.6730451Z layer_outputs = layer_module( 2025-08-14T21:46:10.6730763Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:10.6731100Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:10.6731465Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 569, in forward 2025-08-14T21:46:10.6731834Z layer_output = apply_chunking_to_forward( 2025-08-14T21:46:10.6732198Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:46:10.6732560Z return forward_fn(*input_tensors) 2025-08-14T21:46:10.6732953Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 577, in feed_forward_chunk 2025-08-14T21:46:10.6733394Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:46:10.6733794Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 492, in forward 2025-08-14T21:46:10.6734170Z hidden_states = self.dense(hidden_states) 2025-08-14T21:46:10.6734298Z 2025-08-14T21:46:10.6734403Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:10.6734723Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:10.6735024Z return mod(**inputs) 2025-08-14T21:46:10.6735366Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 999, in forward 2025-08-14T21:46:10.6735724Z outputs = self.roberta( 2025-08-14T21:46:10.6736063Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:10.6736428Z encoder_outputs = self.encoder( 2025-08-14T21:46:10.6736783Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:10.6737147Z layer_outputs = layer_module( 2025-08-14T21:46:10.6737458Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:10.6737792Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:10.6738161Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 569, in forward 2025-08-14T21:46:10.6738525Z layer_output = apply_chunking_to_forward( 2025-08-14T21:46:10.6738892Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:46:10.6739286Z return forward_fn(*input_tensors) 2025-08-14T21:46:10.6739677Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 577, in feed_forward_chunk 2025-08-14T21:46:10.6740104Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:46:10.6740520Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 493, in forward 2025-08-14T21:46:10.6740937Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:46:10.6741291Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:46:10.6741599Z return self.act(input) 2025-08-14T21:46:10.6741708Z 2025-08-14T21:46:10.6741803Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:10.6742132Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:10.6742426Z return mod(**inputs) 2025-08-14T21:46:10.6742794Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 999, in forward 2025-08-14T21:46:10.6743154Z outputs = self.roberta( 2025-08-14T21:46:10.6743502Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:10.6743860Z encoder_outputs = self.encoder( 2025-08-14T21:46:10.6744219Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:10.6744584Z layer_outputs = layer_module( 2025-08-14T21:46:10.6744962Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:10.6745304Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:10.6745678Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 569, in forward 2025-08-14T21:46:10.6746057Z layer_output = apply_chunking_to_forward( 2025-08-14T21:46:10.6746422Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:46:10.6746794Z return forward_fn(*input_tensors) 2025-08-14T21:46:10.6747194Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 578, in feed_forward_chunk 2025-08-14T21:46:10.6747651Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:46:10.6748062Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 506, in forward 2025-08-14T21:46:10.6748438Z hidden_states = self.dense(hidden_states) 2025-08-14T21:46:10.6748567Z 2025-08-14T21:46:10.6748675Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:10.6749000Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:10.6749304Z return mod(**inputs) 2025-08-14T21:46:10.6749643Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 999, in forward 2025-08-14T21:46:10.6750003Z outputs = self.roberta( 2025-08-14T21:46:10.6750342Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:10.6750709Z encoder_outputs = self.encoder( 2025-08-14T21:46:10.6751066Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:10.6751421Z layer_outputs = layer_module( 2025-08-14T21:46:10.6751741Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:10.6752076Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:10.6752462Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 539, in forward 2025-08-14T21:46:10.6752833Z self_attention_outputs = self.attention( 2025-08-14T21:46:10.6753188Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:10.6753552Z return func(*args, **kwargs) 2025-08-14T21:46:10.6753914Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 466, in forward 2025-08-14T21:46:10.6754287Z self_outputs = self.self( 2025-08-14T21:46:10.6754618Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:10.6754960Z return func(*args, **kwargs) 2025-08-14T21:46:10.6755299Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 324, in forward 2025-08-14T21:46:10.6755786Z self.query(hidden_states).view(bsz, -1, self.num_attention_heads, self.attention_head_size).transpose(1, 2) 2025-08-14T21:46:10.6756053Z 2025-08-14T21:46:10.6756148Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:10.6756484Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:10.6756775Z return mod(**inputs) 2025-08-14T21:46:10.6757125Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 999, in forward 2025-08-14T21:46:10.6757486Z outputs = self.roberta( 2025-08-14T21:46:10.6757828Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:10.6758186Z encoder_outputs = self.encoder( 2025-08-14T21:46:10.6758545Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:10.6758906Z layer_outputs = layer_module( 2025-08-14T21:46:10.6759218Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:10.6759550Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:10.6759917Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 539, in forward 2025-08-14T21:46:10.6760297Z self_attention_outputs = self.attention( 2025-08-14T21:46:10.6760642Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:10.6760987Z return func(*args, **kwargs) 2025-08-14T21:46:10.6761339Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 466, in forward 2025-08-14T21:46:10.6761696Z self_outputs = self.self( 2025-08-14T21:46:10.6762033Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:10.6762376Z return func(*args, **kwargs) 2025-08-14T21:46:10.6762733Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 352, in forward 2025-08-14T21:46:10.6763089Z self.key(current_states) 2025-08-14T21:46:10.6763202Z 2025-08-14T21:46:10.6763299Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:10.6763631Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:10.6763929Z return mod(**inputs) 2025-08-14T21:46:10.6764263Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 999, in forward 2025-08-14T21:46:10.6764620Z outputs = self.roberta( 2025-08-14T21:46:10.6764962Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:10.6765339Z encoder_outputs = self.encoder( 2025-08-14T21:46:10.6765701Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:10.6766062Z layer_outputs = layer_module( 2025-08-14T21:46:10.6766400Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:10.6766730Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:10.6767116Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 539, in forward 2025-08-14T21:46:10.6767489Z self_attention_outputs = self.attention( 2025-08-14T21:46:10.6767833Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:10.6768174Z return func(*args, **kwargs) 2025-08-14T21:46:10.6768529Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 466, in forward 2025-08-14T21:46:10.6768915Z self_outputs = self.self( 2025-08-14T21:46:10.6769240Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:10.6769584Z return func(*args, **kwargs) 2025-08-14T21:46:10.6769939Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 357, in forward 2025-08-14T21:46:10.6770296Z self.value(current_states) 2025-08-14T21:46:10.6770412Z 2025-08-14T21:46:10.6770489Z cudagraph partition due to non gpu ops 2025-08-14T21:46:10.6770712Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:10.6771043Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:10.6771336Z return mod(**inputs) 2025-08-14T21:46:10.6771678Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 999, in forward 2025-08-14T21:46:10.6772037Z outputs = self.roberta( 2025-08-14T21:46:10.6772374Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:10.6772734Z encoder_outputs = self.encoder( 2025-08-14T21:46:10.6773095Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:10.6773456Z layer_outputs = layer_module( 2025-08-14T21:46:10.6773766Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:10.6774097Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:10.6774462Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 539, in forward 2025-08-14T21:46:10.6774831Z self_attention_outputs = self.attention( 2025-08-14T21:46:10.6775176Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:10.6775521Z return func(*args, **kwargs) 2025-08-14T21:46:10.6775875Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 466, in forward 2025-08-14T21:46:10.6776231Z self_outputs = self.self( 2025-08-14T21:46:10.6776562Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:10.6776906Z return func(*args, **kwargs) 2025-08-14T21:46:10.6777257Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 388, in forward 2025-08-14T21:46:10.6777668Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:46:10.6777844Z 2025-08-14T21:46:10.6777941Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:10.6778291Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:10.6778595Z return mod(**inputs) 2025-08-14T21:46:10.6778929Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 999, in forward 2025-08-14T21:46:10.6779286Z outputs = self.roberta( 2025-08-14T21:46:10.6779650Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:10.6780067Z encoder_outputs = self.encoder( 2025-08-14T21:46:10.6780425Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:10.6780783Z layer_outputs = layer_module( 2025-08-14T21:46:10.6781100Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:10.6781428Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:10.6781825Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 539, in forward 2025-08-14T21:46:10.6782194Z self_attention_outputs = self.attention( 2025-08-14T21:46:10.6782537Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:10.6782876Z return func(*args, **kwargs) 2025-08-14T21:46:10.6783232Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 476, in forward 2025-08-14T21:46:10.6783647Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:46:10.6784049Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 412, in forward 2025-08-14T21:46:10.6784422Z hidden_states = self.dense(hidden_states) 2025-08-14T21:46:10.6784548Z 2025-08-14T21:46:10.6784857Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:10.6785200Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:10.6785496Z return mod(**inputs) 2025-08-14T21:46:10.6785839Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 999, in forward 2025-08-14T21:46:10.6786205Z outputs = self.roberta( 2025-08-14T21:46:10.6786543Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:10.6786913Z encoder_outputs = self.encoder( 2025-08-14T21:46:10.6787273Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:10.6787636Z layer_outputs = layer_module( 2025-08-14T21:46:10.6787949Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:10.6788285Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:10.6788658Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 569, in forward 2025-08-14T21:46:10.6789027Z layer_output = apply_chunking_to_forward( 2025-08-14T21:46:10.6789399Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:46:10.6789761Z return forward_fn(*input_tensors) 2025-08-14T21:46:10.6790160Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 577, in feed_forward_chunk 2025-08-14T21:46:10.6790591Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:46:10.6791001Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 492, in forward 2025-08-14T21:46:10.6791371Z hidden_states = self.dense(hidden_states) 2025-08-14T21:46:10.6791544Z 2025-08-14T21:46:10.6791651Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:10.6791974Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:10.6792274Z return mod(**inputs) 2025-08-14T21:46:10.6792640Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 999, in forward 2025-08-14T21:46:10.6793027Z outputs = self.roberta( 2025-08-14T21:46:10.6793382Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:10.6793757Z encoder_outputs = self.encoder( 2025-08-14T21:46:10.6794121Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:10.6794477Z layer_outputs = layer_module( 2025-08-14T21:46:10.6794808Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:10.6795169Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:10.6795543Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 569, in forward 2025-08-14T21:46:10.6795909Z layer_output = apply_chunking_to_forward( 2025-08-14T21:46:10.6796278Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:46:10.6796639Z return forward_fn(*input_tensors) 2025-08-14T21:46:10.6797020Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 577, in feed_forward_chunk 2025-08-14T21:46:10.6797448Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:46:10.6797850Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 493, in forward 2025-08-14T21:46:10.6798255Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:46:10.6798609Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:46:10.6798932Z return self.act(input) 2025-08-14T21:46:10.6799037Z 2025-08-14T21:46:10.6799145Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:10.6799491Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:10.6799785Z return mod(**inputs) 2025-08-14T21:46:10.6800131Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 999, in forward 2025-08-14T21:46:10.6800502Z outputs = self.roberta( 2025-08-14T21:46:10.6800848Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:10.6801222Z encoder_outputs = self.encoder( 2025-08-14T21:46:10.6801592Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:10.6801964Z layer_outputs = layer_module( 2025-08-14T21:46:10.6802287Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:10.6802630Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:10.6803008Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 569, in forward 2025-08-14T21:46:10.6803383Z layer_output = apply_chunking_to_forward( 2025-08-14T21:46:10.6803756Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:46:10.6804124Z return forward_fn(*input_tensors) 2025-08-14T21:46:10.6804538Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 578, in feed_forward_chunk 2025-08-14T21:46:10.6804989Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:46:10.6805415Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 506, in forward 2025-08-14T21:46:10.6805799Z hidden_states = self.dense(hidden_states) 2025-08-14T21:46:10.6805946Z 2025-08-14T21:46:10.6806055Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:10.6806409Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:10.6806717Z return mod(**inputs) 2025-08-14T21:46:10.6807062Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 999, in forward 2025-08-14T21:46:10.6807427Z outputs = self.roberta( 2025-08-14T21:46:10.6807779Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:10.6808173Z encoder_outputs = self.encoder( 2025-08-14T21:46:10.6808540Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:10.6808903Z layer_outputs = layer_module( 2025-08-14T21:46:10.6809235Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:10.6809581Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:10.6809954Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 539, in forward 2025-08-14T21:46:10.6810341Z self_attention_outputs = self.attention( 2025-08-14T21:46:10.6810704Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:10.6811061Z return func(*args, **kwargs) 2025-08-14T21:46:10.6811419Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 466, in forward 2025-08-14T21:46:10.6811797Z self_outputs = self.self( 2025-08-14T21:46:10.6812146Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:10.6812501Z return func(*args, **kwargs) 2025-08-14T21:46:10.6812860Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 324, in forward 2025-08-14T21:46:10.6813368Z self.query(hidden_states).view(bsz, -1, self.num_attention_heads, self.attention_head_size).transpose(1, 2) 2025-08-14T21:46:10.6813615Z 2025-08-14T21:46:10.6813721Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:10.6814055Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:10.6814364Z return mod(**inputs) 2025-08-14T21:46:10.6814716Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 999, in forward 2025-08-14T21:46:10.6815093Z outputs = self.roberta( 2025-08-14T21:46:10.6815448Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:10.6815817Z encoder_outputs = self.encoder( 2025-08-14T21:46:10.6816176Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:10.6816540Z layer_outputs = layer_module( 2025-08-14T21:46:10.6816854Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:10.6817188Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:10.6817555Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 539, in forward 2025-08-14T21:46:10.6817935Z self_attention_outputs = self.attention( 2025-08-14T21:46:10.6818292Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:10.6818634Z return func(*args, **kwargs) 2025-08-14T21:46:10.6819007Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 466, in forward 2025-08-14T21:46:10.6819363Z self_outputs = self.self( 2025-08-14T21:46:10.6819714Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:10.6820059Z return func(*args, **kwargs) 2025-08-14T21:46:10.6820407Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 352, in forward 2025-08-14T21:46:10.6820769Z self.key(current_states) 2025-08-14T21:46:10.6820881Z 2025-08-14T21:46:10.6820976Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:10.6821309Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:10.6821616Z return mod(**inputs) 2025-08-14T21:46:10.6821955Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 999, in forward 2025-08-14T21:46:10.6822314Z outputs = self.roberta( 2025-08-14T21:46:10.6822651Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:10.6823015Z encoder_outputs = self.encoder( 2025-08-14T21:46:10.6823374Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:10.6823734Z layer_outputs = layer_module( 2025-08-14T21:46:10.6824045Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:10.6824380Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:10.6824818Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 539, in forward 2025-08-14T21:46:10.6825202Z self_attention_outputs = self.attention( 2025-08-14T21:46:10.6825552Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:10.6825901Z return func(*args, **kwargs) 2025-08-14T21:46:10.6826258Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 466, in forward 2025-08-14T21:46:10.6826613Z self_outputs = self.self( 2025-08-14T21:46:10.6826946Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:10.6827290Z return func(*args, **kwargs) 2025-08-14T21:46:10.6827647Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 357, in forward 2025-08-14T21:46:10.6828010Z self.value(current_states) 2025-08-14T21:46:10.6828129Z 2025-08-14T21:46:10.6828205Z cudagraph partition due to non gpu ops 2025-08-14T21:46:10.6828431Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:10.6828758Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:10.6829057Z return mod(**inputs) 2025-08-14T21:46:10.6829399Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 999, in forward 2025-08-14T21:46:10.6829754Z outputs = self.roberta( 2025-08-14T21:46:10.6830092Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:10.6830454Z encoder_outputs = self.encoder( 2025-08-14T21:46:10.6830828Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:10.6831185Z layer_outputs = layer_module( 2025-08-14T21:46:10.6831502Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:10.6831837Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:10.6832218Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 539, in forward 2025-08-14T21:46:10.6832616Z self_attention_outputs = self.attention( 2025-08-14T21:46:10.6832969Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:10.6833310Z return func(*args, **kwargs) 2025-08-14T21:46:10.6833659Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 466, in forward 2025-08-14T21:46:10.6834013Z self_outputs = self.self( 2025-08-14T21:46:10.6834346Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:10.6834709Z return func(*args, **kwargs) 2025-08-14T21:46:10.6835054Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 388, in forward 2025-08-14T21:46:10.6835476Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:46:10.6835655Z 2025-08-14T21:46:10.6835754Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:10.6836088Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:10.6836379Z return mod(**inputs) 2025-08-14T21:46:10.6836728Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 999, in forward 2025-08-14T21:46:10.6837090Z outputs = self.roberta( 2025-08-14T21:46:10.6837431Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:10.6837797Z encoder_outputs = self.encoder( 2025-08-14T21:46:10.6838155Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:10.6838517Z layer_outputs = layer_module( 2025-08-14T21:46:10.6838831Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:10.6839167Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:10.6839533Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 539, in forward 2025-08-14T21:46:10.6839904Z self_attention_outputs = self.attention( 2025-08-14T21:46:10.6840247Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:10.6840591Z return func(*args, **kwargs) 2025-08-14T21:46:10.6840944Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 476, in forward 2025-08-14T21:46:10.6841354Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:46:10.6841764Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 412, in forward 2025-08-14T21:46:10.6842137Z hidden_states = self.dense(hidden_states) 2025-08-14T21:46:10.6842268Z 2025-08-14T21:46:10.6842372Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:10.6842692Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:10.6842992Z return mod(**inputs) 2025-08-14T21:46:10.6843332Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 999, in forward 2025-08-14T21:46:10.6843690Z outputs = self.roberta( 2025-08-14T21:46:10.6844047Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:10.6844415Z encoder_outputs = self.encoder( 2025-08-14T21:46:10.6844771Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:10.6845139Z layer_outputs = layer_module( 2025-08-14T21:46:10.6845461Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:10.6845810Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:10.6846181Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 569, in forward 2025-08-14T21:46:10.6846554Z layer_output = apply_chunking_to_forward( 2025-08-14T21:46:10.6846931Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:46:10.6847298Z return forward_fn(*input_tensors) 2025-08-14T21:46:10.6847704Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 577, in feed_forward_chunk 2025-08-14T21:46:10.6848138Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:46:10.6848542Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 492, in forward 2025-08-14T21:46:10.6848915Z hidden_states = self.dense(hidden_states) 2025-08-14T21:46:10.6849041Z 2025-08-14T21:46:10.6849135Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:10.6849467Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:10.6849767Z return mod(**inputs) 2025-08-14T21:46:10.6850105Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 999, in forward 2025-08-14T21:46:10.6850455Z outputs = self.roberta( 2025-08-14T21:46:10.6850805Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:10.6851165Z encoder_outputs = self.encoder( 2025-08-14T21:46:10.6851516Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:10.6851877Z layer_outputs = layer_module( 2025-08-14T21:46:10.6852195Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:10.6852523Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:10.6852881Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 569, in forward 2025-08-14T21:46:10.6853251Z layer_output = apply_chunking_to_forward( 2025-08-14T21:46:10.6853617Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:46:10.6853976Z return forward_fn(*input_tensors) 2025-08-14T21:46:10.6854356Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 577, in feed_forward_chunk 2025-08-14T21:46:10.6854787Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:46:10.6855187Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 493, in forward 2025-08-14T21:46:10.6855577Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:46:10.6855930Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:46:10.6856239Z return self.act(input) 2025-08-14T21:46:10.6856341Z 2025-08-14T21:46:10.6856442Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:10.6856779Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:10.6857082Z return mod(**inputs) 2025-08-14T21:46:10.6857423Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 999, in forward 2025-08-14T21:46:10.6857772Z outputs = self.roberta( 2025-08-14T21:46:10.6858133Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:10.6858514Z encoder_outputs = self.encoder( 2025-08-14T21:46:10.6858876Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:10.6859233Z layer_outputs = layer_module( 2025-08-14T21:46:10.6859554Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:10.6859888Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:10.6860259Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 569, in forward 2025-08-14T21:46:10.6860645Z layer_output = apply_chunking_to_forward( 2025-08-14T21:46:10.6861014Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:46:10.6861386Z return forward_fn(*input_tensors) 2025-08-14T21:46:10.6861768Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 578, in feed_forward_chunk 2025-08-14T21:46:10.6862217Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:46:10.6862631Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 506, in forward 2025-08-14T21:46:10.6862998Z hidden_states = self.dense(hidden_states) 2025-08-14T21:46:10.6863124Z 2025-08-14T21:46:10.6863220Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:10.6863555Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:10.6863854Z return mod(**inputs) 2025-08-14T21:46:10.6864191Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 999, in forward 2025-08-14T21:46:10.6864538Z outputs = self.roberta( 2025-08-14T21:46:10.6864973Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:10.6865348Z encoder_outputs = self.encoder( 2025-08-14T21:46:10.6865697Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:10.6866063Z layer_outputs = layer_module( 2025-08-14T21:46:10.6866382Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:10.6866718Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:10.6867083Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 539, in forward 2025-08-14T21:46:10.6867462Z self_attention_outputs = self.attention( 2025-08-14T21:46:10.6867818Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:10.6868157Z return func(*args, **kwargs) 2025-08-14T21:46:10.6868514Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 466, in forward 2025-08-14T21:46:10.6868879Z self_outputs = self.self( 2025-08-14T21:46:10.6869210Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:10.6869545Z return func(*args, **kwargs) 2025-08-14T21:46:10.6869916Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 324, in forward 2025-08-14T21:46:10.6870408Z self.query(hidden_states).view(bsz, -1, self.num_attention_heads, self.attention_head_size).transpose(1, 2) 2025-08-14T21:46:10.6870650Z 2025-08-14T21:46:10.6870753Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:10.6871092Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:10.6871391Z return mod(**inputs) 2025-08-14T21:46:10.6871750Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 999, in forward 2025-08-14T21:46:10.6872099Z outputs = self.roberta( 2025-08-14T21:46:10.6872444Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:10.6872804Z encoder_outputs = self.encoder( 2025-08-14T21:46:10.6873162Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:10.6873536Z layer_outputs = layer_module( 2025-08-14T21:46:10.6873853Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:10.6874186Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:10.6874546Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 539, in forward 2025-08-14T21:46:10.6874919Z self_attention_outputs = self.attention( 2025-08-14T21:46:10.6875273Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:10.6875614Z return func(*args, **kwargs) 2025-08-14T21:46:10.6875959Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 466, in forward 2025-08-14T21:46:10.6876317Z self_outputs = self.self( 2025-08-14T21:46:10.6876649Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:10.6876998Z return func(*args, **kwargs) 2025-08-14T21:46:10.6877343Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 352, in forward 2025-08-14T21:46:10.6877706Z self.key(current_states) 2025-08-14T21:46:10.6877814Z 2025-08-14T21:46:10.6877920Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:10.6878243Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:10.6878543Z return mod(**inputs) 2025-08-14T21:46:10.6878883Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 999, in forward 2025-08-14T21:46:10.6879238Z outputs = self.roberta( 2025-08-14T21:46:10.6879577Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:10.6879939Z encoder_outputs = self.encoder( 2025-08-14T21:46:10.6880295Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:10.6880647Z layer_outputs = layer_module( 2025-08-14T21:46:10.6880965Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:10.6881302Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:10.6881668Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 539, in forward 2025-08-14T21:46:10.6882033Z self_attention_outputs = self.attention( 2025-08-14T21:46:10.6882383Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:10.6882724Z return func(*args, **kwargs) 2025-08-14T21:46:10.6883092Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 466, in forward 2025-08-14T21:46:10.6883452Z self_outputs = self.self( 2025-08-14T21:46:10.6883780Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:10.6884157Z return func(*args, **kwargs) 2025-08-14T21:46:10.6884502Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 357, in forward 2025-08-14T21:46:10.6885045Z self.value(current_states) 2025-08-14T21:46:10.6885169Z 2025-08-14T21:46:10.6885249Z cudagraph partition due to non gpu ops 2025-08-14T21:46:10.6885480Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:10.6885826Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:10.6886130Z return mod(**inputs) 2025-08-14T21:46:10.6886478Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 999, in forward 2025-08-14T21:46:10.6886875Z outputs = self.roberta( 2025-08-14T21:46:10.6887221Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:10.6887585Z encoder_outputs = self.encoder( 2025-08-14T21:46:10.6887941Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:10.6888298Z layer_outputs = layer_module( 2025-08-14T21:46:10.6888618Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:10.6888951Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:10.6889312Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 539, in forward 2025-08-14T21:46:10.6889690Z self_attention_outputs = self.attention( 2025-08-14T21:46:10.6890047Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:10.6890389Z return func(*args, **kwargs) 2025-08-14T21:46:10.6890738Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 466, in forward 2025-08-14T21:46:10.6891102Z self_outputs = self.self( 2025-08-14T21:46:10.6891438Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:10.6891776Z return func(*args, **kwargs) 2025-08-14T21:46:10.6892118Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 388, in forward 2025-08-14T21:46:10.6892533Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:46:10.6892702Z 2025-08-14T21:46:10.6892809Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:10.6893134Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:10.6893435Z return mod(**inputs) 2025-08-14T21:46:10.6893781Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 999, in forward 2025-08-14T21:46:10.6894139Z outputs = self.roberta( 2025-08-14T21:46:10.6894480Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:10.6894843Z encoder_outputs = self.encoder( 2025-08-14T21:46:10.6895204Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:10.6895555Z layer_outputs = layer_module( 2025-08-14T21:46:10.6895871Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:10.6896235Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:10.6896611Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 539, in forward 2025-08-14T21:46:10.6896978Z self_attention_outputs = self.attention( 2025-08-14T21:46:10.6897353Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:10.6897720Z return func(*args, **kwargs) 2025-08-14T21:46:10.6898068Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 476, in forward 2025-08-14T21:46:10.6898473Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:46:10.6898881Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 412, in forward 2025-08-14T21:46:10.6899255Z hidden_states = self.dense(hidden_states) 2025-08-14T21:46:10.6899383Z 2025-08-14T21:46:10.6899498Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:10.6899829Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:10.6900127Z return mod(**inputs) 2025-08-14T21:46:10.6900465Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 999, in forward 2025-08-14T21:46:10.6900817Z outputs = self.roberta( 2025-08-14T21:46:10.6901157Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:10.6901514Z encoder_outputs = self.encoder( 2025-08-14T21:46:10.6901864Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:10.6902213Z layer_outputs = layer_module( 2025-08-14T21:46:10.6902530Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:10.6902860Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:10.6903215Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 569, in forward 2025-08-14T21:46:10.6903582Z layer_output = apply_chunking_to_forward( 2025-08-14T21:46:10.6903951Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:46:10.6904308Z return forward_fn(*input_tensors) 2025-08-14T21:46:10.6904687Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 577, in feed_forward_chunk 2025-08-14T21:46:10.6905177Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:46:10.6905589Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 492, in forward 2025-08-14T21:46:10.6905966Z hidden_states = self.dense(hidden_states) 2025-08-14T21:46:10.6906095Z 2025-08-14T21:46:10.6906192Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:10.6906525Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:10.6906830Z return mod(**inputs) 2025-08-14T21:46:10.6907170Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 999, in forward 2025-08-14T21:46:10.6907535Z outputs = self.roberta( 2025-08-14T21:46:10.6907885Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:10.6908252Z encoder_outputs = self.encoder( 2025-08-14T21:46:10.6908606Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:10.6908971Z layer_outputs = layer_module( 2025-08-14T21:46:10.6909311Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:10.6909648Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:10.6910026Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 569, in forward 2025-08-14T21:46:10.6910426Z layer_output = apply_chunking_to_forward( 2025-08-14T21:46:10.6910810Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:46:10.6911161Z return forward_fn(*input_tensors) 2025-08-14T21:46:10.6911549Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 577, in feed_forward_chunk 2025-08-14T21:46:10.6911981Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:46:10.6912383Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 493, in forward 2025-08-14T21:46:10.6912790Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:46:10.6913139Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:46:10.6913452Z return self.act(input) 2025-08-14T21:46:10.6913557Z 2025-08-14T21:46:10.6913651Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:10.6913988Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:10.6914286Z return mod(**inputs) 2025-08-14T21:46:10.6914629Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 999, in forward 2025-08-14T21:46:10.6914982Z outputs = self.roberta( 2025-08-14T21:46:10.6915326Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:10.6915692Z encoder_outputs = self.encoder( 2025-08-14T21:46:10.6916045Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:10.6916405Z layer_outputs = layer_module( 2025-08-14T21:46:10.6916726Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:10.6917056Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:10.6917420Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 569, in forward 2025-08-14T21:46:10.6917793Z layer_output = apply_chunking_to_forward( 2025-08-14T21:46:10.6918159Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:46:10.6918519Z return forward_fn(*input_tensors) 2025-08-14T21:46:10.6918904Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 578, in feed_forward_chunk 2025-08-14T21:46:10.6919350Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:46:10.6919760Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 506, in forward 2025-08-14T21:46:10.6920124Z hidden_states = self.dense(hidden_states) 2025-08-14T21:46:10.6920263Z 2025-08-14T21:46:10.6920358Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:10.6920688Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:10.6920749Z return mod(**inputs) 2025-08-14T21:46:10.6920996Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 999, in forward 2025-08-14T21:46:10.6921058Z outputs = self.roberta( 2025-08-14T21:46:10.6921311Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:10.6921390Z encoder_outputs = self.encoder( 2025-08-14T21:46:10.6921629Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:10.6921700Z layer_outputs = layer_module( 2025-08-14T21:46:10.6921918Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:10.6922009Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:10.6922255Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 539, in forward 2025-08-14T21:46:10.6922333Z self_attention_outputs = self.attention( 2025-08-14T21:46:10.6922554Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:10.6922629Z return func(*args, **kwargs) 2025-08-14T21:46:10.6922885Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 466, in forward 2025-08-14T21:46:10.6922958Z self_outputs = self.self( 2025-08-14T21:46:10.6923179Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:10.6923246Z return func(*args, **kwargs) 2025-08-14T21:46:10.6923495Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 324, in forward 2025-08-14T21:46:10.6923688Z self.query(hidden_states).view(bsz, -1, self.num_attention_heads, self.attention_head_size).transpose(1, 2) 2025-08-14T21:46:10.6923691Z 2025-08-14T21:46:10.6923795Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:10.6923979Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:10.6924041Z return mod(**inputs) 2025-08-14T21:46:10.6924287Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 999, in forward 2025-08-14T21:46:10.6924349Z outputs = self.roberta( 2025-08-14T21:46:10.6924588Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:10.6924663Z encoder_outputs = self.encoder( 2025-08-14T21:46:10.6924901Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:10.6924974Z layer_outputs = layer_module( 2025-08-14T21:46:10.6925176Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:10.6925250Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:10.6925498Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 539, in forward 2025-08-14T21:46:10.6925576Z self_attention_outputs = self.attention( 2025-08-14T21:46:10.6925805Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:10.6925869Z return func(*args, **kwargs) 2025-08-14T21:46:10.6926110Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 466, in forward 2025-08-14T21:46:10.6926189Z self_outputs = self.self( 2025-08-14T21:46:10.6926411Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:10.6926473Z return func(*args, **kwargs) 2025-08-14T21:46:10.6926719Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 352, in forward 2025-08-14T21:46:10.6926783Z self.key(current_states) 2025-08-14T21:46:10.6926786Z 2025-08-14T21:46:10.6926916Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:10.6927101Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:10.6927160Z return mod(**inputs) 2025-08-14T21:46:10.6927418Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 999, in forward 2025-08-14T21:46:10.6927481Z outputs = self.roberta( 2025-08-14T21:46:10.6927736Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:10.6927809Z encoder_outputs = self.encoder( 2025-08-14T21:46:10.6928046Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:10.6928118Z layer_outputs = layer_module( 2025-08-14T21:46:10.6928318Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:10.6928392Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:10.6928656Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 539, in forward 2025-08-14T21:46:10.6928732Z self_attention_outputs = self.attention( 2025-08-14T21:46:10.6928963Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:10.6929028Z return func(*args, **kwargs) 2025-08-14T21:46:10.6929266Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 466, in forward 2025-08-14T21:46:10.6929339Z self_outputs = self.self( 2025-08-14T21:46:10.6929560Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:10.6929624Z return func(*args, **kwargs) 2025-08-14T21:46:10.6929875Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 357, in forward 2025-08-14T21:46:10.6929943Z self.value(current_states) 2025-08-14T21:46:10.6929947Z 2025-08-14T21:46:10.6930029Z cudagraph partition due to non gpu ops 2025-08-14T21:46:10.6930125Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:10.6930309Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:10.6930378Z return mod(**inputs) 2025-08-14T21:46:10.6930616Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 999, in forward 2025-08-14T21:46:10.6930678Z outputs = self.roberta( 2025-08-14T21:46:10.6930924Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:10.6930988Z encoder_outputs = self.encoder( 2025-08-14T21:46:10.6931235Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:10.6931301Z layer_outputs = layer_module( 2025-08-14T21:46:10.6931504Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:10.6931583Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:10.6931823Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 539, in forward 2025-08-14T21:46:10.6931906Z self_attention_outputs = self.attention( 2025-08-14T21:46:10.6932129Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:10.6932192Z return func(*args, **kwargs) 2025-08-14T21:46:10.6932439Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 466, in forward 2025-08-14T21:46:10.6932501Z self_outputs = self.self( 2025-08-14T21:46:10.6932736Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:10.6932810Z return func(*args, **kwargs) 2025-08-14T21:46:10.6933050Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 388, in forward 2025-08-14T21:46:10.6933195Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:46:10.6933213Z 2025-08-14T21:46:10.6933310Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:10.6933491Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:10.6933557Z return mod(**inputs) 2025-08-14T21:46:10.6933795Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 999, in forward 2025-08-14T21:46:10.6933864Z outputs = self.roberta( 2025-08-14T21:46:10.6934107Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:10.6934191Z encoder_outputs = self.encoder( 2025-08-14T21:46:10.6934434Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:10.6934500Z layer_outputs = layer_module( 2025-08-14T21:46:10.6934700Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:10.6934781Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:10.6935017Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 539, in forward 2025-08-14T21:46:10.6935097Z self_attention_outputs = self.attention( 2025-08-14T21:46:10.6935313Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:10.6935376Z return func(*args, **kwargs) 2025-08-14T21:46:10.6935620Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 476, in forward 2025-08-14T21:46:10.6935736Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:46:10.6935975Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 412, in forward 2025-08-14T21:46:10.6936059Z hidden_states = self.dense(hidden_states) 2025-08-14T21:46:10.6936062Z 2025-08-14T21:46:10.6936156Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:10.6936342Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:10.6936401Z return mod(**inputs) 2025-08-14T21:46:10.6936637Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 999, in forward 2025-08-14T21:46:10.6936708Z outputs = self.roberta( 2025-08-14T21:46:10.6936944Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:10.6937017Z encoder_outputs = self.encoder( 2025-08-14T21:46:10.6937256Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:10.6937320Z layer_outputs = layer_module( 2025-08-14T21:46:10.6937529Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:10.6937598Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:10.6937835Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 569, in forward 2025-08-14T21:46:10.6937919Z layer_output = apply_chunking_to_forward( 2025-08-14T21:46:10.6938167Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:46:10.6938248Z return forward_fn(*input_tensors) 2025-08-14T21:46:10.6938517Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 577, in feed_forward_chunk 2025-08-14T21:46:10.6938627Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:46:10.6938888Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 492, in forward 2025-08-14T21:46:10.6938983Z hidden_states = self.dense(hidden_states) 2025-08-14T21:46:10.6938986Z 2025-08-14T21:46:10.6939089Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:10.6939270Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:10.6939331Z return mod(**inputs) 2025-08-14T21:46:10.6939576Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 999, in forward 2025-08-14T21:46:10.6939656Z outputs = self.roberta( 2025-08-14T21:46:10.6939894Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:10.6939967Z encoder_outputs = self.encoder( 2025-08-14T21:46:10.6940208Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:10.6940283Z layer_outputs = layer_module( 2025-08-14T21:46:10.6940484Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:10.6940555Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:10.6940799Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 569, in forward 2025-08-14T21:46:10.6940875Z layer_output = apply_chunking_to_forward( 2025-08-14T21:46:10.6941117Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:46:10.6941187Z return forward_fn(*input_tensors) 2025-08-14T21:46:10.6941456Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 577, in feed_forward_chunk 2025-08-14T21:46:10.6941574Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:46:10.6941813Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 493, in forward 2025-08-14T21:46:10.6941915Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:46:10.6942115Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:46:10.6942180Z return self.act(input) 2025-08-14T21:46:10.6942184Z 2025-08-14T21:46:10.6942286Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:10.6942471Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:10.6942532Z return mod(**inputs) 2025-08-14T21:46:10.6942780Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 999, in forward 2025-08-14T21:46:10.6942843Z outputs = self.roberta( 2025-08-14T21:46:10.6943088Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:10.6943155Z encoder_outputs = self.encoder( 2025-08-14T21:46:10.6943392Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:10.6943466Z layer_outputs = layer_module( 2025-08-14T21:46:10.6943667Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:10.6943755Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:10.6944005Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 569, in forward 2025-08-14T21:46:10.6944081Z layer_output = apply_chunking_to_forward( 2025-08-14T21:46:10.6944339Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:46:10.6944410Z return forward_fn(*input_tensors) 2025-08-14T21:46:10.6944699Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 578, in feed_forward_chunk 2025-08-14T21:46:10.6944896Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:46:10.6945145Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 506, in forward 2025-08-14T21:46:10.6945231Z hidden_states = self.dense(hidden_states) 2025-08-14T21:46:10.6945234Z 2025-08-14T21:46:10.6945336Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:10.6945558Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:10.6945628Z return mod(**inputs) 2025-08-14T21:46:10.6945872Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 999, in forward 2025-08-14T21:46:10.6945935Z outputs = self.roberta( 2025-08-14T21:46:10.6946188Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:10.6946253Z encoder_outputs = self.encoder( 2025-08-14T21:46:10.6946502Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:10.6946568Z layer_outputs = layer_module( 2025-08-14T21:46:10.6946776Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:10.6946858Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:10.6947099Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 539, in forward 2025-08-14T21:46:10.6947175Z self_attention_outputs = self.attention( 2025-08-14T21:46:10.6947406Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:10.6947470Z return func(*args, **kwargs) 2025-08-14T21:46:10.6947714Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 466, in forward 2025-08-14T21:46:10.6947777Z self_outputs = self.self( 2025-08-14T21:46:10.6947997Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:10.6948069Z return func(*args, **kwargs) 2025-08-14T21:46:10.6948308Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 324, in forward 2025-08-14T21:46:10.6948508Z self.query(hidden_states).view(bsz, -1, self.num_attention_heads, self.attention_head_size).transpose(1, 2) 2025-08-14T21:46:10.6948512Z 2025-08-14T21:46:10.6948608Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:10.6948792Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:10.6948862Z return mod(**inputs) 2025-08-14T21:46:10.6949100Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 999, in forward 2025-08-14T21:46:10.6949162Z outputs = self.roberta( 2025-08-14T21:46:10.6949405Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:10.6949470Z encoder_outputs = self.encoder( 2025-08-14T21:46:10.6949735Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:10.6949804Z layer_outputs = layer_module( 2025-08-14T21:46:10.6950005Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:10.6950099Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:10.6950339Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 539, in forward 2025-08-14T21:46:10.6950441Z self_attention_outputs = self.attention( 2025-08-14T21:46:10.6950659Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:10.6950721Z return func(*args, **kwargs) 2025-08-14T21:46:10.6950962Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 466, in forward 2025-08-14T21:46:10.6951028Z self_outputs = self.self( 2025-08-14T21:46:10.6951263Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:10.6951333Z return func(*args, **kwargs) 2025-08-14T21:46:10.6951573Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 352, in forward 2025-08-14T21:46:10.6951645Z self.key(current_states) 2025-08-14T21:46:10.6951650Z 2025-08-14T21:46:10.6951743Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:10.6951927Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:10.6951996Z return mod(**inputs) 2025-08-14T21:46:10.6952236Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 999, in forward 2025-08-14T21:46:10.6952307Z outputs = self.roberta( 2025-08-14T21:46:10.6952547Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:10.6952614Z encoder_outputs = self.encoder( 2025-08-14T21:46:10.6952861Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:10.6952927Z layer_outputs = layer_module( 2025-08-14T21:46:10.6953130Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:10.6953211Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:10.6953449Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 539, in forward 2025-08-14T21:46:10.6953532Z self_attention_outputs = self.attention( 2025-08-14T21:46:10.6953749Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:10.6953812Z return func(*args, **kwargs) 2025-08-14T21:46:10.6954060Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 466, in forward 2025-08-14T21:46:10.6954122Z self_outputs = self.self( 2025-08-14T21:46:10.6954343Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:10.6954412Z return func(*args, **kwargs) 2025-08-14T21:46:10.6954651Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 357, in forward 2025-08-14T21:46:10.6954724Z self.value(current_states) 2025-08-14T21:46:10.6954728Z 2025-08-14T21:46:10.6954802Z cudagraph partition due to non gpu ops 2025-08-14T21:46:10.6954895Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:10.6955084Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:10.6955159Z return mod(**inputs) 2025-08-14T21:46:10.6955400Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 999, in forward 2025-08-14T21:46:10.6955469Z outputs = self.roberta( 2025-08-14T21:46:10.6955721Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:10.6955797Z encoder_outputs = self.encoder( 2025-08-14T21:46:10.6956050Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:10.6956115Z layer_outputs = layer_module( 2025-08-14T21:46:10.6956323Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:10.6956394Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:10.6956638Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 539, in forward 2025-08-14T21:46:10.6956729Z self_attention_outputs = self.attention( 2025-08-14T21:46:10.6956946Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:10.6957015Z return func(*args, **kwargs) 2025-08-14T21:46:10.6957252Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 466, in forward 2025-08-14T21:46:10.6957317Z self_outputs = self.self( 2025-08-14T21:46:10.6957542Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:10.6957604Z return func(*args, **kwargs) 2025-08-14T21:46:10.6957846Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 388, in forward 2025-08-14T21:46:10.6957968Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:46:10.6957973Z 2025-08-14T21:46:10.6958069Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:10.6958255Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:10.6958315Z return mod(**inputs) 2025-08-14T21:46:10.6958560Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 999, in forward 2025-08-14T21:46:10.6958623Z outputs = self.roberta( 2025-08-14T21:46:10.6958859Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:10.6958933Z encoder_outputs = self.encoder( 2025-08-14T21:46:10.6959171Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:10.6959236Z layer_outputs = layer_module( 2025-08-14T21:46:10.6959444Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:10.6959517Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:10.6959763Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 539, in forward 2025-08-14T21:46:10.6959837Z self_attention_outputs = self.attention( 2025-08-14T21:46:10.6960057Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:10.6960128Z return func(*args, **kwargs) 2025-08-14T21:46:10.6960365Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 476, in forward 2025-08-14T21:46:10.6960482Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:46:10.6960726Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 412, in forward 2025-08-14T21:46:10.6960818Z hidden_states = self.dense(hidden_states) 2025-08-14T21:46:10.6960823Z 2025-08-14T21:46:10.6960925Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:10.6961106Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:10.6961166Z return mod(**inputs) 2025-08-14T21:46:10.6961429Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 999, in forward 2025-08-14T21:46:10.6961509Z outputs = self.roberta( 2025-08-14T21:46:10.6961757Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:10.6961824Z encoder_outputs = self.encoder( 2025-08-14T21:46:10.6962066Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:10.6962137Z layer_outputs = layer_module( 2025-08-14T21:46:10.6962344Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:10.6962433Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:10.6962677Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 569, in forward 2025-08-14T21:46:10.6962754Z layer_output = apply_chunking_to_forward( 2025-08-14T21:46:10.6962995Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:46:10.6963066Z return forward_fn(*input_tensors) 2025-08-14T21:46:10.6963333Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 577, in feed_forward_chunk 2025-08-14T21:46:10.6963449Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:46:10.6963688Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 492, in forward 2025-08-14T21:46:10.6963770Z hidden_states = self.dense(hidden_states) 2025-08-14T21:46:10.6963773Z 2025-08-14T21:46:10.6963866Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:10.6964047Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:10.6964113Z return mod(**inputs) 2025-08-14T21:46:10.6964353Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 999, in forward 2025-08-14T21:46:10.6964415Z outputs = self.roberta( 2025-08-14T21:46:10.6964661Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:10.6964727Z encoder_outputs = self.encoder( 2025-08-14T21:46:10.6964971Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:10.6965036Z layer_outputs = layer_module( 2025-08-14T21:46:10.6965238Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:10.6965316Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:10.6965553Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 569, in forward 2025-08-14T21:46:10.6965634Z layer_output = apply_chunking_to_forward( 2025-08-14T21:46:10.6965869Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:46:10.6965938Z return forward_fn(*input_tensors) 2025-08-14T21:46:10.6966212Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 577, in feed_forward_chunk 2025-08-14T21:46:10.6966321Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:46:10.6966574Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 493, in forward 2025-08-14T21:46:10.6966688Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:46:10.6966885Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:46:10.6966972Z return self.act(input) 2025-08-14T21:46:10.6966976Z 2025-08-14T21:46:10.6967097Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:10.6967280Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:10.6967349Z return mod(**inputs) 2025-08-14T21:46:10.6967586Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 999, in forward 2025-08-14T21:46:10.6967657Z outputs = self.roberta( 2025-08-14T21:46:10.6967896Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:10.6967980Z encoder_outputs = self.encoder( 2025-08-14T21:46:10.6968226Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:10.6968293Z layer_outputs = layer_module( 2025-08-14T21:46:10.6968499Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:10.6968580Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:10.6968821Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 569, in forward 2025-08-14T21:46:10.6968904Z layer_output = apply_chunking_to_forward( 2025-08-14T21:46:10.6969151Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:46:10.6969221Z return forward_fn(*input_tensors) 2025-08-14T21:46:10.6969523Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 578, in feed_forward_chunk 2025-08-14T21:46:10.6969653Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:46:10.6969914Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 506, in forward 2025-08-14T21:46:10.6969993Z hidden_states = self.dense(hidden_states) 2025-08-14T21:46:10.6969998Z 2025-08-14T21:46:10.6970106Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:10.6970293Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:10.6970352Z return mod(**inputs) 2025-08-14T21:46:10.6970590Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 999, in forward 2025-08-14T21:46:10.6970658Z outputs = self.roberta( 2025-08-14T21:46:10.6970897Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:10.6970970Z encoder_outputs = self.encoder( 2025-08-14T21:46:10.6971212Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:10.6971279Z layer_outputs = layer_module( 2025-08-14T21:46:10.6971494Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:10.6971570Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:10.6971824Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 539, in forward 2025-08-14T21:46:10.6971901Z self_attention_outputs = self.attention( 2025-08-14T21:46:10.6972150Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:10.6972228Z return func(*args, **kwargs) 2025-08-14T21:46:10.6972471Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 466, in forward 2025-08-14T21:46:10.6972536Z self_outputs = self.self( 2025-08-14T21:46:10.6972787Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:10.6972871Z return func(*args, **kwargs) 2025-08-14T21:46:10.6973129Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 324, in forward 2025-08-14T21:46:10.6973328Z self.query(hidden_states).view(bsz, -1, self.num_attention_heads, self.attention_head_size).transpose(1, 2) 2025-08-14T21:46:10.6973332Z 2025-08-14T21:46:10.6973428Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:10.6973627Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:10.6973707Z return mod(**inputs) 2025-08-14T21:46:10.6973961Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 999, in forward 2025-08-14T21:46:10.6974025Z outputs = self.roberta( 2025-08-14T21:46:10.6974274Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:10.6974350Z encoder_outputs = self.encoder( 2025-08-14T21:46:10.6974594Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:10.6974660Z layer_outputs = layer_module( 2025-08-14T21:46:10.6974875Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:10.6974949Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:10.6975202Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 539, in forward 2025-08-14T21:46:10.6975282Z self_attention_outputs = self.attention( 2025-08-14T21:46:10.6975512Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:10.6975584Z return func(*args, **kwargs) 2025-08-14T21:46:10.6975833Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 466, in forward 2025-08-14T21:46:10.6975900Z self_outputs = self.self( 2025-08-14T21:46:10.6976138Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:10.6976200Z return func(*args, **kwargs) 2025-08-14T21:46:10.6976452Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 352, in forward 2025-08-14T21:46:10.6976519Z self.key(current_states) 2025-08-14T21:46:10.6976523Z 2025-08-14T21:46:10.6976622Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:10.6976816Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:10.6976878Z return mod(**inputs) 2025-08-14T21:46:10.6977133Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 999, in forward 2025-08-14T21:46:10.6977196Z outputs = self.roberta( 2025-08-14T21:46:10.6977442Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:10.6977516Z encoder_outputs = self.encoder( 2025-08-14T21:46:10.6977762Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:10.6977826Z layer_outputs = layer_module( 2025-08-14T21:46:10.6978057Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:10.6978134Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:10.6978386Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 539, in forward 2025-08-14T21:46:10.6978464Z self_attention_outputs = self.attention( 2025-08-14T21:46:10.6978705Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:10.6978795Z return func(*args, **kwargs) 2025-08-14T21:46:10.6979045Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 466, in forward 2025-08-14T21:46:10.6979111Z self_outputs = self.self( 2025-08-14T21:46:10.6979351Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:10.6979415Z return func(*args, **kwargs) 2025-08-14T21:46:10.6979675Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 357, in forward 2025-08-14T21:46:10.6979760Z self.value(current_states) 2025-08-14T21:46:10.6979763Z 2025-08-14T21:46:10.6979839Z cudagraph partition due to non gpu ops 2025-08-14T21:46:10.6979947Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:10.6980131Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:10.6980202Z return mod(**inputs) 2025-08-14T21:46:10.6980448Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 999, in forward 2025-08-14T21:46:10.6980510Z outputs = self.roberta( 2025-08-14T21:46:10.6980761Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:10.6980828Z encoder_outputs = self.encoder( 2025-08-14T21:46:10.6981073Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:10.6981151Z layer_outputs = layer_module( 2025-08-14T21:46:10.6981356Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:10.6981438Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:10.6981682Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 539, in forward 2025-08-14T21:46:10.6981760Z self_attention_outputs = self.attention( 2025-08-14T21:46:10.6981991Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:10.6982055Z return func(*args, **kwargs) 2025-08-14T21:46:10.6982303Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 466, in forward 2025-08-14T21:46:10.6982383Z self_outputs = self.self( 2025-08-14T21:46:10.6982609Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:10.6982680Z return func(*args, **kwargs) 2025-08-14T21:46:10.6982928Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 388, in forward 2025-08-14T21:46:10.6983054Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:46:10.6983058Z 2025-08-14T21:46:10.6983162Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:10.6983346Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:10.6983414Z return mod(**inputs) 2025-08-14T21:46:10.6983657Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 999, in forward 2025-08-14T21:46:10.6983736Z outputs = self.roberta( 2025-08-14T21:46:10.6984020Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:10.6984086Z encoder_outputs = self.encoder( 2025-08-14T21:46:10.6984340Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:10.6984413Z layer_outputs = layer_module( 2025-08-14T21:46:10.6984820Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:10.6984905Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:10.6985150Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 539, in forward 2025-08-14T21:46:10.6985226Z self_attention_outputs = self.attention( 2025-08-14T21:46:10.6985457Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:10.6985566Z return func(*args, **kwargs) 2025-08-14T21:46:10.6985817Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 476, in forward 2025-08-14T21:46:10.6985939Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:46:10.6986187Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 412, in forward 2025-08-14T21:46:10.6986286Z hidden_states = self.dense(hidden_states) 2025-08-14T21:46:10.6986290Z 2025-08-14T21:46:10.6986385Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:10.6986568Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:10.6986634Z return mod(**inputs) 2025-08-14T21:46:10.6986873Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 999, in forward 2025-08-14T21:46:10.6986943Z outputs = self.roberta( 2025-08-14T21:46:10.6987180Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:10.6987246Z encoder_outputs = self.encoder( 2025-08-14T21:46:10.6987490Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:10.6987558Z layer_outputs = layer_module( 2025-08-14T21:46:10.6987759Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:10.6987839Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:10.6988076Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 569, in forward 2025-08-14T21:46:10.6988163Z layer_output = apply_chunking_to_forward( 2025-08-14T21:46:10.6988401Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:46:10.6988473Z return forward_fn(*input_tensors) 2025-08-14T21:46:10.6988750Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 577, in feed_forward_chunk 2025-08-14T21:46:10.6988862Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:46:10.6989112Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 492, in forward 2025-08-14T21:46:10.6989188Z hidden_states = self.dense(hidden_states) 2025-08-14T21:46:10.6989191Z 2025-08-14T21:46:10.6989286Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:10.6989475Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:10.6989535Z return mod(**inputs) 2025-08-14T21:46:10.6989805Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 999, in forward 2025-08-14T21:46:10.6989871Z outputs = self.roberta( 2025-08-14T21:46:10.6990110Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:10.6990182Z encoder_outputs = self.encoder( 2025-08-14T21:46:10.6990460Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:10.6990548Z layer_outputs = layer_module( 2025-08-14T21:46:10.6990762Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:10.6990833Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:10.6991078Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 569, in forward 2025-08-14T21:46:10.6991156Z layer_output = apply_chunking_to_forward( 2025-08-14T21:46:10.6991410Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:46:10.6991488Z return forward_fn(*input_tensors) 2025-08-14T21:46:10.6991756Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 577, in feed_forward_chunk 2025-08-14T21:46:10.6991864Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:46:10.6992110Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 493, in forward 2025-08-14T21:46:10.6992214Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:46:10.6992414Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:46:10.6992477Z return self.act(input) 2025-08-14T21:46:10.6992480Z 2025-08-14T21:46:10.6992576Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:10.6992767Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:10.6992827Z return mod(**inputs) 2025-08-14T21:46:10.6993073Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 999, in forward 2025-08-14T21:46:10.6993137Z outputs = self.roberta( 2025-08-14T21:46:10.6993386Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:10.6993463Z encoder_outputs = self.encoder( 2025-08-14T21:46:10.6993718Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:10.6993787Z layer_outputs = layer_module( 2025-08-14T21:46:10.6994011Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:10.6994095Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:10.6994343Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 569, in forward 2025-08-14T21:46:10.6994418Z layer_output = apply_chunking_to_forward( 2025-08-14T21:46:10.6994653Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:46:10.6994732Z return forward_fn(*input_tensors) 2025-08-14T21:46:10.6994999Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 578, in feed_forward_chunk 2025-08-14T21:46:10.6995128Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:46:10.6995365Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 506, in forward 2025-08-14T21:46:10.6995562Z hidden_states = self.dense(hidden_states) 2025-08-14T21:46:10.6995566Z 2025-08-14T21:46:10.6995669Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:10.6995852Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:10.6995921Z return mod(**inputs) 2025-08-14T21:46:10.6996179Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 999, in forward 2025-08-14T21:46:10.6996261Z outputs = self.roberta( 2025-08-14T21:46:10.6996529Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:10.6996596Z encoder_outputs = self.encoder( 2025-08-14T21:46:10.6996841Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:10.6996916Z layer_outputs = layer_module( 2025-08-14T21:46:10.6997124Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:10.6997224Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:10.6997464Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 539, in forward 2025-08-14T21:46:10.6997542Z self_attention_outputs = self.attention( 2025-08-14T21:46:10.6997774Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:10.6997840Z return func(*args, **kwargs) 2025-08-14T21:46:10.6998078Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 466, in forward 2025-08-14T21:46:10.6998149Z self_outputs = self.self( 2025-08-14T21:46:10.6998370Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:10.6998441Z return func(*args, **kwargs) 2025-08-14T21:46:10.6998681Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 324, in forward 2025-08-14T21:46:10.6998871Z self.query(hidden_states).view(bsz, -1, self.num_attention_heads, self.attention_head_size).transpose(1, 2) 2025-08-14T21:46:10.6998875Z 2025-08-14T21:46:10.6998978Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:10.6999159Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:10.6999228Z return mod(**inputs) 2025-08-14T21:46:10.6999466Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 999, in forward 2025-08-14T21:46:10.6999527Z outputs = self.roberta( 2025-08-14T21:46:10.6999773Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:10.6999841Z encoder_outputs = self.encoder( 2025-08-14T21:46:10.7000083Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:10.7000154Z layer_outputs = layer_module( 2025-08-14T21:46:10.7000356Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:10.7000434Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:10.7000676Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 539, in forward 2025-08-14T21:46:10.7000750Z self_attention_outputs = self.attention( 2025-08-14T21:46:10.7000976Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:10.7001039Z return func(*args, **kwargs) 2025-08-14T21:46:10.7001302Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 466, in forward 2025-08-14T21:46:10.7001370Z self_outputs = self.self( 2025-08-14T21:46:10.7001593Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:10.7001662Z return func(*args, **kwargs) 2025-08-14T21:46:10.7001920Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 352, in forward 2025-08-14T21:46:10.7002003Z self.key(current_states) 2025-08-14T21:46:10.7002006Z 2025-08-14T21:46:10.7002110Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:10.7002291Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:10.7002358Z return mod(**inputs) 2025-08-14T21:46:10.7002596Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 999, in forward 2025-08-14T21:46:10.7002660Z outputs = self.roberta( 2025-08-14T21:46:10.7002925Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:10.7002990Z encoder_outputs = self.encoder( 2025-08-14T21:46:10.7003232Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:10.7003303Z layer_outputs = layer_module( 2025-08-14T21:46:10.7003507Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:10.7003586Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:10.7003825Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 539, in forward 2025-08-14T21:46:10.7003898Z self_attention_outputs = self.attention( 2025-08-14T21:46:10.7004129Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:10.7004192Z return func(*args, **kwargs) 2025-08-14T21:46:10.7004441Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 466, in forward 2025-08-14T21:46:10.7004503Z self_outputs = self.self( 2025-08-14T21:46:10.7004725Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:10.7004793Z return func(*args, **kwargs) 2025-08-14T21:46:10.7005032Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 357, in forward 2025-08-14T21:46:10.7005096Z self.value(current_states) 2025-08-14T21:46:10.7005099Z 2025-08-14T21:46:10.7005179Z cudagraph partition due to non gpu ops 2025-08-14T21:46:10.7005270Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:10.7005460Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:10.7005519Z return mod(**inputs) 2025-08-14T21:46:10.7005762Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 999, in forward 2025-08-14T21:46:10.7005831Z outputs = self.roberta( 2025-08-14T21:46:10.7006076Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:10.7006143Z encoder_outputs = self.encoder( 2025-08-14T21:46:10.7006390Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:10.7006455Z layer_outputs = layer_module( 2025-08-14T21:46:10.7006664Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:10.7006734Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:10.7006988Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 539, in forward 2025-08-14T21:46:10.7007072Z self_attention_outputs = self.attention( 2025-08-14T21:46:10.7007289Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:10.7007372Z return func(*args, **kwargs) 2025-08-14T21:46:10.7007613Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 466, in forward 2025-08-14T21:46:10.7007694Z self_outputs = self.self( 2025-08-14T21:46:10.7007922Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:10.7007984Z return func(*args, **kwargs) 2025-08-14T21:46:10.7008222Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 388, in forward 2025-08-14T21:46:10.7008351Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:46:10.7008369Z 2025-08-14T21:46:10.7008466Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:10.7008650Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:10.7008709Z return mod(**inputs) 2025-08-14T21:46:10.7008947Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 999, in forward 2025-08-14T21:46:10.7009018Z outputs = self.roberta( 2025-08-14T21:46:10.7009257Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:10.7009329Z encoder_outputs = self.encoder( 2025-08-14T21:46:10.7009568Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:10.7009631Z layer_outputs = layer_module( 2025-08-14T21:46:10.7009841Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:10.7009915Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:10.7010151Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 539, in forward 2025-08-14T21:46:10.7010235Z self_attention_outputs = self.attention( 2025-08-14T21:46:10.7010454Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:10.7010526Z return func(*args, **kwargs) 2025-08-14T21:46:10.7010764Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 476, in forward 2025-08-14T21:46:10.7010881Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:46:10.7011128Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 412, in forward 2025-08-14T21:46:10.7011205Z hidden_states = self.dense(hidden_states) 2025-08-14T21:46:10.7011208Z 2025-08-14T21:46:10.7011308Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:10.7011488Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:10.7011549Z return mod(**inputs) 2025-08-14T21:46:10.7011792Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 999, in forward 2025-08-14T21:46:10.7011855Z outputs = self.roberta( 2025-08-14T21:46:10.7012089Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:10.7012161Z encoder_outputs = self.encoder( 2025-08-14T21:46:10.7012399Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:10.7012493Z layer_outputs = layer_module( 2025-08-14T21:46:10.7012698Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:10.7012771Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:10.7013031Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 569, in forward 2025-08-14T21:46:10.7013111Z layer_output = apply_chunking_to_forward( 2025-08-14T21:46:10.7013364Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:46:10.7013441Z return forward_fn(*input_tensors) 2025-08-14T21:46:10.7013709Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 577, in feed_forward_chunk 2025-08-14T21:46:10.7013826Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:46:10.7014066Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 492, in forward 2025-08-14T21:46:10.7014158Z hidden_states = self.dense(hidden_states) 2025-08-14T21:46:10.7014161Z 2025-08-14T21:46:10.7014262Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:10.7014445Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:10.7014511Z return mod(**inputs) 2025-08-14T21:46:10.7014749Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 999, in forward 2025-08-14T21:46:10.7014811Z outputs = self.roberta( 2025-08-14T21:46:10.7015055Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:10.7015120Z encoder_outputs = self.encoder( 2025-08-14T21:46:10.7015357Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:10.7015429Z layer_outputs = layer_module( 2025-08-14T21:46:10.7015630Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:10.7015708Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:10.7015945Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 569, in forward 2025-08-14T21:46:10.7016021Z layer_output = apply_chunking_to_forward( 2025-08-14T21:46:10.7016262Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:46:10.7016331Z return forward_fn(*input_tensors) 2025-08-14T21:46:10.7016603Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 577, in feed_forward_chunk 2025-08-14T21:46:10.7016712Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:46:10.7016951Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 493, in forward 2025-08-14T21:46:10.7017061Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:46:10.7017258Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:46:10.7017322Z return self.act(input) 2025-08-14T21:46:10.7017334Z 2025-08-14T21:46:10.7017428Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:10.7017607Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:10.7017673Z return mod(**inputs) 2025-08-14T21:46:10.7017911Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 999, in forward 2025-08-14T21:46:10.7017971Z outputs = self.roberta( 2025-08-14T21:46:10.7018230Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:10.7018299Z encoder_outputs = self.encoder( 2025-08-14T21:46:10.7018545Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:10.7018626Z layer_outputs = layer_module( 2025-08-14T21:46:10.7018832Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:10.7018926Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:10.7019163Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 569, in forward 2025-08-14T21:46:10.7019238Z layer_output = apply_chunking_to_forward( 2025-08-14T21:46:10.7019479Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:46:10.7019549Z return forward_fn(*input_tensors) 2025-08-14T21:46:10.7019841Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 578, in feed_forward_chunk 2025-08-14T21:46:10.7019963Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:46:10.7020205Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 506, in forward 2025-08-14T21:46:10.7020287Z hidden_states = self.dense(hidden_states) 2025-08-14T21:46:10.7020291Z 2025-08-14T21:46:10.7020383Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:10.7020571Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:10.7020630Z return mod(**inputs) 2025-08-14T21:46:10.7020869Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 999, in forward 2025-08-14T21:46:10.7020939Z outputs = self.roberta( 2025-08-14T21:46:10.7021183Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:10.7021249Z encoder_outputs = self.encoder( 2025-08-14T21:46:10.7021498Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:10.7021562Z layer_outputs = layer_module( 2025-08-14T21:46:10.7021774Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:10.7021844Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:10.7022082Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 539, in forward 2025-08-14T21:46:10.7022163Z self_attention_outputs = self.attention( 2025-08-14T21:46:10.7022387Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:10.7022452Z return func(*args, **kwargs) 2025-08-14T21:46:10.7022701Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 466, in forward 2025-08-14T21:46:10.7022763Z self_outputs = self.self( 2025-08-14T21:46:10.7022993Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:10.7023058Z return func(*args, **kwargs) 2025-08-14T21:46:10.7023298Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 324, in forward 2025-08-14T21:46:10.7023497Z self.query(hidden_states).view(bsz, -1, self.num_attention_heads, self.attention_head_size).transpose(1, 2) 2025-08-14T21:46:10.7023500Z 2025-08-14T21:46:10.7023594Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:10.7023798Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:10.7023862Z return mod(**inputs) 2025-08-14T21:46:10.7024104Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 999, in forward 2025-08-14T21:46:10.7024173Z outputs = self.roberta( 2025-08-14T21:46:10.7024441Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:10.7024525Z encoder_outputs = self.encoder( 2025-08-14T21:46:10.7024836Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:10.7024906Z layer_outputs = layer_module( 2025-08-14T21:46:10.7025114Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:10.7025186Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:10.7025429Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 539, in forward 2025-08-14T21:46:10.7025534Z self_attention_outputs = self.attention( 2025-08-14T21:46:10.7025754Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:10.7025831Z return func(*args, **kwargs) 2025-08-14T21:46:10.7026069Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 466, in forward 2025-08-14T21:46:10.7026136Z self_outputs = self.self( 2025-08-14T21:46:10.7026365Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:10.7026429Z return func(*args, **kwargs) 2025-08-14T21:46:10.7026666Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 352, in forward 2025-08-14T21:46:10.7026742Z self.key(current_states) 2025-08-14T21:46:10.7026747Z 2025-08-14T21:46:10.7026841Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:10.7027029Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:10.7027090Z return mod(**inputs) 2025-08-14T21:46:10.7027326Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 999, in forward 2025-08-14T21:46:10.7027399Z outputs = self.roberta( 2025-08-14T21:46:10.7027635Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:10.7027700Z encoder_outputs = self.encoder( 2025-08-14T21:46:10.7027943Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:10.7028006Z layer_outputs = layer_module( 2025-08-14T21:46:10.7028214Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:10.7028286Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:10.7028521Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 539, in forward 2025-08-14T21:46:10.7028603Z self_attention_outputs = self.attention( 2025-08-14T21:46:10.7028822Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:10.7028891Z return func(*args, **kwargs) 2025-08-14T21:46:10.7029127Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 466, in forward 2025-08-14T21:46:10.7029190Z self_outputs = self.self( 2025-08-14T21:46:10.7029412Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:10.7029491Z return func(*args, **kwargs) 2025-08-14T21:46:10.7029737Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 357, in forward 2025-08-14T21:46:10.7029810Z self.value(current_states) 2025-08-14T21:46:10.7029813Z 2025-08-14T21:46:10.7029887Z cudagraph partition due to non gpu ops 2025-08-14T21:46:10.7030731Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:10.7030939Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:10.7031000Z return mod(**inputs) 2025-08-14T21:46:10.7031245Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 999, in forward 2025-08-14T21:46:10.7031307Z outputs = self.roberta( 2025-08-14T21:46:10.7031549Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:10.7031623Z encoder_outputs = self.encoder( 2025-08-14T21:46:10.7031889Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:10.7031962Z layer_outputs = layer_module( 2025-08-14T21:46:10.7032164Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:10.7032235Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:10.7032484Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 539, in forward 2025-08-14T21:46:10.7032558Z self_attention_outputs = self.attention( 2025-08-14T21:46:10.7032785Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:10.7032846Z return func(*args, **kwargs) 2025-08-14T21:46:10.7033087Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 466, in forward 2025-08-14T21:46:10.7033155Z self_outputs = self.self( 2025-08-14T21:46:10.7033373Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:10.7033434Z return func(*args, **kwargs) 2025-08-14T21:46:10.7033682Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 388, in forward 2025-08-14T21:46:10.7033804Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:46:10.7033807Z 2025-08-14T21:46:10.7033908Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:10.7034086Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:10.7034145Z return mod(**inputs) 2025-08-14T21:46:10.7034391Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 999, in forward 2025-08-14T21:46:10.7034453Z outputs = self.roberta( 2025-08-14T21:46:10.7034697Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:10.7034761Z encoder_outputs = self.encoder( 2025-08-14T21:46:10.7034997Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:10.7035071Z layer_outputs = layer_module( 2025-08-14T21:46:10.7035274Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:10.7035345Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:10.7035593Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 539, in forward 2025-08-14T21:46:10.7035667Z self_attention_outputs = self.attention( 2025-08-14T21:46:10.7035911Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:10.7035976Z return func(*args, **kwargs) 2025-08-14T21:46:10.7036220Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 476, in forward 2025-08-14T21:46:10.7036396Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:46:10.7036636Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 412, in forward 2025-08-14T21:46:10.7036730Z hidden_states = self.dense(hidden_states) 2025-08-14T21:46:10.7036741Z 2025-08-14T21:46:10.7036836Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:10.7037019Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:10.7037086Z return mod(**inputs) 2025-08-14T21:46:10.7037330Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 999, in forward 2025-08-14T21:46:10.7037412Z outputs = self.roberta( 2025-08-14T21:46:10.7037660Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:10.7037729Z encoder_outputs = self.encoder( 2025-08-14T21:46:10.7037977Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:10.7038046Z layer_outputs = layer_module( 2025-08-14T21:46:10.7038249Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:10.7038328Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:10.7038569Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 569, in forward 2025-08-14T21:46:10.7038648Z layer_output = apply_chunking_to_forward( 2025-08-14T21:46:10.7038894Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:46:10.7038963Z return forward_fn(*input_tensors) 2025-08-14T21:46:10.7039243Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 577, in feed_forward_chunk 2025-08-14T21:46:10.7039352Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:46:10.7039594Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 492, in forward 2025-08-14T21:46:10.7039685Z hidden_states = self.dense(hidden_states) 2025-08-14T21:46:10.7039688Z 2025-08-14T21:46:10.7039780Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:10.7039969Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:10.7040028Z return mod(**inputs) 2025-08-14T21:46:10.7040274Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 999, in forward 2025-08-14T21:46:10.7040345Z outputs = self.roberta( 2025-08-14T21:46:10.7040587Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:10.7040654Z encoder_outputs = self.encoder( 2025-08-14T21:46:10.7040902Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:10.7040967Z layer_outputs = layer_module( 2025-08-14T21:46:10.7041175Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:10.7041246Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:10.7041499Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 569, in forward 2025-08-14T21:46:10.7041584Z layer_output = apply_chunking_to_forward( 2025-08-14T21:46:10.7041817Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:46:10.7041894Z return forward_fn(*input_tensors) 2025-08-14T21:46:10.7042181Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 577, in feed_forward_chunk 2025-08-14T21:46:10.7042305Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:46:10.7042550Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 493, in forward 2025-08-14T21:46:10.7042652Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:46:10.7042845Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:46:10.7042917Z return self.act(input) 2025-08-14T21:46:10.7042921Z 2025-08-14T21:46:10.7043034Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:10.7043221Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:10.7043282Z return mod(**inputs) 2025-08-14T21:46:10.7043525Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 999, in forward 2025-08-14T21:46:10.7043599Z outputs = self.roberta( 2025-08-14T21:46:10.7043837Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:10.7043912Z encoder_outputs = self.encoder( 2025-08-14T21:46:10.7044152Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:10.7044219Z layer_outputs = layer_module( 2025-08-14T21:46:10.7044430Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:10.7044505Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:10.7044748Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 569, in forward 2025-08-14T21:46:10.7044834Z layer_output = apply_chunking_to_forward( 2025-08-14T21:46:10.7045073Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:46:10.7045154Z return forward_fn(*input_tensors) 2025-08-14T21:46:10.7045425Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 578, in feed_forward_chunk 2025-08-14T21:46:10.7045548Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:46:10.7045797Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 506, in forward 2025-08-14T21:46:10.7045874Z hidden_states = self.dense(hidden_states) 2025-08-14T21:46:10.7045877Z 2025-08-14T21:46:10.7045981Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:10.7046164Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:10.7046226Z return mod(**inputs) 2025-08-14T21:46:10.7046473Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 999, in forward 2025-08-14T21:46:10.7046538Z outputs = self.roberta( 2025-08-14T21:46:10.7046777Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:10.7046850Z encoder_outputs = self.encoder( 2025-08-14T21:46:10.7047089Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:10.7047180Z layer_outputs = layer_module( 2025-08-14T21:46:10.7047385Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:10.7047457Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:10.7047715Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 539, in forward 2025-08-14T21:46:10.7047792Z self_attention_outputs = self.attention( 2025-08-14T21:46:10.7048094Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:10.7048220Z return func(*args, **kwargs) 2025-08-14T21:46:10.7048485Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 466, in forward 2025-08-14T21:46:10.7048574Z self_outputs = self.self( 2025-08-14T21:46:10.7048994Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:10.7049098Z return func(*args, **kwargs) 2025-08-14T21:46:10.7049395Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 324, in forward 2025-08-14T21:46:10.7049654Z self.query(hidden_states).view(bsz, -1, self.num_attention_heads, self.attention_head_size).transpose(1, 2) 2025-08-14T21:46:10.7049658Z 2025-08-14T21:46:10.7049775Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:10.7050029Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:10.7050123Z return mod(**inputs) 2025-08-14T21:46:10.7050417Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 999, in forward 2025-08-14T21:46:10.7050502Z outputs = self.roberta( 2025-08-14T21:46:10.7050764Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:10.7050867Z encoder_outputs = self.encoder( 2025-08-14T21:46:10.7051151Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:10.7051276Z layer_outputs = layer_module( 2025-08-14T21:46:10.7051502Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:10.7051596Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:10.7051885Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 539, in forward 2025-08-14T21:46:10.7051969Z self_attention_outputs = self.attention( 2025-08-14T21:46:10.7052227Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:10.7052357Z return func(*args, **kwargs) 2025-08-14T21:46:10.7052618Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 466, in forward 2025-08-14T21:46:10.7052732Z self_outputs = self.self( 2025-08-14T21:46:10.7052974Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:10.7053051Z return func(*args, **kwargs) 2025-08-14T21:46:10.7053371Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 352, in forward 2025-08-14T21:46:10.7053456Z self.key(current_states) 2025-08-14T21:46:10.7053460Z 2025-08-14T21:46:10.7053622Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:10.7053826Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:10.7053908Z return mod(**inputs) 2025-08-14T21:46:10.7054227Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 999, in forward 2025-08-14T21:46:10.7054322Z outputs = self.roberta( 2025-08-14T21:46:10.7054581Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:10.7054692Z encoder_outputs = self.encoder( 2025-08-14T21:46:10.7054966Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:10.7055087Z layer_outputs = layer_module( 2025-08-14T21:46:10.7055328Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:10.7055429Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:10.7055716Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 539, in forward 2025-08-14T21:46:10.7055813Z self_attention_outputs = self.attention( 2025-08-14T21:46:10.7056088Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:10.7056175Z return func(*args, **kwargs) 2025-08-14T21:46:10.7056453Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 466, in forward 2025-08-14T21:46:10.7056574Z self_outputs = self.self( 2025-08-14T21:46:10.7056814Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:10.7056931Z return func(*args, **kwargs) 2025-08-14T21:46:10.7057193Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 357, in forward 2025-08-14T21:46:10.7057266Z self.value(current_states) 2025-08-14T21:46:10.7057270Z 2025-08-14T21:46:10.7057444Z cudagraph partition due to non gpu ops 2025-08-14T21:46:10.7057591Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:10.7057796Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:10.7057912Z return mod(**inputs) 2025-08-14T21:46:10.7058175Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 999, in forward 2025-08-14T21:46:10.7058295Z outputs = self.roberta( 2025-08-14T21:46:10.7058568Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:10.7058656Z encoder_outputs = self.encoder( 2025-08-14T21:46:10.7058949Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:10.7059035Z layer_outputs = layer_module( 2025-08-14T21:46:10.7059273Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:10.7059385Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:10.7059659Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 539, in forward 2025-08-14T21:46:10.7059786Z self_attention_outputs = self.attention( 2025-08-14T21:46:10.7060028Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:10.7060114Z return func(*args, **kwargs) 2025-08-14T21:46:10.7060393Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 466, in forward 2025-08-14T21:46:10.7060497Z self_outputs = self.self( 2025-08-14T21:46:10.7060774Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:10.7060864Z return func(*args, **kwargs) 2025-08-14T21:46:10.7061140Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 388, in forward 2025-08-14T21:46:10.7061312Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:46:10.7061315Z 2025-08-14T21:46:10.7061418Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:10.7061691Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:10.7061792Z return mod(**inputs) 2025-08-14T21:46:10.7062057Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 999, in forward 2025-08-14T21:46:10.7062185Z outputs = self.roberta( 2025-08-14T21:46:10.7062446Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:10.7062564Z encoder_outputs = self.encoder( 2025-08-14T21:46:10.7062842Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:10.7062929Z layer_outputs = layer_module( 2025-08-14T21:46:10.7063195Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:10.7063289Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:10.7063550Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 539, in forward 2025-08-14T21:46:10.7063688Z self_attention_outputs = self.attention( 2025-08-14T21:46:10.7063941Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:10.7064051Z return func(*args, **kwargs) 2025-08-14T21:46:10.7064308Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 476, in forward 2025-08-14T21:46:10.7064447Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:46:10.7064797Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 412, in forward 2025-08-14T21:46:10.7064929Z hidden_states = self.dense(hidden_states) 2025-08-14T21:46:10.7064932Z 2025-08-14T21:46:10.7065084Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:10.7065290Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:10.7065371Z return mod(**inputs) 2025-08-14T21:46:10.7065666Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 999, in forward 2025-08-14T21:46:10.7065770Z outputs = self.roberta( 2025-08-14T21:46:10.7066092Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:10.7066179Z encoder_outputs = self.encoder( 2025-08-14T21:46:10.7066439Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:10.7077474Z layer_outputs = layer_module( 2025-08-14T21:46:10.7077815Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:10.7077901Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:10.7078191Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 569, in forward 2025-08-14T21:46:10.7078279Z layer_output = apply_chunking_to_forward( 2025-08-14T21:46:10.7078534Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:46:10.7078609Z return forward_fn(*input_tensors) 2025-08-14T21:46:10.7078889Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 577, in feed_forward_chunk 2025-08-14T21:46:10.7079081Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:46:10.7079338Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 492, in forward 2025-08-14T21:46:10.7079425Z hidden_states = self.dense(hidden_states) 2025-08-14T21:46:10.7079430Z 2025-08-14T21:46:10.7079535Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:10.7079763Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:10.7079867Z return mod(**inputs) 2025-08-14T21:46:10.7080117Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 999, in forward 2025-08-14T21:46:10.7080185Z outputs = self.roberta( 2025-08-14T21:46:10.7080435Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:10.7080505Z encoder_outputs = self.encoder( 2025-08-14T21:46:10.7080758Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:10.7080854Z layer_outputs = layer_module( 2025-08-14T21:46:10.7081062Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:10.7081151Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:10.7081395Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 569, in forward 2025-08-14T21:46:10.7081477Z layer_output = apply_chunking_to_forward( 2025-08-14T21:46:10.7081723Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:46:10.7081797Z return forward_fn(*input_tensors) 2025-08-14T21:46:10.7082078Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 577, in feed_forward_chunk 2025-08-14T21:46:10.7082196Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:46:10.7082440Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 493, in forward 2025-08-14T21:46:10.7082556Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:46:10.7082756Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:46:10.7082835Z return self.act(input) 2025-08-14T21:46:10.7082840Z 2025-08-14T21:46:10.7082942Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:10.7083133Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:10.7083204Z return mod(**inputs) 2025-08-14T21:46:10.7083447Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 999, in forward 2025-08-14T21:46:10.7083515Z outputs = self.roberta( 2025-08-14T21:46:10.7083763Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:10.7083833Z encoder_outputs = self.encoder( 2025-08-14T21:46:10.7084081Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:10.7084150Z layer_outputs = layer_module( 2025-08-14T21:46:10.7084357Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:10.7084440Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:10.7084839Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 569, in forward 2025-08-14T21:46:10.7084931Z layer_output = apply_chunking_to_forward( 2025-08-14T21:46:10.7085230Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:46:10.7085310Z return forward_fn(*input_tensors) 2025-08-14T21:46:10.7085593Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 578, in feed_forward_chunk 2025-08-14T21:46:10.7085758Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:46:10.7086004Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 506, in forward 2025-08-14T21:46:10.7086114Z hidden_states = self.dense(hidden_states) 2025-08-14T21:46:10.7086118Z 2025-08-14T21:46:10.7086218Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:10.7086412Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:10.7086474Z return mod(**inputs) 2025-08-14T21:46:10.7086722Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 1016, in forward 2025-08-14T21:46:10.7086851Z prediction_scores = self.lm_head(sequence_output) 2025-08-14T21:46:10.7087094Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 1149, in forward 2025-08-14T21:46:10.7087171Z x = self.dense(features) 2025-08-14T21:46:10.7087176Z 2025-08-14T21:46:10.7087270Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:10.7087453Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:10.7087520Z return mod(**inputs) 2025-08-14T21:46:10.7087760Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 1016, in forward 2025-08-14T21:46:10.7087849Z prediction_scores = self.lm_head(sequence_output) 2025-08-14T21:46:10.7088100Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 1154, in forward 2025-08-14T21:46:10.7088165Z x = self.decoder(x) 2025-08-14T21:46:10.7088168Z 2025-08-14T21:46:10.7088267Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:10.7088446Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:10.7088505Z return mod(**inputs) 2025-08-14T21:46:10.7088757Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 1022, in forward 2025-08-14T21:46:10.7088828Z lm_loss = self.loss_function( 2025-08-14T21:46:10.7089060Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/loss/loss_utils.py", line 67, in ForCausalLMLoss 2025-08-14T21:46:10.7089221Z loss = fixed_cross_entropy(logits, shift_labels, num_items_in_batch, ignore_index, **kwargs) 2025-08-14T21:46:10.7089451Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/loss/loss_utils.py", line 36, in fixed_cross_entropy 2025-08-14T21:46:10.7089641Z loss = nn.functional.cross_entropy(source, target, ignore_index=ignore_index, reduction=reduction) 2025-08-14T21:46:10.7089646Z 2025-08-14T21:46:18.3774483Z Compilation time (from dynamo_timed): 13.323395745 2025-08-14T21:46:18.3882484Z pass 2025-08-14T21:46:18.3883038Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:46:18.3883908Z TIMING: _recursive_pre_grad_passes:0.00629 _recursive_joint_graph_passes:0.55619 _recursive_post_grad_passes:0.07263 async_compile.wait:0.67381 code_gen:6.62885 inductor_compile:7.71302 backend_compile:10.59177 gc:0.00083 entire_frame_compile:13.3234 total_wall_time:13.3234 2025-08-14T21:46:18.3885121Z STATS: call_* op count: 303 | FakeTensorMode.__torch_dispatch__:12464 | FakeTensor.__torch_dispatch__:4759 | ProxyTorchDispatchMode.__torch_dispatch__:4539 2025-08-14T21:46:18.3885593Z Dynamo produced 1 graphs covering 303 ops with 0 graph breaks (0 unique) 2025-08-14T21:46:22.4898306Z /opt/conda/envs/py_3.9/lib/python3.9/site-packages/llvmlite/binding/ffi.py:175: UserWarning: pkg_resources is deprecated as an API. See https://setuptools.pypa.io/en/latest/pkg_resources.html. The pkg_resources package is slated for removal as early as 2025-11-30. Refrain from using this package or pin to Setuptools<81. 2025-08-14T21:46:22.4899274Z from pkg_resources import resource_filename 2025-08-14T21:46:23.0179215Z 2025-08-14T21:46:24.0277117Z loading model: 0it [00:00, ?it/s]We strongly recommend passing in an `attention_mask` since your input_ids may be padded. See https://huggingface.co/docs/transformers/troubleshooting#incorrect-output-when-padding-tokens-arent-masked. 2025-08-14T21:46:24.0278794Z You may ignore this warning if your `pad_token_id` (0) is identical to the `bos_token_id` (0), `eos_token_id` (2), or the `sep_token_id` (None), and your input is not padded. 2025-08-14T21:46:24.0280011Z WARNING:transformers.modeling_utils:We strongly recommend passing in an `attention_mask` since your input_ids may be padded. See https://huggingface.co/docs/transformers/troubleshooting#incorrect-output-when-padding-tokens-arent-masked. 2025-08-14T21:46:24.0281229Z You may ignore this warning if your `pad_token_id` (0) is identical to the `bos_token_id` (0), `eos_token_id` (2), or the `sep_token_id` (None), and your input is not padded. 2025-08-14T21:46:24.1387290Z 2025-08-14T21:46:24.1389362Z loading model: 0it [00:01, ?it/s] 2025-08-14T21:46:24.1401067Z cpu eval RobertaForQuestionAnswering 2025-08-14T21:46:24.4869635Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:46:24.6384268Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:46:24.7796668Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:46:31.4879179Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:31.4881184Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:31.4886672Z return mod(**inputs) 2025-08-14T21:46:31.4891425Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 1516, in forward 2025-08-14T21:46:31.4895546Z outputs = self.roberta( 2025-08-14T21:46:31.4897823Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 826, in forward 2025-08-14T21:46:31.4902119Z embedding_output = self.embeddings( 2025-08-14T21:46:31.4907088Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 89, in forward 2025-08-14T21:46:31.4908896Z position_ids = create_position_ids_from_input_ids(input_ids, self.padding_idx, past_key_values_length) 2025-08-14T21:46:31.4909623Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 1576, in create_position_ids_from_input_ids 2025-08-14T21:46:31.4913832Z mask = input_ids.ne(padding_idx).int() 2025-08-14T21:46:31.4915513Z 2025-08-14T21:46:31.4915737Z cudagraph partition due to non gpu ops 2025-08-14T21:46:31.4915972Z cudagraph partition due to non gpu ops 2025-08-14T21:46:31.4916166Z cudagraph partition due to non gpu ops 2025-08-14T21:46:31.4916367Z cudagraph partition due to non gpu ops 2025-08-14T21:46:31.4916637Z cudagraph partition due to non gpu ops 2025-08-14T21:46:31.4920382Z cudagraph partition due to non gpu ops 2025-08-14T21:46:31.4922330Z cudagraph partition due to non gpu ops 2025-08-14T21:46:31.4922570Z cudagraph partition due to non gpu ops 2025-08-14T21:46:31.4922854Z cudagraph partition due to non gpu ops 2025-08-14T21:46:31.4926945Z cudagraph partition due to non gpu ops 2025-08-14T21:46:31.4931092Z cudagraph partition due to non gpu ops 2025-08-14T21:46:31.4934714Z cudagraph partition due to non gpu ops 2025-08-14T21:46:31.4939160Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:31.4939599Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:31.4939909Z return mod(**inputs) 2025-08-14T21:46:31.4940289Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 1516, in forward 2025-08-14T21:46:31.4940723Z outputs = self.roberta( 2025-08-14T21:46:31.4941081Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 826, in forward 2025-08-14T21:46:31.4941510Z embedding_output = self.embeddings( 2025-08-14T21:46:31.4941886Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 89, in forward 2025-08-14T21:46:31.4942381Z position_ids = create_position_ids_from_input_ids(input_ids, self.padding_idx, past_key_values_length) 2025-08-14T21:46:31.4942930Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 1577, in create_position_ids_from_input_ids 2025-08-14T21:46:31.4943514Z incremental_indices = (torch.cumsum(mask, dim=1).type_as(mask) + past_key_values_length) * mask 2025-08-14T21:46:31.4943746Z 2025-08-14T21:46:31.4943849Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:31.4944192Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:31.4944487Z return mod(**inputs) 2025-08-14T21:46:31.4944964Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 1516, in forward 2025-08-14T21:46:31.4945354Z outputs = self.roberta( 2025-08-14T21:46:31.4945711Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 826, in forward 2025-08-14T21:46:31.4946089Z embedding_output = self.embeddings( 2025-08-14T21:46:31.4946465Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 89, in forward 2025-08-14T21:46:31.4946965Z position_ids = create_position_ids_from_input_ids(input_ids, self.padding_idx, past_key_values_length) 2025-08-14T21:46:31.4947510Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 1577, in create_position_ids_from_input_ids 2025-08-14T21:46:31.4948045Z incremental_indices = (torch.cumsum(mask, dim=1).type_as(mask) + past_key_values_length) * mask 2025-08-14T21:46:31.4948279Z 2025-08-14T21:46:31.4948381Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:31.4948758Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:31.4949053Z return mod(**inputs) 2025-08-14T21:46:31.4949398Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 1516, in forward 2025-08-14T21:46:31.4949762Z outputs = self.roberta( 2025-08-14T21:46:31.4950108Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:31.4950464Z encoder_outputs = self.encoder( 2025-08-14T21:46:31.4950827Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:31.4951188Z layer_outputs = layer_module( 2025-08-14T21:46:31.4951505Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:31.4951844Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:31.4952212Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 539, in forward 2025-08-14T21:46:31.4952586Z self_attention_outputs = self.attention( 2025-08-14T21:46:31.4952969Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:31.4953320Z return func(*args, **kwargs) 2025-08-14T21:46:31.4953676Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 466, in forward 2025-08-14T21:46:31.4954035Z self_outputs = self.self( 2025-08-14T21:46:31.4954389Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:31.4954761Z return func(*args, **kwargs) 2025-08-14T21:46:31.4955424Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 324, in forward 2025-08-14T21:46:31.4955909Z self.query(hidden_states).view(bsz, -1, self.num_attention_heads, self.attention_head_size).transpose(1, 2) 2025-08-14T21:46:31.4956161Z 2025-08-14T21:46:31.4956259Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:31.4956597Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:31.4956922Z return mod(**inputs) 2025-08-14T21:46:31.4957260Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 1516, in forward 2025-08-14T21:46:31.4957623Z outputs = self.roberta( 2025-08-14T21:46:31.4957971Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:31.4958335Z encoder_outputs = self.encoder( 2025-08-14T21:46:31.4958689Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:31.4959050Z layer_outputs = layer_module( 2025-08-14T21:46:31.4959369Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:31.4959697Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:31.4960068Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 539, in forward 2025-08-14T21:46:31.4960440Z self_attention_outputs = self.attention( 2025-08-14T21:46:31.4960791Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:31.4961124Z return func(*args, **kwargs) 2025-08-14T21:46:31.4961475Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 466, in forward 2025-08-14T21:46:31.4961835Z self_outputs = self.self( 2025-08-14T21:46:31.4962167Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:31.4962498Z return func(*args, **kwargs) 2025-08-14T21:46:31.4962847Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 352, in forward 2025-08-14T21:46:31.4963204Z self.key(current_states) 2025-08-14T21:46:31.4963309Z 2025-08-14T21:46:31.4963402Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:31.4963735Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:31.4964033Z return mod(**inputs) 2025-08-14T21:46:31.4964377Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 1516, in forward 2025-08-14T21:46:31.4964730Z outputs = self.roberta( 2025-08-14T21:46:31.4965074Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:31.4965436Z encoder_outputs = self.encoder( 2025-08-14T21:46:31.4965785Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:31.4966144Z layer_outputs = layer_module( 2025-08-14T21:46:31.4966483Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:31.4966827Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:31.4967200Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 539, in forward 2025-08-14T21:46:31.4967612Z self_attention_outputs = self.attention( 2025-08-14T21:46:31.4967994Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:31.4968353Z return func(*args, **kwargs) 2025-08-14T21:46:31.4968701Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 466, in forward 2025-08-14T21:46:31.4969061Z self_outputs = self.self( 2025-08-14T21:46:31.4969400Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:31.4969743Z return func(*args, **kwargs) 2025-08-14T21:46:31.4970118Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 357, in forward 2025-08-14T21:46:31.4970487Z self.value(current_states) 2025-08-14T21:46:31.4970598Z 2025-08-14T21:46:31.4970681Z cudagraph partition due to non gpu ops 2025-08-14T21:46:31.4970901Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:31.4971241Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:31.4971544Z return mod(**inputs) 2025-08-14T21:46:31.4971889Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 1516, in forward 2025-08-14T21:46:31.4972259Z outputs = self.roberta( 2025-08-14T21:46:31.4972610Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:31.4972980Z encoder_outputs = self.encoder( 2025-08-14T21:46:31.4973344Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:31.4973714Z layer_outputs = layer_module( 2025-08-14T21:46:31.4974047Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:31.4974387Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:31.4974768Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 539, in forward 2025-08-14T21:46:31.4975153Z self_attention_outputs = self.attention( 2025-08-14T21:46:31.4975516Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:31.4975859Z return func(*args, **kwargs) 2025-08-14T21:46:31.4976222Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 466, in forward 2025-08-14T21:46:31.4976600Z self_outputs = self.self( 2025-08-14T21:46:31.4976935Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:31.4977287Z return func(*args, **kwargs) 2025-08-14T21:46:31.4977646Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 388, in forward 2025-08-14T21:46:31.4978081Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:46:31.4978256Z 2025-08-14T21:46:31.4978355Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:31.4978696Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:31.4979002Z return mod(**inputs) 2025-08-14T21:46:31.4979375Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 1516, in forward 2025-08-14T21:46:31.4979744Z outputs = self.roberta( 2025-08-14T21:46:31.4980106Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:31.4980496Z encoder_outputs = self.encoder( 2025-08-14T21:46:31.4980872Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:31.4981285Z layer_outputs = layer_module( 2025-08-14T21:46:31.4981616Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:31.4981961Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:31.4982334Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 539, in forward 2025-08-14T21:46:31.4982730Z self_attention_outputs = self.attention( 2025-08-14T21:46:31.4983085Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:31.4983443Z return func(*args, **kwargs) 2025-08-14T21:46:31.4983786Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 476, in forward 2025-08-14T21:46:31.4984200Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:46:31.4984924Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 412, in forward 2025-08-14T21:46:31.4985310Z hidden_states = self.dense(hidden_states) 2025-08-14T21:46:31.4985448Z 2025-08-14T21:46:31.4985544Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:31.4985874Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:31.4986174Z return mod(**inputs) 2025-08-14T21:46:31.4986515Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 1516, in forward 2025-08-14T21:46:31.4986884Z outputs = self.roberta( 2025-08-14T21:46:31.4987231Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:31.4987586Z encoder_outputs = self.encoder( 2025-08-14T21:46:31.4987947Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:31.4988312Z layer_outputs = layer_module( 2025-08-14T21:46:31.4988632Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:31.4988957Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:31.4989322Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 569, in forward 2025-08-14T21:46:31.4989697Z layer_output = apply_chunking_to_forward( 2025-08-14T21:46:31.4990071Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:46:31.4990427Z return forward_fn(*input_tensors) 2025-08-14T21:46:31.4990822Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 577, in feed_forward_chunk 2025-08-14T21:46:31.4991258Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:46:31.4991658Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 492, in forward 2025-08-14T21:46:31.4992032Z hidden_states = self.dense(hidden_states) 2025-08-14T21:46:31.4992166Z 2025-08-14T21:46:31.4992261Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:31.4992596Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:31.4992889Z return mod(**inputs) 2025-08-14T21:46:31.4993277Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 1516, in forward 2025-08-14T21:46:31.4993644Z outputs = self.roberta( 2025-08-14T21:46:31.4993987Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:31.4994370Z encoder_outputs = self.encoder( 2025-08-14T21:46:31.4994763Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:31.4995122Z layer_outputs = layer_module( 2025-08-14T21:46:31.4995443Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:31.4995792Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:31.4996180Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 569, in forward 2025-08-14T21:46:31.4996595Z layer_output = apply_chunking_to_forward( 2025-08-14T21:46:31.4996968Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:46:31.4997348Z return forward_fn(*input_tensors) 2025-08-14T21:46:31.4997743Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 577, in feed_forward_chunk 2025-08-14T21:46:31.4998169Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:46:31.4998572Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 493, in forward 2025-08-14T21:46:31.4998967Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:46:31.4999319Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:46:31.4999624Z return self.act(input) 2025-08-14T21:46:31.4999738Z 2025-08-14T21:46:31.4999836Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:31.5000170Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:31.5000469Z return mod(**inputs) 2025-08-14T21:46:31.5000809Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 1516, in forward 2025-08-14T21:46:31.5001170Z outputs = self.roberta( 2025-08-14T21:46:31.5001513Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:31.5001867Z encoder_outputs = self.encoder( 2025-08-14T21:46:31.5002220Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:31.5002576Z layer_outputs = layer_module( 2025-08-14T21:46:31.5002892Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:31.5003220Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:31.5003608Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 569, in forward 2025-08-14T21:46:31.5004013Z layer_output = apply_chunking_to_forward( 2025-08-14T21:46:31.5004446Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:46:31.5004807Z return forward_fn(*input_tensors) 2025-08-14T21:46:31.5005193Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 578, in feed_forward_chunk 2025-08-14T21:46:31.5005638Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:46:31.5006041Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 506, in forward 2025-08-14T21:46:31.5006426Z hidden_states = self.dense(hidden_states) 2025-08-14T21:46:31.5006562Z 2025-08-14T21:46:31.5006657Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:31.5006984Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:31.5007272Z return mod(**inputs) 2025-08-14T21:46:31.5007629Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 1516, in forward 2025-08-14T21:46:31.5008009Z outputs = self.roberta( 2025-08-14T21:46:31.5008343Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:31.5008703Z encoder_outputs = self.encoder( 2025-08-14T21:46:31.5009057Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:31.5009418Z layer_outputs = layer_module( 2025-08-14T21:46:31.5009745Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:31.5010074Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:31.5010442Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 539, in forward 2025-08-14T21:46:31.5010812Z self_attention_outputs = self.attention( 2025-08-14T21:46:31.5011156Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:31.5011506Z return func(*args, **kwargs) 2025-08-14T21:46:31.5011854Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 466, in forward 2025-08-14T21:46:31.5012204Z self_outputs = self.self( 2025-08-14T21:46:31.5012535Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:31.5012873Z return func(*args, **kwargs) 2025-08-14T21:46:31.5013218Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 324, in forward 2025-08-14T21:46:31.5013692Z self.query(hidden_states).view(bsz, -1, self.num_attention_heads, self.attention_head_size).transpose(1, 2) 2025-08-14T21:46:31.5013940Z 2025-08-14T21:46:31.5014035Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:31.5014365Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:31.5014659Z return mod(**inputs) 2025-08-14T21:46:31.5014995Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 1516, in forward 2025-08-14T21:46:31.5015354Z outputs = self.roberta( 2025-08-14T21:46:31.5015696Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:31.5016048Z encoder_outputs = self.encoder( 2025-08-14T21:46:31.5016405Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:31.5016759Z layer_outputs = layer_module( 2025-08-14T21:46:31.5017074Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:31.5017399Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:31.5017759Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 539, in forward 2025-08-14T21:46:31.5018125Z self_attention_outputs = self.attention( 2025-08-14T21:46:31.5018468Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:31.5018811Z return func(*args, **kwargs) 2025-08-14T21:46:31.5019176Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 466, in forward 2025-08-14T21:46:31.5019543Z self_outputs = self.self( 2025-08-14T21:46:31.5019866Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:31.5020207Z return func(*args, **kwargs) 2025-08-14T21:46:31.5020569Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 352, in forward 2025-08-14T21:46:31.5020942Z self.key(current_states) 2025-08-14T21:46:31.5021049Z 2025-08-14T21:46:31.5021143Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:31.5021472Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:31.5021771Z return mod(**inputs) 2025-08-14T21:46:31.5022112Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 1516, in forward 2025-08-14T21:46:31.5022496Z outputs = self.roberta( 2025-08-14T21:46:31.5022842Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:31.5023203Z encoder_outputs = self.encoder( 2025-08-14T21:46:31.5023556Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:31.5023918Z layer_outputs = layer_module( 2025-08-14T21:46:31.5024235Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:31.5024560Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:31.5025004Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 539, in forward 2025-08-14T21:46:31.5025386Z self_attention_outputs = self.attention( 2025-08-14T21:46:31.5025746Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:31.5026086Z return func(*args, **kwargs) 2025-08-14T21:46:31.5026441Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 466, in forward 2025-08-14T21:46:31.5026809Z self_outputs = self.self( 2025-08-14T21:46:31.5027136Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:31.5027484Z return func(*args, **kwargs) 2025-08-14T21:46:31.5027839Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 357, in forward 2025-08-14T21:46:31.5028206Z self.value(current_states) 2025-08-14T21:46:31.5028317Z 2025-08-14T21:46:31.5028393Z cudagraph partition due to non gpu ops 2025-08-14T21:46:31.5028620Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:31.5028954Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:31.5029253Z return mod(**inputs) 2025-08-14T21:46:31.5029592Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 1516, in forward 2025-08-14T21:46:31.5029954Z outputs = self.roberta( 2025-08-14T21:46:31.5030308Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:31.5030667Z encoder_outputs = self.encoder( 2025-08-14T21:46:31.5031027Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:31.5031386Z layer_outputs = layer_module( 2025-08-14T21:46:31.5031709Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:31.5032061Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:31.5032435Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 539, in forward 2025-08-14T21:46:31.5032808Z self_attention_outputs = self.attention( 2025-08-14T21:46:31.5033179Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:31.5033523Z return func(*args, **kwargs) 2025-08-14T21:46:31.5033890Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 466, in forward 2025-08-14T21:46:31.5034250Z self_outputs = self.self( 2025-08-14T21:46:31.5034573Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:31.5034914Z return func(*args, **kwargs) 2025-08-14T21:46:31.5035266Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 388, in forward 2025-08-14T21:46:31.5035692Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:46:31.5035868Z 2025-08-14T21:46:31.5035961Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:31.5036292Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:31.5036586Z return mod(**inputs) 2025-08-14T21:46:31.5036924Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 1516, in forward 2025-08-14T21:46:31.5037288Z outputs = self.roberta( 2025-08-14T21:46:31.5037630Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:31.5037988Z encoder_outputs = self.encoder( 2025-08-14T21:46:31.5038338Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:31.5038698Z layer_outputs = layer_module( 2025-08-14T21:46:31.5039022Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:31.5039346Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:31.5039711Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 539, in forward 2025-08-14T21:46:31.5040077Z self_attention_outputs = self.attention( 2025-08-14T21:46:31.5040430Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:31.5040760Z return func(*args, **kwargs) 2025-08-14T21:46:31.5041110Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 476, in forward 2025-08-14T21:46:31.5041522Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:46:31.5041932Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 412, in forward 2025-08-14T21:46:31.5042303Z hidden_states = self.dense(hidden_states) 2025-08-14T21:46:31.5042439Z 2025-08-14T21:46:31.5042533Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:31.5042862Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:31.5043153Z return mod(**inputs) 2025-08-14T21:46:31.5043497Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 1516, in forward 2025-08-14T21:46:31.5043855Z outputs = self.roberta( 2025-08-14T21:46:31.5044195Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:31.5044547Z encoder_outputs = self.encoder( 2025-08-14T21:46:31.5044922Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:31.5045288Z layer_outputs = layer_module( 2025-08-14T21:46:31.5045601Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:31.5045936Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:31.5046320Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 569, in forward 2025-08-14T21:46:31.5046713Z layer_output = apply_chunking_to_forward( 2025-08-14T21:46:31.5047074Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:46:31.5047432Z return forward_fn(*input_tensors) 2025-08-14T21:46:31.5047822Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 577, in feed_forward_chunk 2025-08-14T21:46:31.5048259Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:46:31.5048675Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 492, in forward 2025-08-14T21:46:31.5049048Z hidden_states = self.dense(hidden_states) 2025-08-14T21:46:31.5049174Z 2025-08-14T21:46:31.5049275Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:31.5049597Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:31.5049895Z return mod(**inputs) 2025-08-14T21:46:31.5050238Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 1516, in forward 2025-08-14T21:46:31.5050600Z outputs = self.roberta( 2025-08-14T21:46:31.5050935Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:31.5051295Z encoder_outputs = self.encoder( 2025-08-14T21:46:31.5051652Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:31.5052009Z layer_outputs = layer_module( 2025-08-14T21:46:31.5052318Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:31.5052649Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:31.5053012Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 569, in forward 2025-08-14T21:46:31.5053377Z layer_output = apply_chunking_to_forward( 2025-08-14T21:46:31.5053742Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:46:31.5054100Z return forward_fn(*input_tensors) 2025-08-14T21:46:31.5054488Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 577, in feed_forward_chunk 2025-08-14T21:46:31.5054918Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:46:31.5055319Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 493, in forward 2025-08-14T21:46:31.5055713Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:46:31.5056067Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:46:31.5056376Z return self.act(input) 2025-08-14T21:46:31.5056483Z 2025-08-14T21:46:31.5056578Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:31.5056913Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:31.5057203Z return mod(**inputs) 2025-08-14T21:46:31.5057548Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 1516, in forward 2025-08-14T21:46:31.5057925Z outputs = self.roberta( 2025-08-14T21:46:31.5058272Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:31.5058627Z encoder_outputs = self.encoder( 2025-08-14T21:46:31.5059002Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:31.5059362Z layer_outputs = layer_module( 2025-08-14T21:46:31.5059689Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:31.5060022Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:31.5060386Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 569, in forward 2025-08-14T21:46:31.5060755Z layer_output = apply_chunking_to_forward( 2025-08-14T21:46:31.5061115Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:46:31.5061490Z return forward_fn(*input_tensors) 2025-08-14T21:46:31.5061881Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 578, in feed_forward_chunk 2025-08-14T21:46:31.5062326Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:46:31.5062739Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 506, in forward 2025-08-14T21:46:31.5063114Z hidden_states = self.dense(hidden_states) 2025-08-14T21:46:31.5063241Z 2025-08-14T21:46:31.5063344Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:31.5063669Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:31.5063968Z return mod(**inputs) 2025-08-14T21:46:31.5064316Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 1516, in forward 2025-08-14T21:46:31.5064678Z outputs = self.roberta( 2025-08-14T21:46:31.5065094Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:31.5065463Z encoder_outputs = self.encoder( 2025-08-14T21:46:31.5065831Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:31.5066188Z layer_outputs = layer_module( 2025-08-14T21:46:31.5066509Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:31.5066844Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:31.5067211Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 539, in forward 2025-08-14T21:46:31.5067576Z self_attention_outputs = self.attention( 2025-08-14T21:46:31.5067935Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:31.5068284Z return func(*args, **kwargs) 2025-08-14T21:46:31.5068640Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 466, in forward 2025-08-14T21:46:31.5068993Z self_outputs = self.self( 2025-08-14T21:46:31.5069327Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:31.5069674Z return func(*args, **kwargs) 2025-08-14T21:46:31.5070020Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 324, in forward 2025-08-14T21:46:31.5070507Z self.query(hidden_states).view(bsz, -1, self.num_attention_heads, self.attention_head_size).transpose(1, 2) 2025-08-14T21:46:31.5070756Z 2025-08-14T21:46:31.5070872Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:31.5071208Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:31.5071498Z return mod(**inputs) 2025-08-14T21:46:31.5071841Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 1516, in forward 2025-08-14T21:46:31.5072220Z outputs = self.roberta( 2025-08-14T21:46:31.5072568Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:31.5072943Z encoder_outputs = self.encoder( 2025-08-14T21:46:31.5073301Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:31.5073662Z layer_outputs = layer_module( 2025-08-14T21:46:31.5073973Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:31.5074307Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:31.5074692Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 539, in forward 2025-08-14T21:46:31.5075060Z self_attention_outputs = self.attention( 2025-08-14T21:46:31.5075403Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:31.5075742Z return func(*args, **kwargs) 2025-08-14T21:46:31.5076089Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 466, in forward 2025-08-14T21:46:31.5076439Z self_outputs = self.self( 2025-08-14T21:46:31.5076771Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:31.5077115Z return func(*args, **kwargs) 2025-08-14T21:46:31.5077463Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 352, in forward 2025-08-14T21:46:31.5077816Z self.key(current_states) 2025-08-14T21:46:31.5077927Z 2025-08-14T21:46:31.5078022Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:31.5078355Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:31.5078651Z return mod(**inputs) 2025-08-14T21:46:31.5078986Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 1516, in forward 2025-08-14T21:46:31.5079345Z outputs = self.roberta( 2025-08-14T21:46:31.5079688Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:31.5080041Z encoder_outputs = self.encoder( 2025-08-14T21:46:31.5080399Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:31.5080755Z layer_outputs = layer_module( 2025-08-14T21:46:31.5081072Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:31.5081397Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:31.5081762Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 539, in forward 2025-08-14T21:46:31.5082130Z self_attention_outputs = self.attention( 2025-08-14T21:46:31.5082471Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:31.5082807Z return func(*args, **kwargs) 2025-08-14T21:46:31.5083153Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 466, in forward 2025-08-14T21:46:31.5083506Z self_outputs = self.self( 2025-08-14T21:46:31.5083857Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:31.5084199Z return func(*args, **kwargs) 2025-08-14T21:46:31.5084546Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 357, in forward 2025-08-14T21:46:31.5085076Z self.value(current_states) 2025-08-14T21:46:31.5085193Z 2025-08-14T21:46:31.5085306Z cudagraph partition due to non gpu ops 2025-08-14T21:46:31.5085554Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:31.5085884Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:31.5086173Z return mod(**inputs) 2025-08-14T21:46:31.5086513Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 1516, in forward 2025-08-14T21:46:31.5086874Z outputs = self.roberta( 2025-08-14T21:46:31.5087212Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:31.5087603Z encoder_outputs = self.encoder( 2025-08-14T21:46:31.5087962Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:31.5088322Z layer_outputs = layer_module( 2025-08-14T21:46:31.5088632Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:31.5088969Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:31.5089336Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 539, in forward 2025-08-14T21:46:31.5089705Z self_attention_outputs = self.attention( 2025-08-14T21:46:31.5090047Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:31.5090385Z return func(*args, **kwargs) 2025-08-14T21:46:31.5090741Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 466, in forward 2025-08-14T21:46:31.5091096Z self_outputs = self.self( 2025-08-14T21:46:31.5091431Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:31.5091772Z return func(*args, **kwargs) 2025-08-14T21:46:31.5092122Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 388, in forward 2025-08-14T21:46:31.5092530Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:46:31.5092708Z 2025-08-14T21:46:31.5092802Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:31.5093134Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:31.5093427Z return mod(**inputs) 2025-08-14T21:46:31.5093765Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 1516, in forward 2025-08-14T21:46:31.5094123Z outputs = self.roberta( 2025-08-14T21:46:31.5094468Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:31.5094821Z encoder_outputs = self.encoder( 2025-08-14T21:46:31.5095182Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:31.5095540Z layer_outputs = layer_module( 2025-08-14T21:46:31.5095856Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:31.5096181Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:31.5096545Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 539, in forward 2025-08-14T21:46:31.5096940Z self_attention_outputs = self.attention( 2025-08-14T21:46:31.5097288Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:31.5097629Z return func(*args, **kwargs) 2025-08-14T21:46:31.5097998Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 476, in forward 2025-08-14T21:46:31.5098412Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:46:31.5098834Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 412, in forward 2025-08-14T21:46:31.5099205Z hidden_states = self.dense(hidden_states) 2025-08-14T21:46:31.5099333Z 2025-08-14T21:46:31.5099433Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:31.5099762Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:31.5100056Z return mod(**inputs) 2025-08-14T21:46:31.5100399Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 1516, in forward 2025-08-14T21:46:31.5100776Z outputs = self.roberta( 2025-08-14T21:46:31.5101109Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:31.5101473Z encoder_outputs = self.encoder( 2025-08-14T21:46:31.5101828Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:31.5102186Z layer_outputs = layer_module( 2025-08-14T21:46:31.5102495Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:31.5102824Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:31.5103189Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 569, in forward 2025-08-14T21:46:31.5103554Z layer_output = apply_chunking_to_forward( 2025-08-14T21:46:31.5103921Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:46:31.5104280Z return forward_fn(*input_tensors) 2025-08-14T21:46:31.5104665Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 577, in feed_forward_chunk 2025-08-14T21:46:31.5105156Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:46:31.5105579Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 492, in forward 2025-08-14T21:46:31.5105964Z hidden_states = self.dense(hidden_states) 2025-08-14T21:46:31.5106094Z 2025-08-14T21:46:31.5106198Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:31.5106535Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:31.5106849Z return mod(**inputs) 2025-08-14T21:46:31.5107204Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 1516, in forward 2025-08-14T21:46:31.5107575Z outputs = self.roberta( 2025-08-14T21:46:31.5107935Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:31.5108317Z encoder_outputs = self.encoder( 2025-08-14T21:46:31.5108677Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:31.5109031Z layer_outputs = layer_module( 2025-08-14T21:46:31.5109351Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:31.5109685Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:31.5110070Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 569, in forward 2025-08-14T21:46:31.5110439Z layer_output = apply_chunking_to_forward( 2025-08-14T21:46:31.5110807Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:46:31.5111180Z return forward_fn(*input_tensors) 2025-08-14T21:46:31.5111564Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 577, in feed_forward_chunk 2025-08-14T21:46:31.5112013Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:46:31.5112413Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 493, in forward 2025-08-14T21:46:31.5112805Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:46:31.5113149Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:46:31.5113487Z return self.act(input) 2025-08-14T21:46:31.5113590Z 2025-08-14T21:46:31.5113697Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:31.5114032Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:31.5114329Z return mod(**inputs) 2025-08-14T21:46:31.5114676Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 1516, in forward 2025-08-14T21:46:31.5115045Z outputs = self.roberta( 2025-08-14T21:46:31.5115382Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:31.5115749Z encoder_outputs = self.encoder( 2025-08-14T21:46:31.5116109Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:31.5116472Z layer_outputs = layer_module( 2025-08-14T21:46:31.5116789Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:31.5117128Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:31.5117496Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 569, in forward 2025-08-14T21:46:31.5117865Z layer_output = apply_chunking_to_forward( 2025-08-14T21:46:31.5118237Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:46:31.5118598Z return forward_fn(*input_tensors) 2025-08-14T21:46:31.5118989Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 578, in feed_forward_chunk 2025-08-14T21:46:31.5119428Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:46:31.5119844Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 506, in forward 2025-08-14T21:46:31.5120219Z hidden_states = self.dense(hidden_states) 2025-08-14T21:46:31.5120347Z 2025-08-14T21:46:31.5120451Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:31.5120779Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:31.5121084Z return mod(**inputs) 2025-08-14T21:46:31.5121434Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 1516, in forward 2025-08-14T21:46:31.5121792Z outputs = self.roberta( 2025-08-14T21:46:31.5122140Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:31.5122507Z encoder_outputs = self.encoder( 2025-08-14T21:46:31.5122887Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:31.5123250Z layer_outputs = layer_module( 2025-08-14T21:46:31.5123571Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:31.5123903Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:31.5124279Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 539, in forward 2025-08-14T21:46:31.5124668Z self_attention_outputs = self.attention( 2025-08-14T21:46:31.5125027Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:31.5125377Z return func(*args, **kwargs) 2025-08-14T21:46:31.5125730Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 466, in forward 2025-08-14T21:46:31.5126095Z self_outputs = self.self( 2025-08-14T21:46:31.5126436Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:31.5126810Z return func(*args, **kwargs) 2025-08-14T21:46:31.5127157Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 324, in forward 2025-08-14T21:46:31.5127649Z self.query(hidden_states).view(bsz, -1, self.num_attention_heads, self.attention_head_size).transpose(1, 2) 2025-08-14T21:46:31.5127893Z 2025-08-14T21:46:31.5127995Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:31.5128320Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:31.5128618Z return mod(**inputs) 2025-08-14T21:46:31.5128966Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 1516, in forward 2025-08-14T21:46:31.5129330Z outputs = self.roberta( 2025-08-14T21:46:31.5129671Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:31.5130038Z encoder_outputs = self.encoder( 2025-08-14T21:46:31.5130409Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:31.5130788Z layer_outputs = layer_module( 2025-08-14T21:46:31.5131122Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:31.5131455Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:31.5131821Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 539, in forward 2025-08-14T21:46:31.5132185Z self_attention_outputs = self.attention( 2025-08-14T21:46:31.5132533Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:31.5132904Z return func(*args, **kwargs) 2025-08-14T21:46:31.5133268Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 466, in forward 2025-08-14T21:46:31.5133633Z self_outputs = self.self( 2025-08-14T21:46:31.5133973Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:31.5134324Z return func(*args, **kwargs) 2025-08-14T21:46:31.5134678Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 352, in forward 2025-08-14T21:46:31.5135051Z self.key(current_states) 2025-08-14T21:46:31.5135166Z 2025-08-14T21:46:31.5135262Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:31.5135604Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:31.5135904Z return mod(**inputs) 2025-08-14T21:46:31.5136278Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 1516, in forward 2025-08-14T21:46:31.5136656Z outputs = self.roberta( 2025-08-14T21:46:31.5137010Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:31.5137408Z encoder_outputs = self.encoder( 2025-08-14T21:46:31.5137777Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:31.5138167Z layer_outputs = layer_module( 2025-08-14T21:46:31.5138489Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:31.5138832Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:31.5139212Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 539, in forward 2025-08-14T21:46:31.5139596Z self_attention_outputs = self.attention( 2025-08-14T21:46:31.5139968Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:31.5140324Z return func(*args, **kwargs) 2025-08-14T21:46:31.5140694Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 466, in forward 2025-08-14T21:46:31.5141065Z self_outputs = self.self( 2025-08-14T21:46:31.5141413Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:31.5141766Z return func(*args, **kwargs) 2025-08-14T21:46:31.5142127Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 357, in forward 2025-08-14T21:46:31.5142495Z self.value(current_states) 2025-08-14T21:46:31.5142620Z 2025-08-14T21:46:31.5142698Z cudagraph partition due to non gpu ops 2025-08-14T21:46:31.5142929Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:31.5143271Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:31.5143583Z return mod(**inputs) 2025-08-14T21:46:31.5143942Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 1516, in forward 2025-08-14T21:46:31.5144316Z outputs = self.roberta( 2025-08-14T21:46:31.5144667Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:31.5145118Z encoder_outputs = self.encoder( 2025-08-14T21:46:31.5145499Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:31.5145881Z layer_outputs = layer_module( 2025-08-14T21:46:31.5146216Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:31.5146577Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:31.5146951Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 539, in forward 2025-08-14T21:46:31.5147325Z self_attention_outputs = self.attention( 2025-08-14T21:46:31.5147690Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:31.5148043Z return func(*args, **kwargs) 2025-08-14T21:46:31.5148401Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 466, in forward 2025-08-14T21:46:31.5148760Z self_outputs = self.self( 2025-08-14T21:46:31.5149098Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:31.5149436Z return func(*args, **kwargs) 2025-08-14T21:46:31.5149802Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 388, in forward 2025-08-14T21:46:31.5150224Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:46:31.5150400Z 2025-08-14T21:46:31.5150495Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:31.5150842Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:31.5151131Z return mod(**inputs) 2025-08-14T21:46:31.5151496Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 1516, in forward 2025-08-14T21:46:31.5151857Z outputs = self.roberta( 2025-08-14T21:46:31.5152195Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:31.5152555Z encoder_outputs = self.encoder( 2025-08-14T21:46:31.5152916Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:31.5153295Z layer_outputs = layer_module( 2025-08-14T21:46:31.5153605Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:31.5153933Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:31.5154298Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 539, in forward 2025-08-14T21:46:31.5154671Z self_attention_outputs = self.attention( 2025-08-14T21:46:31.5155011Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:31.5155348Z return func(*args, **kwargs) 2025-08-14T21:46:31.5155697Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 476, in forward 2025-08-14T21:46:31.5156099Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:46:31.5156510Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 412, in forward 2025-08-14T21:46:31.5156882Z hidden_states = self.dense(hidden_states) 2025-08-14T21:46:31.5157010Z 2025-08-14T21:46:31.5157112Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:31.5157434Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:31.5157733Z return mod(**inputs) 2025-08-14T21:46:31.5158077Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 1516, in forward 2025-08-14T21:46:31.5158437Z outputs = self.roberta( 2025-08-14T21:46:31.5158770Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:31.5159129Z encoder_outputs = self.encoder( 2025-08-14T21:46:31.5159484Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:31.5159836Z layer_outputs = layer_module( 2025-08-14T21:46:31.5160152Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:31.5160483Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:31.5160843Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 569, in forward 2025-08-14T21:46:31.5161208Z layer_output = apply_chunking_to_forward( 2025-08-14T21:46:31.5161574Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:46:31.5161932Z return forward_fn(*input_tensors) 2025-08-14T21:46:31.5162333Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 577, in feed_forward_chunk 2025-08-14T21:46:31.5162775Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:46:31.5163185Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 492, in forward 2025-08-14T21:46:31.5163558Z hidden_states = self.dense(hidden_states) 2025-08-14T21:46:31.5163684Z 2025-08-14T21:46:31.5163796Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:31.5164148Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:31.5164446Z return mod(**inputs) 2025-08-14T21:46:31.5164791Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 1516, in forward 2025-08-14T21:46:31.5165141Z outputs = self.roberta( 2025-08-14T21:46:31.5165481Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:31.5165842Z encoder_outputs = self.encoder( 2025-08-14T21:46:31.5166205Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:31.5166562Z layer_outputs = layer_module( 2025-08-14T21:46:31.5166882Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:31.5167212Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:31.5167568Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 569, in forward 2025-08-14T21:46:31.5167935Z layer_output = apply_chunking_to_forward( 2025-08-14T21:46:31.5168295Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:46:31.5168649Z return forward_fn(*input_tensors) 2025-08-14T21:46:31.5169035Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 577, in feed_forward_chunk 2025-08-14T21:46:31.5169476Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:46:31.5169885Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 493, in forward 2025-08-14T21:46:31.5170278Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:46:31.5170638Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:46:31.5170958Z return self.act(input) 2025-08-14T21:46:31.5171060Z 2025-08-14T21:46:31.5171164Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:31.5171500Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:31.5171814Z return mod(**inputs) 2025-08-14T21:46:31.5172156Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 1516, in forward 2025-08-14T21:46:31.5172510Z outputs = self.roberta( 2025-08-14T21:46:31.5172854Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:31.5173214Z encoder_outputs = self.encoder( 2025-08-14T21:46:31.5173569Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:31.5173919Z layer_outputs = layer_module( 2025-08-14T21:46:31.5174236Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:31.5174565Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:31.5174928Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 569, in forward 2025-08-14T21:46:31.5175290Z layer_output = apply_chunking_to_forward( 2025-08-14T21:46:31.5175672Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:46:31.5176038Z return forward_fn(*input_tensors) 2025-08-14T21:46:31.5176418Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 578, in feed_forward_chunk 2025-08-14T21:46:31.5176881Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:46:31.5177322Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 506, in forward 2025-08-14T21:46:31.5177691Z hidden_states = self.dense(hidden_states) 2025-08-14T21:46:31.5177817Z 2025-08-14T21:46:31.5177911Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:31.5178240Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:31.5178538Z return mod(**inputs) 2025-08-14T21:46:31.5178883Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 1516, in forward 2025-08-14T21:46:31.5179259Z outputs = self.roberta( 2025-08-14T21:46:31.5179607Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:31.5179975Z encoder_outputs = self.encoder( 2025-08-14T21:46:31.5180324Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:31.5180687Z layer_outputs = layer_module( 2025-08-14T21:46:31.5181006Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:31.5181335Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:31.5181692Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 539, in forward 2025-08-14T21:46:31.5182061Z self_attention_outputs = self.attention( 2025-08-14T21:46:31.5182413Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:31.5182742Z return func(*args, **kwargs) 2025-08-14T21:46:31.5183095Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 466, in forward 2025-08-14T21:46:31.5183453Z self_outputs = self.self( 2025-08-14T21:46:31.5183781Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:31.5184110Z return func(*args, **kwargs) 2025-08-14T21:46:31.5184459Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 324, in forward 2025-08-14T21:46:31.5185189Z self.query(hidden_states).view(bsz, -1, self.num_attention_heads, self.attention_head_size).transpose(1, 2) 2025-08-14T21:46:31.5185448Z 2025-08-14T21:46:31.5185558Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:31.5185941Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:31.5186260Z return mod(**inputs) 2025-08-14T21:46:31.5186616Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 1516, in forward 2025-08-14T21:46:31.5186988Z outputs = self.roberta( 2025-08-14T21:46:31.5187337Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:31.5187701Z encoder_outputs = self.encoder( 2025-08-14T21:46:31.5188059Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:31.5188414Z layer_outputs = layer_module( 2025-08-14T21:46:31.5188772Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:31.5189107Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:31.5189472Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 539, in forward 2025-08-14T21:46:31.5189836Z self_attention_outputs = self.attention( 2025-08-14T21:46:31.5190209Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:31.5190583Z return func(*args, **kwargs) 2025-08-14T21:46:31.5190934Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 466, in forward 2025-08-14T21:46:31.5191302Z self_outputs = self.self( 2025-08-14T21:46:31.5191636Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:31.5191980Z return func(*args, **kwargs) 2025-08-14T21:46:31.5192331Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 352, in forward 2025-08-14T21:46:31.5192721Z self.key(current_states) 2025-08-14T21:46:31.5192826Z 2025-08-14T21:46:31.5192931Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:31.5193256Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:31.5193557Z return mod(**inputs) 2025-08-14T21:46:31.5193902Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 1516, in forward 2025-08-14T21:46:31.5194259Z outputs = self.roberta( 2025-08-14T21:46:31.5194594Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:31.5194954Z encoder_outputs = self.encoder( 2025-08-14T21:46:31.5195307Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:31.5195660Z layer_outputs = layer_module( 2025-08-14T21:46:31.5195975Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:31.5196303Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:31.5196670Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 539, in forward 2025-08-14T21:46:31.5197032Z self_attention_outputs = self.attention( 2025-08-14T21:46:31.5197382Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:31.5197720Z return func(*args, **kwargs) 2025-08-14T21:46:31.5198067Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 466, in forward 2025-08-14T21:46:31.5198415Z self_outputs = self.self( 2025-08-14T21:46:31.5198742Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:31.5199081Z return func(*args, **kwargs) 2025-08-14T21:46:31.5199422Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 357, in forward 2025-08-14T21:46:31.5199787Z self.value(current_states) 2025-08-14T21:46:31.5199902Z 2025-08-14T21:46:31.5199977Z cudagraph partition due to non gpu ops 2025-08-14T21:46:31.5200196Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:31.5200514Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:31.5200811Z return mod(**inputs) 2025-08-14T21:46:31.5201149Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 1516, in forward 2025-08-14T21:46:31.5201499Z outputs = self.roberta( 2025-08-14T21:46:31.5201858Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:31.5202221Z encoder_outputs = self.encoder( 2025-08-14T21:46:31.5202582Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:31.5202957Z layer_outputs = layer_module( 2025-08-14T21:46:31.5203280Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:31.5203630Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:31.5203989Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 539, in forward 2025-08-14T21:46:31.5204362Z self_attention_outputs = self.attention( 2025-08-14T21:46:31.5204711Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:31.5205056Z return func(*args, **kwargs) 2025-08-14T21:46:31.5205441Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 466, in forward 2025-08-14T21:46:31.5205803Z self_outputs = self.self( 2025-08-14T21:46:31.5206134Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:31.5206477Z return func(*args, **kwargs) 2025-08-14T21:46:31.5206823Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 388, in forward 2025-08-14T21:46:31.5207240Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:46:31.5207407Z 2025-08-14T21:46:31.5207508Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:31.5207833Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:31.5208141Z return mod(**inputs) 2025-08-14T21:46:31.5208489Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 1516, in forward 2025-08-14T21:46:31.5208852Z outputs = self.roberta( 2025-08-14T21:46:31.5209188Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:31.5209551Z encoder_outputs = self.encoder( 2025-08-14T21:46:31.5209909Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:31.5210266Z layer_outputs = layer_module( 2025-08-14T21:46:31.5210577Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:31.5210913Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:31.5211278Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 539, in forward 2025-08-14T21:46:31.5211646Z self_attention_outputs = self.attention( 2025-08-14T21:46:31.5211997Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:31.5212341Z return func(*args, **kwargs) 2025-08-14T21:46:31.5212692Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 476, in forward 2025-08-14T21:46:31.5213102Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:46:31.5213513Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 412, in forward 2025-08-14T21:46:31.5213884Z hidden_states = self.dense(hidden_states) 2025-08-14T21:46:31.5214011Z 2025-08-14T21:46:31.5214103Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:31.5214452Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:31.5214752Z return mod(**inputs) 2025-08-14T21:46:31.5215096Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 1516, in forward 2025-08-14T21:46:31.5215450Z outputs = self.roberta( 2025-08-14T21:46:31.5215811Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:31.5216188Z encoder_outputs = self.encoder( 2025-08-14T21:46:31.5216545Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:31.5216897Z layer_outputs = layer_module( 2025-08-14T21:46:31.5217214Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:31.5217545Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:31.5217903Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 569, in forward 2025-08-14T21:46:31.5218292Z layer_output = apply_chunking_to_forward( 2025-08-14T21:46:31.5218661Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:46:31.5219021Z return forward_fn(*input_tensors) 2025-08-14T21:46:31.5219408Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 577, in feed_forward_chunk 2025-08-14T21:46:31.5219845Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:46:31.5220254Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 492, in forward 2025-08-14T21:46:31.5220624Z hidden_states = self.dense(hidden_states) 2025-08-14T21:46:31.5220752Z 2025-08-14T21:46:31.5220848Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:31.5221181Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:31.5221485Z return mod(**inputs) 2025-08-14T21:46:31.5221824Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 1516, in forward 2025-08-14T21:46:31.5222186Z outputs = self.roberta( 2025-08-14T21:46:31.5222533Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:31.5222898Z encoder_outputs = self.encoder( 2025-08-14T21:46:31.5223249Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:31.5223610Z layer_outputs = layer_module( 2025-08-14T21:46:31.5223928Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:31.5224256Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:31.5224622Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 569, in forward 2025-08-14T21:46:31.5225057Z layer_output = apply_chunking_to_forward( 2025-08-14T21:46:31.5225433Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:46:31.5225788Z return forward_fn(*input_tensors) 2025-08-14T21:46:31.5226182Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 577, in feed_forward_chunk 2025-08-14T21:46:31.5226615Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:46:31.5227021Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 493, in forward 2025-08-14T21:46:31.5227415Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:46:31.5227785Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:46:31.5228104Z return self.act(input) 2025-08-14T21:46:31.5228205Z 2025-08-14T21:46:31.5228300Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:31.5228664Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:31.5228964Z return mod(**inputs) 2025-08-14T21:46:31.5229330Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 1516, in forward 2025-08-14T21:46:31.5229692Z outputs = self.roberta( 2025-08-14T21:46:31.5230044Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:31.5230409Z encoder_outputs = self.encoder( 2025-08-14T21:46:31.5230764Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:31.5231138Z layer_outputs = layer_module( 2025-08-14T21:46:31.5231456Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:31.5231786Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:31.5232144Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 569, in forward 2025-08-14T21:46:31.5232516Z layer_output = apply_chunking_to_forward( 2025-08-14T21:46:31.5232880Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:46:31.5233237Z return forward_fn(*input_tensors) 2025-08-14T21:46:31.5233616Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 578, in feed_forward_chunk 2025-08-14T21:46:31.5234055Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:46:31.5234467Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 506, in forward 2025-08-14T21:46:31.5234831Z hidden_states = self.dense(hidden_states) 2025-08-14T21:46:31.5234964Z 2025-08-14T21:46:31.5235059Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:31.5235393Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:31.5235693Z return mod(**inputs) 2025-08-14T21:46:31.5236029Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 1516, in forward 2025-08-14T21:46:31.5236387Z outputs = self.roberta( 2025-08-14T21:46:31.5236731Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:31.5237094Z encoder_outputs = self.encoder( 2025-08-14T21:46:31.5237447Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:31.5237810Z layer_outputs = layer_module( 2025-08-14T21:46:31.5238129Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:31.5238457Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:31.5238827Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 539, in forward 2025-08-14T21:46:31.5239198Z self_attention_outputs = self.attention( 2025-08-14T21:46:31.5239553Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:31.5239889Z return func(*args, **kwargs) 2025-08-14T21:46:31.5240241Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 466, in forward 2025-08-14T21:46:31.5240619Z self_outputs = self.self( 2025-08-14T21:46:31.5240948Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:31.5241292Z return func(*args, **kwargs) 2025-08-14T21:46:31.5241658Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 324, in forward 2025-08-14T21:46:31.5242148Z self.query(hidden_states).view(bsz, -1, self.num_attention_heads, self.attention_head_size).transpose(1, 2) 2025-08-14T21:46:31.5242406Z 2025-08-14T21:46:31.5242501Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:31.5242835Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:31.5243134Z return mod(**inputs) 2025-08-14T21:46:31.5243479Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 1516, in forward 2025-08-14T21:46:31.5243834Z outputs = self.roberta( 2025-08-14T21:46:31.5244195Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:31.5244560Z encoder_outputs = self.encoder( 2025-08-14T21:46:31.5244913Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:31.5245274Z layer_outputs = layer_module( 2025-08-14T21:46:31.5245591Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:31.5245923Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:31.5246279Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 539, in forward 2025-08-14T21:46:31.5246650Z self_attention_outputs = self.attention( 2025-08-14T21:46:31.5247001Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:31.5247344Z return func(*args, **kwargs) 2025-08-14T21:46:31.5247687Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 466, in forward 2025-08-14T21:46:31.5248046Z self_outputs = self.self( 2025-08-14T21:46:31.5248377Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:31.5248712Z return func(*args, **kwargs) 2025-08-14T21:46:31.5249060Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 352, in forward 2025-08-14T21:46:31.5249418Z self.key(current_states) 2025-08-14T21:46:31.5249521Z 2025-08-14T21:46:31.5249620Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:31.5249946Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:31.5250247Z return mod(**inputs) 2025-08-14T21:46:31.5250591Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 1516, in forward 2025-08-14T21:46:31.5250946Z outputs = self.roberta( 2025-08-14T21:46:31.5251291Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:31.5251653Z encoder_outputs = self.encoder( 2025-08-14T21:46:31.5252009Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:31.5252360Z layer_outputs = layer_module( 2025-08-14T21:46:31.5252676Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:31.5253006Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:31.5253386Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 539, in forward 2025-08-14T21:46:31.5253763Z self_attention_outputs = self.attention( 2025-08-14T21:46:31.5254116Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:31.5254457Z return func(*args, **kwargs) 2025-08-14T21:46:31.5254817Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 466, in forward 2025-08-14T21:46:31.5255195Z self_outputs = self.self( 2025-08-14T21:46:31.5255529Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:31.5255869Z return func(*args, **kwargs) 2025-08-14T21:46:31.5256216Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 357, in forward 2025-08-14T21:46:31.5256581Z self.value(current_states) 2025-08-14T21:46:31.5256689Z 2025-08-14T21:46:31.5256771Z cudagraph partition due to non gpu ops 2025-08-14T21:46:31.5257004Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:31.5257333Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:31.5257632Z return mod(**inputs) 2025-08-14T21:46:31.5257980Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 1516, in forward 2025-08-14T21:46:31.5258334Z outputs = self.roberta( 2025-08-14T21:46:31.5258680Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:31.5259041Z encoder_outputs = self.encoder( 2025-08-14T21:46:31.5259393Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:31.5259753Z layer_outputs = layer_module( 2025-08-14T21:46:31.5260070Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:31.5260404Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:31.5260759Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 539, in forward 2025-08-14T21:46:31.5261126Z self_attention_outputs = self.attention( 2025-08-14T21:46:31.5261475Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:31.5261813Z return func(*args, **kwargs) 2025-08-14T21:46:31.5262164Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 466, in forward 2025-08-14T21:46:31.5262523Z self_outputs = self.self( 2025-08-14T21:46:31.5262849Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:31.5263181Z return func(*args, **kwargs) 2025-08-14T21:46:31.5263529Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 388, in forward 2025-08-14T21:46:31.5263943Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:46:31.5264114Z 2025-08-14T21:46:31.5264214Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:31.5264538Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:31.5264906Z return mod(**inputs) 2025-08-14T21:46:31.5265254Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 1516, in forward 2025-08-14T21:46:31.5265608Z outputs = self.roberta( 2025-08-14T21:46:31.5265955Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:31.5266319Z encoder_outputs = self.encoder( 2025-08-14T21:46:31.5266698Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:31.5267062Z layer_outputs = layer_module( 2025-08-14T21:46:31.5267389Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:31.5267749Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:31.5268103Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 539, in forward 2025-08-14T21:46:31.5268502Z self_attention_outputs = self.attention( 2025-08-14T21:46:31.5268854Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:31.5269195Z return func(*args, **kwargs) 2025-08-14T21:46:31.5269538Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 476, in forward 2025-08-14T21:46:31.5269952Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:46:31.5270382Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 412, in forward 2025-08-14T21:46:31.5270756Z hidden_states = self.dense(hidden_states) 2025-08-14T21:46:31.5270884Z 2025-08-14T21:46:31.5270979Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:31.5271313Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:31.5271616Z return mod(**inputs) 2025-08-14T21:46:31.5271956Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 1516, in forward 2025-08-14T21:46:31.5272321Z outputs = self.roberta( 2025-08-14T21:46:31.5272668Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:31.5273033Z encoder_outputs = self.encoder( 2025-08-14T21:46:31.5273388Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:31.5273753Z layer_outputs = layer_module( 2025-08-14T21:46:31.5274073Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:31.5274406Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:31.5274767Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 569, in forward 2025-08-14T21:46:31.5275140Z layer_output = apply_chunking_to_forward( 2025-08-14T21:46:31.5275508Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:46:31.5275864Z return forward_fn(*input_tensors) 2025-08-14T21:46:31.5276257Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 577, in feed_forward_chunk 2025-08-14T21:46:31.5276695Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:46:31.5277102Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 492, in forward 2025-08-14T21:46:31.5277468Z hidden_states = self.dense(hidden_states) 2025-08-14T21:46:31.5277601Z 2025-08-14T21:46:31.5277696Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:31.5278026Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:31.5278325Z return mod(**inputs) 2025-08-14T21:46:31.5278663Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 1516, in forward 2025-08-14T21:46:31.5279026Z outputs = self.roberta( 2025-08-14T21:46:31.5279396Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:31.5279753Z encoder_outputs = self.encoder( 2025-08-14T21:46:31.5280107Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:31.5280461Z layer_outputs = layer_module( 2025-08-14T21:46:31.5280790Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:31.5281130Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:31.5281499Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 569, in forward 2025-08-14T21:46:31.5281867Z layer_output = apply_chunking_to_forward( 2025-08-14T21:46:31.5282226Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:46:31.5282587Z return forward_fn(*input_tensors) 2025-08-14T21:46:31.5282979Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 577, in feed_forward_chunk 2025-08-14T21:46:31.5283434Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:46:31.5283831Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 493, in forward 2025-08-14T21:46:31.5284232Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:46:31.5284737Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:46:31.5285065Z return self.act(input) 2025-08-14T21:46:31.5285170Z 2025-08-14T21:46:31.5285265Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:31.5285608Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:31.5285924Z return mod(**inputs) 2025-08-14T21:46:31.5286286Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 1516, in forward 2025-08-14T21:46:31.5286655Z outputs = self.roberta( 2025-08-14T21:46:31.5287007Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:31.5287376Z encoder_outputs = self.encoder( 2025-08-14T21:46:31.5287732Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:31.5288100Z layer_outputs = layer_module( 2025-08-14T21:46:31.5288423Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:31.5288752Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:31.5289124Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 569, in forward 2025-08-14T21:46:31.5289501Z layer_output = apply_chunking_to_forward( 2025-08-14T21:46:31.5289873Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:46:31.5290230Z return forward_fn(*input_tensors) 2025-08-14T21:46:31.5290623Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 578, in feed_forward_chunk 2025-08-14T21:46:31.5291071Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:46:31.5291487Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 506, in forward 2025-08-14T21:46:31.5291854Z hidden_states = self.dense(hidden_states) 2025-08-14T21:46:31.5291987Z 2025-08-14T21:46:31.5292081Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:31.5292415Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:31.5292514Z return mod(**inputs) 2025-08-14T21:46:31.5292770Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 1516, in forward 2025-08-14T21:46:31.5292833Z outputs = self.roberta( 2025-08-14T21:46:31.5293089Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:31.5293168Z encoder_outputs = self.encoder( 2025-08-14T21:46:31.5293431Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:31.5293498Z layer_outputs = layer_module( 2025-08-14T21:46:31.5293707Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:31.5293781Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:31.5294030Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 539, in forward 2025-08-14T21:46:31.5294131Z self_attention_outputs = self.attention( 2025-08-14T21:46:31.5294357Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:31.5294430Z return func(*args, **kwargs) 2025-08-14T21:46:31.5294671Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 466, in forward 2025-08-14T21:46:31.5294737Z self_outputs = self.self( 2025-08-14T21:46:31.5294970Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:31.5295034Z return func(*args, **kwargs) 2025-08-14T21:46:31.5295281Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 324, in forward 2025-08-14T21:46:31.5295472Z self.query(hidden_states).view(bsz, -1, self.num_attention_heads, self.attention_head_size).transpose(1, 2) 2025-08-14T21:46:31.5295477Z 2025-08-14T21:46:31.5295570Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:31.5295762Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:31.5295822Z return mod(**inputs) 2025-08-14T21:46:31.5296072Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 1516, in forward 2025-08-14T21:46:31.5296135Z outputs = self.roberta( 2025-08-14T21:46:31.5296377Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:31.5296449Z encoder_outputs = self.encoder( 2025-08-14T21:46:31.5296685Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:31.5296751Z layer_outputs = layer_module( 2025-08-14T21:46:31.5296958Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:31.5297027Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:31.5297270Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 539, in forward 2025-08-14T21:46:31.5297346Z self_attention_outputs = self.attention( 2025-08-14T21:46:31.5297569Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:31.5297637Z return func(*args, **kwargs) 2025-08-14T21:46:31.5297875Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 466, in forward 2025-08-14T21:46:31.5297944Z self_outputs = self.self( 2025-08-14T21:46:31.5298163Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:31.5298239Z return func(*args, **kwargs) 2025-08-14T21:46:31.5298488Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 352, in forward 2025-08-14T21:46:31.5298552Z self.key(current_states) 2025-08-14T21:46:31.5298555Z 2025-08-14T21:46:31.5298667Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:31.5298865Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:31.5298943Z return mod(**inputs) 2025-08-14T21:46:31.5299192Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 1516, in forward 2025-08-14T21:46:31.5299253Z outputs = self.roberta( 2025-08-14T21:46:31.5299489Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:31.5299561Z encoder_outputs = self.encoder( 2025-08-14T21:46:31.5299798Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:31.5299882Z layer_outputs = layer_module( 2025-08-14T21:46:31.5300089Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:31.5300164Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:31.5300411Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 539, in forward 2025-08-14T21:46:31.5300487Z self_attention_outputs = self.attention( 2025-08-14T21:46:31.5300706Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:31.5300776Z return func(*args, **kwargs) 2025-08-14T21:46:31.5301017Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 466, in forward 2025-08-14T21:46:31.5301088Z self_outputs = self.self( 2025-08-14T21:46:31.5301307Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:31.5301368Z return func(*args, **kwargs) 2025-08-14T21:46:31.5301617Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 357, in forward 2025-08-14T21:46:31.5301686Z self.value(current_states) 2025-08-14T21:46:31.5301689Z 2025-08-14T21:46:31.5301763Z cudagraph partition due to non gpu ops 2025-08-14T21:46:31.5301865Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:31.5302047Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:31.5302113Z return mod(**inputs) 2025-08-14T21:46:31.5302356Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 1516, in forward 2025-08-14T21:46:31.5302420Z outputs = self.roberta( 2025-08-14T21:46:31.5302665Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:31.5302728Z encoder_outputs = self.encoder( 2025-08-14T21:46:31.5302968Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:31.5303040Z layer_outputs = layer_module( 2025-08-14T21:46:31.5303242Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:31.5303318Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:31.5303557Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 539, in forward 2025-08-14T21:46:31.5303628Z self_attention_outputs = self.attention( 2025-08-14T21:46:31.5303871Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:31.5303936Z return func(*args, **kwargs) 2025-08-14T21:46:31.5304184Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 466, in forward 2025-08-14T21:46:31.5304246Z self_outputs = self.self( 2025-08-14T21:46:31.5304482Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:31.5304568Z return func(*args, **kwargs) 2025-08-14T21:46:31.5304864Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 388, in forward 2025-08-14T21:46:31.5304987Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:46:31.5304998Z 2025-08-14T21:46:31.5305092Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:31.5305273Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:31.5305372Z return mod(**inputs) 2025-08-14T21:46:31.5305620Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 1516, in forward 2025-08-14T21:46:31.5305682Z outputs = self.roberta( 2025-08-14T21:46:31.5305932Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:31.5306000Z encoder_outputs = self.encoder( 2025-08-14T21:46:31.5306252Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:31.5306316Z layer_outputs = layer_module( 2025-08-14T21:46:31.5306520Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:31.5306599Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:31.5306839Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 539, in forward 2025-08-14T21:46:31.5306914Z self_attention_outputs = self.attention( 2025-08-14T21:46:31.5307145Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:31.5307208Z return func(*args, **kwargs) 2025-08-14T21:46:31.5307456Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 476, in forward 2025-08-14T21:46:31.5307574Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:46:31.5307814Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 412, in forward 2025-08-14T21:46:31.5307897Z hidden_states = self.dense(hidden_states) 2025-08-14T21:46:31.5307900Z 2025-08-14T21:46:31.5307993Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:31.5308185Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:31.5308246Z return mod(**inputs) 2025-08-14T21:46:31.5308492Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 1516, in forward 2025-08-14T21:46:31.5308572Z outputs = self.roberta( 2025-08-14T21:46:31.5308815Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:31.5308879Z encoder_outputs = self.encoder( 2025-08-14T21:46:31.5309125Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:31.5309188Z layer_outputs = layer_module( 2025-08-14T21:46:31.5309401Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:31.5309471Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:31.5309726Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 569, in forward 2025-08-14T21:46:31.5309813Z layer_output = apply_chunking_to_forward( 2025-08-14T21:46:31.5310063Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:46:31.5310135Z return forward_fn(*input_tensors) 2025-08-14T21:46:31.5310426Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 577, in feed_forward_chunk 2025-08-14T21:46:31.5310535Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:46:31.5310776Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 492, in forward 2025-08-14T21:46:31.5310850Z hidden_states = self.dense(hidden_states) 2025-08-14T21:46:31.5310854Z 2025-08-14T21:46:31.5310947Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:31.5311154Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:31.5311214Z return mod(**inputs) 2025-08-14T21:46:31.5311466Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 1516, in forward 2025-08-14T21:46:31.5311528Z outputs = self.roberta( 2025-08-14T21:46:31.5311768Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:31.5311839Z encoder_outputs = self.encoder( 2025-08-14T21:46:31.5312076Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:31.5312139Z layer_outputs = layer_module( 2025-08-14T21:46:31.5312349Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:31.5312420Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:31.5312660Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 569, in forward 2025-08-14T21:46:31.5312733Z layer_output = apply_chunking_to_forward( 2025-08-14T21:46:31.5312968Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:46:31.5313045Z return forward_fn(*input_tensors) 2025-08-14T21:46:31.5313312Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 577, in feed_forward_chunk 2025-08-14T21:46:31.5313426Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:46:31.5313666Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 493, in forward 2025-08-14T21:46:31.5313768Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:46:31.5313972Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:46:31.5314034Z return self.act(input) 2025-08-14T21:46:31.5314038Z 2025-08-14T21:46:31.5314129Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:31.5314318Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:31.5314380Z return mod(**inputs) 2025-08-14T21:46:31.5314627Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 1516, in forward 2025-08-14T21:46:31.5314688Z outputs = self.roberta( 2025-08-14T21:46:31.5314927Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:31.5314999Z encoder_outputs = self.encoder( 2025-08-14T21:46:31.5315253Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:31.5315328Z layer_outputs = layer_module( 2025-08-14T21:46:31.5315528Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:31.5315600Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:31.5315862Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 569, in forward 2025-08-14T21:46:31.5315956Z layer_output = apply_chunking_to_forward( 2025-08-14T21:46:31.5316193Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:46:31.5316270Z return forward_fn(*input_tensors) 2025-08-14T21:46:31.5316535Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 578, in feed_forward_chunk 2025-08-14T21:46:31.5316666Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:46:31.5316927Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 506, in forward 2025-08-14T21:46:31.5317002Z hidden_states = self.dense(hidden_states) 2025-08-14T21:46:31.5317005Z 2025-08-14T21:46:31.5317107Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:31.5317287Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:31.5317356Z return mod(**inputs) 2025-08-14T21:46:31.5317597Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 1516, in forward 2025-08-14T21:46:31.5317657Z outputs = self.roberta( 2025-08-14T21:46:31.5317899Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:31.5317965Z encoder_outputs = self.encoder( 2025-08-14T21:46:31.5318204Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:31.5318276Z layer_outputs = layer_module( 2025-08-14T21:46:31.5318476Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:31.5318554Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:31.5318790Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 539, in forward 2025-08-14T21:46:31.5318864Z self_attention_outputs = self.attention( 2025-08-14T21:46:31.5319089Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:31.5319153Z return func(*args, **kwargs) 2025-08-14T21:46:31.5319390Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 466, in forward 2025-08-14T21:46:31.5319461Z self_outputs = self.self( 2025-08-14T21:46:31.5319679Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:31.5319748Z return func(*args, **kwargs) 2025-08-14T21:46:31.5319985Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 324, in forward 2025-08-14T21:46:31.5320175Z self.query(hidden_states).view(bsz, -1, self.num_attention_heads, self.attention_head_size).transpose(1, 2) 2025-08-14T21:46:31.5320178Z 2025-08-14T21:46:31.5320279Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:31.5320458Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:31.5320522Z return mod(**inputs) 2025-08-14T21:46:31.5320776Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 1516, in forward 2025-08-14T21:46:31.5320839Z outputs = self.roberta( 2025-08-14T21:46:31.5321085Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:31.5321151Z encoder_outputs = self.encoder( 2025-08-14T21:46:31.5321402Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:31.5321490Z layer_outputs = layer_module( 2025-08-14T21:46:31.5321692Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:31.5321771Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:31.5322011Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 539, in forward 2025-08-14T21:46:31.5322083Z self_attention_outputs = self.attention( 2025-08-14T21:46:31.5322313Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:31.5322393Z return func(*args, **kwargs) 2025-08-14T21:46:31.5322637Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 466, in forward 2025-08-14T21:46:31.5322702Z self_outputs = self.self( 2025-08-14T21:46:31.5322921Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:31.5322994Z return func(*args, **kwargs) 2025-08-14T21:46:31.5323233Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 352, in forward 2025-08-14T21:46:31.5323295Z self.key(current_states) 2025-08-14T21:46:31.5323298Z 2025-08-14T21:46:31.5323399Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:31.5323579Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:31.5323647Z return mod(**inputs) 2025-08-14T21:46:31.5323887Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 1516, in forward 2025-08-14T21:46:31.5323949Z outputs = self.roberta( 2025-08-14T21:46:31.5324194Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:31.5324260Z encoder_outputs = self.encoder( 2025-08-14T21:46:31.5324503Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:31.5324565Z layer_outputs = layer_module( 2025-08-14T21:46:31.5324765Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:31.5324840Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:31.5325078Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 539, in forward 2025-08-14T21:46:31.5325152Z self_attention_outputs = self.attention( 2025-08-14T21:46:31.5325378Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:31.5325440Z return func(*args, **kwargs) 2025-08-14T21:46:31.5325685Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 466, in forward 2025-08-14T21:46:31.5325747Z self_outputs = self.self( 2025-08-14T21:46:31.5325967Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:31.5326033Z return func(*args, **kwargs) 2025-08-14T21:46:31.5326267Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 357, in forward 2025-08-14T21:46:31.5326347Z self.value(current_states) 2025-08-14T21:46:31.5326358Z 2025-08-14T21:46:31.5326434Z cudagraph partition due to non gpu ops 2025-08-14T21:46:31.5326525Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:31.5326709Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:31.5326785Z return mod(**inputs) 2025-08-14T21:46:31.5327028Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 1516, in forward 2025-08-14T21:46:31.5328007Z outputs = self.roberta( 2025-08-14T21:46:31.5328246Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:31.5328311Z encoder_outputs = self.encoder( 2025-08-14T21:46:31.5328556Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:31.5328623Z layer_outputs = layer_module( 2025-08-14T21:46:31.5328851Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:31.5328925Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:31.5329168Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 539, in forward 2025-08-14T21:46:31.5329253Z self_attention_outputs = self.attention( 2025-08-14T21:46:31.5329475Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:31.5329547Z return func(*args, **kwargs) 2025-08-14T21:46:31.5329785Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 466, in forward 2025-08-14T21:46:31.5329848Z self_outputs = self.self( 2025-08-14T21:46:31.5330076Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:31.5330140Z return func(*args, **kwargs) 2025-08-14T21:46:31.5330378Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 388, in forward 2025-08-14T21:46:31.5330506Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:46:31.5330509Z 2025-08-14T21:46:31.5330603Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:31.5330792Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:31.5330851Z return mod(**inputs) 2025-08-14T21:46:31.5331092Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 1516, in forward 2025-08-14T21:46:31.5331161Z outputs = self.roberta( 2025-08-14T21:46:31.5331399Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:31.5331471Z encoder_outputs = self.encoder( 2025-08-14T21:46:31.5331709Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:31.5331772Z layer_outputs = layer_module( 2025-08-14T21:46:31.5331981Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:31.5332052Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:31.5332290Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 539, in forward 2025-08-14T21:46:31.5332371Z self_attention_outputs = self.attention( 2025-08-14T21:46:31.5332588Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:31.5332657Z return func(*args, **kwargs) 2025-08-14T21:46:31.5332908Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 476, in forward 2025-08-14T21:46:31.5333028Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:46:31.5333271Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 412, in forward 2025-08-14T21:46:31.5333365Z hidden_states = self.dense(hidden_states) 2025-08-14T21:46:31.5333368Z 2025-08-14T21:46:31.5333483Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:31.5333662Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:31.5333721Z return mod(**inputs) 2025-08-14T21:46:31.5333969Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 1516, in forward 2025-08-14T21:46:31.5334030Z outputs = self.roberta( 2025-08-14T21:46:31.5334266Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:31.5334353Z encoder_outputs = self.encoder( 2025-08-14T21:46:31.5334589Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:31.5334659Z layer_outputs = layer_module( 2025-08-14T21:46:31.5334859Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:31.5334932Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:31.5335175Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 569, in forward 2025-08-14T21:46:31.5335249Z layer_output = apply_chunking_to_forward( 2025-08-14T21:46:31.5335483Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:46:31.5335561Z return forward_fn(*input_tensors) 2025-08-14T21:46:31.5335827Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 577, in feed_forward_chunk 2025-08-14T21:46:31.5335944Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:46:31.5336179Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 492, in forward 2025-08-14T21:46:31.5336253Z hidden_states = self.dense(hidden_states) 2025-08-14T21:46:31.5336258Z 2025-08-14T21:46:31.5336356Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:31.5336535Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:31.5336602Z return mod(**inputs) 2025-08-14T21:46:31.5336841Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 1516, in forward 2025-08-14T21:46:31.5336902Z outputs = self.roberta( 2025-08-14T21:46:31.5337144Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:31.5337210Z encoder_outputs = self.encoder( 2025-08-14T21:46:31.5337445Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:31.5337517Z layer_outputs = layer_module( 2025-08-14T21:46:31.5337716Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:31.5337792Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:31.5338028Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 569, in forward 2025-08-14T21:46:31.5338101Z layer_output = apply_chunking_to_forward( 2025-08-14T21:46:31.5338341Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:46:31.5338427Z return forward_fn(*input_tensors) 2025-08-14T21:46:31.5338707Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 577, in feed_forward_chunk 2025-08-14T21:46:31.5338815Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:46:31.5339100Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 493, in forward 2025-08-14T21:46:31.5339224Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:46:31.5339418Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:46:31.5339479Z return self.act(input) 2025-08-14T21:46:31.5339488Z 2025-08-14T21:46:31.5339580Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:31.5339760Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:31.5339827Z return mod(**inputs) 2025-08-14T21:46:31.5340086Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 1516, in forward 2025-08-14T21:46:31.5340147Z outputs = self.roberta( 2025-08-14T21:46:31.5340396Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:31.5340459Z encoder_outputs = self.encoder( 2025-08-14T21:46:31.5340705Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:31.5340767Z layer_outputs = layer_module( 2025-08-14T21:46:31.5340969Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:31.5341047Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:31.5341287Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 569, in forward 2025-08-14T21:46:31.5341363Z layer_output = apply_chunking_to_forward( 2025-08-14T21:46:31.5341606Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:46:31.5341674Z return forward_fn(*input_tensors) 2025-08-14T21:46:31.5341951Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 578, in feed_forward_chunk 2025-08-14T21:46:31.5342074Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:46:31.5342314Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 506, in forward 2025-08-14T21:46:31.5342397Z hidden_states = self.dense(hidden_states) 2025-08-14T21:46:31.5342400Z 2025-08-14T21:46:31.5342492Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:31.5342682Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:31.5342743Z return mod(**inputs) 2025-08-14T21:46:31.5342984Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 1516, in forward 2025-08-14T21:46:31.5343051Z outputs = self.roberta( 2025-08-14T21:46:31.5343289Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:31.5343354Z encoder_outputs = self.encoder( 2025-08-14T21:46:31.5343601Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:31.5343665Z layer_outputs = layer_module( 2025-08-14T21:46:31.5343876Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:31.5343947Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:31.5344199Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 539, in forward 2025-08-14T21:46:31.5344286Z self_attention_outputs = self.attention( 2025-08-14T21:46:31.5344507Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:31.5344592Z return func(*args, **kwargs) 2025-08-14T21:46:31.5344900Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 466, in forward 2025-08-14T21:46:31.5344991Z self_outputs = self.self( 2025-08-14T21:46:31.5345226Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:31.5345292Z return func(*args, **kwargs) 2025-08-14T21:46:31.5345534Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 324, in forward 2025-08-14T21:46:31.5345736Z self.query(hidden_states).view(bsz, -1, self.num_attention_heads, self.attention_head_size).transpose(1, 2) 2025-08-14T21:46:31.5345754Z 2025-08-14T21:46:31.5345851Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:31.5346043Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:31.5346105Z return mod(**inputs) 2025-08-14T21:46:31.5346351Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 1516, in forward 2025-08-14T21:46:31.5346426Z outputs = self.roberta( 2025-08-14T21:46:31.5346669Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:31.5346736Z encoder_outputs = self.encoder( 2025-08-14T21:46:31.5346989Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:31.5347057Z layer_outputs = layer_module( 2025-08-14T21:46:31.5347270Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:31.5347340Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:31.5347581Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 539, in forward 2025-08-14T21:46:31.5347666Z self_attention_outputs = self.attention( 2025-08-14T21:46:31.5347889Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:31.5347959Z return func(*args, **kwargs) 2025-08-14T21:46:31.5348202Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 466, in forward 2025-08-14T21:46:31.5348264Z self_outputs = self.self( 2025-08-14T21:46:31.5348521Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:31.5348585Z return func(*args, **kwargs) 2025-08-14T21:46:31.5348826Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 352, in forward 2025-08-14T21:46:31.5348898Z self.key(current_states) 2025-08-14T21:46:31.5348902Z 2025-08-14T21:46:31.5348996Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:31.5349188Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:31.5349248Z return mod(**inputs) 2025-08-14T21:46:31.5349493Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 1516, in forward 2025-08-14T21:46:31.5349561Z outputs = self.roberta( 2025-08-14T21:46:31.5349801Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:31.5349888Z encoder_outputs = self.encoder( 2025-08-14T21:46:31.5350125Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:31.5350189Z layer_outputs = layer_module( 2025-08-14T21:46:31.5350420Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:31.5350493Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:31.5350745Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 539, in forward 2025-08-14T21:46:31.5350825Z self_attention_outputs = self.attention( 2025-08-14T21:46:31.5351045Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:31.5351113Z return func(*args, **kwargs) 2025-08-14T21:46:31.5351352Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 466, in forward 2025-08-14T21:46:31.5351432Z self_outputs = self.self( 2025-08-14T21:46:31.5351660Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:31.5351721Z return func(*args, **kwargs) 2025-08-14T21:46:31.5351959Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 357, in forward 2025-08-14T21:46:31.5352034Z self.value(current_states) 2025-08-14T21:46:31.5352038Z 2025-08-14T21:46:31.5352112Z cudagraph partition due to non gpu ops 2025-08-14T21:46:31.5352212Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:31.5352394Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:31.5352455Z return mod(**inputs) 2025-08-14T21:46:31.5352704Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 1516, in forward 2025-08-14T21:46:31.5352766Z outputs = self.roberta( 2025-08-14T21:46:31.5353009Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:31.5353075Z encoder_outputs = self.encoder( 2025-08-14T21:46:31.5353315Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:31.5353386Z layer_outputs = layer_module( 2025-08-14T21:46:31.5353588Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:31.5353658Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:31.5353900Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 539, in forward 2025-08-14T21:46:31.5353973Z self_attention_outputs = self.attention( 2025-08-14T21:46:31.5354202Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:31.5354265Z return func(*args, **kwargs) 2025-08-14T21:46:31.5354501Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 466, in forward 2025-08-14T21:46:31.5354572Z self_outputs = self.self( 2025-08-14T21:46:31.5354791Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:31.5354855Z return func(*args, **kwargs) 2025-08-14T21:46:31.5355099Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 388, in forward 2025-08-14T21:46:31.5355220Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:46:31.5355223Z 2025-08-14T21:46:31.5355323Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:31.5355519Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:31.5355580Z return mod(**inputs) 2025-08-14T21:46:31.5355830Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 1516, in forward 2025-08-14T21:46:31.5355908Z outputs = self.roberta( 2025-08-14T21:46:31.5356153Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:31.5356234Z encoder_outputs = self.encoder( 2025-08-14T21:46:31.5356474Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:31.5356544Z layer_outputs = layer_module( 2025-08-14T21:46:31.5356745Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:31.5356818Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:31.5357080Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 539, in forward 2025-08-14T21:46:31.5357154Z self_attention_outputs = self.attention( 2025-08-14T21:46:31.5357383Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:31.5357447Z return func(*args, **kwargs) 2025-08-14T21:46:31.5357688Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 476, in forward 2025-08-14T21:46:31.5357813Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:46:31.5358051Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 412, in forward 2025-08-14T21:46:31.5358133Z hidden_states = self.dense(hidden_states) 2025-08-14T21:46:31.5358136Z 2025-08-14T21:46:31.5358232Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:31.5358414Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:31.5358481Z return mod(**inputs) 2025-08-14T21:46:31.5358724Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 1516, in forward 2025-08-14T21:46:31.5358786Z outputs = self.roberta( 2025-08-14T21:46:31.5359031Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:31.5359097Z encoder_outputs = self.encoder( 2025-08-14T21:46:31.5359344Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:31.5359409Z layer_outputs = layer_module( 2025-08-14T21:46:31.5359612Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:31.5359691Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:31.5359927Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 569, in forward 2025-08-14T21:46:31.5360002Z layer_output = apply_chunking_to_forward( 2025-08-14T21:46:31.5360243Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:46:31.5360314Z return forward_fn(*input_tensors) 2025-08-14T21:46:31.5360588Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 577, in feed_forward_chunk 2025-08-14T21:46:31.5360698Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:46:31.5360935Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 492, in forward 2025-08-14T21:46:31.5361034Z hidden_states = self.dense(hidden_states) 2025-08-14T21:46:31.5361038Z 2025-08-14T21:46:31.5361132Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:31.5361318Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:31.5361378Z return mod(**inputs) 2025-08-14T21:46:31.5361635Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 1516, in forward 2025-08-14T21:46:31.5361719Z outputs = self.roberta( 2025-08-14T21:46:31.5361960Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:31.5362026Z encoder_outputs = self.encoder( 2025-08-14T21:46:31.5362276Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:31.5362341Z layer_outputs = layer_module( 2025-08-14T21:46:31.5362556Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:31.5362643Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:31.5362885Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 569, in forward 2025-08-14T21:46:31.5362969Z layer_output = apply_chunking_to_forward( 2025-08-14T21:46:31.5363202Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:46:31.5363279Z return forward_fn(*input_tensors) 2025-08-14T21:46:31.5363550Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 577, in feed_forward_chunk 2025-08-14T21:46:31.5363659Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:46:31.5363907Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 493, in forward 2025-08-14T21:46:31.5364009Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:46:31.5364203Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:46:31.5364273Z return self.act(input) 2025-08-14T21:46:31.5364276Z 2025-08-14T21:46:31.5364370Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:31.5364557Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:31.5364619Z return mod(**inputs) 2025-08-14T21:46:31.5364861Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 1516, in forward 2025-08-14T21:46:31.5364929Z outputs = self.roberta( 2025-08-14T21:46:31.5365170Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:31.5365244Z encoder_outputs = self.encoder( 2025-08-14T21:46:31.5365483Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:31.5365547Z layer_outputs = layer_module( 2025-08-14T21:46:31.5365756Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:31.5365828Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:31.5366069Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 569, in forward 2025-08-14T21:46:31.5366148Z layer_output = apply_chunking_to_forward( 2025-08-14T21:46:31.5366383Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:46:31.5366457Z return forward_fn(*input_tensors) 2025-08-14T21:46:31.5366744Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 578, in feed_forward_chunk 2025-08-14T21:46:31.5366866Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:46:31.5367111Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 506, in forward 2025-08-14T21:46:31.5367201Z hidden_states = self.dense(hidden_states) 2025-08-14T21:46:31.5367204Z 2025-08-14T21:46:31.5367306Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:31.5367501Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:31.5367561Z return mod(**inputs) 2025-08-14T21:46:31.5367805Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 1516, in forward 2025-08-14T21:46:31.5367866Z outputs = self.roberta( 2025-08-14T21:46:31.5368104Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:31.5368192Z encoder_outputs = self.encoder( 2025-08-14T21:46:31.5368429Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:31.5368500Z layer_outputs = layer_module( 2025-08-14T21:46:31.5368702Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:31.5368775Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:31.5369018Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 539, in forward 2025-08-14T21:46:31.5369091Z self_attention_outputs = self.attention( 2025-08-14T21:46:31.5369317Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:31.5369379Z return func(*args, **kwargs) 2025-08-14T21:46:31.5369619Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 466, in forward 2025-08-14T21:46:31.5369691Z self_outputs = self.self( 2025-08-14T21:46:31.5369910Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:31.5369976Z return func(*args, **kwargs) 2025-08-14T21:46:31.5370222Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 324, in forward 2025-08-14T21:46:31.5370412Z self.query(hidden_states).view(bsz, -1, self.num_attention_heads, self.attention_head_size).transpose(1, 2) 2025-08-14T21:46:31.5370415Z 2025-08-14T21:46:31.5370516Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:31.5370696Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:31.5370756Z return mod(**inputs) 2025-08-14T21:46:31.5371006Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 1516, in forward 2025-08-14T21:46:31.5371067Z outputs = self.roberta( 2025-08-14T21:46:31.5371312Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:31.5371380Z encoder_outputs = self.encoder( 2025-08-14T21:46:31.5371620Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:31.5371693Z layer_outputs = layer_module( 2025-08-14T21:46:31.5371893Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:31.5371965Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:31.5372209Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 539, in forward 2025-08-14T21:46:31.5372306Z self_attention_outputs = self.attention( 2025-08-14T21:46:31.5372535Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:31.5372599Z return func(*args, **kwargs) 2025-08-14T21:46:31.5372849Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 466, in forward 2025-08-14T21:46:31.5372922Z self_outputs = self.self( 2025-08-14T21:46:31.5373158Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:31.5373221Z return func(*args, **kwargs) 2025-08-14T21:46:31.5373468Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 352, in forward 2025-08-14T21:46:31.5373531Z self.key(current_states) 2025-08-14T21:46:31.5373534Z 2025-08-14T21:46:31.5373633Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:31.5373815Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:31.5373891Z return mod(**inputs) 2025-08-14T21:46:31.5374142Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 1516, in forward 2025-08-14T21:46:31.5374206Z outputs = self.roberta( 2025-08-14T21:46:31.5374453Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:31.5374520Z encoder_outputs = self.encoder( 2025-08-14T21:46:31.5374759Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:31.5374830Z layer_outputs = layer_module( 2025-08-14T21:46:31.5375030Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:31.5375103Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:31.5375350Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 539, in forward 2025-08-14T21:46:31.5375424Z self_attention_outputs = self.attention( 2025-08-14T21:46:31.5375651Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:31.5375714Z return func(*args, **kwargs) 2025-08-14T21:46:31.5375953Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 466, in forward 2025-08-14T21:46:31.5376026Z self_outputs = self.self( 2025-08-14T21:46:31.5376246Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:31.5376306Z return func(*args, **kwargs) 2025-08-14T21:46:31.5376550Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 357, in forward 2025-08-14T21:46:31.5376616Z self.value(current_states) 2025-08-14T21:46:31.5376619Z 2025-08-14T21:46:31.5376698Z cudagraph partition due to non gpu ops 2025-08-14T21:46:31.5376793Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:31.5376975Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:31.5377041Z return mod(**inputs) 2025-08-14T21:46:31.5377285Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 1516, in forward 2025-08-14T21:46:31.5377354Z outputs = self.roberta( 2025-08-14T21:46:31.5377592Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:31.5377657Z encoder_outputs = self.encoder( 2025-08-14T21:46:31.5377918Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:31.5377986Z layer_outputs = layer_module( 2025-08-14T21:46:31.5378187Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:31.5378265Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:31.5378516Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 539, in forward 2025-08-14T21:46:31.5378614Z self_attention_outputs = self.attention( 2025-08-14T21:46:31.5378833Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:31.5378894Z return func(*args, **kwargs) 2025-08-14T21:46:31.5379136Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 466, in forward 2025-08-14T21:46:31.5379197Z self_outputs = self.self( 2025-08-14T21:46:31.5379417Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:31.5379500Z return func(*args, **kwargs) 2025-08-14T21:46:31.5379741Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 388, in forward 2025-08-14T21:46:31.5379871Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:46:31.5379875Z 2025-08-14T21:46:31.5379970Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:31.5380151Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:31.5380218Z return mod(**inputs) 2025-08-14T21:46:31.5380464Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 1516, in forward 2025-08-14T21:46:31.5380531Z outputs = self.roberta( 2025-08-14T21:46:31.5380772Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:31.5380838Z encoder_outputs = self.encoder( 2025-08-14T21:46:31.5381084Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:31.5381147Z layer_outputs = layer_module( 2025-08-14T21:46:31.5381349Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:31.5381428Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:31.5381667Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 539, in forward 2025-08-14T21:46:31.5381748Z self_attention_outputs = self.attention( 2025-08-14T21:46:31.5381970Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:31.5382031Z return func(*args, **kwargs) 2025-08-14T21:46:31.5382275Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 476, in forward 2025-08-14T21:46:31.5382391Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:46:31.5382638Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 412, in forward 2025-08-14T21:46:31.5382713Z hidden_states = self.dense(hidden_states) 2025-08-14T21:46:31.5382717Z 2025-08-14T21:46:31.5382810Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:31.5382997Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:31.5383057Z return mod(**inputs) 2025-08-14T21:46:31.5383301Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 1516, in forward 2025-08-14T21:46:31.5383369Z outputs = self.roberta( 2025-08-14T21:46:31.5383623Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:31.5383700Z encoder_outputs = self.encoder( 2025-08-14T21:46:31.5383937Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:31.5384013Z layer_outputs = layer_module( 2025-08-14T21:46:31.5384223Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:31.5384310Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:31.5384552Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 569, in forward 2025-08-14T21:46:31.5384828Z layer_output = apply_chunking_to_forward( 2025-08-14T21:46:31.5385074Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:46:31.5385156Z return forward_fn(*input_tensors) 2025-08-14T21:46:31.5385472Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 577, in feed_forward_chunk 2025-08-14T21:46:31.5385587Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:46:31.5385843Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 492, in forward 2025-08-14T21:46:31.5385921Z hidden_states = self.dense(hidden_states) 2025-08-14T21:46:31.5385924Z 2025-08-14T21:46:31.5386028Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:31.5386214Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:31.5386276Z return mod(**inputs) 2025-08-14T21:46:31.5386529Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 1516, in forward 2025-08-14T21:46:31.5386593Z outputs = self.roberta( 2025-08-14T21:46:31.5386837Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:31.5386913Z encoder_outputs = self.encoder( 2025-08-14T21:46:31.5387157Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:31.5387231Z layer_outputs = layer_module( 2025-08-14T21:46:31.5387435Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:31.5387507Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:31.5387757Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 569, in forward 2025-08-14T21:46:31.5387834Z layer_output = apply_chunking_to_forward( 2025-08-14T21:46:31.5388085Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:46:31.5388157Z return forward_fn(*input_tensors) 2025-08-14T21:46:31.5388435Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 577, in feed_forward_chunk 2025-08-14T21:46:31.5388554Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:46:31.5388798Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 493, in forward 2025-08-14T21:46:31.5388903Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:46:31.5389109Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:46:31.5389174Z return self.act(input) 2025-08-14T21:46:31.5389178Z 2025-08-14T21:46:31.5389280Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:31.5389491Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:31.5389568Z return mod(**inputs) 2025-08-14T21:46:31.5389824Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 1516, in forward 2025-08-14T21:46:31.5389888Z outputs = self.roberta( 2025-08-14T21:46:31.5390167Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:31.5390258Z encoder_outputs = self.encoder( 2025-08-14T21:46:31.5390508Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:31.5390580Z layer_outputs = layer_module( 2025-08-14T21:46:31.5390789Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:31.5390861Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:31.5391117Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 569, in forward 2025-08-14T21:46:31.5391220Z layer_output = apply_chunking_to_forward( 2025-08-14T21:46:31.5391473Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:46:31.5391544Z return forward_fn(*input_tensors) 2025-08-14T21:46:31.5391824Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 578, in feed_forward_chunk 2025-08-14T21:46:31.5391954Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:46:31.5392199Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 506, in forward 2025-08-14T21:46:31.5392281Z hidden_states = self.dense(hidden_states) 2025-08-14T21:46:31.5392284Z 2025-08-14T21:46:31.5392380Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:31.5392568Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:31.5392636Z return mod(**inputs) 2025-08-14T21:46:31.5392885Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 1516, in forward 2025-08-14T21:46:31.5392948Z outputs = self.roberta( 2025-08-14T21:46:31.5393200Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:31.5393268Z encoder_outputs = self.encoder( 2025-08-14T21:46:31.5393521Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:31.5393586Z layer_outputs = layer_module( 2025-08-14T21:46:31.5393792Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:31.5393876Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:31.5394121Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 539, in forward 2025-08-14T21:46:31.5394206Z self_attention_outputs = self.attention( 2025-08-14T21:46:31.5394437Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:31.5394501Z return func(*args, **kwargs) 2025-08-14T21:46:31.5394751Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 466, in forward 2025-08-14T21:46:31.5394815Z self_outputs = self.self( 2025-08-14T21:46:31.5395042Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:31.5395110Z return func(*args, **kwargs) 2025-08-14T21:46:31.5395369Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 324, in forward 2025-08-14T21:46:31.5395573Z self.query(hidden_states).view(bsz, -1, self.num_attention_heads, self.attention_head_size).transpose(1, 2) 2025-08-14T21:46:31.5395577Z 2025-08-14T21:46:31.5395675Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:31.5395878Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:31.5395962Z return mod(**inputs) 2025-08-14T21:46:31.5396213Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 1516, in forward 2025-08-14T21:46:31.5396285Z outputs = self.roberta( 2025-08-14T21:46:31.5396529Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:31.5396596Z encoder_outputs = self.encoder( 2025-08-14T21:46:31.5396851Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:31.5396934Z layer_outputs = layer_module( 2025-08-14T21:46:31.5397139Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:31.5397220Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:31.5397465Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 539, in forward 2025-08-14T21:46:31.5397547Z self_attention_outputs = self.attention( 2025-08-14T21:46:31.5397771Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:31.5397833Z return func(*args, **kwargs) 2025-08-14T21:46:31.5398084Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 466, in forward 2025-08-14T21:46:31.5398148Z self_outputs = self.self( 2025-08-14T21:46:31.5398376Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:31.5398447Z return func(*args, **kwargs) 2025-08-14T21:46:31.5398697Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 352, in forward 2025-08-14T21:46:31.5398768Z self.key(current_states) 2025-08-14T21:46:31.5398773Z 2025-08-14T21:46:31.5398867Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:31.5399046Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:31.5399113Z return mod(**inputs) 2025-08-14T21:46:31.5399352Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 1516, in forward 2025-08-14T21:46:31.5399420Z outputs = self.roberta( 2025-08-14T21:46:31.5399658Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:31.5399723Z encoder_outputs = self.encoder( 2025-08-14T21:46:31.5399965Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:31.5400029Z layer_outputs = layer_module( 2025-08-14T21:46:31.5400230Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:31.5400309Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:31.5400545Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 539, in forward 2025-08-14T21:46:31.5400624Z self_attention_outputs = self.attention( 2025-08-14T21:46:31.5400840Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:31.5400901Z return func(*args, **kwargs) 2025-08-14T21:46:31.5401162Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 466, in forward 2025-08-14T21:46:31.5401230Z self_outputs = self.self( 2025-08-14T21:46:31.5401449Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:31.5401532Z return func(*args, **kwargs) 2025-08-14T21:46:31.5401786Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 357, in forward 2025-08-14T21:46:31.5401859Z self.value(current_states) 2025-08-14T21:46:31.5401863Z 2025-08-14T21:46:31.5401935Z cudagraph partition due to non gpu ops 2025-08-14T21:46:31.5402029Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:31.5402219Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:31.5402278Z return mod(**inputs) 2025-08-14T21:46:31.5402527Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 1516, in forward 2025-08-14T21:46:31.5402604Z outputs = self.roberta( 2025-08-14T21:46:31.5402841Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:31.5402914Z encoder_outputs = self.encoder( 2025-08-14T21:46:31.5403152Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:31.5403217Z layer_outputs = layer_module( 2025-08-14T21:46:31.5403424Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:31.5403494Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:31.5403738Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 539, in forward 2025-08-14T21:46:31.5403814Z self_attention_outputs = self.attention( 2025-08-14T21:46:31.5404033Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:31.5404100Z return func(*args, **kwargs) 2025-08-14T21:46:31.5404337Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 466, in forward 2025-08-14T21:46:31.5404400Z self_outputs = self.self( 2025-08-14T21:46:31.5404625Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:31.5404686Z return func(*args, **kwargs) 2025-08-14T21:46:31.5404929Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 388, in forward 2025-08-14T21:46:31.5405049Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:46:31.5405052Z 2025-08-14T21:46:31.5405144Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:31.5405333Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:31.5405391Z return mod(**inputs) 2025-08-14T21:46:31.5405639Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 1516, in forward 2025-08-14T21:46:31.5405700Z outputs = self.roberta( 2025-08-14T21:46:31.5405938Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:31.5406012Z encoder_outputs = self.encoder( 2025-08-14T21:46:31.5406249Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:31.5406314Z layer_outputs = layer_module( 2025-08-14T21:46:31.5406538Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:31.5406614Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:31.5406861Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 539, in forward 2025-08-14T21:46:31.5406936Z self_attention_outputs = self.attention( 2025-08-14T21:46:31.5407169Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:31.5407256Z return func(*args, **kwargs) 2025-08-14T21:46:31.5407500Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 476, in forward 2025-08-14T21:46:31.5407622Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:46:31.5407864Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 412, in forward 2025-08-14T21:46:31.5407940Z hidden_states = self.dense(hidden_states) 2025-08-14T21:46:31.5407961Z 2025-08-14T21:46:31.5408063Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:31.5408244Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:31.5408303Z return mod(**inputs) 2025-08-14T21:46:31.5408556Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 1516, in forward 2025-08-14T21:46:31.5408618Z outputs = self.roberta( 2025-08-14T21:46:31.5408860Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:31.5408924Z encoder_outputs = self.encoder( 2025-08-14T21:46:31.5409159Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:31.5409229Z layer_outputs = layer_module( 2025-08-14T21:46:31.5409431Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:31.5409502Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:31.5409744Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 569, in forward 2025-08-14T21:46:31.5409821Z layer_output = apply_chunking_to_forward( 2025-08-14T21:46:31.5410059Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:46:31.5410129Z return forward_fn(*input_tensors) 2025-08-14T21:46:31.5410395Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 577, in feed_forward_chunk 2025-08-14T21:46:31.5410511Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:46:31.5410747Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 492, in forward 2025-08-14T21:46:31.5410831Z hidden_states = self.dense(hidden_states) 2025-08-14T21:46:31.5410834Z 2025-08-14T21:46:31.5410924Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:31.5411104Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:31.5411171Z return mod(**inputs) 2025-08-14T21:46:31.5411412Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 1516, in forward 2025-08-14T21:46:31.5411480Z outputs = self.roberta( 2025-08-14T21:46:31.5411716Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:31.5411780Z encoder_outputs = self.encoder( 2025-08-14T21:46:31.5412021Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:31.5412099Z layer_outputs = layer_module( 2025-08-14T21:46:31.5412303Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:31.5412381Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:31.5412632Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 569, in forward 2025-08-14T21:46:31.5412719Z layer_output = apply_chunking_to_forward( 2025-08-14T21:46:31.5412979Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:46:31.5413050Z return forward_fn(*input_tensors) 2025-08-14T21:46:31.5413325Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 577, in feed_forward_chunk 2025-08-14T21:46:31.5413431Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:46:31.5413670Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 493, in forward 2025-08-14T21:46:31.5413797Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:46:31.5413993Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:46:31.5414064Z return self.act(input) 2025-08-14T21:46:31.5414068Z 2025-08-14T21:46:31.5414164Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:31.5414349Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:31.5414415Z return mod(**inputs) 2025-08-14T21:46:31.5414659Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 1516, in forward 2025-08-14T21:46:31.5414729Z outputs = self.roberta( 2025-08-14T21:46:31.5414969Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:31.5415036Z encoder_outputs = self.encoder( 2025-08-14T21:46:31.5415283Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:31.5415348Z layer_outputs = layer_module( 2025-08-14T21:46:31.5415551Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:31.5415633Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:31.5415870Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 569, in forward 2025-08-14T21:46:31.5415953Z layer_output = apply_chunking_to_forward( 2025-08-14T21:46:31.5416187Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:46:31.5416256Z return forward_fn(*input_tensors) 2025-08-14T21:46:31.5416534Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 578, in feed_forward_chunk 2025-08-14T21:46:31.5416656Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:46:31.5416902Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 506, in forward 2025-08-14T21:46:31.5416977Z hidden_states = self.dense(hidden_states) 2025-08-14T21:46:31.5416982Z 2025-08-14T21:46:31.5417075Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:31.5417264Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:31.5417324Z return mod(**inputs) 2025-08-14T21:46:31.5417566Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 1516, in forward 2025-08-14T21:46:31.5417637Z outputs = self.roberta( 2025-08-14T21:46:31.5417890Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:31.5417965Z encoder_outputs = self.encoder( 2025-08-14T21:46:31.5418202Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:31.5418278Z layer_outputs = layer_module( 2025-08-14T21:46:31.5418491Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:31.5418576Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:31.5418819Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 539, in forward 2025-08-14T21:46:31.5418894Z self_attention_outputs = self.attention( 2025-08-14T21:46:31.5419112Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:31.5419183Z return func(*args, **kwargs) 2025-08-14T21:46:31.5419433Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 466, in forward 2025-08-14T21:46:31.5419496Z self_outputs = self.self( 2025-08-14T21:46:31.5419723Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:31.5419786Z return func(*args, **kwargs) 2025-08-14T21:46:31.5420033Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 324, in forward 2025-08-14T21:46:31.5420222Z self.query(hidden_states).view(bsz, -1, self.num_attention_heads, self.attention_head_size).transpose(1, 2) 2025-08-14T21:46:31.5420225Z 2025-08-14T21:46:31.5420318Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:31.5420503Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:31.5420563Z return mod(**inputs) 2025-08-14T21:46:31.5420815Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 1516, in forward 2025-08-14T21:46:31.5420875Z outputs = self.roberta( 2025-08-14T21:46:31.5421113Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:31.5421185Z encoder_outputs = self.encoder( 2025-08-14T21:46:31.5421422Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:31.5421485Z layer_outputs = layer_module( 2025-08-14T21:46:31.5421692Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:31.5421761Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:31.5422006Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 539, in forward 2025-08-14T21:46:31.5422082Z self_attention_outputs = self.attention( 2025-08-14T21:46:31.5422300Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:31.5422369Z return func(*args, **kwargs) 2025-08-14T21:46:31.5422606Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 466, in forward 2025-08-14T21:46:31.5422669Z self_outputs = self.self( 2025-08-14T21:46:31.5422894Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:31.5422954Z return func(*args, **kwargs) 2025-08-14T21:46:31.5423197Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 352, in forward 2025-08-14T21:46:31.5423261Z self.key(current_states) 2025-08-14T21:46:31.5423264Z 2025-08-14T21:46:31.5423369Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:31.5423558Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:31.5423617Z return mod(**inputs) 2025-08-14T21:46:31.5423876Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 1516, in forward 2025-08-14T21:46:31.5423939Z outputs = self.roberta( 2025-08-14T21:46:31.5424191Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:31.5424263Z encoder_outputs = self.encoder( 2025-08-14T21:46:31.5424498Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:31.5424560Z layer_outputs = layer_module( 2025-08-14T21:46:31.5424832Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:31.5424932Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:31.5425180Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 539, in forward 2025-08-14T21:46:31.5425254Z self_attention_outputs = self.attention( 2025-08-14T21:46:31.5425478Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:31.5425550Z return func(*args, **kwargs) 2025-08-14T21:46:31.5425791Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 466, in forward 2025-08-14T21:46:31.5425855Z self_outputs = self.self( 2025-08-14T21:46:31.5426082Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:31.5426143Z return func(*args, **kwargs) 2025-08-14T21:46:31.5426394Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 357, in forward 2025-08-14T21:46:31.5426460Z self.value(current_states) 2025-08-14T21:46:31.5426463Z 2025-08-14T21:46:31.5426536Z cudagraph partition due to non gpu ops 2025-08-14T21:46:31.5426638Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:31.5426823Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:31.5426892Z return mod(**inputs) 2025-08-14T21:46:31.5427134Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 1516, in forward 2025-08-14T21:46:31.5427196Z outputs = self.roberta( 2025-08-14T21:46:31.5427443Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:31.5427508Z encoder_outputs = self.encoder( 2025-08-14T21:46:31.5427747Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:31.5427822Z layer_outputs = layer_module( 2025-08-14T21:46:31.5428027Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:31.5428105Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:31.5428346Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 539, in forward 2025-08-14T21:46:31.5428422Z self_attention_outputs = self.attention( 2025-08-14T21:46:31.5428651Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:31.5428712Z return func(*args, **kwargs) 2025-08-14T21:46:31.5428949Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 466, in forward 2025-08-14T21:46:31.5429058Z self_outputs = self.self( 2025-08-14T21:46:31.5429281Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:31.5429350Z return func(*args, **kwargs) 2025-08-14T21:46:31.5429600Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 388, in forward 2025-08-14T21:46:31.5429725Z attn_output = torch.nn.functional.scaled_dot_product_attention( 2025-08-14T21:46:31.5429741Z 2025-08-14T21:46:31.5429845Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:31.5430025Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:31.5430093Z return mod(**inputs) 2025-08-14T21:46:31.5430335Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 1516, in forward 2025-08-14T21:46:31.5430396Z outputs = self.roberta( 2025-08-14T21:46:31.5430641Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:31.5430723Z encoder_outputs = self.encoder( 2025-08-14T21:46:31.5430965Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:31.5431038Z layer_outputs = layer_module( 2025-08-14T21:46:31.5431242Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:31.5431319Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:31.5431557Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 539, in forward 2025-08-14T21:46:31.5431631Z self_attention_outputs = self.attention( 2025-08-14T21:46:31.5431864Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/utils/deprecation.py", line 172, in wrapped_func 2025-08-14T21:46:31.5431927Z return func(*args, **kwargs) 2025-08-14T21:46:31.5432173Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 476, in forward 2025-08-14T21:46:31.5432289Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:46:31.5432529Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 412, in forward 2025-08-14T21:46:31.5432612Z hidden_states = self.dense(hidden_states) 2025-08-14T21:46:31.5432615Z 2025-08-14T21:46:31.5432706Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:31.5432885Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:31.5432952Z return mod(**inputs) 2025-08-14T21:46:31.5433196Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 1516, in forward 2025-08-14T21:46:31.5433263Z outputs = self.roberta( 2025-08-14T21:46:31.5433503Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:31.5433567Z encoder_outputs = self.encoder( 2025-08-14T21:46:31.5433815Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:31.5433878Z layer_outputs = layer_module( 2025-08-14T21:46:31.5434091Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:31.5434162Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:31.5434400Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 569, in forward 2025-08-14T21:46:31.5434483Z layer_output = apply_chunking_to_forward( 2025-08-14T21:46:31.5434733Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:46:31.5434806Z return forward_fn(*input_tensors) 2025-08-14T21:46:31.5435080Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 577, in feed_forward_chunk 2025-08-14T21:46:31.5435210Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:46:31.5435455Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 492, in forward 2025-08-14T21:46:31.5435547Z hidden_states = self.dense(hidden_states) 2025-08-14T21:46:31.5435550Z 2025-08-14T21:46:31.5435644Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:31.5435835Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:31.5435896Z return mod(**inputs) 2025-08-14T21:46:31.5436144Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 1516, in forward 2025-08-14T21:46:31.5436223Z outputs = self.roberta( 2025-08-14T21:46:31.5436461Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:31.5436534Z encoder_outputs = self.encoder( 2025-08-14T21:46:31.5436772Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:31.5436838Z layer_outputs = layer_module( 2025-08-14T21:46:31.5437048Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:31.5437119Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:31.5437361Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 569, in forward 2025-08-14T21:46:31.5437437Z layer_output = apply_chunking_to_forward( 2025-08-14T21:46:31.5437674Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:46:31.5437754Z return forward_fn(*input_tensors) 2025-08-14T21:46:31.5438021Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 577, in feed_forward_chunk 2025-08-14T21:46:31.5438130Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:46:31.5438376Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 493, in forward 2025-08-14T21:46:31.5438477Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:46:31.5438678Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:46:31.5438741Z return self.act(input) 2025-08-14T21:46:31.5438745Z 2025-08-14T21:46:31.5438839Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:31.5439030Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:31.5439089Z return mod(**inputs) 2025-08-14T21:46:31.5439336Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 1516, in forward 2025-08-14T21:46:31.5439399Z outputs = self.roberta( 2025-08-14T21:46:31.5439637Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 890, in forward 2025-08-14T21:46:31.5439711Z encoder_outputs = self.encoder( 2025-08-14T21:46:31.5439950Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 631, in forward 2025-08-14T21:46:31.5440015Z layer_outputs = layer_module( 2025-08-14T21:46:31.5440222Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:31.5440310Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:31.5440557Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 569, in forward 2025-08-14T21:46:31.5440632Z layer_output = apply_chunking_to_forward( 2025-08-14T21:46:31.5440880Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:46:31.5440970Z return forward_fn(*input_tensors) 2025-08-14T21:46:31.5441236Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 578, in feed_forward_chunk 2025-08-14T21:46:31.5441363Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:46:31.5441602Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 506, in forward 2025-08-14T21:46:31.5441676Z hidden_states = self.dense(hidden_states) 2025-08-14T21:46:31.5441680Z 2025-08-14T21:46:31.5441797Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:31.5441979Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:31.5442038Z return mod(**inputs) 2025-08-14T21:46:31.5442288Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 1530, in forward 2025-08-14T21:46:31.5442367Z logits = self.qa_outputs(sequence_output) 2025-08-14T21:46:31.5442370Z 2025-08-14T21:46:31.5442472Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:31.5442650Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:31.5442710Z return mod(**inputs) 2025-08-14T21:46:31.5442958Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 1548, in forward 2025-08-14T21:46:31.5443054Z start_loss = loss_fct(start_logits, start_positions) 2025-08-14T21:46:31.5443058Z 2025-08-14T21:46:31.5443158Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:31.5443335Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:31.5443395Z return mod(**inputs) 2025-08-14T21:46:31.5443645Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/roberta/modeling_roberta.py", line 1549, in forward 2025-08-14T21:46:31.5443730Z end_loss = loss_fct(end_logits, end_positions) 2025-08-14T21:46:31.5443734Z 2025-08-14T21:46:38.2340011Z Compilation time (from dynamo_timed): 12.419432872 2025-08-14T21:46:38.2343417Z pass 2025-08-14T21:46:38.2346822Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:46:38.2351350Z TIMING: _recursive_pre_grad_passes:0.00609 _recursive_joint_graph_passes:0.54143 _recursive_post_grad_passes:0.07911 async_compile.wait:0.00218 code_gen:5.7792 inductor_compile:6.87897 backend_compile:9.72324 gc:0.00073 entire_frame_compile:12.41943 total_wall_time:12.41943 2025-08-14T21:46:38.2353029Z STATS: call_* op count: 303 | FakeTensorMode.__torch_dispatch__:12465 | FakeTensor.__torch_dispatch__:4777 | ProxyTorchDispatchMode.__torch_dispatch__:4566 2025-08-14T21:46:38.2353594Z Dynamo produced 1 graphs covering 303 ops with 0 graph breaks (0 unique) 2025-08-14T21:46:42.3169662Z /opt/conda/envs/py_3.9/lib/python3.9/site-packages/llvmlite/binding/ffi.py:175: UserWarning: pkg_resources is deprecated as an API. See https://setuptools.pypa.io/en/latest/pkg_resources.html. The pkg_resources package is slated for removal as early as 2025-11-30. Refrain from using this package or pin to Setuptools<81. 2025-08-14T21:46:42.3171092Z from pkg_resources import resource_filename 2025-08-14T21:46:42.8793298Z 2025-08-14T21:46:43.8312843Z loading model: 0it [00:00, ?it/s] 2025-08-14T21:46:43.8315687Z loading model: 0it [00:00, ?it/s] 2025-08-14T21:46:43.8333211Z cpu eval T5ForConditionalGeneration 2025-08-14T21:46:44.8919620Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:46:45.2310310Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:46:45.6148064Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:46:54.0239951Z cudagraph partition due to non gpu ops 2025-08-14T21:46:54.0241753Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0242127Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0242450Z return mod(**inputs) 2025-08-14T21:46:54.0242822Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0243191Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0243551Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0243974Z layer_outputs = layer_module( 2025-08-14T21:46:54.0244314Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0244673Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0245028Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 681, in forward 2025-08-14T21:46:54.0245397Z self_attention_outputs = self.layer[0]( 2025-08-14T21:46:54.0245763Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 599, in forward 2025-08-14T21:46:54.0246130Z attention_output = self.SelfAttention( 2025-08-14T21:46:54.0246502Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 546, in forward 2025-08-14T21:46:54.0246881Z position_bias = position_bias + causal_mask 2025-08-14T21:46:54.0247021Z 2025-08-14T21:46:54.0247127Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0247456Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0247761Z return mod(**inputs) 2025-08-14T21:46:54.0248098Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0248452Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0248801Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0249151Z layer_outputs = layer_module( 2025-08-14T21:46:54.0249480Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0249828Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0250244Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 681, in forward 2025-08-14T21:46:54.0250593Z self_attention_outputs = self.layer[0]( 2025-08-14T21:46:54.0250939Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 598, in forward 2025-08-14T21:46:54.0251305Z normed_hidden_states = self.layer_norm(hidden_states) 2025-08-14T21:46:54.0251670Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 261, in forward 2025-08-14T21:46:54.0252019Z return self.weight * hidden_states 2025-08-14T21:46:54.0252144Z 2025-08-14T21:46:54.0252242Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0252572Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0252874Z return mod(**inputs) 2025-08-14T21:46:54.0253251Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0253594Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0253929Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0254268Z layer_outputs = layer_module( 2025-08-14T21:46:54.0254630Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0254995Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0255341Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 681, in forward 2025-08-14T21:46:54.0255691Z self_attention_outputs = self.layer[0]( 2025-08-14T21:46:54.0256031Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 599, in forward 2025-08-14T21:46:54.0256384Z attention_output = self.SelfAttention( 2025-08-14T21:46:54.0256733Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 490, in forward 2025-08-14T21:46:54.0257100Z query_states = self.q(hidden_states) 2025-08-14T21:46:54.0257225Z 2025-08-14T21:46:54.0257323Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0257662Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0257975Z return mod(**inputs) 2025-08-14T21:46:54.0258298Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0258640Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0258976Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0259314Z layer_outputs = layer_module( 2025-08-14T21:46:54.0259627Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0259965Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0260302Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 681, in forward 2025-08-14T21:46:54.0260639Z self_attention_outputs = self.layer[0]( 2025-08-14T21:46:54.0260984Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 599, in forward 2025-08-14T21:46:54.0261334Z attention_output = self.SelfAttention( 2025-08-14T21:46:54.0261701Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 510, in forward 2025-08-14T21:46:54.0262034Z key_states = self.k(current_states) 2025-08-14T21:46:54.0262160Z 2025-08-14T21:46:54.0262257Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0262583Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0262883Z return mod(**inputs) 2025-08-14T21:46:54.0263196Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0263541Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0263878Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0264208Z layer_outputs = layer_module( 2025-08-14T21:46:54.0264526Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0264988Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0265335Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 681, in forward 2025-08-14T21:46:54.0265678Z self_attention_outputs = self.layer[0]( 2025-08-14T21:46:54.0266047Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 599, in forward 2025-08-14T21:46:54.0266406Z attention_output = self.SelfAttention( 2025-08-14T21:46:54.0266751Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 526, in forward 2025-08-14T21:46:54.0267155Z scores = torch.matmul(query_states, key_states.transpose(3, 2)) 2025-08-14T21:46:54.0267337Z 2025-08-14T21:46:54.0267455Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0267857Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0268170Z return mod(**inputs) 2025-08-14T21:46:54.0268508Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0268870Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0269221Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0269571Z layer_outputs = layer_module( 2025-08-14T21:46:54.0269921Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0270268Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0270613Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 681, in forward 2025-08-14T21:46:54.0270971Z self_attention_outputs = self.layer[0]( 2025-08-14T21:46:54.0271323Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 599, in forward 2025-08-14T21:46:54.0271676Z attention_output = self.SelfAttention( 2025-08-14T21:46:54.0272019Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 558, in forward 2025-08-14T21:46:54.0272441Z attn_weights = nn.functional.softmax(scores.float(), dim=-1).type_as(scores) 2025-08-14T21:46:54.0272639Z 2025-08-14T21:46:54.0272746Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0273079Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0273386Z return mod(**inputs) 2025-08-14T21:46:54.0273713Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0274068Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0274404Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0274753Z layer_outputs = layer_module( 2025-08-14T21:46:54.0275079Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0275420Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0275771Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 681, in forward 2025-08-14T21:46:54.0276125Z self_attention_outputs = self.layer[0]( 2025-08-14T21:46:54.0276479Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 599, in forward 2025-08-14T21:46:54.0276825Z attention_output = self.SelfAttention( 2025-08-14T21:46:54.0277182Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 511, in forward 2025-08-14T21:46:54.0277535Z value_states = self.v(current_states) 2025-08-14T21:46:54.0277662Z 2025-08-14T21:46:54.0277769Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0278105Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0278413Z return mod(**inputs) 2025-08-14T21:46:54.0278739Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0279084Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0279444Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0279796Z layer_outputs = layer_module( 2025-08-14T21:46:54.0280124Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0280478Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0280830Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 681, in forward 2025-08-14T21:46:54.0281211Z self_attention_outputs = self.layer[0]( 2025-08-14T21:46:54.0281569Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 599, in forward 2025-08-14T21:46:54.0281923Z attention_output = self.SelfAttention( 2025-08-14T21:46:54.0282274Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 565, in forward 2025-08-14T21:46:54.0282655Z attn_output = torch.matmul(attn_weights, value_states) 2025-08-14T21:46:54.0282826Z 2025-08-14T21:46:54.0282923Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0283252Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0283578Z return mod(**inputs) 2025-08-14T21:46:54.0283900Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0284249Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0284864Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0285212Z layer_outputs = layer_module( 2025-08-14T21:46:54.0285542Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0285883Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0286230Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 681, in forward 2025-08-14T21:46:54.0286589Z self_attention_outputs = self.layer[0]( 2025-08-14T21:46:54.0286942Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 599, in forward 2025-08-14T21:46:54.0287312Z attention_output = self.SelfAttention( 2025-08-14T21:46:54.0287652Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 565, in forward 2025-08-14T21:46:54.0288027Z attn_output = torch.matmul(attn_weights, value_states) 2025-08-14T21:46:54.0288177Z 2025-08-14T21:46:54.0288284Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0288607Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0288902Z return mod(**inputs) 2025-08-14T21:46:54.0289220Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0289567Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0289891Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0290238Z layer_outputs = layer_module( 2025-08-14T21:46:54.0290556Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0290889Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0291220Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 681, in forward 2025-08-14T21:46:54.0291561Z self_attention_outputs = self.layer[0]( 2025-08-14T21:46:54.0291898Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 599, in forward 2025-08-14T21:46:54.0292238Z attention_output = self.SelfAttention( 2025-08-14T21:46:54.0292629Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 567, in forward 2025-08-14T21:46:54.0293002Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:46:54.0293150Z 2025-08-14T21:46:54.0293251Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0293599Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0293928Z return mod(**inputs) 2025-08-14T21:46:54.0294245Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0294580Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0294917Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0295259Z layer_outputs = layer_module( 2025-08-14T21:46:54.0295579Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0295982Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0296332Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 681, in forward 2025-08-14T21:46:54.0296691Z self_attention_outputs = self.layer[0]( 2025-08-14T21:46:54.0297032Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 599, in forward 2025-08-14T21:46:54.0297385Z attention_output = self.SelfAttention( 2025-08-14T21:46:54.0297732Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 569, in forward 2025-08-14T21:46:54.0298080Z attn_output = self.o(attn_output) 2025-08-14T21:46:54.0298203Z 2025-08-14T21:46:54.0298300Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0298639Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0298949Z return mod(**inputs) 2025-08-14T21:46:54.0299277Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0299618Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0299959Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0300302Z layer_outputs = layer_module( 2025-08-14T21:46:54.0300613Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0300951Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0301297Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 705, in forward 2025-08-14T21:46:54.0301643Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:46:54.0301986Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 635, in forward 2025-08-14T21:46:54.0302342Z attention_output = self.EncDecAttention( 2025-08-14T21:46:54.0302696Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 490, in forward 2025-08-14T21:46:54.0303046Z query_states = self.q(hidden_states) 2025-08-14T21:46:54.0303180Z 2025-08-14T21:46:54.0303279Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0303625Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0303937Z return mod(**inputs) 2025-08-14T21:46:54.0304260Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1725, in forward 2025-08-14T21:46:54.0304647Z encoder_outputs = self.encoder( 2025-08-14T21:46:54.0305065Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0305485Z layer_outputs = layer_module( 2025-08-14T21:46:54.0305812Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0306155Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0306524Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 681, in forward 2025-08-14T21:46:54.0306874Z self_attention_outputs = self.layer[0]( 2025-08-14T21:46:54.0307243Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 599, in forward 2025-08-14T21:46:54.0307598Z attention_output = self.SelfAttention( 2025-08-14T21:46:54.0307947Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 490, in forward 2025-08-14T21:46:54.0308327Z query_states = self.q(hidden_states) 2025-08-14T21:46:54.0308458Z 2025-08-14T21:46:54.0308558Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0308916Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0309219Z return mod(**inputs) 2025-08-14T21:46:54.0309542Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1725, in forward 2025-08-14T21:46:54.0309894Z encoder_outputs = self.encoder( 2025-08-14T21:46:54.0310237Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0310578Z layer_outputs = layer_module( 2025-08-14T21:46:54.0310901Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0311240Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0311584Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 681, in forward 2025-08-14T21:46:54.0311949Z self_attention_outputs = self.layer[0]( 2025-08-14T21:46:54.0312304Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 599, in forward 2025-08-14T21:46:54.0312659Z attention_output = self.SelfAttention( 2025-08-14T21:46:54.0313000Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 510, in forward 2025-08-14T21:46:54.0313352Z key_states = self.k(current_states) 2025-08-14T21:46:54.0313475Z 2025-08-14T21:46:54.0313581Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0313920Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0314221Z return mod(**inputs) 2025-08-14T21:46:54.0314545Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1725, in forward 2025-08-14T21:46:54.0314894Z encoder_outputs = self.encoder( 2025-08-14T21:46:54.0315232Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0315588Z layer_outputs = layer_module( 2025-08-14T21:46:54.0315916Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0316259Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0316596Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 681, in forward 2025-08-14T21:46:54.0316953Z self_attention_outputs = self.layer[0]( 2025-08-14T21:46:54.0317300Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 599, in forward 2025-08-14T21:46:54.0317648Z attention_output = self.SelfAttention( 2025-08-14T21:46:54.0317997Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 526, in forward 2025-08-14T21:46:54.0318407Z scores = torch.matmul(query_states, key_states.transpose(3, 2)) 2025-08-14T21:46:54.0318584Z 2025-08-14T21:46:54.0318686Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0319026Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0319340Z return mod(**inputs) 2025-08-14T21:46:54.0319698Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1725, in forward 2025-08-14T21:46:54.0320065Z encoder_outputs = self.encoder( 2025-08-14T21:46:54.0320401Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0320748Z layer_outputs = layer_module( 2025-08-14T21:46:54.0321077Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0321422Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0321764Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 681, in forward 2025-08-14T21:46:54.0322122Z self_attention_outputs = self.layer[0]( 2025-08-14T21:46:54.0322462Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 599, in forward 2025-08-14T21:46:54.0322799Z attention_output = self.SelfAttention( 2025-08-14T21:46:54.0323138Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 558, in forward 2025-08-14T21:46:54.0323550Z attn_weights = nn.functional.softmax(scores.float(), dim=-1).type_as(scores) 2025-08-14T21:46:54.0323742Z 2025-08-14T21:46:54.0323845Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0324169Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0324468Z return mod(**inputs) 2025-08-14T21:46:54.0324788Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1725, in forward 2025-08-14T21:46:54.0325137Z encoder_outputs = self.encoder( 2025-08-14T21:46:54.0325490Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0325847Z layer_outputs = layer_module( 2025-08-14T21:46:54.0326189Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0326530Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0326889Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 681, in forward 2025-08-14T21:46:54.0327233Z self_attention_outputs = self.layer[0]( 2025-08-14T21:46:54.0327565Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 599, in forward 2025-08-14T21:46:54.0327912Z attention_output = self.SelfAttention( 2025-08-14T21:46:54.0328255Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 558, in forward 2025-08-14T21:46:54.0328663Z attn_weights = nn.functional.softmax(scores.float(), dim=-1).type_as(scores) 2025-08-14T21:46:54.0328851Z 2025-08-14T21:46:54.0328946Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0329277Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0329579Z return mod(**inputs) 2025-08-14T21:46:54.0329898Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1725, in forward 2025-08-14T21:46:54.0330230Z encoder_outputs = self.encoder( 2025-08-14T21:46:54.0330563Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0330899Z layer_outputs = layer_module( 2025-08-14T21:46:54.0331225Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0331566Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0331918Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 681, in forward 2025-08-14T21:46:54.0332282Z self_attention_outputs = self.layer[0]( 2025-08-14T21:46:54.0332621Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 599, in forward 2025-08-14T21:46:54.0332987Z attention_output = self.SelfAttention( 2025-08-14T21:46:54.0333328Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 558, in forward 2025-08-14T21:46:54.0333728Z attn_weights = nn.functional.softmax(scores.float(), dim=-1).type_as(scores) 2025-08-14T21:46:54.0333924Z 2025-08-14T21:46:54.0334018Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0334348Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0334664Z return mod(**inputs) 2025-08-14T21:46:54.0334974Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1725, in forward 2025-08-14T21:46:54.0335315Z encoder_outputs = self.encoder( 2025-08-14T21:46:54.0335652Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0336000Z layer_outputs = layer_module( 2025-08-14T21:46:54.0336309Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0336641Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0336979Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 681, in forward 2025-08-14T21:46:54.0337315Z self_attention_outputs = self.layer[0]( 2025-08-14T21:46:54.0337660Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 599, in forward 2025-08-14T21:46:54.0338007Z attention_output = self.SelfAttention( 2025-08-14T21:46:54.0338352Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 511, in forward 2025-08-14T21:46:54.0338691Z value_states = self.v(current_states) 2025-08-14T21:46:54.0338824Z 2025-08-14T21:46:54.0338919Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0339251Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0339543Z return mod(**inputs) 2025-08-14T21:46:54.0339865Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1725, in forward 2025-08-14T21:46:54.0340218Z encoder_outputs = self.encoder( 2025-08-14T21:46:54.0340574Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0340914Z layer_outputs = layer_module( 2025-08-14T21:46:54.0341232Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0341565Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0341900Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 681, in forward 2025-08-14T21:46:54.0342247Z self_attention_outputs = self.layer[0]( 2025-08-14T21:46:54.0342586Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 599, in forward 2025-08-14T21:46:54.0342928Z attention_output = self.SelfAttention( 2025-08-14T21:46:54.0343265Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 565, in forward 2025-08-14T21:46:54.0343652Z attn_output = torch.matmul(attn_weights, value_states) 2025-08-14T21:46:54.0343814Z 2025-08-14T21:46:54.0343914Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0344251Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0344554Z return mod(**inputs) 2025-08-14T21:46:54.0344982Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1725, in forward 2025-08-14T21:46:54.0345363Z encoder_outputs = self.encoder( 2025-08-14T21:46:54.0345700Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0346061Z layer_outputs = layer_module( 2025-08-14T21:46:54.0346422Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0346768Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0347168Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 681, in forward 2025-08-14T21:46:54.0347537Z self_attention_outputs = self.layer[0]( 2025-08-14T21:46:54.0347919Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 599, in forward 2025-08-14T21:46:54.0348278Z attention_output = self.SelfAttention( 2025-08-14T21:46:54.0348630Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 565, in forward 2025-08-14T21:46:54.0349011Z attn_output = torch.matmul(attn_weights, value_states) 2025-08-14T21:46:54.0349163Z 2025-08-14T21:46:54.0349270Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0349595Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0349901Z return mod(**inputs) 2025-08-14T21:46:54.0350230Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1725, in forward 2025-08-14T21:46:54.0350585Z encoder_outputs = self.encoder( 2025-08-14T21:46:54.0350918Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0351264Z layer_outputs = layer_module( 2025-08-14T21:46:54.0351590Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0351930Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0352290Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 681, in forward 2025-08-14T21:46:54.0352645Z self_attention_outputs = self.layer[0]( 2025-08-14T21:46:54.0352996Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 599, in forward 2025-08-14T21:46:54.0353342Z attention_output = self.SelfAttention( 2025-08-14T21:46:54.0353695Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 567, in forward 2025-08-14T21:46:54.0354081Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:46:54.0354234Z 2025-08-14T21:46:54.0354339Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0354676Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0354981Z return mod(**inputs) 2025-08-14T21:46:54.0355308Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1725, in forward 2025-08-14T21:46:54.0355651Z encoder_outputs = self.encoder( 2025-08-14T21:46:54.0355998Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0356347Z layer_outputs = layer_module( 2025-08-14T21:46:54.0356690Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0357031Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0357386Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 681, in forward 2025-08-14T21:46:54.0357746Z self_attention_outputs = self.layer[0]( 2025-08-14T21:46:54.0358109Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 599, in forward 2025-08-14T21:46:54.0358483Z attention_output = self.SelfAttention( 2025-08-14T21:46:54.0358834Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 569, in forward 2025-08-14T21:46:54.0359185Z attn_output = self.o(attn_output) 2025-08-14T21:46:54.0359307Z 2025-08-14T21:46:54.0359381Z cudagraph partition due to non gpu ops 2025-08-14T21:46:54.0359606Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0359944Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0360264Z return mod(**inputs) 2025-08-14T21:46:54.0360580Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1725, in forward 2025-08-14T21:46:54.0360920Z encoder_outputs = self.encoder( 2025-08-14T21:46:54.0361252Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0361588Z layer_outputs = layer_module( 2025-08-14T21:46:54.0361902Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0362230Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0362567Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 731, in forward 2025-08-14T21:46:54.0362914Z hidden_states = self.layer[-1](hidden_states) 2025-08-14T21:46:54.0363264Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 341, in forward 2025-08-14T21:46:54.0363624Z forwarded_states = self.layer_norm(hidden_states) 2025-08-14T21:46:54.0363971Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 261, in forward 2025-08-14T21:46:54.0364315Z return self.weight * hidden_states 2025-08-14T21:46:54.0364444Z 2025-08-14T21:46:54.0364540Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0364869Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0365160Z return mod(**inputs) 2025-08-14T21:46:54.0365482Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1725, in forward 2025-08-14T21:46:54.0365833Z encoder_outputs = self.encoder( 2025-08-14T21:46:54.0366170Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0366524Z layer_outputs = layer_module( 2025-08-14T21:46:54.0366849Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0367201Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0367537Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 731, in forward 2025-08-14T21:46:54.0367896Z hidden_states = self.layer[-1](hidden_states) 2025-08-14T21:46:54.0368247Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 342, in forward 2025-08-14T21:46:54.0368622Z forwarded_states = self.DenseReluDense(forwarded_states) 2025-08-14T21:46:54.0368989Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 287, in forward 2025-08-14T21:46:54.0369325Z hidden_states = self.wi(hidden_states) 2025-08-14T21:46:54.0369448Z 2025-08-14T21:46:54.0369583Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0369910Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0370210Z return mod(**inputs) 2025-08-14T21:46:54.0370540Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1725, in forward 2025-08-14T21:46:54.0370882Z encoder_outputs = self.encoder( 2025-08-14T21:46:54.0371224Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0371562Z layer_outputs = layer_module( 2025-08-14T21:46:54.0371879Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0372206Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0372547Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 731, in forward 2025-08-14T21:46:54.0372921Z hidden_states = self.layer[-1](hidden_states) 2025-08-14T21:46:54.0373269Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 342, in forward 2025-08-14T21:46:54.0373641Z forwarded_states = self.DenseReluDense(forwarded_states) 2025-08-14T21:46:54.0374017Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 288, in forward 2025-08-14T21:46:54.0374363Z hidden_states = self.act(hidden_states) 2025-08-14T21:46:54.0374488Z 2025-08-14T21:46:54.0374592Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0374911Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0375209Z return mod(**inputs) 2025-08-14T21:46:54.0375524Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1725, in forward 2025-08-14T21:46:54.0375859Z encoder_outputs = self.encoder( 2025-08-14T21:46:54.0376193Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0376526Z layer_outputs = layer_module( 2025-08-14T21:46:54.0376843Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0377169Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0377512Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 731, in forward 2025-08-14T21:46:54.0377864Z hidden_states = self.layer[-1](hidden_states) 2025-08-14T21:46:54.0378208Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 342, in forward 2025-08-14T21:46:54.0378584Z forwarded_states = self.DenseReluDense(forwarded_states) 2025-08-14T21:46:54.0378959Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 296, in forward 2025-08-14T21:46:54.0379304Z hidden_states = self.wo(hidden_states) 2025-08-14T21:46:54.0379428Z 2025-08-14T21:46:54.0379502Z cudagraph partition due to non gpu ops 2025-08-14T21:46:54.0380235Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0380568Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0380873Z return mod(**inputs) 2025-08-14T21:46:54.0381185Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1725, in forward 2025-08-14T21:46:54.0381527Z encoder_outputs = self.encoder( 2025-08-14T21:46:54.0381863Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0382195Z layer_outputs = layer_module( 2025-08-14T21:46:54.0382533Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0382875Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0383218Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 681, in forward 2025-08-14T21:46:54.0383557Z self_attention_outputs = self.layer[0]( 2025-08-14T21:46:54.0383921Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 598, in forward 2025-08-14T21:46:54.0384306Z normed_hidden_states = self.layer_norm(hidden_states) 2025-08-14T21:46:54.0384896Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 261, in forward 2025-08-14T21:46:54.0385256Z return self.weight * hidden_states 2025-08-14T21:46:54.0385389Z 2025-08-14T21:46:54.0385487Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0385831Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0386179Z return mod(**inputs) 2025-08-14T21:46:54.0386510Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1725, in forward 2025-08-14T21:46:54.0386861Z encoder_outputs = self.encoder( 2025-08-14T21:46:54.0387214Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0387555Z layer_outputs = layer_module( 2025-08-14T21:46:54.0387883Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0388226Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0388570Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 681, in forward 2025-08-14T21:46:54.0388926Z self_attention_outputs = self.layer[0]( 2025-08-14T21:46:54.0389285Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 599, in forward 2025-08-14T21:46:54.0389646Z attention_output = self.SelfAttention( 2025-08-14T21:46:54.0389999Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 490, in forward 2025-08-14T21:46:54.0390351Z query_states = self.q(hidden_states) 2025-08-14T21:46:54.0390476Z 2025-08-14T21:46:54.0390582Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0390915Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0391221Z return mod(**inputs) 2025-08-14T21:46:54.0391545Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1725, in forward 2025-08-14T21:46:54.0391897Z encoder_outputs = self.encoder( 2025-08-14T21:46:54.0392233Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0392580Z layer_outputs = layer_module( 2025-08-14T21:46:54.0392906Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0393238Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0393586Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 681, in forward 2025-08-14T21:46:54.0393939Z self_attention_outputs = self.layer[0]( 2025-08-14T21:46:54.0394291Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 599, in forward 2025-08-14T21:46:54.0394640Z attention_output = self.SelfAttention( 2025-08-14T21:46:54.0394993Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 510, in forward 2025-08-14T21:46:54.0395346Z key_states = self.k(current_states) 2025-08-14T21:46:54.0395469Z 2025-08-14T21:46:54.0395602Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0395940Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0396249Z return mod(**inputs) 2025-08-14T21:46:54.0396578Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1725, in forward 2025-08-14T21:46:54.0396951Z encoder_outputs = self.encoder( 2025-08-14T21:46:54.0397303Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0397674Z layer_outputs = layer_module( 2025-08-14T21:46:54.0398000Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0398337Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0398705Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 681, in forward 2025-08-14T21:46:54.0399064Z self_attention_outputs = self.layer[0]( 2025-08-14T21:46:54.0399438Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 599, in forward 2025-08-14T21:46:54.0399786Z attention_output = self.SelfAttention( 2025-08-14T21:46:54.0400131Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 526, in forward 2025-08-14T21:46:54.0400523Z scores = torch.matmul(query_states, key_states.transpose(3, 2)) 2025-08-14T21:46:54.0400692Z 2025-08-14T21:46:54.0400788Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0401117Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0401416Z return mod(**inputs) 2025-08-14T21:46:54.0401728Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1725, in forward 2025-08-14T21:46:54.0402072Z encoder_outputs = self.encoder( 2025-08-14T21:46:54.0402407Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0402747Z layer_outputs = layer_module( 2025-08-14T21:46:54.0403059Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0403394Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0403738Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 681, in forward 2025-08-14T21:46:54.0404085Z self_attention_outputs = self.layer[0]( 2025-08-14T21:46:54.0404419Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 599, in forward 2025-08-14T21:46:54.0404764Z attention_output = self.SelfAttention( 2025-08-14T21:46:54.0405109Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 558, in forward 2025-08-14T21:46:54.0405513Z attn_weights = nn.functional.softmax(scores.float(), dim=-1).type_as(scores) 2025-08-14T21:46:54.0405715Z 2025-08-14T21:46:54.0405812Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0406141Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0406438Z return mod(**inputs) 2025-08-14T21:46:54.0406750Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1725, in forward 2025-08-14T21:46:54.0407093Z encoder_outputs = self.encoder( 2025-08-14T21:46:54.0407426Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0407755Z layer_outputs = layer_module( 2025-08-14T21:46:54.0408070Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0408416Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0408763Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 681, in forward 2025-08-14T21:46:54.0409104Z self_attention_outputs = self.layer[0]( 2025-08-14T21:46:54.0409464Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 599, in forward 2025-08-14T21:46:54.0409815Z attention_output = self.SelfAttention( 2025-08-14T21:46:54.0410173Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 558, in forward 2025-08-14T21:46:54.0410574Z attn_weights = nn.functional.softmax(scores.float(), dim=-1).type_as(scores) 2025-08-14T21:46:54.0410771Z 2025-08-14T21:46:54.0410865Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0411193Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0411483Z return mod(**inputs) 2025-08-14T21:46:54.0411803Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1725, in forward 2025-08-14T21:46:54.0412176Z encoder_outputs = self.encoder( 2025-08-14T21:46:54.0412514Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0412849Z layer_outputs = layer_module( 2025-08-14T21:46:54.0413168Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0413506Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0413840Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 681, in forward 2025-08-14T21:46:54.0414188Z self_attention_outputs = self.layer[0]( 2025-08-14T21:46:54.0414534Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 599, in forward 2025-08-14T21:46:54.0414885Z attention_output = self.SelfAttention( 2025-08-14T21:46:54.0415223Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 558, in forward 2025-08-14T21:46:54.0415633Z attn_weights = nn.functional.softmax(scores.float(), dim=-1).type_as(scores) 2025-08-14T21:46:54.0415828Z 2025-08-14T21:46:54.0415925Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0416258Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0416553Z return mod(**inputs) 2025-08-14T21:46:54.0416871Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1725, in forward 2025-08-14T21:46:54.0417212Z encoder_outputs = self.encoder( 2025-08-14T21:46:54.0417539Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0417880Z layer_outputs = layer_module( 2025-08-14T21:46:54.0418199Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0418533Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0418866Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 681, in forward 2025-08-14T21:46:54.0419216Z self_attention_outputs = self.layer[0]( 2025-08-14T21:46:54.0419562Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 599, in forward 2025-08-14T21:46:54.0419903Z attention_output = self.SelfAttention( 2025-08-14T21:46:54.0420247Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 511, in forward 2025-08-14T21:46:54.0420591Z value_states = self.v(current_states) 2025-08-14T21:46:54.0420713Z 2025-08-14T21:46:54.0420814Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0421152Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0421455Z return mod(**inputs) 2025-08-14T21:46:54.0421778Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1725, in forward 2025-08-14T21:46:54.0422125Z encoder_outputs = self.encoder( 2025-08-14T21:46:54.0422471Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0422826Z layer_outputs = layer_module( 2025-08-14T21:46:54.0423144Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0423469Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0423807Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 681, in forward 2025-08-14T21:46:54.0424156Z self_attention_outputs = self.layer[0]( 2025-08-14T21:46:54.0424494Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 599, in forward 2025-08-14T21:46:54.0424921Z attention_output = self.SelfAttention( 2025-08-14T21:46:54.0425282Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 565, in forward 2025-08-14T21:46:54.0425668Z attn_output = torch.matmul(attn_weights, value_states) 2025-08-14T21:46:54.0425826Z 2025-08-14T21:46:54.0425924Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0426267Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0426579Z return mod(**inputs) 2025-08-14T21:46:54.0426896Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1725, in forward 2025-08-14T21:46:54.0427232Z encoder_outputs = self.encoder( 2025-08-14T21:46:54.0427578Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0427932Z layer_outputs = layer_module( 2025-08-14T21:46:54.0428263Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0428599Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0428955Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 681, in forward 2025-08-14T21:46:54.0429315Z self_attention_outputs = self.layer[0]( 2025-08-14T21:46:54.0429660Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 599, in forward 2025-08-14T21:46:54.0430019Z attention_output = self.SelfAttention( 2025-08-14T21:46:54.0430374Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 565, in forward 2025-08-14T21:46:54.0430757Z attn_output = torch.matmul(attn_weights, value_states) 2025-08-14T21:46:54.0430910Z 2025-08-14T21:46:54.0431006Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0431340Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0431645Z return mod(**inputs) 2025-08-14T21:46:54.0431967Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1725, in forward 2025-08-14T21:46:54.0432324Z encoder_outputs = self.encoder( 2025-08-14T21:46:54.0432670Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0433018Z layer_outputs = layer_module( 2025-08-14T21:46:54.0433336Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0433679Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0434051Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 681, in forward 2025-08-14T21:46:54.0434414Z self_attention_outputs = self.layer[0]( 2025-08-14T21:46:54.0434757Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 599, in forward 2025-08-14T21:46:54.0435113Z attention_output = self.SelfAttention( 2025-08-14T21:46:54.0435482Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 567, in forward 2025-08-14T21:46:54.0435871Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:46:54.0436029Z 2025-08-14T21:46:54.0436128Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0436463Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0436768Z return mod(**inputs) 2025-08-14T21:46:54.0437087Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1725, in forward 2025-08-14T21:46:54.0437457Z encoder_outputs = self.encoder( 2025-08-14T21:46:54.0437797Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0438139Z layer_outputs = layer_module( 2025-08-14T21:46:54.0438466Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0438821Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0439159Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 681, in forward 2025-08-14T21:46:54.0439496Z self_attention_outputs = self.layer[0]( 2025-08-14T21:46:54.0439838Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 599, in forward 2025-08-14T21:46:54.0440179Z attention_output = self.SelfAttention( 2025-08-14T21:46:54.0440514Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 569, in forward 2025-08-14T21:46:54.0440859Z attn_output = self.o(attn_output) 2025-08-14T21:46:54.0440983Z 2025-08-14T21:46:54.0441078Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0441406Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0441701Z return mod(**inputs) 2025-08-14T21:46:54.0442022Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1725, in forward 2025-08-14T21:46:54.0442361Z encoder_outputs = self.encoder( 2025-08-14T21:46:54.0442696Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0443027Z layer_outputs = layer_module( 2025-08-14T21:46:54.0443343Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0443678Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0444013Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 731, in forward 2025-08-14T21:46:54.0444367Z hidden_states = self.layer[-1](hidden_states) 2025-08-14T21:46:54.0444720Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 341, in forward 2025-08-14T21:46:54.0445083Z forwarded_states = self.layer_norm(hidden_states) 2025-08-14T21:46:54.0445431Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 261, in forward 2025-08-14T21:46:54.0445772Z return self.weight * hidden_states 2025-08-14T21:46:54.0445893Z 2025-08-14T21:46:54.0445996Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0446318Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0446631Z return mod(**inputs) 2025-08-14T21:46:54.0446954Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1725, in forward 2025-08-14T21:46:54.0447298Z encoder_outputs = self.encoder( 2025-08-14T21:46:54.0447653Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0447995Z layer_outputs = layer_module( 2025-08-14T21:46:54.0448329Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0448660Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0448993Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 731, in forward 2025-08-14T21:46:54.0449346Z hidden_states = self.layer[-1](hidden_states) 2025-08-14T21:46:54.0449696Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 342, in forward 2025-08-14T21:46:54.0450088Z forwarded_states = self.DenseReluDense(forwarded_states) 2025-08-14T21:46:54.0450465Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 287, in forward 2025-08-14T21:46:54.0450810Z hidden_states = self.wi(hidden_states) 2025-08-14T21:46:54.0450936Z 2025-08-14T21:46:54.0451041Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0451369Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0451670Z return mod(**inputs) 2025-08-14T21:46:54.0451984Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1725, in forward 2025-08-14T21:46:54.0452322Z encoder_outputs = self.encoder( 2025-08-14T21:46:54.0452663Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0453003Z layer_outputs = layer_module( 2025-08-14T21:46:54.0453323Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0453650Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0453994Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 731, in forward 2025-08-14T21:46:54.0454350Z hidden_states = self.layer[-1](hidden_states) 2025-08-14T21:46:54.0454697Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 342, in forward 2025-08-14T21:46:54.0455075Z forwarded_states = self.DenseReluDense(forwarded_states) 2025-08-14T21:46:54.0455450Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 288, in forward 2025-08-14T21:46:54.0455792Z hidden_states = self.act(hidden_states) 2025-08-14T21:46:54.0455915Z 2025-08-14T21:46:54.0456011Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0456342Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0456642Z return mod(**inputs) 2025-08-14T21:46:54.0456962Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1725, in forward 2025-08-14T21:46:54.0457300Z encoder_outputs = self.encoder( 2025-08-14T21:46:54.0457639Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0457980Z layer_outputs = layer_module( 2025-08-14T21:46:54.0458291Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0458624Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0458966Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 731, in forward 2025-08-14T21:46:54.0459334Z hidden_states = self.layer[-1](hidden_states) 2025-08-14T21:46:54.0459685Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 342, in forward 2025-08-14T21:46:54.0460066Z forwarded_states = self.DenseReluDense(forwarded_states) 2025-08-14T21:46:54.0460460Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 296, in forward 2025-08-14T21:46:54.0460831Z hidden_states = self.wo(hidden_states) 2025-08-14T21:46:54.0460954Z 2025-08-14T21:46:54.0461029Z cudagraph partition due to non gpu ops 2025-08-14T21:46:54.0461260Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0461593Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0461886Z return mod(**inputs) 2025-08-14T21:46:54.0462203Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1725, in forward 2025-08-14T21:46:54.0462546Z encoder_outputs = self.encoder( 2025-08-14T21:46:54.0462900Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0463234Z layer_outputs = layer_module( 2025-08-14T21:46:54.0463553Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0463885Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0464219Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 681, in forward 2025-08-14T21:46:54.0464597Z self_attention_outputs = self.layer[0]( 2025-08-14T21:46:54.0465017Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 598, in forward 2025-08-14T21:46:54.0465408Z normed_hidden_states = self.layer_norm(hidden_states) 2025-08-14T21:46:54.0465784Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 261, in forward 2025-08-14T21:46:54.0466168Z return self.weight * hidden_states 2025-08-14T21:46:54.0466290Z 2025-08-14T21:46:54.0466394Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0466722Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0467025Z return mod(**inputs) 2025-08-14T21:46:54.0467360Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1725, in forward 2025-08-14T21:46:54.0467720Z encoder_outputs = self.encoder( 2025-08-14T21:46:54.0468063Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0468414Z layer_outputs = layer_module( 2025-08-14T21:46:54.0468746Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0469085Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0469437Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 681, in forward 2025-08-14T21:46:54.0469792Z self_attention_outputs = self.layer[0]( 2025-08-14T21:46:54.0470151Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 599, in forward 2025-08-14T21:46:54.0470507Z attention_output = self.SelfAttention( 2025-08-14T21:46:54.0470865Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 490, in forward 2025-08-14T21:46:54.0471217Z query_states = self.q(hidden_states) 2025-08-14T21:46:54.0471343Z 2025-08-14T21:46:54.0471448Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0471783Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0472089Z return mod(**inputs) 2025-08-14T21:46:54.0472439Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1725, in forward 2025-08-14T21:46:54.0472790Z encoder_outputs = self.encoder( 2025-08-14T21:46:54.0473137Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0473502Z layer_outputs = layer_module( 2025-08-14T21:46:54.0473836Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0474191Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0474542Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 681, in forward 2025-08-14T21:46:54.0474899Z self_attention_outputs = self.layer[0]( 2025-08-14T21:46:54.0475248Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 599, in forward 2025-08-14T21:46:54.0475606Z attention_output = self.SelfAttention( 2025-08-14T21:46:54.0475979Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 510, in forward 2025-08-14T21:46:54.0476331Z key_states = self.k(current_states) 2025-08-14T21:46:54.0476456Z 2025-08-14T21:46:54.0476556Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0476898Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0477205Z return mod(**inputs) 2025-08-14T21:46:54.0490850Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1725, in forward 2025-08-14T21:46:54.0491249Z encoder_outputs = self.encoder( 2025-08-14T21:46:54.0491604Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0491972Z layer_outputs = layer_module( 2025-08-14T21:46:54.0492324Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0492668Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0493033Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 681, in forward 2025-08-14T21:46:54.0493399Z self_attention_outputs = self.layer[0]( 2025-08-14T21:46:54.0493754Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 599, in forward 2025-08-14T21:46:54.0494111Z attention_output = self.SelfAttention( 2025-08-14T21:46:54.0494468Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 526, in forward 2025-08-14T21:46:54.0494865Z scores = torch.matmul(query_states, key_states.transpose(3, 2)) 2025-08-14T21:46:54.0495039Z 2025-08-14T21:46:54.0495151Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0495493Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0495804Z return mod(**inputs) 2025-08-14T21:46:54.0496137Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1725, in forward 2025-08-14T21:46:54.0496480Z encoder_outputs = self.encoder( 2025-08-14T21:46:54.0496823Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0497173Z layer_outputs = layer_module( 2025-08-14T21:46:54.0497497Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0497833Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0498180Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 681, in forward 2025-08-14T21:46:54.0498534Z self_attention_outputs = self.layer[0]( 2025-08-14T21:46:54.0499026Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 599, in forward 2025-08-14T21:46:54.0499388Z attention_output = self.SelfAttention( 2025-08-14T21:46:54.0499736Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 558, in forward 2025-08-14T21:46:54.0500198Z attn_weights = nn.functional.softmax(scores.float(), dim=-1).type_as(scores) 2025-08-14T21:46:54.0500427Z 2025-08-14T21:46:54.0500527Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0500867Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0501173Z return mod(**inputs) 2025-08-14T21:46:54.0501497Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1725, in forward 2025-08-14T21:46:54.0501840Z encoder_outputs = self.encoder( 2025-08-14T21:46:54.0502180Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0502575Z layer_outputs = layer_module( 2025-08-14T21:46:54.0502890Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0503230Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0503575Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 681, in forward 2025-08-14T21:46:54.0503926Z self_attention_outputs = self.layer[0]( 2025-08-14T21:46:54.0504261Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 599, in forward 2025-08-14T21:46:54.0504610Z attention_output = self.SelfAttention( 2025-08-14T21:46:54.0505037Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 558, in forward 2025-08-14T21:46:54.0505464Z attn_weights = nn.functional.softmax(scores.float(), dim=-1).type_as(scores) 2025-08-14T21:46:54.0505671Z 2025-08-14T21:46:54.0505772Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0506122Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0506441Z return mod(**inputs) 2025-08-14T21:46:54.0506760Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1725, in forward 2025-08-14T21:46:54.0507109Z encoder_outputs = self.encoder( 2025-08-14T21:46:54.0507457Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0507811Z layer_outputs = layer_module( 2025-08-14T21:46:54.0508132Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0508477Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0508833Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 681, in forward 2025-08-14T21:46:54.0509184Z self_attention_outputs = self.layer[0]( 2025-08-14T21:46:54.0509542Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 599, in forward 2025-08-14T21:46:54.0509902Z attention_output = self.SelfAttention( 2025-08-14T21:46:54.0510261Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 558, in forward 2025-08-14T21:46:54.0510677Z attn_weights = nn.functional.softmax(scores.float(), dim=-1).type_as(scores) 2025-08-14T21:46:54.0510881Z 2025-08-14T21:46:54.0510979Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0511323Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0511637Z return mod(**inputs) 2025-08-14T21:46:54.0511979Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1725, in forward 2025-08-14T21:46:54.0512339Z encoder_outputs = self.encoder( 2025-08-14T21:46:54.0512688Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0513028Z layer_outputs = layer_module( 2025-08-14T21:46:54.0513373Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0513732Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0514085Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 681, in forward 2025-08-14T21:46:54.0514432Z self_attention_outputs = self.layer[0]( 2025-08-14T21:46:54.0514782Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 599, in forward 2025-08-14T21:46:54.0515142Z attention_output = self.SelfAttention( 2025-08-14T21:46:54.0515511Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 511, in forward 2025-08-14T21:46:54.0515858Z value_states = self.v(current_states) 2025-08-14T21:46:54.0515994Z 2025-08-14T21:46:54.0516092Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0516434Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0516735Z return mod(**inputs) 2025-08-14T21:46:54.0517061Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1725, in forward 2025-08-14T21:46:54.0517415Z encoder_outputs = self.encoder( 2025-08-14T21:46:54.0517762Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0518099Z layer_outputs = layer_module( 2025-08-14T21:46:54.0518425Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0518769Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0519117Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 681, in forward 2025-08-14T21:46:54.0519464Z self_attention_outputs = self.layer[0]( 2025-08-14T21:46:54.0519817Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 599, in forward 2025-08-14T21:46:54.0520169Z attention_output = self.SelfAttention( 2025-08-14T21:46:54.0520511Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 565, in forward 2025-08-14T21:46:54.0520931Z attn_output = torch.matmul(attn_weights, value_states) 2025-08-14T21:46:54.0521088Z 2025-08-14T21:46:54.0521184Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0521516Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0521810Z return mod(**inputs) 2025-08-14T21:46:54.0522129Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1725, in forward 2025-08-14T21:46:54.0522470Z encoder_outputs = self.encoder( 2025-08-14T21:46:54.0522799Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0523140Z layer_outputs = layer_module( 2025-08-14T21:46:54.0523457Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0523786Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0524119Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 681, in forward 2025-08-14T21:46:54.0524460Z self_attention_outputs = self.layer[0]( 2025-08-14T21:46:54.0524818Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 599, in forward 2025-08-14T21:46:54.0525167Z attention_output = self.SelfAttention( 2025-08-14T21:46:54.0525510Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 565, in forward 2025-08-14T21:46:54.0525965Z attn_output = torch.matmul(attn_weights, value_states) 2025-08-14T21:46:54.0526115Z 2025-08-14T21:46:54.0526217Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0526554Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0526854Z return mod(**inputs) 2025-08-14T21:46:54.0527170Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1725, in forward 2025-08-14T21:46:54.0527507Z encoder_outputs = self.encoder( 2025-08-14T21:46:54.0527836Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0528198Z layer_outputs = layer_module( 2025-08-14T21:46:54.0528515Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0528843Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0529189Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 681, in forward 2025-08-14T21:46:54.0529537Z self_attention_outputs = self.layer[0]( 2025-08-14T21:46:54.0529879Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 599, in forward 2025-08-14T21:46:54.0530217Z attention_output = self.SelfAttention( 2025-08-14T21:46:54.0530562Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 567, in forward 2025-08-14T21:46:54.0530937Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:46:54.0531087Z 2025-08-14T21:46:54.0531191Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0531518Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0531817Z return mod(**inputs) 2025-08-14T21:46:54.0532138Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1725, in forward 2025-08-14T21:46:54.0532475Z encoder_outputs = self.encoder( 2025-08-14T21:46:54.0532816Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0533152Z layer_outputs = layer_module( 2025-08-14T21:46:54.0533468Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0533793Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0534137Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 681, in forward 2025-08-14T21:46:54.0534484Z self_attention_outputs = self.layer[0]( 2025-08-14T21:46:54.0534819Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 599, in forward 2025-08-14T21:46:54.0535166Z attention_output = self.SelfAttention( 2025-08-14T21:46:54.0535510Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 569, in forward 2025-08-14T21:46:54.0535853Z attn_output = self.o(attn_output) 2025-08-14T21:46:54.0535972Z 2025-08-14T21:46:54.0536048Z cudagraph partition due to non gpu ops 2025-08-14T21:46:54.0536273Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0536607Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0536900Z return mod(**inputs) 2025-08-14T21:46:54.0537238Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1725, in forward 2025-08-14T21:46:54.0537583Z encoder_outputs = self.encoder( 2025-08-14T21:46:54.0537917Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0538246Z layer_outputs = layer_module( 2025-08-14T21:46:54.0538579Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0538931Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0539270Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 731, in forward 2025-08-14T21:46:54.0539632Z hidden_states = self.layer[-1](hidden_states) 2025-08-14T21:46:54.0539992Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 341, in forward 2025-08-14T21:46:54.0540355Z forwarded_states = self.layer_norm(hidden_states) 2025-08-14T21:46:54.0540714Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 261, in forward 2025-08-14T21:46:54.0541074Z return self.weight * hidden_states 2025-08-14T21:46:54.0541194Z 2025-08-14T21:46:54.0541300Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0541622Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0541921Z return mod(**inputs) 2025-08-14T21:46:54.0542238Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1725, in forward 2025-08-14T21:46:54.0542574Z encoder_outputs = self.encoder( 2025-08-14T21:46:54.0542896Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0543230Z layer_outputs = layer_module( 2025-08-14T21:46:54.0543546Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0543868Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0544207Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 731, in forward 2025-08-14T21:46:54.0544561Z hidden_states = self.layer[-1](hidden_states) 2025-08-14T21:46:54.0544986Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 342, in forward 2025-08-14T21:46:54.0545369Z forwarded_states = self.DenseReluDense(forwarded_states) 2025-08-14T21:46:54.0545754Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 287, in forward 2025-08-14T21:46:54.0546110Z hidden_states = self.wi(hidden_states) 2025-08-14T21:46:54.0546238Z 2025-08-14T21:46:54.0546336Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0546674Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0546991Z return mod(**inputs) 2025-08-14T21:46:54.0547312Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1725, in forward 2025-08-14T21:46:54.0547650Z encoder_outputs = self.encoder( 2025-08-14T21:46:54.0547987Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0548323Z layer_outputs = layer_module( 2025-08-14T21:46:54.0548633Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0548962Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0549301Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 731, in forward 2025-08-14T21:46:54.0549655Z hidden_states = self.layer[-1](hidden_states) 2025-08-14T21:46:54.0550021Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 342, in forward 2025-08-14T21:46:54.0550400Z forwarded_states = self.DenseReluDense(forwarded_states) 2025-08-14T21:46:54.0550768Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 288, in forward 2025-08-14T21:46:54.0551108Z hidden_states = self.act(hidden_states) 2025-08-14T21:46:54.0551245Z 2025-08-14T21:46:54.0551339Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0551689Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0551981Z return mod(**inputs) 2025-08-14T21:46:54.0552286Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1725, in forward 2025-08-14T21:46:54.0552622Z encoder_outputs = self.encoder( 2025-08-14T21:46:54.0552953Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0553285Z layer_outputs = layer_module( 2025-08-14T21:46:54.0553613Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0553944Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0554287Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 731, in forward 2025-08-14T21:46:54.0554634Z hidden_states = self.layer[-1](hidden_states) 2025-08-14T21:46:54.0554988Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 342, in forward 2025-08-14T21:46:54.0555367Z forwarded_states = self.DenseReluDense(forwarded_states) 2025-08-14T21:46:54.0555740Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 296, in forward 2025-08-14T21:46:54.0556078Z hidden_states = self.wo(hidden_states) 2025-08-14T21:46:54.0556209Z 2025-08-14T21:46:54.0556283Z cudagraph partition due to non gpu ops 2025-08-14T21:46:54.0556503Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0556831Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0557125Z return mod(**inputs) 2025-08-14T21:46:54.0557448Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1725, in forward 2025-08-14T21:46:54.0557789Z encoder_outputs = self.encoder( 2025-08-14T21:46:54.0558120Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0558459Z layer_outputs = layer_module( 2025-08-14T21:46:54.0558779Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0559117Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0559495Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 681, in forward 2025-08-14T21:46:54.0559841Z self_attention_outputs = self.layer[0]( 2025-08-14T21:46:54.0560181Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 598, in forward 2025-08-14T21:46:54.0560544Z normed_hidden_states = self.layer_norm(hidden_states) 2025-08-14T21:46:54.0560909Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 261, in forward 2025-08-14T21:46:54.0561251Z return self.weight * hidden_states 2025-08-14T21:46:54.0561369Z 2025-08-14T21:46:54.0561471Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0561793Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0562091Z return mod(**inputs) 2025-08-14T21:46:54.0562407Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1725, in forward 2025-08-14T21:46:54.0562759Z encoder_outputs = self.encoder( 2025-08-14T21:46:54.0563094Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0563428Z layer_outputs = layer_module( 2025-08-14T21:46:54.0563755Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0564078Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0564432Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 681, in forward 2025-08-14T21:46:54.0564775Z self_attention_outputs = self.layer[0]( 2025-08-14T21:46:54.0565147Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 599, in forward 2025-08-14T21:46:54.0565496Z attention_output = self.SelfAttention( 2025-08-14T21:46:54.0565847Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 490, in forward 2025-08-14T21:46:54.0566216Z query_states = self.q(hidden_states) 2025-08-14T21:46:54.0566351Z 2025-08-14T21:46:54.0566445Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0566770Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0567063Z return mod(**inputs) 2025-08-14T21:46:54.0567377Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1725, in forward 2025-08-14T21:46:54.0567714Z encoder_outputs = self.encoder( 2025-08-14T21:46:54.0568048Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0568383Z layer_outputs = layer_module( 2025-08-14T21:46:54.0568687Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0569016Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0569357Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 681, in forward 2025-08-14T21:46:54.0569698Z self_attention_outputs = self.layer[0]( 2025-08-14T21:46:54.0570035Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 599, in forward 2025-08-14T21:46:54.0570380Z attention_output = self.SelfAttention( 2025-08-14T21:46:54.0570723Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 510, in forward 2025-08-14T21:46:54.0571059Z key_states = self.k(current_states) 2025-08-14T21:46:54.0571176Z 2025-08-14T21:46:54.0571269Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0571596Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0571896Z return mod(**inputs) 2025-08-14T21:46:54.0572206Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1725, in forward 2025-08-14T21:46:54.0572548Z encoder_outputs = self.encoder( 2025-08-14T21:46:54.0572880Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0573215Z layer_outputs = layer_module( 2025-08-14T21:46:54.0573522Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0573852Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0574191Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 681, in forward 2025-08-14T21:46:54.0574524Z self_attention_outputs = self.layer[0]( 2025-08-14T21:46:54.0574861Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 599, in forward 2025-08-14T21:46:54.0575217Z attention_output = self.SelfAttention( 2025-08-14T21:46:54.0575566Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 526, in forward 2025-08-14T21:46:54.0575952Z scores = torch.matmul(query_states, key_states.transpose(3, 2)) 2025-08-14T21:46:54.0576126Z 2025-08-14T21:46:54.0576232Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0576564Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0576881Z return mod(**inputs) 2025-08-14T21:46:54.0577191Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1725, in forward 2025-08-14T21:46:54.0577532Z encoder_outputs = self.encoder( 2025-08-14T21:46:54.0577862Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0578191Z layer_outputs = layer_module( 2025-08-14T21:46:54.0578508Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0578858Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0579202Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 681, in forward 2025-08-14T21:46:54.0579552Z self_attention_outputs = self.layer[0]( 2025-08-14T21:46:54.0579897Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 599, in forward 2025-08-14T21:46:54.0580251Z attention_output = self.SelfAttention( 2025-08-14T21:46:54.0580592Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 558, in forward 2025-08-14T21:46:54.0581008Z attn_weights = nn.functional.softmax(scores.float(), dim=-1).type_as(scores) 2025-08-14T21:46:54.0581212Z 2025-08-14T21:46:54.0581309Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0581646Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0581946Z return mod(**inputs) 2025-08-14T21:46:54.0582268Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1725, in forward 2025-08-14T21:46:54.0582615Z encoder_outputs = self.encoder( 2025-08-14T21:46:54.0582951Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0583297Z layer_outputs = layer_module( 2025-08-14T21:46:54.0583620Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0583956Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0584294Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 681, in forward 2025-08-14T21:46:54.0584821Z self_attention_outputs = self.layer[0]( 2025-08-14T21:46:54.0585193Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 599, in forward 2025-08-14T21:46:54.0585557Z attention_output = self.SelfAttention( 2025-08-14T21:46:54.0585953Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 558, in forward 2025-08-14T21:46:54.0586364Z attn_weights = nn.functional.softmax(scores.float(), dim=-1).type_as(scores) 2025-08-14T21:46:54.0586557Z 2025-08-14T21:46:54.0586658Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0586985Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0587298Z return mod(**inputs) 2025-08-14T21:46:54.0587635Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1725, in forward 2025-08-14T21:46:54.0588000Z encoder_outputs = self.encoder( 2025-08-14T21:46:54.0588388Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0588752Z layer_outputs = layer_module( 2025-08-14T21:46:54.0589088Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0589456Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0589818Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 681, in forward 2025-08-14T21:46:54.0590208Z self_attention_outputs = self.layer[0]( 2025-08-14T21:46:54.0590575Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 599, in forward 2025-08-14T21:46:54.0590934Z attention_output = self.SelfAttention( 2025-08-14T21:46:54.0591295Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 558, in forward 2025-08-14T21:46:54.0591727Z attn_weights = nn.functional.softmax(scores.float(), dim=-1).type_as(scores) 2025-08-14T21:46:54.0591948Z 2025-08-14T21:46:54.0592053Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0592393Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0592704Z return mod(**inputs) 2025-08-14T21:46:54.0593035Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1725, in forward 2025-08-14T21:46:54.0593387Z encoder_outputs = self.encoder( 2025-08-14T21:46:54.0593738Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0594090Z layer_outputs = layer_module( 2025-08-14T21:46:54.0594419Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0594761Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0595118Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 681, in forward 2025-08-14T21:46:54.0595476Z self_attention_outputs = self.layer[0]( 2025-08-14T21:46:54.0595823Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 599, in forward 2025-08-14T21:46:54.0596183Z attention_output = self.SelfAttention( 2025-08-14T21:46:54.0596533Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 511, in forward 2025-08-14T21:46:54.0596870Z value_states = self.v(current_states) 2025-08-14T21:46:54.0596990Z 2025-08-14T21:46:54.0597082Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0597408Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0597699Z return mod(**inputs) 2025-08-14T21:46:54.0598010Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1725, in forward 2025-08-14T21:46:54.0598339Z encoder_outputs = self.encoder( 2025-08-14T21:46:54.0598669Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0599001Z layer_outputs = layer_module( 2025-08-14T21:46:54.0599306Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0599632Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0599969Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 681, in forward 2025-08-14T21:46:54.0600304Z self_attention_outputs = self.layer[0]( 2025-08-14T21:46:54.0600632Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 599, in forward 2025-08-14T21:46:54.0600970Z attention_output = self.SelfAttention( 2025-08-14T21:46:54.0601322Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 565, in forward 2025-08-14T21:46:54.0601426Z attn_output = torch.matmul(attn_weights, value_states) 2025-08-14T21:46:54.0601430Z 2025-08-14T21:46:54.0601534Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0601740Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0601818Z return mod(**inputs) 2025-08-14T21:46:54.0602039Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1725, in forward 2025-08-14T21:46:54.0602105Z encoder_outputs = self.encoder( 2025-08-14T21:46:54.0602319Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0602391Z layer_outputs = layer_module( 2025-08-14T21:46:54.0602592Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0602687Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0602903Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 681, in forward 2025-08-14T21:46:54.0602977Z self_attention_outputs = self.layer[0]( 2025-08-14T21:46:54.0603197Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 599, in forward 2025-08-14T21:46:54.0603274Z attention_output = self.SelfAttention( 2025-08-14T21:46:54.0603487Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 565, in forward 2025-08-14T21:46:54.0603591Z attn_output = torch.matmul(attn_weights, value_states) 2025-08-14T21:46:54.0603595Z 2025-08-14T21:46:54.0603693Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0603880Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0603938Z return mod(**inputs) 2025-08-14T21:46:54.0604150Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1725, in forward 2025-08-14T21:46:54.0604219Z encoder_outputs = self.encoder( 2025-08-14T21:46:54.0604433Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0604499Z layer_outputs = layer_module( 2025-08-14T21:46:54.0604699Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0604767Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0604982Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 681, in forward 2025-08-14T21:46:54.0605051Z self_attention_outputs = self.layer[0]( 2025-08-14T21:46:54.0605262Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 599, in forward 2025-08-14T21:46:54.0605347Z attention_output = self.SelfAttention( 2025-08-14T21:46:54.0605557Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 567, in forward 2025-08-14T21:46:54.0605661Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:46:54.0605664Z 2025-08-14T21:46:54.0605758Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0605942Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0606007Z return mod(**inputs) 2025-08-14T21:46:54.0606227Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1725, in forward 2025-08-14T21:46:54.0606293Z encoder_outputs = self.encoder( 2025-08-14T21:46:54.0606531Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0606599Z layer_outputs = layer_module( 2025-08-14T21:46:54.0606807Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0606879Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0607107Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 681, in forward 2025-08-14T21:46:54.0607202Z self_attention_outputs = self.layer[0]( 2025-08-14T21:46:54.0607417Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 599, in forward 2025-08-14T21:46:54.0607500Z attention_output = self.SelfAttention( 2025-08-14T21:46:54.0607713Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 569, in forward 2025-08-14T21:46:54.0607787Z attn_output = self.o(attn_output) 2025-08-14T21:46:54.0607790Z 2025-08-14T21:46:54.0607893Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0608090Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0608147Z return mod(**inputs) 2025-08-14T21:46:54.0608365Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1725, in forward 2025-08-14T21:46:54.0608427Z encoder_outputs = self.encoder( 2025-08-14T21:46:54.0608646Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0608709Z layer_outputs = layer_module( 2025-08-14T21:46:54.0608905Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0608981Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0609189Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 681, in forward 2025-08-14T21:46:54.0609262Z self_attention_outputs = self.layer[0]( 2025-08-14T21:46:54.0609477Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 609, in forward 2025-08-14T21:46:54.0609594Z hidden_states = hidden_states + self.dropout(attention_output[0]) 2025-08-14T21:46:54.0609597Z 2025-08-14T21:46:54.0609676Z cudagraph partition due to non gpu ops 2025-08-14T21:46:54.0609768Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0609947Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0610009Z return mod(**inputs) 2025-08-14T21:46:54.0610221Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1725, in forward 2025-08-14T21:46:54.0610291Z encoder_outputs = self.encoder( 2025-08-14T21:46:54.0610507Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0610570Z layer_outputs = layer_module( 2025-08-14T21:46:54.0610775Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0610845Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0611057Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 731, in forward 2025-08-14T21:46:54.0611146Z hidden_states = self.layer[-1](hidden_states) 2025-08-14T21:46:54.0611355Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 341, in forward 2025-08-14T21:46:54.0611443Z forwarded_states = self.layer_norm(hidden_states) 2025-08-14T21:46:54.0611651Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 261, in forward 2025-08-14T21:46:54.0611717Z return self.weight * hidden_states 2025-08-14T21:46:54.0611720Z 2025-08-14T21:46:54.0611831Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0612013Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0612074Z return mod(**inputs) 2025-08-14T21:46:54.0612300Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1725, in forward 2025-08-14T21:46:54.0612363Z encoder_outputs = self.encoder( 2025-08-14T21:46:54.0612599Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0612658Z layer_outputs = layer_module( 2025-08-14T21:46:54.0612860Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0612936Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0613147Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 731, in forward 2025-08-14T21:46:54.0613255Z hidden_states = self.layer[-1](hidden_states) 2025-08-14T21:46:54.0613470Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 342, in forward 2025-08-14T21:46:54.0613574Z forwarded_states = self.DenseReluDense(forwarded_states) 2025-08-14T21:46:54.0613796Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 287, in forward 2025-08-14T21:46:54.0613868Z hidden_states = self.wi(hidden_states) 2025-08-14T21:46:54.0613872Z 2025-08-14T21:46:54.0613968Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0614150Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0614209Z return mod(**inputs) 2025-08-14T21:46:54.0614431Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1725, in forward 2025-08-14T21:46:54.0614499Z encoder_outputs = self.encoder( 2025-08-14T21:46:54.0614718Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0614786Z layer_outputs = layer_module( 2025-08-14T21:46:54.0614989Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0615067Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0615281Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 731, in forward 2025-08-14T21:46:54.0615361Z hidden_states = self.layer[-1](hidden_states) 2025-08-14T21:46:54.0615577Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 342, in forward 2025-08-14T21:46:54.0615684Z forwarded_states = self.DenseReluDense(forwarded_states) 2025-08-14T21:46:54.0615896Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 288, in forward 2025-08-14T21:46:54.0615971Z hidden_states = self.act(hidden_states) 2025-08-14T21:46:54.0615974Z 2025-08-14T21:46:54.0616062Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0616247Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0616305Z return mod(**inputs) 2025-08-14T21:46:54.0616523Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1725, in forward 2025-08-14T21:46:54.0616596Z encoder_outputs = self.encoder( 2025-08-14T21:46:54.0616813Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0616876Z layer_outputs = layer_module( 2025-08-14T21:46:54.0617086Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0617169Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0617391Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 731, in forward 2025-08-14T21:46:54.0617471Z hidden_states = self.layer[-1](hidden_states) 2025-08-14T21:46:54.0617694Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 342, in forward 2025-08-14T21:46:54.0617818Z forwarded_states = self.DenseReluDense(forwarded_states) 2025-08-14T21:46:54.0618026Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 296, in forward 2025-08-14T21:46:54.0618102Z hidden_states = self.wo(hidden_states) 2025-08-14T21:46:54.0618105Z 2025-08-14T21:46:54.0618176Z cudagraph partition due to non gpu ops 2025-08-14T21:46:54.0618267Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0618454Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0618549Z return mod(**inputs) 2025-08-14T21:46:54.0618765Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1725, in forward 2025-08-14T21:46:54.0618837Z encoder_outputs = self.encoder( 2025-08-14T21:46:54.0619051Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0619121Z layer_outputs = layer_module( 2025-08-14T21:46:54.0619322Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0619392Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0619611Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 681, in forward 2025-08-14T21:46:54.0619683Z self_attention_outputs = self.layer[0]( 2025-08-14T21:46:54.0619893Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 598, in forward 2025-08-14T21:46:54.0619994Z normed_hidden_states = self.layer_norm(hidden_states) 2025-08-14T21:46:54.0620205Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 261, in forward 2025-08-14T21:46:54.0620282Z return self.weight * hidden_states 2025-08-14T21:46:54.0620285Z 2025-08-14T21:46:54.0620376Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0620560Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0620626Z return mod(**inputs) 2025-08-14T21:46:54.0620840Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1725, in forward 2025-08-14T21:46:54.0620911Z encoder_outputs = self.encoder( 2025-08-14T21:46:54.0621128Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0621195Z layer_outputs = layer_module( 2025-08-14T21:46:54.0621402Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0621471Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0621686Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 681, in forward 2025-08-14T21:46:54.0621764Z self_attention_outputs = self.layer[0]( 2025-08-14T21:46:54.0621978Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 599, in forward 2025-08-14T21:46:54.0622058Z attention_output = self.SelfAttention( 2025-08-14T21:46:54.0622268Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 490, in forward 2025-08-14T21:46:54.0622337Z query_states = self.q(hidden_states) 2025-08-14T21:46:54.0622340Z 2025-08-14T21:46:54.0622460Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0622641Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0622703Z return mod(**inputs) 2025-08-14T21:46:54.0622930Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1725, in forward 2025-08-14T21:46:54.0622997Z encoder_outputs = self.encoder( 2025-08-14T21:46:54.0623233Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0623297Z layer_outputs = layer_module( 2025-08-14T21:46:54.0623497Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0623570Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0623782Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 681, in forward 2025-08-14T21:46:54.0623858Z self_attention_outputs = self.layer[0]( 2025-08-14T21:46:54.0624093Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 599, in forward 2025-08-14T21:46:54.0624165Z attention_output = self.SelfAttention( 2025-08-14T21:46:54.0624382Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 510, in forward 2025-08-14T21:46:54.0624450Z key_states = self.k(current_states) 2025-08-14T21:46:54.0624454Z 2025-08-14T21:46:54.0624547Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0624739Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0624864Z return mod(**inputs) 2025-08-14T21:46:54.0625102Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1725, in forward 2025-08-14T21:46:54.0625169Z encoder_outputs = self.encoder( 2025-08-14T21:46:54.0625399Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0625474Z layer_outputs = layer_module( 2025-08-14T21:46:54.0625680Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0625770Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0625985Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 681, in forward 2025-08-14T21:46:54.0626056Z self_attention_outputs = self.layer[0]( 2025-08-14T21:46:54.0626280Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 599, in forward 2025-08-14T21:46:54.0626353Z attention_output = self.SelfAttention( 2025-08-14T21:46:54.0626569Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 526, in forward 2025-08-14T21:46:54.0626696Z scores = torch.matmul(query_states, key_states.transpose(3, 2)) 2025-08-14T21:46:54.0626700Z 2025-08-14T21:46:54.0626792Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0626980Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0627040Z return mod(**inputs) 2025-08-14T21:46:54.0627257Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1725, in forward 2025-08-14T21:46:54.0627330Z encoder_outputs = self.encoder( 2025-08-14T21:46:54.0627545Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0627607Z layer_outputs = layer_module( 2025-08-14T21:46:54.0627817Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0627907Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0628130Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 681, in forward 2025-08-14T21:46:54.0628202Z self_attention_outputs = self.layer[0]( 2025-08-14T21:46:54.0628413Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 599, in forward 2025-08-14T21:46:54.0628510Z attention_output = self.SelfAttention( 2025-08-14T21:46:54.0628739Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 558, in forward 2025-08-14T21:46:54.0628888Z attn_weights = nn.functional.softmax(scores.float(), dim=-1).type_as(scores) 2025-08-14T21:46:54.0628892Z 2025-08-14T21:46:54.0628984Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0629164Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0629233Z return mod(**inputs) 2025-08-14T21:46:54.0629450Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1725, in forward 2025-08-14T21:46:54.0629531Z encoder_outputs = self.encoder( 2025-08-14T21:46:54.0629759Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0629824Z layer_outputs = layer_module( 2025-08-14T21:46:54.0630033Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0630106Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0630321Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 681, in forward 2025-08-14T21:46:54.0630399Z self_attention_outputs = self.layer[0]( 2025-08-14T21:46:54.0630614Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 599, in forward 2025-08-14T21:46:54.0630694Z attention_output = self.SelfAttention( 2025-08-14T21:46:54.0630908Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 558, in forward 2025-08-14T21:46:54.0631046Z attn_weights = nn.functional.softmax(scores.float(), dim=-1).type_as(scores) 2025-08-14T21:46:54.0631050Z 2025-08-14T21:46:54.0631148Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0631328Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0631387Z return mod(**inputs) 2025-08-14T21:46:54.0631612Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1725, in forward 2025-08-14T21:46:54.0631677Z encoder_outputs = self.encoder( 2025-08-14T21:46:54.0631901Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0631965Z layer_outputs = layer_module( 2025-08-14T21:46:54.0632168Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0632246Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0632461Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 681, in forward 2025-08-14T21:46:54.0632534Z self_attention_outputs = self.layer[0]( 2025-08-14T21:46:54.0632756Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 599, in forward 2025-08-14T21:46:54.0632830Z attention_output = self.SelfAttention( 2025-08-14T21:46:54.0633047Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 558, in forward 2025-08-14T21:46:54.0633184Z attn_weights = nn.functional.softmax(scores.float(), dim=-1).type_as(scores) 2025-08-14T21:46:54.0633187Z 2025-08-14T21:46:54.0633293Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0633484Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0633544Z return mod(**inputs) 2025-08-14T21:46:54.0633765Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1725, in forward 2025-08-14T21:46:54.0633843Z encoder_outputs = self.encoder( 2025-08-14T21:46:54.0634060Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0634147Z layer_outputs = layer_module( 2025-08-14T21:46:54.0634350Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0634421Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0634648Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 681, in forward 2025-08-14T21:46:54.0634721Z self_attention_outputs = self.layer[0]( 2025-08-14T21:46:54.0634957Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 599, in forward 2025-08-14T21:46:54.0635029Z attention_output = self.SelfAttention( 2025-08-14T21:46:54.0635239Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 511, in forward 2025-08-14T21:46:54.0635316Z value_states = self.v(current_states) 2025-08-14T21:46:54.0635321Z 2025-08-14T21:46:54.0635412Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0635594Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0635653Z return mod(**inputs) 2025-08-14T21:46:54.0635865Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1725, in forward 2025-08-14T21:46:54.0635934Z encoder_outputs = self.encoder( 2025-08-14T21:46:54.0636149Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0636213Z layer_outputs = layer_module( 2025-08-14T21:46:54.0636417Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0636487Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0636703Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 681, in forward 2025-08-14T21:46:54.0636773Z self_attention_outputs = self.layer[0]( 2025-08-14T21:46:54.0636981Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 599, in forward 2025-08-14T21:46:54.0637062Z attention_output = self.SelfAttention( 2025-08-14T21:46:54.0637270Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 565, in forward 2025-08-14T21:46:54.0637367Z attn_output = torch.matmul(attn_weights, value_states) 2025-08-14T21:46:54.0637377Z 2025-08-14T21:46:54.0637468Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0637645Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0637710Z return mod(**inputs) 2025-08-14T21:46:54.0637923Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1725, in forward 2025-08-14T21:46:54.0637990Z encoder_outputs = self.encoder( 2025-08-14T21:46:54.0638210Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0638273Z layer_outputs = layer_module( 2025-08-14T21:46:54.0638478Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0638549Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0638774Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 681, in forward 2025-08-14T21:46:54.0638855Z self_attention_outputs = self.layer[0]( 2025-08-14T21:46:54.0639067Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 599, in forward 2025-08-14T21:46:54.0639153Z attention_output = self.SelfAttention( 2025-08-14T21:46:54.0639372Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 565, in forward 2025-08-14T21:46:54.0639486Z attn_output = torch.matmul(attn_weights, value_states) 2025-08-14T21:46:54.0639489Z 2025-08-14T21:46:54.0639588Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0639767Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0639824Z return mod(**inputs) 2025-08-14T21:46:54.0640049Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1725, in forward 2025-08-14T21:46:54.0640170Z encoder_outputs = self.encoder( 2025-08-14T21:46:54.0640382Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0640450Z layer_outputs = layer_module( 2025-08-14T21:46:54.0640651Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0640732Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0640944Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 681, in forward 2025-08-14T21:46:54.0641014Z self_attention_outputs = self.layer[0]( 2025-08-14T21:46:54.0641232Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 599, in forward 2025-08-14T21:46:54.0641305Z attention_output = self.SelfAttention( 2025-08-14T21:46:54.0641525Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 567, in forward 2025-08-14T21:46:54.0641625Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:46:54.0641628Z 2025-08-14T21:46:54.0641719Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0641909Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0641971Z return mod(**inputs) 2025-08-14T21:46:54.0642184Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1725, in forward 2025-08-14T21:46:54.0642254Z encoder_outputs = self.encoder( 2025-08-14T21:46:54.0642470Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0642541Z layer_outputs = layer_module( 2025-08-14T21:46:54.0642743Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0642817Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0643035Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 681, in forward 2025-08-14T21:46:54.0643105Z self_attention_outputs = self.layer[0]( 2025-08-14T21:46:54.0643323Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 599, in forward 2025-08-14T21:46:54.0643396Z attention_output = self.SelfAttention( 2025-08-14T21:46:54.0643608Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 569, in forward 2025-08-14T21:46:54.0643682Z attn_output = self.o(attn_output) 2025-08-14T21:46:54.0643685Z 2025-08-14T21:46:54.0643758Z cudagraph partition due to non gpu ops 2025-08-14T21:46:54.0643848Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0644048Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0644110Z return mod(**inputs) 2025-08-14T21:46:54.0644331Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1725, in forward 2025-08-14T21:46:54.0644396Z encoder_outputs = self.encoder( 2025-08-14T21:46:54.0644623Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0644719Z layer_outputs = layer_module( 2025-08-14T21:46:54.0644926Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0644996Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0645224Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 731, in forward 2025-08-14T21:46:54.0645307Z hidden_states = self.layer[-1](hidden_states) 2025-08-14T21:46:54.0645531Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 341, in forward 2025-08-14T21:46:54.0645636Z forwarded_states = self.layer_norm(hidden_states) 2025-08-14T21:46:54.0645858Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 261, in forward 2025-08-14T21:46:54.0645938Z return self.weight * hidden_states 2025-08-14T21:46:54.0645942Z 2025-08-14T21:46:54.0646052Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0646240Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0646300Z return mod(**inputs) 2025-08-14T21:46:54.0646515Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1725, in forward 2025-08-14T21:46:54.0646589Z encoder_outputs = self.encoder( 2025-08-14T21:46:54.0646810Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0646875Z layer_outputs = layer_module( 2025-08-14T21:46:54.0647085Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0647156Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0647378Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 731, in forward 2025-08-14T21:46:54.0647462Z hidden_states = self.layer[-1](hidden_states) 2025-08-14T21:46:54.0647676Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 342, in forward 2025-08-14T21:46:54.0647791Z forwarded_states = self.DenseReluDense(forwarded_states) 2025-08-14T21:46:54.0648005Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 287, in forward 2025-08-14T21:46:54.0648076Z hidden_states = self.wi(hidden_states) 2025-08-14T21:46:54.0648086Z 2025-08-14T21:46:54.0648183Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0648367Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0648432Z return mod(**inputs) 2025-08-14T21:46:54.0648651Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1725, in forward 2025-08-14T21:46:54.0648718Z encoder_outputs = self.encoder( 2025-08-14T21:46:54.0648943Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0649008Z layer_outputs = layer_module( 2025-08-14T21:46:54.0649216Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0649288Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0650142Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 731, in forward 2025-08-14T21:46:54.0650234Z hidden_states = self.layer[-1](hidden_states) 2025-08-14T21:46:54.0650449Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 342, in forward 2025-08-14T21:46:54.0650556Z forwarded_states = self.DenseReluDense(forwarded_states) 2025-08-14T21:46:54.0650791Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 288, in forward 2025-08-14T21:46:54.0650879Z hidden_states = self.act(hidden_states) 2025-08-14T21:46:54.0650882Z 2025-08-14T21:46:54.0650982Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0651164Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0651223Z return mod(**inputs) 2025-08-14T21:46:54.0651448Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1725, in forward 2025-08-14T21:46:54.0651530Z encoder_outputs = self.encoder( 2025-08-14T21:46:54.0651745Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0651817Z layer_outputs = layer_module( 2025-08-14T21:46:54.0652018Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0652097Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0652314Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 731, in forward 2025-08-14T21:46:54.0652394Z hidden_states = self.layer[-1](hidden_states) 2025-08-14T21:46:54.0652611Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 342, in forward 2025-08-14T21:46:54.0652715Z forwarded_states = self.DenseReluDense(forwarded_states) 2025-08-14T21:46:54.0652933Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 296, in forward 2025-08-14T21:46:54.0653004Z hidden_states = self.wo(hidden_states) 2025-08-14T21:46:54.0653007Z 2025-08-14T21:46:54.0653079Z cudagraph partition due to non gpu ops 2025-08-14T21:46:54.0653177Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0653360Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0653420Z return mod(**inputs) 2025-08-14T21:46:54.0653642Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1725, in forward 2025-08-14T21:46:54.0653706Z encoder_outputs = self.encoder( 2025-08-14T21:46:54.0653926Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0653990Z layer_outputs = layer_module( 2025-08-14T21:46:54.0654193Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0654272Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0654483Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 681, in forward 2025-08-14T21:46:54.0654554Z self_attention_outputs = self.layer[0]( 2025-08-14T21:46:54.0654773Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 598, in forward 2025-08-14T21:46:54.0654870Z normed_hidden_states = self.layer_norm(hidden_states) 2025-08-14T21:46:54.0655088Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 261, in forward 2025-08-14T21:46:54.0655155Z return self.weight * hidden_states 2025-08-14T21:46:54.0655159Z 2025-08-14T21:46:54.0655252Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0655456Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0655517Z return mod(**inputs) 2025-08-14T21:46:54.0655742Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1725, in forward 2025-08-14T21:46:54.0655808Z encoder_outputs = self.encoder( 2025-08-14T21:46:54.0656036Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0656123Z layer_outputs = layer_module( 2025-08-14T21:46:54.0656329Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0656400Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0656625Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 681, in forward 2025-08-14T21:46:54.0656697Z self_attention_outputs = self.layer[0]( 2025-08-14T21:46:54.0656923Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 599, in forward 2025-08-14T21:46:54.0657016Z attention_output = self.SelfAttention( 2025-08-14T21:46:54.0657227Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 490, in forward 2025-08-14T21:46:54.0657305Z query_states = self.q(hidden_states) 2025-08-14T21:46:54.0657309Z 2025-08-14T21:46:54.0657399Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0657582Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0657637Z return mod(**inputs) 2025-08-14T21:46:54.0657852Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1725, in forward 2025-08-14T21:46:54.0657923Z encoder_outputs = self.encoder( 2025-08-14T21:46:54.0658138Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0658202Z layer_outputs = layer_module( 2025-08-14T21:46:54.0658408Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0658479Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0658697Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 681, in forward 2025-08-14T21:46:54.0658769Z self_attention_outputs = self.layer[0]( 2025-08-14T21:46:54.0658979Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 599, in forward 2025-08-14T21:46:54.0659061Z attention_output = self.SelfAttention( 2025-08-14T21:46:54.0659271Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 510, in forward 2025-08-14T21:46:54.0659339Z key_states = self.k(current_states) 2025-08-14T21:46:54.0659349Z 2025-08-14T21:46:54.0659443Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0659622Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0659689Z return mod(**inputs) 2025-08-14T21:46:54.0659902Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1725, in forward 2025-08-14T21:46:54.0659967Z encoder_outputs = self.encoder( 2025-08-14T21:46:54.0660189Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0660254Z layer_outputs = layer_module( 2025-08-14T21:46:54.0660460Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0660531Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0660742Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 681, in forward 2025-08-14T21:46:54.0660834Z self_attention_outputs = self.layer[0]( 2025-08-14T21:46:54.0661053Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 599, in forward 2025-08-14T21:46:54.0661126Z attention_output = self.SelfAttention( 2025-08-14T21:46:54.0661363Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 526, in forward 2025-08-14T21:46:54.0661507Z scores = torch.matmul(query_states, key_states.transpose(3, 2)) 2025-08-14T21:46:54.0661510Z 2025-08-14T21:46:54.0661605Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0661784Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0661844Z return mod(**inputs) 2025-08-14T21:46:54.0662063Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1725, in forward 2025-08-14T21:46:54.0662129Z encoder_outputs = self.encoder( 2025-08-14T21:46:54.0662360Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0662429Z layer_outputs = layer_module( 2025-08-14T21:46:54.0662630Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0662705Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0662918Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 681, in forward 2025-08-14T21:46:54.0662987Z self_attention_outputs = self.layer[0]( 2025-08-14T21:46:54.0663207Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 599, in forward 2025-08-14T21:46:54.0663280Z attention_output = self.SelfAttention( 2025-08-14T21:46:54.0663499Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 558, in forward 2025-08-14T21:46:54.0663638Z attn_weights = nn.functional.softmax(scores.float(), dim=-1).type_as(scores) 2025-08-14T21:46:54.0663641Z 2025-08-14T21:46:54.0663732Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0663919Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0663978Z return mod(**inputs) 2025-08-14T21:46:54.0664194Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1725, in forward 2025-08-14T21:46:54.0664267Z encoder_outputs = self.encoder( 2025-08-14T21:46:54.0664480Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0664547Z layer_outputs = layer_module( 2025-08-14T21:46:54.0664745Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0664918Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0665149Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 681, in forward 2025-08-14T21:46:54.0665221Z self_attention_outputs = self.layer[0]( 2025-08-14T21:46:54.0665458Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 599, in forward 2025-08-14T21:46:54.0665535Z attention_output = self.SelfAttention( 2025-08-14T21:46:54.0665753Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 558, in forward 2025-08-14T21:46:54.0665901Z attn_weights = nn.functional.softmax(scores.float(), dim=-1).type_as(scores) 2025-08-14T21:46:54.0665905Z 2025-08-14T21:46:54.0665999Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0666185Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0666271Z return mod(**inputs) 2025-08-14T21:46:54.0666498Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1725, in forward 2025-08-14T21:46:54.0666584Z encoder_outputs = self.encoder( 2025-08-14T21:46:54.0666800Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0666880Z layer_outputs = layer_module( 2025-08-14T21:46:54.0669479Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0669548Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0669769Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 681, in forward 2025-08-14T21:46:54.0669846Z self_attention_outputs = self.layer[0]( 2025-08-14T21:46:54.0670055Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 599, in forward 2025-08-14T21:46:54.0670131Z attention_output = self.SelfAttention( 2025-08-14T21:46:54.0670369Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 558, in forward 2025-08-14T21:46:54.0670508Z attn_weights = nn.functional.softmax(scores.float(), dim=-1).type_as(scores) 2025-08-14T21:46:54.0670512Z 2025-08-14T21:46:54.0670616Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0670829Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0670888Z return mod(**inputs) 2025-08-14T21:46:54.0671112Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1725, in forward 2025-08-14T21:46:54.0671177Z encoder_outputs = self.encoder( 2025-08-14T21:46:54.0671392Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0671464Z layer_outputs = layer_module( 2025-08-14T21:46:54.0671668Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0671746Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0671959Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 681, in forward 2025-08-14T21:46:54.0672030Z self_attention_outputs = self.layer[0]( 2025-08-14T21:46:54.0672244Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 599, in forward 2025-08-14T21:46:54.0672316Z attention_output = self.SelfAttention( 2025-08-14T21:46:54.0672526Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 511, in forward 2025-08-14T21:46:54.0672603Z value_states = self.v(current_states) 2025-08-14T21:46:54.0672607Z 2025-08-14T21:46:54.0672699Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0672888Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0672948Z return mod(**inputs) 2025-08-14T21:46:54.0673161Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1725, in forward 2025-08-14T21:46:54.0673234Z encoder_outputs = self.encoder( 2025-08-14T21:46:54.0673447Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0673513Z layer_outputs = layer_module( 2025-08-14T21:46:54.0673719Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0673789Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0674008Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 681, in forward 2025-08-14T21:46:54.0674097Z self_attention_outputs = self.layer[0]( 2025-08-14T21:46:54.0674309Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 599, in forward 2025-08-14T21:46:54.0674394Z attention_output = self.SelfAttention( 2025-08-14T21:46:54.0674626Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 565, in forward 2025-08-14T21:46:54.0674736Z attn_output = torch.matmul(attn_weights, value_states) 2025-08-14T21:46:54.0674795Z 2025-08-14T21:46:54.0674894Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0675084Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0675151Z return mod(**inputs) 2025-08-14T21:46:54.0675379Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1725, in forward 2025-08-14T21:46:54.0675447Z encoder_outputs = self.encoder( 2025-08-14T21:46:54.0675703Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0675794Z layer_outputs = layer_module( 2025-08-14T21:46:54.0676034Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0676115Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0676370Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 681, in forward 2025-08-14T21:46:54.0676459Z self_attention_outputs = self.layer[0]( 2025-08-14T21:46:54.0676712Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 599, in forward 2025-08-14T21:46:54.0676793Z attention_output = self.SelfAttention( 2025-08-14T21:46:54.0677044Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 565, in forward 2025-08-14T21:46:54.0677147Z attn_output = torch.matmul(attn_weights, value_states) 2025-08-14T21:46:54.0677152Z 2025-08-14T21:46:54.0677256Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0677497Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0677560Z return mod(**inputs) 2025-08-14T21:46:54.0677819Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1725, in forward 2025-08-14T21:46:54.0677890Z encoder_outputs = self.encoder( 2025-08-14T21:46:54.0678149Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0678221Z layer_outputs = layer_module( 2025-08-14T21:46:54.0678449Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0678535Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0678787Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 681, in forward 2025-08-14T21:46:54.0678868Z self_attention_outputs = self.layer[0]( 2025-08-14T21:46:54.0679122Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 599, in forward 2025-08-14T21:46:54.0679205Z attention_output = self.SelfAttention( 2025-08-14T21:46:54.0679463Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 567, in forward 2025-08-14T21:46:54.0679576Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:46:54.0679579Z 2025-08-14T21:46:54.0679684Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0679897Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0679963Z return mod(**inputs) 2025-08-14T21:46:54.0680241Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1725, in forward 2025-08-14T21:46:54.0680317Z encoder_outputs = self.encoder( 2025-08-14T21:46:54.0680626Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0680704Z layer_outputs = layer_module( 2025-08-14T21:46:54.0680949Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0681065Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0681318Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 681, in forward 2025-08-14T21:46:54.0681395Z self_attention_outputs = self.layer[0]( 2025-08-14T21:46:54.0681651Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 599, in forward 2025-08-14T21:46:54.0681733Z attention_output = self.SelfAttention( 2025-08-14T21:46:54.0681986Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 569, in forward 2025-08-14T21:46:54.0682089Z attn_output = self.o(attn_output) 2025-08-14T21:46:54.0682094Z 2025-08-14T21:46:54.0682199Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0682406Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0682479Z return mod(**inputs) 2025-08-14T21:46:54.0682736Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1725, in forward 2025-08-14T21:46:54.0682813Z encoder_outputs = self.encoder( 2025-08-14T21:46:54.0683065Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0683136Z layer_outputs = layer_module( 2025-08-14T21:46:54.0683375Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0683457Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0683715Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 681, in forward 2025-08-14T21:46:54.0683794Z self_attention_outputs = self.layer[0]( 2025-08-14T21:46:54.0684047Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 609, in forward 2025-08-14T21:46:54.0684191Z hidden_states = hidden_states + self.dropout(attention_output[0]) 2025-08-14T21:46:54.0684195Z 2025-08-14T21:46:54.0684279Z cudagraph partition due to non gpu ops 2025-08-14T21:46:54.0684383Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0684741Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0684818Z return mod(**inputs) 2025-08-14T21:46:54.0685080Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1725, in forward 2025-08-14T21:46:54.0685158Z encoder_outputs = self.encoder( 2025-08-14T21:46:54.0685412Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0685489Z layer_outputs = layer_module( 2025-08-14T21:46:54.0685717Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0685797Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0686060Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 731, in forward 2025-08-14T21:46:54.0686155Z hidden_states = self.layer[-1](hidden_states) 2025-08-14T21:46:54.0686416Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 341, in forward 2025-08-14T21:46:54.0686519Z forwarded_states = self.layer_norm(hidden_states) 2025-08-14T21:46:54.0686827Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 261, in forward 2025-08-14T21:46:54.0686921Z return self.weight * hidden_states 2025-08-14T21:46:54.0686925Z 2025-08-14T21:46:54.0687033Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0687273Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0687372Z return mod(**inputs) 2025-08-14T21:46:54.0687631Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1725, in forward 2025-08-14T21:46:54.0687713Z encoder_outputs = self.encoder( 2025-08-14T21:46:54.0687962Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0688034Z layer_outputs = layer_module( 2025-08-14T21:46:54.0688273Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0688377Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0688612Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 731, in forward 2025-08-14T21:46:54.0688696Z hidden_states = self.layer[-1](hidden_states) 2025-08-14T21:46:54.0688917Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 342, in forward 2025-08-14T21:46:54.0689036Z forwarded_states = self.DenseReluDense(forwarded_states) 2025-08-14T21:46:54.0689253Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 287, in forward 2025-08-14T21:46:54.0689326Z hidden_states = self.wi(hidden_states) 2025-08-14T21:46:54.0689335Z 2025-08-14T21:46:54.0689429Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0689611Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0689675Z return mod(**inputs) 2025-08-14T21:46:54.0689895Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1725, in forward 2025-08-14T21:46:54.0689957Z encoder_outputs = self.encoder( 2025-08-14T21:46:54.0690180Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0690243Z layer_outputs = layer_module( 2025-08-14T21:46:54.0690453Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0690526Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0690741Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 731, in forward 2025-08-14T21:46:54.0690829Z hidden_states = self.layer[-1](hidden_states) 2025-08-14T21:46:54.0691048Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 342, in forward 2025-08-14T21:46:54.0691161Z forwarded_states = self.DenseReluDense(forwarded_states) 2025-08-14T21:46:54.0691385Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 288, in forward 2025-08-14T21:46:54.0691459Z hidden_states = self.act(hidden_states) 2025-08-14T21:46:54.0691462Z 2025-08-14T21:46:54.0691562Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0691748Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0691809Z return mod(**inputs) 2025-08-14T21:46:54.0692034Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1725, in forward 2025-08-14T21:46:54.0692098Z encoder_outputs = self.encoder( 2025-08-14T21:46:54.0692340Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0692409Z layer_outputs = layer_module( 2025-08-14T21:46:54.0692619Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0692698Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0692935Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 731, in forward 2025-08-14T21:46:54.0693034Z hidden_states = self.layer[-1](hidden_states) 2025-08-14T21:46:54.0693257Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 342, in forward 2025-08-14T21:46:54.0693359Z forwarded_states = self.DenseReluDense(forwarded_states) 2025-08-14T21:46:54.0693581Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 296, in forward 2025-08-14T21:46:54.0693650Z hidden_states = self.wo(hidden_states) 2025-08-14T21:46:54.0693654Z 2025-08-14T21:46:54.0693727Z cudagraph partition due to non gpu ops 2025-08-14T21:46:54.0693837Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0694019Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0694078Z return mod(**inputs) 2025-08-14T21:46:54.0694308Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1725, in forward 2025-08-14T21:46:54.0694374Z encoder_outputs = self.encoder( 2025-08-14T21:46:54.0694598Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1128, in forward 2025-08-14T21:46:54.0694696Z hidden_states = self.final_layer_norm(hidden_states) 2025-08-14T21:46:54.0694912Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 261, in forward 2025-08-14T21:46:54.0694987Z return self.weight * hidden_states 2025-08-14T21:46:54.0694992Z 2025-08-14T21:46:54.0695088Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0695275Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0695332Z return mod(**inputs) 2025-08-14T21:46:54.0695548Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0695621Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0695841Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0695904Z layer_outputs = layer_module( 2025-08-14T21:46:54.0696117Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0696189Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0696415Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 705, in forward 2025-08-14T21:46:54.0696489Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:46:54.0696704Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 635, in forward 2025-08-14T21:46:54.0696788Z attention_output = self.EncDecAttention( 2025-08-14T21:46:54.0697005Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 510, in forward 2025-08-14T21:46:54.0697077Z key_states = self.k(current_states) 2025-08-14T21:46:54.0697086Z 2025-08-14T21:46:54.0697179Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0697360Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0697427Z return mod(**inputs) 2025-08-14T21:46:54.0697646Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0697726Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0697956Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0698021Z layer_outputs = layer_module( 2025-08-14T21:46:54.0698251Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0698324Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0698559Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 705, in forward 2025-08-14T21:46:54.0698640Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:46:54.0698857Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 635, in forward 2025-08-14T21:46:54.0698933Z attention_output = self.EncDecAttention( 2025-08-14T21:46:54.0699158Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 526, in forward 2025-08-14T21:46:54.0699295Z scores = torch.matmul(query_states, key_states.transpose(3, 2)) 2025-08-14T21:46:54.0699299Z 2025-08-14T21:46:54.0699400Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0699582Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0699644Z return mod(**inputs) 2025-08-14T21:46:54.0699872Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0699939Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0700157Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0700224Z layer_outputs = layer_module( 2025-08-14T21:46:54.0700428Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0700508Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0700734Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 705, in forward 2025-08-14T21:46:54.0700804Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:46:54.0701023Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 635, in forward 2025-08-14T21:46:54.0701098Z attention_output = self.EncDecAttention( 2025-08-14T21:46:54.0701316Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 558, in forward 2025-08-14T21:46:54.0701454Z attn_weights = nn.functional.softmax(scores.float(), dim=-1).type_as(scores) 2025-08-14T21:46:54.0701458Z 2025-08-14T21:46:54.0701550Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0701732Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0701793Z return mod(**inputs) 2025-08-14T21:46:54.0702008Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0702079Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0702292Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0702364Z layer_outputs = layer_module( 2025-08-14T21:46:54.0702566Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0702636Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0702854Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 705, in forward 2025-08-14T21:46:54.0702924Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:46:54.0703158Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 635, in forward 2025-08-14T21:46:54.0703235Z attention_output = self.EncDecAttention( 2025-08-14T21:46:54.0703445Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 511, in forward 2025-08-14T21:46:54.0703520Z value_states = self.v(current_states) 2025-08-14T21:46:54.0703524Z 2025-08-14T21:46:54.0703630Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0703810Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0703893Z return mod(**inputs) 2025-08-14T21:46:54.0704108Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0704181Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0704394Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0704459Z layer_outputs = layer_module( 2025-08-14T21:46:54.0704689Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0704828Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0705053Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 705, in forward 2025-08-14T21:46:54.0705129Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:46:54.0705342Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 635, in forward 2025-08-14T21:46:54.0705420Z attention_output = self.EncDecAttention( 2025-08-14T21:46:54.0705632Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 565, in forward 2025-08-14T21:46:54.0705729Z attn_output = torch.matmul(attn_weights, value_states) 2025-08-14T21:46:54.0705733Z 2025-08-14T21:46:54.0705828Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0706018Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0706088Z return mod(**inputs) 2025-08-14T21:46:54.0706316Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0706384Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0706628Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0706695Z layer_outputs = layer_module( 2025-08-14T21:46:54.0706897Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0706973Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0707188Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 705, in forward 2025-08-14T21:46:54.0707266Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:46:54.0707483Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 635, in forward 2025-08-14T21:46:54.0707559Z attention_output = self.EncDecAttention( 2025-08-14T21:46:54.0707781Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 565, in forward 2025-08-14T21:46:54.0707878Z attn_output = torch.matmul(attn_weights, value_states) 2025-08-14T21:46:54.0707883Z 2025-08-14T21:46:54.0707982Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0708165Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0708224Z return mod(**inputs) 2025-08-14T21:46:54.0708451Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0708515Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0708749Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0708824Z layer_outputs = layer_module( 2025-08-14T21:46:54.0709026Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0709111Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0709325Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 705, in forward 2025-08-14T21:46:54.0709407Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:46:54.0709625Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 635, in forward 2025-08-14T21:46:54.0709701Z attention_output = self.EncDecAttention( 2025-08-14T21:46:54.0709913Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 567, in forward 2025-08-14T21:46:54.0710022Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:46:54.0710040Z 2025-08-14T21:46:54.0710133Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0710317Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0710376Z return mod(**inputs) 2025-08-14T21:46:54.0710588Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0710660Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0710872Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0710943Z layer_outputs = layer_module( 2025-08-14T21:46:54.0711143Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0711212Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0711431Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 705, in forward 2025-08-14T21:46:54.0711503Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:46:54.0711712Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 635, in forward 2025-08-14T21:46:54.0711795Z attention_output = self.EncDecAttention( 2025-08-14T21:46:54.0712006Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 569, in forward 2025-08-14T21:46:54.0712087Z attn_output = self.o(attn_output) 2025-08-14T21:46:54.0712091Z 2025-08-14T21:46:54.0712163Z cudagraph partition due to non gpu ops 2025-08-14T21:46:54.0712255Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0712440Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0712499Z return mod(**inputs) 2025-08-14T21:46:54.0712713Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0712795Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0713008Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0713081Z layer_outputs = layer_module( 2025-08-14T21:46:54.0713282Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0713353Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0713569Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 731, in forward 2025-08-14T21:46:54.0713651Z hidden_states = self.layer[-1](hidden_states) 2025-08-14T21:46:54.0713870Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 341, in forward 2025-08-14T21:46:54.0713972Z forwarded_states = self.layer_norm(hidden_states) 2025-08-14T21:46:54.0714187Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 261, in forward 2025-08-14T21:46:54.0714263Z return self.weight * hidden_states 2025-08-14T21:46:54.0714267Z 2025-08-14T21:46:54.0714373Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0714554Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0714638Z return mod(**inputs) 2025-08-14T21:46:54.0714853Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0714926Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0715139Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0715203Z layer_outputs = layer_module( 2025-08-14T21:46:54.0715412Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0715497Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0715708Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 731, in forward 2025-08-14T21:46:54.0715796Z hidden_states = self.layer[-1](hidden_states) 2025-08-14T21:46:54.0716007Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 342, in forward 2025-08-14T21:46:54.0716120Z forwarded_states = self.DenseReluDense(forwarded_states) 2025-08-14T21:46:54.0716332Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 287, in forward 2025-08-14T21:46:54.0716402Z hidden_states = self.wi(hidden_states) 2025-08-14T21:46:54.0716406Z 2025-08-14T21:46:54.0716505Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0716685Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0716751Z return mod(**inputs) 2025-08-14T21:46:54.0716962Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0717028Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0717247Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0717312Z layer_outputs = layer_module( 2025-08-14T21:46:54.0717513Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0717592Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0717804Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 731, in forward 2025-08-14T21:46:54.0717892Z hidden_states = self.layer[-1](hidden_states) 2025-08-14T21:46:54.0718106Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 342, in forward 2025-08-14T21:46:54.0718210Z forwarded_states = self.DenseReluDense(forwarded_states) 2025-08-14T21:46:54.0718433Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 288, in forward 2025-08-14T21:46:54.0718505Z hidden_states = self.act(hidden_states) 2025-08-14T21:46:54.0718510Z 2025-08-14T21:46:54.0718608Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0718786Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0718846Z return mod(**inputs) 2025-08-14T21:46:54.0719064Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0719129Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0719357Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0719431Z layer_outputs = layer_module( 2025-08-14T21:46:54.0719636Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0719713Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0719942Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 731, in forward 2025-08-14T21:46:54.0720037Z hidden_states = self.layer[-1](hidden_states) 2025-08-14T21:46:54.0720255Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 342, in forward 2025-08-14T21:46:54.0720356Z forwarded_states = self.DenseReluDense(forwarded_states) 2025-08-14T21:46:54.0720568Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 296, in forward 2025-08-14T21:46:54.0720646Z hidden_states = self.wo(hidden_states) 2025-08-14T21:46:54.0720663Z 2025-08-14T21:46:54.0720756Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0720943Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0721001Z return mod(**inputs) 2025-08-14T21:46:54.0721215Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0721288Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0721503Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0721573Z layer_outputs = layer_module( 2025-08-14T21:46:54.0721773Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0721843Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0722063Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 681, in forward 2025-08-14T21:46:54.0722137Z self_attention_outputs = self.layer[0]( 2025-08-14T21:46:54.0722348Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 598, in forward 2025-08-14T21:46:54.0722453Z normed_hidden_states = self.layer_norm(hidden_states) 2025-08-14T21:46:54.0722664Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 261, in forward 2025-08-14T21:46:54.0722740Z return self.weight * hidden_states 2025-08-14T21:46:54.0722743Z 2025-08-14T21:46:54.0722835Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0723014Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0723082Z return mod(**inputs) 2025-08-14T21:46:54.0723300Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0723367Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0723588Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0723651Z layer_outputs = layer_module( 2025-08-14T21:46:54.0723857Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0723930Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0724141Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 681, in forward 2025-08-14T21:46:54.0724222Z self_attention_outputs = self.layer[0]( 2025-08-14T21:46:54.0724432Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 599, in forward 2025-08-14T21:46:54.0724514Z attention_output = self.SelfAttention( 2025-08-14T21:46:54.0724740Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 490, in forward 2025-08-14T21:46:54.0724812Z query_states = self.q(hidden_states) 2025-08-14T21:46:54.0724815Z 2025-08-14T21:46:54.0724915Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0725112Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0725175Z return mod(**inputs) 2025-08-14T21:46:54.0725423Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0725489Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0725715Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0725778Z layer_outputs = layer_module( 2025-08-14T21:46:54.0725981Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0726079Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0726292Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 681, in forward 2025-08-14T21:46:54.0726364Z self_attention_outputs = self.layer[0]( 2025-08-14T21:46:54.0726583Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 599, in forward 2025-08-14T21:46:54.0726660Z attention_output = self.SelfAttention( 2025-08-14T21:46:54.0726878Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 510, in forward 2025-08-14T21:46:54.0726947Z key_states = self.k(current_states) 2025-08-14T21:46:54.0726951Z 2025-08-14T21:46:54.0727042Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0727228Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0727289Z return mod(**inputs) 2025-08-14T21:46:54.0727512Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0727576Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0727790Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0727862Z layer_outputs = layer_module( 2025-08-14T21:46:54.0728061Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0728133Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0728351Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 681, in forward 2025-08-14T21:46:54.0728422Z self_attention_outputs = self.layer[0]( 2025-08-14T21:46:54.0728639Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 599, in forward 2025-08-14T21:46:54.0728713Z attention_output = self.SelfAttention( 2025-08-14T21:46:54.0728924Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 526, in forward 2025-08-14T21:46:54.0729045Z scores = torch.matmul(query_states, key_states.transpose(3, 2)) 2025-08-14T21:46:54.0729048Z 2025-08-14T21:46:54.0729138Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0729318Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0729381Z return mod(**inputs) 2025-08-14T21:46:54.0729595Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0729665Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0729879Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0729958Z layer_outputs = layer_module( 2025-08-14T21:46:54.0730168Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0730237Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0730495Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 681, in forward 2025-08-14T21:46:54.0730568Z self_attention_outputs = self.layer[0]( 2025-08-14T21:46:54.0730802Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 599, in forward 2025-08-14T21:46:54.0730880Z attention_output = self.SelfAttention( 2025-08-14T21:46:54.0731094Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 558, in forward 2025-08-14T21:46:54.0731234Z attn_weights = nn.functional.softmax(scores.float(), dim=-1).type_as(scores) 2025-08-14T21:46:54.0731244Z 2025-08-14T21:46:54.0731337Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0731528Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0731592Z return mod(**inputs) 2025-08-14T21:46:54.0731810Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0731874Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0732094Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0732158Z layer_outputs = layer_module( 2025-08-14T21:46:54.0732363Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0732432Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0732641Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 681, in forward 2025-08-14T21:46:54.0732721Z self_attention_outputs = self.layer[0]( 2025-08-14T21:46:54.0732933Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 599, in forward 2025-08-14T21:46:54.0733007Z attention_output = self.SelfAttention( 2025-08-14T21:46:54.0733225Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 511, in forward 2025-08-14T21:46:54.0733294Z value_states = self.v(current_states) 2025-08-14T21:46:54.0733299Z 2025-08-14T21:46:54.0733394Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0733569Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0733623Z return mod(**inputs) 2025-08-14T21:46:54.0733839Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0733900Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0734112Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0734175Z layer_outputs = layer_module( 2025-08-14T21:46:54.0734372Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0734451Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0734662Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 681, in forward 2025-08-14T21:46:54.0734734Z self_attention_outputs = self.layer[0]( 2025-08-14T21:46:54.0734952Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 599, in forward 2025-08-14T21:46:54.0735024Z attention_output = self.SelfAttention( 2025-08-14T21:46:54.0735239Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 565, in forward 2025-08-14T21:46:54.0735353Z attn_output = torch.matmul(attn_weights, value_states) 2025-08-14T21:46:54.0735358Z 2025-08-14T21:46:54.0735454Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0735641Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0735700Z return mod(**inputs) 2025-08-14T21:46:54.0735928Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0736019Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0736236Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0736305Z layer_outputs = layer_module( 2025-08-14T21:46:54.0736506Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0736576Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0736796Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 681, in forward 2025-08-14T21:46:54.0736885Z self_attention_outputs = self.layer[0]( 2025-08-14T21:46:54.0737104Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 599, in forward 2025-08-14T21:46:54.0737179Z attention_output = self.SelfAttention( 2025-08-14T21:46:54.0737390Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 565, in forward 2025-08-14T21:46:54.0737494Z attn_output = torch.matmul(attn_weights, value_states) 2025-08-14T21:46:54.0737498Z 2025-08-14T21:46:54.0737592Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0737771Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0737838Z return mod(**inputs) 2025-08-14T21:46:54.0738054Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0738126Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0738339Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0738402Z layer_outputs = layer_module( 2025-08-14T21:46:54.0738611Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0738683Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0738896Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 681, in forward 2025-08-14T21:46:54.0738974Z self_attention_outputs = self.layer[0]( 2025-08-14T21:46:54.0739186Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 599, in forward 2025-08-14T21:46:54.0739267Z attention_output = self.SelfAttention( 2025-08-14T21:46:54.0739478Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 567, in forward 2025-08-14T21:46:54.0739576Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:46:54.0739579Z 2025-08-14T21:46:54.0739681Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0739863Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0739929Z return mod(**inputs) 2025-08-14T21:46:54.0740144Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0740207Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0740429Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0740491Z layer_outputs = layer_module( 2025-08-14T21:46:54.0740705Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0740786Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0740997Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 681, in forward 2025-08-14T21:46:54.0741074Z self_attention_outputs = self.layer[0]( 2025-08-14T21:46:54.0741300Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 599, in forward 2025-08-14T21:46:54.0741388Z attention_output = self.SelfAttention( 2025-08-14T21:46:54.0741607Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 569, in forward 2025-08-14T21:46:54.0741677Z attn_output = self.o(attn_output) 2025-08-14T21:46:54.0741681Z 2025-08-14T21:46:54.0741759Z cudagraph partition due to non gpu ops 2025-08-14T21:46:54.0741852Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0742034Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0742113Z return mod(**inputs) 2025-08-14T21:46:54.0742326Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0742389Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0742606Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0742666Z layer_outputs = layer_module( 2025-08-14T21:46:54.0742870Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0742937Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0743148Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 705, in forward 2025-08-14T21:46:54.0743229Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:46:54.0743440Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 634, in forward 2025-08-14T21:46:54.0743537Z normed_hidden_states = self.layer_norm(hidden_states) 2025-08-14T21:46:54.0743755Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 261, in forward 2025-08-14T21:46:54.0743825Z return self.weight * hidden_states 2025-08-14T21:46:54.0743828Z 2025-08-14T21:46:54.0743929Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0744105Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0744164Z return mod(**inputs) 2025-08-14T21:46:54.0744383Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0744448Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0744672Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0744743Z layer_outputs = layer_module( 2025-08-14T21:46:54.0745025Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0745110Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0745338Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 705, in forward 2025-08-14T21:46:54.0745416Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:46:54.0745646Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 635, in forward 2025-08-14T21:46:54.0745726Z attention_output = self.EncDecAttention( 2025-08-14T21:46:54.0745960Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 490, in forward 2025-08-14T21:46:54.0746032Z query_states = self.q(hidden_states) 2025-08-14T21:46:54.0746053Z 2025-08-14T21:46:54.0746149Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0746338Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0746396Z return mod(**inputs) 2025-08-14T21:46:54.0746635Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0746710Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0746947Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0747017Z layer_outputs = layer_module( 2025-08-14T21:46:54.0747215Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0747285Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0747506Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 705, in forward 2025-08-14T21:46:54.0747597Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:46:54.0747810Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 635, in forward 2025-08-14T21:46:54.0747896Z attention_output = self.EncDecAttention( 2025-08-14T21:46:54.0748109Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 510, in forward 2025-08-14T21:46:54.0748190Z key_states = self.k(current_states) 2025-08-14T21:46:54.0748194Z 2025-08-14T21:46:54.0748290Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0748471Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0748539Z return mod(**inputs) 2025-08-14T21:46:54.0748757Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0748833Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0749051Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0749118Z layer_outputs = layer_module( 2025-08-14T21:46:54.0749328Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0749400Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0749614Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 705, in forward 2025-08-14T21:46:54.0749697Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:46:54.0749911Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 635, in forward 2025-08-14T21:46:54.0749996Z attention_output = self.EncDecAttention( 2025-08-14T21:46:54.0750210Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 526, in forward 2025-08-14T21:46:54.0750330Z scores = torch.matmul(query_states, key_states.transpose(3, 2)) 2025-08-14T21:46:54.0750333Z 2025-08-14T21:46:54.0750436Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0750618Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0750685Z return mod(**inputs) 2025-08-14T21:46:54.0750903Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0750972Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0751195Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0751260Z layer_outputs = layer_module( 2025-08-14T21:46:54.0751464Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0751555Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0751771Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 705, in forward 2025-08-14T21:46:54.0751846Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:46:54.0752073Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 635, in forward 2025-08-14T21:46:54.0752152Z attention_output = self.EncDecAttention( 2025-08-14T21:46:54.0752389Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 558, in forward 2025-08-14T21:46:54.0752528Z attn_weights = nn.functional.softmax(scores.float(), dim=-1).type_as(scores) 2025-08-14T21:46:54.0752532Z 2025-08-14T21:46:54.0752630Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0752814Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0752874Z return mod(**inputs) 2025-08-14T21:46:54.0753114Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0753179Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0753397Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0753467Z layer_outputs = layer_module( 2025-08-14T21:46:54.0753667Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0753743Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0753956Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 705, in forward 2025-08-14T21:46:54.0754027Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:46:54.0754246Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 635, in forward 2025-08-14T21:46:54.0754322Z attention_output = self.EncDecAttention( 2025-08-14T21:46:54.0754533Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 511, in forward 2025-08-14T21:46:54.0754609Z value_states = self.v(current_states) 2025-08-14T21:46:54.0754613Z 2025-08-14T21:46:54.0754707Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0754895Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0754955Z return mod(**inputs) 2025-08-14T21:46:54.0755168Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0755238Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0755451Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0755517Z layer_outputs = layer_module( 2025-08-14T21:46:54.0755726Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0755795Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0756016Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 705, in forward 2025-08-14T21:46:54.0756087Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:46:54.0756299Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 635, in forward 2025-08-14T21:46:54.0756380Z attention_output = self.EncDecAttention( 2025-08-14T21:46:54.0756590Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 565, in forward 2025-08-14T21:46:54.0756694Z attn_output = torch.matmul(attn_weights, value_states) 2025-08-14T21:46:54.0756698Z 2025-08-14T21:46:54.0756803Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0756986Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0757052Z return mod(**inputs) 2025-08-14T21:46:54.0757266Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0757346Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0757569Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0757646Z layer_outputs = layer_module( 2025-08-14T21:46:54.0757849Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0757914Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0758120Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 705, in forward 2025-08-14T21:46:54.0758196Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:46:54.0758423Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 635, in forward 2025-08-14T21:46:54.0758497Z attention_output = self.EncDecAttention( 2025-08-14T21:46:54.0758715Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 565, in forward 2025-08-14T21:46:54.0758809Z attn_output = torch.matmul(attn_weights, value_states) 2025-08-14T21:46:54.0758814Z 2025-08-14T21:46:54.0758912Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0759090Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0759148Z return mod(**inputs) 2025-08-14T21:46:54.0759366Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0759430Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0759650Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0759715Z layer_outputs = layer_module( 2025-08-14T21:46:54.0759915Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0759990Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0760201Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 705, in forward 2025-08-14T21:46:54.0760272Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:46:54.0760489Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 635, in forward 2025-08-14T21:46:54.0760563Z attention_output = self.EncDecAttention( 2025-08-14T21:46:54.0760778Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 567, in forward 2025-08-14T21:46:54.0760873Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:46:54.0760878Z 2025-08-14T21:46:54.0760969Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0761156Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0761214Z return mod(**inputs) 2025-08-14T21:46:54.0761437Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0761503Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0761717Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0761787Z layer_outputs = layer_module( 2025-08-14T21:46:54.0761986Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0762056Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0762289Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 705, in forward 2025-08-14T21:46:54.0762363Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:46:54.0762580Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 635, in forward 2025-08-14T21:46:54.0762668Z attention_output = self.EncDecAttention( 2025-08-14T21:46:54.0762884Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 569, in forward 2025-08-14T21:46:54.0762992Z attn_output = self.o(attn_output) 2025-08-14T21:46:54.0762996Z 2025-08-14T21:46:54.0763070Z cudagraph partition due to non gpu ops 2025-08-14T21:46:54.0763164Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0763351Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0763410Z return mod(**inputs) 2025-08-14T21:46:54.0763635Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0763715Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0763931Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0764006Z layer_outputs = layer_module( 2025-08-14T21:46:54.0764205Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0764282Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0764488Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 731, in forward 2025-08-14T21:46:54.0764570Z hidden_states = self.layer[-1](hidden_states) 2025-08-14T21:46:54.0764794Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 341, in forward 2025-08-14T21:46:54.0764883Z forwarded_states = self.layer_norm(hidden_states) 2025-08-14T21:46:54.0765099Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 261, in forward 2025-08-14T21:46:54.0765175Z return self.weight * hidden_states 2025-08-14T21:46:54.0765178Z 2025-08-14T21:46:54.0765271Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0765457Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0765517Z return mod(**inputs) 2025-08-14T21:46:54.0765733Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0765807Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0766022Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0766085Z layer_outputs = layer_module( 2025-08-14T21:46:54.0766295Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0766367Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0766585Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 731, in forward 2025-08-14T21:46:54.0766669Z hidden_states = self.layer[-1](hidden_states) 2025-08-14T21:46:54.0766880Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 342, in forward 2025-08-14T21:46:54.0766999Z forwarded_states = self.DenseReluDense(forwarded_states) 2025-08-14T21:46:54.0767212Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 287, in forward 2025-08-14T21:46:54.0767292Z hidden_states = self.wi(hidden_states) 2025-08-14T21:46:54.0767295Z 2025-08-14T21:46:54.0767387Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0767592Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0767663Z return mod(**inputs) 2025-08-14T21:46:54.0767879Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0767944Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0768182Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0768263Z layer_outputs = layer_module( 2025-08-14T21:46:54.0768474Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0768544Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0768755Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 731, in forward 2025-08-14T21:46:54.0768846Z hidden_states = self.layer[-1](hidden_states) 2025-08-14T21:46:54.0769061Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 342, in forward 2025-08-14T21:46:54.0769190Z forwarded_states = self.DenseReluDense(forwarded_states) 2025-08-14T21:46:54.0769404Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 288, in forward 2025-08-14T21:46:54.0769475Z hidden_states = self.act(hidden_states) 2025-08-14T21:46:54.0769479Z 2025-08-14T21:46:54.0769581Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0769760Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0769820Z return mod(**inputs) 2025-08-14T21:46:54.0770041Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0770106Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0770329Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0770394Z layer_outputs = layer_module( 2025-08-14T21:46:54.0770597Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0770676Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0770891Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 731, in forward 2025-08-14T21:46:54.0770972Z hidden_states = self.layer[-1](hidden_states) 2025-08-14T21:46:54.0771190Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 342, in forward 2025-08-14T21:46:54.0771293Z forwarded_states = self.DenseReluDense(forwarded_states) 2025-08-14T21:46:54.0771511Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 296, in forward 2025-08-14T21:46:54.0771585Z hidden_states = self.wo(hidden_states) 2025-08-14T21:46:54.0771589Z 2025-08-14T21:46:54.0771663Z cudagraph partition due to non gpu ops 2025-08-14T21:46:54.0771765Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0771943Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0772010Z return mod(**inputs) 2025-08-14T21:46:54.0772226Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0772292Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0772513Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0772576Z layer_outputs = layer_module( 2025-08-14T21:46:54.0772775Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0772868Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0773085Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 681, in forward 2025-08-14T21:46:54.0773165Z self_attention_outputs = self.layer[0]( 2025-08-14T21:46:54.0773394Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 598, in forward 2025-08-14T21:46:54.0773492Z normed_hidden_states = self.layer_norm(hidden_states) 2025-08-14T21:46:54.0773726Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 261, in forward 2025-08-14T21:46:54.0773795Z return self.weight * hidden_states 2025-08-14T21:46:54.0773798Z 2025-08-14T21:46:54.0773890Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0774076Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0774134Z return mod(**inputs) 2025-08-14T21:46:54.0774357Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0774440Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0774657Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0774731Z layer_outputs = layer_module( 2025-08-14T21:46:54.0774935Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0775015Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0775231Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 681, in forward 2025-08-14T21:46:54.0775305Z self_attention_outputs = self.layer[0]( 2025-08-14T21:46:54.0775526Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 599, in forward 2025-08-14T21:46:54.0775603Z attention_output = self.SelfAttention( 2025-08-14T21:46:54.0775815Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 490, in forward 2025-08-14T21:46:54.0775894Z query_states = self.q(hidden_states) 2025-08-14T21:46:54.0775897Z 2025-08-14T21:46:54.0775989Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0776183Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0776246Z return mod(**inputs) 2025-08-14T21:46:54.0776461Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0776537Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0776752Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0776817Z layer_outputs = layer_module( 2025-08-14T21:46:54.0777031Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0777104Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0777325Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 681, in forward 2025-08-14T21:46:54.0777400Z self_attention_outputs = self.layer[0]( 2025-08-14T21:46:54.0777615Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 599, in forward 2025-08-14T21:46:54.0777701Z attention_output = self.SelfAttention( 2025-08-14T21:46:54.0777915Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 510, in forward 2025-08-14T21:46:54.0777992Z key_states = self.k(current_states) 2025-08-14T21:46:54.0777995Z 2025-08-14T21:46:54.0778091Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0778289Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0778358Z return mod(**inputs) 2025-08-14T21:46:54.0778577Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0778641Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0778881Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0778960Z layer_outputs = layer_module( 2025-08-14T21:46:54.0779172Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0779242Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0779457Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 681, in forward 2025-08-14T21:46:54.0779536Z self_attention_outputs = self.layer[0]( 2025-08-14T21:46:54.0779751Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 599, in forward 2025-08-14T21:46:54.0779837Z attention_output = self.SelfAttention( 2025-08-14T21:46:54.0780057Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 526, in forward 2025-08-14T21:46:54.0780179Z scores = torch.matmul(query_states, key_states.transpose(3, 2)) 2025-08-14T21:46:54.0780182Z 2025-08-14T21:46:54.0780285Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0780466Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0780525Z return mod(**inputs) 2025-08-14T21:46:54.0780748Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0780815Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0781038Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0781104Z layer_outputs = layer_module( 2025-08-14T21:46:54.0781305Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0781384Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0781597Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 681, in forward 2025-08-14T21:46:54.0781670Z self_attention_outputs = self.layer[0]( 2025-08-14T21:46:54.0781890Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 599, in forward 2025-08-14T21:46:54.0781964Z attention_output = self.SelfAttention( 2025-08-14T21:46:54.0782183Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 558, in forward 2025-08-14T21:46:54.0782325Z attn_weights = nn.functional.softmax(scores.float(), dim=-1).type_as(scores) 2025-08-14T21:46:54.0782330Z 2025-08-14T21:46:54.0782424Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0782612Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0782672Z return mod(**inputs) 2025-08-14T21:46:54.0782895Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0782961Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0783179Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0783250Z layer_outputs = layer_module( 2025-08-14T21:46:54.0783452Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0783523Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0783757Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 681, in forward 2025-08-14T21:46:54.0783831Z self_attention_outputs = self.layer[0]( 2025-08-14T21:46:54.0784051Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 599, in forward 2025-08-14T21:46:54.0784126Z attention_output = self.SelfAttention( 2025-08-14T21:46:54.0784348Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 511, in forward 2025-08-14T21:46:54.0784439Z value_states = self.v(current_states) 2025-08-14T21:46:54.0784442Z 2025-08-14T21:46:54.0784535Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0784938Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0785014Z return mod(**inputs) 2025-08-14T21:46:54.0785247Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0785327Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0785626Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0785698Z layer_outputs = layer_module( 2025-08-14T21:46:54.0785937Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0786020Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0786332Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 681, in forward 2025-08-14T21:46:54.0786408Z self_attention_outputs = self.layer[0]( 2025-08-14T21:46:54.0786632Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 599, in forward 2025-08-14T21:46:54.0786713Z attention_output = self.SelfAttention( 2025-08-14T21:46:54.0786944Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 565, in forward 2025-08-14T21:46:54.0787044Z attn_output = torch.matmul(attn_weights, value_states) 2025-08-14T21:46:54.0787048Z 2025-08-14T21:46:54.0787146Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0787322Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0787397Z return mod(**inputs) 2025-08-14T21:46:54.0787654Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0787730Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0787982Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0788054Z layer_outputs = layer_module( 2025-08-14T21:46:54.0788284Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0788376Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0788631Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 681, in forward 2025-08-14T21:46:54.0788720Z self_attention_outputs = self.layer[0]( 2025-08-14T21:46:54.0788975Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 599, in forward 2025-08-14T21:46:54.0789058Z attention_output = self.SelfAttention( 2025-08-14T21:46:54.0789318Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 565, in forward 2025-08-14T21:46:54.0789429Z attn_output = torch.matmul(attn_weights, value_states) 2025-08-14T21:46:54.0789433Z 2025-08-14T21:46:54.0789547Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0789752Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0789818Z return mod(**inputs) 2025-08-14T21:46:54.0790105Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0790184Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0790443Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0790545Z layer_outputs = layer_module( 2025-08-14T21:46:54.0790781Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0790907Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0791165Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 681, in forward 2025-08-14T21:46:54.0791247Z self_attention_outputs = self.layer[0]( 2025-08-14T21:46:54.0791503Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 599, in forward 2025-08-14T21:46:54.0791590Z attention_output = self.SelfAttention( 2025-08-14T21:46:54.0791875Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 567, in forward 2025-08-14T21:46:54.0791987Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:46:54.0791991Z 2025-08-14T21:46:54.0792099Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0792312Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0792381Z return mod(**inputs) 2025-08-14T21:46:54.0792637Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0792720Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0792960Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0793032Z layer_outputs = layer_module( 2025-08-14T21:46:54.0793233Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0793306Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0793525Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 681, in forward 2025-08-14T21:46:54.0793596Z self_attention_outputs = self.layer[0]( 2025-08-14T21:46:54.0793806Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 599, in forward 2025-08-14T21:46:54.0793889Z attention_output = self.SelfAttention( 2025-08-14T21:46:54.0794099Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 569, in forward 2025-08-14T21:46:54.0794177Z attn_output = self.o(attn_output) 2025-08-14T21:46:54.0794180Z 2025-08-14T21:46:54.0794274Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0794453Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0794520Z return mod(**inputs) 2025-08-14T21:46:54.0794734Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0794806Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0795021Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0795084Z layer_outputs = layer_module( 2025-08-14T21:46:54.0795289Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0795359Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0795570Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 681, in forward 2025-08-14T21:46:54.0795648Z self_attention_outputs = self.layer[0]( 2025-08-14T21:46:54.0795874Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 609, in forward 2025-08-14T21:46:54.0796003Z hidden_states = hidden_states + self.dropout(attention_output[0]) 2025-08-14T21:46:54.0796007Z 2025-08-14T21:46:54.0796079Z cudagraph partition due to non gpu ops 2025-08-14T21:46:54.0796188Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0796376Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0796448Z return mod(**inputs) 2025-08-14T21:46:54.0796664Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0796737Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0796951Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0797021Z layer_outputs = layer_module( 2025-08-14T21:46:54.0797224Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0797311Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0797534Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 705, in forward 2025-08-14T21:46:54.0797607Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:46:54.0797824Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 634, in forward 2025-08-14T21:46:54.0797921Z normed_hidden_states = self.layer_norm(hidden_states) 2025-08-14T21:46:54.0798131Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 261, in forward 2025-08-14T21:46:54.0798206Z return self.weight * hidden_states 2025-08-14T21:46:54.0798209Z 2025-08-14T21:46:54.0798299Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0798480Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0798546Z return mod(**inputs) 2025-08-14T21:46:54.0798761Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0798832Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0799047Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0799114Z layer_outputs = layer_module( 2025-08-14T21:46:54.0799320Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0799389Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0799600Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 705, in forward 2025-08-14T21:46:54.0799678Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:46:54.0799890Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 635, in forward 2025-08-14T21:46:54.0799975Z attention_output = self.EncDecAttention( 2025-08-14T21:46:54.0800186Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 490, in forward 2025-08-14T21:46:54.0800256Z query_states = self.q(hidden_states) 2025-08-14T21:46:54.0800261Z 2025-08-14T21:46:54.0800359Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0800539Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0800605Z return mod(**inputs) 2025-08-14T21:46:54.0800819Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0800883Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0801120Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0801186Z layer_outputs = layer_module( 2025-08-14T21:46:54.0801386Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0801463Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0801689Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 705, in forward 2025-08-14T21:46:54.0801786Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:46:54.0801999Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 635, in forward 2025-08-14T21:46:54.0802076Z attention_output = self.EncDecAttention( 2025-08-14T21:46:54.0802295Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 510, in forward 2025-08-14T21:46:54.0802364Z key_states = self.k(current_states) 2025-08-14T21:46:54.0802369Z 2025-08-14T21:46:54.0802483Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0802662Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0802722Z return mod(**inputs) 2025-08-14T21:46:54.0802943Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0803011Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0803224Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0803298Z layer_outputs = layer_module( 2025-08-14T21:46:54.0803498Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0803578Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0803792Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 705, in forward 2025-08-14T21:46:54.0803866Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:46:54.0804083Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 635, in forward 2025-08-14T21:46:54.0804159Z attention_output = self.EncDecAttention( 2025-08-14T21:46:54.0804370Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 526, in forward 2025-08-14T21:46:54.0804498Z scores = torch.matmul(query_states, key_states.transpose(3, 2)) 2025-08-14T21:46:54.0804501Z 2025-08-14T21:46:54.0804593Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0804778Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0804836Z return mod(**inputs) 2025-08-14T21:46:54.0805052Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0805129Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0805343Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0805414Z layer_outputs = layer_module( 2025-08-14T21:46:54.0805616Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0805689Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0805906Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 705, in forward 2025-08-14T21:46:54.0805977Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:46:54.0806189Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 635, in forward 2025-08-14T21:46:54.0806271Z attention_output = self.EncDecAttention( 2025-08-14T21:46:54.0806496Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 558, in forward 2025-08-14T21:46:54.0806646Z attn_weights = nn.functional.softmax(scores.float(), dim=-1).type_as(scores) 2025-08-14T21:46:54.0806649Z 2025-08-14T21:46:54.0806742Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0806943Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0807012Z return mod(**inputs) 2025-08-14T21:46:54.0807254Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0807320Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0807545Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0807608Z layer_outputs = layer_module( 2025-08-14T21:46:54.0807818Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0807905Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0808118Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 705, in forward 2025-08-14T21:46:54.0808197Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:46:54.0808408Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 635, in forward 2025-08-14T21:46:54.0808493Z attention_output = self.EncDecAttention( 2025-08-14T21:46:54.0808704Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 511, in forward 2025-08-14T21:46:54.0808773Z value_states = self.v(current_states) 2025-08-14T21:46:54.0808776Z 2025-08-14T21:46:54.0808876Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0809055Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0809115Z return mod(**inputs) 2025-08-14T21:46:54.0809339Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0809404Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0809626Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0809691Z layer_outputs = layer_module( 2025-08-14T21:46:54.0809890Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0809968Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0810178Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 705, in forward 2025-08-14T21:46:54.0810249Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:46:54.0810467Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 635, in forward 2025-08-14T21:46:54.0810543Z attention_output = self.EncDecAttention( 2025-08-14T21:46:54.0810761Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 565, in forward 2025-08-14T21:46:54.0810856Z attn_output = torch.matmul(attn_weights, value_states) 2025-08-14T21:46:54.0810861Z 2025-08-14T21:46:54.0810952Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0811140Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0811198Z return mod(**inputs) 2025-08-14T21:46:54.0811417Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0811482Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0811694Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0811780Z layer_outputs = layer_module( 2025-08-14T21:46:54.0811984Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0812055Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0812290Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 705, in forward 2025-08-14T21:46:54.0812364Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:46:54.0812601Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 635, in forward 2025-08-14T21:46:54.0812676Z attention_output = self.EncDecAttention( 2025-08-14T21:46:54.0812886Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 565, in forward 2025-08-14T21:46:54.0812991Z attn_output = torch.matmul(attn_weights, value_states) 2025-08-14T21:46:54.0812994Z 2025-08-14T21:46:54.0813087Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0813288Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0813347Z return mod(**inputs) 2025-08-14T21:46:54.0813563Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0813636Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0813850Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0813915Z layer_outputs = layer_module( 2025-08-14T21:46:54.0814124Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0814194Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0814413Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 705, in forward 2025-08-14T21:46:54.0814484Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:46:54.0814696Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 635, in forward 2025-08-14T21:46:54.0814778Z attention_output = self.EncDecAttention( 2025-08-14T21:46:54.0814990Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 567, in forward 2025-08-14T21:46:54.0815088Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:46:54.0815100Z 2025-08-14T21:46:54.0815192Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0815373Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0815439Z return mod(**inputs) 2025-08-14T21:46:54.0815653Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0815723Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0815948Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0816011Z layer_outputs = layer_module( 2025-08-14T21:46:54.0816217Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0816288Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0816500Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 705, in forward 2025-08-14T21:46:54.0816579Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:46:54.0816792Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 635, in forward 2025-08-14T21:46:54.0816865Z attention_output = self.EncDecAttention( 2025-08-14T21:46:54.0817123Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 569, in forward 2025-08-14T21:46:54.0817195Z attn_output = self.o(attn_output) 2025-08-14T21:46:54.0817199Z 2025-08-14T21:46:54.0817279Z cudagraph partition due to non gpu ops 2025-08-14T21:46:54.0817372Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0817566Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0817638Z return mod(**inputs) 2025-08-14T21:46:54.0817853Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0817935Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0818162Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0818226Z layer_outputs = layer_module( 2025-08-14T21:46:54.0818434Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0818502Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0818732Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 731, in forward 2025-08-14T21:46:54.0818821Z hidden_states = self.layer[-1](hidden_states) 2025-08-14T21:46:54.0819033Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 341, in forward 2025-08-14T21:46:54.0819126Z forwarded_states = self.layer_norm(hidden_states) 2025-08-14T21:46:54.0819341Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 261, in forward 2025-08-14T21:46:54.0819409Z return self.weight * hidden_states 2025-08-14T21:46:54.0819412Z 2025-08-14T21:46:54.0819510Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0819687Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0819745Z return mod(**inputs) 2025-08-14T21:46:54.0819968Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0820036Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0820258Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0820323Z layer_outputs = layer_module( 2025-08-14T21:46:54.0820523Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0820601Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0820813Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 731, in forward 2025-08-14T21:46:54.0820902Z hidden_states = self.layer[-1](hidden_states) 2025-08-14T21:46:54.0821115Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 342, in forward 2025-08-14T21:46:54.0821223Z forwarded_states = self.DenseReluDense(forwarded_states) 2025-08-14T21:46:54.0821445Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 287, in forward 2025-08-14T21:46:54.0821518Z hidden_states = self.wi(hidden_states) 2025-08-14T21:46:54.0821521Z 2025-08-14T21:46:54.0821614Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0821802Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0821863Z return mod(**inputs) 2025-08-14T21:46:54.0822086Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0822150Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0822365Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0822449Z layer_outputs = layer_module( 2025-08-14T21:46:54.0822657Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0822728Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0822950Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 731, in forward 2025-08-14T21:46:54.0823044Z hidden_states = self.layer[-1](hidden_states) 2025-08-14T21:46:54.0823283Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 342, in forward 2025-08-14T21:46:54.0823390Z forwarded_states = self.DenseReluDense(forwarded_states) 2025-08-14T21:46:54.0823600Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 288, in forward 2025-08-14T21:46:54.0823682Z hidden_states = self.act(hidden_states) 2025-08-14T21:46:54.0823685Z 2025-08-14T21:46:54.0823777Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0823979Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0824036Z return mod(**inputs) 2025-08-14T21:46:54.0824251Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0824323Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0824588Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0824656Z layer_outputs = layer_module( 2025-08-14T21:46:54.0824941Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0825019Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0825251Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 731, in forward 2025-08-14T21:46:54.0825339Z hidden_states = self.layer[-1](hidden_states) 2025-08-14T21:46:54.0825560Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 342, in forward 2025-08-14T21:46:54.0825678Z forwarded_states = self.DenseReluDense(forwarded_states) 2025-08-14T21:46:54.0825897Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 296, in forward 2025-08-14T21:46:54.0825972Z hidden_states = self.wo(hidden_states) 2025-08-14T21:46:54.0825985Z 2025-08-14T21:46:54.0826062Z cudagraph partition due to non gpu ops 2025-08-14T21:46:54.0826169Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0826359Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0826419Z return mod(**inputs) 2025-08-14T21:46:54.0826635Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0826713Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0826931Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0827004Z layer_outputs = layer_module( 2025-08-14T21:46:54.0827207Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0827281Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0827512Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 681, in forward 2025-08-14T21:46:54.0827587Z self_attention_outputs = self.layer[0]( 2025-08-14T21:46:54.0827806Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 598, in forward 2025-08-14T21:46:54.0827914Z normed_hidden_states = self.layer_norm(hidden_states) 2025-08-14T21:46:54.0828153Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 261, in forward 2025-08-14T21:46:54.0828235Z return self.weight * hidden_states 2025-08-14T21:46:54.0828239Z 2025-08-14T21:46:54.0828334Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0828519Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0828610Z return mod(**inputs) 2025-08-14T21:46:54.0828838Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0828920Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0829151Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0829216Z layer_outputs = layer_module( 2025-08-14T21:46:54.0829430Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0829504Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0829741Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 681, in forward 2025-08-14T21:46:54.0829822Z self_attention_outputs = self.layer[0]( 2025-08-14T21:46:54.0830045Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 599, in forward 2025-08-14T21:46:54.0830130Z attention_output = self.SelfAttention( 2025-08-14T21:46:54.0830351Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 490, in forward 2025-08-14T21:46:54.0830423Z query_states = self.q(hidden_states) 2025-08-14T21:46:54.0830427Z 2025-08-14T21:46:54.0830528Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0830714Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0830777Z return mod(**inputs) 2025-08-14T21:46:54.0831007Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0831076Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0831308Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0831376Z layer_outputs = layer_module( 2025-08-14T21:46:54.0831582Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0831665Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0831885Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 681, in forward 2025-08-14T21:46:54.0831957Z self_attention_outputs = self.layer[0]( 2025-08-14T21:46:54.0832183Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 599, in forward 2025-08-14T21:46:54.0832260Z attention_output = self.SelfAttention( 2025-08-14T21:46:54.0832489Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 510, in forward 2025-08-14T21:46:54.0832559Z key_states = self.k(current_states) 2025-08-14T21:46:54.0832563Z 2025-08-14T21:46:54.0832657Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0832850Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0832912Z return mod(**inputs) 2025-08-14T21:46:54.0833140Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0833206Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0833428Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0833501Z layer_outputs = layer_module( 2025-08-14T21:46:54.0833728Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0833805Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0834032Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 681, in forward 2025-08-14T21:46:54.0834118Z self_attention_outputs = self.layer[0]( 2025-08-14T21:46:54.0834346Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 599, in forward 2025-08-14T21:46:54.0834439Z attention_output = self.SelfAttention( 2025-08-14T21:46:54.0834657Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 526, in forward 2025-08-14T21:46:54.0834787Z scores = torch.matmul(query_states, key_states.transpose(3, 2)) 2025-08-14T21:46:54.0834791Z 2025-08-14T21:46:54.0834887Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0835083Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0835159Z return mod(**inputs) 2025-08-14T21:46:54.0835384Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0835459Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0835685Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0835753Z layer_outputs = layer_module( 2025-08-14T21:46:54.0835966Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0836038Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0836267Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 681, in forward 2025-08-14T21:46:54.0836342Z self_attention_outputs = self.layer[0]( 2025-08-14T21:46:54.0836564Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 599, in forward 2025-08-14T21:46:54.0836647Z attention_output = self.SelfAttention( 2025-08-14T21:46:54.0836869Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 558, in forward 2025-08-14T21:46:54.0837013Z attn_weights = nn.functional.softmax(scores.float(), dim=-1).type_as(scores) 2025-08-14T21:46:54.0837023Z 2025-08-14T21:46:54.0837117Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0837303Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0837369Z return mod(**inputs) 2025-08-14T21:46:54.0837593Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0837660Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0837901Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0837964Z layer_outputs = layer_module( 2025-08-14T21:46:54.0838172Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0838243Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0838460Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 681, in forward 2025-08-14T21:46:54.0838541Z self_attention_outputs = self.layer[0]( 2025-08-14T21:46:54.0838756Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 599, in forward 2025-08-14T21:46:54.0838829Z attention_output = self.SelfAttention( 2025-08-14T21:46:54.0839049Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 511, in forward 2025-08-14T21:46:54.0839117Z value_states = self.v(current_states) 2025-08-14T21:46:54.0839135Z 2025-08-14T21:46:54.0839239Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0839420Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0839478Z return mod(**inputs) 2025-08-14T21:46:54.0839717Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0839798Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0840020Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0840084Z layer_outputs = layer_module( 2025-08-14T21:46:54.0840287Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0840363Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0840576Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 681, in forward 2025-08-14T21:46:54.0840664Z self_attention_outputs = self.layer[0]( 2025-08-14T21:46:54.0840888Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 599, in forward 2025-08-14T21:46:54.0840964Z attention_output = self.SelfAttention( 2025-08-14T21:46:54.0841190Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 565, in forward 2025-08-14T21:46:54.0841293Z attn_output = torch.matmul(attn_weights, value_states) 2025-08-14T21:46:54.0841297Z 2025-08-14T21:46:54.0841391Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0841581Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0841642Z return mod(**inputs) 2025-08-14T21:46:54.0841861Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0841937Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0842154Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0842228Z layer_outputs = layer_module( 2025-08-14T21:46:54.0842434Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0842509Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0842737Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 681, in forward 2025-08-14T21:46:54.0842811Z self_attention_outputs = self.layer[0]( 2025-08-14T21:46:54.0843035Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 599, in forward 2025-08-14T21:46:54.0843110Z attention_output = self.SelfAttention( 2025-08-14T21:46:54.0843325Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 565, in forward 2025-08-14T21:46:54.0843435Z attn_output = torch.matmul(attn_weights, value_states) 2025-08-14T21:46:54.0843439Z 2025-08-14T21:46:54.0843534Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0843719Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0843789Z return mod(**inputs) 2025-08-14T21:46:54.0844012Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0844089Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0844309Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0844375Z layer_outputs = layer_module( 2025-08-14T21:46:54.0844587Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0844680Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0844895Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 681, in forward 2025-08-14T21:46:54.0844975Z self_attention_outputs = self.layer[0]( 2025-08-14T21:46:54.0845203Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 599, in forward 2025-08-14T21:46:54.0845304Z attention_output = self.SelfAttention( 2025-08-14T21:46:54.0845526Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 567, in forward 2025-08-14T21:46:54.0845625Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:46:54.0845629Z 2025-08-14T21:46:54.0845733Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0845921Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0845991Z return mod(**inputs) 2025-08-14T21:46:54.0846229Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0846296Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0846539Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0846603Z layer_outputs = layer_module( 2025-08-14T21:46:54.0846807Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0846886Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0847102Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 681, in forward 2025-08-14T21:46:54.0847181Z self_attention_outputs = self.layer[0]( 2025-08-14T21:46:54.0847396Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 599, in forward 2025-08-14T21:46:54.0847471Z attention_output = self.SelfAttention( 2025-08-14T21:46:54.0847693Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 569, in forward 2025-08-14T21:46:54.0847762Z attn_output = self.o(attn_output) 2025-08-14T21:46:54.0847765Z 2025-08-14T21:46:54.0847847Z cudagraph partition due to non gpu ops 2025-08-14T21:46:54.0847940Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0848125Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0848193Z return mod(**inputs) 2025-08-14T21:46:54.0848411Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0848476Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0848705Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0848771Z layer_outputs = layer_module( 2025-08-14T21:46:54.0848980Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0849050Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0849265Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 705, in forward 2025-08-14T21:46:54.0849345Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:46:54.0849561Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 634, in forward 2025-08-14T21:46:54.0849657Z normed_hidden_states = self.layer_norm(hidden_states) 2025-08-14T21:46:54.0849878Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 261, in forward 2025-08-14T21:46:54.0849947Z return self.weight * hidden_states 2025-08-14T21:46:54.0849950Z 2025-08-14T21:46:54.0850073Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0850254Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0850315Z return mod(**inputs) 2025-08-14T21:46:54.0850548Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0850615Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0850852Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0850922Z layer_outputs = layer_module( 2025-08-14T21:46:54.0851122Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0851198Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0851410Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 705, in forward 2025-08-14T21:46:54.0851498Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:46:54.0851716Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 635, in forward 2025-08-14T21:46:54.0851792Z attention_output = self.EncDecAttention( 2025-08-14T21:46:54.0852011Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 490, in forward 2025-08-14T21:46:54.0852080Z query_states = self.q(hidden_states) 2025-08-14T21:46:54.0852083Z 2025-08-14T21:46:54.0852174Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0852361Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0852419Z return mod(**inputs) 2025-08-14T21:46:54.0852635Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0852707Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0852926Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0852996Z layer_outputs = layer_module( 2025-08-14T21:46:54.0853201Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0853271Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0853489Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 705, in forward 2025-08-14T21:46:54.0853562Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:46:54.0853776Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 635, in forward 2025-08-14T21:46:54.0853860Z attention_output = self.EncDecAttention( 2025-08-14T21:46:54.0854072Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 510, in forward 2025-08-14T21:46:54.0854150Z key_states = self.k(current_states) 2025-08-14T21:46:54.0854153Z 2025-08-14T21:46:54.0854245Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0854426Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0854492Z return mod(**inputs) 2025-08-14T21:46:54.0854710Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0854783Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0855000Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0855063Z layer_outputs = layer_module( 2025-08-14T21:46:54.0855272Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0855342Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0855571Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 705, in forward 2025-08-14T21:46:54.0855654Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:46:54.0855867Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 635, in forward 2025-08-14T21:46:54.0855965Z attention_output = self.EncDecAttention( 2025-08-14T21:46:54.0856179Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 526, in forward 2025-08-14T21:46:54.0856315Z scores = torch.matmul(query_states, key_states.transpose(3, 2)) 2025-08-14T21:46:54.0856318Z 2025-08-14T21:46:54.0856418Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0856599Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0856666Z return mod(**inputs) 2025-08-14T21:46:54.0856887Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0856969Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0857194Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0857259Z layer_outputs = layer_module( 2025-08-14T21:46:54.0857462Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0857541Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0857756Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 705, in forward 2025-08-14T21:46:54.0857834Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:46:54.0858049Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 635, in forward 2025-08-14T21:46:54.0858128Z attention_output = self.EncDecAttention( 2025-08-14T21:46:54.0858351Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 558, in forward 2025-08-14T21:46:54.0858489Z attn_weights = nn.functional.softmax(scores.float(), dim=-1).type_as(scores) 2025-08-14T21:46:54.0858493Z 2025-08-14T21:46:54.0858594Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0858778Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0858838Z return mod(**inputs) 2025-08-14T21:46:54.0859065Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0859131Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0859348Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0859419Z layer_outputs = layer_module( 2025-08-14T21:46:54.0859623Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0859704Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0859922Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 705, in forward 2025-08-14T21:46:54.0859995Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:46:54.0860218Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 635, in forward 2025-08-14T21:46:54.0860296Z attention_output = self.EncDecAttention( 2025-08-14T21:46:54.0860509Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 511, in forward 2025-08-14T21:46:54.0860588Z value_states = self.v(current_states) 2025-08-14T21:46:54.0860591Z 2025-08-14T21:46:54.0860682Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0860886Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0860947Z return mod(**inputs) 2025-08-14T21:46:54.0861162Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0861236Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0861461Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0862311Z layer_outputs = layer_module( 2025-08-14T21:46:54.0862524Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0862593Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0862819Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 705, in forward 2025-08-14T21:46:54.0862890Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:46:54.0863104Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 635, in forward 2025-08-14T21:46:54.0863206Z attention_output = self.EncDecAttention( 2025-08-14T21:46:54.0863419Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 565, in forward 2025-08-14T21:46:54.0863526Z attn_output = torch.matmul(attn_weights, value_states) 2025-08-14T21:46:54.0863530Z 2025-08-14T21:46:54.0863625Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0863807Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0863874Z return mod(**inputs) 2025-08-14T21:46:54.0864091Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0864159Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0864385Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0864452Z layer_outputs = layer_module( 2025-08-14T21:46:54.0864664Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0864734Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0865031Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 705, in forward 2025-08-14T21:46:54.0865120Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:46:54.0865344Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 635, in forward 2025-08-14T21:46:54.0865430Z attention_output = self.EncDecAttention( 2025-08-14T21:46:54.0865660Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 565, in forward 2025-08-14T21:46:54.0865758Z attn_output = torch.matmul(attn_weights, value_states) 2025-08-14T21:46:54.0865764Z 2025-08-14T21:46:54.0865866Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0866047Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0866108Z return mod(**inputs) 2025-08-14T21:46:54.0866336Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0866405Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0866631Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0866694Z layer_outputs = layer_module( 2025-08-14T21:46:54.0866897Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0866973Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0867205Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 705, in forward 2025-08-14T21:46:54.0867282Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:46:54.0867508Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 635, in forward 2025-08-14T21:46:54.0867588Z attention_output = self.EncDecAttention( 2025-08-14T21:46:54.0867822Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 567, in forward 2025-08-14T21:46:54.0867940Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:46:54.0867943Z 2025-08-14T21:46:54.0868033Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0868222Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0868280Z return mod(**inputs) 2025-08-14T21:46:54.0868505Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0868594Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0868810Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0868883Z layer_outputs = layer_module( 2025-08-14T21:46:54.0869085Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0869156Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0869379Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 705, in forward 2025-08-14T21:46:54.0869450Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:46:54.0869668Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 635, in forward 2025-08-14T21:46:54.0869744Z attention_output = self.EncDecAttention( 2025-08-14T21:46:54.0869955Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 569, in forward 2025-08-14T21:46:54.0870035Z attn_output = self.o(attn_output) 2025-08-14T21:46:54.0870038Z 2025-08-14T21:46:54.0870131Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0870319Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0870380Z return mod(**inputs) 2025-08-14T21:46:54.0870598Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0870672Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0870886Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0870952Z layer_outputs = layer_module( 2025-08-14T21:46:54.0871163Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0871236Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0871462Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 705, in forward 2025-08-14T21:46:54.0871535Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:46:54.0871748Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 647, in forward 2025-08-14T21:46:54.0871882Z layer_output = hidden_states + self.dropout(attention_output[0]) 2025-08-14T21:46:54.0871887Z 2025-08-14T21:46:54.0871960Z cudagraph partition due to non gpu ops 2025-08-14T21:46:54.0872054Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0872241Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0872301Z return mod(**inputs) 2025-08-14T21:46:54.0872536Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0872604Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0872820Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0872890Z layer_outputs = layer_module( 2025-08-14T21:46:54.0873104Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0873190Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0873414Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 731, in forward 2025-08-14T21:46:54.0873496Z hidden_states = self.layer[-1](hidden_states) 2025-08-14T21:46:54.0873717Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 341, in forward 2025-08-14T21:46:54.0873806Z forwarded_states = self.layer_norm(hidden_states) 2025-08-14T21:46:54.0874021Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 261, in forward 2025-08-14T21:46:54.0874113Z return self.weight * hidden_states 2025-08-14T21:46:54.0874117Z 2025-08-14T21:46:54.0874210Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0874399Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0874459Z return mod(**inputs) 2025-08-14T21:46:54.0874677Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0874752Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0874971Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0875035Z layer_outputs = layer_module( 2025-08-14T21:46:54.0875248Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0875320Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0875542Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 731, in forward 2025-08-14T21:46:54.0875624Z hidden_states = self.layer[-1](hidden_states) 2025-08-14T21:46:54.0875839Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 342, in forward 2025-08-14T21:46:54.0875958Z forwarded_states = self.DenseReluDense(forwarded_states) 2025-08-14T21:46:54.0876175Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 287, in forward 2025-08-14T21:46:54.0876254Z hidden_states = self.wi(hidden_states) 2025-08-14T21:46:54.0876258Z 2025-08-14T21:46:54.0876347Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0876526Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0876593Z return mod(**inputs) 2025-08-14T21:46:54.0876813Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0876878Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0877104Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0877169Z layer_outputs = layer_module( 2025-08-14T21:46:54.0877381Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0877451Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0877666Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 731, in forward 2025-08-14T21:46:54.0877752Z hidden_states = self.layer[-1](hidden_states) 2025-08-14T21:46:54.0877981Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 342, in forward 2025-08-14T21:46:54.0878090Z forwarded_states = self.DenseReluDense(forwarded_states) 2025-08-14T21:46:54.0878306Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 288, in forward 2025-08-14T21:46:54.0878377Z hidden_states = self.act(hidden_states) 2025-08-14T21:46:54.0878397Z 2025-08-14T21:46:54.0878498Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0878696Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0878755Z return mod(**inputs) 2025-08-14T21:46:54.0878976Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0879041Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0879261Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0879325Z layer_outputs = layer_module( 2025-08-14T21:46:54.0879543Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0879620Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0879835Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 731, in forward 2025-08-14T21:46:54.0879913Z hidden_states = self.layer[-1](hidden_states) 2025-08-14T21:46:54.0880134Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 342, in forward 2025-08-14T21:46:54.0880239Z forwarded_states = self.DenseReluDense(forwarded_states) 2025-08-14T21:46:54.0880457Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 296, in forward 2025-08-14T21:46:54.0880529Z hidden_states = self.wo(hidden_states) 2025-08-14T21:46:54.0880532Z 2025-08-14T21:46:54.0880605Z cudagraph partition due to non gpu ops 2025-08-14T21:46:54.0880706Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0880885Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0880944Z return mod(**inputs) 2025-08-14T21:46:54.0881166Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0881233Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0881454Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0881519Z layer_outputs = layer_module( 2025-08-14T21:46:54.0881719Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0881796Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0882008Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 681, in forward 2025-08-14T21:46:54.0882089Z self_attention_outputs = self.layer[0]( 2025-08-14T21:46:54.0882300Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 598, in forward 2025-08-14T21:46:54.0882395Z normed_hidden_states = self.layer_norm(hidden_states) 2025-08-14T21:46:54.0882614Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 261, in forward 2025-08-14T21:46:54.0882685Z return self.weight * hidden_states 2025-08-14T21:46:54.0882689Z 2025-08-14T21:46:54.0882780Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0882965Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0883024Z return mod(**inputs) 2025-08-14T21:46:54.0883245Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0883325Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0883543Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0883613Z layer_outputs = layer_module( 2025-08-14T21:46:54.0883831Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0883902Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0884142Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 681, in forward 2025-08-14T21:46:54.0884215Z self_attention_outputs = self.layer[0]( 2025-08-14T21:46:54.0884436Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 599, in forward 2025-08-14T21:46:54.0884511Z attention_output = self.SelfAttention( 2025-08-14T21:46:54.0884918Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 490, in forward 2025-08-14T21:46:54.0885030Z query_states = self.q(hidden_states) 2025-08-14T21:46:54.0885034Z 2025-08-14T21:46:54.0885138Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0885344Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0885411Z return mod(**inputs) 2025-08-14T21:46:54.0885646Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0885730Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0885964Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0886034Z layer_outputs = layer_module( 2025-08-14T21:46:54.0886279Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0886354Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0886592Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 681, in forward 2025-08-14T21:46:54.0886669Z self_attention_outputs = self.layer[0]( 2025-08-14T21:46:54.0886901Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 599, in forward 2025-08-14T21:46:54.0886985Z attention_output = self.SelfAttention( 2025-08-14T21:46:54.0887204Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 510, in forward 2025-08-14T21:46:54.0887284Z key_states = self.k(current_states) 2025-08-14T21:46:54.0887287Z 2025-08-14T21:46:54.0887387Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0887583Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0887656Z return mod(**inputs) 2025-08-14T21:46:54.0887893Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0887966Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0888209Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0888279Z layer_outputs = layer_module( 2025-08-14T21:46:54.0888502Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0888579Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0888810Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 681, in forward 2025-08-14T21:46:54.0888895Z self_attention_outputs = self.layer[0]( 2025-08-14T21:46:54.0889125Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 599, in forward 2025-08-14T21:46:54.0889242Z attention_output = self.SelfAttention( 2025-08-14T21:46:54.0889485Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 526, in forward 2025-08-14T21:46:54.0889612Z scores = torch.matmul(query_states, key_states.transpose(3, 2)) 2025-08-14T21:46:54.0889616Z 2025-08-14T21:46:54.0889745Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0889940Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0890040Z return mod(**inputs) 2025-08-14T21:46:54.0890282Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0890354Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0890595Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0890664Z layer_outputs = layer_module( 2025-08-14T21:46:54.0890884Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0890996Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0891228Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 681, in forward 2025-08-14T21:46:54.0891304Z self_attention_outputs = self.layer[0]( 2025-08-14T21:46:54.0891543Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 599, in forward 2025-08-14T21:46:54.0891623Z attention_output = self.SelfAttention( 2025-08-14T21:46:54.0891858Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 558, in forward 2025-08-14T21:46:54.0892010Z attn_weights = nn.functional.softmax(scores.float(), dim=-1).type_as(scores) 2025-08-14T21:46:54.0892014Z 2025-08-14T21:46:54.0892112Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0892311Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0892377Z return mod(**inputs) 2025-08-14T21:46:54.0892618Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0892691Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0892923Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0892999Z layer_outputs = layer_module( 2025-08-14T21:46:54.0893216Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0893292Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0893531Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 681, in forward 2025-08-14T21:46:54.0893608Z self_attention_outputs = self.layer[0]( 2025-08-14T21:46:54.0893843Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 599, in forward 2025-08-14T21:46:54.0893922Z attention_output = self.SelfAttention( 2025-08-14T21:46:54.0894153Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 511, in forward 2025-08-14T21:46:54.0894236Z value_states = self.v(current_states) 2025-08-14T21:46:54.0894242Z 2025-08-14T21:46:54.0894340Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0894538Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0894605Z return mod(**inputs) 2025-08-14T21:46:54.0894818Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0894889Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0895117Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0895185Z layer_outputs = layer_module( 2025-08-14T21:46:54.0895394Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0895464Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0895697Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 681, in forward 2025-08-14T21:46:54.0895784Z self_attention_outputs = self.layer[0]( 2025-08-14T21:46:54.0895998Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 599, in forward 2025-08-14T21:46:54.0896080Z attention_output = self.SelfAttention( 2025-08-14T21:46:54.0896294Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 565, in forward 2025-08-14T21:46:54.0896396Z attn_output = torch.matmul(attn_weights, value_states) 2025-08-14T21:46:54.0896412Z 2025-08-14T21:46:54.0896513Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0896691Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0896760Z return mod(**inputs) 2025-08-14T21:46:54.0896975Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0897042Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0897266Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0897332Z layer_outputs = layer_module( 2025-08-14T21:46:54.0897531Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0897609Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0897822Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 681, in forward 2025-08-14T21:46:54.0897903Z self_attention_outputs = self.layer[0]( 2025-08-14T21:46:54.0898115Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 599, in forward 2025-08-14T21:46:54.0898188Z attention_output = self.SelfAttention( 2025-08-14T21:46:54.0898407Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 565, in forward 2025-08-14T21:46:54.0898503Z attn_output = torch.matmul(attn_weights, value_states) 2025-08-14T21:46:54.0898507Z 2025-08-14T21:46:54.0898607Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0898785Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0898845Z return mod(**inputs) 2025-08-14T21:46:54.0899068Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0899134Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0899347Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0899417Z layer_outputs = layer_module( 2025-08-14T21:46:54.0899618Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0899700Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0899909Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 681, in forward 2025-08-14T21:46:54.0899981Z self_attention_outputs = self.layer[0]( 2025-08-14T21:46:54.0900198Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 599, in forward 2025-08-14T21:46:54.0900271Z attention_output = self.SelfAttention( 2025-08-14T21:46:54.0900496Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 567, in forward 2025-08-14T21:46:54.0900602Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:46:54.0900605Z 2025-08-14T21:46:54.0900699Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0900902Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0900963Z return mod(**inputs) 2025-08-14T21:46:54.0901193Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0901268Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0901484Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0901556Z layer_outputs = layer_module( 2025-08-14T21:46:54.0901763Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0901882Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0902102Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 681, in forward 2025-08-14T21:46:54.0902175Z self_attention_outputs = self.layer[0]( 2025-08-14T21:46:54.0902387Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 599, in forward 2025-08-14T21:46:54.0902469Z attention_output = self.SelfAttention( 2025-08-14T21:46:54.0902683Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 569, in forward 2025-08-14T21:46:54.0902759Z attn_output = self.o(attn_output) 2025-08-14T21:46:54.0902762Z 2025-08-14T21:46:54.0902834Z cudagraph partition due to non gpu ops 2025-08-14T21:46:54.0902926Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0903118Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0903178Z return mod(**inputs) 2025-08-14T21:46:54.0903393Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0903466Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0903682Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0903754Z layer_outputs = layer_module( 2025-08-14T21:46:54.0903956Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0904027Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0904247Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 705, in forward 2025-08-14T21:46:54.0904319Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:46:54.0904536Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 634, in forward 2025-08-14T21:46:54.0904633Z normed_hidden_states = self.layer_norm(hidden_states) 2025-08-14T21:46:54.0904899Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 261, in forward 2025-08-14T21:46:54.0904983Z return self.weight * hidden_states 2025-08-14T21:46:54.0904987Z 2025-08-14T21:46:54.0905079Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0905260Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0905326Z return mod(**inputs) 2025-08-14T21:46:54.0905542Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0905615Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0905849Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0905920Z layer_outputs = layer_module( 2025-08-14T21:46:54.0906134Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0906206Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0906433Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 705, in forward 2025-08-14T21:46:54.0906533Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:46:54.0906749Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 635, in forward 2025-08-14T21:46:54.0906835Z attention_output = self.EncDecAttention( 2025-08-14T21:46:54.0907046Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 490, in forward 2025-08-14T21:46:54.0907117Z query_states = self.q(hidden_states) 2025-08-14T21:46:54.0907120Z 2025-08-14T21:46:54.0907222Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0907418Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0907485Z return mod(**inputs) 2025-08-14T21:46:54.0907701Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0907765Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0907986Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0908049Z layer_outputs = layer_module( 2025-08-14T21:46:54.0908249Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0908325Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0908537Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 705, in forward 2025-08-14T21:46:54.0908616Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:46:54.0908824Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 635, in forward 2025-08-14T21:46:54.0908901Z attention_output = self.EncDecAttention( 2025-08-14T21:46:54.0909120Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 510, in forward 2025-08-14T21:46:54.0909189Z key_states = self.k(current_states) 2025-08-14T21:46:54.0909192Z 2025-08-14T21:46:54.0909290Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0909468Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0909526Z return mod(**inputs) 2025-08-14T21:46:54.0909748Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0909815Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0910029Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0910101Z layer_outputs = layer_module( 2025-08-14T21:46:54.0910301Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0910382Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0910596Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 705, in forward 2025-08-14T21:46:54.0910667Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:46:54.0910886Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 635, in forward 2025-08-14T21:46:54.0910962Z attention_output = self.EncDecAttention( 2025-08-14T21:46:54.0911186Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 526, in forward 2025-08-14T21:46:54.0911316Z scores = torch.matmul(query_states, key_states.transpose(3, 2)) 2025-08-14T21:46:54.0911319Z 2025-08-14T21:46:54.0911413Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0911603Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0911690Z return mod(**inputs) 2025-08-14T21:46:54.0911906Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0911999Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0912223Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0912295Z layer_outputs = layer_module( 2025-08-14T21:46:54.0912502Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0912573Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0912809Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 705, in forward 2025-08-14T21:46:54.0912880Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:46:54.0913093Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 635, in forward 2025-08-14T21:46:54.0913176Z attention_output = self.EncDecAttention( 2025-08-14T21:46:54.0913387Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 558, in forward 2025-08-14T21:46:54.0913533Z attn_weights = nn.functional.softmax(scores.float(), dim=-1).type_as(scores) 2025-08-14T21:46:54.0913537Z 2025-08-14T21:46:54.0913629Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0913808Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0913875Z return mod(**inputs) 2025-08-14T21:46:54.0914091Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0914157Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0914375Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0914442Z layer_outputs = layer_module( 2025-08-14T21:46:54.0914650Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0914723Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0914933Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 705, in forward 2025-08-14T21:46:54.0915012Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:46:54.0915222Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 635, in forward 2025-08-14T21:46:54.0915308Z attention_output = self.EncDecAttention( 2025-08-14T21:46:54.0915522Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 511, in forward 2025-08-14T21:46:54.0915591Z value_states = self.v(current_states) 2025-08-14T21:46:54.0915594Z 2025-08-14T21:46:54.0915694Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0915872Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0915932Z return mod(**inputs) 2025-08-14T21:46:54.0916155Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0916221Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0916442Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0916505Z layer_outputs = layer_module( 2025-08-14T21:46:54.0916719Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0916801Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0917017Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 705, in forward 2025-08-14T21:46:54.0917110Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:46:54.0917325Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 635, in forward 2025-08-14T21:46:54.0917416Z attention_output = self.EncDecAttention( 2025-08-14T21:46:54.0917633Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 565, in forward 2025-08-14T21:46:54.0917730Z attn_output = torch.matmul(attn_weights, value_states) 2025-08-14T21:46:54.0917734Z 2025-08-14T21:46:54.0917826Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0918012Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0918086Z return mod(**inputs) 2025-08-14T21:46:54.0918311Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0918375Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0918594Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0918667Z layer_outputs = layer_module( 2025-08-14T21:46:54.0918868Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0918937Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0919155Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 705, in forward 2025-08-14T21:46:54.0919226Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:46:54.0919448Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 635, in forward 2025-08-14T21:46:54.0919524Z attention_output = self.EncDecAttention( 2025-08-14T21:46:54.0919736Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 565, in forward 2025-08-14T21:46:54.0919839Z attn_output = torch.matmul(attn_weights, value_states) 2025-08-14T21:46:54.0919844Z 2025-08-14T21:46:54.0919937Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0920124Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0920183Z return mod(**inputs) 2025-08-14T21:46:54.0920398Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0920471Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0920685Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0920750Z layer_outputs = layer_module( 2025-08-14T21:46:54.0920962Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0921031Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0921254Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 705, in forward 2025-08-14T21:46:54.0921326Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:46:54.0921539Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 635, in forward 2025-08-14T21:46:54.0921624Z attention_output = self.EncDecAttention( 2025-08-14T21:46:54.0921837Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 567, in forward 2025-08-14T21:46:54.0921947Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:46:54.0921958Z 2025-08-14T21:46:54.0922054Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0922238Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0922304Z return mod(**inputs) 2025-08-14T21:46:54.0922540Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0922621Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0922845Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0922909Z layer_outputs = layer_module( 2025-08-14T21:46:54.0923116Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0923187Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0923402Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 705, in forward 2025-08-14T21:46:54.0923496Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:46:54.0923708Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 635, in forward 2025-08-14T21:46:54.0923785Z attention_output = self.EncDecAttention( 2025-08-14T21:46:54.0924002Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 569, in forward 2025-08-14T21:46:54.0924074Z attn_output = self.o(attn_output) 2025-08-14T21:46:54.0924077Z 2025-08-14T21:46:54.0924155Z cudagraph partition due to non gpu ops 2025-08-14T21:46:54.0924248Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0924427Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0924495Z return mod(**inputs) 2025-08-14T21:46:54.0924710Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0924784Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0925006Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0925072Z layer_outputs = layer_module( 2025-08-14T21:46:54.0925284Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0925359Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0925573Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 731, in forward 2025-08-14T21:46:54.0925663Z hidden_states = self.layer[-1](hidden_states) 2025-08-14T21:46:54.0925876Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 341, in forward 2025-08-14T21:46:54.0925973Z forwarded_states = self.layer_norm(hidden_states) 2025-08-14T21:46:54.0926186Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 261, in forward 2025-08-14T21:46:54.0926255Z return self.weight * hidden_states 2025-08-14T21:46:54.0926258Z 2025-08-14T21:46:54.0926356Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0926536Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0926596Z return mod(**inputs) 2025-08-14T21:46:54.0926818Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0926883Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0927103Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0927167Z layer_outputs = layer_module( 2025-08-14T21:46:54.0927383Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0927468Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0927683Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 731, in forward 2025-08-14T21:46:54.0927778Z hidden_states = self.layer[-1](hidden_states) 2025-08-14T21:46:54.0928005Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 342, in forward 2025-08-14T21:46:54.0928127Z forwarded_states = self.DenseReluDense(forwarded_states) 2025-08-14T21:46:54.0928347Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 287, in forward 2025-08-14T21:46:54.0928420Z hidden_states = self.wi(hidden_states) 2025-08-14T21:46:54.0928424Z 2025-08-14T21:46:54.0928514Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0928701Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0928780Z return mod(**inputs) 2025-08-14T21:46:54.0929007Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0929075Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0929292Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0929367Z layer_outputs = layer_module( 2025-08-14T21:46:54.0929570Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0929641Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0929862Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 731, in forward 2025-08-14T21:46:54.0929944Z hidden_states = self.layer[-1](hidden_states) 2025-08-14T21:46:54.0930167Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 342, in forward 2025-08-14T21:46:54.0930274Z forwarded_states = self.DenseReluDense(forwarded_states) 2025-08-14T21:46:54.0930490Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 288, in forward 2025-08-14T21:46:54.0930573Z hidden_states = self.act(hidden_states) 2025-08-14T21:46:54.0930576Z 2025-08-14T21:46:54.0930670Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0930860Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0930920Z return mod(**inputs) 2025-08-14T21:46:54.0931136Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0931209Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0931429Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0931495Z layer_outputs = layer_module( 2025-08-14T21:46:54.0931707Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0931779Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0932000Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 731, in forward 2025-08-14T21:46:54.0932083Z hidden_states = self.layer[-1](hidden_states) 2025-08-14T21:46:54.0932296Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 342, in forward 2025-08-14T21:46:54.0932409Z forwarded_states = self.DenseReluDense(forwarded_states) 2025-08-14T21:46:54.0932623Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 296, in forward 2025-08-14T21:46:54.0932696Z hidden_states = self.wo(hidden_states) 2025-08-14T21:46:54.0932729Z 2025-08-14T21:46:54.0932824Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0933003Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0933069Z return mod(**inputs) 2025-08-14T21:46:54.0933299Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0933382Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0933605Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0933668Z layer_outputs = layer_module( 2025-08-14T21:46:54.0933877Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0933948Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0934160Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 731, in forward 2025-08-14T21:46:54.0934264Z hidden_states = self.layer[-1](hidden_states) 2025-08-14T21:46:54.0934477Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 343, in forward 2025-08-14T21:46:54.0934594Z hidden_states = hidden_states + self.dropout(forwarded_states) 2025-08-14T21:46:54.0934604Z 2025-08-14T21:46:54.0934676Z cudagraph partition due to non gpu ops 2025-08-14T21:46:54.0934769Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0934955Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0935014Z return mod(**inputs) 2025-08-14T21:46:54.0935232Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0935304Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0935521Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0935593Z layer_outputs = layer_module( 2025-08-14T21:46:54.0935795Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0935866Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0936087Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 681, in forward 2025-08-14T21:46:54.0936163Z self_attention_outputs = self.layer[0]( 2025-08-14T21:46:54.0936375Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 598, in forward 2025-08-14T21:46:54.0936480Z normed_hidden_states = self.layer_norm(hidden_states) 2025-08-14T21:46:54.0936690Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 261, in forward 2025-08-14T21:46:54.0936766Z return self.weight * hidden_states 2025-08-14T21:46:54.0936771Z 2025-08-14T21:46:54.0936864Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0937042Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0937110Z return mod(**inputs) 2025-08-14T21:46:54.0937326Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0937393Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0937616Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0937681Z layer_outputs = layer_module( 2025-08-14T21:46:54.0937889Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0937960Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0938186Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 681, in forward 2025-08-14T21:46:54.0938272Z self_attention_outputs = self.layer[0]( 2025-08-14T21:46:54.0938485Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 599, in forward 2025-08-14T21:46:54.0938583Z attention_output = self.SelfAttention( 2025-08-14T21:46:54.0938800Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 490, in forward 2025-08-14T21:46:54.0938890Z query_states = self.q(hidden_states) 2025-08-14T21:46:54.0938894Z 2025-08-14T21:46:54.0938991Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0939169Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0939229Z return mod(**inputs) 2025-08-14T21:46:54.0939451Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0939533Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0939755Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0939820Z layer_outputs = layer_module( 2025-08-14T21:46:54.0940024Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0940105Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0940316Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 681, in forward 2025-08-14T21:46:54.0940388Z self_attention_outputs = self.layer[0]( 2025-08-14T21:46:54.0940605Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 599, in forward 2025-08-14T21:46:54.0940680Z attention_output = self.SelfAttention( 2025-08-14T21:46:54.0940901Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 510, in forward 2025-08-14T21:46:54.0940972Z key_states = self.k(current_states) 2025-08-14T21:46:54.0940975Z 2025-08-14T21:46:54.0941068Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0941255Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0941314Z return mod(**inputs) 2025-08-14T21:46:54.0941536Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0941601Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0941814Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0941883Z layer_outputs = layer_module( 2025-08-14T21:46:54.0942085Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0942158Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0942378Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 681, in forward 2025-08-14T21:46:54.0942450Z self_attention_outputs = self.layer[0]( 2025-08-14T21:46:54.0942668Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 599, in forward 2025-08-14T21:46:54.0942741Z attention_output = self.SelfAttention( 2025-08-14T21:46:54.0942952Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 526, in forward 2025-08-14T21:46:54.0943076Z scores = torch.matmul(query_states, key_states.transpose(3, 2)) 2025-08-14T21:46:54.0943080Z 2025-08-14T21:46:54.0943173Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0943357Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0943433Z return mod(**inputs) 2025-08-14T21:46:54.0943650Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0943723Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0943952Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0944017Z layer_outputs = layer_module( 2025-08-14T21:46:54.0944246Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0944316Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0944537Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 681, in forward 2025-08-14T21:46:54.0944611Z self_attention_outputs = self.layer[0]( 2025-08-14T21:46:54.0944900Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 599, in forward 2025-08-14T21:46:54.0945009Z attention_output = self.SelfAttention( 2025-08-14T21:46:54.0945234Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 558, in forward 2025-08-14T21:46:54.0945380Z attn_weights = nn.functional.softmax(scores.float(), dim=-1).type_as(scores) 2025-08-14T21:46:54.0945392Z 2025-08-14T21:46:54.0945489Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0945678Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0945747Z return mod(**inputs) 2025-08-14T21:46:54.0945976Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0946042Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0946286Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0946352Z layer_outputs = layer_module( 2025-08-14T21:46:54.0946565Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0946637Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0946856Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 681, in forward 2025-08-14T21:46:54.0946934Z self_attention_outputs = self.layer[0]( 2025-08-14T21:46:54.0947202Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 599, in forward 2025-08-14T21:46:54.0947280Z attention_output = self.SelfAttention( 2025-08-14T21:46:54.0947511Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 511, in forward 2025-08-14T21:46:54.0947582Z value_states = self.v(current_states) 2025-08-14T21:46:54.0947585Z 2025-08-14T21:46:54.0947688Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0947882Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0947944Z return mod(**inputs) 2025-08-14T21:46:54.0948175Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0948243Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0948467Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0948542Z layer_outputs = layer_module( 2025-08-14T21:46:54.0948750Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0948831Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0949057Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 681, in forward 2025-08-14T21:46:54.0949143Z self_attention_outputs = self.layer[0]( 2025-08-14T21:46:54.0949377Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 599, in forward 2025-08-14T21:46:54.0949453Z attention_output = self.SelfAttention( 2025-08-14T21:46:54.0949692Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 565, in forward 2025-08-14T21:46:54.0949796Z attn_output = torch.matmul(attn_weights, value_states) 2025-08-14T21:46:54.0949813Z 2025-08-14T21:46:54.0949911Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0950104Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0950165Z return mod(**inputs) 2025-08-14T21:46:54.0950387Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0950461Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0950686Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0950785Z layer_outputs = layer_module( 2025-08-14T21:46:54.0950992Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0951068Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0951297Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 681, in forward 2025-08-14T21:46:54.0951374Z self_attention_outputs = self.layer[0]( 2025-08-14T21:46:54.0951598Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 599, in forward 2025-08-14T21:46:54.0951681Z attention_output = self.SelfAttention( 2025-08-14T21:46:54.0951904Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 565, in forward 2025-08-14T21:46:54.0952013Z attn_output = torch.matmul(attn_weights, value_states) 2025-08-14T21:46:54.0952017Z 2025-08-14T21:46:54.0952113Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0952302Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0952372Z return mod(**inputs) 2025-08-14T21:46:54.0952600Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0952676Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0952900Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0952965Z layer_outputs = layer_module( 2025-08-14T21:46:54.0953181Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0953252Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0953476Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 681, in forward 2025-08-14T21:46:54.0953557Z self_attention_outputs = self.layer[0]( 2025-08-14T21:46:54.0953778Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 599, in forward 2025-08-14T21:46:54.0953862Z attention_output = self.SelfAttention( 2025-08-14T21:46:54.0954084Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 567, in forward 2025-08-14T21:46:54.0954185Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:46:54.0954188Z 2025-08-14T21:46:54.0954288Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0954476Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0954542Z return mod(**inputs) 2025-08-14T21:46:54.0954783Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0954853Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0955083Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0955149Z layer_outputs = layer_module( 2025-08-14T21:46:54.0955373Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0955475Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0955699Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 681, in forward 2025-08-14T21:46:54.0955783Z self_attention_outputs = self.layer[0]( 2025-08-14T21:46:54.0956006Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 599, in forward 2025-08-14T21:46:54.0956084Z attention_output = self.SelfAttention( 2025-08-14T21:46:54.0956313Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 569, in forward 2025-08-14T21:46:54.0956400Z attn_output = self.o(attn_output) 2025-08-14T21:46:54.0956404Z 2025-08-14T21:46:54.0956479Z cudagraph partition due to non gpu ops 2025-08-14T21:46:54.0956583Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0956770Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0956839Z return mod(**inputs) 2025-08-14T21:46:54.0957060Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0957126Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0957356Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0957420Z layer_outputs = layer_module( 2025-08-14T21:46:54.0957631Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0957712Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0957931Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 705, in forward 2025-08-14T21:46:54.0958013Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:46:54.0958230Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 634, in forward 2025-08-14T21:46:54.0958332Z normed_hidden_states = self.layer_norm(hidden_states) 2025-08-14T21:46:54.0958559Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 261, in forward 2025-08-14T21:46:54.0958629Z return self.weight * hidden_states 2025-08-14T21:46:54.0958632Z 2025-08-14T21:46:54.0958733Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0958921Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0958983Z return mod(**inputs) 2025-08-14T21:46:54.0959210Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0959276Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0959497Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0959581Z layer_outputs = layer_module( 2025-08-14T21:46:54.0959780Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0959858Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0960068Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 705, in forward 2025-08-14T21:46:54.0960139Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:46:54.0960370Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 635, in forward 2025-08-14T21:46:54.0960450Z attention_output = self.EncDecAttention( 2025-08-14T21:46:54.0960674Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 490, in forward 2025-08-14T21:46:54.0960758Z query_states = self.q(hidden_states) 2025-08-14T21:46:54.0960761Z 2025-08-14T21:46:54.0960868Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0961054Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0961111Z return mod(**inputs) 2025-08-14T21:46:54.0961326Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0961399Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0961617Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0961703Z layer_outputs = layer_module( 2025-08-14T21:46:54.0961905Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0961976Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0962196Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 705, in forward 2025-08-14T21:46:54.0962270Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:46:54.0962483Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 635, in forward 2025-08-14T21:46:54.0962568Z attention_output = self.EncDecAttention( 2025-08-14T21:46:54.0962781Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 510, in forward 2025-08-14T21:46:54.0962860Z key_states = self.k(current_states) 2025-08-14T21:46:54.0962863Z 2025-08-14T21:46:54.0962961Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0963140Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0963205Z return mod(**inputs) 2025-08-14T21:46:54.0963423Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0963495Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0963712Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0963776Z layer_outputs = layer_module( 2025-08-14T21:46:54.0963986Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0964056Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0964269Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 705, in forward 2025-08-14T21:46:54.0964349Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:46:54.0964563Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 635, in forward 2025-08-14T21:46:54.0964646Z attention_output = self.EncDecAttention( 2025-08-14T21:46:54.0964858Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 526, in forward 2025-08-14T21:46:54.0964976Z scores = torch.matmul(query_states, key_states.transpose(3, 2)) 2025-08-14T21:46:54.0964981Z 2025-08-14T21:46:54.0965082Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0965261Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0965320Z return mod(**inputs) 2025-08-14T21:46:54.0965544Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0965623Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0965847Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0965913Z layer_outputs = layer_module( 2025-08-14T21:46:54.0966129Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0966207Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0966438Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 705, in forward 2025-08-14T21:46:54.0966515Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:46:54.0966729Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 635, in forward 2025-08-14T21:46:54.0966804Z attention_output = self.EncDecAttention( 2025-08-14T21:46:54.0967025Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 558, in forward 2025-08-14T21:46:54.0967180Z attn_weights = nn.functional.softmax(scores.float(), dim=-1).type_as(scores) 2025-08-14T21:46:54.0967185Z 2025-08-14T21:46:54.0967277Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0967463Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0967522Z return mod(**inputs) 2025-08-14T21:46:54.0967743Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0967808Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0968018Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0968090Z layer_outputs = layer_module( 2025-08-14T21:46:54.0968292Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0968371Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0968582Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 705, in forward 2025-08-14T21:46:54.0968653Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:46:54.0968870Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 635, in forward 2025-08-14T21:46:54.0968945Z attention_output = self.EncDecAttention( 2025-08-14T21:46:54.0969154Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 511, in forward 2025-08-14T21:46:54.0969230Z value_states = self.v(current_states) 2025-08-14T21:46:54.0969233Z 2025-08-14T21:46:54.0969324Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0969508Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0969568Z return mod(**inputs) 2025-08-14T21:46:54.0969782Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0969855Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0970068Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0970133Z layer_outputs = layer_module( 2025-08-14T21:46:54.0970340Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0970408Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0970625Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 705, in forward 2025-08-14T21:46:54.0970695Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:46:54.0970918Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 635, in forward 2025-08-14T21:46:54.0971004Z attention_output = self.EncDecAttention( 2025-08-14T21:46:54.0971217Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 565, in forward 2025-08-14T21:46:54.0971321Z attn_output = torch.matmul(attn_weights, value_states) 2025-08-14T21:46:54.0971324Z 2025-08-14T21:46:54.0971433Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0971636Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0971701Z return mod(**inputs) 2025-08-14T21:46:54.0971916Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0971980Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0972202Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0972266Z layer_outputs = layer_module( 2025-08-14T21:46:54.0972495Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0972566Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0972779Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 705, in forward 2025-08-14T21:46:54.0972862Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:46:54.0973077Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 635, in forward 2025-08-14T21:46:54.0973152Z attention_output = self.EncDecAttention( 2025-08-14T21:46:54.0973374Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 565, in forward 2025-08-14T21:46:54.0973471Z attn_output = torch.matmul(attn_weights, value_states) 2025-08-14T21:46:54.0973474Z 2025-08-14T21:46:54.0973575Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0973758Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0973818Z return mod(**inputs) 2025-08-14T21:46:54.0974040Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0974107Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0974330Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0974397Z layer_outputs = layer_module( 2025-08-14T21:46:54.0974599Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0974677Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0974889Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 705, in forward 2025-08-14T21:46:54.0974963Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:46:54.0975184Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 635, in forward 2025-08-14T21:46:54.0975260Z attention_output = self.EncDecAttention( 2025-08-14T21:46:54.0975483Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 567, in forward 2025-08-14T21:46:54.0975581Z attn_output = attn_output.transpose(1, 2).contiguous() 2025-08-14T21:46:54.0975585Z 2025-08-14T21:46:54.0975679Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0975870Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0975930Z return mod(**inputs) 2025-08-14T21:46:54.0976154Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0976220Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0976450Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0976527Z layer_outputs = layer_module( 2025-08-14T21:46:54.0976730Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0976813Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0977035Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 705, in forward 2025-08-14T21:46:54.0977126Z cross_attention_outputs = self.layer[1]( 2025-08-14T21:46:54.0977342Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 635, in forward 2025-08-14T21:46:54.0977419Z attention_output = self.EncDecAttention( 2025-08-14T21:46:54.0977635Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 569, in forward 2025-08-14T21:46:54.0977714Z attn_output = self.o(attn_output) 2025-08-14T21:46:54.0977732Z 2025-08-14T21:46:54.0977807Z cudagraph partition due to non gpu ops 2025-08-14T21:46:54.0977907Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0978099Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0978164Z return mod(**inputs) 2025-08-14T21:46:54.0978391Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0978461Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0978682Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0978757Z layer_outputs = layer_module( 2025-08-14T21:46:54.0978966Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0979041Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0979269Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 731, in forward 2025-08-14T21:46:54.0979355Z hidden_states = self.layer[-1](hidden_states) 2025-08-14T21:46:54.0979582Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 341, in forward 2025-08-14T21:46:54.0979674Z forwarded_states = self.layer_norm(hidden_states) 2025-08-14T21:46:54.0979894Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 261, in forward 2025-08-14T21:46:54.0979974Z return self.weight * hidden_states 2025-08-14T21:46:54.0979977Z 2025-08-14T21:46:54.0980074Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0980265Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0980327Z return mod(**inputs) 2025-08-14T21:46:54.0980549Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0980630Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0980852Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0980921Z layer_outputs = layer_module( 2025-08-14T21:46:54.0981135Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0981211Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0981436Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 731, in forward 2025-08-14T21:46:54.0981523Z hidden_states = self.layer[-1](hidden_states) 2025-08-14T21:46:54.0981740Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 342, in forward 2025-08-14T21:46:54.0981876Z forwarded_states = self.DenseReluDense(forwarded_states) 2025-08-14T21:46:54.0982090Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 287, in forward 2025-08-14T21:46:54.0982169Z hidden_states = self.wi(hidden_states) 2025-08-14T21:46:54.0982173Z 2025-08-14T21:46:54.0982278Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0982462Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0982546Z return mod(**inputs) 2025-08-14T21:46:54.0982763Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0982830Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0983053Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0983117Z layer_outputs = layer_module( 2025-08-14T21:46:54.0983327Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0983411Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0983624Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 731, in forward 2025-08-14T21:46:54.0983714Z hidden_states = self.layer[-1](hidden_states) 2025-08-14T21:46:54.0983927Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 342, in forward 2025-08-14T21:46:54.0984033Z forwarded_states = self.DenseReluDense(forwarded_states) 2025-08-14T21:46:54.0984250Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 288, in forward 2025-08-14T21:46:54.0984323Z hidden_states = self.act(hidden_states) 2025-08-14T21:46:54.0984327Z 2025-08-14T21:46:54.0984425Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0984727Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0984854Z return mod(**inputs) 2025-08-14T21:46:54.0985119Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1762, in forward 2025-08-14T21:46:54.0985195Z decoder_outputs = self.decoder( 2025-08-14T21:46:54.0985462Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1092, in forward 2025-08-14T21:46:54.0985540Z layer_outputs = layer_module( 2025-08-14T21:46:54.0985774Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:46:54.0985863Z return super().__call__(*args, **kwargs) 2025-08-14T21:46:54.0986111Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 731, in forward 2025-08-14T21:46:54.0986199Z hidden_states = self.layer[-1](hidden_states) 2025-08-14T21:46:54.0986441Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 342, in forward 2025-08-14T21:46:54.0986554Z forwarded_states = self.DenseReluDense(forwarded_states) 2025-08-14T21:46:54.0986832Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 296, in forward 2025-08-14T21:46:54.0986906Z hidden_states = self.wo(hidden_states) 2025-08-14T21:46:54.0986911Z 2025-08-14T21:46:54.0986983Z cudagraph partition due to non gpu ops 2025-08-14T21:46:54.0987084Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0987262Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0987329Z return mod(**inputs) 2025-08-14T21:46:54.0987545Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1789, in forward 2025-08-14T21:46:54.0987691Z sequence_output = sequence_output * (self.model_dim**-0.5) 2025-08-14T21:46:54.0987696Z 2025-08-14T21:46:54.0987796Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0988001Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0988069Z return mod(**inputs) 2025-08-14T21:46:54.0988357Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1791, in forward 2025-08-14T21:46:54.0988505Z lm_logits = self.lm_head(sequence_output) 2025-08-14T21:46:54.0988509Z 2025-08-14T21:46:54.0988623Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0988830Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0988898Z return mod(**inputs) 2025-08-14T21:46:54.0989162Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1798, in forward 2025-08-14T21:46:54.0989312Z loss = loss_fct(lm_logits.view(-1, lm_logits.size(-1)), labels.view(-1)) 2025-08-14T21:46:54.0989338Z 2025-08-14T21:46:54.0989457Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0989661Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0989730Z return mod(**inputs) 2025-08-14T21:46:54.0989991Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1798, in forward 2025-08-14T21:46:54.0990131Z loss = loss_fct(lm_logits.view(-1, lm_logits.size(-1)), labels.view(-1)) 2025-08-14T21:46:54.0990135Z 2025-08-14T21:46:54.0990239Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:46:54.0990453Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:46:54.0990519Z return mod(**inputs) 2025-08-14T21:46:54.0990785Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/t5/modeling_t5.py", line 1798, in forward 2025-08-14T21:46:54.0990923Z loss = loss_fct(lm_logits.view(-1, lm_logits.size(-1)), labels.view(-1)) 2025-08-14T21:46:54.0990927Z 2025-08-14T21:47:02.5264075Z Compilation time (from dynamo_timed): 15.675272897 2025-08-14T21:47:02.5398474Z pass 2025-08-14T21:47:02.5402681Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:47:02.5407202Z TIMING: _recursive_pre_grad_passes:0.00979 _recursive_joint_graph_passes:0.51628 _recursive_post_grad_passes:0.16871 async_compile.wait:0.6744 code_gen:8.10424 inductor_compile:9.53599 backend_compile:13.1548 gc:0.0002 entire_frame_compile:15.67527 total_wall_time:15.67527 2025-08-14T21:47:02.5408674Z STATS: call_* op count: 810 | FakeTensorMode.__torch_dispatch__:20429 | FakeTensor.__torch_dispatch__:5656 | ProxyTorchDispatchMode.__torch_dispatch__:7292 2025-08-14T21:47:02.5409146Z Dynamo produced 1 graphs covering 810 ops with 0 graph breaks (0 unique) 2025-08-14T21:47:06.7149070Z /opt/conda/envs/py_3.9/lib/python3.9/site-packages/llvmlite/binding/ffi.py:175: UserWarning: pkg_resources is deprecated as an API. See https://setuptools.pypa.io/en/latest/pkg_resources.html. The pkg_resources package is slated for removal as early as 2025-11-30. Refrain from using this package or pin to Setuptools<81. 2025-08-14T21:47:06.7149930Z from pkg_resources import resource_filename 2025-08-14T21:47:07.2753937Z 2025-08-14T21:47:08.2601337Z loading model: 0it [00:00, ?it/s] 2025-08-14T21:47:08.2602447Z loading model: 0it [00:00, ?it/s] 2025-08-14T21:47:08.2615366Z cpu eval T5Small 2025-08-14T21:47:09.4428463Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:47:09.7455447Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:47:10.1232233Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:47:19.7325031Z Compilation time (from dynamo_timed): 8.333625682 2025-08-14T21:47:19.7453630Z pass 2025-08-14T21:47:19.7457671Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:47:19.7459496Z TIMING: _recursive_pre_grad_passes:0.00995 async_compile.wait:0.00543 backend_compile:5.79409 gc:0.00185 entire_frame_compile:8.33363 total_wall_time:8.33363 2025-08-14T21:47:19.7460111Z STATS: call_* op count: 810 | FakeTensorMode.__torch_dispatch__:2289 | FakeTensor.__torch_dispatch__:17 2025-08-14T21:47:19.7460482Z Dynamo produced 1 graphs covering 810 ops with 0 graph breaks (0 unique) 2025-08-14T21:47:23.7981081Z /opt/conda/envs/py_3.9/lib/python3.9/site-packages/llvmlite/binding/ffi.py:175: UserWarning: pkg_resources is deprecated as an API. See https://setuptools.pypa.io/en/latest/pkg_resources.html. The pkg_resources package is slated for removal as early as 2025-11-30. Refrain from using this package or pin to Setuptools<81. 2025-08-14T21:47:23.7982401Z from pkg_resources import resource_filename 2025-08-14T21:47:24.3315652Z 2025-08-14T21:47:26.7328996Z loading model: 0it [00:00, ?it/s] 2025-08-14T21:47:26.7333138Z loading model: 0it [00:02, ?it/s] 2025-08-14T21:47:26.7345997Z cpu eval TrOCRForCausalLM 2025-08-14T21:47:26.8695687Z WARNING:common:fp64 golden ref were not generated for TrOCRForCausalLM. Setting accuracy check to cosine 2025-08-14T21:47:26.8974918Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:47:27.0983609Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:47:27.2801100Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:47:34.0422763Z cudagraph partition due to non gpu ops 2025-08-14T21:47:34.0426912Z cudagraph partition due to non gpu ops 2025-08-14T21:47:34.0428917Z cudagraph partition due to non gpu ops 2025-08-14T21:47:34.0429266Z cudagraph partition due to non gpu ops 2025-08-14T21:47:34.0433970Z cudagraph partition due to non gpu ops 2025-08-14T21:47:34.0440187Z cudagraph partition due to non gpu ops 2025-08-14T21:47:34.0442382Z cudagraph partition due to non gpu ops 2025-08-14T21:47:34.0447043Z cudagraph partition due to non gpu ops 2025-08-14T21:47:34.0451473Z cudagraph partition due to non gpu ops 2025-08-14T21:47:34.0453768Z cudagraph partition due to non gpu ops 2025-08-14T21:47:34.0454143Z cudagraph partition due to non gpu ops 2025-08-14T21:47:34.0458729Z cudagraph partition due to non gpu ops 2025-08-14T21:47:34.0462956Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:47:34.0464956Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:47:34.0465402Z return mod(**inputs) 2025-08-14T21:47:34.0470623Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 823, in forward 2025-08-14T21:47:34.0472559Z outputs = self.model.decoder( 2025-08-14T21:47:34.0477263Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 644, in forward 2025-08-14T21:47:34.0480917Z layer_outputs = decoder_layer( 2025-08-14T21:47:34.0484919Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:47:34.0487722Z return super().__call__(*args, **kwargs) 2025-08-14T21:47:34.0491604Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 367, in forward 2025-08-14T21:47:34.0495364Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:47:34.0495995Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 199, in forward 2025-08-14T21:47:34.0496901Z query_states = self.q_proj(hidden_states) * self.scaling 2025-08-14T21:47:34.0499929Z 2025-08-14T21:47:34.0500432Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:47:34.0503692Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:47:34.0507505Z return mod(**inputs) 2025-08-14T21:47:34.0511437Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 823, in forward 2025-08-14T21:47:34.0515331Z outputs = self.model.decoder( 2025-08-14T21:47:34.0519808Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 644, in forward 2025-08-14T21:47:34.0521400Z layer_outputs = decoder_layer( 2025-08-14T21:47:34.0521917Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:47:34.0526052Z return super().__call__(*args, **kwargs) 2025-08-14T21:47:34.0528217Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 367, in forward 2025-08-14T21:47:34.0529011Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:47:34.0529412Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 218, in forward 2025-08-14T21:47:34.0529790Z key_states = self.k_proj(current_states) 2025-08-14T21:47:34.0529933Z 2025-08-14T21:47:34.0530037Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:47:34.0530385Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:47:34.0530690Z return mod(**inputs) 2025-08-14T21:47:34.0531038Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 823, in forward 2025-08-14T21:47:34.0531406Z outputs = self.model.decoder( 2025-08-14T21:47:34.0531764Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 644, in forward 2025-08-14T21:47:34.0532121Z layer_outputs = decoder_layer( 2025-08-14T21:47:34.0532450Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:47:34.0532791Z return super().__call__(*args, **kwargs) 2025-08-14T21:47:34.0533156Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 367, in forward 2025-08-14T21:47:34.0533536Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:47:34.0533915Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 219, in forward 2025-08-14T21:47:34.0534293Z value_states = self.v_proj(current_states) 2025-08-14T21:47:34.0534425Z 2025-08-14T21:47:34.0534502Z cudagraph partition due to non gpu ops 2025-08-14T21:47:34.0534701Z cudagraph partition due to non gpu ops 2025-08-14T21:47:34.0534890Z cudagraph partition due to non gpu ops 2025-08-14T21:47:34.0535107Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:47:34.0535439Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:47:34.0535744Z return mod(**inputs) 2025-08-14T21:47:34.0536079Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 823, in forward 2025-08-14T21:47:34.0536435Z outputs = self.model.decoder( 2025-08-14T21:47:34.0536793Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 644, in forward 2025-08-14T21:47:34.0537178Z layer_outputs = decoder_layer( 2025-08-14T21:47:34.0537503Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:47:34.0537833Z return super().__call__(*args, **kwargs) 2025-08-14T21:47:34.0538226Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 367, in forward 2025-08-14T21:47:34.0538615Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:47:34.0538982Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 291, in forward 2025-08-14T21:47:34.0539344Z attn_output = self.out_proj(attn_output) 2025-08-14T21:47:34.0539475Z 2025-08-14T21:47:34.0539599Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:47:34.0539967Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:47:34.0540261Z return mod(**inputs) 2025-08-14T21:47:34.0540599Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 823, in forward 2025-08-14T21:47:34.0540956Z outputs = self.model.decoder( 2025-08-14T21:47:34.0541309Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 644, in forward 2025-08-14T21:47:34.0541661Z layer_outputs = decoder_layer( 2025-08-14T21:47:34.0542028Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:47:34.0542374Z return super().__call__(*args, **kwargs) 2025-08-14T21:47:34.0542732Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 401, in forward 2025-08-14T21:47:34.0543141Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:47:34.0543317Z 2025-08-14T21:47:34.0543417Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:47:34.0543752Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:47:34.0544051Z return mod(**inputs) 2025-08-14T21:47:34.0544389Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 823, in forward 2025-08-14T21:47:34.0544849Z outputs = self.model.decoder( 2025-08-14T21:47:34.0545204Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 644, in forward 2025-08-14T21:47:34.0545562Z layer_outputs = decoder_layer( 2025-08-14T21:47:34.0545882Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:47:34.0546217Z return super().__call__(*args, **kwargs) 2025-08-14T21:47:34.0546567Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 401, in forward 2025-08-14T21:47:34.0546962Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:47:34.0547323Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:47:34.0547640Z return self.act(input) 2025-08-14T21:47:34.0547742Z 2025-08-14T21:47:34.0547838Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:47:34.0548169Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:47:34.0548465Z return mod(**inputs) 2025-08-14T21:47:34.0548788Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 823, in forward 2025-08-14T21:47:34.0549142Z outputs = self.model.decoder( 2025-08-14T21:47:34.0549484Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 644, in forward 2025-08-14T21:47:34.0549835Z layer_outputs = decoder_layer( 2025-08-14T21:47:34.0550142Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:47:34.0550472Z return super().__call__(*args, **kwargs) 2025-08-14T21:47:34.0550822Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 403, in forward 2025-08-14T21:47:34.0551196Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:47:34.0551332Z 2025-08-14T21:47:34.0551425Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:47:34.0551749Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:47:34.0552045Z return mod(**inputs) 2025-08-14T21:47:34.0552387Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 823, in forward 2025-08-14T21:47:34.0552764Z outputs = self.model.decoder( 2025-08-14T21:47:34.0553108Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 644, in forward 2025-08-14T21:47:34.0553453Z layer_outputs = decoder_layer( 2025-08-14T21:47:34.0553767Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:47:34.0554095Z return super().__call__(*args, **kwargs) 2025-08-14T21:47:34.0554451Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 367, in forward 2025-08-14T21:47:34.0554845Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:47:34.0555225Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 199, in forward 2025-08-14T21:47:34.0555628Z query_states = self.q_proj(hidden_states) * self.scaling 2025-08-14T21:47:34.0555785Z 2025-08-14T21:47:34.0555890Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:47:34.0556214Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:47:34.0556516Z return mod(**inputs) 2025-08-14T21:47:34.0556851Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 823, in forward 2025-08-14T21:47:34.0557203Z outputs = self.model.decoder( 2025-08-14T21:47:34.0557557Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 644, in forward 2025-08-14T21:47:34.0557913Z layer_outputs = decoder_layer( 2025-08-14T21:47:34.0558232Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:47:34.0558561Z return super().__call__(*args, **kwargs) 2025-08-14T21:47:34.0558924Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 367, in forward 2025-08-14T21:47:34.0559307Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:47:34.0559689Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 218, in forward 2025-08-14T21:47:34.0560051Z key_states = self.k_proj(current_states) 2025-08-14T21:47:34.0560181Z 2025-08-14T21:47:34.0560274Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:47:34.0560610Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:47:34.0560907Z return mod(**inputs) 2025-08-14T21:47:34.0561243Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 823, in forward 2025-08-14T21:47:34.0561600Z outputs = self.model.decoder( 2025-08-14T21:47:34.0561950Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 644, in forward 2025-08-14T21:47:34.0562302Z layer_outputs = decoder_layer( 2025-08-14T21:47:34.0562622Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:47:34.0562957Z return super().__call__(*args, **kwargs) 2025-08-14T21:47:34.0563311Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 367, in forward 2025-08-14T21:47:34.0563691Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:47:34.0564115Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 219, in forward 2025-08-14T21:47:34.0564486Z value_states = self.v_proj(current_states) 2025-08-14T21:47:34.0564617Z 2025-08-14T21:47:34.0564694Z cudagraph partition due to non gpu ops 2025-08-14T21:47:34.0564895Z cudagraph partition due to non gpu ops 2025-08-14T21:47:34.0565108Z cudagraph partition due to non gpu ops 2025-08-14T21:47:34.0565317Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:47:34.0565666Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:47:34.0565963Z return mod(**inputs) 2025-08-14T21:47:34.0566293Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 823, in forward 2025-08-14T21:47:34.0566636Z outputs = self.model.decoder( 2025-08-14T21:47:34.0566979Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 644, in forward 2025-08-14T21:47:34.0567355Z layer_outputs = decoder_layer( 2025-08-14T21:47:34.0567668Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:47:34.0568003Z return super().__call__(*args, **kwargs) 2025-08-14T21:47:34.0568367Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 367, in forward 2025-08-14T21:47:34.0568752Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:47:34.0569123Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 291, in forward 2025-08-14T21:47:34.0569487Z attn_output = self.out_proj(attn_output) 2025-08-14T21:47:34.0569613Z 2025-08-14T21:47:34.0569717Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:47:34.0570047Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:47:34.0570340Z return mod(**inputs) 2025-08-14T21:47:34.0570677Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 823, in forward 2025-08-14T21:47:34.0571033Z outputs = self.model.decoder( 2025-08-14T21:47:34.0571379Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 644, in forward 2025-08-14T21:47:34.0571737Z layer_outputs = decoder_layer( 2025-08-14T21:47:34.0572060Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:47:34.0572392Z return super().__call__(*args, **kwargs) 2025-08-14T21:47:34.0572744Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 401, in forward 2025-08-14T21:47:34.0573145Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:47:34.0573303Z 2025-08-14T21:47:34.0573409Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:47:34.0573740Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:47:34.0574033Z return mod(**inputs) 2025-08-14T21:47:34.0574366Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 823, in forward 2025-08-14T21:47:34.0574727Z outputs = self.model.decoder( 2025-08-14T21:47:34.0575068Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 644, in forward 2025-08-14T21:47:34.0575426Z layer_outputs = decoder_layer( 2025-08-14T21:47:34.0575743Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:47:34.0576075Z return super().__call__(*args, **kwargs) 2025-08-14T21:47:34.0576426Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 401, in forward 2025-08-14T21:47:34.0576842Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:47:34.0577206Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:47:34.0577519Z return self.act(input) 2025-08-14T21:47:34.0577628Z 2025-08-14T21:47:34.0577739Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:47:34.0578069Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:47:34.0578388Z return mod(**inputs) 2025-08-14T21:47:34.0578713Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 823, in forward 2025-08-14T21:47:34.0579069Z outputs = self.model.decoder( 2025-08-14T21:47:34.0579422Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 644, in forward 2025-08-14T21:47:34.0579781Z layer_outputs = decoder_layer( 2025-08-14T21:47:34.0580114Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:47:34.0580474Z return super().__call__(*args, **kwargs) 2025-08-14T21:47:34.0580841Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 403, in forward 2025-08-14T21:47:34.0581208Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:47:34.0581343Z 2025-08-14T21:47:34.0581440Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:47:34.0581777Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:47:34.0582081Z return mod(**inputs) 2025-08-14T21:47:34.0582414Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 823, in forward 2025-08-14T21:47:34.0582782Z outputs = self.model.decoder( 2025-08-14T21:47:34.0583129Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 644, in forward 2025-08-14T21:47:34.0583478Z layer_outputs = decoder_layer( 2025-08-14T21:47:34.0583797Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:47:34.0584126Z return super().__call__(*args, **kwargs) 2025-08-14T21:47:34.0584482Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 367, in forward 2025-08-14T21:47:34.0585155Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:47:34.0585536Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 199, in forward 2025-08-14T21:47:34.0585927Z query_states = self.q_proj(hidden_states) * self.scaling 2025-08-14T21:47:34.0586080Z 2025-08-14T21:47:34.0586181Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:47:34.0586506Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:47:34.0586811Z return mod(**inputs) 2025-08-14T21:47:34.0587140Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 823, in forward 2025-08-14T21:47:34.0587487Z outputs = self.model.decoder( 2025-08-14T21:47:34.0587833Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 644, in forward 2025-08-14T21:47:34.0588189Z layer_outputs = decoder_layer( 2025-08-14T21:47:34.0588506Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:47:34.0588840Z return super().__call__(*args, **kwargs) 2025-08-14T21:47:34.0589195Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 367, in forward 2025-08-14T21:47:34.0589575Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:47:34.0589987Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 218, in forward 2025-08-14T21:47:34.0590356Z key_states = self.k_proj(current_states) 2025-08-14T21:47:34.0590488Z 2025-08-14T21:47:34.0590581Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:47:34.0590938Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:47:34.0591318Z return mod(**inputs) 2025-08-14T21:47:34.0591648Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 823, in forward 2025-08-14T21:47:34.0592002Z outputs = self.model.decoder( 2025-08-14T21:47:34.0592344Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 644, in forward 2025-08-14T21:47:34.0592700Z layer_outputs = decoder_layer( 2025-08-14T21:47:34.0593022Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:47:34.0593395Z return super().__call__(*args, **kwargs) 2025-08-14T21:47:34.0593752Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 367, in forward 2025-08-14T21:47:34.0594137Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:47:34.0594519Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 219, in forward 2025-08-14T21:47:34.0594893Z value_states = self.v_proj(current_states) 2025-08-14T21:47:34.0595027Z 2025-08-14T21:47:34.0595102Z cudagraph partition due to non gpu ops 2025-08-14T21:47:34.0595301Z cudagraph partition due to non gpu ops 2025-08-14T21:47:34.0595497Z cudagraph partition due to non gpu ops 2025-08-14T21:47:34.0595710Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:47:34.0596044Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:47:34.0596350Z return mod(**inputs) 2025-08-14T21:47:34.0596685Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 823, in forward 2025-08-14T21:47:34.0597039Z outputs = self.model.decoder( 2025-08-14T21:47:34.0597394Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 644, in forward 2025-08-14T21:47:34.0597754Z layer_outputs = decoder_layer( 2025-08-14T21:47:34.0598070Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:47:34.0598406Z return super().__call__(*args, **kwargs) 2025-08-14T21:47:34.0598769Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 367, in forward 2025-08-14T21:47:34.0599152Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:47:34.0599525Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 291, in forward 2025-08-14T21:47:34.0599901Z attn_output = self.out_proj(attn_output) 2025-08-14T21:47:34.0600030Z 2025-08-14T21:47:34.0600133Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:47:34.0600464Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:47:34.0600767Z return mod(**inputs) 2025-08-14T21:47:34.0601102Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 823, in forward 2025-08-14T21:47:34.0601464Z outputs = self.model.decoder( 2025-08-14T21:47:34.0601812Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 644, in forward 2025-08-14T21:47:34.0602173Z layer_outputs = decoder_layer( 2025-08-14T21:47:34.0602518Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:47:34.0602859Z return super().__call__(*args, **kwargs) 2025-08-14T21:47:34.0603205Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 401, in forward 2025-08-14T21:47:34.0603601Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:47:34.0603758Z 2025-08-14T21:47:34.0603876Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:47:34.0604213Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:47:34.0604511Z return mod(**inputs) 2025-08-14T21:47:34.0604841Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 823, in forward 2025-08-14T21:47:34.0605192Z outputs = self.model.decoder( 2025-08-14T21:47:34.0605529Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 644, in forward 2025-08-14T21:47:34.0605881Z layer_outputs = decoder_layer( 2025-08-14T21:47:34.0606234Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:47:34.0606562Z return super().__call__(*args, **kwargs) 2025-08-14T21:47:34.0606916Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 401, in forward 2025-08-14T21:47:34.0607309Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:47:34.0607662Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:47:34.0607966Z return self.act(input) 2025-08-14T21:47:34.0608074Z 2025-08-14T21:47:34.0608166Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:47:34.0608492Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:47:34.0608787Z return mod(**inputs) 2025-08-14T21:47:34.0609111Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 823, in forward 2025-08-14T21:47:34.0609466Z outputs = self.model.decoder( 2025-08-14T21:47:34.0609812Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 644, in forward 2025-08-14T21:47:34.0610159Z layer_outputs = decoder_layer( 2025-08-14T21:47:34.0610473Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:47:34.0610803Z return super().__call__(*args, **kwargs) 2025-08-14T21:47:34.0611153Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 403, in forward 2025-08-14T21:47:34.0611505Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:47:34.0611635Z 2025-08-14T21:47:34.0611727Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:47:34.0612051Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:47:34.0612342Z return mod(**inputs) 2025-08-14T21:47:34.0612674Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 823, in forward 2025-08-14T21:47:34.0613025Z outputs = self.model.decoder( 2025-08-14T21:47:34.0613367Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 644, in forward 2025-08-14T21:47:34.0613712Z layer_outputs = decoder_layer( 2025-08-14T21:47:34.0614025Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:47:34.0614356Z return super().__call__(*args, **kwargs) 2025-08-14T21:47:34.0614702Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 367, in forward 2025-08-14T21:47:34.0615081Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:47:34.0615471Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 199, in forward 2025-08-14T21:47:34.0615866Z query_states = self.q_proj(hidden_states) * self.scaling 2025-08-14T21:47:34.0616018Z 2025-08-14T21:47:34.0616111Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:47:34.0616455Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:47:34.0616775Z return mod(**inputs) 2025-08-14T21:47:34.0617108Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 823, in forward 2025-08-14T21:47:34.0617457Z outputs = self.model.decoder( 2025-08-14T21:47:34.0617805Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 644, in forward 2025-08-14T21:47:34.0618160Z layer_outputs = decoder_layer( 2025-08-14T21:47:34.0618471Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:47:34.0618820Z return super().__call__(*args, **kwargs) 2025-08-14T21:47:34.0619177Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 367, in forward 2025-08-14T21:47:34.0619552Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:47:34.0619914Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 218, in forward 2025-08-14T21:47:34.0620278Z key_states = self.k_proj(current_states) 2025-08-14T21:47:34.0620399Z 2025-08-14T21:47:34.0620501Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:47:34.0620825Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:47:34.0621115Z return mod(**inputs) 2025-08-14T21:47:34.0621442Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 823, in forward 2025-08-14T21:47:34.0621796Z outputs = self.model.decoder( 2025-08-14T21:47:34.0622130Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 644, in forward 2025-08-14T21:47:34.0622486Z layer_outputs = decoder_layer( 2025-08-14T21:47:34.0622803Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:47:34.0623133Z return super().__call__(*args, **kwargs) 2025-08-14T21:47:34.0623476Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 367, in forward 2025-08-14T21:47:34.0623851Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:47:34.0624223Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 219, in forward 2025-08-14T21:47:34.0624581Z value_states = self.v_proj(current_states) 2025-08-14T21:47:34.0624718Z 2025-08-14T21:47:34.0624856Z cudagraph partition due to non gpu ops 2025-08-14T21:47:34.0625057Z cudagraph partition due to non gpu ops 2025-08-14T21:47:34.0625248Z cudagraph partition due to non gpu ops 2025-08-14T21:47:34.0625454Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:47:34.0625788Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:47:34.0626090Z return mod(**inputs) 2025-08-14T21:47:34.0626415Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 823, in forward 2025-08-14T21:47:34.0626773Z outputs = self.model.decoder( 2025-08-14T21:47:34.0627121Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 644, in forward 2025-08-14T21:47:34.0627475Z layer_outputs = decoder_layer( 2025-08-14T21:47:34.0627809Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:47:34.0628144Z return super().__call__(*args, **kwargs) 2025-08-14T21:47:34.0628499Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 367, in forward 2025-08-14T21:47:34.0628872Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:47:34.0629251Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 291, in forward 2025-08-14T21:47:34.0629633Z attn_output = self.out_proj(attn_output) 2025-08-14T21:47:34.0629756Z 2025-08-14T21:47:34.0629857Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:47:34.0630175Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:47:34.0630471Z return mod(**inputs) 2025-08-14T21:47:34.0630804Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 823, in forward 2025-08-14T21:47:34.0631176Z outputs = self.model.decoder( 2025-08-14T21:47:34.0631515Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 644, in forward 2025-08-14T21:47:34.0631868Z layer_outputs = decoder_layer( 2025-08-14T21:47:34.0632189Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:47:34.0632514Z return super().__call__(*args, **kwargs) 2025-08-14T21:47:34.0632870Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 401, in forward 2025-08-14T21:47:34.0633269Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:47:34.0633425Z 2025-08-14T21:47:34.0633524Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:47:34.0633843Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:47:34.0634143Z return mod(**inputs) 2025-08-14T21:47:34.0634475Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 823, in forward 2025-08-14T21:47:34.0634824Z outputs = self.model.decoder( 2025-08-14T21:47:34.0635165Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 644, in forward 2025-08-14T21:47:34.0635518Z layer_outputs = decoder_layer( 2025-08-14T21:47:34.0635834Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:47:34.0636158Z return super().__call__(*args, **kwargs) 2025-08-14T21:47:34.0636510Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 401, in forward 2025-08-14T21:47:34.0636900Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:47:34.0637252Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:47:34.0637557Z return self.act(input) 2025-08-14T21:47:34.0637662Z 2025-08-14T21:47:34.0637755Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:47:34.0638081Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:47:34.0638371Z return mod(**inputs) 2025-08-14T21:47:34.0638703Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 823, in forward 2025-08-14T21:47:34.0639061Z outputs = self.model.decoder( 2025-08-14T21:47:34.0639405Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 644, in forward 2025-08-14T21:47:34.0639751Z layer_outputs = decoder_layer( 2025-08-14T21:47:34.0640065Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:47:34.0640410Z return super().__call__(*args, **kwargs) 2025-08-14T21:47:34.0640763Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 403, in forward 2025-08-14T21:47:34.0641123Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:47:34.0641252Z 2025-08-14T21:47:34.0641344Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:47:34.0641687Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:47:34.0641997Z return mod(**inputs) 2025-08-14T21:47:34.0642326Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 823, in forward 2025-08-14T21:47:34.0642678Z outputs = self.model.decoder( 2025-08-14T21:47:34.0643021Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 644, in forward 2025-08-14T21:47:34.0643366Z layer_outputs = decoder_layer( 2025-08-14T21:47:34.0643686Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:47:34.0644034Z return super().__call__(*args, **kwargs) 2025-08-14T21:47:34.0644378Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 367, in forward 2025-08-14T21:47:34.0644756Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:47:34.0645126Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 199, in forward 2025-08-14T21:47:34.0645518Z query_states = self.q_proj(hidden_states) * self.scaling 2025-08-14T21:47:34.0645669Z 2025-08-14T21:47:34.0645762Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:47:34.0646090Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:47:34.0646386Z return mod(**inputs) 2025-08-14T21:47:34.0646709Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 823, in forward 2025-08-14T21:47:34.0647064Z outputs = self.model.decoder( 2025-08-14T21:47:34.0647409Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 644, in forward 2025-08-14T21:47:34.0647756Z layer_outputs = decoder_layer( 2025-08-14T21:47:34.0648065Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:47:34.0648400Z return super().__call__(*args, **kwargs) 2025-08-14T21:47:34.0648756Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 367, in forward 2025-08-14T21:47:34.0649132Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:47:34.0649495Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 218, in forward 2025-08-14T21:47:34.0649856Z key_states = self.k_proj(current_states) 2025-08-14T21:47:34.0649981Z 2025-08-14T21:47:34.0650080Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:47:34.0650398Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:47:34.0650695Z return mod(**inputs) 2025-08-14T21:47:34.0651029Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 823, in forward 2025-08-14T21:47:34.0651380Z outputs = self.model.decoder( 2025-08-14T21:47:34.0651718Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 644, in forward 2025-08-14T21:47:34.0652068Z layer_outputs = decoder_layer( 2025-08-14T21:47:34.0652385Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:47:34.0652712Z return super().__call__(*args, **kwargs) 2025-08-14T21:47:34.0653075Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 367, in forward 2025-08-14T21:47:34.0653453Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:47:34.0653823Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 219, in forward 2025-08-14T21:47:34.0654200Z value_states = self.v_proj(current_states) 2025-08-14T21:47:34.0654339Z 2025-08-14T21:47:34.0654440Z cudagraph partition due to non gpu ops 2025-08-14T21:47:34.0654637Z cudagraph partition due to non gpu ops 2025-08-14T21:47:34.0654831Z cudagraph partition due to non gpu ops 2025-08-14T21:47:34.0655040Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:47:34.0655368Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:47:34.0655668Z return mod(**inputs) 2025-08-14T21:47:34.0655997Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 823, in forward 2025-08-14T21:47:34.0656373Z outputs = self.model.decoder( 2025-08-14T21:47:34.0656724Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 644, in forward 2025-08-14T21:47:34.0657077Z layer_outputs = decoder_layer( 2025-08-14T21:47:34.0657390Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:47:34.0657723Z return super().__call__(*args, **kwargs) 2025-08-14T21:47:34.0658078Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 367, in forward 2025-08-14T21:47:34.0658449Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:47:34.0658824Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 291, in forward 2025-08-14T21:47:34.0659187Z attn_output = self.out_proj(attn_output) 2025-08-14T21:47:34.0659315Z 2025-08-14T21:47:34.0659418Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:47:34.0659740Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:47:34.0660038Z return mod(**inputs) 2025-08-14T21:47:34.0660371Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 823, in forward 2025-08-14T21:47:34.0660723Z outputs = self.model.decoder( 2025-08-14T21:47:34.0661072Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 644, in forward 2025-08-14T21:47:34.0661428Z layer_outputs = decoder_layer( 2025-08-14T21:47:34.0661750Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:47:34.0662075Z return super().__call__(*args, **kwargs) 2025-08-14T21:47:34.0662437Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 401, in forward 2025-08-14T21:47:34.0662837Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:47:34.0662995Z 2025-08-14T21:47:34.0663097Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:47:34.0663422Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:47:34.0663721Z return mod(**inputs) 2025-08-14T21:47:34.0664056Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 823, in forward 2025-08-14T21:47:34.0664403Z outputs = self.model.decoder( 2025-08-14T21:47:34.0664812Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 644, in forward 2025-08-14T21:47:34.0665179Z layer_outputs = decoder_layer( 2025-08-14T21:47:34.0665526Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:47:34.0665855Z return super().__call__(*args, **kwargs) 2025-08-14T21:47:34.0666215Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 401, in forward 2025-08-14T21:47:34.0666613Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:47:34.0666977Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:47:34.0667315Z return self.act(input) 2025-08-14T21:47:34.0667425Z 2025-08-14T21:47:34.0667517Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:47:34.0667842Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:47:34.0668133Z return mod(**inputs) 2025-08-14T21:47:34.0668465Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 823, in forward 2025-08-14T21:47:34.0668820Z outputs = self.model.decoder( 2025-08-14T21:47:34.0669188Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 644, in forward 2025-08-14T21:47:34.0669532Z layer_outputs = decoder_layer( 2025-08-14T21:47:34.0669848Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:47:34.0670179Z return super().__call__(*args, **kwargs) 2025-08-14T21:47:34.0670527Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 403, in forward 2025-08-14T21:47:34.0670888Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:47:34.0671020Z 2025-08-14T21:47:34.0671114Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:47:34.0671445Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:47:34.0671735Z return mod(**inputs) 2025-08-14T21:47:34.0672065Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 823, in forward 2025-08-14T21:47:34.0672418Z outputs = self.model.decoder( 2025-08-14T21:47:34.0672754Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 644, in forward 2025-08-14T21:47:34.0673106Z layer_outputs = decoder_layer( 2025-08-14T21:47:34.0673420Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:47:34.0673747Z return super().__call__(*args, **kwargs) 2025-08-14T21:47:34.0674095Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 367, in forward 2025-08-14T21:47:34.0674470Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:47:34.0674842Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 199, in forward 2025-08-14T21:47:34.0675229Z query_states = self.q_proj(hidden_states) * self.scaling 2025-08-14T21:47:34.0675383Z 2025-08-14T21:47:34.0675478Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:47:34.0675806Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:47:34.0676103Z return mod(**inputs) 2025-08-14T21:47:34.0676429Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 823, in forward 2025-08-14T21:47:34.0676782Z outputs = self.model.decoder( 2025-08-14T21:47:34.0677131Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 644, in forward 2025-08-14T21:47:34.0677481Z layer_outputs = decoder_layer( 2025-08-14T21:47:34.0677792Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:47:34.0678123Z return super().__call__(*args, **kwargs) 2025-08-14T21:47:34.0678494Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 367, in forward 2025-08-14T21:47:34.0678868Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:47:34.0679263Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 218, in forward 2025-08-14T21:47:34.0679632Z key_states = self.k_proj(current_states) 2025-08-14T21:47:34.0679774Z 2025-08-14T21:47:34.0679874Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:47:34.0680191Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:47:34.0680484Z return mod(**inputs) 2025-08-14T21:47:34.0680811Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 823, in forward 2025-08-14T21:47:34.0681162Z outputs = self.model.decoder( 2025-08-14T21:47:34.0681498Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 644, in forward 2025-08-14T21:47:34.0681870Z layer_outputs = decoder_layer( 2025-08-14T21:47:34.0682185Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:47:34.0682509Z return super().__call__(*args, **kwargs) 2025-08-14T21:47:34.0682865Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 367, in forward 2025-08-14T21:47:34.0683242Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:47:34.0683615Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 219, in forward 2025-08-14T21:47:34.0683970Z value_states = self.v_proj(current_states) 2025-08-14T21:47:34.0684106Z 2025-08-14T21:47:34.0684179Z cudagraph partition due to non gpu ops 2025-08-14T21:47:34.0684374Z cudagraph partition due to non gpu ops 2025-08-14T21:47:34.0684558Z cudagraph partition due to non gpu ops 2025-08-14T21:47:34.0684885Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:47:34.0685221Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:47:34.0685525Z return mod(**inputs) 2025-08-14T21:47:34.0685856Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 823, in forward 2025-08-14T21:47:34.0686222Z outputs = self.model.decoder( 2025-08-14T21:47:34.0686575Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 644, in forward 2025-08-14T21:47:34.0686929Z layer_outputs = decoder_layer( 2025-08-14T21:47:34.0687254Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:47:34.0687590Z return super().__call__(*args, **kwargs) 2025-08-14T21:47:34.0687951Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 367, in forward 2025-08-14T21:47:34.0688327Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:47:34.0688706Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 291, in forward 2025-08-14T21:47:34.0689070Z attn_output = self.out_proj(attn_output) 2025-08-14T21:47:34.0689195Z 2025-08-14T21:47:34.0689295Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:47:34.0689614Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:47:34.0689914Z return mod(**inputs) 2025-08-14T21:47:34.0690248Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 823, in forward 2025-08-14T21:47:34.0690600Z outputs = self.model.decoder( 2025-08-14T21:47:34.0690989Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 644, in forward 2025-08-14T21:47:34.0691347Z layer_outputs = decoder_layer( 2025-08-14T21:47:34.0691672Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:47:34.0692000Z return super().__call__(*args, **kwargs) 2025-08-14T21:47:34.0692386Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 401, in forward 2025-08-14T21:47:34.0692810Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:47:34.0692966Z 2025-08-14T21:47:34.0693059Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:47:34.0693389Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:47:34.0693687Z return mod(**inputs) 2025-08-14T21:47:34.0694016Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 823, in forward 2025-08-14T21:47:34.0694389Z outputs = self.model.decoder( 2025-08-14T21:47:34.0694737Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 644, in forward 2025-08-14T21:47:34.0695096Z layer_outputs = decoder_layer( 2025-08-14T21:47:34.0695419Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:47:34.0695747Z return super().__call__(*args, **kwargs) 2025-08-14T21:47:34.0696105Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 401, in forward 2025-08-14T21:47:34.0696499Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:47:34.0696845Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:47:34.0697157Z return self.act(input) 2025-08-14T21:47:34.0697264Z 2025-08-14T21:47:34.0697360Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:47:34.0697689Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:47:34.0697978Z return mod(**inputs) 2025-08-14T21:47:34.0698308Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 823, in forward 2025-08-14T21:47:34.0698659Z outputs = self.model.decoder( 2025-08-14T21:47:34.0698997Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 644, in forward 2025-08-14T21:47:34.0699349Z layer_outputs = decoder_layer( 2025-08-14T21:47:34.0699666Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:47:34.0699993Z return super().__call__(*args, **kwargs) 2025-08-14T21:47:34.0700340Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 403, in forward 2025-08-14T21:47:34.0700700Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:47:34.0700823Z 2025-08-14T21:47:34.0700923Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:47:34.0701248Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:47:34.0701538Z return mod(**inputs) 2025-08-14T21:47:34.0701864Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 823, in forward 2025-08-14T21:47:34.0702215Z outputs = self.model.decoder( 2025-08-14T21:47:34.0702551Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 644, in forward 2025-08-14T21:47:34.0702898Z layer_outputs = decoder_layer( 2025-08-14T21:47:34.0703211Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:47:34.0703556Z return super().__call__(*args, **kwargs) 2025-08-14T21:47:34.0703903Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 367, in forward 2025-08-14T21:47:34.0704278Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:47:34.0704675Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 199, in forward 2025-08-14T21:47:34.0705116Z query_states = self.q_proj(hidden_states) * self.scaling 2025-08-14T21:47:34.0705303Z 2025-08-14T21:47:34.0705396Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:47:34.0705725Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:47:34.0706023Z return mod(**inputs) 2025-08-14T21:47:34.0706346Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 823, in forward 2025-08-14T21:47:34.0706699Z outputs = self.model.decoder( 2025-08-14T21:47:34.0707046Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 644, in forward 2025-08-14T21:47:34.0707422Z layer_outputs = decoder_layer( 2025-08-14T21:47:34.0707738Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:47:34.0708074Z return super().__call__(*args, **kwargs) 2025-08-14T21:47:34.0708437Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 367, in forward 2025-08-14T21:47:34.0708817Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:47:34.0709200Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 218, in forward 2025-08-14T21:47:34.0709567Z key_states = self.k_proj(current_states) 2025-08-14T21:47:34.0709694Z 2025-08-14T21:47:34.0709797Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:47:34.0710125Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:47:34.0710430Z return mod(**inputs) 2025-08-14T21:47:34.0710771Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 823, in forward 2025-08-14T21:47:34.0711122Z outputs = self.model.decoder( 2025-08-14T21:47:34.0711474Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 644, in forward 2025-08-14T21:47:34.0711834Z layer_outputs = decoder_layer( 2025-08-14T21:47:34.0712159Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:47:34.0712488Z return super().__call__(*args, **kwargs) 2025-08-14T21:47:34.0712863Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 367, in forward 2025-08-14T21:47:34.0713435Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:47:34.0713905Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 219, in forward 2025-08-14T21:47:34.0714305Z value_states = self.v_proj(current_states) 2025-08-14T21:47:34.0714513Z 2025-08-14T21:47:34.0714588Z cudagraph partition due to non gpu ops 2025-08-14T21:47:34.0714819Z cudagraph partition due to non gpu ops 2025-08-14T21:47:34.0715019Z cudagraph partition due to non gpu ops 2025-08-14T21:47:34.0715240Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:47:34.0715580Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:47:34.0715880Z return mod(**inputs) 2025-08-14T21:47:34.0716222Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 823, in forward 2025-08-14T21:47:34.0716585Z outputs = self.model.decoder( 2025-08-14T21:47:34.0717009Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 644, in forward 2025-08-14T21:47:34.0717373Z layer_outputs = decoder_layer( 2025-08-14T21:47:34.0717700Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:47:34.0718039Z return super().__call__(*args, **kwargs) 2025-08-14T21:47:34.0718451Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 367, in forward 2025-08-14T21:47:34.0718853Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:47:34.0719240Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 291, in forward 2025-08-14T21:47:34.0719611Z attn_output = self.out_proj(attn_output) 2025-08-14T21:47:34.0719741Z 2025-08-14T21:47:34.0719837Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:47:34.0720178Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:47:34.0720504Z return mod(**inputs) 2025-08-14T21:47:34.0720847Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 823, in forward 2025-08-14T21:47:34.0721206Z outputs = self.model.decoder( 2025-08-14T21:47:34.0721562Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 644, in forward 2025-08-14T21:47:34.0721925Z layer_outputs = decoder_layer( 2025-08-14T21:47:34.0722244Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:47:34.0722585Z return super().__call__(*args, **kwargs) 2025-08-14T21:47:34.0722948Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 401, in forward 2025-08-14T21:47:34.0723357Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:47:34.0723518Z 2025-08-14T21:47:34.0723617Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:47:34.0724072Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:47:34.0724383Z return mod(**inputs) 2025-08-14T21:47:34.0724814Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 823, in forward 2025-08-14T21:47:34.0725220Z outputs = self.model.decoder( 2025-08-14T21:47:34.0725618Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 644, in forward 2025-08-14T21:47:34.0725980Z layer_outputs = decoder_layer( 2025-08-14T21:47:34.0726294Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:47:34.0726634Z return super().__call__(*args, **kwargs) 2025-08-14T21:47:34.0726999Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 401, in forward 2025-08-14T21:47:34.0727404Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:47:34.0727756Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:47:34.0728076Z return self.act(input) 2025-08-14T21:47:34.0728180Z 2025-08-14T21:47:34.0728283Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:47:34.0728614Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:47:34.0728919Z return mod(**inputs) 2025-08-14T21:47:34.0729257Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 823, in forward 2025-08-14T21:47:34.0729621Z outputs = self.model.decoder( 2025-08-14T21:47:34.0729966Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 644, in forward 2025-08-14T21:47:34.0730342Z layer_outputs = decoder_layer( 2025-08-14T21:47:34.0730670Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:47:34.0731006Z return super().__call__(*args, **kwargs) 2025-08-14T21:47:34.0731381Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 403, in forward 2025-08-14T21:47:34.0731755Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:47:34.0731900Z 2025-08-14T21:47:34.0732008Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:47:34.0732344Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:47:34.0732662Z return mod(**inputs) 2025-08-14T21:47:34.0732999Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 823, in forward 2025-08-14T21:47:34.0733359Z outputs = self.model.decoder( 2025-08-14T21:47:34.0733703Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 644, in forward 2025-08-14T21:47:34.0734075Z layer_outputs = decoder_layer( 2025-08-14T21:47:34.0734386Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:47:34.0744274Z return super().__call__(*args, **kwargs) 2025-08-14T21:47:34.0744884Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 367, in forward 2025-08-14T21:47:34.0745324Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:47:34.0745735Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 199, in forward 2025-08-14T21:47:34.0746148Z query_states = self.q_proj(hidden_states) * self.scaling 2025-08-14T21:47:34.0746330Z 2025-08-14T21:47:34.0746433Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:47:34.0746786Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:47:34.0747103Z return mod(**inputs) 2025-08-14T21:47:34.0747440Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 823, in forward 2025-08-14T21:47:34.0747812Z outputs = self.model.decoder( 2025-08-14T21:47:34.0748171Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 644, in forward 2025-08-14T21:47:34.0748533Z layer_outputs = decoder_layer( 2025-08-14T21:47:34.0748867Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:47:34.0749210Z return super().__call__(*args, **kwargs) 2025-08-14T21:47:34.0749572Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 367, in forward 2025-08-14T21:47:34.0749949Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:47:34.0750334Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 218, in forward 2025-08-14T21:47:34.0750699Z key_states = self.k_proj(current_states) 2025-08-14T21:47:34.0750826Z 2025-08-14T21:47:34.0750931Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:47:34.0751256Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:47:34.0751560Z return mod(**inputs) 2025-08-14T21:47:34.0751894Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 823, in forward 2025-08-14T21:47:34.0752244Z outputs = self.model.decoder( 2025-08-14T21:47:34.0752592Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 644, in forward 2025-08-14T21:47:34.0752946Z layer_outputs = decoder_layer( 2025-08-14T21:47:34.0753330Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:47:34.0753663Z return super().__call__(*args, **kwargs) 2025-08-14T21:47:34.0754025Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 367, in forward 2025-08-14T21:47:34.0754430Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:47:34.0754840Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 219, in forward 2025-08-14T21:47:34.0755205Z value_states = self.v_proj(current_states) 2025-08-14T21:47:34.0755342Z 2025-08-14T21:47:34.0755420Z cudagraph partition due to non gpu ops 2025-08-14T21:47:34.0755619Z cudagraph partition due to non gpu ops 2025-08-14T21:47:34.0755807Z cudagraph partition due to non gpu ops 2025-08-14T21:47:34.0756026Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:47:34.0756363Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:47:34.0756689Z return mod(**inputs) 2025-08-14T21:47:34.0757017Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 823, in forward 2025-08-14T21:47:34.0757376Z outputs = self.model.decoder( 2025-08-14T21:47:34.0757725Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 644, in forward 2025-08-14T21:47:34.0758077Z layer_outputs = decoder_layer( 2025-08-14T21:47:34.0758396Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:47:34.0758727Z return super().__call__(*args, **kwargs) 2025-08-14T21:47:34.0759080Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 367, in forward 2025-08-14T21:47:34.0759452Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:47:34.0759827Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 291, in forward 2025-08-14T21:47:34.0760193Z attn_output = self.out_proj(attn_output) 2025-08-14T21:47:34.0760318Z 2025-08-14T21:47:34.0760414Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:47:34.0760743Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:47:34.0761045Z return mod(**inputs) 2025-08-14T21:47:34.0761379Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 823, in forward 2025-08-14T21:47:34.0761728Z outputs = self.model.decoder( 2025-08-14T21:47:34.0762079Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 644, in forward 2025-08-14T21:47:34.0762433Z layer_outputs = decoder_layer( 2025-08-14T21:47:34.0762747Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:47:34.0763082Z return super().__call__(*args, **kwargs) 2025-08-14T21:47:34.0763443Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 401, in forward 2025-08-14T21:47:34.0763933Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:47:34.0764095Z 2025-08-14T21:47:34.0764192Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:47:34.0764527Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:47:34.0764832Z return mod(**inputs) 2025-08-14T21:47:34.0765172Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 823, in forward 2025-08-14T21:47:34.0765529Z outputs = self.model.decoder( 2025-08-14T21:47:34.0765900Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 644, in forward 2025-08-14T21:47:34.0766262Z layer_outputs = decoder_layer( 2025-08-14T21:47:34.0766577Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:47:34.0766910Z return super().__call__(*args, **kwargs) 2025-08-14T21:47:34.0767283Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 401, in forward 2025-08-14T21:47:34.0767696Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:47:34.0768047Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:47:34.0768366Z return self.act(input) 2025-08-14T21:47:34.0768467Z 2025-08-14T21:47:34.0768570Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:47:34.0768903Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:47:34.0769199Z return mod(**inputs) 2025-08-14T21:47:34.0769552Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 823, in forward 2025-08-14T21:47:34.0769908Z outputs = self.model.decoder( 2025-08-14T21:47:34.0770249Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 644, in forward 2025-08-14T21:47:34.0770606Z layer_outputs = decoder_layer( 2025-08-14T21:47:34.0770925Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:47:34.0771258Z return super().__call__(*args, **kwargs) 2025-08-14T21:47:34.0771608Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 403, in forward 2025-08-14T21:47:34.0771970Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:47:34.0772096Z 2025-08-14T21:47:34.0772199Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:47:34.0772528Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:47:34.0772830Z return mod(**inputs) 2025-08-14T21:47:34.0773149Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 823, in forward 2025-08-14T21:47:34.0773506Z outputs = self.model.decoder( 2025-08-14T21:47:34.0773853Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 644, in forward 2025-08-14T21:47:34.0774204Z layer_outputs = decoder_layer( 2025-08-14T21:47:34.0774515Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:47:34.0774844Z return super().__call__(*args, **kwargs) 2025-08-14T21:47:34.0775203Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 367, in forward 2025-08-14T21:47:34.0775571Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:47:34.0775946Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 199, in forward 2025-08-14T21:47:34.0776332Z query_states = self.q_proj(hidden_states) * self.scaling 2025-08-14T21:47:34.0776484Z 2025-08-14T21:47:34.0776584Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:47:34.0776905Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:47:34.0777204Z return mod(**inputs) 2025-08-14T21:47:34.0777532Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 823, in forward 2025-08-14T21:47:34.0777885Z outputs = self.model.decoder( 2025-08-14T21:47:34.0778226Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 644, in forward 2025-08-14T21:47:34.0778595Z layer_outputs = decoder_layer( 2025-08-14T21:47:34.0778919Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:47:34.0779244Z return super().__call__(*args, **kwargs) 2025-08-14T21:47:34.0779614Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 367, in forward 2025-08-14T21:47:34.0779998Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:47:34.0780391Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 218, in forward 2025-08-14T21:47:34.0780746Z key_states = self.k_proj(current_states) 2025-08-14T21:47:34.0780876Z 2025-08-14T21:47:34.0780970Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:47:34.0781299Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:47:34.0781589Z return mod(**inputs) 2025-08-14T21:47:34.0781920Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 823, in forward 2025-08-14T21:47:34.0782293Z outputs = self.model.decoder( 2025-08-14T21:47:34.0782644Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 644, in forward 2025-08-14T21:47:34.0782993Z layer_outputs = decoder_layer( 2025-08-14T21:47:34.0783310Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:47:34.0783644Z return super().__call__(*args, **kwargs) 2025-08-14T21:47:34.0783997Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 367, in forward 2025-08-14T21:47:34.0784370Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:47:34.0785054Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 219, in forward 2025-08-14T21:47:34.0785452Z value_states = self.v_proj(current_states) 2025-08-14T21:47:34.0785590Z 2025-08-14T21:47:34.0785668Z cudagraph partition due to non gpu ops 2025-08-14T21:47:34.0785878Z cudagraph partition due to non gpu ops 2025-08-14T21:47:34.0786089Z cudagraph partition due to non gpu ops 2025-08-14T21:47:34.0786311Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:47:34.0786641Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:47:34.0786950Z return mod(**inputs) 2025-08-14T21:47:34.0787290Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 823, in forward 2025-08-14T21:47:34.0787644Z outputs = self.model.decoder( 2025-08-14T21:47:34.0787996Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 644, in forward 2025-08-14T21:47:34.0788349Z layer_outputs = decoder_layer( 2025-08-14T21:47:34.0788674Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:47:34.0789006Z return super().__call__(*args, **kwargs) 2025-08-14T21:47:34.0789365Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 367, in forward 2025-08-14T21:47:34.0789749Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:47:34.0790119Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 291, in forward 2025-08-14T21:47:34.0790485Z attn_output = self.out_proj(attn_output) 2025-08-14T21:47:34.0790617Z 2025-08-14T21:47:34.0790711Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:47:34.0791042Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:47:34.0791333Z return mod(**inputs) 2025-08-14T21:47:34.0791723Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 823, in forward 2025-08-14T21:47:34.0792084Z outputs = self.model.decoder( 2025-08-14T21:47:34.0792433Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 644, in forward 2025-08-14T21:47:34.0792782Z layer_outputs = decoder_layer( 2025-08-14T21:47:34.0793124Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:47:34.0793499Z return super().__call__(*args, **kwargs) 2025-08-14T21:47:34.0793847Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 401, in forward 2025-08-14T21:47:34.0794244Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:47:34.0794411Z 2025-08-14T21:47:34.0794507Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:47:34.0794834Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:47:34.0795156Z return mod(**inputs) 2025-08-14T21:47:34.0795497Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 823, in forward 2025-08-14T21:47:34.0795855Z outputs = self.model.decoder( 2025-08-14T21:47:34.0796201Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 644, in forward 2025-08-14T21:47:34.0796558Z layer_outputs = decoder_layer( 2025-08-14T21:47:34.0796875Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:47:34.0797207Z return super().__call__(*args, **kwargs) 2025-08-14T21:47:34.0797558Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 401, in forward 2025-08-14T21:47:34.0797957Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:47:34.0798316Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:47:34.0798634Z return self.act(input) 2025-08-14T21:47:34.0798737Z 2025-08-14T21:47:34.0798830Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:47:34.0799164Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:47:34.0799466Z return mod(**inputs) 2025-08-14T21:47:34.0799802Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 823, in forward 2025-08-14T21:47:34.0800156Z outputs = self.model.decoder( 2025-08-14T21:47:34.0800503Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 644, in forward 2025-08-14T21:47:34.0800850Z layer_outputs = decoder_layer( 2025-08-14T21:47:34.0801166Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:47:34.0801499Z return super().__call__(*args, **kwargs) 2025-08-14T21:47:34.0801857Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 403, in forward 2025-08-14T21:47:34.0802216Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:47:34.0802347Z 2025-08-14T21:47:34.0802441Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:47:34.0802771Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:47:34.0803063Z return mod(**inputs) 2025-08-14T21:47:34.0803393Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 823, in forward 2025-08-14T21:47:34.0803749Z outputs = self.model.decoder( 2025-08-14T21:47:34.0804096Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 644, in forward 2025-08-14T21:47:34.0804456Z layer_outputs = decoder_layer( 2025-08-14T21:47:34.0804776Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:47:34.0805105Z return super().__call__(*args, **kwargs) 2025-08-14T21:47:34.0805467Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 367, in forward 2025-08-14T21:47:34.0805846Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:47:34.0806245Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 199, in forward 2025-08-14T21:47:34.0806635Z query_states = self.q_proj(hidden_states) * self.scaling 2025-08-14T21:47:34.0806783Z 2025-08-14T21:47:34.0806876Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:47:34.0807206Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:47:34.0807506Z return mod(**inputs) 2025-08-14T21:47:34.0807852Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 823, in forward 2025-08-14T21:47:34.0808197Z outputs = self.model.decoder( 2025-08-14T21:47:34.0808543Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 644, in forward 2025-08-14T21:47:34.0808892Z layer_outputs = decoder_layer( 2025-08-14T21:47:34.0809206Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:47:34.0809539Z return super().__call__(*args, **kwargs) 2025-08-14T21:47:34.0809891Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 367, in forward 2025-08-14T21:47:34.0810263Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:47:34.0810627Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 218, in forward 2025-08-14T21:47:34.0810988Z key_states = self.k_proj(current_states) 2025-08-14T21:47:34.0811110Z 2025-08-14T21:47:34.0811211Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:47:34.0811537Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:47:34.0811829Z return mod(**inputs) 2025-08-14T21:47:34.0812158Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 823, in forward 2025-08-14T21:47:34.0812511Z outputs = self.model.decoder( 2025-08-14T21:47:34.0812848Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 644, in forward 2025-08-14T21:47:34.0813200Z layer_outputs = decoder_layer( 2025-08-14T21:47:34.0813514Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:47:34.0813846Z return super().__call__(*args, **kwargs) 2025-08-14T21:47:34.0814192Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 367, in forward 2025-08-14T21:47:34.0814565Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:47:34.0814938Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 219, in forward 2025-08-14T21:47:34.0815293Z value_states = self.v_proj(current_states) 2025-08-14T21:47:34.0815430Z 2025-08-14T21:47:34.0815503Z cudagraph partition due to non gpu ops 2025-08-14T21:47:34.0815694Z cudagraph partition due to non gpu ops 2025-08-14T21:47:34.0815885Z cudagraph partition due to non gpu ops 2025-08-14T21:47:34.0816088Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:47:34.0816416Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:47:34.0816715Z return mod(**inputs) 2025-08-14T21:47:34.0817052Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 823, in forward 2025-08-14T21:47:34.0817409Z outputs = self.model.decoder( 2025-08-14T21:47:34.0817757Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 644, in forward 2025-08-14T21:47:34.0818131Z layer_outputs = decoder_layer( 2025-08-14T21:47:34.0818441Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:47:34.0818789Z return super().__call__(*args, **kwargs) 2025-08-14T21:47:34.0819145Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 367, in forward 2025-08-14T21:47:34.0819519Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:47:34.0819897Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 291, in forward 2025-08-14T21:47:34.0820262Z attn_output = self.out_proj(attn_output) 2025-08-14T21:47:34.0820401Z 2025-08-14T21:47:34.0820501Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:47:34.0820819Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:47:34.0821113Z return mod(**inputs) 2025-08-14T21:47:34.0821441Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 823, in forward 2025-08-14T21:47:34.0821792Z outputs = self.model.decoder( 2025-08-14T21:47:34.0822127Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 644, in forward 2025-08-14T21:47:34.0822475Z layer_outputs = decoder_layer( 2025-08-14T21:47:34.0822787Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:47:34.0823107Z return super().__call__(*args, **kwargs) 2025-08-14T21:47:34.0823459Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 401, in forward 2025-08-14T21:47:34.0823852Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:47:34.0824009Z 2025-08-14T21:47:34.0824109Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:47:34.0824427Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:47:34.0824723Z return mod(**inputs) 2025-08-14T21:47:34.0825139Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 823, in forward 2025-08-14T21:47:34.0825488Z outputs = self.model.decoder( 2025-08-14T21:47:34.0825836Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 644, in forward 2025-08-14T21:47:34.0826187Z layer_outputs = decoder_layer( 2025-08-14T21:47:34.0826508Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:47:34.0826834Z return super().__call__(*args, **kwargs) 2025-08-14T21:47:34.0827187Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 401, in forward 2025-08-14T21:47:34.0827584Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:47:34.0827940Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:47:34.0828249Z return self.act(input) 2025-08-14T21:47:34.0828358Z 2025-08-14T21:47:34.0828450Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:47:34.0828780Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:47:34.0829070Z return mod(**inputs) 2025-08-14T21:47:34.0829419Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 823, in forward 2025-08-14T21:47:34.0829777Z outputs = self.model.decoder( 2025-08-14T21:47:34.0830122Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 644, in forward 2025-08-14T21:47:34.0830465Z layer_outputs = decoder_layer( 2025-08-14T21:47:34.0830795Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:47:34.0831142Z return super().__call__(*args, **kwargs) 2025-08-14T21:47:34.0831489Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 403, in forward 2025-08-14T21:47:34.0831850Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:47:34.0831981Z 2025-08-14T21:47:34.0832074Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:47:34.0832397Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:47:34.0832686Z return mod(**inputs) 2025-08-14T21:47:34.0833029Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 823, in forward 2025-08-14T21:47:34.0833379Z outputs = self.model.decoder( 2025-08-14T21:47:34.0833717Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 644, in forward 2025-08-14T21:47:34.0834067Z layer_outputs = decoder_layer( 2025-08-14T21:47:34.0834382Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:47:34.0834707Z return super().__call__(*args, **kwargs) 2025-08-14T21:47:34.0835050Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 367, in forward 2025-08-14T21:47:34.0835424Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:47:34.0835946Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 199, in forward 2025-08-14T21:47:34.0836345Z query_states = self.q_proj(hidden_states) * self.scaling 2025-08-14T21:47:34.0836496Z 2025-08-14T21:47:34.0836589Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:47:34.0836921Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:47:34.0837222Z return mod(**inputs) 2025-08-14T21:47:34.0837552Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 823, in forward 2025-08-14T21:47:34.0837912Z outputs = self.model.decoder( 2025-08-14T21:47:34.0838260Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 644, in forward 2025-08-14T21:47:34.0838614Z layer_outputs = decoder_layer( 2025-08-14T21:47:34.0838930Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:47:34.0839268Z return super().__call__(*args, **kwargs) 2025-08-14T21:47:34.0839626Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 367, in forward 2025-08-14T21:47:34.0840002Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:47:34.0840368Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 218, in forward 2025-08-14T21:47:34.0840733Z key_states = self.k_proj(current_states) 2025-08-14T21:47:34.0840858Z 2025-08-14T21:47:34.0840960Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:47:34.0841282Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:47:34.0841582Z return mod(**inputs) 2025-08-14T21:47:34.0841911Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 823, in forward 2025-08-14T21:47:34.0842284Z outputs = self.model.decoder( 2025-08-14T21:47:34.0842626Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 644, in forward 2025-08-14T21:47:34.0842980Z layer_outputs = decoder_layer( 2025-08-14T21:47:34.0843327Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:47:34.0843656Z return super().__call__(*args, **kwargs) 2025-08-14T21:47:34.0844025Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 367, in forward 2025-08-14T21:47:34.0844399Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:47:34.0844769Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 219, in forward 2025-08-14T21:47:34.0845125Z value_states = self.v_proj(current_states) 2025-08-14T21:47:34.0845258Z 2025-08-14T21:47:34.0845329Z cudagraph partition due to non gpu ops 2025-08-14T21:47:34.0845524Z cudagraph partition due to non gpu ops 2025-08-14T21:47:34.0845723Z cudagraph partition due to non gpu ops 2025-08-14T21:47:34.0845933Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:47:34.0846258Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:47:34.0846554Z return mod(**inputs) 2025-08-14T21:47:34.0846878Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 823, in forward 2025-08-14T21:47:34.0847228Z outputs = self.model.decoder( 2025-08-14T21:47:34.0847575Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 644, in forward 2025-08-14T21:47:34.0847920Z layer_outputs = decoder_layer( 2025-08-14T21:47:34.0848233Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:47:34.0848562Z return super().__call__(*args, **kwargs) 2025-08-14T21:47:34.0848919Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 367, in forward 2025-08-14T21:47:34.0849287Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:47:34.0849659Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 291, in forward 2025-08-14T21:47:34.0850019Z attn_output = self.out_proj(attn_output) 2025-08-14T21:47:34.0850144Z 2025-08-14T21:47:34.0850244Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:47:34.0850561Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:47:34.0850855Z return mod(**inputs) 2025-08-14T21:47:34.0851181Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 823, in forward 2025-08-14T21:47:34.0851523Z outputs = self.model.decoder( 2025-08-14T21:47:34.0851863Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 644, in forward 2025-08-14T21:47:34.0852210Z layer_outputs = decoder_layer( 2025-08-14T21:47:34.0852524Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:47:34.0852848Z return super().__call__(*args, **kwargs) 2025-08-14T21:47:34.0853198Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 401, in forward 2025-08-14T21:47:34.0853589Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:47:34.0853741Z 2025-08-14T21:47:34.0853838Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:47:34.0854156Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:47:34.0854451Z return mod(**inputs) 2025-08-14T21:47:34.0854791Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 823, in forward 2025-08-14T21:47:34.0855143Z outputs = self.model.decoder( 2025-08-14T21:47:34.0855493Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 644, in forward 2025-08-14T21:47:34.0855845Z layer_outputs = decoder_layer( 2025-08-14T21:47:34.0856179Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:47:34.0856523Z return super().__call__(*args, **kwargs) 2025-08-14T21:47:34.0856879Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 401, in forward 2025-08-14T21:47:34.0857275Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:47:34.0857622Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:47:34.0857938Z return self.act(input) 2025-08-14T21:47:34.0858062Z 2025-08-14T21:47:34.0858154Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:47:34.0858479Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:47:34.0858768Z return mod(**inputs) 2025-08-14T21:47:34.0859098Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 823, in forward 2025-08-14T21:47:34.0859453Z outputs = self.model.decoder( 2025-08-14T21:47:34.0859790Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 644, in forward 2025-08-14T21:47:34.0860140Z layer_outputs = decoder_layer( 2025-08-14T21:47:34.0860451Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:47:34.0860778Z return super().__call__(*args, **kwargs) 2025-08-14T21:47:34.0861126Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 403, in forward 2025-08-14T21:47:34.0861487Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:47:34.0861611Z 2025-08-14T21:47:34.0861709Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:47:34.0862031Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:47:34.0862317Z return mod(**inputs) 2025-08-14T21:47:34.0862648Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 823, in forward 2025-08-14T21:47:34.0862997Z outputs = self.model.decoder( 2025-08-14T21:47:34.0863332Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 644, in forward 2025-08-14T21:47:34.0863683Z layer_outputs = decoder_layer( 2025-08-14T21:47:34.0863999Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:47:34.0864331Z return super().__call__(*args, **kwargs) 2025-08-14T21:47:34.0864676Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 367, in forward 2025-08-14T21:47:34.0865118Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:47:34.0865495Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 199, in forward 2025-08-14T21:47:34.0865886Z query_states = self.q_proj(hidden_states) * self.scaling 2025-08-14T21:47:34.0866036Z 2025-08-14T21:47:34.0866130Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:47:34.0866455Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:47:34.0866753Z return mod(**inputs) 2025-08-14T21:47:34.0867077Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 823, in forward 2025-08-14T21:47:34.0867451Z outputs = self.model.decoder( 2025-08-14T21:47:34.0867802Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 644, in forward 2025-08-14T21:47:34.0868155Z layer_outputs = decoder_layer( 2025-08-14T21:47:34.0868478Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:47:34.0868811Z return super().__call__(*args, **kwargs) 2025-08-14T21:47:34.0869178Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 367, in forward 2025-08-14T21:47:34.0869268Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:47:34.0869494Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 218, in forward 2025-08-14T21:47:34.0869576Z key_states = self.k_proj(current_states) 2025-08-14T21:47:34.0869580Z 2025-08-14T21:47:34.0869672Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:47:34.0869878Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:47:34.0869937Z return mod(**inputs) 2025-08-14T21:47:34.0870166Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 823, in forward 2025-08-14T21:47:34.0870238Z outputs = self.model.decoder( 2025-08-14T21:47:34.0870465Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 644, in forward 2025-08-14T21:47:34.0870536Z layer_outputs = decoder_layer( 2025-08-14T21:47:34.0870736Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:47:34.0870806Z return super().__call__(*args, **kwargs) 2025-08-14T21:47:34.0871039Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 367, in forward 2025-08-14T21:47:34.0871128Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:47:34.0871355Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 219, in forward 2025-08-14T21:47:34.0871439Z value_states = self.v_proj(current_states) 2025-08-14T21:47:34.0871442Z 2025-08-14T21:47:34.0871515Z cudagraph partition due to non gpu ops 2025-08-14T21:47:34.0871590Z cudagraph partition due to non gpu ops 2025-08-14T21:47:34.0871659Z cudagraph partition due to non gpu ops 2025-08-14T21:47:34.0871751Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:47:34.0871936Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:47:34.0871995Z return mod(**inputs) 2025-08-14T21:47:34.0872223Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 823, in forward 2025-08-14T21:47:34.0872294Z outputs = self.model.decoder( 2025-08-14T21:47:34.0872523Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 644, in forward 2025-08-14T21:47:34.0872594Z layer_outputs = decoder_layer( 2025-08-14T21:47:34.0872795Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:47:34.0872868Z return super().__call__(*args, **kwargs) 2025-08-14T21:47:34.0873105Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 367, in forward 2025-08-14T21:47:34.0873192Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:47:34.0873425Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 291, in forward 2025-08-14T21:47:34.0873500Z attn_output = self.out_proj(attn_output) 2025-08-14T21:47:34.0873503Z 2025-08-14T21:47:34.0873610Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:47:34.0873801Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:47:34.0873860Z return mod(**inputs) 2025-08-14T21:47:34.0874087Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 823, in forward 2025-08-14T21:47:34.0874174Z outputs = self.model.decoder( 2025-08-14T21:47:34.0874404Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 644, in forward 2025-08-14T21:47:34.0874491Z layer_outputs = decoder_layer( 2025-08-14T21:47:34.0874691Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:47:34.0874764Z return super().__call__(*args, **kwargs) 2025-08-14T21:47:34.0874998Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 401, in forward 2025-08-14T21:47:34.0875108Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:47:34.0875128Z 2025-08-14T21:47:34.0875226Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:47:34.0875406Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:47:34.0875465Z return mod(**inputs) 2025-08-14T21:47:34.0875700Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 823, in forward 2025-08-14T21:47:34.0875768Z outputs = self.model.decoder( 2025-08-14T21:47:34.0875995Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 644, in forward 2025-08-14T21:47:34.0876067Z layer_outputs = decoder_layer( 2025-08-14T21:47:34.0876265Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:47:34.0876342Z return super().__call__(*args, **kwargs) 2025-08-14T21:47:34.0876571Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 401, in forward 2025-08-14T21:47:34.0876678Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:47:34.0876877Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:47:34.0876940Z return self.act(input) 2025-08-14T21:47:34.0876944Z 2025-08-14T21:47:34.0877036Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:47:34.0877223Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:47:34.0877281Z return mod(**inputs) 2025-08-14T21:47:34.0877516Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 823, in forward 2025-08-14T21:47:34.0877579Z outputs = self.model.decoder( 2025-08-14T21:47:34.0877808Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 644, in forward 2025-08-14T21:47:34.0877881Z layer_outputs = decoder_layer( 2025-08-14T21:47:34.0878080Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:47:34.0878158Z return super().__call__(*args, **kwargs) 2025-08-14T21:47:34.0878387Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 403, in forward 2025-08-14T21:47:34.0878461Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:47:34.0878465Z 2025-08-14T21:47:34.0878562Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:47:34.0878742Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:47:34.0878801Z return mod(**inputs) 2025-08-14T21:47:34.0879059Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 839, in forward 2025-08-14T21:47:34.0879147Z logits = self.output_projection(outputs[0]) 2025-08-14T21:47:34.0879151Z 2025-08-14T21:47:34.0879247Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:47:34.0879425Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:47:34.0879497Z return mod(**inputs) 2025-08-14T21:47:34.0879737Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/trocr/modeling_trocr.py", line 844, in forward 2025-08-14T21:47:34.0879885Z loss = loss_fct(logits.view(-1, self.config.vocab_size), labels.view(-1)) 2025-08-14T21:47:34.0879889Z 2025-08-14T21:47:41.5104627Z Compilation time (from dynamo_timed): 13.171461664 2025-08-14T21:47:41.5133420Z pass 2025-08-14T21:47:41.5135667Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:47:41.5139951Z TIMING: _recursive_pre_grad_passes:0.00651 _recursive_joint_graph_passes:0.62256 _recursive_post_grad_passes:0.07073 async_compile.wait:0.75565 code_gen:6.92727 inductor_compile:7.98367 backend_compile:11.10122 gc:0.00029 entire_frame_compile:13.17146 total_wall_time:13.17146 2025-08-14T21:47:41.5141385Z STATS: call_* op count: 443 | FakeTensorMode.__torch_dispatch__:14347 | FakeTensor.__torch_dispatch__:4678 | ProxyTorchDispatchMode.__torch_dispatch__:5467 2025-08-14T21:47:41.5141855Z Dynamo produced 1 graphs covering 443 ops with 0 graph breaks (0 unique) 2025-08-14T21:47:45.7043811Z /opt/conda/envs/py_3.9/lib/python3.9/site-packages/llvmlite/binding/ffi.py:175: UserWarning: pkg_resources is deprecated as an API. See https://setuptools.pypa.io/en/latest/pkg_resources.html. The pkg_resources package is slated for removal as early as 2025-11-30. Refrain from using this package or pin to Setuptools<81. 2025-08-14T21:47:45.7044651Z from pkg_resources import resource_filename 2025-08-14T21:47:46.3088765Z 2025-08-14T21:47:52.1617186Z loading model: 0it [00:00, ?it/s] 2025-08-14T21:47:52.1621468Z loading model: 0it [00:05, ?it/s] 2025-08-14T21:47:52.1638852Z cpu eval XGLMForCausalLM 2025-08-14T21:47:52.5572511Z WARNING:common:fp64 golden ref were not generated for XGLMForCausalLM. Setting accuracy check to cosine 2025-08-14T21:47:52.6326903Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:47:53.0545304Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:47:53.4675772Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:48:06.4910103Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.4913915Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.4915944Z return mod(**inputs) 2025-08-14T21:48:06.4916509Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.4921210Z outputs = self.model( 2025-08-14T21:48:06.4926423Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.4927680Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.4931634Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.4933226Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.4933692Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.4934164Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.4938502Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 156, in forward 2025-08-14T21:48:06.4943056Z query_states = self.q_proj(hidden_states) * self.scaling 2025-08-14T21:48:06.4947255Z 2025-08-14T21:48:06.4952121Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.4956588Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.4960770Z return mod(**inputs) 2025-08-14T21:48:06.4965761Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.4969886Z outputs = self.model( 2025-08-14T21:48:06.4974570Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.4975064Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.4975412Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.4975763Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.4976134Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.4976699Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.4977087Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 175, in forward 2025-08-14T21:48:06.4977460Z key_states = self.k_proj(current_states) 2025-08-14T21:48:06.4977593Z 2025-08-14T21:48:06.4977699Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.4978052Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.4978372Z return mod(**inputs) 2025-08-14T21:48:06.4978719Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.4979066Z outputs = self.model( 2025-08-14T21:48:06.4979402Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.4979771Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.4980098Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.4980453Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.4980821Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.4981223Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.4981590Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 156, in forward 2025-08-14T21:48:06.4981979Z query_states = self.q_proj(hidden_states) * self.scaling 2025-08-14T21:48:06.4982142Z 2025-08-14T21:48:06.4982241Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.4982578Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.4982879Z return mod(**inputs) 2025-08-14T21:48:06.4983208Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.4983555Z outputs = self.model( 2025-08-14T21:48:06.4983880Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.4984234Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.4984559Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.4985133Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.4985485Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.4985876Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.4986313Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 197, in forward 2025-08-14T21:48:06.4986731Z attn_weights = torch.bmm(query_states, key_states.transpose(1, 2)) 2025-08-14T21:48:06.4986909Z 2025-08-14T21:48:06.4987008Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.4987341Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.4987672Z return mod(**inputs) 2025-08-14T21:48:06.4987997Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.4988385Z outputs = self.model( 2025-08-14T21:48:06.4988711Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.4989061Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.4989375Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.4989711Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.4990093Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.4990458Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.4990829Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 176, in forward 2025-08-14T21:48:06.4991197Z value_states = self.v_proj(current_states) 2025-08-14T21:48:06.4991330Z 2025-08-14T21:48:06.4991431Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.4991753Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.4992053Z return mod(**inputs) 2025-08-14T21:48:06.4992379Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.4992729Z outputs = self.model( 2025-08-14T21:48:06.4993053Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.4993404Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.4993724Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.4994050Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.4994404Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.4994776Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.4995143Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 243, in forward 2025-08-14T21:48:06.4995506Z attn_output = torch.bmm(attn_probs, value_states) 2025-08-14T21:48:06.4995653Z 2025-08-14T21:48:06.4995746Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.4996075Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.4996367Z return mod(**inputs) 2025-08-14T21:48:06.4996690Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.4997040Z outputs = self.model( 2025-08-14T21:48:06.4997368Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.4997712Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.4998032Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.4998362Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.4998710Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.4999075Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.4999493Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 256, in forward 2025-08-14T21:48:06.4999901Z attn_output = attn_output.reshape(bsz, tgt_len, self.embed_dim) 2025-08-14T21:48:06.5000067Z 2025-08-14T21:48:06.5000163Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5000520Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5000842Z return mod(**inputs) 2025-08-14T21:48:06.5001171Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5001511Z outputs = self.model( 2025-08-14T21:48:06.5001842Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5002194Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5002507Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5002854Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5003208Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5003579Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5003939Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 258, in forward 2025-08-14T21:48:06.5004300Z attn_output = self.out_proj(attn_output) 2025-08-14T21:48:06.5004424Z 2025-08-14T21:48:06.5004525Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5004855Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5005153Z return mod(**inputs) 2025-08-14T21:48:06.5005481Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5005831Z outputs = self.model( 2025-08-14T21:48:06.5006155Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5006504Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5006826Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5007160Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5007506Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 362, in forward 2025-08-14T21:48:06.5007907Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:48:06.5008065Z 2025-08-14T21:48:06.5008167Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5008497Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5008795Z return mod(**inputs) 2025-08-14T21:48:06.5009132Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5009478Z outputs = self.model( 2025-08-14T21:48:06.5009801Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5010153Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5010476Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5010809Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5011156Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 362, in forward 2025-08-14T21:48:06.5011548Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:48:06.5011921Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:48:06.5012237Z return self.act(input) 2025-08-14T21:48:06.5012345Z 2025-08-14T21:48:06.5012439Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5012768Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5013084Z return mod(**inputs) 2025-08-14T21:48:06.5013406Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5013767Z outputs = self.model( 2025-08-14T21:48:06.5014093Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5014435Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5014775Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5015117Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5015491Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 364, in forward 2025-08-14T21:48:06.5015848Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:48:06.5015984Z 2025-08-14T21:48:06.5016078Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5016421Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5016728Z return mod(**inputs) 2025-08-14T21:48:06.5017056Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5017404Z outputs = self.model( 2025-08-14T21:48:06.5017736Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5018081Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5018414Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5018750Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5019113Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5019477Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5019844Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 156, in forward 2025-08-14T21:48:06.5020232Z query_states = self.q_proj(hidden_states) * self.scaling 2025-08-14T21:48:06.5020381Z 2025-08-14T21:48:06.5020474Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5020802Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5021100Z return mod(**inputs) 2025-08-14T21:48:06.5021428Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5021766Z outputs = self.model( 2025-08-14T21:48:06.5022093Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5022437Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5022750Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5023084Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5023434Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5023805Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5024162Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 175, in forward 2025-08-14T21:48:06.5024519Z key_states = self.k_proj(current_states) 2025-08-14T21:48:06.5024664Z 2025-08-14T21:48:06.5024835Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5025178Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5025470Z return mod(**inputs) 2025-08-14T21:48:06.5025819Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5026168Z outputs = self.model( 2025-08-14T21:48:06.5026509Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5026862Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5027189Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5027520Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5027869Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5028259Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5028624Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 156, in forward 2025-08-14T21:48:06.5029006Z query_states = self.q_proj(hidden_states) * self.scaling 2025-08-14T21:48:06.5029157Z 2025-08-14T21:48:06.5029252Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5029583Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5029884Z return mod(**inputs) 2025-08-14T21:48:06.5030200Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5030539Z outputs = self.model( 2025-08-14T21:48:06.5030865Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5031214Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5031525Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5031853Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5032201Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5032566Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5032932Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 197, in forward 2025-08-14T21:48:06.5033333Z attn_weights = torch.bmm(query_states, key_states.transpose(1, 2)) 2025-08-14T21:48:06.5033505Z 2025-08-14T21:48:06.5033606Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5033928Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5034227Z return mod(**inputs) 2025-08-14T21:48:06.5034555Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5034900Z outputs = self.model( 2025-08-14T21:48:06.5035218Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5035568Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5035887Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5036211Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5036564Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5036933Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5037329Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 176, in forward 2025-08-14T21:48:06.5037692Z value_states = self.v_proj(current_states) 2025-08-14T21:48:06.5037829Z 2025-08-14T21:48:06.5037926Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5038256Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5038577Z return mod(**inputs) 2025-08-14T21:48:06.5038906Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5039272Z outputs = self.model( 2025-08-14T21:48:06.5039602Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5039946Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5040262Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5040593Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5040964Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5041334Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5041701Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 243, in forward 2025-08-14T21:48:06.5042070Z attn_output = torch.bmm(attn_probs, value_states) 2025-08-14T21:48:06.5042208Z 2025-08-14T21:48:06.5042301Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5042627Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5042925Z return mod(**inputs) 2025-08-14T21:48:06.5043250Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5043586Z outputs = self.model( 2025-08-14T21:48:06.5043916Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5044268Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5044578Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5044912Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5045261Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5045629Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5045988Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 256, in forward 2025-08-14T21:48:06.5046382Z attn_output = attn_output.reshape(bsz, tgt_len, self.embed_dim) 2025-08-14T21:48:06.5046545Z 2025-08-14T21:48:06.5046646Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5046973Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5047262Z return mod(**inputs) 2025-08-14T21:48:06.5047585Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5047928Z outputs = self.model( 2025-08-14T21:48:06.5048247Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5048594Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5048910Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5049239Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5049579Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5049964Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5050335Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 258, in forward 2025-08-14T21:48:06.5050685Z attn_output = self.out_proj(attn_output) 2025-08-14T21:48:06.5050820Z 2025-08-14T21:48:06.5050927Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5051259Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5051581Z return mod(**inputs) 2025-08-14T21:48:06.5051908Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5052262Z outputs = self.model( 2025-08-14T21:48:06.5052599Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5052957Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5053279Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5053652Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5054003Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 362, in forward 2025-08-14T21:48:06.5054392Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:48:06.5054556Z 2025-08-14T21:48:06.5054652Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5054981Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5055279Z return mod(**inputs) 2025-08-14T21:48:06.5055597Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5055940Z outputs = self.model( 2025-08-14T21:48:06.5056264Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5056607Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5056925Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5057253Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5057603Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 362, in forward 2025-08-14T21:48:06.5057987Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:48:06.5058341Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:48:06.5058656Z return self.act(input) 2025-08-14T21:48:06.5058756Z 2025-08-14T21:48:06.5058858Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5059181Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5059482Z return mod(**inputs) 2025-08-14T21:48:06.5059810Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5060148Z outputs = self.model( 2025-08-14T21:48:06.5060475Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5060825Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5061143Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5061469Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5061818Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 364, in forward 2025-08-14T21:48:06.5062173Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:48:06.5062296Z 2025-08-14T21:48:06.5062388Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5062730Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5063032Z return mod(**inputs) 2025-08-14T21:48:06.5063356Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5063710Z outputs = self.model( 2025-08-14T21:48:06.5064038Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5064408Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5064723Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5065130Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5065483Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5065862Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5066248Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 156, in forward 2025-08-14T21:48:06.5066636Z query_states = self.q_proj(hidden_states) * self.scaling 2025-08-14T21:48:06.5066786Z 2025-08-14T21:48:06.5066891Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5067224Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5067521Z return mod(**inputs) 2025-08-14T21:48:06.5067856Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5068205Z outputs = self.model( 2025-08-14T21:48:06.5068527Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5068881Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5069203Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5069540Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5069882Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5070253Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5070620Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 175, in forward 2025-08-14T21:48:06.5070971Z key_states = self.k_proj(current_states) 2025-08-14T21:48:06.5071103Z 2025-08-14T21:48:06.5071195Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5071521Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5071819Z return mod(**inputs) 2025-08-14T21:48:06.5072136Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5072485Z outputs = self.model( 2025-08-14T21:48:06.5072813Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5073152Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5073469Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5073802Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5074150Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5074515Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5074883Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 156, in forward 2025-08-14T21:48:06.5075282Z query_states = self.q_proj(hidden_states) * self.scaling 2025-08-14T21:48:06.5075437Z 2025-08-14T21:48:06.5075538Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5075860Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5076163Z return mod(**inputs) 2025-08-14T21:48:06.5077382Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5077768Z outputs = self.model( 2025-08-14T21:48:06.5078099Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5078450Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5078769Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5079094Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5079448Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5079838Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5080198Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 197, in forward 2025-08-14T21:48:06.5080608Z attn_weights = torch.bmm(query_states, key_states.transpose(1, 2)) 2025-08-14T21:48:06.5080788Z 2025-08-14T21:48:06.5080886Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5081217Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5081508Z return mod(**inputs) 2025-08-14T21:48:06.5081835Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5082180Z outputs = self.model( 2025-08-14T21:48:06.5082508Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5082852Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5083168Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5083499Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5083844Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5084216Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5084587Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 176, in forward 2025-08-14T21:48:06.5085116Z value_states = self.v_proj(current_states) 2025-08-14T21:48:06.5085249Z 2025-08-14T21:48:06.5085345Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5085683Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5085989Z return mod(**inputs) 2025-08-14T21:48:06.5086320Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5086663Z outputs = self.model( 2025-08-14T21:48:06.5086995Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5087350Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5087666Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5088005Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5088361Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5088733Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5089129Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 243, in forward 2025-08-14T21:48:06.5089506Z attn_output = torch.bmm(attn_probs, value_states) 2025-08-14T21:48:06.5089644Z 2025-08-14T21:48:06.5089744Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5090083Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5090385Z return mod(**inputs) 2025-08-14T21:48:06.5090752Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5091099Z outputs = self.model( 2025-08-14T21:48:06.5091423Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5091772Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5092090Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5092414Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5092795Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5093166Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5093535Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 256, in forward 2025-08-14T21:48:06.5093932Z attn_output = attn_output.reshape(bsz, tgt_len, self.embed_dim) 2025-08-14T21:48:06.5094100Z 2025-08-14T21:48:06.5094192Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5094522Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5094818Z return mod(**inputs) 2025-08-14T21:48:06.5095135Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5095479Z outputs = self.model( 2025-08-14T21:48:06.5095805Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5096145Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5096464Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5096792Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5097143Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5097504Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5097870Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 258, in forward 2025-08-14T21:48:06.5098228Z attn_output = self.out_proj(attn_output) 2025-08-14T21:48:06.5098350Z 2025-08-14T21:48:06.5098950Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5099278Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5099581Z return mod(**inputs) 2025-08-14T21:48:06.5099911Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5100253Z outputs = self.model( 2025-08-14T21:48:06.5100580Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5100930Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5101250Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5101574Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5101929Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 362, in forward 2025-08-14T21:48:06.5102337Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:48:06.5102499Z 2025-08-14T21:48:06.5102600Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5102919Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5103214Z return mod(**inputs) 2025-08-14T21:48:06.5103559Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5103912Z outputs = self.model( 2025-08-14T21:48:06.5104237Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5104584Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5104957Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5105290Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5105647Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 362, in forward 2025-08-14T21:48:06.5106060Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:48:06.5106413Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:48:06.5106732Z return self.act(input) 2025-08-14T21:48:06.5106845Z 2025-08-14T21:48:06.5106942Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5107274Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5107569Z return mod(**inputs) 2025-08-14T21:48:06.5107901Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5108245Z outputs = self.model( 2025-08-14T21:48:06.5108565Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5108917Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5109239Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5109567Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5109915Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 364, in forward 2025-08-14T21:48:06.5110274Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:48:06.5110400Z 2025-08-14T21:48:06.5110503Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5110832Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5111123Z return mod(**inputs) 2025-08-14T21:48:06.5111447Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5111792Z outputs = self.model( 2025-08-14T21:48:06.5112113Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5112461Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5112780Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5113109Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5113454Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 366, in forward 2025-08-14T21:48:06.5113810Z hidden_states = residual + hidden_states 2025-08-14T21:48:06.5113934Z 2025-08-14T21:48:06.5114038Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5114363Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5114664Z return mod(**inputs) 2025-08-14T21:48:06.5115007Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5115352Z outputs = self.model( 2025-08-14T21:48:06.5115672Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5116035Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5116358Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5116696Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5117047Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5117421Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5117787Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 156, in forward 2025-08-14T21:48:06.5118167Z query_states = self.q_proj(hidden_states) * self.scaling 2025-08-14T21:48:06.5118345Z 2025-08-14T21:48:06.5118438Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5118768Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5119068Z return mod(**inputs) 2025-08-14T21:48:06.5119388Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5119738Z outputs = self.model( 2025-08-14T21:48:06.5120064Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5120405Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5120724Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5121056Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5121410Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5121778Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5122147Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 175, in forward 2025-08-14T21:48:06.5122504Z key_states = self.k_proj(current_states) 2025-08-14T21:48:06.5122628Z 2025-08-14T21:48:06.5122729Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5123053Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5123352Z return mod(**inputs) 2025-08-14T21:48:06.5123677Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5124012Z outputs = self.model( 2025-08-14T21:48:06.5124339Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5124689Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5125004Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5125327Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5125675Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5126047Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5126405Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 156, in forward 2025-08-14T21:48:06.5126790Z query_states = self.q_proj(hidden_states) * self.scaling 2025-08-14T21:48:06.5126947Z 2025-08-14T21:48:06.5127040Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5127379Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5127675Z return mod(**inputs) 2025-08-14T21:48:06.5128012Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5128364Z outputs = self.model( 2025-08-14T21:48:06.5128707Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5129071Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5129387Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5129715Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5130056Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5130427Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5130796Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 197, in forward 2025-08-14T21:48:06.5131212Z attn_weights = torch.bmm(query_states, key_states.transpose(1, 2)) 2025-08-14T21:48:06.5131384Z 2025-08-14T21:48:06.5131476Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5131808Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5132109Z return mod(**inputs) 2025-08-14T21:48:06.5132428Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5132777Z outputs = self.model( 2025-08-14T21:48:06.5133107Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5133455Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5133766Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5134099Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5134450Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5134812Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5135181Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 176, in forward 2025-08-14T21:48:06.5135541Z value_states = self.v_proj(current_states) 2025-08-14T21:48:06.5135671Z 2025-08-14T21:48:06.5135771Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5136092Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5136388Z return mod(**inputs) 2025-08-14T21:48:06.5136713Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5137060Z outputs = self.model( 2025-08-14T21:48:06.5137379Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5137724Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5138042Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5138364Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5138718Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5139084Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5139450Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 243, in forward 2025-08-14T21:48:06.5139809Z attn_output = torch.bmm(attn_probs, value_states) 2025-08-14T21:48:06.5139953Z 2025-08-14T21:48:06.5140062Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5140394Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5140684Z return mod(**inputs) 2025-08-14T21:48:06.5141032Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5141380Z outputs = self.model( 2025-08-14T21:48:06.5141719Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5142059Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5142374Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5142703Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5143050Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5143413Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5143800Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 256, in forward 2025-08-14T21:48:06.5144201Z attn_output = attn_output.reshape(bsz, tgt_len, self.embed_dim) 2025-08-14T21:48:06.5144368Z 2025-08-14T21:48:06.5144464Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5144866Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5145177Z return mod(**inputs) 2025-08-14T21:48:06.5145508Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5145847Z outputs = self.model( 2025-08-14T21:48:06.5146178Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5146534Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5146850Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5147183Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5147540Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5147914Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5148279Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 258, in forward 2025-08-14T21:48:06.5148640Z attn_output = self.out_proj(attn_output) 2025-08-14T21:48:06.5148764Z 2025-08-14T21:48:06.5148865Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5149192Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5149481Z return mod(**inputs) 2025-08-14T21:48:06.5149811Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5150156Z outputs = self.model( 2025-08-14T21:48:06.5150478Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5150828Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5151143Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5151474Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5151817Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 362, in forward 2025-08-14T21:48:06.5152209Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:48:06.5152365Z 2025-08-14T21:48:06.5152466Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5152811Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5153113Z return mod(**inputs) 2025-08-14T21:48:06.5153440Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5153785Z outputs = self.model( 2025-08-14T21:48:06.5154123Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5154487Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5154804Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5155133Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5155477Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 362, in forward 2025-08-14T21:48:06.5155865Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:48:06.5156220Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:48:06.5156542Z return self.act(input) 2025-08-14T21:48:06.5156649Z 2025-08-14T21:48:06.5156742Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5157073Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5157371Z return mod(**inputs) 2025-08-14T21:48:06.5157692Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5158031Z outputs = self.model( 2025-08-14T21:48:06.5158358Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5158695Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5159012Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5159343Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5159692Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 364, in forward 2025-08-14T21:48:06.5160040Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:48:06.5160171Z 2025-08-14T21:48:06.5160266Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5160595Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5160887Z return mod(**inputs) 2025-08-14T21:48:06.5161215Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5161559Z outputs = self.model( 2025-08-14T21:48:06.5161885Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5162231Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5162555Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5162886Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5163242Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5163610Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5163986Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 156, in forward 2025-08-14T21:48:06.5164376Z query_states = self.q_proj(hidden_states) * self.scaling 2025-08-14T21:48:06.5164527Z 2025-08-14T21:48:06.5164621Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5164952Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5165249Z return mod(**inputs) 2025-08-14T21:48:06.5165593Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5165937Z outputs = self.model( 2025-08-14T21:48:06.5166265Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5166627Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5166950Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5167301Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5167651Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5168022Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5168382Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 175, in forward 2025-08-14T21:48:06.5168740Z key_states = self.k_proj(current_states) 2025-08-14T21:48:06.5168880Z 2025-08-14T21:48:06.5168982Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5169309Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5169602Z return mod(**inputs) 2025-08-14T21:48:06.5169927Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5170274Z outputs = self.model( 2025-08-14T21:48:06.5170594Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5170940Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5171255Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5171582Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5171922Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5172296Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5172662Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 156, in forward 2025-08-14T21:48:06.5173040Z query_states = self.q_proj(hidden_states) * self.scaling 2025-08-14T21:48:06.5173196Z 2025-08-14T21:48:06.5173288Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5173616Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5173912Z return mod(**inputs) 2025-08-14T21:48:06.5174229Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5174574Z outputs = self.model( 2025-08-14T21:48:06.5174898Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5175247Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5175554Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5175884Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5176233Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5176597Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5176967Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 197, in forward 2025-08-14T21:48:06.5177372Z attn_weights = torch.bmm(query_states, key_states.transpose(1, 2)) 2025-08-14T21:48:06.5177540Z 2025-08-14T21:48:06.5177641Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5177975Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5178275Z return mod(**inputs) 2025-08-14T21:48:06.5178600Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5178938Z outputs = self.model( 2025-08-14T21:48:06.5179282Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5179654Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5179977Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5180307Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5180662Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5181037Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5181412Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 176, in forward 2025-08-14T21:48:06.5181790Z value_states = self.v_proj(current_states) 2025-08-14T21:48:06.5181926Z 2025-08-14T21:48:06.5182020Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5182353Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5182647Z return mod(**inputs) 2025-08-14T21:48:06.5182978Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5183326Z outputs = self.model( 2025-08-14T21:48:06.5183657Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5183999Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5184320Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5184806Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5185164Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5185538Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5185911Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 243, in forward 2025-08-14T21:48:06.5186284Z attn_output = torch.bmm(attn_probs, value_states) 2025-08-14T21:48:06.5186421Z 2025-08-14T21:48:06.5186514Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5186842Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5187135Z return mod(**inputs) 2025-08-14T21:48:06.5187463Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5187803Z outputs = self.model( 2025-08-14T21:48:06.5188132Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5188483Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5188796Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5189132Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5189482Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5189852Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5190213Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 256, in forward 2025-08-14T21:48:06.5190613Z attn_output = attn_output.reshape(bsz, tgt_len, self.embed_dim) 2025-08-14T21:48:06.5190817Z 2025-08-14T21:48:06.5190923Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5191257Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5191554Z return mod(**inputs) 2025-08-14T21:48:06.5191910Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5192263Z outputs = self.model( 2025-08-14T21:48:06.5192611Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5192961Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5193279Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5193611Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5193958Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5194354Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5194726Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 258, in forward 2025-08-14T21:48:06.5195080Z attn_output = self.out_proj(attn_output) 2025-08-14T21:48:06.5195214Z 2025-08-14T21:48:06.5195310Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5195647Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5195948Z return mod(**inputs) 2025-08-14T21:48:06.5196268Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5196611Z outputs = self.model( 2025-08-14T21:48:06.5196936Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5197280Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5197599Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5197928Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5198279Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 362, in forward 2025-08-14T21:48:06.5198664Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:48:06.5198833Z 2025-08-14T21:48:06.5198930Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5199260Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5199559Z return mod(**inputs) 2025-08-14T21:48:06.5199878Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5200229Z outputs = self.model( 2025-08-14T21:48:06.5200556Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5200903Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5201222Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5201553Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5201904Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 362, in forward 2025-08-14T21:48:06.5202288Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:48:06.5202644Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:48:06.5202957Z return self.act(input) 2025-08-14T21:48:06.5203059Z 2025-08-14T21:48:06.5203152Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5203496Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5203800Z return mod(**inputs) 2025-08-14T21:48:06.5204125Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5204462Z outputs = self.model( 2025-08-14T21:48:06.5204804Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5205168Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5205477Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5205811Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5206165Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 364, in forward 2025-08-14T21:48:06.5206523Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:48:06.5206649Z 2025-08-14T21:48:06.5206762Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5207092Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5207389Z return mod(**inputs) 2025-08-14T21:48:06.5207713Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5208052Z outputs = self.model( 2025-08-14T21:48:06.5208381Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5208735Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5209045Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5209376Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5209728Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 366, in forward 2025-08-14T21:48:06.5210085Z hidden_states = residual + hidden_states 2025-08-14T21:48:06.5210207Z 2025-08-14T21:48:06.5210302Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5210630Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5210931Z return mod(**inputs) 2025-08-14T21:48:06.5211249Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5211594Z outputs = self.model( 2025-08-14T21:48:06.5211919Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5212271Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5212577Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5212909Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5213259Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5213628Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5213989Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 156, in forward 2025-08-14T21:48:06.5214374Z query_states = self.q_proj(hidden_states) * self.scaling 2025-08-14T21:48:06.5214524Z 2025-08-14T21:48:06.5214624Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5214945Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5215245Z return mod(**inputs) 2025-08-14T21:48:06.5215572Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5215918Z outputs = self.model( 2025-08-14T21:48:06.5216253Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5216606Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5216920Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5217256Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5217612Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5218004Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5218369Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 175, in forward 2025-08-14T21:48:06.5218715Z key_states = self.k_proj(current_states) 2025-08-14T21:48:06.5218844Z 2025-08-14T21:48:06.5218936Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5219266Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5219585Z return mod(**inputs) 2025-08-14T21:48:06.5219905Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5220250Z outputs = self.model( 2025-08-14T21:48:06.5220581Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5220927Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5221248Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5221583Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5221936Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5222304Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5222677Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 156, in forward 2025-08-14T21:48:06.5223071Z query_states = self.q_proj(hidden_states) * self.scaling 2025-08-14T21:48:06.5223223Z 2025-08-14T21:48:06.5223319Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5223651Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5223954Z return mod(**inputs) 2025-08-14T21:48:06.5224283Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5224624Z outputs = self.model( 2025-08-14T21:48:06.5225021Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5225381Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5225698Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5226036Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5226396Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5226777Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5227146Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 197, in forward 2025-08-14T21:48:06.5227564Z attn_weights = torch.bmm(query_states, key_states.transpose(1, 2)) 2025-08-14T21:48:06.5227748Z 2025-08-14T21:48:06.5227846Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5228181Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5228480Z return mod(**inputs) 2025-08-14T21:48:06.5228837Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5229183Z outputs = self.model( 2025-08-14T21:48:06.5229500Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5229849Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5230181Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5230532Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5230877Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5231248Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5231613Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 176, in forward 2025-08-14T21:48:06.5231975Z value_states = self.v_proj(current_states) 2025-08-14T21:48:06.5232106Z 2025-08-14T21:48:06.5232225Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5232558Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5232859Z return mod(**inputs) 2025-08-14T21:48:06.5233182Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5233530Z outputs = self.model( 2025-08-14T21:48:06.5233858Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5234207Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5234518Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5234849Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5235203Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5235569Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5235938Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 243, in forward 2025-08-14T21:48:06.5236312Z attn_output = torch.bmm(attn_probs, value_states) 2025-08-14T21:48:06.5236450Z 2025-08-14T21:48:06.5236550Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5236872Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5237172Z return mod(**inputs) 2025-08-14T21:48:06.5237500Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5237847Z outputs = self.model( 2025-08-14T21:48:06.5238172Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5238525Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5238845Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5239170Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5239525Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5239900Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5240271Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 256, in forward 2025-08-14T21:48:06.5240662Z attn_output = attn_output.reshape(bsz, tgt_len, self.embed_dim) 2025-08-14T21:48:06.5240832Z 2025-08-14T21:48:06.5240926Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5241257Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5241565Z return mod(**inputs) 2025-08-14T21:48:06.5241901Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5242247Z outputs = self.model( 2025-08-14T21:48:06.5242591Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5242936Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5243269Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5243602Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5243950Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5244314Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5244686Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 258, in forward 2025-08-14T21:48:06.5245065Z attn_output = self.out_proj(attn_output) 2025-08-14T21:48:06.5245190Z 2025-08-14T21:48:06.5245282Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5245614Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5245914Z return mod(**inputs) 2025-08-14T21:48:06.5246245Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5246591Z outputs = self.model( 2025-08-14T21:48:06.5246925Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5247281Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5247599Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5247938Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5248295Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 362, in forward 2025-08-14T21:48:06.5248692Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:48:06.5248850Z 2025-08-14T21:48:06.5248946Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5249277Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5249578Z return mod(**inputs) 2025-08-14T21:48:06.5249907Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5250246Z outputs = self.model( 2025-08-14T21:48:06.5250576Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5250928Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5251244Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5251580Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5251934Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 362, in forward 2025-08-14T21:48:06.5252335Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:48:06.5252687Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:48:06.5253002Z return self.act(input) 2025-08-14T21:48:06.5253103Z 2025-08-14T21:48:06.5253206Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5253530Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5253832Z return mod(**inputs) 2025-08-14T21:48:06.5254174Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5254523Z outputs = self.model( 2025-08-14T21:48:06.5254842Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5255192Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5255524Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5255883Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5256242Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 364, in forward 2025-08-14T21:48:06.5256610Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:48:06.5256738Z 2025-08-14T21:48:06.5256840Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5257171Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5257482Z return mod(**inputs) 2025-08-14T21:48:06.5257830Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5258171Z outputs = self.model( 2025-08-14T21:48:06.5258490Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5258833Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5259148Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5259470Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5259819Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5260188Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5260559Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 156, in forward 2025-08-14T21:48:06.5260938Z query_states = self.q_proj(hidden_states) * self.scaling 2025-08-14T21:48:06.5261093Z 2025-08-14T21:48:06.5261186Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5261514Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5261802Z return mod(**inputs) 2025-08-14T21:48:06.5262127Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5262472Z outputs = self.model( 2025-08-14T21:48:06.5262796Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5263136Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5263452Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5263783Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5264131Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5264501Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5264961Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 175, in forward 2025-08-14T21:48:06.5265332Z key_states = self.k_proj(current_states) 2025-08-14T21:48:06.5265457Z 2025-08-14T21:48:06.5265551Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5265884Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5266191Z return mod(**inputs) 2025-08-14T21:48:06.5266518Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5266856Z outputs = self.model( 2025-08-14T21:48:06.5267203Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5267564Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5267877Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5268225Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5268587Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5268977Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5269338Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 156, in forward 2025-08-14T21:48:06.5269724Z query_states = self.q_proj(hidden_states) * self.scaling 2025-08-14T21:48:06.5269873Z 2025-08-14T21:48:06.5269974Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5270304Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5270612Z return mod(**inputs) 2025-08-14T21:48:06.5270941Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5271291Z outputs = self.model( 2025-08-14T21:48:06.5271614Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5271966Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5272284Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5272616Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5272961Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5273338Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5273710Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 197, in forward 2025-08-14T21:48:06.5274109Z attn_weights = torch.bmm(query_states, key_states.transpose(1, 2)) 2025-08-14T21:48:06.5274292Z 2025-08-14T21:48:06.5274388Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5274716Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5275016Z return mod(**inputs) 2025-08-14T21:48:06.5275336Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5275683Z outputs = self.model( 2025-08-14T21:48:06.5276012Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5276360Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5276676Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5277014Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5277367Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5277734Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5278104Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 176, in forward 2025-08-14T21:48:06.5278467Z value_states = self.v_proj(current_states) 2025-08-14T21:48:06.5278597Z 2025-08-14T21:48:06.5278698Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5279020Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5279321Z return mod(**inputs) 2025-08-14T21:48:06.5279661Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5280034Z outputs = self.model( 2025-08-14T21:48:06.5280361Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5280707Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5281036Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5281390Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5281738Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5282105Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5282470Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 243, in forward 2025-08-14T21:48:06.5282833Z attn_output = torch.bmm(attn_probs, value_states) 2025-08-14T21:48:06.5282993Z 2025-08-14T21:48:06.5283085Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5283412Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5283699Z return mod(**inputs) 2025-08-14T21:48:06.5284025Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5284367Z outputs = self.model( 2025-08-14T21:48:06.5284837Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5285188Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5285511Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5285843Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5286194Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5286568Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5286940Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 256, in forward 2025-08-14T21:48:06.5287342Z attn_output = attn_output.reshape(bsz, tgt_len, self.embed_dim) 2025-08-14T21:48:06.5287509Z 2025-08-14T21:48:06.5287606Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5287939Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5288239Z return mod(**inputs) 2025-08-14T21:48:06.5288565Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5288905Z outputs = self.model( 2025-08-14T21:48:06.5289237Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5289599Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5289910Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5290245Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5290597Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5290968Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5291331Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 258, in forward 2025-08-14T21:48:06.5291691Z attn_output = self.out_proj(attn_output) 2025-08-14T21:48:06.5291816Z 2025-08-14T21:48:06.5291920Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5292275Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5292581Z return mod(**inputs) 2025-08-14T21:48:06.5292904Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5293246Z outputs = self.model( 2025-08-14T21:48:06.5293585Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5293940Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5294288Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5294625Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5294968Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 362, in forward 2025-08-14T21:48:06.5295357Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:48:06.5295515Z 2025-08-14T21:48:06.5295619Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5295963Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5296258Z return mod(**inputs) 2025-08-14T21:48:06.5296584Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5296930Z outputs = self.model( 2025-08-14T21:48:06.5297250Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5297597Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5297911Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5298232Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5298579Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 362, in forward 2025-08-14T21:48:06.5298967Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:48:06.5299318Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:48:06.5299621Z return self.act(input) 2025-08-14T21:48:06.5299727Z 2025-08-14T21:48:06.5299821Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5300147Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5300439Z return mod(**inputs) 2025-08-14T21:48:06.5300762Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5301100Z outputs = self.model( 2025-08-14T21:48:06.5301426Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5301763Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5302081Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5302411Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5302761Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 364, in forward 2025-08-14T21:48:06.5303109Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:48:06.5303239Z 2025-08-14T21:48:06.5303334Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5303659Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5303947Z return mod(**inputs) 2025-08-14T21:48:06.5304268Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5304609Z outputs = self.model( 2025-08-14T21:48:06.5305003Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5305355Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5305673Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5306004Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5306361Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 366, in forward 2025-08-14T21:48:06.5306736Z hidden_states = residual + hidden_states 2025-08-14T21:48:06.5306873Z 2025-08-14T21:48:06.5306968Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5307301Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5307595Z return mod(**inputs) 2025-08-14T21:48:06.5307922Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5308273Z outputs = self.model( 2025-08-14T21:48:06.5308611Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5308962Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5309282Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5309616Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5309960Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5310335Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5310706Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 156, in forward 2025-08-14T21:48:06.5311092Z query_states = self.q_proj(hidden_states) * self.scaling 2025-08-14T21:48:06.5311240Z 2025-08-14T21:48:06.5311334Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5311663Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5311963Z return mod(**inputs) 2025-08-14T21:48:06.5312281Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5312630Z outputs = self.model( 2025-08-14T21:48:06.5312956Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5313025Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5313235Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5313308Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5313535Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5313633Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5313859Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 175, in forward 2025-08-14T21:48:06.5313939Z key_states = self.k_proj(current_states) 2025-08-14T21:48:06.5313943Z 2025-08-14T21:48:06.5314038Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5314219Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5314288Z return mod(**inputs) 2025-08-14T21:48:06.5314514Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5314575Z outputs = self.model( 2025-08-14T21:48:06.5314808Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5314874Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5315097Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5315172Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5315399Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5315516Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5315754Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 156, in forward 2025-08-14T21:48:06.5315864Z query_states = self.q_proj(hidden_states) * self.scaling 2025-08-14T21:48:06.5315868Z 2025-08-14T21:48:06.5315963Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5316146Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5316212Z return mod(**inputs) 2025-08-14T21:48:06.5316436Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5316514Z outputs = self.model( 2025-08-14T21:48:06.5316745Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5316811Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5317019Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5317093Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5317315Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5317410Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5317632Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 197, in forward 2025-08-14T21:48:06.5317757Z attn_weights = torch.bmm(query_states, key_states.transpose(1, 2)) 2025-08-14T21:48:06.5317768Z 2025-08-14T21:48:06.5317862Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5318045Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5318112Z return mod(**inputs) 2025-08-14T21:48:06.5318338Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5318400Z outputs = self.model( 2025-08-14T21:48:06.5318633Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5318697Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5318908Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5318979Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5319205Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5319304Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5319527Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 176, in forward 2025-08-14T21:48:06.5319608Z value_states = self.v_proj(current_states) 2025-08-14T21:48:06.5319621Z 2025-08-14T21:48:06.5319714Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5319894Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5319961Z return mod(**inputs) 2025-08-14T21:48:06.5320184Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5320245Z outputs = self.model( 2025-08-14T21:48:06.5320493Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5320560Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5320766Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5320837Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5321082Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5321191Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5321416Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 243, in forward 2025-08-14T21:48:06.5321504Z attn_output = torch.bmm(attn_probs, value_states) 2025-08-14T21:48:06.5321507Z 2025-08-14T21:48:06.5321608Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5321795Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5321875Z return mod(**inputs) 2025-08-14T21:48:06.5322101Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5322161Z outputs = self.model( 2025-08-14T21:48:06.5322394Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5322459Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5322660Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5322739Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5322964Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5323056Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5323282Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 256, in forward 2025-08-14T21:48:06.5323399Z attn_output = attn_output.reshape(bsz, tgt_len, self.embed_dim) 2025-08-14T21:48:06.5323402Z 2025-08-14T21:48:06.5323503Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5323687Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5323757Z return mod(**inputs) 2025-08-14T21:48:06.5323985Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5324045Z outputs = self.model( 2025-08-14T21:48:06.5324280Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5324345Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5324547Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5324627Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5324852Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5324946Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5325170Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 258, in forward 2025-08-14T21:48:06.5325246Z attn_output = self.out_proj(attn_output) 2025-08-14T21:48:06.5325249Z 2025-08-14T21:48:06.5325351Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5325533Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5325597Z return mod(**inputs) 2025-08-14T21:48:06.5325823Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5325896Z outputs = self.model( 2025-08-14T21:48:06.5326133Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5326201Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5326418Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5326499Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5326740Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 362, in forward 2025-08-14T21:48:06.5326855Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:48:06.5326858Z 2025-08-14T21:48:06.5326950Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5327132Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5327198Z return mod(**inputs) 2025-08-14T21:48:06.5327424Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5327506Z outputs = self.model( 2025-08-14T21:48:06.5327729Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5327796Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5328002Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5328074Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5328295Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 362, in forward 2025-08-14T21:48:06.5328412Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:48:06.5328606Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:48:06.5328680Z return self.act(input) 2025-08-14T21:48:06.5328684Z 2025-08-14T21:48:06.5328776Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5328957Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5329023Z return mod(**inputs) 2025-08-14T21:48:06.5329246Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5329308Z outputs = self.model( 2025-08-14T21:48:06.5329538Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5329603Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5329812Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5329882Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5330105Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 364, in forward 2025-08-14T21:48:06.5330189Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:48:06.5330192Z 2025-08-14T21:48:06.5330284Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5330472Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5330532Z return mod(**inputs) 2025-08-14T21:48:06.5330758Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5330826Z outputs = self.model( 2025-08-14T21:48:06.5331048Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5331112Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5331339Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5331413Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5331645Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5331734Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5331975Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 156, in forward 2025-08-14T21:48:06.5332102Z query_states = self.q_proj(hidden_states) * self.scaling 2025-08-14T21:48:06.5332106Z 2025-08-14T21:48:06.5332198Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5332385Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5332444Z return mod(**inputs) 2025-08-14T21:48:06.5332670Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5332765Z outputs = self.model( 2025-08-14T21:48:06.5332990Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5333055Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5333265Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5333336Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5333566Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5333654Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5333875Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 175, in forward 2025-08-14T21:48:06.5333956Z key_states = self.k_proj(current_states) 2025-08-14T21:48:06.5333959Z 2025-08-14T21:48:06.5334052Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5334234Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5334302Z return mod(**inputs) 2025-08-14T21:48:06.5334526Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5334593Z outputs = self.model( 2025-08-14T21:48:06.5334817Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5334882Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5335086Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5335156Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5335388Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5335478Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5335703Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 156, in forward 2025-08-14T21:48:06.5335813Z query_states = self.q_proj(hidden_states) * self.scaling 2025-08-14T21:48:06.5335816Z 2025-08-14T21:48:06.5335911Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5336095Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5336162Z return mod(**inputs) 2025-08-14T21:48:06.5336386Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5336455Z outputs = self.model( 2025-08-14T21:48:06.5336678Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5336759Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5336973Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5337043Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5337279Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5337376Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5337620Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 197, in forward 2025-08-14T21:48:06.5337752Z attn_weights = torch.bmm(query_states, key_states.transpose(1, 2)) 2025-08-14T21:48:06.5337755Z 2025-08-14T21:48:06.5337846Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5338028Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5338094Z return mod(**inputs) 2025-08-14T21:48:06.5338318Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5338399Z outputs = self.model( 2025-08-14T21:48:06.5338624Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5338690Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5338898Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5338971Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5339190Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5339283Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5339510Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 176, in forward 2025-08-14T21:48:06.5339595Z value_states = self.v_proj(current_states) 2025-08-14T21:48:06.5339598Z 2025-08-14T21:48:06.5339688Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5339867Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5339933Z return mod(**inputs) 2025-08-14T21:48:06.5340158Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5340228Z outputs = self.model( 2025-08-14T21:48:06.5340454Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5340519Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5340724Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5340794Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5341016Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5341113Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5341337Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 243, in forward 2025-08-14T21:48:06.5341431Z attn_output = torch.bmm(attn_probs, value_states) 2025-08-14T21:48:06.5341436Z 2025-08-14T21:48:06.5341526Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5341705Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5341770Z return mod(**inputs) 2025-08-14T21:48:06.5341992Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5342052Z outputs = self.model( 2025-08-14T21:48:06.5342308Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5342378Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5342587Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5342673Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5342898Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5343011Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5343238Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 256, in forward 2025-08-14T21:48:06.5343359Z attn_output = attn_output.reshape(bsz, tgt_len, self.embed_dim) 2025-08-14T21:48:06.5343362Z 2025-08-14T21:48:06.5343453Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5343635Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5343719Z return mod(**inputs) 2025-08-14T21:48:06.5343944Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5344005Z outputs = self.model( 2025-08-14T21:48:06.5344235Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5344303Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5344511Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5344582Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5344868Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5344970Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5345196Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 258, in forward 2025-08-14T21:48:06.5345282Z attn_output = self.out_proj(attn_output) 2025-08-14T21:48:06.5345285Z 2025-08-14T21:48:06.5345380Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5345567Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5345639Z return mod(**inputs) 2025-08-14T21:48:06.5345879Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5345942Z outputs = self.model( 2025-08-14T21:48:06.5346187Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5346253Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5346473Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5346549Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5346779Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 362, in forward 2025-08-14T21:48:06.5346900Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:48:06.5346905Z 2025-08-14T21:48:06.5347001Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5347197Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5347257Z return mod(**inputs) 2025-08-14T21:48:06.5347491Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5347563Z outputs = self.model( 2025-08-14T21:48:06.5347796Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5347880Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5348093Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5348163Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5348408Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 362, in forward 2025-08-14T21:48:06.5348516Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:48:06.5348727Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:48:06.5348799Z return self.act(input) 2025-08-14T21:48:06.5348802Z 2025-08-14T21:48:06.5348895Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5349080Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5349147Z return mod(**inputs) 2025-08-14T21:48:06.5349376Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5349455Z outputs = self.model( 2025-08-14T21:48:06.5349680Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5349746Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5349956Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5350028Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5350260Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 364, in forward 2025-08-14T21:48:06.5350334Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:48:06.5350337Z 2025-08-14T21:48:06.5350429Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5350620Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5350680Z return mod(**inputs) 2025-08-14T21:48:06.5350904Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5350972Z outputs = self.model( 2025-08-14T21:48:06.5351199Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5351272Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5351474Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5351544Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5351772Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 366, in forward 2025-08-14T21:48:06.5351845Z hidden_states = residual + hidden_states 2025-08-14T21:48:06.5351850Z 2025-08-14T21:48:06.5351943Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5352128Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5352186Z return mod(**inputs) 2025-08-14T21:48:06.5352418Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5352477Z outputs = self.model( 2025-08-14T21:48:06.5352703Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5352775Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5352975Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5353051Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5353288Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5353379Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5353610Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 156, in forward 2025-08-14T21:48:06.5353711Z query_states = self.q_proj(hidden_states) * self.scaling 2025-08-14T21:48:06.5353728Z 2025-08-14T21:48:06.5353822Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5354024Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5354082Z return mod(**inputs) 2025-08-14T21:48:06.5354310Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5354370Z outputs = self.model( 2025-08-14T21:48:06.5354596Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5354668Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5354883Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5354953Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5355184Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5355270Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5355503Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 175, in forward 2025-08-14T21:48:06.5355576Z key_states = self.k_proj(current_states) 2025-08-14T21:48:06.5355580Z 2025-08-14T21:48:06.5355671Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5355858Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5355918Z return mod(**inputs) 2025-08-14T21:48:06.5356149Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5356208Z outputs = self.model( 2025-08-14T21:48:06.5356431Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5356503Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5356702Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5356773Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5357005Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5357092Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5357323Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 156, in forward 2025-08-14T21:48:06.5357425Z query_states = self.q_proj(hidden_states) * self.scaling 2025-08-14T21:48:06.5357428Z 2025-08-14T21:48:06.5357520Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5357710Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5357770Z return mod(**inputs) 2025-08-14T21:48:06.5358000Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5358061Z outputs = self.model( 2025-08-14T21:48:06.5358282Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5358355Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5358554Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5358640Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5358874Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5358960Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5359205Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 197, in forward 2025-08-14T21:48:06.5359329Z attn_weights = torch.bmm(query_states, key_states.transpose(1, 2)) 2025-08-14T21:48:06.5359346Z 2025-08-14T21:48:06.5359440Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5359628Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5359687Z return mod(**inputs) 2025-08-14T21:48:06.5359915Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5359976Z outputs = self.model( 2025-08-14T21:48:06.5360199Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5360295Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5360502Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5360576Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5360815Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5360905Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5361143Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 176, in forward 2025-08-14T21:48:06.5361221Z value_states = self.v_proj(current_states) 2025-08-14T21:48:06.5361225Z 2025-08-14T21:48:06.5361318Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5361511Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5361571Z return mod(**inputs) 2025-08-14T21:48:06.5361802Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5361871Z outputs = self.model( 2025-08-14T21:48:06.5362103Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5362178Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5362382Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5362453Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5362690Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5362779Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5363018Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 243, in forward 2025-08-14T21:48:06.5363105Z attn_output = torch.bmm(attn_probs, value_states) 2025-08-14T21:48:06.5363108Z 2025-08-14T21:48:06.5363202Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5363397Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5363459Z return mod(**inputs) 2025-08-14T21:48:06.5363689Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5363758Z outputs = self.model( 2025-08-14T21:48:06.5363988Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5364061Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5364283Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5364357Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5364592Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5364691Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5364926Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 256, in forward 2025-08-14T21:48:06.5365058Z attn_output = attn_output.reshape(bsz, tgt_len, self.embed_dim) 2025-08-14T21:48:06.5365062Z 2025-08-14T21:48:06.5365153Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5365340Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5365399Z return mod(**inputs) 2025-08-14T21:48:06.5365624Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5365707Z outputs = self.model( 2025-08-14T21:48:06.5365931Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5366002Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5366200Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5366272Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5366498Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5366585Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5366805Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 258, in forward 2025-08-14T21:48:06.5366885Z attn_output = self.out_proj(attn_output) 2025-08-14T21:48:06.5366890Z 2025-08-14T21:48:06.5366984Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5367172Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5367230Z return mod(**inputs) 2025-08-14T21:48:06.5367455Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5367523Z outputs = self.model( 2025-08-14T21:48:06.5367746Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5367818Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5368016Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5368086Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5368315Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 362, in forward 2025-08-14T21:48:06.5368424Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:48:06.5368427Z 2025-08-14T21:48:06.5368517Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5368709Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5368767Z return mod(**inputs) 2025-08-14T21:48:06.5368999Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5369060Z outputs = self.model( 2025-08-14T21:48:06.5369284Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5369355Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5369554Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5369644Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5369874Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 362, in forward 2025-08-14T21:48:06.5369978Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:48:06.5370193Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:48:06.5370270Z return self.act(input) 2025-08-14T21:48:06.5370274Z 2025-08-14T21:48:06.5370367Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5370558Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5370616Z return mod(**inputs) 2025-08-14T21:48:06.5370849Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5370908Z outputs = self.model( 2025-08-14T21:48:06.5371133Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5371223Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5371425Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5371499Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5371733Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 364, in forward 2025-08-14T21:48:06.5371808Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:48:06.5371812Z 2025-08-14T21:48:06.5371913Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5372094Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5372154Z return mod(**inputs) 2025-08-14T21:48:06.5372392Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5372455Z outputs = self.model( 2025-08-14T21:48:06.5372687Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5372751Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5372952Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5373031Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5373255Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5373344Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5373574Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 156, in forward 2025-08-14T21:48:06.5373678Z query_states = self.q_proj(hidden_states) * self.scaling 2025-08-14T21:48:06.5373683Z 2025-08-14T21:48:06.5373784Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5373964Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5374023Z return mod(**inputs) 2025-08-14T21:48:06.5374257Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5374321Z outputs = self.model( 2025-08-14T21:48:06.5374546Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5374617Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5374817Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5374895Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5375134Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5375224Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5375456Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 175, in forward 2025-08-14T21:48:06.5375559Z key_states = self.k_proj(current_states) 2025-08-14T21:48:06.5375564Z 2025-08-14T21:48:06.5375667Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5375861Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5375920Z return mod(**inputs) 2025-08-14T21:48:06.5376151Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5376211Z outputs = self.model( 2025-08-14T21:48:06.5376436Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5376525Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5376728Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5376806Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5377033Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5377124Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5377359Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 156, in forward 2025-08-14T21:48:06.5377463Z query_states = self.q_proj(hidden_states) * self.scaling 2025-08-14T21:48:06.5377466Z 2025-08-14T21:48:06.5377567Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5377748Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5377809Z return mod(**inputs) 2025-08-14T21:48:06.5378042Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5378102Z outputs = self.model( 2025-08-14T21:48:06.5378327Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5378401Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5378605Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5378680Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5378906Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5378994Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5379226Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 197, in forward 2025-08-14T21:48:06.5379351Z attn_weights = torch.bmm(query_states, key_states.transpose(1, 2)) 2025-08-14T21:48:06.5379354Z 2025-08-14T21:48:06.5379453Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5379634Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5379693Z return mod(**inputs) 2025-08-14T21:48:06.5379926Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5379985Z outputs = self.model( 2025-08-14T21:48:06.5380210Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5380281Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5380481Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5380570Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5380798Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5380884Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5381128Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 176, in forward 2025-08-14T21:48:06.5381228Z value_states = self.v_proj(current_states) 2025-08-14T21:48:06.5381232Z 2025-08-14T21:48:06.5381323Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5381512Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5381570Z return mod(**inputs) 2025-08-14T21:48:06.5381801Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5381863Z outputs = self.model( 2025-08-14T21:48:06.5382102Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5382175Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5382374Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5382451Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5382674Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5382763Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5382992Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 243, in forward 2025-08-14T21:48:06.5383077Z attn_output = torch.bmm(attn_probs, value_states) 2025-08-14T21:48:06.5383081Z 2025-08-14T21:48:06.5383173Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5383362Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5383422Z return mod(**inputs) 2025-08-14T21:48:06.5383655Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5383717Z outputs = self.model( 2025-08-14T21:48:06.5383942Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5384019Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5384219Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5384287Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5384517Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5384715Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5385001Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 256, in forward 2025-08-14T21:48:06.5385120Z attn_output = attn_output.reshape(bsz, tgt_len, self.embed_dim) 2025-08-14T21:48:06.5385124Z 2025-08-14T21:48:06.5385220Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5385412Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5385473Z return mod(**inputs) 2025-08-14T21:48:06.5385706Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5385767Z outputs = self.model( 2025-08-14T21:48:06.5385992Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5386067Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5386300Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5386376Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5386609Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5386716Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5386951Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 258, in forward 2025-08-14T21:48:06.5387064Z attn_output = self.out_proj(attn_output) 2025-08-14T21:48:06.5387067Z 2025-08-14T21:48:06.5387161Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5387351Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5387412Z return mod(**inputs) 2025-08-14T21:48:06.5387646Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5387730Z outputs = self.model( 2025-08-14T21:48:06.5387955Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5388027Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5388229Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5388303Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5388535Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 362, in forward 2025-08-14T21:48:06.5388643Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:48:06.5388646Z 2025-08-14T21:48:06.5388747Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5388933Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5388995Z return mod(**inputs) 2025-08-14T21:48:06.5389230Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5389290Z outputs = self.model( 2025-08-14T21:48:06.5389526Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5389593Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5389794Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5389873Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5390096Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 362, in forward 2025-08-14T21:48:06.5390201Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:48:06.5390406Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:48:06.5390470Z return self.act(input) 2025-08-14T21:48:06.5390473Z 2025-08-14T21:48:06.5390572Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5390758Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5390817Z return mod(**inputs) 2025-08-14T21:48:06.5391050Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5391112Z outputs = self.model( 2025-08-14T21:48:06.5391337Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5391409Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5391610Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5391702Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5391931Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 364, in forward 2025-08-14T21:48:06.5392005Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:48:06.5392008Z 2025-08-14T21:48:06.5392122Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5392306Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5392390Z return mod(**inputs) 2025-08-14T21:48:06.5392616Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5392676Z outputs = self.model( 2025-08-14T21:48:06.5392908Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5392974Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5393175Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5393267Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5393488Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 366, in forward 2025-08-14T21:48:06.5393566Z hidden_states = residual + hidden_states 2025-08-14T21:48:06.5393569Z 2025-08-14T21:48:06.5393662Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5393845Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5393911Z return mod(**inputs) 2025-08-14T21:48:06.5394135Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5394196Z outputs = self.model( 2025-08-14T21:48:06.5394424Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5394490Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5394695Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5394765Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5394986Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5395082Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5395306Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 156, in forward 2025-08-14T21:48:06.5395413Z query_states = self.q_proj(hidden_states) * self.scaling 2025-08-14T21:48:06.5395417Z 2025-08-14T21:48:06.5395508Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5395690Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5395758Z return mod(**inputs) 2025-08-14T21:48:06.5395981Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5396041Z outputs = self.model( 2025-08-14T21:48:06.5396273Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5396340Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5396547Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5396618Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5396841Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5396935Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5397168Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 175, in forward 2025-08-14T21:48:06.5397252Z key_states = self.k_proj(current_states) 2025-08-14T21:48:06.5397255Z 2025-08-14T21:48:06.5397350Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5397544Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5397611Z return mod(**inputs) 2025-08-14T21:48:06.5397854Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5397914Z outputs = self.model( 2025-08-14T21:48:06.5398146Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5398210Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5398419Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5398491Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5398730Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5398827Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5399056Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 156, in forward 2025-08-14T21:48:06.5399158Z query_states = self.q_proj(hidden_states) * self.scaling 2025-08-14T21:48:06.5399169Z 2025-08-14T21:48:06.5399262Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5399441Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5399508Z return mod(**inputs) 2025-08-14T21:48:06.5399729Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5399789Z outputs = self.model( 2025-08-14T21:48:06.5400022Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5400085Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5400293Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5400363Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5400584Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5400678Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5400899Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 197, in forward 2025-08-14T21:48:06.5401017Z attn_weights = torch.bmm(query_states, key_states.transpose(1, 2)) 2025-08-14T21:48:06.5401027Z 2025-08-14T21:48:06.5401122Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5401304Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5401370Z return mod(**inputs) 2025-08-14T21:48:06.5401594Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5401656Z outputs = self.model( 2025-08-14T21:48:06.5401886Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5401953Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5402161Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5402231Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5402452Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5402563Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5402791Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 176, in forward 2025-08-14T21:48:06.5402868Z value_states = self.v_proj(current_states) 2025-08-14T21:48:06.5402872Z 2025-08-14T21:48:06.5402991Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5403175Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5403257Z return mod(**inputs) 2025-08-14T21:48:06.5403479Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5403540Z outputs = self.model( 2025-08-14T21:48:06.5403768Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5403832Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5404034Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5404127Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5404347Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5404444Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5404668Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 243, in forward 2025-08-14T21:48:06.5404754Z attn_output = torch.bmm(attn_probs, value_states) 2025-08-14T21:48:06.5404757Z 2025-08-14T21:48:06.5404854Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5405032Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5405099Z return mod(**inputs) 2025-08-14T21:48:06.5405324Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5405384Z outputs = self.model( 2025-08-14T21:48:06.5405615Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5405680Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5405878Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5405955Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5406177Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5406269Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5406491Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 256, in forward 2025-08-14T21:48:06.5406606Z attn_output = attn_output.reshape(bsz, tgt_len, self.embed_dim) 2025-08-14T21:48:06.5406612Z 2025-08-14T21:48:06.5406710Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5406888Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5406953Z return mod(**inputs) 2025-08-14T21:48:06.5407176Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5407237Z outputs = self.model( 2025-08-14T21:48:06.5407464Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5407529Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5407729Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5407805Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5408042Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5408140Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5408363Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 258, in forward 2025-08-14T21:48:06.5408449Z attn_output = self.out_proj(attn_output) 2025-08-14T21:48:06.5408467Z 2025-08-14T21:48:06.5408569Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5408747Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5408812Z return mod(**inputs) 2025-08-14T21:48:06.5409036Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5409095Z outputs = self.model( 2025-08-14T21:48:06.5409326Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5409408Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5409610Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5409690Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5409915Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 362, in forward 2025-08-14T21:48:06.5410034Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:48:06.5410037Z 2025-08-14T21:48:06.5410129Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5410310Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5410378Z return mod(**inputs) 2025-08-14T21:48:06.5410604Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5410666Z outputs = self.model( 2025-08-14T21:48:06.5410895Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5410960Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5411168Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5411239Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5411464Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 362, in forward 2025-08-14T21:48:06.5411580Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:48:06.5411773Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:48:06.5411844Z return self.act(input) 2025-08-14T21:48:06.5411848Z 2025-08-14T21:48:06.5411943Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5412124Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5412190Z return mod(**inputs) 2025-08-14T21:48:06.5412413Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5412475Z outputs = self.model( 2025-08-14T21:48:06.5412705Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5412771Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5412976Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5413046Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5413266Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 364, in forward 2025-08-14T21:48:06.5413362Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:48:06.5413368Z 2025-08-14T21:48:06.5413463Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5413644Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5413709Z return mod(**inputs) 2025-08-14T21:48:06.5413949Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5414033Z outputs = self.model( 2025-08-14T21:48:06.5414263Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5414328Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5414540Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5414609Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5414847Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5414952Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5415177Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 156, in forward 2025-08-14T21:48:06.5415284Z query_states = self.q_proj(hidden_states) * self.scaling 2025-08-14T21:48:06.5415289Z 2025-08-14T21:48:06.5415381Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5415562Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5415627Z return mod(**inputs) 2025-08-14T21:48:06.5415853Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5415919Z outputs = self.model( 2025-08-14T21:48:06.5416145Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5416210Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5416418Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5416490Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5416715Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5416812Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5417037Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 175, in forward 2025-08-14T21:48:06.5417116Z key_states = self.k_proj(current_states) 2025-08-14T21:48:06.5417119Z 2025-08-14T21:48:06.5417211Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5417391Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5417459Z return mod(**inputs) 2025-08-14T21:48:06.5417682Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5417749Z outputs = self.model( 2025-08-14T21:48:06.5417974Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5418040Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5418250Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5418319Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5418544Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5418638Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5418878Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 156, in forward 2025-08-14T21:48:06.5418989Z query_states = self.q_proj(hidden_states) * self.scaling 2025-08-14T21:48:06.5418992Z 2025-08-14T21:48:06.5419084Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5419277Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5419345Z return mod(**inputs) 2025-08-14T21:48:06.5419587Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5419654Z outputs = self.model( 2025-08-14T21:48:06.5419876Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5419940Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5420146Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5420233Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5420454Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5420548Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5420769Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 197, in forward 2025-08-14T21:48:06.5420895Z attn_weights = torch.bmm(query_states, key_states.transpose(1, 2)) 2025-08-14T21:48:06.5420899Z 2025-08-14T21:48:06.5420989Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5421168Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5421234Z return mod(**inputs) 2025-08-14T21:48:06.5421459Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5421528Z outputs = self.model( 2025-08-14T21:48:06.5421749Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5421813Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5422019Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5422090Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5422311Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5422404Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5422626Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 176, in forward 2025-08-14T21:48:06.5422708Z value_states = self.v_proj(current_states) 2025-08-14T21:48:06.5422712Z 2025-08-14T21:48:06.5422805Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5422988Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5423055Z return mod(**inputs) 2025-08-14T21:48:06.5423279Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5423340Z outputs = self.model( 2025-08-14T21:48:06.5423570Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5423635Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5423837Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5423909Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5424160Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5424257Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5424485Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 243, in forward 2025-08-14T21:48:06.5424580Z attn_output = torch.bmm(attn_probs, value_states) 2025-08-14T21:48:06.5424583Z 2025-08-14T21:48:06.5424689Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5424952Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5425025Z return mod(**inputs) 2025-08-14T21:48:06.5425259Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5425322Z outputs = self.model( 2025-08-14T21:48:06.5425560Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5425629Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5425860Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5425933Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5426159Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5426258Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5426483Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 256, in forward 2025-08-14T21:48:06.5426607Z attn_output = attn_output.reshape(bsz, tgt_len, self.embed_dim) 2025-08-14T21:48:06.5426611Z 2025-08-14T21:48:06.5426707Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5426891Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5426962Z return mod(**inputs) 2025-08-14T21:48:06.5427190Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5427254Z outputs = self.model( 2025-08-14T21:48:06.5427489Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5427558Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5427767Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5427840Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5428065Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5428161Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5428386Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 258, in forward 2025-08-14T21:48:06.5428462Z attn_output = self.out_proj(attn_output) 2025-08-14T21:48:06.5428472Z 2025-08-14T21:48:06.5428565Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5428745Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5428812Z return mod(**inputs) 2025-08-14T21:48:06.5429037Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5429098Z outputs = self.model( 2025-08-14T21:48:06.5429330Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5429395Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5429604Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5429676Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5429915Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 362, in forward 2025-08-14T21:48:06.5430034Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:48:06.5430038Z 2025-08-14T21:48:06.5430129Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5430325Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5430408Z return mod(**inputs) 2025-08-14T21:48:06.5430634Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5430704Z outputs = self.model( 2025-08-14T21:48:06.5430926Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5430992Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5431201Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5431287Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5431518Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 362, in forward 2025-08-14T21:48:06.5431626Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:48:06.5431820Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:48:06.5431892Z return self.act(input) 2025-08-14T21:48:06.5431895Z 2025-08-14T21:48:06.5431988Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5432168Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5432233Z return mod(**inputs) 2025-08-14T21:48:06.5432460Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5432529Z outputs = self.model( 2025-08-14T21:48:06.5432753Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5432817Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5433026Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5433097Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5433321Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 364, in forward 2025-08-14T21:48:06.5433402Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:48:06.5433406Z 2025-08-14T21:48:06.5433497Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5433685Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5433743Z return mod(**inputs) 2025-08-14T21:48:06.5433969Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5434039Z outputs = self.model( 2025-08-14T21:48:06.5434264Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5434337Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5434538Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5434611Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5434843Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 366, in forward 2025-08-14T21:48:06.5434913Z hidden_states = residual + hidden_states 2025-08-14T21:48:06.5434917Z 2025-08-14T21:48:06.5435008Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5435211Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5435275Z return mod(**inputs) 2025-08-14T21:48:06.5435508Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5435569Z outputs = self.model( 2025-08-14T21:48:06.5435806Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5435895Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5436095Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5436165Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5436397Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5436485Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5436717Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 156, in forward 2025-08-14T21:48:06.5436834Z query_states = self.q_proj(hidden_states) * self.scaling 2025-08-14T21:48:06.5436838Z 2025-08-14T21:48:06.5436930Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5437122Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5437183Z return mod(**inputs) 2025-08-14T21:48:06.5437419Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5437480Z outputs = self.model( 2025-08-14T21:48:06.5437707Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5437780Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5437983Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5438057Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5438292Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5438382Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5438617Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 175, in forward 2025-08-14T21:48:06.5438693Z key_states = self.k_proj(current_states) 2025-08-14T21:48:06.5438696Z 2025-08-14T21:48:06.5438790Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5438983Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5439041Z return mod(**inputs) 2025-08-14T21:48:06.5439273Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5439336Z outputs = self.model( 2025-08-14T21:48:06.5439558Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5439631Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5439834Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5439905Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5440138Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5440225Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5440457Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 156, in forward 2025-08-14T21:48:06.5440558Z query_states = self.q_proj(hidden_states) * self.scaling 2025-08-14T21:48:06.5440562Z 2025-08-14T21:48:06.5440668Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5440860Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5440917Z return mod(**inputs) 2025-08-14T21:48:06.5441157Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5441227Z outputs = self.model( 2025-08-14T21:48:06.5441470Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5441543Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5441743Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5441813Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5442048Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5442160Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5442396Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 197, in forward 2025-08-14T21:48:06.5442517Z attn_weights = torch.bmm(query_states, key_states.transpose(1, 2)) 2025-08-14T21:48:06.5442521Z 2025-08-14T21:48:06.5442613Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5442802Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5442862Z return mod(**inputs) 2025-08-14T21:48:06.5443089Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5443156Z outputs = self.model( 2025-08-14T21:48:06.5443383Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5443458Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5443661Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5443732Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5443966Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5444055Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5444287Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 176, in forward 2025-08-14T21:48:06.5444365Z value_states = self.v_proj(current_states) 2025-08-14T21:48:06.5444368Z 2025-08-14T21:48:06.5444459Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5444647Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5444708Z return mod(**inputs) 2025-08-14T21:48:06.5444936Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5445005Z outputs = self.model( 2025-08-14T21:48:06.5445232Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5445303Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5445506Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5445576Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5445810Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5445897Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5446139Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 243, in forward 2025-08-14T21:48:06.5446237Z attn_output = torch.bmm(attn_probs, value_states) 2025-08-14T21:48:06.5446240Z 2025-08-14T21:48:06.5446330Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5446516Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5446587Z return mod(**inputs) 2025-08-14T21:48:06.5446815Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5446900Z outputs = self.model( 2025-08-14T21:48:06.5447126Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5447198Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5447398Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5447469Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5447716Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5447805Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5448029Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 256, in forward 2025-08-14T21:48:06.5448153Z attn_output = attn_output.reshape(bsz, tgt_len, self.embed_dim) 2025-08-14T21:48:06.5448159Z 2025-08-14T21:48:06.5448250Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5448440Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5448499Z return mod(**inputs) 2025-08-14T21:48:06.5448722Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5448790Z outputs = self.model( 2025-08-14T21:48:06.5449012Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5449086Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5449285Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5449356Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5449585Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5449673Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5449895Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 258, in forward 2025-08-14T21:48:06.5449977Z attn_output = self.out_proj(attn_output) 2025-08-14T21:48:06.5449981Z 2025-08-14T21:48:06.5450071Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5450262Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5450323Z return mod(**inputs) 2025-08-14T21:48:06.5450546Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5450615Z outputs = self.model( 2025-08-14T21:48:06.5450838Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5450905Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5451114Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5451183Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5451413Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 362, in forward 2025-08-14T21:48:06.5451540Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:48:06.5451546Z 2025-08-14T21:48:06.5451639Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5451828Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5451893Z return mod(**inputs) 2025-08-14T21:48:06.5452138Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5452214Z outputs = self.model( 2025-08-14T21:48:06.5452441Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5452514Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5452712Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5452784Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5453014Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 362, in forward 2025-08-14T21:48:06.5453134Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:48:06.5453334Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:48:06.5453397Z return self.act(input) 2025-08-14T21:48:06.5453402Z 2025-08-14T21:48:06.5453497Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5453690Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5453751Z return mod(**inputs) 2025-08-14T21:48:06.5453987Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5454051Z outputs = self.model( 2025-08-14T21:48:06.5454277Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5454352Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5454557Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5454629Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5454863Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 364, in forward 2025-08-14T21:48:06.5454938Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:48:06.5454943Z 2025-08-14T21:48:06.5455045Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5455229Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5455290Z return mod(**inputs) 2025-08-14T21:48:06.5455523Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5455587Z outputs = self.model( 2025-08-14T21:48:06.5455814Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5455890Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5456092Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5456173Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5456399Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5456491Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5456724Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 156, in forward 2025-08-14T21:48:06.5456828Z query_states = self.q_proj(hidden_states) * self.scaling 2025-08-14T21:48:06.5456831Z 2025-08-14T21:48:06.5456932Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5457128Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5457191Z return mod(**inputs) 2025-08-14T21:48:06.5457421Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5457480Z outputs = self.model( 2025-08-14T21:48:06.5457714Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5457800Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5458001Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5458077Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5458299Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5458388Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5458635Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 175, in forward 2025-08-14T21:48:06.5458709Z key_states = self.k_proj(current_states) 2025-08-14T21:48:06.5458712Z 2025-08-14T21:48:06.5458810Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5458991Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5459050Z return mod(**inputs) 2025-08-14T21:48:06.5459282Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5459341Z outputs = self.model( 2025-08-14T21:48:06.5459568Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5459639Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5459841Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5459919Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5460144Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5460232Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5460463Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 156, in forward 2025-08-14T21:48:06.5460564Z query_states = self.q_proj(hidden_states) * self.scaling 2025-08-14T21:48:06.5460568Z 2025-08-14T21:48:06.5460666Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5460844Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5460904Z return mod(**inputs) 2025-08-14T21:48:06.5461138Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5461200Z outputs = self.model( 2025-08-14T21:48:06.5461423Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5461496Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5461696Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5461777Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5461999Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5462086Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5462316Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 197, in forward 2025-08-14T21:48:06.5462465Z attn_weights = torch.bmm(query_states, key_states.transpose(1, 2)) 2025-08-14T21:48:06.5462470Z 2025-08-14T21:48:06.5462565Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5462751Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5462808Z return mod(**inputs) 2025-08-14T21:48:06.5463053Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5463137Z outputs = self.model( 2025-08-14T21:48:06.5463359Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5463433Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5463634Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5463713Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5463937Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5464042Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5464276Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 176, in forward 2025-08-14T21:48:06.5464355Z value_states = self.v_proj(current_states) 2025-08-14T21:48:06.5464358Z 2025-08-14T21:48:06.5464452Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5464667Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5464821Z return mod(**inputs) 2025-08-14T21:48:06.5465189Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5465266Z outputs = self.model( 2025-08-14T21:48:06.5465541Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5465628Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5465864Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5465946Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5466197Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5466285Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5466514Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 243, in forward 2025-08-14T21:48:06.5466602Z attn_output = torch.bmm(attn_probs, value_states) 2025-08-14T21:48:06.5466606Z 2025-08-14T21:48:06.5466698Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5466884Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5466943Z return mod(**inputs) 2025-08-14T21:48:06.5467208Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5467279Z outputs = self.model( 2025-08-14T21:48:06.5467552Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5467634Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5467871Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5467952Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5468244Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5468343Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5468641Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 256, in forward 2025-08-14T21:48:06.5468780Z attn_output = attn_output.reshape(bsz, tgt_len, self.embed_dim) 2025-08-14T21:48:06.5468784Z 2025-08-14T21:48:06.5468891Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5469126Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5469198Z return mod(**inputs) 2025-08-14T21:48:06.5469503Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5469573Z outputs = self.model( 2025-08-14T21:48:06.5469856Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5469938Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5470172Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5470256Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5470553Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5470639Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5470867Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 258, in forward 2025-08-14T21:48:06.5470942Z attn_output = self.out_proj(attn_output) 2025-08-14T21:48:06.5470945Z 2025-08-14T21:48:06.5471035Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5471221Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5471279Z return mod(**inputs) 2025-08-14T21:48:06.5471508Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5471567Z outputs = self.model( 2025-08-14T21:48:06.5471790Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5471863Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5472062Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5472132Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5472357Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 362, in forward 2025-08-14T21:48:06.5472464Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:48:06.5472468Z 2025-08-14T21:48:06.5472565Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5472745Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5472801Z return mod(**inputs) 2025-08-14T21:48:06.5473032Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5473092Z outputs = self.model( 2025-08-14T21:48:06.5473313Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5473382Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5473582Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5473663Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5473882Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 362, in forward 2025-08-14T21:48:06.5473986Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:48:06.5474187Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:48:06.5474262Z return self.act(input) 2025-08-14T21:48:06.5474268Z 2025-08-14T21:48:06.5474368Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5474548Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5474606Z return mod(**inputs) 2025-08-14T21:48:06.5474851Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5474927Z outputs = self.model( 2025-08-14T21:48:06.5475149Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5475221Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5475417Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5475496Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5475718Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 364, in forward 2025-08-14T21:48:06.5475806Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:48:06.5475810Z 2025-08-14T21:48:06.5475910Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5476090Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5476147Z return mod(**inputs) 2025-08-14T21:48:06.5476378Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5476440Z outputs = self.model( 2025-08-14T21:48:06.5476670Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5476735Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5476936Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5477015Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5477238Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 366, in forward 2025-08-14T21:48:06.5477318Z hidden_states = residual + hidden_states 2025-08-14T21:48:06.5477321Z 2025-08-14T21:48:06.5477415Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5477597Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5477663Z return mod(**inputs) 2025-08-14T21:48:06.5477886Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5477945Z outputs = self.model( 2025-08-14T21:48:06.5478177Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5478243Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5478453Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5478524Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5478746Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5478844Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5479067Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 156, in forward 2025-08-14T21:48:06.5479167Z query_states = self.q_proj(hidden_states) * self.scaling 2025-08-14T21:48:06.5479178Z 2025-08-14T21:48:06.5479268Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5479447Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5479510Z return mod(**inputs) 2025-08-14T21:48:06.5479751Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5479813Z outputs = self.model( 2025-08-14T21:48:06.5480043Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5480106Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5480329Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5480415Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5480638Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5480732Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5480954Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 175, in forward 2025-08-14T21:48:06.5481030Z key_states = self.k_proj(current_states) 2025-08-14T21:48:06.5481053Z 2025-08-14T21:48:06.5481146Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5481327Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5481393Z return mod(**inputs) 2025-08-14T21:48:06.5481618Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5481679Z outputs = self.model( 2025-08-14T21:48:06.5481907Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5481972Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5482179Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5482250Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5482473Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5482570Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5482791Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 156, in forward 2025-08-14T21:48:06.5482890Z query_states = self.q_proj(hidden_states) * self.scaling 2025-08-14T21:48:06.5482894Z 2025-08-14T21:48:06.5482993Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5483171Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5483235Z return mod(**inputs) 2025-08-14T21:48:06.5483456Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5483515Z outputs = self.model( 2025-08-14T21:48:06.5483742Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5483808Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5484005Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5484083Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5484303Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5484396Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5484747Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 197, in forward 2025-08-14T21:48:06.5484876Z attn_weights = torch.bmm(query_states, key_states.transpose(1, 2)) 2025-08-14T21:48:06.5484879Z 2025-08-14T21:48:06.5484980Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5485194Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5485263Z return mod(**inputs) 2025-08-14T21:48:06.5485488Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5485548Z outputs = self.model( 2025-08-14T21:48:06.5485804Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5485893Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5486096Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5486174Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5486396Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5486489Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5486713Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 176, in forward 2025-08-14T21:48:06.5486812Z value_states = self.v_proj(current_states) 2025-08-14T21:48:06.5486815Z 2025-08-14T21:48:06.5486914Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5487094Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5487160Z return mod(**inputs) 2025-08-14T21:48:06.5487382Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5487441Z outputs = self.model( 2025-08-14T21:48:06.5487670Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5487735Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5487936Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5488016Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5488237Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5488331Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5488553Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 243, in forward 2025-08-14T21:48:06.5488639Z attn_output = torch.bmm(attn_probs, value_states) 2025-08-14T21:48:06.5488642Z 2025-08-14T21:48:06.5488739Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5488918Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5488983Z return mod(**inputs) 2025-08-14T21:48:06.5489206Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5489268Z outputs = self.model( 2025-08-14T21:48:06.5489498Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5489562Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5489763Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5489840Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5490062Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5490155Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5490376Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 256, in forward 2025-08-14T21:48:06.5490489Z attn_output = attn_output.reshape(bsz, tgt_len, self.embed_dim) 2025-08-14T21:48:06.5490492Z 2025-08-14T21:48:06.5490604Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5490787Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5490847Z return mod(**inputs) 2025-08-14T21:48:06.5491079Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5491153Z outputs = self.model( 2025-08-14T21:48:06.5491386Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5491467Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5491667Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5491745Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5491966Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5492062Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5492300Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 258, in forward 2025-08-14T21:48:06.5492373Z attn_output = self.out_proj(attn_output) 2025-08-14T21:48:06.5492376Z 2025-08-14T21:48:06.5492474Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5492653Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5492712Z return mod(**inputs) 2025-08-14T21:48:06.5492942Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5493001Z outputs = self.model( 2025-08-14T21:48:06.5493232Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5493295Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5493496Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5493574Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5493795Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 362, in forward 2025-08-14T21:48:06.5493909Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:48:06.5493913Z 2025-08-14T21:48:06.5494004Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5494183Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5494248Z return mod(**inputs) 2025-08-14T21:48:06.5494470Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5494529Z outputs = self.model( 2025-08-14T21:48:06.5494761Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5494827Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5495033Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5495102Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5495325Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 362, in forward 2025-08-14T21:48:06.5495441Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:48:06.5495632Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:48:06.5495695Z return self.act(input) 2025-08-14T21:48:06.5495705Z 2025-08-14T21:48:06.5495798Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5495993Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5496064Z return mod(**inputs) 2025-08-14T21:48:06.5496293Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5496353Z outputs = self.model( 2025-08-14T21:48:06.5496601Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5496682Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5496892Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5496963Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5497189Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 364, in forward 2025-08-14T21:48:06.5497268Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:48:06.5497272Z 2025-08-14T21:48:06.5497365Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5497561Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5497626Z return mod(**inputs) 2025-08-14T21:48:06.5497849Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5497917Z outputs = self.model( 2025-08-14T21:48:06.5498141Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5498207Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5498412Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5498481Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5498704Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5498800Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5499025Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 156, in forward 2025-08-14T21:48:06.5499130Z query_states = self.q_proj(hidden_states) * self.scaling 2025-08-14T21:48:06.5499133Z 2025-08-14T21:48:06.5499227Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5499407Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5499475Z return mod(**inputs) 2025-08-14T21:48:06.5499698Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5499763Z outputs = self.model( 2025-08-14T21:48:06.5499986Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5500052Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5500260Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5500331Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5500557Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5500654Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5500879Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 175, in forward 2025-08-14T21:48:06.5500958Z key_states = self.k_proj(current_states) 2025-08-14T21:48:06.5500961Z 2025-08-14T21:48:06.5501053Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5501231Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5501298Z return mod(**inputs) 2025-08-14T21:48:06.5501538Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5501607Z outputs = self.model( 2025-08-14T21:48:06.5501835Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5501915Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5502121Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5502206Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5502430Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5502523Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5502748Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 156, in forward 2025-08-14T21:48:06.5502855Z query_states = self.q_proj(hidden_states) * self.scaling 2025-08-14T21:48:06.5502871Z 2025-08-14T21:48:06.5502965Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5503143Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5503209Z return mod(**inputs) 2025-08-14T21:48:06.5503436Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5503499Z outputs = self.model( 2025-08-14T21:48:06.5503729Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5503794Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5504004Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5504073Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5504298Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5504396Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5504618Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 197, in forward 2025-08-14T21:48:06.5504851Z attn_weights = torch.bmm(query_states, key_states.transpose(1, 2)) 2025-08-14T21:48:06.5504858Z 2025-08-14T21:48:06.5504957Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5505141Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5505209Z return mod(**inputs) 2025-08-14T21:48:06.5505437Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5505500Z outputs = self.model( 2025-08-14T21:48:06.5505737Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5505812Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5506024Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5506097Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5506323Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5506421Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5506647Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 176, in forward 2025-08-14T21:48:06.5506733Z value_states = self.v_proj(current_states) 2025-08-14T21:48:06.5506737Z 2025-08-14T21:48:06.5506829Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5507039Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5507111Z return mod(**inputs) 2025-08-14T21:48:06.5507338Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5507401Z outputs = self.model( 2025-08-14T21:48:06.5507645Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5507726Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5507936Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5508006Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5508229Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5508325Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5508553Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 243, in forward 2025-08-14T21:48:06.5508656Z attn_output = torch.bmm(attn_probs, value_states) 2025-08-14T21:48:06.5508667Z 2025-08-14T21:48:06.5508758Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5508941Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5509006Z return mod(**inputs) 2025-08-14T21:48:06.5509234Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5509294Z outputs = self.model( 2025-08-14T21:48:06.5509527Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5509593Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5509804Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5509875Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5510101Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5510196Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5510423Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 256, in forward 2025-08-14T21:48:06.5510539Z attn_output = attn_output.reshape(bsz, tgt_len, self.embed_dim) 2025-08-14T21:48:06.5510549Z 2025-08-14T21:48:06.5510640Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5510821Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5510887Z return mod(**inputs) 2025-08-14T21:48:06.5511112Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5511173Z outputs = self.model( 2025-08-14T21:48:06.5511406Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5511469Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5511679Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5511747Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5511972Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5512062Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5512286Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 258, in forward 2025-08-14T21:48:06.5512358Z attn_output = self.out_proj(attn_output) 2025-08-14T21:48:06.5512362Z 2025-08-14T21:48:06.5512475Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5512661Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5512726Z return mod(**inputs) 2025-08-14T21:48:06.5512963Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5513024Z outputs = self.model( 2025-08-14T21:48:06.5513274Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5513340Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5513538Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5513615Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5513839Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 362, in forward 2025-08-14T21:48:06.5513966Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:48:06.5513969Z 2025-08-14T21:48:06.5514059Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5514239Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5514306Z return mod(**inputs) 2025-08-14T21:48:06.5514530Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5514599Z outputs = self.model( 2025-08-14T21:48:06.5514823Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5514888Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5515092Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5515163Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5515388Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 362, in forward 2025-08-14T21:48:06.5515500Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:48:06.5515695Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:48:06.5515764Z return self.act(input) 2025-08-14T21:48:06.5515769Z 2025-08-14T21:48:06.5515859Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5516040Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5516107Z return mod(**inputs) 2025-08-14T21:48:06.5516332Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5516400Z outputs = self.model( 2025-08-14T21:48:06.5516626Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5516690Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5516895Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5516965Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5517191Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 364, in forward 2025-08-14T21:48:06.5517274Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:48:06.5517278Z 2025-08-14T21:48:06.5517368Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5517556Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5517614Z return mod(**inputs) 2025-08-14T21:48:06.5517852Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5517924Z outputs = self.model( 2025-08-14T21:48:06.5518147Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5518211Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5518441Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5518514Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5518760Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 366, in forward 2025-08-14T21:48:06.5518832Z hidden_states = residual + hidden_states 2025-08-14T21:48:06.5518835Z 2025-08-14T21:48:06.5518927Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5519114Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5519171Z return mod(**inputs) 2025-08-14T21:48:06.5519403Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5519479Z outputs = self.model( 2025-08-14T21:48:06.5519703Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5519775Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5519974Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5520047Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5520276Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5520363Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5520590Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 156, in forward 2025-08-14T21:48:06.5520693Z query_states = self.q_proj(hidden_states) * self.scaling 2025-08-14T21:48:06.5520696Z 2025-08-14T21:48:06.5520787Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5520974Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5521034Z return mod(**inputs) 2025-08-14T21:48:06.5521265Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5521328Z outputs = self.model( 2025-08-14T21:48:06.5521550Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5521623Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5521822Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5521893Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5522125Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5522213Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5522442Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 175, in forward 2025-08-14T21:48:06.5522514Z key_states = self.k_proj(current_states) 2025-08-14T21:48:06.5522518Z 2025-08-14T21:48:06.5522611Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5522798Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5522857Z return mod(**inputs) 2025-08-14T21:48:06.5523083Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5523150Z outputs = self.model( 2025-08-14T21:48:06.5523387Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5523462Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5523662Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5523745Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5523978Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5524084Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5524313Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 156, in forward 2025-08-14T21:48:06.5524410Z query_states = self.q_proj(hidden_states) * self.scaling 2025-08-14T21:48:06.5524413Z 2025-08-14T21:48:06.5524503Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5524688Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5524771Z return mod(**inputs) 2025-08-14T21:48:06.5524997Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5525061Z outputs = self.model( 2025-08-14T21:48:06.5525286Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5525360Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5525560Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5525628Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5525857Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5525943Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5526165Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 197, in forward 2025-08-14T21:48:06.5526293Z attn_weights = torch.bmm(query_states, key_states.transpose(1, 2)) 2025-08-14T21:48:06.5526296Z 2025-08-14T21:48:06.5526388Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5526574Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5526634Z return mod(**inputs) 2025-08-14T21:48:06.5526857Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5526924Z outputs = self.model( 2025-08-14T21:48:06.5527147Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5527218Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5527418Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5527490Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5527719Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5527807Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5528035Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 176, in forward 2025-08-14T21:48:06.5528121Z value_states = self.v_proj(current_states) 2025-08-14T21:48:06.5528124Z 2025-08-14T21:48:06.5528216Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5528402Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5528461Z return mod(**inputs) 2025-08-14T21:48:06.5528700Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5528772Z outputs = self.model( 2025-08-14T21:48:06.5528998Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5529070Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5529285Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5529370Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5529600Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5529686Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5529907Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 243, in forward 2025-08-14T21:48:06.5530000Z attn_output = torch.bmm(attn_probs, value_states) 2025-08-14T21:48:06.5530004Z 2025-08-14T21:48:06.5530113Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5530302Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5530362Z return mod(**inputs) 2025-08-14T21:48:06.5530588Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5530657Z outputs = self.model( 2025-08-14T21:48:06.5530882Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5530955Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5531155Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5531225Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5531454Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5531543Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5531765Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 256, in forward 2025-08-14T21:48:06.5531886Z attn_output = attn_output.reshape(bsz, tgt_len, self.embed_dim) 2025-08-14T21:48:06.5531891Z 2025-08-14T21:48:06.5531982Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5532172Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5532230Z return mod(**inputs) 2025-08-14T21:48:06.5532455Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5532522Z outputs = self.model( 2025-08-14T21:48:06.5532743Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5532808Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5533017Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5533087Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5533319Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5533406Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5533628Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 258, in forward 2025-08-14T21:48:06.5533709Z attn_output = self.out_proj(attn_output) 2025-08-14T21:48:06.5533712Z 2025-08-14T21:48:06.5533802Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5533987Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5534060Z return mod(**inputs) 2025-08-14T21:48:06.5534290Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5534357Z outputs = self.model( 2025-08-14T21:48:06.5534584Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5534662Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5534885Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5534956Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5535187Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 362, in forward 2025-08-14T21:48:06.5535292Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:48:06.5535295Z 2025-08-14T21:48:06.5535386Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5535575Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5535648Z return mod(**inputs) 2025-08-14T21:48:06.5535877Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5535939Z outputs = self.model( 2025-08-14T21:48:06.5536162Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5536236Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5536435Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5536504Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5536732Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 362, in forward 2025-08-14T21:48:06.5536837Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:48:06.5537039Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:48:06.5537102Z return self.act(input) 2025-08-14T21:48:06.5537105Z 2025-08-14T21:48:06.5537197Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5537380Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5537439Z return mod(**inputs) 2025-08-14T21:48:06.5537661Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5537729Z outputs = self.model( 2025-08-14T21:48:06.5537949Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5538022Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5538222Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5538293Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5538519Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 364, in forward 2025-08-14T21:48:06.5538590Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:48:06.5538594Z 2025-08-14T21:48:06.5538691Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5538868Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5538926Z return mod(**inputs) 2025-08-14T21:48:06.5539153Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5539213Z outputs = self.model( 2025-08-14T21:48:06.5539431Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5539519Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5539720Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5539796Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5540032Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5540122Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5540366Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 156, in forward 2025-08-14T21:48:06.5540465Z query_states = self.q_proj(hidden_states) * self.scaling 2025-08-14T21:48:06.5540469Z 2025-08-14T21:48:06.5540568Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5540747Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5540808Z return mod(**inputs) 2025-08-14T21:48:06.5541056Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5541115Z outputs = self.model( 2025-08-14T21:48:06.5541341Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5541412Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5541614Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5541691Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5541917Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5542004Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5542240Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 175, in forward 2025-08-14T21:48:06.5542312Z key_states = self.k_proj(current_states) 2025-08-14T21:48:06.5542315Z 2025-08-14T21:48:06.5542406Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5542592Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5542651Z return mod(**inputs) 2025-08-14T21:48:06.5542883Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5542944Z outputs = self.model( 2025-08-14T21:48:06.5543171Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5543243Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5543445Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5543521Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5543747Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5543834Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5544071Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 156, in forward 2025-08-14T21:48:06.5544171Z query_states = self.q_proj(hidden_states) * self.scaling 2025-08-14T21:48:06.5544175Z 2025-08-14T21:48:06.5544266Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5544455Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5544513Z return mod(**inputs) 2025-08-14T21:48:06.5544809Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5544880Z outputs = self.model( 2025-08-14T21:48:06.5545136Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5545216Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5545421Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5545511Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5545744Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5545856Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5546088Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 197, in forward 2025-08-14T21:48:06.5546208Z attn_weights = torch.bmm(query_states, key_states.transpose(1, 2)) 2025-08-14T21:48:06.5546212Z 2025-08-14T21:48:06.5546304Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5546492Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5546584Z return mod(**inputs) 2025-08-14T21:48:06.5546819Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5546881Z outputs = self.model( 2025-08-14T21:48:06.5547107Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5547184Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5547385Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5547455Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5547686Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5547773Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5548006Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 176, in forward 2025-08-14T21:48:06.5548083Z value_states = self.v_proj(current_states) 2025-08-14T21:48:06.5548086Z 2025-08-14T21:48:06.5548176Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5548367Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5548427Z return mod(**inputs) 2025-08-14T21:48:06.5548660Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5548720Z outputs = self.model( 2025-08-14T21:48:06.5548945Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5549018Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5549222Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5549293Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5549527Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5549613Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5549843Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 243, in forward 2025-08-14T21:48:06.5549929Z attn_output = torch.bmm(attn_probs, value_states) 2025-08-14T21:48:06.5549932Z 2025-08-14T21:48:06.5550021Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5550210Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5550268Z return mod(**inputs) 2025-08-14T21:48:06.5550513Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5550578Z outputs = self.model( 2025-08-14T21:48:06.5550798Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5550867Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5551079Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5551164Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5551393Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5551479Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5551704Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 256, in forward 2025-08-14T21:48:06.5551819Z attn_output = attn_output.reshape(bsz, tgt_len, self.embed_dim) 2025-08-14T21:48:06.5551836Z 2025-08-14T21:48:06.5551927Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5552117Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5552176Z return mod(**inputs) 2025-08-14T21:48:06.5552403Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5552470Z outputs = self.model( 2025-08-14T21:48:06.5552693Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5552765Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5552966Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5553036Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5553269Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5553356Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5553586Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 258, in forward 2025-08-14T21:48:06.5553661Z attn_output = self.out_proj(attn_output) 2025-08-14T21:48:06.5553664Z 2025-08-14T21:48:06.5553756Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5553944Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5554000Z return mod(**inputs) 2025-08-14T21:48:06.5554224Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5554292Z outputs = self.model( 2025-08-14T21:48:06.5554516Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5554591Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5554791Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5554860Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5555091Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 362, in forward 2025-08-14T21:48:06.5555199Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:48:06.5555202Z 2025-08-14T21:48:06.5555303Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5555482Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5555541Z return mod(**inputs) 2025-08-14T21:48:06.5555778Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5555855Z outputs = self.model( 2025-08-14T21:48:06.5556085Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5556157Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5556372Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5556453Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5556697Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 362, in forward 2025-08-14T21:48:06.5556805Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:48:06.5557008Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:48:06.5557072Z return self.act(input) 2025-08-14T21:48:06.5557076Z 2025-08-14T21:48:06.5557168Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5557358Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5557432Z return mod(**inputs) 2025-08-14T21:48:06.5557664Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5557725Z outputs = self.model( 2025-08-14T21:48:06.5557952Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5558027Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5558229Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5558306Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5558529Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 364, in forward 2025-08-14T21:48:06.5558609Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:48:06.5558614Z 2025-08-14T21:48:06.5558712Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5558893Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5558951Z return mod(**inputs) 2025-08-14T21:48:06.5559184Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5559245Z outputs = self.model( 2025-08-14T21:48:06.5559476Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5559570Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5559770Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5559846Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5560069Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 366, in forward 2025-08-14T21:48:06.5560143Z hidden_states = residual + hidden_states 2025-08-14T21:48:06.5560154Z 2025-08-14T21:48:06.5560244Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5560425Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5560490Z return mod(**inputs) 2025-08-14T21:48:06.5560717Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5560776Z outputs = self.model( 2025-08-14T21:48:06.5561003Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5561067Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5561270Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5561353Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5561582Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5561678Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5561918Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 156, in forward 2025-08-14T21:48:06.5562032Z query_states = self.q_proj(hidden_states) * self.scaling 2025-08-14T21:48:06.5562043Z 2025-08-14T21:48:06.5562135Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5562313Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5562378Z return mod(**inputs) 2025-08-14T21:48:06.5562599Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5562660Z outputs = self.model( 2025-08-14T21:48:06.5562906Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5562971Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5563175Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5563247Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5563470Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5563564Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5563784Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 175, in forward 2025-08-14T21:48:06.5563854Z key_states = self.k_proj(current_states) 2025-08-14T21:48:06.5563857Z 2025-08-14T21:48:06.5563956Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5564136Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5564201Z return mod(**inputs) 2025-08-14T21:48:06.5564425Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5564484Z outputs = self.model( 2025-08-14T21:48:06.5564712Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5564776Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5564975Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5565052Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5565273Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5565367Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5565591Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 156, in forward 2025-08-14T21:48:06.5565687Z query_states = self.q_proj(hidden_states) * self.scaling 2025-08-14T21:48:06.5565691Z 2025-08-14T21:48:06.5565792Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5565968Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5566034Z return mod(**inputs) 2025-08-14T21:48:06.5566255Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5566314Z outputs = self.model( 2025-08-14T21:48:06.5566544Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5566607Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5566820Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5566902Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5567124Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5567236Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5567467Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 197, in forward 2025-08-14T21:48:06.5567603Z attn_weights = torch.bmm(query_states, key_states.transpose(1, 2)) 2025-08-14T21:48:06.5567606Z 2025-08-14T21:48:06.5567705Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5567885Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5567951Z return mod(**inputs) 2025-08-14T21:48:06.5568175Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5568251Z outputs = self.model( 2025-08-14T21:48:06.5568486Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5568552Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5568754Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5568835Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5569059Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5569154Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5569376Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 176, in forward 2025-08-14T21:48:06.5569454Z value_states = self.v_proj(current_states) 2025-08-14T21:48:06.5569459Z 2025-08-14T21:48:06.5569560Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5569738Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5569797Z return mod(**inputs) 2025-08-14T21:48:06.5570036Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5570099Z outputs = self.model( 2025-08-14T21:48:06.5570337Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5570404Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5570611Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5570691Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5570923Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5571021Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5571248Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 243, in forward 2025-08-14T21:48:06.5571338Z attn_output = torch.bmm(attn_probs, value_states) 2025-08-14T21:48:06.5571341Z 2025-08-14T21:48:06.5571442Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5571625Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5571685Z return mod(**inputs) 2025-08-14T21:48:06.5585005Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5585155Z outputs = self.model( 2025-08-14T21:48:06.5585548Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5585640Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5585868Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5585960Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5586239Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5586379Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5586630Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 256, in forward 2025-08-14T21:48:06.5586750Z attn_output = attn_output.reshape(bsz, tgt_len, self.embed_dim) 2025-08-14T21:48:06.5586755Z 2025-08-14T21:48:06.5586860Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5587068Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5587164Z return mod(**inputs) 2025-08-14T21:48:06.5587411Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5587479Z outputs = self.model( 2025-08-14T21:48:06.5587717Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5587798Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5588010Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5588089Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5588334Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5588425Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5588661Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 258, in forward 2025-08-14T21:48:06.5588738Z attn_output = self.out_proj(attn_output) 2025-08-14T21:48:06.5588742Z 2025-08-14T21:48:06.5588838Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5589035Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5589097Z return mod(**inputs) 2025-08-14T21:48:06.5589333Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5589397Z outputs = self.model( 2025-08-14T21:48:06.5589623Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5589700Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5589903Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5589980Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5590211Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 362, in forward 2025-08-14T21:48:06.5590322Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:48:06.5590326Z 2025-08-14T21:48:06.5590428Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5590612Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5590674Z return mod(**inputs) 2025-08-14T21:48:06.5590903Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5590963Z outputs = self.model( 2025-08-14T21:48:06.5591194Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5591275Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5591478Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5591556Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5591791Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 362, in forward 2025-08-14T21:48:06.5591904Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:48:06.5592120Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:48:06.5592185Z return self.act(input) 2025-08-14T21:48:06.5592189Z 2025-08-14T21:48:06.5592290Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5592472Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5592532Z return mod(**inputs) 2025-08-14T21:48:06.5592762Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5592852Z outputs = self.model( 2025-08-14T21:48:06.5593077Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5593152Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5593354Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5593433Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5593661Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 364, in forward 2025-08-14T21:48:06.5593734Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:48:06.5593738Z 2025-08-14T21:48:06.5593840Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5594025Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5594094Z return mod(**inputs) 2025-08-14T21:48:06.5594319Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5594379Z outputs = self.model( 2025-08-14T21:48:06.5594615Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5594681Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5594881Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5594961Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5595187Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5595283Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5595512Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 156, in forward 2025-08-14T21:48:06.5595619Z query_states = self.q_proj(hidden_states) * self.scaling 2025-08-14T21:48:06.5595622Z 2025-08-14T21:48:06.5595723Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5595910Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5595976Z return mod(**inputs) 2025-08-14T21:48:06.5596204Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5596264Z outputs = self.model( 2025-08-14T21:48:06.5596496Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5596560Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5596774Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5596856Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5597082Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5597177Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5597414Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 175, in forward 2025-08-14T21:48:06.5597503Z key_states = self.k_proj(current_states) 2025-08-14T21:48:06.5597506Z 2025-08-14T21:48:06.5597606Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5597787Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5597846Z return mod(**inputs) 2025-08-14T21:48:06.5598075Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5598137Z outputs = self.model( 2025-08-14T21:48:06.5598385Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5598450Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5598652Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5598730Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5598954Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5599050Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5599272Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 156, in forward 2025-08-14T21:48:06.5599373Z query_states = self.q_proj(hidden_states) * self.scaling 2025-08-14T21:48:06.5599377Z 2025-08-14T21:48:06.5599504Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5599686Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5599748Z return mod(**inputs) 2025-08-14T21:48:06.5599980Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5600043Z outputs = self.model( 2025-08-14T21:48:06.5600276Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5600345Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5600544Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5600624Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5600849Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5600939Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5601172Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 197, in forward 2025-08-14T21:48:06.5601295Z attn_weights = torch.bmm(query_states, key_states.transpose(1, 2)) 2025-08-14T21:48:06.5601299Z 2025-08-14T21:48:06.5601402Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5601586Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5601647Z return mod(**inputs) 2025-08-14T21:48:06.5601880Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5601943Z outputs = self.model( 2025-08-14T21:48:06.5602172Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5602250Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5602454Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5602533Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5602772Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5602861Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5603111Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 176, in forward 2025-08-14T21:48:06.5603190Z value_states = self.v_proj(current_states) 2025-08-14T21:48:06.5603193Z 2025-08-14T21:48:06.5603291Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5603472Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5603531Z return mod(**inputs) 2025-08-14T21:48:06.5603764Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5603840Z outputs = self.model( 2025-08-14T21:48:06.5604066Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5604143Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5604343Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5604415Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5604643Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5604731Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5604954Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 243, in forward 2025-08-14T21:48:06.5605049Z attn_output = torch.bmm(attn_probs, value_states) 2025-08-14T21:48:06.5605054Z 2025-08-14T21:48:06.5605145Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5605332Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5605389Z return mod(**inputs) 2025-08-14T21:48:06.5605613Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5605683Z outputs = self.model( 2025-08-14T21:48:06.5605905Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5605969Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5606174Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5606244Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5606472Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5606562Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5606783Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 256, in forward 2025-08-14T21:48:06.5606906Z attn_output = attn_output.reshape(bsz, tgt_len, self.embed_dim) 2025-08-14T21:48:06.5606910Z 2025-08-14T21:48:06.5607001Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5607188Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5607246Z return mod(**inputs) 2025-08-14T21:48:06.5607470Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5607538Z outputs = self.model( 2025-08-14T21:48:06.5607775Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5607847Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5608057Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5608127Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5608374Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5608477Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5608701Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 258, in forward 2025-08-14T21:48:06.5608781Z attn_output = self.out_proj(attn_output) 2025-08-14T21:48:06.5608785Z 2025-08-14T21:48:06.5608876Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5609064Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5609138Z return mod(**inputs) 2025-08-14T21:48:06.5609364Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5609433Z outputs = self.model( 2025-08-14T21:48:06.5609658Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5609725Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5609933Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5610002Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5610233Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 362, in forward 2025-08-14T21:48:06.5610340Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:48:06.5610343Z 2025-08-14T21:48:06.5610436Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5610625Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5610683Z return mod(**inputs) 2025-08-14T21:48:06.5610911Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5610979Z outputs = self.model( 2025-08-14T21:48:06.5611204Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5611276Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5611475Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5611545Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5611776Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 362, in forward 2025-08-14T21:48:06.5611883Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:48:06.5612083Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:48:06.5612146Z return self.act(input) 2025-08-14T21:48:06.5612149Z 2025-08-14T21:48:06.5612242Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5612426Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5612486Z return mod(**inputs) 2025-08-14T21:48:06.5612710Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5612776Z outputs = self.model( 2025-08-14T21:48:06.5613001Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5613091Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5613300Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5613372Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5613623Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 364, in forward 2025-08-14T21:48:06.5613700Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:48:06.5613728Z 2025-08-14T21:48:06.5613832Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5614019Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5614080Z return mod(**inputs) 2025-08-14T21:48:06.5614319Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5614381Z outputs = self.model( 2025-08-14T21:48:06.5614613Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5614706Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5614915Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5614996Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5615227Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 366, in forward 2025-08-14T21:48:06.5615304Z hidden_states = residual + hidden_states 2025-08-14T21:48:06.5615307Z 2025-08-14T21:48:06.5615408Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5615593Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5615655Z return mod(**inputs) 2025-08-14T21:48:06.5615903Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5615964Z outputs = self.model( 2025-08-14T21:48:06.5616195Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5616259Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5616461Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5616538Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5616763Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5616858Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5617081Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 156, in forward 2025-08-14T21:48:06.5617182Z query_states = self.q_proj(hidden_states) * self.scaling 2025-08-14T21:48:06.5617185Z 2025-08-14T21:48:06.5617286Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5617468Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5617527Z return mod(**inputs) 2025-08-14T21:48:06.5617764Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5617826Z outputs = self.model( 2025-08-14T21:48:06.5618057Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5618122Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5618320Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5618397Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5618634Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5618726Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5618955Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 175, in forward 2025-08-14T21:48:06.5619027Z key_states = self.k_proj(current_states) 2025-08-14T21:48:06.5619031Z 2025-08-14T21:48:06.5619141Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5619337Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5619395Z return mod(**inputs) 2025-08-14T21:48:06.5619626Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5619686Z outputs = self.model( 2025-08-14T21:48:06.5619914Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5619978Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5620195Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5620274Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5620501Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5620588Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5620822Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 156, in forward 2025-08-14T21:48:06.5620920Z query_states = self.q_proj(hidden_states) * self.scaling 2025-08-14T21:48:06.5620923Z 2025-08-14T21:48:06.5621021Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5621200Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5621259Z return mod(**inputs) 2025-08-14T21:48:06.5621495Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5621556Z outputs = self.model( 2025-08-14T21:48:06.5621786Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5621850Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5622048Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5622126Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5622349Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5622434Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5622665Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 197, in forward 2025-08-14T21:48:06.5622787Z attn_weights = torch.bmm(query_states, key_states.transpose(1, 2)) 2025-08-14T21:48:06.5622791Z 2025-08-14T21:48:06.5622892Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5623071Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5623131Z return mod(**inputs) 2025-08-14T21:48:06.5623362Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5623423Z outputs = self.model( 2025-08-14T21:48:06.5623644Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5623717Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5623916Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5624009Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5624233Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5624320Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5624560Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 176, in forward 2025-08-14T21:48:06.5624642Z value_states = self.v_proj(current_states) 2025-08-14T21:48:06.5624660Z 2025-08-14T21:48:06.5624825Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5625015Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5625073Z return mod(**inputs) 2025-08-14T21:48:06.5625308Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5625369Z outputs = self.model( 2025-08-14T21:48:06.5625598Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5625691Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5625892Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5625973Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5626197Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5626286Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5626518Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 243, in forward 2025-08-14T21:48:06.5626605Z attn_output = torch.bmm(attn_probs, value_states) 2025-08-14T21:48:06.5626608Z 2025-08-14T21:48:06.5626703Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5626900Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5626961Z return mod(**inputs) 2025-08-14T21:48:06.5627184Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5627252Z outputs = self.model( 2025-08-14T21:48:06.5627475Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5627550Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5627749Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5627820Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5628049Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5628137Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5628365Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 256, in forward 2025-08-14T21:48:06.5628481Z attn_output = attn_output.reshape(bsz, tgt_len, self.embed_dim) 2025-08-14T21:48:06.5628485Z 2025-08-14T21:48:06.5628577Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5628767Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5628826Z return mod(**inputs) 2025-08-14T21:48:06.5629049Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5629118Z outputs = self.model( 2025-08-14T21:48:06.5629342Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5629413Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5629628Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5629701Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5629927Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5630029Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5630257Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 258, in forward 2025-08-14T21:48:06.5630346Z attn_output = self.out_proj(attn_output) 2025-08-14T21:48:06.5630349Z 2025-08-14T21:48:06.5630439Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5630624Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5630681Z return mod(**inputs) 2025-08-14T21:48:06.5630908Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5630990Z outputs = self.model( 2025-08-14T21:48:06.5631212Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5631283Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5631480Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5631552Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5631779Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 362, in forward 2025-08-14T21:48:06.5631884Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:48:06.5631887Z 2025-08-14T21:48:06.5631985Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5632164Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5632222Z return mod(**inputs) 2025-08-14T21:48:06.5632453Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5632510Z outputs = self.model( 2025-08-14T21:48:06.5632733Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5632802Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5632999Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5633075Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5633296Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 362, in forward 2025-08-14T21:48:06.5633399Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:48:06.5633598Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:48:06.5633661Z return self.act(input) 2025-08-14T21:48:06.5633665Z 2025-08-14T21:48:06.5633756Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5633941Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5633999Z return mod(**inputs) 2025-08-14T21:48:06.5634228Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5634288Z outputs = self.model( 2025-08-14T21:48:06.5634508Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5634579Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5634775Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5634858Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5635091Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 364, in forward 2025-08-14T21:48:06.5635163Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:48:06.5635166Z 2025-08-14T21:48:06.5635264Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5635464Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5635539Z return mod(**inputs) 2025-08-14T21:48:06.5635768Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5635827Z outputs = self.model( 2025-08-14T21:48:06.5636059Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5636124Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5636327Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5636421Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5636642Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5636731Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5636961Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 156, in forward 2025-08-14T21:48:06.5637061Z query_states = self.q_proj(hidden_states) * self.scaling 2025-08-14T21:48:06.5637065Z 2025-08-14T21:48:06.5637161Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5637340Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5637397Z return mod(**inputs) 2025-08-14T21:48:06.5637627Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5637689Z outputs = self.model( 2025-08-14T21:48:06.5637919Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5637984Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5638184Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5638261Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5638483Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5638570Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5638799Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 175, in forward 2025-08-14T21:48:06.5638870Z key_states = self.k_proj(current_states) 2025-08-14T21:48:06.5638874Z 2025-08-14T21:48:06.5638974Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5639154Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5639214Z return mod(**inputs) 2025-08-14T21:48:06.5639442Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5639503Z outputs = self.model( 2025-08-14T21:48:06.5639727Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5639799Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5639996Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5640070Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5640305Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5640397Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5640628Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 156, in forward 2025-08-14T21:48:06.5640726Z query_states = self.q_proj(hidden_states) * self.scaling 2025-08-14T21:48:06.5640765Z 2025-08-14T21:48:06.5640864Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5641062Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5641121Z return mod(**inputs) 2025-08-14T21:48:06.5641350Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5641410Z outputs = self.model( 2025-08-14T21:48:06.5641633Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5641706Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5641924Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5642001Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5642233Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5642318Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5642560Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 197, in forward 2025-08-14T21:48:06.5642679Z attn_weights = torch.bmm(query_states, key_states.transpose(1, 2)) 2025-08-14T21:48:06.5642683Z 2025-08-14T21:48:06.5642780Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5642962Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5643022Z return mod(**inputs) 2025-08-14T21:48:06.5643261Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5643320Z outputs = self.model( 2025-08-14T21:48:06.5643548Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5643618Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5643822Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5643899Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5644127Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5644213Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5644447Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 176, in forward 2025-08-14T21:48:06.5644526Z value_states = self.v_proj(current_states) 2025-08-14T21:48:06.5644529Z 2025-08-14T21:48:06.5644627Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5644810Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5644868Z return mod(**inputs) 2025-08-14T21:48:06.5645101Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5645163Z outputs = self.model( 2025-08-14T21:48:06.5645390Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5645460Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5645662Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5645754Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5645979Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5646065Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5646306Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 243, in forward 2025-08-14T21:48:06.5646392Z attn_output = torch.bmm(attn_probs, value_states) 2025-08-14T21:48:06.5646409Z 2025-08-14T21:48:06.5646501Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5646689Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5646749Z return mod(**inputs) 2025-08-14T21:48:06.5646984Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5647047Z outputs = self.model( 2025-08-14T21:48:06.5647276Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5648033Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5648234Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5648310Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5648532Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5648620Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5648850Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 256, in forward 2025-08-14T21:48:06.5648962Z attn_output = attn_output.reshape(bsz, tgt_len, self.embed_dim) 2025-08-14T21:48:06.5648965Z 2025-08-14T21:48:06.5649055Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5649243Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5649303Z return mod(**inputs) 2025-08-14T21:48:06.5649533Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5649596Z outputs = self.model( 2025-08-14T21:48:06.5649820Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5649894Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5650094Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5650163Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5650391Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5650479Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5650708Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 258, in forward 2025-08-14T21:48:06.5650780Z attn_output = self.out_proj(attn_output) 2025-08-14T21:48:06.5650783Z 2025-08-14T21:48:06.5650874Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5651063Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5651121Z return mod(**inputs) 2025-08-14T21:48:06.5651352Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5651411Z outputs = self.model( 2025-08-14T21:48:06.5651633Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5651705Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5651916Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5651991Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5652221Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 362, in forward 2025-08-14T21:48:06.5652342Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:48:06.5652346Z 2025-08-14T21:48:06.5652462Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5652640Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5652698Z return mod(**inputs) 2025-08-14T21:48:06.5652928Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5652988Z outputs = self.model( 2025-08-14T21:48:06.5653216Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5653297Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5653496Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5653573Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5653799Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 362, in forward 2025-08-14T21:48:06.5653906Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:48:06.5654104Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:48:06.5654166Z return self.act(input) 2025-08-14T21:48:06.5654170Z 2025-08-14T21:48:06.5654267Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5654447Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5654508Z return mod(**inputs) 2025-08-14T21:48:06.5654739Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5654798Z outputs = self.model( 2025-08-14T21:48:06.5655022Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5655092Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5655292Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5655367Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5655589Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 364, in forward 2025-08-14T21:48:06.5655662Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:48:06.5655665Z 2025-08-14T21:48:06.5655765Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5655947Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5656013Z return mod(**inputs) 2025-08-14T21:48:06.5656236Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5656296Z outputs = self.model( 2025-08-14T21:48:06.5656526Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5656594Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5656791Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5656868Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5657092Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 366, in forward 2025-08-14T21:48:06.5657195Z hidden_states = residual + hidden_states 2025-08-14T21:48:06.5657200Z 2025-08-14T21:48:06.5657292Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5657471Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5657538Z return mod(**inputs) 2025-08-14T21:48:06.5657777Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5657852Z outputs = self.model( 2025-08-14T21:48:06.5658087Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5658150Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5658355Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5658421Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5658647Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5658757Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5658979Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 156, in forward 2025-08-14T21:48:06.5659086Z query_states = self.q_proj(hidden_states) * self.scaling 2025-08-14T21:48:06.5659089Z 2025-08-14T21:48:06.5659179Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5659357Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5659419Z return mod(**inputs) 2025-08-14T21:48:06.5659641Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5659701Z outputs = self.model( 2025-08-14T21:48:06.5659934Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5659999Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5660203Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5660271Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5660496Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5660590Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5660811Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 175, in forward 2025-08-14T21:48:06.5660888Z key_states = self.k_proj(current_states) 2025-08-14T21:48:06.5660891Z 2025-08-14T21:48:06.5660983Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5661160Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5661224Z return mod(**inputs) 2025-08-14T21:48:06.5661451Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5661511Z outputs = self.model( 2025-08-14T21:48:06.5661742Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5661806Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5662013Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5662081Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5662301Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5662395Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5662632Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 156, in forward 2025-08-14T21:48:06.5662734Z query_states = self.q_proj(hidden_states) * self.scaling 2025-08-14T21:48:06.5662744Z 2025-08-14T21:48:06.5662836Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5663030Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5663096Z return mod(**inputs) 2025-08-14T21:48:06.5663319Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5663396Z outputs = self.model( 2025-08-14T21:48:06.5663630Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5663695Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5663905Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5663977Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5664215Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5664308Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5664529Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 197, in forward 2025-08-14T21:48:06.5664653Z attn_weights = torch.bmm(query_states, key_states.transpose(1, 2)) 2025-08-14T21:48:06.5664663Z 2025-08-14T21:48:06.5664819Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5665008Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5665074Z return mod(**inputs) 2025-08-14T21:48:06.5665300Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5665361Z outputs = self.model( 2025-08-14T21:48:06.5665590Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5665656Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5665864Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5665935Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5666161Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5666257Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5666481Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 176, in forward 2025-08-14T21:48:06.5666559Z value_states = self.v_proj(current_states) 2025-08-14T21:48:06.5666562Z 2025-08-14T21:48:06.5666665Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5666846Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5666916Z return mod(**inputs) 2025-08-14T21:48:06.5667143Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5667204Z outputs = self.model( 2025-08-14T21:48:06.5667437Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5667503Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5667704Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5667782Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5668007Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5668116Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5668345Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 243, in forward 2025-08-14T21:48:06.5668432Z attn_output = torch.bmm(attn_probs, value_states) 2025-08-14T21:48:06.5668435Z 2025-08-14T21:48:06.5668550Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5668732Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5668815Z return mod(**inputs) 2025-08-14T21:48:06.5669041Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5669101Z outputs = self.model( 2025-08-14T21:48:06.5669333Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5669397Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5669598Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5669691Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5669913Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5670009Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5670230Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 256, in forward 2025-08-14T21:48:06.5670345Z attn_output = attn_output.reshape(bsz, tgt_len, self.embed_dim) 2025-08-14T21:48:06.5670348Z 2025-08-14T21:48:06.5670450Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5670631Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5670695Z return mod(**inputs) 2025-08-14T21:48:06.5670919Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5670980Z outputs = self.model( 2025-08-14T21:48:06.5671206Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5671270Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5671472Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5671551Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5671773Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 330, in forward 2025-08-14T21:48:06.5671865Z hidden_states, self_attn_weights = self.self_attn( 2025-08-14T21:48:06.5672087Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 258, in forward 2025-08-14T21:48:06.5672163Z attn_output = self.out_proj(attn_output) 2025-08-14T21:48:06.5672167Z 2025-08-14T21:48:06.5672266Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5672445Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5672509Z return mod(**inputs) 2025-08-14T21:48:06.5672734Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5672796Z outputs = self.model( 2025-08-14T21:48:06.5673024Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5673088Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5673287Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5673363Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5673599Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 362, in forward 2025-08-14T21:48:06.5673716Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:48:06.5673720Z 2025-08-14T21:48:06.5673812Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5674004Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5674071Z return mod(**inputs) 2025-08-14T21:48:06.5674315Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5674375Z outputs = self.model( 2025-08-14T21:48:06.5674606Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5674669Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5674882Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5674976Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5675199Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 362, in forward 2025-08-14T21:48:06.5675310Z hidden_states = self.activation_fn(self.fc1(hidden_states)) 2025-08-14T21:48:06.5675504Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:48:06.5675574Z return self.act(input) 2025-08-14T21:48:06.5675577Z 2025-08-14T21:48:06.5675669Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5675849Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5675914Z return mod(**inputs) 2025-08-14T21:48:06.5676135Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 664, in forward 2025-08-14T21:48:06.5676197Z outputs = self.model( 2025-08-14T21:48:06.5676426Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 552, in forward 2025-08-14T21:48:06.5676490Z layer_outputs = decoder_layer( 2025-08-14T21:48:06.5676695Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:48:06.5676764Z return super().__call__(*args, **kwargs) 2025-08-14T21:48:06.5676986Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 364, in forward 2025-08-14T21:48:06.5677064Z hidden_states = self.fc2(hidden_states) 2025-08-14T21:48:06.5677067Z 2025-08-14T21:48:06.5677159Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5677343Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5677401Z return mod(**inputs) 2025-08-14T21:48:06.5677623Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 681, in forward 2025-08-14T21:48:06.5677704Z logits = self.lm_head(outputs[0]) 2025-08-14T21:48:06.5677707Z 2025-08-14T21:48:06.5677797Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:06.5677974Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:06.5678039Z return mod(**inputs) 2025-08-14T21:48:06.5678264Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xglm/modeling_xglm.py", line 685, in forward 2025-08-14T21:48:06.5678337Z loss = self.loss_function( 2025-08-14T21:48:06.5678558Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/loss/loss_utils.py", line 67, in ForCausalLMLoss 2025-08-14T21:48:06.5678716Z loss = fixed_cross_entropy(logits, shift_labels, num_items_in_batch, ignore_index, **kwargs) 2025-08-14T21:48:06.5678964Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/loss/loss_utils.py", line 36, in fixed_cross_entropy 2025-08-14T21:48:06.5679147Z loss = nn.functional.cross_entropy(source, target, ignore_index=ignore_index, reduction=reduction) 2025-08-14T21:48:06.5679150Z 2025-08-14T21:48:17.1114429Z Compilation time (from dynamo_timed): 22.325687672 2025-08-14T21:48:17.1226077Z pass 2025-08-14T21:48:17.1228314Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:48:17.1229265Z TIMING: _recursive_pre_grad_passes:0.01117 _recursive_joint_graph_passes:0.71749 _recursive_post_grad_passes:0.25777 async_compile.wait:0.79236 code_gen:9.95972 inductor_compile:12.76364 backend_compile:18.1734 gc:0.00032 entire_frame_compile:22.32569 total_wall_time:22.32569 2025-08-14T21:48:17.1233331Z STATS: call_* op count: 921 | FakeTensorMode.__torch_dispatch__:29112 | FakeTensor.__torch_dispatch__:10687 | ProxyTorchDispatchMode.__torch_dispatch__:10816 2025-08-14T21:48:17.1237483Z Dynamo produced 1 graphs covering 921 ops with 0 graph breaks (0 unique) 2025-08-14T21:48:21.6900837Z /opt/conda/envs/py_3.9/lib/python3.9/site-packages/llvmlite/binding/ffi.py:175: UserWarning: pkg_resources is deprecated as an API. See https://setuptools.pypa.io/en/latest/pkg_resources.html. The pkg_resources package is slated for removal as early as 2025-11-30. Refrain from using this package or pin to Setuptools<81. 2025-08-14T21:48:21.6901790Z from pkg_resources import resource_filename 2025-08-14T21:48:22.2709099Z 2025-08-14T21:48:25.2281542Z loading model: 0it [00:00, ?it/s] 2025-08-14T21:48:25.2285800Z loading model: 0it [00:02, ?it/s] 2025-08-14T21:48:25.2308651Z cpu eval XLNetLMHeadModel 2025-08-14T21:48:27.4267112Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:48:28.1933165Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:48:28.9461333Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:48:47.8669208Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.8669740Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.8673864Z return mod(**inputs) 2025-08-14T21:48:47.8678841Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.8679436Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.8679946Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1307, in forward 2025-08-14T21:48:47.8685388Z word_emb_k = self.word_embedding(input_ids) 2025-08-14T21:48:47.8685573Z 2025-08-14T21:48:47.8685798Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.8686182Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.8686510Z return mod(**inputs) 2025-08-14T21:48:47.8686881Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.8689558Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.8690179Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1334, in forward 2025-08-14T21:48:47.8690749Z pos_emb = self.relative_positional_encoding(qlen, klen, bsz=bsz) 2025-08-14T21:48:47.8691220Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1157, in relative_positional_encoding 2025-08-14T21:48:47.8691785Z pos_emb = self.positional_embedding(fwd_pos_seq, inv_freq, bsz) 2025-08-14T21:48:47.8692241Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1115, in positional_embedding 2025-08-14T21:48:47.8692916Z pos_emb = torch.cat([torch.sin(sinusoid_inp), torch.cos(sinusoid_inp)], dim=-1) 2025-08-14T21:48:47.8693120Z 2025-08-14T21:48:47.8693232Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.8693574Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.8693891Z return mod(**inputs) 2025-08-14T21:48:47.8694998Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.8695438Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.8695821Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1334, in forward 2025-08-14T21:48:47.8696229Z pos_emb = self.relative_positional_encoding(qlen, klen, bsz=bsz) 2025-08-14T21:48:47.8696687Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1157, in relative_positional_encoding 2025-08-14T21:48:47.8697179Z pos_emb = self.positional_embedding(fwd_pos_seq, inv_freq, bsz) 2025-08-14T21:48:47.8697611Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1115, in positional_embedding 2025-08-14T21:48:47.8698075Z pos_emb = torch.cat([torch.sin(sinusoid_inp), torch.cos(sinusoid_inp)], dim=-1) 2025-08-14T21:48:47.8698269Z 2025-08-14T21:48:47.8698377Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.8698708Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.8699016Z return mod(**inputs) 2025-08-14T21:48:47.8699362Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.8699747Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.8700113Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.8700472Z outputs = layer_module( 2025-08-14T21:48:47.8700813Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.8701163Z outputs = self.rel_attn( 2025-08-14T21:48:47.8701508Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 440, in forward 2025-08-14T21:48:47.8701885Z output_h = self.post_attention(h, attn_vec) 2025-08-14T21:48:47.8702274Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 304, in post_attention 2025-08-14T21:48:47.8702680Z attn_out = torch.einsum("ibnd,hnd->ibh", attn_vec, self.o) 2025-08-14T21:48:47.8702840Z 2025-08-14T21:48:47.8702943Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.8703286Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.8703590Z return mod(**inputs) 2025-08-14T21:48:47.8703920Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.8704286Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.8704655Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.8705116Z outputs = layer_module( 2025-08-14T21:48:47.8705458Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.8705812Z outputs = self.rel_attn( 2025-08-14T21:48:47.8706156Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 440, in forward 2025-08-14T21:48:47.8706523Z output_h = self.post_attention(h, attn_vec) 2025-08-14T21:48:47.8706940Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 304, in post_attention 2025-08-14T21:48:47.8707359Z attn_out = torch.einsum("ibnd,hnd->ibh", attn_vec, self.o) 2025-08-14T21:48:47.8707516Z 2025-08-14T21:48:47.8707619Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.8707981Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.8708286Z return mod(**inputs) 2025-08-14T21:48:47.8708650Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.8709010Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.8709377Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.8709732Z outputs = layer_module( 2025-08-14T21:48:47.8710069Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.8710435Z outputs = self.rel_attn( 2025-08-14T21:48:47.8710774Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 440, in forward 2025-08-14T21:48:47.8711148Z output_h = self.post_attention(h, attn_vec) 2025-08-14T21:48:47.8711527Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 304, in post_attention 2025-08-14T21:48:47.8711942Z attn_out = torch.einsum("ibnd,hnd->ibh", attn_vec, self.o) 2025-08-14T21:48:47.8712102Z 2025-08-14T21:48:47.8712199Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.8712535Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.8712837Z return mod(**inputs) 2025-08-14T21:48:47.8713173Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.8713544Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.8713901Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.8714257Z outputs = layer_module( 2025-08-14T21:48:47.8714595Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.8714951Z outputs = self.rel_attn( 2025-08-14T21:48:47.8715281Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 440, in forward 2025-08-14T21:48:47.8715652Z output_h = self.post_attention(h, attn_vec) 2025-08-14T21:48:47.8716033Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 304, in post_attention 2025-08-14T21:48:47.8716442Z attn_out = torch.einsum("ibnd,hnd->ibh", attn_vec, self.o) 2025-08-14T21:48:47.8716596Z 2025-08-14T21:48:47.8716695Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.8717025Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.8717324Z return mod(**inputs) 2025-08-14T21:48:47.8717651Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.8718017Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.8718382Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.8718734Z outputs = layer_module( 2025-08-14T21:48:47.8719062Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.8719415Z outputs = self.rel_attn( 2025-08-14T21:48:47.8719766Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 440, in forward 2025-08-14T21:48:47.8720146Z output_h = self.post_attention(h, attn_vec) 2025-08-14T21:48:47.8720526Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 304, in post_attention 2025-08-14T21:48:47.8720947Z attn_out = torch.einsum("ibnd,hnd->ibh", attn_vec, self.o) 2025-08-14T21:48:47.8721102Z 2025-08-14T21:48:47.8721205Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.8721542Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.8721870Z return mod(**inputs) 2025-08-14T21:48:47.8722198Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.8722564Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.8722927Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.8723300Z outputs = layer_module( 2025-08-14T21:48:47.8723636Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.8723986Z outputs = self.rel_attn( 2025-08-14T21:48:47.8724331Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 440, in forward 2025-08-14T21:48:47.8724691Z output_h = self.post_attention(h, attn_vec) 2025-08-14T21:48:47.8725073Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 304, in post_attention 2025-08-14T21:48:47.8725479Z attn_out = torch.einsum("ibnd,hnd->ibh", attn_vec, self.o) 2025-08-14T21:48:47.8725630Z 2025-08-14T21:48:47.8725732Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.8726051Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.8726351Z return mod(**inputs) 2025-08-14T21:48:47.8726686Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.8727044Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.8727409Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.8727765Z outputs = layer_module( 2025-08-14T21:48:47.8728101Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.8728444Z outputs = self.rel_attn( 2025-08-14T21:48:47.8728779Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 440, in forward 2025-08-14T21:48:47.8729145Z output_h = self.post_attention(h, attn_vec) 2025-08-14T21:48:47.8729527Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 304, in post_attention 2025-08-14T21:48:47.8729925Z attn_out = torch.einsum("ibnd,hnd->ibh", attn_vec, self.o) 2025-08-14T21:48:47.8730083Z 2025-08-14T21:48:47.8730177Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.8730502Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.8730792Z return mod(**inputs) 2025-08-14T21:48:47.8731126Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.8731501Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.8731867Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.8732210Z outputs = layer_module( 2025-08-14T21:48:47.8732562Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.8732917Z outputs = self.rel_attn( 2025-08-14T21:48:47.8733245Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 440, in forward 2025-08-14T21:48:47.8733611Z output_h = self.post_attention(h, attn_vec) 2025-08-14T21:48:47.8734027Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 304, in post_attention 2025-08-14T21:48:47.8734452Z attn_out = torch.einsum("ibnd,hnd->ibh", attn_vec, self.o) 2025-08-14T21:48:47.8734607Z 2025-08-14T21:48:47.8734704Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.8735040Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.8735344Z return mod(**inputs) 2025-08-14T21:48:47.8735684Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.8736063Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.8736424Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.8736775Z outputs = layer_module( 2025-08-14T21:48:47.8737107Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.8737465Z outputs = self.rel_attn( 2025-08-14T21:48:47.8737802Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 440, in forward 2025-08-14T21:48:47.8738171Z output_h = self.post_attention(h, attn_vec) 2025-08-14T21:48:47.8738546Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 304, in post_attention 2025-08-14T21:48:47.8738952Z attn_out = torch.einsum("ibnd,hnd->ibh", attn_vec, self.o) 2025-08-14T21:48:47.8739104Z 2025-08-14T21:48:47.8739210Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.8739538Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.8739829Z return mod(**inputs) 2025-08-14T21:48:47.8740165Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.8740531Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.8740890Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.8741239Z outputs = layer_module( 2025-08-14T21:48:47.8741577Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.8741928Z outputs = self.rel_attn( 2025-08-14T21:48:47.8742260Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 440, in forward 2025-08-14T21:48:47.8742634Z output_h = self.post_attention(h, attn_vec) 2025-08-14T21:48:47.8743013Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 304, in post_attention 2025-08-14T21:48:47.8743410Z attn_out = torch.einsum("ibnd,hnd->ibh", attn_vec, self.o) 2025-08-14T21:48:47.8743566Z 2025-08-14T21:48:47.8743662Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.8743990Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.8744287Z return mod(**inputs) 2025-08-14T21:48:47.8744636Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.8745103Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.8745503Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.8745876Z outputs = layer_module( 2025-08-14T21:48:47.8746206Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.8746556Z outputs = self.rel_attn( 2025-08-14T21:48:47.8746910Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 440, in forward 2025-08-14T21:48:47.8747294Z output_h = self.post_attention(h, attn_vec) 2025-08-14T21:48:47.8747681Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 304, in post_attention 2025-08-14T21:48:47.8748088Z attn_out = torch.einsum("ibnd,hnd->ibh", attn_vec, self.o) 2025-08-14T21:48:47.8748241Z 2025-08-14T21:48:47.8748344Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.8748666Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.8748987Z return mod(**inputs) 2025-08-14T21:48:47.8749319Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.8749680Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.8750032Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.8750382Z outputs = layer_module( 2025-08-14T21:48:47.8750720Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.8751064Z outputs = self.rel_attn( 2025-08-14T21:48:47.8751399Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 440, in forward 2025-08-14T21:48:47.8751768Z output_h = self.post_attention(h, attn_vec) 2025-08-14T21:48:47.8752157Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 304, in post_attention 2025-08-14T21:48:47.8752553Z attn_out = torch.einsum("ibnd,hnd->ibh", attn_vec, self.o) 2025-08-14T21:48:47.8752710Z 2025-08-14T21:48:47.8752805Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.8753139Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.8753427Z return mod(**inputs) 2025-08-14T21:48:47.8753761Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.8754121Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.8754481Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.8754826Z outputs = layer_module( 2025-08-14T21:48:47.8755165Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.8755519Z outputs = self.rel_attn( 2025-08-14T21:48:47.8755855Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 440, in forward 2025-08-14T21:48:47.8756214Z output_h = self.post_attention(h, attn_vec) 2025-08-14T21:48:47.8756593Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 304, in post_attention 2025-08-14T21:48:47.8757004Z attn_out = torch.einsum("ibnd,hnd->ibh", attn_vec, self.o) 2025-08-14T21:48:47.8757154Z 2025-08-14T21:48:47.8757249Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.8757576Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.8757871Z return mod(**inputs) 2025-08-14T21:48:47.8758217Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.8758577Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.8758940Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.8759289Z outputs = layer_module( 2025-08-14T21:48:47.8759629Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.8760001Z outputs = self.rel_attn( 2025-08-14T21:48:47.8760333Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 440, in forward 2025-08-14T21:48:47.8760703Z output_h = self.post_attention(h, attn_vec) 2025-08-14T21:48:47.8761077Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 304, in post_attention 2025-08-14T21:48:47.8761484Z attn_out = torch.einsum("ibnd,hnd->ibh", attn_vec, self.o) 2025-08-14T21:48:47.8761641Z 2025-08-14T21:48:47.8761755Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.8762090Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.8762388Z return mod(**inputs) 2025-08-14T21:48:47.8762729Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.8763098Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.8763459Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.8763818Z outputs = layer_module( 2025-08-14T21:48:47.8764158Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.8764513Z outputs = self.rel_attn( 2025-08-14T21:48:47.8764854Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 440, in forward 2025-08-14T21:48:47.8765230Z output_h = self.post_attention(h, attn_vec) 2025-08-14T21:48:47.8765619Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 304, in post_attention 2025-08-14T21:48:47.8766049Z attn_out = torch.einsum("ibnd,hnd->ibh", attn_vec, self.o) 2025-08-14T21:48:47.8766206Z 2025-08-14T21:48:47.8766305Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.8766641Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.8766952Z return mod(**inputs) 2025-08-14T21:48:47.8767286Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.8767662Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.8768034Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.8768394Z outputs = layer_module( 2025-08-14T21:48:47.8768731Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.8769089Z outputs = self.rel_attn( 2025-08-14T21:48:47.8769438Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 440, in forward 2025-08-14T21:48:47.8769817Z output_h = self.post_attention(h, attn_vec) 2025-08-14T21:48:47.8770218Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 304, in post_attention 2025-08-14T21:48:47.8770631Z attn_out = torch.einsum("ibnd,hnd->ibh", attn_vec, self.o) 2025-08-14T21:48:47.8770784Z 2025-08-14T21:48:47.8770891Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.8771241Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.8771543Z return mod(**inputs) 2025-08-14T21:48:47.8771874Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.8772236Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.8772611Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.8772984Z outputs = layer_module( 2025-08-14T21:48:47.8773321Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.8773662Z outputs = self.rel_attn( 2025-08-14T21:48:47.8774000Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 440, in forward 2025-08-14T21:48:47.8774370Z output_h = self.post_attention(h, attn_vec) 2025-08-14T21:48:47.8774764Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 304, in post_attention 2025-08-14T21:48:47.8775180Z attn_out = torch.einsum("ibnd,hnd->ibh", attn_vec, self.o) 2025-08-14T21:48:47.8775338Z 2025-08-14T21:48:47.8775432Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.8775760Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.8776058Z return mod(**inputs) 2025-08-14T21:48:47.8776384Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.8777166Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.8777530Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.8777873Z outputs = layer_module( 2025-08-14T21:48:47.8778211Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.8778563Z outputs = self.rel_attn( 2025-08-14T21:48:47.8778916Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 440, in forward 2025-08-14T21:48:47.8779281Z output_h = self.post_attention(h, attn_vec) 2025-08-14T21:48:47.8779664Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 304, in post_attention 2025-08-14T21:48:47.8780074Z attn_out = torch.einsum("ibnd,hnd->ibh", attn_vec, self.o) 2025-08-14T21:48:47.8780230Z 2025-08-14T21:48:47.8780331Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.8780650Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.8780946Z return mod(**inputs) 2025-08-14T21:48:47.8781281Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.8781648Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.8782012Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.8782362Z outputs = layer_module( 2025-08-14T21:48:47.8782695Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.8783043Z outputs = self.rel_attn( 2025-08-14T21:48:47.8783381Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 440, in forward 2025-08-14T21:48:47.8783747Z output_h = self.post_attention(h, attn_vec) 2025-08-14T21:48:47.8784130Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 304, in post_attention 2025-08-14T21:48:47.8784573Z attn_out = torch.einsum("ibnd,hnd->ibh", attn_vec, self.o) 2025-08-14T21:48:47.8785016Z 2025-08-14T21:48:47.8785128Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.8785473Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.8785772Z return mod(**inputs) 2025-08-14T21:48:47.8786146Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.8786535Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.8786939Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.8787292Z outputs = layer_module( 2025-08-14T21:48:47.8787637Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.8787997Z outputs = self.rel_attn( 2025-08-14T21:48:47.8788337Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 440, in forward 2025-08-14T21:48:47.8788747Z output_h = self.post_attention(h, attn_vec) 2025-08-14T21:48:47.8789138Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 304, in post_attention 2025-08-14T21:48:47.8789557Z attn_out = torch.einsum("ibnd,hnd->ibh", attn_vec, self.o) 2025-08-14T21:48:47.8789714Z 2025-08-14T21:48:47.8789811Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.8790150Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.8790475Z return mod(**inputs) 2025-08-14T21:48:47.8790816Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.8791186Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.8791560Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.8791922Z outputs = layer_module( 2025-08-14T21:48:47.8792257Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.8792623Z outputs = self.rel_attn( 2025-08-14T21:48:47.8792967Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 440, in forward 2025-08-14T21:48:47.8793346Z output_h = self.post_attention(h, attn_vec) 2025-08-14T21:48:47.8793729Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 304, in post_attention 2025-08-14T21:48:47.8794147Z attn_out = torch.einsum("ibnd,hnd->ibh", attn_vec, self.o) 2025-08-14T21:48:47.8794300Z 2025-08-14T21:48:47.8794405Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.8794748Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.8795048Z return mod(**inputs) 2025-08-14T21:48:47.8795392Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.8795772Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.8796138Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.8796496Z outputs = layer_module( 2025-08-14T21:48:47.8796834Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.8797191Z outputs = self.rel_attn( 2025-08-14T21:48:47.8797532Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 440, in forward 2025-08-14T21:48:47.8797909Z output_h = self.post_attention(h, attn_vec) 2025-08-14T21:48:47.8798318Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 304, in post_attention 2025-08-14T21:48:47.8798739Z attn_out = torch.einsum("ibnd,hnd->ibh", attn_vec, self.o) 2025-08-14T21:48:47.8798895Z 2025-08-14T21:48:47.8798992Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.8799350Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.8799670Z return mod(**inputs) 2025-08-14T21:48:47.8800016Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.8800385Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.8800747Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.8801094Z outputs = layer_module( 2025-08-14T21:48:47.8801421Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.8801790Z outputs = self.rel_attn( 2025-08-14T21:48:47.8802130Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 440, in forward 2025-08-14T21:48:47.8802491Z output_h = self.post_attention(h, attn_vec) 2025-08-14T21:48:47.8802879Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 304, in post_attention 2025-08-14T21:48:47.8803288Z attn_out = torch.einsum("ibnd,hnd->ibh", attn_vec, self.o) 2025-08-14T21:48:47.8803439Z 2025-08-14T21:48:47.8803541Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.8803861Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.8804163Z return mod(**inputs) 2025-08-14T21:48:47.8804500Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.8804866Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.8805219Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.8805569Z outputs = layer_module( 2025-08-14T21:48:47.8805905Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.8806249Z outputs = self.rel_attn( 2025-08-14T21:48:47.8806590Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 440, in forward 2025-08-14T21:48:47.8806962Z output_h = self.post_attention(h, attn_vec) 2025-08-14T21:48:47.8807342Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 304, in post_attention 2025-08-14T21:48:47.8807741Z attn_out = torch.einsum("ibnd,hnd->ibh", attn_vec, self.o) 2025-08-14T21:48:47.8807899Z 2025-08-14T21:48:47.8807997Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.8808329Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.8808632Z return mod(**inputs) 2025-08-14T21:48:47.8808961Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.8809329Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.8809692Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.8810036Z outputs = layer_module( 2025-08-14T21:48:47.8810373Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.8810726Z outputs = self.rel_attn( 2025-08-14T21:48:47.8811081Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 416, in forward 2025-08-14T21:48:47.8811457Z q_head_h = torch.einsum("ibh,hnd->ibnd", h, self.q) 2025-08-14T21:48:47.8811604Z 2025-08-14T21:48:47.8811699Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.8812059Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.8812351Z return mod(**inputs) 2025-08-14T21:48:47.8812705Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.8813073Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.8813434Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.8813774Z outputs = layer_module( 2025-08-14T21:48:47.8814109Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.8814477Z outputs = self.rel_attn( 2025-08-14T21:48:47.8814812Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 417, in forward 2025-08-14T21:48:47.8815185Z k_head_h = torch.einsum("ibh,hnd->ibnd", cat, self.k) 2025-08-14T21:48:47.8815335Z 2025-08-14T21:48:47.8815431Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.8815760Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.8816053Z return mod(**inputs) 2025-08-14T21:48:47.8816385Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.8816745Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.8817105Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.8817445Z outputs = layer_module( 2025-08-14T21:48:47.8817781Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.8818131Z outputs = self.rel_attn( 2025-08-14T21:48:47.8818461Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 425, in forward 2025-08-14T21:48:47.8818817Z attn_vec = self.rel_attn_core( 2025-08-14T21:48:47.8819185Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 263, in rel_attn_core 2025-08-14T21:48:47.8819610Z ac = torch.einsum("ibnd,jbnd->bnij", q_head + self.r_w_bias, k_head_h) 2025-08-14T21:48:47.8819785Z 2025-08-14T21:48:47.8819881Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.8820212Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.8820506Z return mod(**inputs) 2025-08-14T21:48:47.8820841Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.8821203Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.8821566Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1334, in forward 2025-08-14T21:48:47.8821973Z pos_emb = self.relative_positional_encoding(qlen, klen, bsz=bsz) 2025-08-14T21:48:47.8822420Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1157, in relative_positional_encoding 2025-08-14T21:48:47.8822875Z pos_emb = self.positional_embedding(fwd_pos_seq, inv_freq, bsz) 2025-08-14T21:48:47.8823309Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1115, in positional_embedding 2025-08-14T21:48:47.8823790Z pos_emb = torch.cat([torch.sin(sinusoid_inp), torch.cos(sinusoid_inp)], dim=-1) 2025-08-14T21:48:47.8823980Z 2025-08-14T21:48:47.8824077Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.8824408Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.8824708Z return mod(**inputs) 2025-08-14T21:48:47.8825141Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.8825524Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.8825892Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.8826251Z outputs = layer_module( 2025-08-14T21:48:47.8826585Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.8826944Z outputs = self.rel_attn( 2025-08-14T21:48:47.8827288Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 422, in forward 2025-08-14T21:48:47.8827729Z k_head_r = torch.einsum("ibh,hnd->ibnd", r.type(self.r.dtype), self.r) 2025-08-14T21:48:47.8827905Z 2025-08-14T21:48:47.8827999Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.8828330Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.8828628Z return mod(**inputs) 2025-08-14T21:48:47.8828964Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.8829323Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.8829683Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.8830033Z outputs = layer_module( 2025-08-14T21:48:47.8830360Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.8830713Z outputs = self.rel_attn( 2025-08-14T21:48:47.8831049Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 425, in forward 2025-08-14T21:48:47.8831404Z attn_vec = self.rel_attn_core( 2025-08-14T21:48:47.8831766Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 266, in rel_attn_core 2025-08-14T21:48:47.8832190Z bd = torch.einsum("ibnd,jbnd->bnij", q_head + self.r_r_bias, k_head_r) 2025-08-14T21:48:47.8832360Z 2025-08-14T21:48:47.8832461Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.8832789Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.8833077Z return mod(**inputs) 2025-08-14T21:48:47.8833410Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.8833772Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.8834129Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.8834475Z outputs = layer_module( 2025-08-14T21:48:47.8834811Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.8835160Z outputs = self.rel_attn( 2025-08-14T21:48:47.8835487Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 418, in forward 2025-08-14T21:48:47.8835866Z v_head_h = torch.einsum("ibh,hnd->ibnd", cat, self.v) 2025-08-14T21:48:47.8836005Z 2025-08-14T21:48:47.8836105Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.8836423Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.8836735Z return mod(**inputs) 2025-08-14T21:48:47.8837074Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.8837438Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.8837808Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.8838159Z outputs = layer_module( 2025-08-14T21:48:47.8838511Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.8838864Z outputs = self.rel_attn( 2025-08-14T21:48:47.8839195Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 425, in forward 2025-08-14T21:48:47.8839552Z attn_vec = self.rel_attn_core( 2025-08-14T21:48:47.8839921Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 294, in rel_attn_core 2025-08-14T21:48:47.8840347Z attn_vec = torch.einsum("bnij,jbnd->ibnd", attn_prob, v_head_h) 2025-08-14T21:48:47.8840518Z 2025-08-14T21:48:47.8840614Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.8840955Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.8841255Z return mod(**inputs) 2025-08-14T21:48:47.8841583Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.8841944Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.8842305Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.8842646Z outputs = layer_module( 2025-08-14T21:48:47.8842986Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.8843339Z outputs = self.rel_attn( 2025-08-14T21:48:47.8843674Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 440, in forward 2025-08-14T21:48:47.8844036Z output_h = self.post_attention(h, attn_vec) 2025-08-14T21:48:47.8844422Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 304, in post_attention 2025-08-14T21:48:47.8844831Z attn_out = torch.einsum("ibnd,hnd->ibh", attn_vec, self.o) 2025-08-14T21:48:47.8844983Z 2025-08-14T21:48:47.8845085Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.8845406Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.8845704Z return mod(**inputs) 2025-08-14T21:48:47.8846037Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.8846399Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.8846764Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.8847112Z outputs = layer_module( 2025-08-14T21:48:47.8847445Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.8847790Z outputs = self.rel_attn( 2025-08-14T21:48:47.8848128Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 440, in forward 2025-08-14T21:48:47.8848498Z output_h = self.post_attention(h, attn_vec) 2025-08-14T21:48:47.8848873Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 304, in post_attention 2025-08-14T21:48:47.8849279Z attn_out = torch.einsum("ibnd,hnd->ibh", attn_vec, self.o) 2025-08-14T21:48:47.8849437Z 2025-08-14T21:48:47.8849547Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.8849877Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.8850164Z return mod(**inputs) 2025-08-14T21:48:47.8850513Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.8850880Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.8851271Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.8851614Z outputs = layer_module( 2025-08-14T21:48:47.8851946Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 512, in forward 2025-08-14T21:48:47.8852432Z output_h = apply_chunking_to_forward(self.ff_chunk, self.chunk_size_feed_forward, self.seq_len_dim, output_h) 2025-08-14T21:48:47.8852915Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:48:47.8853304Z return forward_fn(*input_tensors) 2025-08-14T21:48:47.8853666Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 518, in ff_chunk 2025-08-14T21:48:47.8854029Z output_x = self.ff(output_x) 2025-08-14T21:48:47.8854375Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 463, in forward 2025-08-14T21:48:47.8854737Z output = self.layer_1(output) 2025-08-14T21:48:47.8854853Z 2025-08-14T21:48:47.8854959Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.8855298Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.8855594Z return mod(**inputs) 2025-08-14T21:48:47.8855932Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.8856302Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.8856662Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.8857018Z outputs = layer_module( 2025-08-14T21:48:47.8857355Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 512, in forward 2025-08-14T21:48:47.8857841Z output_h = apply_chunking_to_forward(self.ff_chunk, self.chunk_size_feed_forward, self.seq_len_dim, output_h) 2025-08-14T21:48:47.8858319Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:48:47.8858682Z return forward_fn(*input_tensors) 2025-08-14T21:48:47.8859039Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 518, in ff_chunk 2025-08-14T21:48:47.8859399Z output_x = self.ff(output_x) 2025-08-14T21:48:47.8859756Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 464, in forward 2025-08-14T21:48:47.8860134Z output = self.activation_function(output) 2025-08-14T21:48:47.8860475Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:48:47.8860794Z return self.act(input) 2025-08-14T21:48:47.8860908Z 2025-08-14T21:48:47.8861007Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.8861349Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.8861659Z return mod(**inputs) 2025-08-14T21:48:47.8861997Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.8862377Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.8862768Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.8863130Z outputs = layer_module( 2025-08-14T21:48:47.8863485Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 512, in forward 2025-08-14T21:48:47.8863981Z output_h = apply_chunking_to_forward(self.ff_chunk, self.chunk_size_feed_forward, self.seq_len_dim, output_h) 2025-08-14T21:48:47.8864481Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:48:47.8864906Z return forward_fn(*input_tensors) 2025-08-14T21:48:47.8865268Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 518, in ff_chunk 2025-08-14T21:48:47.8865626Z output_x = self.ff(output_x) 2025-08-14T21:48:47.8865977Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 466, in forward 2025-08-14T21:48:47.8866347Z output = self.layer_2(output) 2025-08-14T21:48:47.8866472Z 2025-08-14T21:48:47.8866569Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.8866906Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.8867199Z return mod(**inputs) 2025-08-14T21:48:47.8867535Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.8867905Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.8868271Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.8868614Z outputs = layer_module( 2025-08-14T21:48:47.8868951Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.8869307Z outputs = self.rel_attn( 2025-08-14T21:48:47.8869650Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 416, in forward 2025-08-14T21:48:47.8870018Z q_head_h = torch.einsum("ibh,hnd->ibnd", h, self.q) 2025-08-14T21:48:47.8870165Z 2025-08-14T21:48:47.8870263Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.8870589Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.8870882Z return mod(**inputs) 2025-08-14T21:48:47.8871219Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.8871586Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.8871948Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.8872292Z outputs = layer_module( 2025-08-14T21:48:47.8872630Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.8872983Z outputs = self.rel_attn( 2025-08-14T21:48:47.8873318Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 417, in forward 2025-08-14T21:48:47.8873699Z k_head_h = torch.einsum("ibh,hnd->ibnd", cat, self.k) 2025-08-14T21:48:47.8873853Z 2025-08-14T21:48:47.8873949Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.8874275Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.8874568Z return mod(**inputs) 2025-08-14T21:48:47.8874900Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.8875262Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.8875642Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.8875990Z outputs = layer_module( 2025-08-14T21:48:47.8876327Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.8876678Z outputs = self.rel_attn( 2025-08-14T21:48:47.8877029Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 425, in forward 2025-08-14T21:48:47.8877417Z attn_vec = self.rel_attn_core( 2025-08-14T21:48:47.8877785Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 263, in rel_attn_core 2025-08-14T21:48:47.8878213Z ac = torch.einsum("ibnd,jbnd->bnij", q_head + self.r_w_bias, k_head_h) 2025-08-14T21:48:47.8878386Z 2025-08-14T21:48:47.8878480Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.8878813Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.8879132Z return mod(**inputs) 2025-08-14T21:48:47.8879462Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.8879827Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.8880193Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.8880547Z outputs = layer_module( 2025-08-14T21:48:47.8880882Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.8881234Z outputs = self.rel_attn( 2025-08-14T21:48:47.8881575Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 422, in forward 2025-08-14T21:48:47.8881984Z k_head_r = torch.einsum("ibh,hnd->ibnd", r.type(self.r.dtype), self.r) 2025-08-14T21:48:47.8882162Z 2025-08-14T21:48:47.8882258Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.8882591Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.8882892Z return mod(**inputs) 2025-08-14T21:48:47.8883219Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.8883591Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.8883955Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.8884306Z outputs = layer_module( 2025-08-14T21:48:47.8884804Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.8885169Z outputs = self.rel_attn( 2025-08-14T21:48:47.8885516Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 425, in forward 2025-08-14T21:48:47.8885870Z attn_vec = self.rel_attn_core( 2025-08-14T21:48:47.8886241Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 266, in rel_attn_core 2025-08-14T21:48:47.8886672Z bd = torch.einsum("ibnd,jbnd->bnij", q_head + self.r_r_bias, k_head_r) 2025-08-14T21:48:47.8886846Z 2025-08-14T21:48:47.8886953Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.8887279Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.8887582Z return mod(**inputs) 2025-08-14T21:48:47.8887918Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.8888290Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.8888690Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.8889046Z outputs = layer_module( 2025-08-14T21:48:47.8889380Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.8889721Z outputs = self.rel_attn( 2025-08-14T21:48:47.8890087Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 418, in forward 2025-08-14T21:48:47.8890502Z v_head_h = torch.einsum("ibh,hnd->ibnd", cat, self.v) 2025-08-14T21:48:47.8890646Z 2025-08-14T21:48:47.8890750Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.8891079Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.8891383Z return mod(**inputs) 2025-08-14T21:48:47.8891723Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.8892111Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.8892467Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.8892814Z outputs = layer_module( 2025-08-14T21:48:47.8893150Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.8893498Z outputs = self.rel_attn( 2025-08-14T21:48:47.8893837Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 425, in forward 2025-08-14T21:48:47.8894192Z attn_vec = self.rel_attn_core( 2025-08-14T21:48:47.8894556Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 294, in rel_attn_core 2025-08-14T21:48:47.8894969Z attn_vec = torch.einsum("bnij,jbnd->ibnd", attn_prob, v_head_h) 2025-08-14T21:48:47.8895138Z 2025-08-14T21:48:47.8895234Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.8895568Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.8895857Z return mod(**inputs) 2025-08-14T21:48:47.8896192Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.8896559Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.8896922Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.8897264Z outputs = layer_module( 2025-08-14T21:48:47.8897598Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.8897947Z outputs = self.rel_attn( 2025-08-14T21:48:47.8898284Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 440, in forward 2025-08-14T21:48:47.8898648Z output_h = self.post_attention(h, attn_vec) 2025-08-14T21:48:47.8899033Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 304, in post_attention 2025-08-14T21:48:47.8899439Z attn_out = torch.einsum("ibnd,hnd->ibh", attn_vec, self.o) 2025-08-14T21:48:47.8899591Z 2025-08-14T21:48:47.8899686Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.8900012Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.8900306Z return mod(**inputs) 2025-08-14T21:48:47.8900638Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.8900994Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.8901370Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.8901724Z outputs = layer_module( 2025-08-14T21:48:47.8902052Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.8902401Z outputs = self.rel_attn( 2025-08-14T21:48:47.8902756Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 440, in forward 2025-08-14T21:48:47.8903142Z output_h = self.post_attention(h, attn_vec) 2025-08-14T21:48:47.8903517Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 304, in post_attention 2025-08-14T21:48:47.8903923Z attn_out = torch.einsum("ibnd,hnd->ibh", attn_vec, self.o) 2025-08-14T21:48:47.8904082Z 2025-08-14T21:48:47.8904177Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.8904506Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.8904858Z return mod(**inputs) 2025-08-14T21:48:47.8905217Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.8905583Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.8905939Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.8906291Z outputs = layer_module( 2025-08-14T21:48:47.8906638Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 512, in forward 2025-08-14T21:48:47.8907122Z output_h = apply_chunking_to_forward(self.ff_chunk, self.chunk_size_feed_forward, self.seq_len_dim, output_h) 2025-08-14T21:48:47.8907600Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:48:47.8907971Z return forward_fn(*input_tensors) 2025-08-14T21:48:47.8908330Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 518, in ff_chunk 2025-08-14T21:48:47.8908690Z output_x = self.ff(output_x) 2025-08-14T21:48:47.8909031Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 463, in forward 2025-08-14T21:48:47.8909385Z output = self.layer_1(output) 2025-08-14T21:48:47.8909499Z 2025-08-14T21:48:47.8909601Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.8909924Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.8910225Z return mod(**inputs) 2025-08-14T21:48:47.8910556Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.8910917Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.8911272Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.8911626Z outputs = layer_module( 2025-08-14T21:48:47.8911961Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 512, in forward 2025-08-14T21:48:47.8912439Z output_h = apply_chunking_to_forward(self.ff_chunk, self.chunk_size_feed_forward, self.seq_len_dim, output_h) 2025-08-14T21:48:47.8912915Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:48:47.8913277Z return forward_fn(*input_tensors) 2025-08-14T21:48:47.8913629Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 518, in ff_chunk 2025-08-14T21:48:47.8913973Z output_x = self.ff(output_x) 2025-08-14T21:48:47.8914336Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 464, in forward 2025-08-14T21:48:47.8914715Z output = self.activation_function(output) 2025-08-14T21:48:47.8915055Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:48:47.8915375Z return self.act(input) 2025-08-14T21:48:47.8915488Z 2025-08-14T21:48:47.8915603Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.8915935Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.8916246Z return mod(**inputs) 2025-08-14T21:48:47.8916582Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.8916952Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.8917318Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.8917664Z outputs = layer_module( 2025-08-14T21:48:47.8918002Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 512, in forward 2025-08-14T21:48:47.8918495Z output_h = apply_chunking_to_forward(self.ff_chunk, self.chunk_size_feed_forward, self.seq_len_dim, output_h) 2025-08-14T21:48:47.8918975Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:48:47.8919331Z return forward_fn(*input_tensors) 2025-08-14T21:48:47.8919686Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 518, in ff_chunk 2025-08-14T21:48:47.8920041Z output_x = self.ff(output_x) 2025-08-14T21:48:47.8920379Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 466, in forward 2025-08-14T21:48:47.8920734Z output = self.layer_2(output) 2025-08-14T21:48:47.8920855Z 2025-08-14T21:48:47.8920953Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.8921284Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.8921577Z return mod(**inputs) 2025-08-14T21:48:47.8921910Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.8922280Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.8922644Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.8922988Z outputs = layer_module( 2025-08-14T21:48:47.8923323Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.8923675Z outputs = self.rel_attn( 2025-08-14T21:48:47.8924006Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 416, in forward 2025-08-14T21:48:47.8924386Z q_head_h = torch.einsum("ibh,hnd->ibnd", h, self.q) 2025-08-14T21:48:47.8924532Z 2025-08-14T21:48:47.8924626Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.8924953Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.8925244Z return mod(**inputs) 2025-08-14T21:48:47.8925574Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.8925942Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.8926298Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.8926646Z outputs = layer_module( 2025-08-14T21:48:47.8926978Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.8927342Z outputs = self.rel_attn( 2025-08-14T21:48:47.8927680Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 417, in forward 2025-08-14T21:48:47.8928063Z k_head_h = torch.einsum("ibh,hnd->ibnd", cat, self.k) 2025-08-14T21:48:47.8928203Z 2025-08-14T21:48:47.8928304Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.8928660Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.8928969Z return mod(**inputs) 2025-08-14T21:48:47.8929306Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.8929673Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.8930033Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.8930383Z outputs = layer_module( 2025-08-14T21:48:47.8930719Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.8931088Z outputs = self.rel_attn( 2025-08-14T21:48:47.8931419Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 425, in forward 2025-08-14T21:48:47.8931775Z attn_vec = self.rel_attn_core( 2025-08-14T21:48:47.8932141Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 263, in rel_attn_core 2025-08-14T21:48:47.8932559Z ac = torch.einsum("ibnd,jbnd->bnij", q_head + self.r_w_bias, k_head_h) 2025-08-14T21:48:47.8932739Z 2025-08-14T21:48:47.8932833Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.8933161Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.8933458Z return mod(**inputs) 2025-08-14T21:48:47.8933783Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.8934149Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.8934509Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.8934858Z outputs = layer_module( 2025-08-14T21:48:47.8935184Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.8935536Z outputs = self.rel_attn( 2025-08-14T21:48:47.8935874Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 422, in forward 2025-08-14T21:48:47.8936274Z k_head_r = torch.einsum("ibh,hnd->ibnd", r.type(self.r.dtype), self.r) 2025-08-14T21:48:47.8936455Z 2025-08-14T21:48:47.8936549Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.8936881Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.8937184Z return mod(**inputs) 2025-08-14T21:48:47.8937507Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.8937870Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.8938237Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.8938590Z outputs = layer_module( 2025-08-14T21:48:47.8938920Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.8939269Z outputs = self.rel_attn( 2025-08-14T21:48:47.8939607Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 425, in forward 2025-08-14T21:48:47.8939955Z attn_vec = self.rel_attn_core( 2025-08-14T21:48:47.8940346Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 266, in rel_attn_core 2025-08-14T21:48:47.8940772Z bd = torch.einsum("ibnd,jbnd->bnij", q_head + self.r_r_bias, k_head_r) 2025-08-14T21:48:47.8940942Z 2025-08-14T21:48:47.8941044Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.8941382Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.8941699Z return mod(**inputs) 2025-08-14T21:48:47.8942041Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.8942408Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.8942780Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.8943140Z outputs = layer_module( 2025-08-14T21:48:47.8943484Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.8943847Z outputs = self.rel_attn( 2025-08-14T21:48:47.8944186Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 418, in forward 2025-08-14T21:48:47.8944571Z v_head_h = torch.einsum("ibh,hnd->ibnd", cat, self.v) 2025-08-14T21:48:47.8944716Z 2025-08-14T21:48:47.8944882Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.8945221Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.8945522Z return mod(**inputs) 2025-08-14T21:48:47.8945861Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.8946228Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.8946598Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.8946955Z outputs = layer_module( 2025-08-14T21:48:47.8947295Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.8947640Z outputs = self.rel_attn( 2025-08-14T21:48:47.8947983Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 425, in forward 2025-08-14T21:48:47.8948345Z attn_vec = self.rel_attn_core( 2025-08-14T21:48:47.8948707Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 294, in rel_attn_core 2025-08-14T21:48:47.8949131Z attn_vec = torch.einsum("bnij,jbnd->ibnd", attn_prob, v_head_h) 2025-08-14T21:48:47.8949303Z 2025-08-14T21:48:47.8949400Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.8949734Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.8950029Z return mod(**inputs) 2025-08-14T21:48:47.8950364Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.8950736Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.8951105Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.8951455Z outputs = layer_module( 2025-08-14T21:48:47.8951795Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.8952149Z outputs = self.rel_attn( 2025-08-14T21:48:47.8952482Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 440, in forward 2025-08-14T21:48:47.8952857Z output_h = self.post_attention(h, attn_vec) 2025-08-14T21:48:47.8953267Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 304, in post_attention 2025-08-14T21:48:47.8953682Z attn_out = torch.einsum("ibnd,hnd->ibh", attn_vec, self.o) 2025-08-14T21:48:47.8953835Z 2025-08-14T21:48:47.8953931Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.8954279Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.8954586Z return mod(**inputs) 2025-08-14T21:48:47.8954941Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.8955315Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.8955684Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.8956042Z outputs = layer_module( 2025-08-14T21:48:47.8956377Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.8956746Z outputs = self.rel_attn( 2025-08-14T21:48:47.8957084Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 440, in forward 2025-08-14T21:48:47.8957452Z output_h = self.post_attention(h, attn_vec) 2025-08-14T21:48:47.8957832Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 304, in post_attention 2025-08-14T21:48:47.8958243Z attn_out = torch.einsum("ibnd,hnd->ibh", attn_vec, self.o) 2025-08-14T21:48:47.8958395Z 2025-08-14T21:48:47.8958497Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.8958821Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.8959121Z return mod(**inputs) 2025-08-14T21:48:47.8959458Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.8959826Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.8960181Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.8960528Z outputs = layer_module( 2025-08-14T21:48:47.8960866Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 512, in forward 2025-08-14T21:48:47.8961346Z output_h = apply_chunking_to_forward(self.ff_chunk, self.chunk_size_feed_forward, self.seq_len_dim, output_h) 2025-08-14T21:48:47.8961825Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:48:47.8962188Z return forward_fn(*input_tensors) 2025-08-14T21:48:47.8962545Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 518, in ff_chunk 2025-08-14T21:48:47.8962897Z output_x = self.ff(output_x) 2025-08-14T21:48:47.8963246Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 463, in forward 2025-08-14T21:48:47.8963597Z output = self.layer_1(output) 2025-08-14T21:48:47.8963710Z 2025-08-14T21:48:47.8963812Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.8964137Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.8964435Z return mod(**inputs) 2025-08-14T21:48:47.8964768Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.8965130Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.8965487Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.8965835Z outputs = layer_module( 2025-08-14T21:48:47.8966188Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 512, in forward 2025-08-14T21:48:47.8966658Z output_h = apply_chunking_to_forward(self.ff_chunk, self.chunk_size_feed_forward, self.seq_len_dim, output_h) 2025-08-14T21:48:47.8967155Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:48:47.8967521Z return forward_fn(*input_tensors) 2025-08-14T21:48:47.8967896Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 518, in ff_chunk 2025-08-14T21:48:47.8968244Z output_x = self.ff(output_x) 2025-08-14T21:48:47.8968587Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 464, in forward 2025-08-14T21:48:47.8968951Z output = self.activation_function(output) 2025-08-14T21:48:47.8969281Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:48:47.8969607Z return self.act(input) 2025-08-14T21:48:47.8969715Z 2025-08-14T21:48:47.8969811Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.8970139Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.8970431Z return mod(**inputs) 2025-08-14T21:48:47.8970764Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.8971130Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.8971494Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.8971846Z outputs = layer_module( 2025-08-14T21:48:47.8972182Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 512, in forward 2025-08-14T21:48:47.8972659Z output_h = apply_chunking_to_forward(self.ff_chunk, self.chunk_size_feed_forward, self.seq_len_dim, output_h) 2025-08-14T21:48:47.8973135Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:48:47.8973498Z return forward_fn(*input_tensors) 2025-08-14T21:48:47.8973857Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 518, in ff_chunk 2025-08-14T21:48:47.8974214Z output_x = self.ff(output_x) 2025-08-14T21:48:47.8974553Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 466, in forward 2025-08-14T21:48:47.8974904Z output = self.layer_2(output) 2025-08-14T21:48:47.8975017Z 2025-08-14T21:48:47.8975119Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.8975449Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.8975738Z return mod(**inputs) 2025-08-14T21:48:47.8976073Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.8976438Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.8976792Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.8977142Z outputs = layer_module( 2025-08-14T21:48:47.8977478Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.8977829Z outputs = self.rel_attn( 2025-08-14T21:48:47.8978159Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 416, in forward 2025-08-14T21:48:47.8978536Z q_head_h = torch.einsum("ibh,hnd->ibnd", h, self.q) 2025-08-14T21:48:47.8978676Z 2025-08-14T21:48:47.8978808Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.8979141Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.8979427Z return mod(**inputs) 2025-08-14T21:48:47.8979753Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.8980129Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.8980486Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.8980855Z outputs = layer_module( 2025-08-14T21:48:47.8981188Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.8981538Z outputs = self.rel_attn( 2025-08-14T21:48:47.8981873Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 417, in forward 2025-08-14T21:48:47.8982252Z k_head_h = torch.einsum("ibh,hnd->ibnd", cat, self.k) 2025-08-14T21:48:47.8982415Z 2025-08-14T21:48:47.8982518Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.8982839Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.8983140Z return mod(**inputs) 2025-08-14T21:48:47.8983474Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.8983841Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.8984197Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.8984546Z outputs = layer_module( 2025-08-14T21:48:47.8985105Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.8985457Z outputs = self.rel_attn( 2025-08-14T21:48:47.8985802Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 425, in forward 2025-08-14T21:48:47.8986167Z attn_vec = self.rel_attn_core( 2025-08-14T21:48:47.8986538Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 263, in rel_attn_core 2025-08-14T21:48:47.8986960Z ac = torch.einsum("ibnd,jbnd->bnij", q_head + self.r_w_bias, k_head_h) 2025-08-14T21:48:47.8987148Z 2025-08-14T21:48:47.8987245Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.8987579Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.8987880Z return mod(**inputs) 2025-08-14T21:48:47.8988207Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.8988576Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.8988943Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.8989288Z outputs = layer_module( 2025-08-14T21:48:47.8989623Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.8989975Z outputs = self.rel_attn( 2025-08-14T21:48:47.8990315Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 422, in forward 2025-08-14T21:48:47.8990721Z k_head_r = torch.einsum("ibh,hnd->ibnd", r.type(self.r.dtype), self.r) 2025-08-14T21:48:47.8990903Z 2025-08-14T21:48:47.8990999Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.8991329Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.8991632Z return mod(**inputs) 2025-08-14T21:48:47.8991996Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.8992365Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.8992727Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.8993069Z outputs = layer_module( 2025-08-14T21:48:47.8993430Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.8993806Z outputs = self.rel_attn( 2025-08-14T21:48:47.8994145Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 425, in forward 2025-08-14T21:48:47.8994493Z attn_vec = self.rel_attn_core( 2025-08-14T21:48:47.8994860Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 266, in rel_attn_core 2025-08-14T21:48:47.8995279Z bd = torch.einsum("ibnd,jbnd->bnij", q_head + self.r_r_bias, k_head_r) 2025-08-14T21:48:47.8995476Z 2025-08-14T21:48:47.8995583Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.8995913Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.8996215Z return mod(**inputs) 2025-08-14T21:48:47.8996557Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.8996924Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.8997291Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.8997646Z outputs = layer_module( 2025-08-14T21:48:47.8997985Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.8998333Z outputs = self.rel_attn( 2025-08-14T21:48:47.8998676Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 418, in forward 2025-08-14T21:48:47.8999068Z v_head_h = torch.einsum("ibh,hnd->ibnd", cat, self.v) 2025-08-14T21:48:47.8999213Z 2025-08-14T21:48:47.8999311Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.8999648Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.8999956Z return mod(**inputs) 2025-08-14T21:48:47.9000299Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9000666Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9001035Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9001395Z outputs = layer_module( 2025-08-14T21:48:47.9001734Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9002096Z outputs = self.rel_attn( 2025-08-14T21:48:47.9002440Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 425, in forward 2025-08-14T21:48:47.9002803Z attn_vec = self.rel_attn_core( 2025-08-14T21:48:47.9003171Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 294, in rel_attn_core 2025-08-14T21:48:47.9003597Z attn_vec = torch.einsum("bnij,jbnd->ibnd", attn_prob, v_head_h) 2025-08-14T21:48:47.9003763Z 2025-08-14T21:48:47.9003871Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9004207Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9004504Z return mod(**inputs) 2025-08-14T21:48:47.9004862Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9005233Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9005585Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9005933Z outputs = layer_module( 2025-08-14T21:48:47.9006282Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9006652Z outputs = self.rel_attn( 2025-08-14T21:48:47.9006989Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 440, in forward 2025-08-14T21:48:47.9007369Z output_h = self.post_attention(h, attn_vec) 2025-08-14T21:48:47.9007763Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 304, in post_attention 2025-08-14T21:48:47.9008180Z attn_out = torch.einsum("ibnd,hnd->ibh", attn_vec, self.o) 2025-08-14T21:48:47.9008336Z 2025-08-14T21:48:47.9008452Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9008782Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9009078Z return mod(**inputs) 2025-08-14T21:48:47.9009409Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9009777Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9010140Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9010493Z outputs = layer_module( 2025-08-14T21:48:47.9010821Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9011170Z outputs = self.rel_attn( 2025-08-14T21:48:47.9011508Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 440, in forward 2025-08-14T21:48:47.9011871Z output_h = self.post_attention(h, attn_vec) 2025-08-14T21:48:47.9012255Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 304, in post_attention 2025-08-14T21:48:47.9012660Z attn_out = torch.einsum("ibnd,hnd->ibh", attn_vec, self.o) 2025-08-14T21:48:47.9012810Z 2025-08-14T21:48:47.9012913Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9013233Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9013529Z return mod(**inputs) 2025-08-14T21:48:47.9013859Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9014223Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9014582Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9014931Z outputs = layer_module( 2025-08-14T21:48:47.9015264Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 512, in forward 2025-08-14T21:48:47.9015737Z output_h = apply_chunking_to_forward(self.ff_chunk, self.chunk_size_feed_forward, self.seq_len_dim, output_h) 2025-08-14T21:48:47.9016225Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:48:47.9016588Z return forward_fn(*input_tensors) 2025-08-14T21:48:47.9016943Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 518, in ff_chunk 2025-08-14T21:48:47.9017293Z output_x = self.ff(output_x) 2025-08-14T21:48:47.9017639Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 463, in forward 2025-08-14T21:48:47.9018011Z output = self.layer_1(output) 2025-08-14T21:48:47.9018128Z 2025-08-14T21:48:47.9018230Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9018553Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9018850Z return mod(**inputs) 2025-08-14T21:48:47.9019202Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9019587Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9019959Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9020315Z outputs = layer_module( 2025-08-14T21:48:47.9020731Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 512, in forward 2025-08-14T21:48:47.9021287Z output_h = apply_chunking_to_forward(self.ff_chunk, self.chunk_size_feed_forward, self.seq_len_dim, output_h) 2025-08-14T21:48:47.9021918Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:48:47.9022285Z return forward_fn(*input_tensors) 2025-08-14T21:48:47.9022649Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 518, in ff_chunk 2025-08-14T21:48:47.9023016Z output_x = self.ff(output_x) 2025-08-14T21:48:47.9023358Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 464, in forward 2025-08-14T21:48:47.9023735Z output = self.activation_function(output) 2025-08-14T21:48:47.9024069Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:48:47.9024395Z return self.act(input) 2025-08-14T21:48:47.9024500Z 2025-08-14T21:48:47.9024597Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9024998Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9025304Z return mod(**inputs) 2025-08-14T21:48:47.9025633Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9026009Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9026374Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9026731Z outputs = layer_module( 2025-08-14T21:48:47.9027064Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 512, in forward 2025-08-14T21:48:47.9027545Z output_h = apply_chunking_to_forward(self.ff_chunk, self.chunk_size_feed_forward, self.seq_len_dim, output_h) 2025-08-14T21:48:47.9028032Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:48:47.9028401Z return forward_fn(*input_tensors) 2025-08-14T21:48:47.9028757Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 518, in ff_chunk 2025-08-14T21:48:47.9029117Z output_x = self.ff(output_x) 2025-08-14T21:48:47.9029467Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 466, in forward 2025-08-14T21:48:47.9029819Z output = self.layer_2(output) 2025-08-14T21:48:47.9029940Z 2025-08-14T21:48:47.9030036Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9030367Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9030664Z return mod(**inputs) 2025-08-14T21:48:47.9031009Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9031379Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9031743Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9032094Z outputs = layer_module( 2025-08-14T21:48:47.9032439Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9032808Z outputs = self.rel_attn( 2025-08-14T21:48:47.9033148Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 416, in forward 2025-08-14T21:48:47.9033520Z q_head_h = torch.einsum("ibh,hnd->ibnd", h, self.q) 2025-08-14T21:48:47.9033668Z 2025-08-14T21:48:47.9033762Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9034092Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9034393Z return mod(**inputs) 2025-08-14T21:48:47.9034740Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9035110Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9035478Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9035828Z outputs = layer_module( 2025-08-14T21:48:47.9036167Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9036521Z outputs = self.rel_attn( 2025-08-14T21:48:47.9036874Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 417, in forward 2025-08-14T21:48:47.9037252Z k_head_h = torch.einsum("ibh,hnd->ibnd", cat, self.k) 2025-08-14T21:48:47.9037401Z 2025-08-14T21:48:47.9037495Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9037826Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9038128Z return mod(**inputs) 2025-08-14T21:48:47.9038461Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9038832Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9039201Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9039550Z outputs = layer_module( 2025-08-14T21:48:47.9039890Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9040241Z outputs = self.rel_attn( 2025-08-14T21:48:47.9040577Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 425, in forward 2025-08-14T21:48:47.9040927Z attn_vec = self.rel_attn_core( 2025-08-14T21:48:47.9041298Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 263, in rel_attn_core 2025-08-14T21:48:47.9041727Z ac = torch.einsum("ibnd,jbnd->bnij", q_head + self.r_w_bias, k_head_h) 2025-08-14T21:48:47.9041900Z 2025-08-14T21:48:47.9041998Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9042331Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9042634Z return mod(**inputs) 2025-08-14T21:48:47.9042969Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9043330Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9043697Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9044066Z outputs = layer_module( 2025-08-14T21:48:47.9044401Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9044745Z outputs = self.rel_attn( 2025-08-14T21:48:47.9045082Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 422, in forward 2025-08-14T21:48:47.9045506Z k_head_r = torch.einsum("ibh,hnd->ibnd", r.type(self.r.dtype), self.r) 2025-08-14T21:48:47.9045699Z 2025-08-14T21:48:47.9045792Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9046124Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9046425Z return mod(**inputs) 2025-08-14T21:48:47.9046663Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9046745Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9046982Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9047062Z outputs = layer_module( 2025-08-14T21:48:47.9047301Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9047366Z outputs = self.rel_attn( 2025-08-14T21:48:47.9047605Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 425, in forward 2025-08-14T21:48:47.9047674Z attn_vec = self.rel_attn_core( 2025-08-14T21:48:47.9047923Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 266, in rel_attn_core 2025-08-14T21:48:47.9048050Z bd = torch.einsum("ibnd,jbnd->bnij", q_head + self.r_r_bias, k_head_r) 2025-08-14T21:48:47.9048054Z 2025-08-14T21:48:47.9048147Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9048332Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9048402Z return mod(**inputs) 2025-08-14T21:48:47.9048635Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9048720Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9048952Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9049015Z outputs = layer_module( 2025-08-14T21:48:47.9049255Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9049318Z outputs = self.rel_attn( 2025-08-14T21:48:47.9049557Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 418, in forward 2025-08-14T21:48:47.9049654Z v_head_h = torch.einsum("ibh,hnd->ibnd", cat, self.v) 2025-08-14T21:48:47.9049658Z 2025-08-14T21:48:47.9049753Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9049942Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9050002Z return mod(**inputs) 2025-08-14T21:48:47.9050238Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9050323Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9050553Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9050622Z outputs = layer_module( 2025-08-14T21:48:47.9050851Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9050914Z outputs = self.rel_attn( 2025-08-14T21:48:47.9051168Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 425, in forward 2025-08-14T21:48:47.9051239Z attn_vec = self.rel_attn_core( 2025-08-14T21:48:47.9051497Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 294, in rel_attn_core 2025-08-14T21:48:47.9051627Z attn_vec = torch.einsum("bnij,jbnd->ibnd", attn_prob, v_head_h) 2025-08-14T21:48:47.9051631Z 2025-08-14T21:48:47.9051743Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9051931Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9051990Z return mod(**inputs) 2025-08-14T21:48:47.9052223Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9052306Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9052539Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9052624Z outputs = layer_module( 2025-08-14T21:48:47.9052854Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9052917Z outputs = self.rel_attn( 2025-08-14T21:48:47.9053153Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 440, in forward 2025-08-14T21:48:47.9053237Z output_h = self.post_attention(h, attn_vec) 2025-08-14T21:48:47.9053488Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 304, in post_attention 2025-08-14T21:48:47.9053597Z attn_out = torch.einsum("ibnd,hnd->ibh", attn_vec, self.o) 2025-08-14T21:48:47.9053601Z 2025-08-14T21:48:47.9053693Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9053883Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9053942Z return mod(**inputs) 2025-08-14T21:48:47.9054174Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9054254Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9054486Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9054554Z outputs = layer_module( 2025-08-14T21:48:47.9054783Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9054846Z outputs = self.rel_attn( 2025-08-14T21:48:47.9055080Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 440, in forward 2025-08-14T21:48:47.9055162Z output_h = self.post_attention(h, attn_vec) 2025-08-14T21:48:47.9055413Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 304, in post_attention 2025-08-14T21:48:47.9055525Z attn_out = torch.einsum("ibnd,hnd->ibh", attn_vec, self.o) 2025-08-14T21:48:47.9055528Z 2025-08-14T21:48:47.9055620Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9055809Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9055870Z return mod(**inputs) 2025-08-14T21:48:47.9056101Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9056182Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9056411Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9056478Z outputs = layer_module( 2025-08-14T21:48:47.9056724Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 512, in forward 2025-08-14T21:48:47.9056924Z output_h = apply_chunking_to_forward(self.ff_chunk, self.chunk_size_feed_forward, self.seq_len_dim, output_h) 2025-08-14T21:48:47.9057190Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:48:47.9057265Z return forward_fn(*input_tensors) 2025-08-14T21:48:47.9057533Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 518, in ff_chunk 2025-08-14T21:48:47.9057601Z output_x = self.ff(output_x) 2025-08-14T21:48:47.9057831Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 463, in forward 2025-08-14T21:48:47.9057904Z output = self.layer_1(output) 2025-08-14T21:48:47.9057907Z 2025-08-14T21:48:47.9058001Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9058184Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9058267Z return mod(**inputs) 2025-08-14T21:48:47.9058499Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9058583Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9058817Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9058879Z outputs = layer_module( 2025-08-14T21:48:47.9059115Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 512, in forward 2025-08-14T21:48:47.9059307Z output_h = apply_chunking_to_forward(self.ff_chunk, self.chunk_size_feed_forward, self.seq_len_dim, output_h) 2025-08-14T21:48:47.9059557Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:48:47.9059630Z return forward_fn(*input_tensors) 2025-08-14T21:48:47.9059864Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 518, in ff_chunk 2025-08-14T21:48:47.9059937Z output_x = self.ff(output_x) 2025-08-14T21:48:47.9060172Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 464, in forward 2025-08-14T21:48:47.9060254Z output = self.activation_function(output) 2025-08-14T21:48:47.9060459Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:48:47.9060522Z return self.act(input) 2025-08-14T21:48:47.9060525Z 2025-08-14T21:48:47.9060626Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9060811Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9060871Z return mod(**inputs) 2025-08-14T21:48:47.9061112Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9061185Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9061419Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9061486Z outputs = layer_module( 2025-08-14T21:48:47.9061718Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 512, in forward 2025-08-14T21:48:47.9061914Z output_h = apply_chunking_to_forward(self.ff_chunk, self.chunk_size_feed_forward, self.seq_len_dim, output_h) 2025-08-14T21:48:47.9062154Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:48:47.9062223Z return forward_fn(*input_tensors) 2025-08-14T21:48:47.9062480Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 518, in ff_chunk 2025-08-14T21:48:47.9062550Z output_x = self.ff(output_x) 2025-08-14T21:48:47.9062787Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 466, in forward 2025-08-14T21:48:47.9062869Z output = self.layer_2(output) 2025-08-14T21:48:47.9062872Z 2025-08-14T21:48:47.9062968Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9063173Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9063232Z return mod(**inputs) 2025-08-14T21:48:47.9063463Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9063546Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9063781Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9063868Z outputs = layer_module( 2025-08-14T21:48:47.9064099Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9064163Z outputs = self.rel_attn( 2025-08-14T21:48:47.9064404Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 416, in forward 2025-08-14T21:48:47.9064499Z q_head_h = torch.einsum("ibh,hnd->ibnd", h, self.q) 2025-08-14T21:48:47.9064502Z 2025-08-14T21:48:47.9064602Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9064858Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9064925Z return mod(**inputs) 2025-08-14T21:48:47.9065169Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9065249Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9065483Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9065554Z outputs = layer_module( 2025-08-14T21:48:47.9065786Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9065861Z outputs = self.rel_attn( 2025-08-14T21:48:47.9066093Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 417, in forward 2025-08-14T21:48:47.9066188Z k_head_h = torch.einsum("ibh,hnd->ibnd", cat, self.k) 2025-08-14T21:48:47.9066191Z 2025-08-14T21:48:47.9066295Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9066478Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9066548Z return mod(**inputs) 2025-08-14T21:48:47.9066782Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9066859Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9067103Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9067165Z outputs = layer_module( 2025-08-14T21:48:47.9067399Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9067473Z outputs = self.rel_attn( 2025-08-14T21:48:47.9067703Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 425, in forward 2025-08-14T21:48:47.9067782Z attn_vec = self.rel_attn_core( 2025-08-14T21:48:47.9068051Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 263, in rel_attn_core 2025-08-14T21:48:47.9068176Z ac = torch.einsum("ibnd,jbnd->bnij", q_head + self.r_w_bias, k_head_h) 2025-08-14T21:48:47.9068179Z 2025-08-14T21:48:47.9068280Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9068458Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9068541Z return mod(**inputs) 2025-08-14T21:48:47.9068777Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9068867Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9069104Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9069164Z outputs = layer_module( 2025-08-14T21:48:47.9069390Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9069461Z outputs = self.rel_attn( 2025-08-14T21:48:47.9069712Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 422, in forward 2025-08-14T21:48:47.9069845Z k_head_r = torch.einsum("ibh,hnd->ibnd", r.type(self.r.dtype), self.r) 2025-08-14T21:48:47.9069848Z 2025-08-14T21:48:47.9069942Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9070123Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9070192Z return mod(**inputs) 2025-08-14T21:48:47.9070423Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9070496Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9070732Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9070794Z outputs = layer_module( 2025-08-14T21:48:47.9071035Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9071095Z outputs = self.rel_attn( 2025-08-14T21:48:47.9071322Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 425, in forward 2025-08-14T21:48:47.9071420Z attn_vec = self.rel_attn_core( 2025-08-14T21:48:47.9071667Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 266, in rel_attn_core 2025-08-14T21:48:47.9071794Z bd = torch.einsum("ibnd,jbnd->bnij", q_head + self.r_r_bias, k_head_r) 2025-08-14T21:48:47.9071797Z 2025-08-14T21:48:47.9071889Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9072068Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9072134Z return mod(**inputs) 2025-08-14T21:48:47.9072368Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9072442Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9072679Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9072739Z outputs = layer_module( 2025-08-14T21:48:47.9072972Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9073036Z outputs = self.rel_attn( 2025-08-14T21:48:47.9073263Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 418, in forward 2025-08-14T21:48:47.9073362Z v_head_h = torch.einsum("ibh,hnd->ibnd", cat, self.v) 2025-08-14T21:48:47.9073365Z 2025-08-14T21:48:47.9073454Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9073657Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9073721Z return mod(**inputs) 2025-08-14T21:48:47.9073952Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9074034Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9074277Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9074353Z outputs = layer_module( 2025-08-14T21:48:47.9074589Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9074650Z outputs = self.rel_attn( 2025-08-14T21:48:47.9074884Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 425, in forward 2025-08-14T21:48:47.9074950Z attn_vec = self.rel_attn_core( 2025-08-14T21:48:47.9075196Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 294, in rel_attn_core 2025-08-14T21:48:47.9075333Z attn_vec = torch.einsum("bnij,jbnd->ibnd", attn_prob, v_head_h) 2025-08-14T21:48:47.9075337Z 2025-08-14T21:48:47.9075429Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9075617Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9075676Z return mod(**inputs) 2025-08-14T21:48:47.9075909Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9075990Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9076221Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9076281Z outputs = layer_module( 2025-08-14T21:48:47.9076517Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9076579Z outputs = self.rel_attn( 2025-08-14T21:48:47.9076816Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 440, in forward 2025-08-14T21:48:47.9076899Z output_h = self.post_attention(h, attn_vec) 2025-08-14T21:48:47.9077148Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 304, in post_attention 2025-08-14T21:48:47.9077258Z attn_out = torch.einsum("ibnd,hnd->ibh", attn_vec, self.o) 2025-08-14T21:48:47.9077261Z 2025-08-14T21:48:47.9077352Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9077539Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9077597Z return mod(**inputs) 2025-08-14T21:48:47.9077828Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9077913Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9078142Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9078202Z outputs = layer_module( 2025-08-14T21:48:47.9078440Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9078502Z outputs = self.rel_attn( 2025-08-14T21:48:47.9078738Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 440, in forward 2025-08-14T21:48:47.9078819Z output_h = self.post_attention(h, attn_vec) 2025-08-14T21:48:47.9079066Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 304, in post_attention 2025-08-14T21:48:47.9079190Z attn_out = torch.einsum("ibnd,hnd->ibh", attn_vec, self.o) 2025-08-14T21:48:47.9079194Z 2025-08-14T21:48:47.9079290Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9079470Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9079536Z return mod(**inputs) 2025-08-14T21:48:47.9079794Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9079893Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9080124Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9080186Z outputs = layer_module( 2025-08-14T21:48:47.9080422Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 512, in forward 2025-08-14T21:48:47.9080615Z output_h = apply_chunking_to_forward(self.ff_chunk, self.chunk_size_feed_forward, self.seq_len_dim, output_h) 2025-08-14T21:48:47.9080879Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:48:47.9080949Z return forward_fn(*input_tensors) 2025-08-14T21:48:47.9081183Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 518, in ff_chunk 2025-08-14T21:48:47.9081257Z output_x = self.ff(output_x) 2025-08-14T21:48:47.9081488Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 463, in forward 2025-08-14T21:48:47.9081554Z output = self.layer_1(output) 2025-08-14T21:48:47.9081564Z 2025-08-14T21:48:47.9081655Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9081837Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9081902Z return mod(**inputs) 2025-08-14T21:48:47.9082137Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9082213Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9082452Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9082513Z outputs = layer_module( 2025-08-14T21:48:47.9082752Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 512, in forward 2025-08-14T21:48:47.9082942Z output_h = apply_chunking_to_forward(self.ff_chunk, self.chunk_size_feed_forward, self.seq_len_dim, output_h) 2025-08-14T21:48:47.9083182Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:48:47.9083263Z return forward_fn(*input_tensors) 2025-08-14T21:48:47.9083496Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 518, in ff_chunk 2025-08-14T21:48:47.9083563Z output_x = self.ff(output_x) 2025-08-14T21:48:47.9083800Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 464, in forward 2025-08-14T21:48:47.9083881Z output = self.activation_function(output) 2025-08-14T21:48:47.9084086Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:48:47.9084150Z return self.act(input) 2025-08-14T21:48:47.9084154Z 2025-08-14T21:48:47.9084247Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9084437Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9084495Z return mod(**inputs) 2025-08-14T21:48:47.9084887Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9085008Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9085244Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9085313Z outputs = layer_module( 2025-08-14T21:48:47.9085569Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 512, in forward 2025-08-14T21:48:47.9085762Z output_h = apply_chunking_to_forward(self.ff_chunk, self.chunk_size_feed_forward, self.seq_len_dim, output_h) 2025-08-14T21:48:47.9086035Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:48:47.9086105Z return forward_fn(*input_tensors) 2025-08-14T21:48:47.9086343Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 518, in ff_chunk 2025-08-14T21:48:47.9086408Z output_x = self.ff(output_x) 2025-08-14T21:48:47.9086638Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 466, in forward 2025-08-14T21:48:47.9086735Z output = self.layer_2(output) 2025-08-14T21:48:47.9086738Z 2025-08-14T21:48:47.9086832Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9087022Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9087083Z return mod(**inputs) 2025-08-14T21:48:47.9087313Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9087395Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9087624Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9087686Z outputs = layer_module( 2025-08-14T21:48:47.9087924Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9087987Z outputs = self.rel_attn( 2025-08-14T21:48:47.9088223Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 416, in forward 2025-08-14T21:48:47.9088314Z q_head_h = torch.einsum("ibh,hnd->ibnd", h, self.q) 2025-08-14T21:48:47.9088318Z 2025-08-14T21:48:47.9088412Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9088603Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9088663Z return mod(**inputs) 2025-08-14T21:48:47.9088902Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9088977Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9089208Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9089278Z outputs = layer_module( 2025-08-14T21:48:47.9089506Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9089568Z outputs = self.rel_attn( 2025-08-14T21:48:47.9089803Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 417, in forward 2025-08-14T21:48:47.9089898Z k_head_h = torch.einsum("ibh,hnd->ibnd", cat, self.k) 2025-08-14T21:48:47.9089902Z 2025-08-14T21:48:47.9090001Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9090181Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9090239Z return mod(**inputs) 2025-08-14T21:48:47.9090479Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9090569Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9090814Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9090875Z outputs = layer_module( 2025-08-14T21:48:47.9091123Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9091195Z outputs = self.rel_attn( 2025-08-14T21:48:47.9091445Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 425, in forward 2025-08-14T21:48:47.9091511Z attn_vec = self.rel_attn_core( 2025-08-14T21:48:47.9091767Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 263, in rel_attn_core 2025-08-14T21:48:47.9091888Z ac = torch.einsum("ibnd,jbnd->bnij", q_head + self.r_w_bias, k_head_h) 2025-08-14T21:48:47.9091891Z 2025-08-14T21:48:47.9091991Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9092190Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9092249Z return mod(**inputs) 2025-08-14T21:48:47.9092490Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9092567Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9092801Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9092870Z outputs = layer_module( 2025-08-14T21:48:47.9093102Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9093170Z outputs = self.rel_attn( 2025-08-14T21:48:47.9093402Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 422, in forward 2025-08-14T21:48:47.9093527Z k_head_r = torch.einsum("ibh,hnd->ibnd", r.type(self.r.dtype), self.r) 2025-08-14T21:48:47.9093530Z 2025-08-14T21:48:47.9093631Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9093811Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9093889Z return mod(**inputs) 2025-08-14T21:48:47.9094124Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9094199Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9094438Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9094499Z outputs = layer_module( 2025-08-14T21:48:47.9094728Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9094797Z outputs = self.rel_attn( 2025-08-14T21:48:47.9095029Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 425, in forward 2025-08-14T21:48:47.9095100Z attn_vec = self.rel_attn_core( 2025-08-14T21:48:47.9095347Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 266, in rel_attn_core 2025-08-14T21:48:47.9095467Z bd = torch.einsum("ibnd,jbnd->bnij", q_head + self.r_r_bias, k_head_r) 2025-08-14T21:48:47.9095472Z 2025-08-14T21:48:47.9095571Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9095753Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9095818Z return mod(**inputs) 2025-08-14T21:48:47.9096051Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9096143Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9096385Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9096445Z outputs = layer_module( 2025-08-14T21:48:47.9096719Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9096791Z outputs = self.rel_attn( 2025-08-14T21:48:47.9097040Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 418, in forward 2025-08-14T21:48:47.9097140Z v_head_h = torch.einsum("ibh,hnd->ibnd", cat, self.v) 2025-08-14T21:48:47.9097144Z 2025-08-14T21:48:47.9097236Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9097418Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9097483Z return mod(**inputs) 2025-08-14T21:48:47.9097719Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9097818Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9098049Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9098111Z outputs = layer_module( 2025-08-14T21:48:47.9098350Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9098412Z outputs = self.rel_attn( 2025-08-14T21:48:47.9098640Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 425, in forward 2025-08-14T21:48:47.9098713Z attn_vec = self.rel_attn_core( 2025-08-14T21:48:47.9098957Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 294, in rel_attn_core 2025-08-14T21:48:47.9099078Z attn_vec = torch.einsum("bnij,jbnd->ibnd", attn_prob, v_head_h) 2025-08-14T21:48:47.9099083Z 2025-08-14T21:48:47.9099176Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9099357Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9099427Z return mod(**inputs) 2025-08-14T21:48:47.9099660Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9099743Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9099975Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9100033Z outputs = layer_module( 2025-08-14T21:48:47.9100268Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9100329Z outputs = self.rel_attn( 2025-08-14T21:48:47.9100561Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 440, in forward 2025-08-14T21:48:47.9100649Z output_h = self.post_attention(h, attn_vec) 2025-08-14T21:48:47.9100897Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 304, in post_attention 2025-08-14T21:48:47.9101004Z attn_out = torch.einsum("ibnd,hnd->ibh", attn_vec, self.o) 2025-08-14T21:48:47.9101009Z 2025-08-14T21:48:47.9101099Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9101281Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9101345Z return mod(**inputs) 2025-08-14T21:48:47.9101576Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9101648Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9101908Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9101972Z outputs = layer_module( 2025-08-14T21:48:47.9102210Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9102288Z outputs = self.rel_attn( 2025-08-14T21:48:47.9102520Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 440, in forward 2025-08-14T21:48:47.9102625Z output_h = self.post_attention(h, attn_vec) 2025-08-14T21:48:47.9102873Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 304, in post_attention 2025-08-14T21:48:47.9102981Z attn_out = torch.einsum("ibnd,hnd->ibh", attn_vec, self.o) 2025-08-14T21:48:47.9102984Z 2025-08-14T21:48:47.9103076Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9103257Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9103340Z return mod(**inputs) 2025-08-14T21:48:47.9103573Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9103648Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9103888Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9103950Z outputs = layer_module( 2025-08-14T21:48:47.9104186Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 512, in forward 2025-08-14T21:48:47.9104374Z output_h = apply_chunking_to_forward(self.ff_chunk, self.chunk_size_feed_forward, self.seq_len_dim, output_h) 2025-08-14T21:48:47.9104616Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:48:47.9104693Z return forward_fn(*input_tensors) 2025-08-14T21:48:47.9104981Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 518, in ff_chunk 2025-08-14T21:48:47.9105058Z output_x = self.ff(output_x) 2025-08-14T21:48:47.9105292Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 463, in forward 2025-08-14T21:48:47.9105359Z output = self.layer_1(output) 2025-08-14T21:48:47.9105362Z 2025-08-14T21:48:47.9105463Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9105645Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9105704Z return mod(**inputs) 2025-08-14T21:48:47.9105944Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9106022Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9106265Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9106327Z outputs = layer_module( 2025-08-14T21:48:47.9106560Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 512, in forward 2025-08-14T21:48:47.9106762Z output_h = apply_chunking_to_forward(self.ff_chunk, self.chunk_size_feed_forward, self.seq_len_dim, output_h) 2025-08-14T21:48:47.9107006Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:48:47.9107085Z return forward_fn(*input_tensors) 2025-08-14T21:48:47.9107319Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 518, in ff_chunk 2025-08-14T21:48:47.9107387Z output_x = self.ff(output_x) 2025-08-14T21:48:47.9107645Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 464, in forward 2025-08-14T21:48:47.9107729Z output = self.activation_function(output) 2025-08-14T21:48:47.9107926Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:48:47.9108014Z return self.act(input) 2025-08-14T21:48:47.9108017Z 2025-08-14T21:48:47.9108112Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9108317Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9108377Z return mod(**inputs) 2025-08-14T21:48:47.9108608Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9108691Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9108924Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9109008Z outputs = layer_module( 2025-08-14T21:48:47.9109240Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 512, in forward 2025-08-14T21:48:47.9109428Z output_h = apply_chunking_to_forward(self.ff_chunk, self.chunk_size_feed_forward, self.seq_len_dim, output_h) 2025-08-14T21:48:47.9109675Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:48:47.9109746Z return forward_fn(*input_tensors) 2025-08-14T21:48:47.9109979Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 518, in ff_chunk 2025-08-14T21:48:47.9110050Z output_x = self.ff(output_x) 2025-08-14T21:48:47.9110280Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 466, in forward 2025-08-14T21:48:47.9110350Z output = self.layer_2(output) 2025-08-14T21:48:47.9110355Z 2025-08-14T21:48:47.9110446Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9110627Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9110695Z return mod(**inputs) 2025-08-14T21:48:47.9110929Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9111013Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9111245Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9111332Z outputs = layer_module( 2025-08-14T21:48:47.9111568Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9111630Z outputs = self.rel_attn( 2025-08-14T21:48:47.9111862Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 416, in forward 2025-08-14T21:48:47.9111962Z q_head_h = torch.einsum("ibh,hnd->ibnd", h, self.q) 2025-08-14T21:48:47.9111965Z 2025-08-14T21:48:47.9112058Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9112249Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9112308Z return mod(**inputs) 2025-08-14T21:48:47.9112542Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9112623Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9112855Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9112922Z outputs = layer_module( 2025-08-14T21:48:47.9113662Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9113732Z outputs = self.rel_attn( 2025-08-14T21:48:47.9113974Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 417, in forward 2025-08-14T21:48:47.9114068Z k_head_h = torch.einsum("ibh,hnd->ibnd", cat, self.k) 2025-08-14T21:48:47.9114090Z 2025-08-14T21:48:47.9114186Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9114394Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9114455Z return mod(**inputs) 2025-08-14T21:48:47.9114696Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9114770Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9115002Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9115089Z outputs = layer_module( 2025-08-14T21:48:47.9115323Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9115385Z outputs = self.rel_attn( 2025-08-14T21:48:47.9115624Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 425, in forward 2025-08-14T21:48:47.9115691Z attn_vec = self.rel_attn_core( 2025-08-14T21:48:47.9115951Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 263, in rel_attn_core 2025-08-14T21:48:47.9116072Z ac = torch.einsum("ibnd,jbnd->bnij", q_head + self.r_w_bias, k_head_h) 2025-08-14T21:48:47.9116075Z 2025-08-14T21:48:47.9116169Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9116359Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9116417Z return mod(**inputs) 2025-08-14T21:48:47.9116659Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9116732Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9116971Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9117038Z outputs = layer_module( 2025-08-14T21:48:47.9117270Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9117331Z outputs = self.rel_attn( 2025-08-14T21:48:47.9117569Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 422, in forward 2025-08-14T21:48:47.9117691Z k_head_r = torch.einsum("ibh,hnd->ibnd", r.type(self.r.dtype), self.r) 2025-08-14T21:48:47.9117694Z 2025-08-14T21:48:47.9117796Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9117979Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9118038Z return mod(**inputs) 2025-08-14T21:48:47.9118278Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9118354Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9118595Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9118655Z outputs = layer_module( 2025-08-14T21:48:47.9118888Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9118956Z outputs = self.rel_attn( 2025-08-14T21:48:47.9119185Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 425, in forward 2025-08-14T21:48:47.9119267Z attn_vec = self.rel_attn_core( 2025-08-14T21:48:47.9119525Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 266, in rel_attn_core 2025-08-14T21:48:47.9119643Z bd = torch.einsum("ibnd,jbnd->bnij", q_head + self.r_r_bias, k_head_r) 2025-08-14T21:48:47.9119647Z 2025-08-14T21:48:47.9119763Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9119960Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9120019Z return mod(**inputs) 2025-08-14T21:48:47.9120260Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9120333Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9120568Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9120631Z outputs = layer_module( 2025-08-14T21:48:47.9120884Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9120955Z outputs = self.rel_attn( 2025-08-14T21:48:47.9121186Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 418, in forward 2025-08-14T21:48:47.9121280Z v_head_h = torch.einsum("ibh,hnd->ibnd", cat, self.v) 2025-08-14T21:48:47.9121285Z 2025-08-14T21:48:47.9121385Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9121567Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9121635Z return mod(**inputs) 2025-08-14T21:48:47.9121867Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9121942Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9122184Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9122247Z outputs = layer_module( 2025-08-14T21:48:47.9122484Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9122546Z outputs = self.rel_attn( 2025-08-14T21:48:47.9122778Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 425, in forward 2025-08-14T21:48:47.9122852Z attn_vec = self.rel_attn_core( 2025-08-14T21:48:47.9123097Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 294, in rel_attn_core 2025-08-14T21:48:47.9123209Z attn_vec = torch.einsum("bnij,jbnd->ibnd", attn_prob, v_head_h) 2025-08-14T21:48:47.9123213Z 2025-08-14T21:48:47.9123314Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9123496Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9123564Z return mod(**inputs) 2025-08-14T21:48:47.9123796Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9123873Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9124114Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9124176Z outputs = layer_module( 2025-08-14T21:48:47.9124405Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9124475Z outputs = self.rel_attn( 2025-08-14T21:48:47.9124703Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 440, in forward 2025-08-14T21:48:47.9124824Z output_h = self.post_attention(h, attn_vec) 2025-08-14T21:48:47.9125078Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 304, in post_attention 2025-08-14T21:48:47.9125179Z attn_out = torch.einsum("ibnd,hnd->ibh", attn_vec, self.o) 2025-08-14T21:48:47.9125183Z 2025-08-14T21:48:47.9125297Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9125480Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9125564Z return mod(**inputs) 2025-08-14T21:48:47.9125796Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9125869Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9126107Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9126169Z outputs = layer_module( 2025-08-14T21:48:47.9126400Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9126487Z outputs = self.rel_attn( 2025-08-14T21:48:47.9126716Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 440, in forward 2025-08-14T21:48:47.9126804Z output_h = self.post_attention(h, attn_vec) 2025-08-14T21:48:47.9127055Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 304, in post_attention 2025-08-14T21:48:47.9127158Z attn_out = torch.einsum("ibnd,hnd->ibh", attn_vec, self.o) 2025-08-14T21:48:47.9127161Z 2025-08-14T21:48:47.9127263Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9127445Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9127512Z return mod(**inputs) 2025-08-14T21:48:47.9127744Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9127820Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9128058Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9128120Z outputs = layer_module( 2025-08-14T21:48:47.9128349Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 512, in forward 2025-08-14T21:48:47.9128549Z output_h = apply_chunking_to_forward(self.ff_chunk, self.chunk_size_feed_forward, self.seq_len_dim, output_h) 2025-08-14T21:48:47.9128787Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:48:47.9128861Z return forward_fn(*input_tensors) 2025-08-14T21:48:47.9129091Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 518, in ff_chunk 2025-08-14T21:48:47.9129156Z output_x = self.ff(output_x) 2025-08-14T21:48:47.9129391Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 463, in forward 2025-08-14T21:48:47.9129455Z output = self.layer_1(output) 2025-08-14T21:48:47.9129458Z 2025-08-14T21:48:47.9129558Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9129739Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9129797Z return mod(**inputs) 2025-08-14T21:48:47.9130033Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9130106Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9130337Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9130419Z outputs = layer_module( 2025-08-14T21:48:47.9130650Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 512, in forward 2025-08-14T21:48:47.9130845Z output_h = apply_chunking_to_forward(self.ff_chunk, self.chunk_size_feed_forward, self.seq_len_dim, output_h) 2025-08-14T21:48:47.9131100Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:48:47.9131186Z return forward_fn(*input_tensors) 2025-08-14T21:48:47.9131429Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 518, in ff_chunk 2025-08-14T21:48:47.9131495Z output_x = self.ff(output_x) 2025-08-14T21:48:47.9131734Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 464, in forward 2025-08-14T21:48:47.9131814Z output = self.activation_function(output) 2025-08-14T21:48:47.9132013Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:48:47.9132101Z return self.act(input) 2025-08-14T21:48:47.9132105Z 2025-08-14T21:48:47.9132196Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9132377Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9132442Z return mod(**inputs) 2025-08-14T21:48:47.9132675Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9132756Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9132987Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9133046Z outputs = layer_module( 2025-08-14T21:48:47.9133287Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 512, in forward 2025-08-14T21:48:47.9133479Z output_h = apply_chunking_to_forward(self.ff_chunk, self.chunk_size_feed_forward, self.seq_len_dim, output_h) 2025-08-14T21:48:47.9133726Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:48:47.9133796Z return forward_fn(*input_tensors) 2025-08-14T21:48:47.9134027Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 518, in ff_chunk 2025-08-14T21:48:47.9134102Z output_x = self.ff(output_x) 2025-08-14T21:48:47.9134333Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 466, in forward 2025-08-14T21:48:47.9134399Z output = self.layer_2(output) 2025-08-14T21:48:47.9134409Z 2025-08-14T21:48:47.9134502Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9134683Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9134752Z return mod(**inputs) 2025-08-14T21:48:47.9134985Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9135059Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9135300Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9135363Z outputs = layer_module( 2025-08-14T21:48:47.9135601Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9135664Z outputs = self.rel_attn( 2025-08-14T21:48:47.9135896Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 416, in forward 2025-08-14T21:48:47.9136011Z q_head_h = torch.einsum("ibh,hnd->ibnd", h, self.q) 2025-08-14T21:48:47.9136015Z 2025-08-14T21:48:47.9136111Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9136294Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9136361Z return mod(**inputs) 2025-08-14T21:48:47.9136608Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9136705Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9136933Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9136994Z outputs = layer_module( 2025-08-14T21:48:47.9137229Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9137293Z outputs = self.rel_attn( 2025-08-14T21:48:47.9137521Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 417, in forward 2025-08-14T21:48:47.9137640Z k_head_h = torch.einsum("ibh,hnd->ibnd", cat, self.k) 2025-08-14T21:48:47.9137644Z 2025-08-14T21:48:47.9137736Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9137924Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9137983Z return mod(**inputs) 2025-08-14T21:48:47.9138219Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9138303Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9138538Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9138605Z outputs = layer_module( 2025-08-14T21:48:47.9138839Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9138903Z outputs = self.rel_attn( 2025-08-14T21:48:47.9139145Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 425, in forward 2025-08-14T21:48:47.9139211Z attn_vec = self.rel_attn_core( 2025-08-14T21:48:47.9139461Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 263, in rel_attn_core 2025-08-14T21:48:47.9139590Z ac = torch.einsum("ibnd,jbnd->bnij", q_head + self.r_w_bias, k_head_h) 2025-08-14T21:48:47.9139593Z 2025-08-14T21:48:47.9139687Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9139876Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9139935Z return mod(**inputs) 2025-08-14T21:48:47.9140173Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9140257Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9140491Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9140558Z outputs = layer_module( 2025-08-14T21:48:47.9140792Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9140854Z outputs = self.rel_attn( 2025-08-14T21:48:47.9141095Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 422, in forward 2025-08-14T21:48:47.9141218Z k_head_r = torch.einsum("ibh,hnd->ibnd", r.type(self.r.dtype), self.r) 2025-08-14T21:48:47.9141221Z 2025-08-14T21:48:47.9141315Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9141504Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9141580Z return mod(**inputs) 2025-08-14T21:48:47.9141821Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9141894Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9142143Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9142212Z outputs = layer_module( 2025-08-14T21:48:47.9142465Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9142534Z outputs = self.rel_attn( 2025-08-14T21:48:47.9142762Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 425, in forward 2025-08-14T21:48:47.9142827Z attn_vec = self.rel_attn_core( 2025-08-14T21:48:47.9143078Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 266, in rel_attn_core 2025-08-14T21:48:47.9143212Z bd = torch.einsum("ibnd,jbnd->bnij", q_head + self.r_r_bias, k_head_r) 2025-08-14T21:48:47.9143215Z 2025-08-14T21:48:47.9143308Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9143498Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9143557Z return mod(**inputs) 2025-08-14T21:48:47.9143799Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9143871Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9144101Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9144170Z outputs = layer_module( 2025-08-14T21:48:47.9144401Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9144473Z outputs = self.rel_attn( 2025-08-14T21:48:47.9144700Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 418, in forward 2025-08-14T21:48:47.9144857Z v_head_h = torch.einsum("ibh,hnd->ibnd", cat, self.v) 2025-08-14T21:48:47.9144863Z 2025-08-14T21:48:47.9144970Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9145151Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9145212Z return mod(**inputs) 2025-08-14T21:48:47.9145452Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9145528Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9145770Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9145831Z outputs = layer_module( 2025-08-14T21:48:47.9146066Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9146137Z outputs = self.rel_attn( 2025-08-14T21:48:47.9146370Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 425, in forward 2025-08-14T21:48:47.9146436Z attn_vec = self.rel_attn_core( 2025-08-14T21:48:47.9146690Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 294, in rel_attn_core 2025-08-14T21:48:47.9146803Z attn_vec = torch.einsum("bnij,jbnd->ibnd", attn_prob, v_head_h) 2025-08-14T21:48:47.9146806Z 2025-08-14T21:48:47.9146907Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9147088Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9147148Z return mod(**inputs) 2025-08-14T21:48:47.9147405Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9147482Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9153674Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9153996Z outputs = layer_module( 2025-08-14T21:48:47.9154303Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9154418Z outputs = self.rel_attn( 2025-08-14T21:48:47.9154667Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 440, in forward 2025-08-14T21:48:47.9154759Z output_h = self.post_attention(h, attn_vec) 2025-08-14T21:48:47.9155030Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 304, in post_attention 2025-08-14T21:48:47.9155175Z attn_out = torch.einsum("ibnd,hnd->ibh", attn_vec, self.o) 2025-08-14T21:48:47.9155181Z 2025-08-14T21:48:47.9155294Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9155490Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9155589Z return mod(**inputs) 2025-08-14T21:48:47.9155831Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9155921Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9156156Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9156221Z outputs = layer_module( 2025-08-14T21:48:47.9156476Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9156542Z outputs = self.rel_attn( 2025-08-14T21:48:47.9156783Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 440, in forward 2025-08-14T21:48:47.9156863Z output_h = self.post_attention(h, attn_vec) 2025-08-14T21:48:47.9157127Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 304, in post_attention 2025-08-14T21:48:47.9157233Z attn_out = torch.einsum("ibnd,hnd->ibh", attn_vec, self.o) 2025-08-14T21:48:47.9157238Z 2025-08-14T21:48:47.9157334Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9157528Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9157588Z return mod(**inputs) 2025-08-14T21:48:47.9157828Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9157906Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9158138Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9158208Z outputs = layer_module( 2025-08-14T21:48:47.9158437Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 512, in forward 2025-08-14T21:48:47.9158635Z output_h = apply_chunking_to_forward(self.ff_chunk, self.chunk_size_feed_forward, self.seq_len_dim, output_h) 2025-08-14T21:48:47.9158887Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:48:47.9158961Z return forward_fn(*input_tensors) 2025-08-14T21:48:47.9159197Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 518, in ff_chunk 2025-08-14T21:48:47.9159265Z output_x = self.ff(output_x) 2025-08-14T21:48:47.9159491Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 463, in forward 2025-08-14T21:48:47.9159566Z output = self.layer_1(output) 2025-08-14T21:48:47.9159570Z 2025-08-14T21:48:47.9159669Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9159940Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9160004Z return mod(**inputs) 2025-08-14T21:48:47.9160260Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9160347Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9160574Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9160636Z outputs = layer_module( 2025-08-14T21:48:47.9160871Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 512, in forward 2025-08-14T21:48:47.9161085Z output_h = apply_chunking_to_forward(self.ff_chunk, self.chunk_size_feed_forward, self.seq_len_dim, output_h) 2025-08-14T21:48:47.9161335Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:48:47.9161411Z return forward_fn(*input_tensors) 2025-08-14T21:48:47.9161642Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 518, in ff_chunk 2025-08-14T21:48:47.9161720Z output_x = self.ff(output_x) 2025-08-14T21:48:47.9161949Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 464, in forward 2025-08-14T21:48:47.9162038Z output = self.activation_function(output) 2025-08-14T21:48:47.9162235Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:48:47.9162300Z return self.act(input) 2025-08-14T21:48:47.9162305Z 2025-08-14T21:48:47.9162410Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9162591Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9162653Z return mod(**inputs) 2025-08-14T21:48:47.9162894Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9162973Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9163208Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9163269Z outputs = layer_module( 2025-08-14T21:48:47.9163497Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 512, in forward 2025-08-14T21:48:47.9163691Z output_h = apply_chunking_to_forward(self.ff_chunk, self.chunk_size_feed_forward, self.seq_len_dim, output_h) 2025-08-14T21:48:47.9163930Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:48:47.9164005Z return forward_fn(*input_tensors) 2025-08-14T21:48:47.9164237Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 518, in ff_chunk 2025-08-14T21:48:47.9164303Z output_x = self.ff(output_x) 2025-08-14T21:48:47.9164542Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 466, in forward 2025-08-14T21:48:47.9164609Z output = self.layer_2(output) 2025-08-14T21:48:47.9164612Z 2025-08-14T21:48:47.9164712Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9164890Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9164949Z return mod(**inputs) 2025-08-14T21:48:47.9165184Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9165261Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9165490Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9165605Z outputs = layer_module( 2025-08-14T21:48:47.9165838Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9165928Z outputs = self.rel_attn( 2025-08-14T21:48:47.9166159Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 416, in forward 2025-08-14T21:48:47.9166253Z q_head_h = torch.einsum("ibh,hnd->ibnd", h, self.q) 2025-08-14T21:48:47.9166256Z 2025-08-14T21:48:47.9166356Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9166536Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9166626Z return mod(**inputs) 2025-08-14T21:48:47.9166868Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9166943Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9167185Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9167245Z outputs = layer_module( 2025-08-14T21:48:47.9167476Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9167545Z outputs = self.rel_attn( 2025-08-14T21:48:47.9167779Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 417, in forward 2025-08-14T21:48:47.9167881Z k_head_h = torch.einsum("ibh,hnd->ibnd", cat, self.k) 2025-08-14T21:48:47.9167885Z 2025-08-14T21:48:47.9167980Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9168162Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9168229Z return mod(**inputs) 2025-08-14T21:48:47.9168465Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9168540Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9168780Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9168840Z outputs = layer_module( 2025-08-14T21:48:47.9169079Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9169141Z outputs = self.rel_attn( 2025-08-14T21:48:47.9169372Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 425, in forward 2025-08-14T21:48:47.9169448Z attn_vec = self.rel_attn_core( 2025-08-14T21:48:47.9169698Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 263, in rel_attn_core 2025-08-14T21:48:47.9169835Z ac = torch.einsum("ibnd,jbnd->bnij", q_head + self.r_w_bias, k_head_h) 2025-08-14T21:48:47.9169839Z 2025-08-14T21:48:47.9169931Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9170112Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9170177Z return mod(**inputs) 2025-08-14T21:48:47.9170411Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9170486Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9170726Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9170790Z outputs = layer_module( 2025-08-14T21:48:47.9171029Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9171090Z outputs = self.rel_attn( 2025-08-14T21:48:47.9171353Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 422, in forward 2025-08-14T21:48:47.9171503Z k_head_r = torch.einsum("ibh,hnd->ibnd", r.type(self.r.dtype), self.r) 2025-08-14T21:48:47.9171507Z 2025-08-14T21:48:47.9171599Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9171788Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9171847Z return mod(**inputs) 2025-08-14T21:48:47.9172081Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9172180Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9172409Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9172469Z outputs = layer_module( 2025-08-14T21:48:47.9172713Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9172777Z outputs = self.rel_attn( 2025-08-14T21:48:47.9173019Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 425, in forward 2025-08-14T21:48:47.9173086Z attn_vec = self.rel_attn_core( 2025-08-14T21:48:47.9173338Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 266, in rel_attn_core 2025-08-14T21:48:47.9173468Z bd = torch.einsum("ibnd,jbnd->bnij", q_head + self.r_r_bias, k_head_r) 2025-08-14T21:48:47.9173472Z 2025-08-14T21:48:47.9173565Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9173748Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9173815Z return mod(**inputs) 2025-08-14T21:48:47.9174054Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9174136Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9174371Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9174431Z outputs = layer_module( 2025-08-14T21:48:47.9174672Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9174733Z outputs = self.rel_attn( 2025-08-14T21:48:47.9174970Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 418, in forward 2025-08-14T21:48:47.9175065Z v_head_h = torch.einsum("ibh,hnd->ibnd", cat, self.v) 2025-08-14T21:48:47.9175069Z 2025-08-14T21:48:47.9175162Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9175351Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9175415Z return mod(**inputs) 2025-08-14T21:48:47.9175649Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9175732Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9175967Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9176033Z outputs = layer_module( 2025-08-14T21:48:47.9176266Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9176329Z outputs = self.rel_attn( 2025-08-14T21:48:47.9176569Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 425, in forward 2025-08-14T21:48:47.9176633Z attn_vec = self.rel_attn_core( 2025-08-14T21:48:47.9176921Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 294, in rel_attn_core 2025-08-14T21:48:47.9177040Z attn_vec = torch.einsum("bnij,jbnd->ibnd", attn_prob, v_head_h) 2025-08-14T21:48:47.9177059Z 2025-08-14T21:48:47.9177153Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9177340Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9177399Z return mod(**inputs) 2025-08-14T21:48:47.9177634Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9177716Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9177962Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9178026Z outputs = layer_module( 2025-08-14T21:48:47.9178258Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9178320Z outputs = self.rel_attn( 2025-08-14T21:48:47.9178559Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 440, in forward 2025-08-14T21:48:47.9178640Z output_h = self.post_attention(h, attn_vec) 2025-08-14T21:48:47.9178889Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 304, in post_attention 2025-08-14T21:48:47.9178999Z attn_out = torch.einsum("ibnd,hnd->ibh", attn_vec, self.o) 2025-08-14T21:48:47.9179003Z 2025-08-14T21:48:47.9179094Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9179282Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9179340Z return mod(**inputs) 2025-08-14T21:48:47.9179571Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9179655Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9179886Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9179955Z outputs = layer_module( 2025-08-14T21:48:47.9180182Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9180245Z outputs = self.rel_attn( 2025-08-14T21:48:47.9180479Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 440, in forward 2025-08-14T21:48:47.9180559Z output_h = self.post_attention(h, attn_vec) 2025-08-14T21:48:47.9180810Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 304, in post_attention 2025-08-14T21:48:47.9180918Z attn_out = torch.einsum("ibnd,hnd->ibh", attn_vec, self.o) 2025-08-14T21:48:47.9180921Z 2025-08-14T21:48:47.9181016Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9181205Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9181265Z return mod(**inputs) 2025-08-14T21:48:47.9181493Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9181575Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9181805Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9181870Z outputs = layer_module( 2025-08-14T21:48:47.9182099Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 512, in forward 2025-08-14T21:48:47.9182291Z output_h = apply_chunking_to_forward(self.ff_chunk, self.chunk_size_feed_forward, self.seq_len_dim, output_h) 2025-08-14T21:48:47.9182566Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:48:47.9182654Z return forward_fn(*input_tensors) 2025-08-14T21:48:47.9182889Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 518, in ff_chunk 2025-08-14T21:48:47.9182954Z output_x = self.ff(output_x) 2025-08-14T21:48:47.9183182Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 463, in forward 2025-08-14T21:48:47.9183256Z output = self.layer_1(output) 2025-08-14T21:48:47.9183260Z 2025-08-14T21:48:47.9183374Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9183555Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9183622Z return mod(**inputs) 2025-08-14T21:48:47.9183855Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9183938Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9184170Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9184233Z outputs = layer_module( 2025-08-14T21:48:47.9184470Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 512, in forward 2025-08-14T21:48:47.9184989Z output_h = apply_chunking_to_forward(self.ff_chunk, self.chunk_size_feed_forward, self.seq_len_dim, output_h) 2025-08-14T21:48:47.9185351Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:48:47.9185437Z return forward_fn(*input_tensors) 2025-08-14T21:48:47.9185675Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 518, in ff_chunk 2025-08-14T21:48:47.9185754Z output_x = self.ff(output_x) 2025-08-14T21:48:47.9185989Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 464, in forward 2025-08-14T21:48:47.9186080Z output = self.activation_function(output) 2025-08-14T21:48:47.9186292Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:48:47.9186361Z return self.act(input) 2025-08-14T21:48:47.9186365Z 2025-08-14T21:48:47.9186473Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9186660Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9186725Z return mod(**inputs) 2025-08-14T21:48:47.9186970Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9187051Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9187288Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9187361Z outputs = layer_module( 2025-08-14T21:48:47.9187594Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 512, in forward 2025-08-14T21:48:47.9187795Z output_h = apply_chunking_to_forward(self.ff_chunk, self.chunk_size_feed_forward, self.seq_len_dim, output_h) 2025-08-14T21:48:47.9188039Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:48:47.9188114Z return forward_fn(*input_tensors) 2025-08-14T21:48:47.9188360Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 518, in ff_chunk 2025-08-14T21:48:47.9188428Z output_x = self.ff(output_x) 2025-08-14T21:48:47.9188755Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 466, in forward 2025-08-14T21:48:47.9188823Z output = self.layer_2(output) 2025-08-14T21:48:47.9188859Z 2025-08-14T21:48:47.9188956Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9189148Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9189210Z return mod(**inputs) 2025-08-14T21:48:47.9189442Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9189527Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9189759Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9189854Z outputs = layer_module( 2025-08-14T21:48:47.9190084Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9190150Z outputs = self.rel_attn( 2025-08-14T21:48:47.9190391Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 416, in forward 2025-08-14T21:48:47.9190484Z q_head_h = torch.einsum("ibh,hnd->ibnd", h, self.q) 2025-08-14T21:48:47.9190487Z 2025-08-14T21:48:47.9190589Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9190770Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9190829Z return mod(**inputs) 2025-08-14T21:48:47.9191068Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9191144Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9191374Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9191446Z outputs = layer_module( 2025-08-14T21:48:47.9191678Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9191750Z outputs = self.rel_attn( 2025-08-14T21:48:47.9191979Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 417, in forward 2025-08-14T21:48:47.9192071Z k_head_h = torch.einsum("ibh,hnd->ibnd", cat, self.k) 2025-08-14T21:48:47.9192075Z 2025-08-14T21:48:47.9192175Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9192355Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9192422Z return mod(**inputs) 2025-08-14T21:48:47.9192654Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9192726Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9192964Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9193024Z outputs = layer_module( 2025-08-14T21:48:47.9193254Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9193323Z outputs = self.rel_attn( 2025-08-14T21:48:47.9193549Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 425, in forward 2025-08-14T21:48:47.9193619Z attn_vec = self.rel_attn_core( 2025-08-14T21:48:47.9193865Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 263, in rel_attn_core 2025-08-14T21:48:47.9193987Z ac = torch.einsum("ibnd,jbnd->bnij", q_head + self.r_w_bias, k_head_h) 2025-08-14T21:48:47.9193991Z 2025-08-14T21:48:47.9194089Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9194305Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9194390Z return mod(**inputs) 2025-08-14T21:48:47.9194628Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9194703Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9194952Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9195012Z outputs = layer_module( 2025-08-14T21:48:47.9195248Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9195336Z outputs = self.rel_attn( 2025-08-14T21:48:47.9195565Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 422, in forward 2025-08-14T21:48:47.9195697Z k_head_r = torch.einsum("ibh,hnd->ibnd", r.type(self.r.dtype), self.r) 2025-08-14T21:48:47.9195701Z 2025-08-14T21:48:47.9195792Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9195973Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9196039Z return mod(**inputs) 2025-08-14T21:48:47.9196271Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9196344Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9196584Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9196647Z outputs = layer_module( 2025-08-14T21:48:47.9196885Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9196946Z outputs = self.rel_attn( 2025-08-14T21:48:47.9197178Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 425, in forward 2025-08-14T21:48:47.9197254Z attn_vec = self.rel_attn_core( 2025-08-14T21:48:47.9197500Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 266, in rel_attn_core 2025-08-14T21:48:47.9197627Z bd = torch.einsum("ibnd,jbnd->bnij", q_head + self.r_r_bias, k_head_r) 2025-08-14T21:48:47.9197630Z 2025-08-14T21:48:47.9197723Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9197902Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9197972Z return mod(**inputs) 2025-08-14T21:48:47.9198204Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9198277Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9198516Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9198579Z outputs = layer_module( 2025-08-14T21:48:47.9198817Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9198879Z outputs = self.rel_attn( 2025-08-14T21:48:47.9199109Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 418, in forward 2025-08-14T21:48:47.9199211Z v_head_h = torch.einsum("ibh,hnd->ibnd", cat, self.v) 2025-08-14T21:48:47.9199214Z 2025-08-14T21:48:47.9199305Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9199493Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9199551Z return mod(**inputs) 2025-08-14T21:48:47.9199819Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9199903Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9200152Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9200212Z outputs = layer_module( 2025-08-14T21:48:47.9200450Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9200513Z outputs = self.rel_attn( 2025-08-14T21:48:47.9200752Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 425, in forward 2025-08-14T21:48:47.9200834Z attn_vec = self.rel_attn_core( 2025-08-14T21:48:47.9201081Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 294, in rel_attn_core 2025-08-14T21:48:47.9201204Z attn_vec = torch.einsum("bnij,jbnd->ibnd", attn_prob, v_head_h) 2025-08-14T21:48:47.9201210Z 2025-08-14T21:48:47.9201302Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9201490Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9201550Z return mod(**inputs) 2025-08-14T21:48:47.9201783Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9201865Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9202095Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9202155Z outputs = layer_module( 2025-08-14T21:48:47.9202391Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9202452Z outputs = self.rel_attn( 2025-08-14T21:48:47.9202688Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 440, in forward 2025-08-14T21:48:47.9202768Z output_h = self.post_attention(h, attn_vec) 2025-08-14T21:48:47.9203016Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 304, in post_attention 2025-08-14T21:48:47.9203126Z attn_out = torch.einsum("ibnd,hnd->ibh", attn_vec, self.o) 2025-08-14T21:48:47.9203130Z 2025-08-14T21:48:47.9203220Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9203406Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9203465Z return mod(**inputs) 2025-08-14T21:48:47.9203698Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9203776Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9204011Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9204070Z outputs = layer_module( 2025-08-14T21:48:47.9204312Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9204374Z outputs = self.rel_attn( 2025-08-14T21:48:47.9204609Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 440, in forward 2025-08-14T21:48:47.9204687Z output_h = self.post_attention(h, attn_vec) 2025-08-14T21:48:47.9204932Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 304, in post_attention 2025-08-14T21:48:47.9205042Z attn_out = torch.einsum("ibnd,hnd->ibh", attn_vec, self.o) 2025-08-14T21:48:47.9205045Z 2025-08-14T21:48:47.9205137Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9205345Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9205412Z return mod(**inputs) 2025-08-14T21:48:47.9205645Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9205744Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9205974Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9206035Z outputs = layer_module( 2025-08-14T21:48:47.9206271Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 512, in forward 2025-08-14T21:48:47.9206476Z output_h = apply_chunking_to_forward(self.ff_chunk, self.chunk_size_feed_forward, self.seq_len_dim, output_h) 2025-08-14T21:48:47.9206722Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:48:47.9206792Z return forward_fn(*input_tensors) 2025-08-14T21:48:47.9207024Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 518, in ff_chunk 2025-08-14T21:48:47.9207098Z output_x = self.ff(output_x) 2025-08-14T21:48:47.9207326Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 463, in forward 2025-08-14T21:48:47.9207390Z output = self.layer_1(output) 2025-08-14T21:48:47.9207401Z 2025-08-14T21:48:47.9207494Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9207674Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9207741Z return mod(**inputs) 2025-08-14T21:48:47.9207973Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9208049Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9208288Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9208352Z outputs = layer_module( 2025-08-14T21:48:47.9208589Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 512, in forward 2025-08-14T21:48:47.9208779Z output_h = apply_chunking_to_forward(self.ff_chunk, self.chunk_size_feed_forward, self.seq_len_dim, output_h) 2025-08-14T21:48:47.9209017Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:48:47.9209095Z return forward_fn(*input_tensors) 2025-08-14T21:48:47.9209330Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 518, in ff_chunk 2025-08-14T21:48:47.9209394Z output_x = self.ff(output_x) 2025-08-14T21:48:47.9209633Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 464, in forward 2025-08-14T21:48:47.9209712Z output = self.activation_function(output) 2025-08-14T21:48:47.9209917Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:48:47.9209980Z return self.act(input) 2025-08-14T21:48:47.9209983Z 2025-08-14T21:48:47.9210076Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9210263Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9210321Z return mod(**inputs) 2025-08-14T21:48:47.9210560Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9210636Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9210865Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9210952Z outputs = layer_module( 2025-08-14T21:48:47.9211208Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 512, in forward 2025-08-14T21:48:47.9211414Z output_h = apply_chunking_to_forward(self.ff_chunk, self.chunk_size_feed_forward, self.seq_len_dim, output_h) 2025-08-14T21:48:47.9211663Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:48:47.9211733Z return forward_fn(*input_tensors) 2025-08-14T21:48:47.9211972Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 518, in ff_chunk 2025-08-14T21:48:47.9212054Z output_x = self.ff(output_x) 2025-08-14T21:48:47.9212286Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 466, in forward 2025-08-14T21:48:47.9212360Z output = self.layer_2(output) 2025-08-14T21:48:47.9212363Z 2025-08-14T21:48:47.9212458Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9212650Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9212712Z return mod(**inputs) 2025-08-14T21:48:47.9212944Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9213038Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9213269Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9213329Z outputs = layer_module( 2025-08-14T21:48:47.9213569Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9213632Z outputs = self.rel_attn( 2025-08-14T21:48:47.9213874Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 416, in forward 2025-08-14T21:48:47.9213965Z q_head_h = torch.einsum("ibh,hnd->ibnd", h, self.q) 2025-08-14T21:48:47.9213969Z 2025-08-14T21:48:47.9214061Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9214248Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9214307Z return mod(**inputs) 2025-08-14T21:48:47.9214548Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9214623Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9214854Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9214924Z outputs = layer_module( 2025-08-14T21:48:47.9215154Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9215220Z outputs = self.rel_attn( 2025-08-14T21:48:47.9215459Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 417, in forward 2025-08-14T21:48:47.9215553Z k_head_h = torch.einsum("ibh,hnd->ibnd", cat, self.k) 2025-08-14T21:48:47.9215556Z 2025-08-14T21:48:47.9215657Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9215835Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9215894Z return mod(**inputs) 2025-08-14T21:48:47.9216135Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9216212Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9216449Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9216509Z outputs = layer_module( 2025-08-14T21:48:47.9216768Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9216850Z outputs = self.rel_attn( 2025-08-14T21:48:47.9217083Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 425, in forward 2025-08-14T21:48:47.9217151Z attn_vec = self.rel_attn_core( 2025-08-14T21:48:47.9217405Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 263, in rel_attn_core 2025-08-14T21:48:47.9217525Z ac = torch.einsum("ibnd,jbnd->bnij", q_head + self.r_w_bias, k_head_h) 2025-08-14T21:48:47.9217544Z 2025-08-14T21:48:47.9217646Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9217828Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9217888Z return mod(**inputs) 2025-08-14T21:48:47.9218130Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9218208Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9218444Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9218511Z outputs = layer_module( 2025-08-14T21:48:47.9218744Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9218811Z outputs = self.rel_attn( 2025-08-14T21:48:47.9219041Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 422, in forward 2025-08-14T21:48:47.9219163Z k_head_r = torch.einsum("ibh,hnd->ibnd", r.type(self.r.dtype), self.r) 2025-08-14T21:48:47.9219166Z 2025-08-14T21:48:47.9219266Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9219451Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9219519Z return mod(**inputs) 2025-08-14T21:48:47.9219751Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9219826Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9220066Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9220125Z outputs = layer_module( 2025-08-14T21:48:47.9220356Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9220426Z outputs = self.rel_attn( 2025-08-14T21:48:47.9220656Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 425, in forward 2025-08-14T21:48:47.9220729Z attn_vec = self.rel_attn_core( 2025-08-14T21:48:47.9220980Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 266, in rel_attn_core 2025-08-14T21:48:47.9221099Z bd = torch.einsum("ibnd,jbnd->bnij", q_head + self.r_r_bias, k_head_r) 2025-08-14T21:48:47.9221102Z 2025-08-14T21:48:47.9221202Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9221383Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9221448Z return mod(**inputs) 2025-08-14T21:48:47.9221681Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9221757Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9221995Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9222053Z outputs = layer_module( 2025-08-14T21:48:47.9222316Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9222401Z outputs = self.rel_attn( 2025-08-14T21:48:47.9222634Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 418, in forward 2025-08-14T21:48:47.9222732Z v_head_h = torch.einsum("ibh,hnd->ibnd", cat, self.v) 2025-08-14T21:48:47.9222735Z 2025-08-14T21:48:47.9222826Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9223009Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9223077Z return mod(**inputs) 2025-08-14T21:48:47.9223327Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9223409Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9223648Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9223709Z outputs = layer_module( 2025-08-14T21:48:47.9223951Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9224015Z outputs = self.rel_attn( 2025-08-14T21:48:47.9224248Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 425, in forward 2025-08-14T21:48:47.9224319Z attn_vec = self.rel_attn_core( 2025-08-14T21:48:47.9224567Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 294, in rel_attn_core 2025-08-14T21:48:47.9224690Z attn_vec = torch.einsum("bnij,jbnd->ibnd", attn_prob, v_head_h) 2025-08-14T21:48:47.9224693Z 2025-08-14T21:48:47.9224858Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9225048Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9225116Z return mod(**inputs) 2025-08-14T21:48:47.9225352Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9225436Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9225665Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9225727Z outputs = layer_module( 2025-08-14T21:48:47.9225966Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9226032Z outputs = self.rel_attn( 2025-08-14T21:48:47.9226262Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 440, in forward 2025-08-14T21:48:47.9226352Z output_h = self.post_attention(h, attn_vec) 2025-08-14T21:48:47.9226603Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 304, in post_attention 2025-08-14T21:48:47.9226719Z attn_out = torch.einsum("ibnd,hnd->ibh", attn_vec, self.o) 2025-08-14T21:48:47.9226723Z 2025-08-14T21:48:47.9226815Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9226997Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9227068Z return mod(**inputs) 2025-08-14T21:48:47.9227300Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9227375Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9227614Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9227676Z outputs = layer_module( 2025-08-14T21:48:47.9227948Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9228015Z outputs = self.rel_attn( 2025-08-14T21:48:47.9228265Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 440, in forward 2025-08-14T21:48:47.9228354Z output_h = self.post_attention(h, attn_vec) 2025-08-14T21:48:47.9228603Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 304, in post_attention 2025-08-14T21:48:47.9228711Z attn_out = torch.einsum("ibnd,hnd->ibh", attn_vec, self.o) 2025-08-14T21:48:47.9228714Z 2025-08-14T21:48:47.9228808Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9229006Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9229073Z return mod(**inputs) 2025-08-14T21:48:47.9229306Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9229380Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9229620Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9229680Z outputs = layer_module( 2025-08-14T21:48:47.9229913Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 512, in forward 2025-08-14T21:48:47.9230101Z output_h = apply_chunking_to_forward(self.ff_chunk, self.chunk_size_feed_forward, self.seq_len_dim, output_h) 2025-08-14T21:48:47.9230339Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:48:47.9230418Z return forward_fn(*input_tensors) 2025-08-14T21:48:47.9230649Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 518, in ff_chunk 2025-08-14T21:48:47.9230724Z output_x = self.ff(output_x) 2025-08-14T21:48:47.9230955Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 463, in forward 2025-08-14T21:48:47.9231023Z output = self.layer_1(output) 2025-08-14T21:48:47.9231026Z 2025-08-14T21:48:47.9231126Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9231306Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9231366Z return mod(**inputs) 2025-08-14T21:48:47.9231602Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9231679Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9231917Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9231978Z outputs = layer_module( 2025-08-14T21:48:47.9232208Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 512, in forward 2025-08-14T21:48:47.9232406Z output_h = apply_chunking_to_forward(self.ff_chunk, self.chunk_size_feed_forward, self.seq_len_dim, output_h) 2025-08-14T21:48:47.9232643Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:48:47.9232719Z return forward_fn(*input_tensors) 2025-08-14T21:48:47.9232951Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 518, in ff_chunk 2025-08-14T21:48:47.9233017Z output_x = self.ff(output_x) 2025-08-14T21:48:47.9233255Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 464, in forward 2025-08-14T21:48:47.9233336Z output = self.activation_function(output) 2025-08-14T21:48:47.9233573Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:48:47.9233644Z return self.act(input) 2025-08-14T21:48:47.9233662Z 2025-08-14T21:48:47.9233756Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9233941Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9234000Z return mod(**inputs) 2025-08-14T21:48:47.9234230Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9234311Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9234539Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9234622Z outputs = layer_module( 2025-08-14T21:48:47.9234861Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 512, in forward 2025-08-14T21:48:47.9235052Z output_h = apply_chunking_to_forward(self.ff_chunk, self.chunk_size_feed_forward, self.seq_len_dim, output_h) 2025-08-14T21:48:47.9235306Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:48:47.9235374Z return forward_fn(*input_tensors) 2025-08-14T21:48:47.9235611Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 518, in ff_chunk 2025-08-14T21:48:47.9235682Z output_x = self.ff(output_x) 2025-08-14T21:48:47.9235919Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 466, in forward 2025-08-14T21:48:47.9235993Z output = self.layer_2(output) 2025-08-14T21:48:47.9235997Z 2025-08-14T21:48:47.9236090Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9236276Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9236345Z return mod(**inputs) 2025-08-14T21:48:47.9236586Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9236668Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9236907Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9236967Z outputs = layer_module( 2025-08-14T21:48:47.9237213Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9237275Z outputs = self.rel_attn( 2025-08-14T21:48:47.9237515Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 416, in forward 2025-08-14T21:48:47.9237610Z q_head_h = torch.einsum("ibh,hnd->ibnd", h, self.q) 2025-08-14T21:48:47.9237613Z 2025-08-14T21:48:47.9237710Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9237902Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9237963Z return mod(**inputs) 2025-08-14T21:48:47.9238202Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9238284Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9238523Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9238592Z outputs = layer_module( 2025-08-14T21:48:47.9238831Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9238895Z outputs = self.rel_attn( 2025-08-14T21:48:47.9239157Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 417, in forward 2025-08-14T21:48:47.9239267Z k_head_h = torch.einsum("ibh,hnd->ibnd", cat, self.k) 2025-08-14T21:48:47.9239286Z 2025-08-14T21:48:47.9239380Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9239572Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9239630Z return mod(**inputs) 2025-08-14T21:48:47.9239870Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9239943Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9240179Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9240268Z outputs = layer_module( 2025-08-14T21:48:47.9240502Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9240567Z outputs = self.rel_attn( 2025-08-14T21:48:47.9240806Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 425, in forward 2025-08-14T21:48:47.9240875Z attn_vec = self.rel_attn_core( 2025-08-14T21:48:47.9241131Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 263, in rel_attn_core 2025-08-14T21:48:47.9241253Z ac = torch.einsum("ibnd,jbnd->bnij", q_head + self.r_w_bias, k_head_h) 2025-08-14T21:48:47.9241256Z 2025-08-14T21:48:47.9241350Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9241537Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9241599Z return mod(**inputs) 2025-08-14T21:48:47.9241838Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9241911Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9242143Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9242214Z outputs = layer_module( 2025-08-14T21:48:47.9242443Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9242505Z outputs = self.rel_attn( 2025-08-14T21:48:47.9242743Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 422, in forward 2025-08-14T21:48:47.9242864Z k_head_r = torch.einsum("ibh,hnd->ibnd", r.type(self.r.dtype), self.r) 2025-08-14T21:48:47.9242869Z 2025-08-14T21:48:47.9242969Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9243150Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9243208Z return mod(**inputs) 2025-08-14T21:48:47.9243450Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9243527Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9243766Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9243828Z outputs = layer_module( 2025-08-14T21:48:47.9244058Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9244129Z outputs = self.rel_attn( 2025-08-14T21:48:47.9244359Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 425, in forward 2025-08-14T21:48:47.9244427Z attn_vec = self.rel_attn_core( 2025-08-14T21:48:47.9244682Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 266, in rel_attn_core 2025-08-14T21:48:47.9244834Z bd = torch.einsum("ibnd,jbnd->bnij", q_head + self.r_r_bias, k_head_r) 2025-08-14T21:48:47.9244838Z 2025-08-14T21:48:47.9244956Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9245140Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9245200Z return mod(**inputs) 2025-08-14T21:48:47.9245441Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9245515Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9245753Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9245832Z outputs = layer_module( 2025-08-14T21:48:47.9246069Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9246138Z outputs = self.rel_attn( 2025-08-14T21:48:47.9246381Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 418, in forward 2025-08-14T21:48:47.9246474Z v_head_h = torch.einsum("ibh,hnd->ibnd", cat, self.v) 2025-08-14T21:48:47.9246477Z 2025-08-14T21:48:47.9246578Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9246763Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9246828Z return mod(**inputs) 2025-08-14T21:48:47.9247069Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9247147Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9247394Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9247453Z outputs = layer_module( 2025-08-14T21:48:47.9247700Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9247763Z outputs = self.rel_attn( 2025-08-14T21:48:47.9248002Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 425, in forward 2025-08-14T21:48:47.9248074Z attn_vec = self.rel_attn_core( 2025-08-14T21:48:47.9248329Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 294, in rel_attn_core 2025-08-14T21:48:47.9248443Z attn_vec = torch.einsum("bnij,jbnd->ibnd", attn_prob, v_head_h) 2025-08-14T21:48:47.9248446Z 2025-08-14T21:48:47.9248547Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9248733Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9248797Z return mod(**inputs) 2025-08-14T21:48:47.9249041Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9249116Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9249363Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9249423Z outputs = layer_module( 2025-08-14T21:48:47.9249660Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9249729Z outputs = self.rel_attn( 2025-08-14T21:48:47.9249963Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 440, in forward 2025-08-14T21:48:47.9250050Z output_h = self.post_attention(h, attn_vec) 2025-08-14T21:48:47.9250306Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 304, in post_attention 2025-08-14T21:48:47.9250454Z attn_out = torch.einsum("ibnd,hnd->ibh", attn_vec, self.o) 2025-08-14T21:48:47.9250458Z 2025-08-14T21:48:47.9250559Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9250760Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9250826Z return mod(**inputs) 2025-08-14T21:48:47.9251057Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9251129Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9251365Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9251452Z outputs = layer_module( 2025-08-14T21:48:47.9251682Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9251749Z outputs = self.rel_attn( 2025-08-14T21:48:47.9251980Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 440, in forward 2025-08-14T21:48:47.9252068Z output_h = self.post_attention(h, attn_vec) 2025-08-14T21:48:47.9252318Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 304, in post_attention 2025-08-14T21:48:47.9252419Z attn_out = torch.einsum("ibnd,hnd->ibh", attn_vec, self.o) 2025-08-14T21:48:47.9252422Z 2025-08-14T21:48:47.9252523Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9252703Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9252771Z return mod(**inputs) 2025-08-14T21:48:47.9253002Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9253076Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9253316Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9253378Z outputs = layer_module( 2025-08-14T21:48:47.9253607Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 512, in forward 2025-08-14T21:48:47.9253801Z output_h = apply_chunking_to_forward(self.ff_chunk, self.chunk_size_feed_forward, self.seq_len_dim, output_h) 2025-08-14T21:48:47.9254041Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:48:47.9254119Z return forward_fn(*input_tensors) 2025-08-14T21:48:47.9254352Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 518, in ff_chunk 2025-08-14T21:48:47.9254418Z output_x = self.ff(output_x) 2025-08-14T21:48:47.9254660Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 463, in forward 2025-08-14T21:48:47.9254727Z output = self.layer_1(output) 2025-08-14T21:48:47.9254732Z 2025-08-14T21:48:47.9254832Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9255015Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9255076Z return mod(**inputs) 2025-08-14T21:48:47.9255313Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9255386Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9255614Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9255684Z outputs = layer_module( 2025-08-14T21:48:47.9255914Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 512, in forward 2025-08-14T21:48:47.9256139Z output_h = apply_chunking_to_forward(self.ff_chunk, self.chunk_size_feed_forward, self.seq_len_dim, output_h) 2025-08-14T21:48:47.9256395Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:48:47.9256465Z return forward_fn(*input_tensors) 2025-08-14T21:48:47.9256703Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 518, in ff_chunk 2025-08-14T21:48:47.9256768Z output_x = self.ff(output_x) 2025-08-14T21:48:47.9257003Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 464, in forward 2025-08-14T21:48:47.9257102Z output = self.activation_function(output) 2025-08-14T21:48:47.9257297Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:48:47.9257369Z return self.act(input) 2025-08-14T21:48:47.9257373Z 2025-08-14T21:48:47.9257468Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9257649Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9257715Z return mod(**inputs) 2025-08-14T21:48:47.9257947Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9258027Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9258259Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9258320Z outputs = layer_module( 2025-08-14T21:48:47.9258559Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 512, in forward 2025-08-14T21:48:47.9258748Z output_h = apply_chunking_to_forward(self.ff_chunk, self.chunk_size_feed_forward, self.seq_len_dim, output_h) 2025-08-14T21:48:47.9258996Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:48:47.9259067Z return forward_fn(*input_tensors) 2025-08-14T21:48:47.9259296Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 518, in ff_chunk 2025-08-14T21:48:47.9259367Z output_x = self.ff(output_x) 2025-08-14T21:48:47.9259595Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 466, in forward 2025-08-14T21:48:47.9259661Z output = self.layer_2(output) 2025-08-14T21:48:47.9259671Z 2025-08-14T21:48:47.9259764Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9259946Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9260011Z return mod(**inputs) 2025-08-14T21:48:47.9260245Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9260321Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9260562Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9260624Z outputs = layer_module( 2025-08-14T21:48:47.9260863Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9260926Z outputs = self.rel_attn( 2025-08-14T21:48:47.9261153Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 416, in forward 2025-08-14T21:48:47.9261249Z q_head_h = torch.einsum("ibh,hnd->ibnd", h, self.q) 2025-08-14T21:48:47.9261252Z 2025-08-14T21:48:47.9261345Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9261524Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9261622Z return mod(**inputs) 2025-08-14T21:48:47.9261857Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9261954Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9262188Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9262251Z outputs = layer_module( 2025-08-14T21:48:47.9262490Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9262553Z outputs = self.rel_attn( 2025-08-14T21:48:47.9262797Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 417, in forward 2025-08-14T21:48:47.9262895Z k_head_h = torch.einsum("ibh,hnd->ibnd", cat, self.k) 2025-08-14T21:48:47.9262899Z 2025-08-14T21:48:47.9262994Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9263183Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9263245Z return mod(**inputs) 2025-08-14T21:48:47.9263474Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9263554Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9263784Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9263850Z outputs = layer_module( 2025-08-14T21:48:47.9264077Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9264142Z outputs = self.rel_attn( 2025-08-14T21:48:47.9264377Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 425, in forward 2025-08-14T21:48:47.9264447Z attn_vec = self.rel_attn_core( 2025-08-14T21:48:47.9264693Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 263, in rel_attn_core 2025-08-14T21:48:47.9264894Z ac = torch.einsum("ibnd,jbnd->bnij", q_head + self.r_w_bias, k_head_h) 2025-08-14T21:48:47.9264899Z 2025-08-14T21:48:47.9264994Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9265183Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9265244Z return mod(**inputs) 2025-08-14T21:48:47.9265478Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9265564Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9265798Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9265871Z outputs = layer_module( 2025-08-14T21:48:47.9266103Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9266169Z outputs = self.rel_attn( 2025-08-14T21:48:47.9266409Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 422, in forward 2025-08-14T21:48:47.9266532Z k_head_r = torch.einsum("ibh,hnd->ibnd", r.type(self.r.dtype), self.r) 2025-08-14T21:48:47.9266535Z 2025-08-14T21:48:47.9266628Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9266817Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9266878Z return mod(**inputs) 2025-08-14T21:48:47.9267118Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9267193Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9267458Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9267548Z outputs = layer_module( 2025-08-14T21:48:47.9267779Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9267849Z outputs = self.rel_attn( 2025-08-14T21:48:47.9268075Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 425, in forward 2025-08-14T21:48:47.9268142Z attn_vec = self.rel_attn_core( 2025-08-14T21:48:47.9268395Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 266, in rel_attn_core 2025-08-14T21:48:47.9268532Z bd = torch.einsum("ibnd,jbnd->bnij", q_head + self.r_r_bias, k_head_r) 2025-08-14T21:48:47.9268535Z 2025-08-14T21:48:47.9268630Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9268826Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9268888Z return mod(**inputs) 2025-08-14T21:48:47.9269129Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9269202Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9269435Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9269505Z outputs = layer_module( 2025-08-14T21:48:47.9269738Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9269807Z outputs = self.rel_attn( 2025-08-14T21:48:47.9270039Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 418, in forward 2025-08-14T21:48:47.9270132Z v_head_h = torch.einsum("ibh,hnd->ibnd", cat, self.v) 2025-08-14T21:48:47.9270136Z 2025-08-14T21:48:47.9270236Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9270418Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9270477Z return mod(**inputs) 2025-08-14T21:48:47.9270721Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9270794Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9271032Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9271095Z outputs = layer_module( 2025-08-14T21:48:47.9271327Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9271398Z outputs = self.rel_attn( 2025-08-14T21:48:47.9271631Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 425, in forward 2025-08-14T21:48:47.9271699Z attn_vec = self.rel_attn_core( 2025-08-14T21:48:47.9271951Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 294, in rel_attn_core 2025-08-14T21:48:47.9272063Z attn_vec = torch.einsum("bnij,jbnd->ibnd", attn_prob, v_head_h) 2025-08-14T21:48:47.9272067Z 2025-08-14T21:48:47.9272168Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9272347Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9272408Z return mod(**inputs) 2025-08-14T21:48:47.9272650Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9272722Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9272993Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9273078Z outputs = layer_module( 2025-08-14T21:48:47.9273309Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9273380Z outputs = self.rel_attn( 2025-08-14T21:48:47.9273613Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 440, in forward 2025-08-14T21:48:47.9273693Z output_h = self.post_attention(h, attn_vec) 2025-08-14T21:48:47.9273950Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 304, in post_attention 2025-08-14T21:48:47.9274079Z attn_out = torch.einsum("ibnd,hnd->ibh", attn_vec, self.o) 2025-08-14T21:48:47.9274083Z 2025-08-14T21:48:47.9274182Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9274362Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9274421Z return mod(**inputs) 2025-08-14T21:48:47.9274659Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9274734Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9274969Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9275028Z outputs = layer_module( 2025-08-14T21:48:47.9275259Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9275331Z outputs = self.rel_attn( 2025-08-14T21:48:47.9275560Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 440, in forward 2025-08-14T21:48:47.9275639Z output_h = self.post_attention(h, attn_vec) 2025-08-14T21:48:47.9275898Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 304, in post_attention 2025-08-14T21:48:47.9276000Z attn_out = torch.einsum("ibnd,hnd->ibh", attn_vec, self.o) 2025-08-14T21:48:47.9276003Z 2025-08-14T21:48:47.9276103Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9276283Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9276341Z return mod(**inputs) 2025-08-14T21:48:47.9276579Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9276655Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9276890Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9276949Z outputs = layer_module( 2025-08-14T21:48:47.9277178Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 512, in forward 2025-08-14T21:48:47.9277377Z output_h = apply_chunking_to_forward(self.ff_chunk, self.chunk_size_feed_forward, self.seq_len_dim, output_h) 2025-08-14T21:48:47.9277618Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:48:47.9277688Z return forward_fn(*input_tensors) 2025-08-14T21:48:47.9277926Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 518, in ff_chunk 2025-08-14T21:48:47.9277991Z output_x = self.ff(output_x) 2025-08-14T21:48:47.9278229Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 463, in forward 2025-08-14T21:48:47.9278295Z output = self.layer_1(output) 2025-08-14T21:48:47.9278298Z 2025-08-14T21:48:47.9278407Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9278613Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9278690Z return mod(**inputs) 2025-08-14T21:48:47.9278934Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9279009Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9279245Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9279313Z outputs = layer_module( 2025-08-14T21:48:47.9279549Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 512, in forward 2025-08-14T21:48:47.9279756Z output_h = apply_chunking_to_forward(self.ff_chunk, self.chunk_size_feed_forward, self.seq_len_dim, output_h) 2025-08-14T21:48:47.9280006Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:48:47.9280078Z return forward_fn(*input_tensors) 2025-08-14T21:48:47.9280318Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 518, in ff_chunk 2025-08-14T21:48:47.9280384Z output_x = self.ff(output_x) 2025-08-14T21:48:47.9280613Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 464, in forward 2025-08-14T21:48:47.9280702Z output = self.activation_function(output) 2025-08-14T21:48:47.9280900Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:48:47.9280972Z return self.act(input) 2025-08-14T21:48:47.9280976Z 2025-08-14T21:48:47.9281067Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9281247Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9281316Z return mod(**inputs) 2025-08-14T21:48:47.9281550Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9281626Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9281866Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9281926Z outputs = layer_module( 2025-08-14T21:48:47.9282165Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 512, in forward 2025-08-14T21:48:47.9282351Z output_h = apply_chunking_to_forward(self.ff_chunk, self.chunk_size_feed_forward, self.seq_len_dim, output_h) 2025-08-14T21:48:47.9282591Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:48:47.9282669Z return forward_fn(*input_tensors) 2025-08-14T21:48:47.9282905Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 518, in ff_chunk 2025-08-14T21:48:47.9282979Z output_x = self.ff(output_x) 2025-08-14T21:48:47.9283210Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 466, in forward 2025-08-14T21:48:47.9283275Z output = self.layer_2(output) 2025-08-14T21:48:47.9283278Z 2025-08-14T21:48:47.9283378Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9283559Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9283617Z return mod(**inputs) 2025-08-14T21:48:47.9283857Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9283932Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9284203Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9284266Z outputs = layer_module( 2025-08-14T21:48:47.9284515Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9284703Z outputs = self.rel_attn( 2025-08-14T21:48:47.9284947Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 416, in forward 2025-08-14T21:48:47.9285045Z q_head_h = torch.einsum("ibh,hnd->ibnd", h, self.q) 2025-08-14T21:48:47.9285049Z 2025-08-14T21:48:47.9285143Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9285362Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9285432Z return mod(**inputs) 2025-08-14T21:48:47.9285666Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9285743Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9285983Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9286043Z outputs = layer_module( 2025-08-14T21:48:47.9286279Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9286343Z outputs = self.rel_attn( 2025-08-14T21:48:47.9286572Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 417, in forward 2025-08-14T21:48:47.9286672Z k_head_h = torch.einsum("ibh,hnd->ibnd", cat, self.k) 2025-08-14T21:48:47.9286677Z 2025-08-14T21:48:47.9286768Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9286954Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9287017Z return mod(**inputs) 2025-08-14T21:48:47.9287248Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9287332Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9287566Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9287626Z outputs = layer_module( 2025-08-14T21:48:47.9287863Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9287927Z outputs = self.rel_attn( 2025-08-14T21:48:47.9288165Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 425, in forward 2025-08-14T21:48:47.9288233Z attn_vec = self.rel_attn_core( 2025-08-14T21:48:47.9288482Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 263, in rel_attn_core 2025-08-14T21:48:47.9288612Z ac = torch.einsum("ibnd,jbnd->bnij", q_head + self.r_w_bias, k_head_h) 2025-08-14T21:48:47.9288617Z 2025-08-14T21:48:47.9288711Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9288889Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9288957Z return mod(**inputs) 2025-08-14T21:48:47.9289186Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9289267Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9289494Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9289555Z outputs = layer_module( 2025-08-14T21:48:47.9289789Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9289892Z outputs = self.rel_attn( 2025-08-14T21:48:47.9290128Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 422, in forward 2025-08-14T21:48:47.9290273Z k_head_r = torch.einsum("ibh,hnd->ibnd", r.type(self.r.dtype), self.r) 2025-08-14T21:48:47.9290277Z 2025-08-14T21:48:47.9290370Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9290558Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9290615Z return mod(**inputs) 2025-08-14T21:48:47.9290848Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9290945Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9291178Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9291246Z outputs = layer_module( 2025-08-14T21:48:47.9291476Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9291539Z outputs = self.rel_attn( 2025-08-14T21:48:47.9291776Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 425, in forward 2025-08-14T21:48:47.9291840Z attn_vec = self.rel_attn_core( 2025-08-14T21:48:47.9292094Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 266, in rel_attn_core 2025-08-14T21:48:47.9292213Z bd = torch.einsum("ibnd,jbnd->bnij", q_head + self.r_r_bias, k_head_r) 2025-08-14T21:48:47.9292218Z 2025-08-14T21:48:47.9292310Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9292496Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9292555Z return mod(**inputs) 2025-08-14T21:48:47.9292788Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9292873Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9293104Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9293173Z outputs = layer_module( 2025-08-14T21:48:47.9293402Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9293464Z outputs = self.rel_attn( 2025-08-14T21:48:47.9293703Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 418, in forward 2025-08-14T21:48:47.9293797Z v_head_h = torch.einsum("ibh,hnd->ibnd", cat, self.v) 2025-08-14T21:48:47.9293800Z 2025-08-14T21:48:47.9293900Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9294083Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9294142Z return mod(**inputs) 2025-08-14T21:48:47.9294381Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9294457Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9294687Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9294756Z outputs = layer_module( 2025-08-14T21:48:47.9294986Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9295054Z outputs = self.rel_attn( 2025-08-14T21:48:47.9295283Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 425, in forward 2025-08-14T21:48:47.9295365Z attn_vec = self.rel_attn_core( 2025-08-14T21:48:47.9295646Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 294, in rel_attn_core 2025-08-14T21:48:47.9295776Z attn_vec = torch.einsum("bnij,jbnd->ibnd", attn_prob, v_head_h) 2025-08-14T21:48:47.9295780Z 2025-08-14T21:48:47.9295881Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9296062Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9296121Z return mod(**inputs) 2025-08-14T21:48:47.9296362Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9296453Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9296686Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9296755Z outputs = layer_module( 2025-08-14T21:48:47.9296986Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9297058Z outputs = self.rel_attn( 2025-08-14T21:48:47.9297289Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 440, in forward 2025-08-14T21:48:47.9297369Z output_h = self.post_attention(h, attn_vec) 2025-08-14T21:48:47.9297626Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 304, in post_attention 2025-08-14T21:48:47.9297727Z attn_out = torch.einsum("ibnd,hnd->ibh", attn_vec, self.o) 2025-08-14T21:48:47.9297731Z 2025-08-14T21:48:47.9297824Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9298009Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9298069Z return mod(**inputs) 2025-08-14T21:48:47.9298310Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9298385Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9298614Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9298682Z outputs = layer_module( 2025-08-14T21:48:47.9298910Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9298981Z outputs = self.rel_attn( 2025-08-14T21:48:47.9299211Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 440, in forward 2025-08-14T21:48:47.9299294Z output_h = self.post_attention(h, attn_vec) 2025-08-14T21:48:47.9299552Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 304, in post_attention 2025-08-14T21:48:47.9299654Z attn_out = torch.einsum("ibnd,hnd->ibh", attn_vec, self.o) 2025-08-14T21:48:47.9299658Z 2025-08-14T21:48:47.9299751Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9299937Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9299997Z return mod(**inputs) 2025-08-14T21:48:47.9300235Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9300309Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9300540Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9300610Z outputs = layer_module( 2025-08-14T21:48:47.9300839Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 512, in forward 2025-08-14T21:48:47.9301064Z output_h = apply_chunking_to_forward(self.ff_chunk, self.chunk_size_feed_forward, self.seq_len_dim, output_h) 2025-08-14T21:48:47.9301307Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:48:47.9301396Z return forward_fn(*input_tensors) 2025-08-14T21:48:47.9301638Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 518, in ff_chunk 2025-08-14T21:48:47.9301704Z output_x = self.ff(output_x) 2025-08-14T21:48:47.9301933Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 463, in forward 2025-08-14T21:48:47.9302024Z output = self.layer_1(output) 2025-08-14T21:48:47.9302028Z 2025-08-14T21:48:47.9302119Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9302305Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9302363Z return mod(**inputs) 2025-08-14T21:48:47.9302599Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9302681Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9302910Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9302975Z outputs = layer_module( 2025-08-14T21:48:47.9303206Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 512, in forward 2025-08-14T21:48:47.9303394Z output_h = apply_chunking_to_forward(self.ff_chunk, self.chunk_size_feed_forward, self.seq_len_dim, output_h) 2025-08-14T21:48:47.9303640Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:48:47.9303708Z return forward_fn(*input_tensors) 2025-08-14T21:48:47.9303940Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 518, in ff_chunk 2025-08-14T21:48:47.9304013Z output_x = self.ff(output_x) 2025-08-14T21:48:47.9304243Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 464, in forward 2025-08-14T21:48:47.9304329Z output = self.activation_function(output) 2025-08-14T21:48:47.9304523Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:48:47.9304585Z return self.act(input) 2025-08-14T21:48:47.9304588Z 2025-08-14T21:48:47.9304687Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9304919Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9304989Z return mod(**inputs) 2025-08-14T21:48:47.9305222Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9305298Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9305535Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9305594Z outputs = layer_module( 2025-08-14T21:48:47.9305820Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 512, in forward 2025-08-14T21:48:47.9306015Z output_h = apply_chunking_to_forward(self.ff_chunk, self.chunk_size_feed_forward, self.seq_len_dim, output_h) 2025-08-14T21:48:47.9306250Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:48:47.9306328Z return forward_fn(*input_tensors) 2025-08-14T21:48:47.9306558Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 518, in ff_chunk 2025-08-14T21:48:47.9306655Z output_x = self.ff(output_x) 2025-08-14T21:48:47.9306896Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 466, in forward 2025-08-14T21:48:47.9306981Z output = self.layer_2(output) 2025-08-14T21:48:47.9306984Z 2025-08-14T21:48:47.9307086Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9307268Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9307329Z return mod(**inputs) 2025-08-14T21:48:47.9307570Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9307665Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9307895Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9307963Z outputs = layer_module( 2025-08-14T21:48:47.9308194Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9308267Z outputs = self.rel_attn( 2025-08-14T21:48:47.9308496Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 416, in forward 2025-08-14T21:48:47.9308587Z q_head_h = torch.einsum("ibh,hnd->ibnd", h, self.q) 2025-08-14T21:48:47.9308590Z 2025-08-14T21:48:47.9308690Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9308870Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9308937Z return mod(**inputs) 2025-08-14T21:48:47.9309168Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9309242Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9309480Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9309542Z outputs = layer_module( 2025-08-14T21:48:47.9309773Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9309845Z outputs = self.rel_attn( 2025-08-14T21:48:47.9310076Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 417, in forward 2025-08-14T21:48:47.9310177Z k_head_h = torch.einsum("ibh,hnd->ibnd", cat, self.k) 2025-08-14T21:48:47.9310180Z 2025-08-14T21:48:47.9310273Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9310458Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9310528Z return mod(**inputs) 2025-08-14T21:48:47.9310759Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9310837Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9311073Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9311136Z outputs = layer_module( 2025-08-14T21:48:47.9311372Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9311434Z outputs = self.rel_attn( 2025-08-14T21:48:47.9311663Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 425, in forward 2025-08-14T21:48:47.9311738Z attn_vec = self.rel_attn_core( 2025-08-14T21:48:47.9311987Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 263, in rel_attn_core 2025-08-14T21:48:47.9312117Z ac = torch.einsum("ibnd,jbnd->bnij", q_head + self.r_w_bias, k_head_h) 2025-08-14T21:48:47.9312120Z 2025-08-14T21:48:47.9312240Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9312421Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9312506Z return mod(**inputs) 2025-08-14T21:48:47.9312737Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9312810Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9313047Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9313108Z outputs = layer_module( 2025-08-14T21:48:47.9313364Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9313426Z outputs = self.rel_attn( 2025-08-14T21:48:47.9313657Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 422, in forward 2025-08-14T21:48:47.9313789Z k_head_r = torch.einsum("ibh,hnd->ibnd", r.type(self.r.dtype), self.r) 2025-08-14T21:48:47.9313794Z 2025-08-14T21:48:47.9314131Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9314319Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9314377Z return mod(**inputs) 2025-08-14T21:48:47.9314614Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9314696Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9314928Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9314989Z outputs = layer_module( 2025-08-14T21:48:47.9315227Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9315292Z outputs = self.rel_attn( 2025-08-14T21:48:47.9315531Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 425, in forward 2025-08-14T21:48:47.9315597Z attn_vec = self.rel_attn_core( 2025-08-14T21:48:47.9315846Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 266, in rel_attn_core 2025-08-14T21:48:47.9315972Z bd = torch.einsum("ibnd,jbnd->bnij", q_head + self.r_r_bias, k_head_r) 2025-08-14T21:48:47.9315975Z 2025-08-14T21:48:47.9316068Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9316254Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9316316Z return mod(**inputs) 2025-08-14T21:48:47.9316549Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9316631Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9316863Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9316926Z outputs = layer_module( 2025-08-14T21:48:47.9317160Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9317221Z outputs = self.rel_attn( 2025-08-14T21:48:47.9317457Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 418, in forward 2025-08-14T21:48:47.9317546Z v_head_h = torch.einsum("ibh,hnd->ibnd", cat, self.v) 2025-08-14T21:48:47.9317551Z 2025-08-14T21:48:47.9317643Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9317832Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9317890Z return mod(**inputs) 2025-08-14T21:48:47.9318174Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9318263Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9318496Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9318563Z outputs = layer_module( 2025-08-14T21:48:47.9318796Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9318857Z outputs = self.rel_attn( 2025-08-14T21:48:47.9319097Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 425, in forward 2025-08-14T21:48:47.9319178Z attn_vec = self.rel_attn_core( 2025-08-14T21:48:47.9319436Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 294, in rel_attn_core 2025-08-14T21:48:47.9319550Z attn_vec = torch.einsum("bnij,jbnd->ibnd", attn_prob, v_head_h) 2025-08-14T21:48:47.9319554Z 2025-08-14T21:48:47.9319647Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9319838Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9319896Z return mod(**inputs) 2025-08-14T21:48:47.9320137Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9320211Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9320444Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9320514Z outputs = layer_module( 2025-08-14T21:48:47.9320744Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9320805Z outputs = self.rel_attn( 2025-08-14T21:48:47.9321042Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 440, in forward 2025-08-14T21:48:47.9321124Z output_h = self.post_attention(h, attn_vec) 2025-08-14T21:48:47.9321381Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 304, in post_attention 2025-08-14T21:48:47.9321483Z attn_out = torch.einsum("ibnd,hnd->ibh", attn_vec, self.o) 2025-08-14T21:48:47.9321486Z 2025-08-14T21:48:47.9321576Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9321768Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9321828Z return mod(**inputs) 2025-08-14T21:48:47.9322069Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9322142Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9322376Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9322446Z outputs = layer_module( 2025-08-14T21:48:47.9322677Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9322738Z outputs = self.rel_attn( 2025-08-14T21:48:47.9322976Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 440, in forward 2025-08-14T21:48:47.9323054Z output_h = self.post_attention(h, attn_vec) 2025-08-14T21:48:47.9323308Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 304, in post_attention 2025-08-14T21:48:47.9323411Z attn_out = torch.einsum("ibnd,hnd->ibh", attn_vec, self.o) 2025-08-14T21:48:47.9323414Z 2025-08-14T21:48:47.9323507Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9323725Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9323803Z return mod(**inputs) 2025-08-14T21:48:47.9324035Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9324118Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9324361Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9324429Z outputs = layer_module( 2025-08-14T21:48:47.9324658Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 512, in forward 2025-08-14T21:48:47.9324866Z output_h = apply_chunking_to_forward(self.ff_chunk, self.chunk_size_feed_forward, self.seq_len_dim, output_h) 2025-08-14T21:48:47.9325116Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:48:47.9325186Z return forward_fn(*input_tensors) 2025-08-14T21:48:47.9325427Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 518, in ff_chunk 2025-08-14T21:48:47.9325492Z output_x = self.ff(output_x) 2025-08-14T21:48:47.9325722Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 463, in forward 2025-08-14T21:48:47.9325795Z output = self.layer_1(output) 2025-08-14T21:48:47.9325798Z 2025-08-14T21:48:47.9325891Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9326077Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9326138Z return mod(**inputs) 2025-08-14T21:48:47.9326369Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9326451Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9326681Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9326743Z outputs = layer_module( 2025-08-14T21:48:47.9326980Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 512, in forward 2025-08-14T21:48:47.9327168Z output_h = apply_chunking_to_forward(self.ff_chunk, self.chunk_size_feed_forward, self.seq_len_dim, output_h) 2025-08-14T21:48:47.9327414Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:48:47.9327485Z return forward_fn(*input_tensors) 2025-08-14T21:48:47.9327718Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 518, in ff_chunk 2025-08-14T21:48:47.9327791Z output_x = self.ff(output_x) 2025-08-14T21:48:47.9328026Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 464, in forward 2025-08-14T21:48:47.9328118Z output = self.activation_function(output) 2025-08-14T21:48:47.9328313Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:48:47.9328376Z return self.act(input) 2025-08-14T21:48:47.9328379Z 2025-08-14T21:48:47.9328479Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9328659Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9328718Z return mod(**inputs) 2025-08-14T21:48:47.9328959Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9329033Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9329300Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9329363Z outputs = layer_module( 2025-08-14T21:48:47.9329629Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 512, in forward 2025-08-14T21:48:47.9329825Z output_h = apply_chunking_to_forward(self.ff_chunk, self.chunk_size_feed_forward, self.seq_len_dim, output_h) 2025-08-14T21:48:47.9330061Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:48:47.9330137Z return forward_fn(*input_tensors) 2025-08-14T21:48:47.9330366Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 518, in ff_chunk 2025-08-14T21:48:47.9330454Z output_x = self.ff(output_x) 2025-08-14T21:48:47.9330697Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 466, in forward 2025-08-14T21:48:47.9330763Z output = self.layer_2(output) 2025-08-14T21:48:47.9330766Z 2025-08-14T21:48:47.9330862Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9331051Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9331109Z return mod(**inputs) 2025-08-14T21:48:47.9331353Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9331427Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9331662Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9331733Z outputs = layer_module( 2025-08-14T21:48:47.9331967Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9332029Z outputs = self.rel_attn( 2025-08-14T21:48:47.9332273Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 416, in forward 2025-08-14T21:48:47.9332363Z q_head_h = torch.einsum("ibh,hnd->ibnd", h, self.q) 2025-08-14T21:48:47.9332367Z 2025-08-14T21:48:47.9332466Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9332648Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9332707Z return mod(**inputs) 2025-08-14T21:48:47.9332951Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9333025Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9333268Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9333328Z outputs = layer_module( 2025-08-14T21:48:47.9333565Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9333636Z outputs = self.rel_attn( 2025-08-14T21:48:47.9333873Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 417, in forward 2025-08-14T21:48:47.9333966Z k_head_h = torch.einsum("ibh,hnd->ibnd", cat, self.k) 2025-08-14T21:48:47.9333969Z 2025-08-14T21:48:47.9334067Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9334252Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9334317Z return mod(**inputs) 2025-08-14T21:48:47.9334556Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9334631Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9334907Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9334971Z outputs = layer_module( 2025-08-14T21:48:47.9335357Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9335421Z outputs = self.rel_attn( 2025-08-14T21:48:47.9335652Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 425, in forward 2025-08-14T21:48:47.9335726Z attn_vec = self.rel_attn_core( 2025-08-14T21:48:47.9335975Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 263, in rel_attn_core 2025-08-14T21:48:47.9336127Z ac = torch.einsum("ibnd,jbnd->bnij", q_head + self.r_w_bias, k_head_h) 2025-08-14T21:48:47.9336130Z 2025-08-14T21:48:47.9336231Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9336416Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9336487Z return mod(**inputs) 2025-08-14T21:48:47.9336726Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9336803Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9337046Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9337108Z outputs = layer_module( 2025-08-14T21:48:47.9337342Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9337413Z outputs = self.rel_attn( 2025-08-14T21:48:47.9337651Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 422, in forward 2025-08-14T21:48:47.9337784Z k_head_r = torch.einsum("ibh,hnd->ibnd", r.type(self.r.dtype), self.r) 2025-08-14T21:48:47.9337787Z 2025-08-14T21:48:47.9337884Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9338066Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9338136Z return mod(**inputs) 2025-08-14T21:48:47.9338374Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9338458Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9338696Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9338756Z outputs = layer_module( 2025-08-14T21:48:47.9339001Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9339062Z outputs = self.rel_attn( 2025-08-14T21:48:47.9339297Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 425, in forward 2025-08-14T21:48:47.9339371Z attn_vec = self.rel_attn_core( 2025-08-14T21:48:47.9339624Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 266, in rel_attn_core 2025-08-14T21:48:47.9339752Z bd = torch.einsum("ibnd,jbnd->bnij", q_head + self.r_r_bias, k_head_r) 2025-08-14T21:48:47.9339754Z 2025-08-14T21:48:47.9339846Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9340032Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9340100Z return mod(**inputs) 2025-08-14T21:48:47.9340336Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9340420Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9340674Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9340753Z outputs = layer_module( 2025-08-14T21:48:47.9340989Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9341070Z outputs = self.rel_attn( 2025-08-14T21:48:47.9341298Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 418, in forward 2025-08-14T21:48:47.9341396Z v_head_h = torch.einsum("ibh,hnd->ibnd", cat, self.v) 2025-08-14T21:48:47.9341400Z 2025-08-14T21:48:47.9341493Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9341678Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9341754Z return mod(**inputs) 2025-08-14T21:48:47.9341989Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9342074Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9342307Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9342376Z outputs = layer_module( 2025-08-14T21:48:47.9342609Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9342671Z outputs = self.rel_attn( 2025-08-14T21:48:47.9342910Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 425, in forward 2025-08-14T21:48:47.9342975Z attn_vec = self.rel_attn_core( 2025-08-14T21:48:47.9343223Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 294, in rel_attn_core 2025-08-14T21:48:47.9343345Z attn_vec = torch.einsum("bnij,jbnd->ibnd", attn_prob, v_head_h) 2025-08-14T21:48:47.9343348Z 2025-08-14T21:48:47.9343444Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9343634Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9343694Z return mod(**inputs) 2025-08-14T21:48:47.9355988Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9356163Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9356462Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9356536Z outputs = layer_module( 2025-08-14T21:48:47.9356795Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9356880Z outputs = self.rel_attn( 2025-08-14T21:48:47.9357119Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 440, in forward 2025-08-14T21:48:47.9357226Z output_h = self.post_attention(h, attn_vec) 2025-08-14T21:48:47.9357486Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 304, in post_attention 2025-08-14T21:48:47.9357610Z attn_out = torch.einsum("ibnd,hnd->ibh", attn_vec, self.o) 2025-08-14T21:48:47.9357616Z 2025-08-14T21:48:47.9357723Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9357918Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9357990Z return mod(**inputs) 2025-08-14T21:48:47.9358229Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9358322Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9358555Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9358713Z outputs = layer_module( 2025-08-14T21:48:47.9358958Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9359072Z outputs = self.rel_attn( 2025-08-14T21:48:47.9359308Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 440, in forward 2025-08-14T21:48:47.9359404Z output_h = self.post_attention(h, attn_vec) 2025-08-14T21:48:47.9359659Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 304, in post_attention 2025-08-14T21:48:47.9359772Z attn_out = torch.einsum("ibnd,hnd->ibh", attn_vec, self.o) 2025-08-14T21:48:47.9359799Z 2025-08-14T21:48:47.9359899Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9360087Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9360154Z return mod(**inputs) 2025-08-14T21:48:47.9360392Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9360472Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9360711Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9360774Z outputs = layer_module( 2025-08-14T21:48:47.9361012Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 512, in forward 2025-08-14T21:48:47.9361211Z output_h = apply_chunking_to_forward(self.ff_chunk, self.chunk_size_feed_forward, self.seq_len_dim, output_h) 2025-08-14T21:48:47.9361458Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:48:47.9361543Z return forward_fn(*input_tensors) 2025-08-14T21:48:47.9361777Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 518, in ff_chunk 2025-08-14T21:48:47.9361858Z output_x = self.ff(output_x) 2025-08-14T21:48:47.9362090Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 463, in forward 2025-08-14T21:48:47.9362158Z output = self.layer_1(output) 2025-08-14T21:48:47.9362162Z 2025-08-14T21:48:47.9362269Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9362456Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9362517Z return mod(**inputs) 2025-08-14T21:48:47.9362754Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9362833Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9363076Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9363139Z outputs = layer_module( 2025-08-14T21:48:47.9363370Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 512, in forward 2025-08-14T21:48:47.9363578Z output_h = apply_chunking_to_forward(self.ff_chunk, self.chunk_size_feed_forward, self.seq_len_dim, output_h) 2025-08-14T21:48:47.9363820Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:48:47.9363899Z return forward_fn(*input_tensors) 2025-08-14T21:48:47.9364132Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 518, in ff_chunk 2025-08-14T21:48:47.9364203Z output_x = self.ff(output_x) 2025-08-14T21:48:47.9364438Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 464, in forward 2025-08-14T21:48:47.9364550Z output = self.activation_function(output) 2025-08-14T21:48:47.9364750Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:48:47.9364843Z return self.act(input) 2025-08-14T21:48:47.9364846Z 2025-08-14T21:48:47.9364944Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9365140Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9365201Z return mod(**inputs) 2025-08-14T21:48:47.9365444Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9365548Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9365781Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9365850Z outputs = layer_module( 2025-08-14T21:48:47.9366084Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 512, in forward 2025-08-14T21:48:47.9366278Z output_h = apply_chunking_to_forward(self.ff_chunk, self.chunk_size_feed_forward, self.seq_len_dim, output_h) 2025-08-14T21:48:47.9366527Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:48:47.9366596Z return forward_fn(*input_tensors) 2025-08-14T21:48:47.9366830Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 518, in ff_chunk 2025-08-14T21:48:47.9366901Z output_x = self.ff(output_x) 2025-08-14T21:48:47.9367135Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 466, in forward 2025-08-14T21:48:47.9367212Z output = self.layer_2(output) 2025-08-14T21:48:47.9367215Z 2025-08-14T21:48:47.9367310Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9367493Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9367560Z return mod(**inputs) 2025-08-14T21:48:47.9367796Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9367878Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9368111Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9368175Z outputs = layer_module( 2025-08-14T21:48:47.9368415Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9368480Z outputs = self.rel_attn( 2025-08-14T21:48:47.9368713Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 416, in forward 2025-08-14T21:48:47.9368819Z q_head_h = torch.einsum("ibh,hnd->ibnd", h, self.q) 2025-08-14T21:48:47.9368822Z 2025-08-14T21:48:47.9368918Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9369110Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9369169Z return mod(**inputs) 2025-08-14T21:48:47.9369402Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9369487Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9369720Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9369792Z outputs = layer_module( 2025-08-14T21:48:47.9370023Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9370087Z outputs = self.rel_attn( 2025-08-14T21:48:47.9370365Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 417, in forward 2025-08-14T21:48:47.9370480Z k_head_h = torch.einsum("ibh,hnd->ibnd", cat, self.k) 2025-08-14T21:48:47.9370483Z 2025-08-14T21:48:47.9370579Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9370769Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9370830Z return mod(**inputs) 2025-08-14T21:48:47.9371070Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9371146Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9371393Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9371463Z outputs = layer_module( 2025-08-14T21:48:47.9371695Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9371758Z outputs = self.rel_attn( 2025-08-14T21:48:47.9371998Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 425, in forward 2025-08-14T21:48:47.9372068Z attn_vec = self.rel_attn_core( 2025-08-14T21:48:47.9372323Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 263, in rel_attn_core 2025-08-14T21:48:47.9372448Z ac = torch.einsum("ibnd,jbnd->bnij", q_head + self.r_w_bias, k_head_h) 2025-08-14T21:48:47.9372451Z 2025-08-14T21:48:47.9372545Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9372733Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9372794Z return mod(**inputs) 2025-08-14T21:48:47.9373036Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9373112Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9373344Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9373414Z outputs = layer_module( 2025-08-14T21:48:47.9373644Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9373707Z outputs = self.rel_attn( 2025-08-14T21:48:47.9373941Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 422, in forward 2025-08-14T21:48:47.9374068Z k_head_r = torch.einsum("ibh,hnd->ibnd", r.type(self.r.dtype), self.r) 2025-08-14T21:48:47.9374072Z 2025-08-14T21:48:47.9374170Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9374349Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9374410Z return mod(**inputs) 2025-08-14T21:48:47.9374648Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9374723Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9374962Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9375022Z outputs = layer_module( 2025-08-14T21:48:47.9375252Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9375323Z outputs = self.rel_attn( 2025-08-14T21:48:47.9375555Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 425, in forward 2025-08-14T21:48:47.9375622Z attn_vec = self.rel_attn_core( 2025-08-14T21:48:47.9375910Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 266, in rel_attn_core 2025-08-14T21:48:47.9376033Z bd = torch.einsum("ibnd,jbnd->bnij", q_head + self.r_r_bias, k_head_r) 2025-08-14T21:48:47.9376052Z 2025-08-14T21:48:47.9376155Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9376336Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9376395Z return mod(**inputs) 2025-08-14T21:48:47.9376638Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9376712Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9376969Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9377031Z outputs = layer_module( 2025-08-14T21:48:47.9377262Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9377333Z outputs = self.rel_attn( 2025-08-14T21:48:47.9377564Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 418, in forward 2025-08-14T21:48:47.9377659Z v_head_h = torch.einsum("ibh,hnd->ibnd", cat, self.v) 2025-08-14T21:48:47.9377670Z 2025-08-14T21:48:47.9377762Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9377944Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9378012Z return mod(**inputs) 2025-08-14T21:48:47.9378242Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9378317Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9378559Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9378622Z outputs = layer_module( 2025-08-14T21:48:47.9378861Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9378924Z outputs = self.rel_attn( 2025-08-14T21:48:47.9379153Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 425, in forward 2025-08-14T21:48:47.9379223Z attn_vec = self.rel_attn_core( 2025-08-14T21:48:47.9379477Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 294, in rel_attn_core 2025-08-14T21:48:47.9379596Z attn_vec = torch.einsum("bnij,jbnd->ibnd", attn_prob, v_head_h) 2025-08-14T21:48:47.9379601Z 2025-08-14T21:48:47.9379693Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9379880Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9379940Z return mod(**inputs) 2025-08-14T21:48:47.9380183Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9380259Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9380489Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9380558Z outputs = layer_module( 2025-08-14T21:48:47.9380787Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9380848Z outputs = self.rel_attn( 2025-08-14T21:48:47.9381085Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 440, in forward 2025-08-14T21:48:47.9381167Z output_h = self.post_attention(h, attn_vec) 2025-08-14T21:48:47.9381464Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 304, in post_attention 2025-08-14T21:48:47.9381569Z attn_out = torch.einsum("ibnd,hnd->ibh", attn_vec, self.o) 2025-08-14T21:48:47.9381588Z 2025-08-14T21:48:47.9381684Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9381875Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9381934Z return mod(**inputs) 2025-08-14T21:48:47.9382172Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9382246Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9382476Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9382562Z outputs = layer_module( 2025-08-14T21:48:47.9382793Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9382857Z outputs = self.rel_attn( 2025-08-14T21:48:47.9383094Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 440, in forward 2025-08-14T21:48:47.9383175Z output_h = self.post_attention(h, attn_vec) 2025-08-14T21:48:47.9383429Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 304, in post_attention 2025-08-14T21:48:47.9383532Z attn_out = torch.einsum("ibnd,hnd->ibh", attn_vec, self.o) 2025-08-14T21:48:47.9383535Z 2025-08-14T21:48:47.9383627Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9383817Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9383876Z return mod(**inputs) 2025-08-14T21:48:47.9384107Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9384193Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9384424Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9384493Z outputs = layer_module( 2025-08-14T21:48:47.9384922Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 512, in forward 2025-08-14T21:48:47.9385122Z output_h = apply_chunking_to_forward(self.ff_chunk, self.chunk_size_feed_forward, self.seq_len_dim, output_h) 2025-08-14T21:48:47.9385376Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:48:47.9385451Z return forward_fn(*input_tensors) 2025-08-14T21:48:47.9385693Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 518, in ff_chunk 2025-08-14T21:48:47.9385761Z output_x = self.ff(output_x) 2025-08-14T21:48:47.9385998Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 463, in forward 2025-08-14T21:48:47.9386077Z output = self.layer_1(output) 2025-08-14T21:48:47.9386081Z 2025-08-14T21:48:47.9386177Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9386365Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9386427Z return mod(**inputs) 2025-08-14T21:48:47.9386662Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9386746Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9386982Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9387043Z outputs = layer_module( 2025-08-14T21:48:47.9387350Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 512, in forward 2025-08-14T21:48:47.9387541Z output_h = apply_chunking_to_forward(self.ff_chunk, self.chunk_size_feed_forward, self.seq_len_dim, output_h) 2025-08-14T21:48:47.9387814Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:48:47.9387887Z return forward_fn(*input_tensors) 2025-08-14T21:48:47.9388121Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 518, in ff_chunk 2025-08-14T21:48:47.9388197Z output_x = self.ff(output_x) 2025-08-14T21:48:47.9388426Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 464, in forward 2025-08-14T21:48:47.9388541Z output = self.activation_function(output) 2025-08-14T21:48:47.9388739Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:48:47.9388808Z return self.act(input) 2025-08-14T21:48:47.9388811Z 2025-08-14T21:48:47.9388916Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9389101Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9389159Z return mod(**inputs) 2025-08-14T21:48:47.9389397Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9389472Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9389709Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9389771Z outputs = layer_module( 2025-08-14T21:48:47.9390000Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 512, in forward 2025-08-14T21:48:47.9390197Z output_h = apply_chunking_to_forward(self.ff_chunk, self.chunk_size_feed_forward, self.seq_len_dim, output_h) 2025-08-14T21:48:47.9390435Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:48:47.9390514Z return forward_fn(*input_tensors) 2025-08-14T21:48:47.9390745Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 518, in ff_chunk 2025-08-14T21:48:47.9390810Z output_x = self.ff(output_x) 2025-08-14T21:48:47.9391046Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 466, in forward 2025-08-14T21:48:47.9391110Z output = self.layer_2(output) 2025-08-14T21:48:47.9391115Z 2025-08-14T21:48:47.9391209Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9391396Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9391455Z return mod(**inputs) 2025-08-14T21:48:47.9391696Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9391770Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9392003Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9392071Z outputs = layer_module( 2025-08-14T21:48:47.9392299Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9392362Z outputs = self.rel_attn( 2025-08-14T21:48:47.9392598Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 416, in forward 2025-08-14T21:48:47.9392690Z q_head_h = torch.einsum("ibh,hnd->ibnd", h, self.q) 2025-08-14T21:48:47.9392693Z 2025-08-14T21:48:47.9392792Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9393008Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9393086Z return mod(**inputs) 2025-08-14T21:48:47.9393325Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9393400Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9393639Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9393699Z outputs = layer_module( 2025-08-14T21:48:47.9393929Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9394018Z outputs = self.rel_attn( 2025-08-14T21:48:47.9394250Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 417, in forward 2025-08-14T21:48:47.9394346Z k_head_h = torch.einsum("ibh,hnd->ibnd", cat, self.k) 2025-08-14T21:48:47.9394359Z 2025-08-14T21:48:47.9394454Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9394641Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9394707Z return mod(**inputs) 2025-08-14T21:48:47.9394941Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9395015Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9395255Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9395318Z outputs = layer_module( 2025-08-14T21:48:47.9395555Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9395618Z outputs = self.rel_attn( 2025-08-14T21:48:47.9395849Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 425, in forward 2025-08-14T21:48:47.9395924Z attn_vec = self.rel_attn_core( 2025-08-14T21:48:47.9396172Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 263, in rel_attn_core 2025-08-14T21:48:47.9396296Z ac = torch.einsum("ibnd,jbnd->bnij", q_head + self.r_w_bias, k_head_h) 2025-08-14T21:48:47.9396299Z 2025-08-14T21:48:47.9396400Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9396581Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9396646Z return mod(**inputs) 2025-08-14T21:48:47.9396878Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9396952Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9397196Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9397255Z outputs = layer_module( 2025-08-14T21:48:47.9397489Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9397557Z outputs = self.rel_attn( 2025-08-14T21:48:47.9397788Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 422, in forward 2025-08-14T21:48:47.9397919Z k_head_r = torch.einsum("ibh,hnd->ibnd", r.type(self.r.dtype), self.r) 2025-08-14T21:48:47.9397922Z 2025-08-14T21:48:47.9398016Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9398199Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9398265Z return mod(**inputs) 2025-08-14T21:48:47.9398525Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9398608Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9398853Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9398913Z outputs = layer_module( 2025-08-14T21:48:47.9399148Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9399210Z outputs = self.rel_attn( 2025-08-14T21:48:47.9399437Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 425, in forward 2025-08-14T21:48:47.9399527Z attn_vec = self.rel_attn_core( 2025-08-14T21:48:47.9399773Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 266, in rel_attn_core 2025-08-14T21:48:47.9399900Z bd = torch.einsum("ibnd,jbnd->bnij", q_head + self.r_r_bias, k_head_r) 2025-08-14T21:48:47.9399905Z 2025-08-14T21:48:47.9399998Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9400181Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9400247Z return mod(**inputs) 2025-08-14T21:48:47.9400476Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9400555Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9400783Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9400843Z outputs = layer_module( 2025-08-14T21:48:47.9401079Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9401141Z outputs = self.rel_attn( 2025-08-14T21:48:47.9401374Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 418, in forward 2025-08-14T21:48:47.9401472Z v_head_h = torch.einsum("ibh,hnd->ibnd", cat, self.v) 2025-08-14T21:48:47.9401476Z 2025-08-14T21:48:47.9401570Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9401758Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9401817Z return mod(**inputs) 2025-08-14T21:48:47.9402050Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9402130Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9402361Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9402429Z outputs = layer_module( 2025-08-14T21:48:47.9402658Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9402719Z outputs = self.rel_attn( 2025-08-14T21:48:47.9402953Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 425, in forward 2025-08-14T21:48:47.9403018Z attn_vec = self.rel_attn_core( 2025-08-14T21:48:47.9403262Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 294, in rel_attn_core 2025-08-14T21:48:47.9403391Z attn_vec = torch.einsum("bnij,jbnd->ibnd", attn_prob, v_head_h) 2025-08-14T21:48:47.9403395Z 2025-08-14T21:48:47.9403487Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9403665Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9403733Z return mod(**inputs) 2025-08-14T21:48:47.9403963Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9404084Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9404319Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9404397Z outputs = layer_module( 2025-08-14T21:48:47.9404635Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9404695Z outputs = self.rel_attn( 2025-08-14T21:48:47.9404922Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 440, in forward 2025-08-14T21:48:47.9405010Z output_h = self.post_attention(h, attn_vec) 2025-08-14T21:48:47.9405279Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 304, in post_attention 2025-08-14T21:48:47.9405390Z attn_out = torch.einsum("ibnd,hnd->ibh", attn_vec, self.o) 2025-08-14T21:48:47.9405393Z 2025-08-14T21:48:47.9405488Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9405671Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9405739Z return mod(**inputs) 2025-08-14T21:48:47.9405974Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9406054Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9406285Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9406345Z outputs = layer_module( 2025-08-14T21:48:47.9406583Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9406645Z outputs = self.rel_attn( 2025-08-14T21:48:47.9406875Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 440, in forward 2025-08-14T21:48:47.9406962Z output_h = self.post_attention(h, attn_vec) 2025-08-14T21:48:47.9407214Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 304, in post_attention 2025-08-14T21:48:47.9407319Z attn_out = torch.einsum("ibnd,hnd->ibh", attn_vec, self.o) 2025-08-14T21:48:47.9407323Z 2025-08-14T21:48:47.9407415Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9407597Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9407663Z return mod(**inputs) 2025-08-14T21:48:47.9407892Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9407974Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9408205Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9408269Z outputs = layer_module( 2025-08-14T21:48:47.9408505Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 512, in forward 2025-08-14T21:48:47.9408696Z output_h = apply_chunking_to_forward(self.ff_chunk, self.chunk_size_feed_forward, self.seq_len_dim, output_h) 2025-08-14T21:48:47.9408933Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:48:47.9409010Z return forward_fn(*input_tensors) 2025-08-14T21:48:47.9409242Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 518, in ff_chunk 2025-08-14T21:48:47.9409315Z output_x = self.ff(output_x) 2025-08-14T21:48:47.9409545Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 463, in forward 2025-08-14T21:48:47.9409610Z output = self.layer_1(output) 2025-08-14T21:48:47.9409644Z 2025-08-14T21:48:47.9409748Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9409947Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9410013Z return mod(**inputs) 2025-08-14T21:48:47.9410243Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9410316Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9410553Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9410630Z outputs = layer_module( 2025-08-14T21:48:47.9410859Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 512, in forward 2025-08-14T21:48:47.9411057Z output_h = apply_chunking_to_forward(self.ff_chunk, self.chunk_size_feed_forward, self.seq_len_dim, output_h) 2025-08-14T21:48:47.9411303Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:48:47.9411379Z return forward_fn(*input_tensors) 2025-08-14T21:48:47.9411614Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 518, in ff_chunk 2025-08-14T21:48:47.9411679Z output_x = self.ff(output_x) 2025-08-14T21:48:47.9411918Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 464, in forward 2025-08-14T21:48:47.9411998Z output = self.activation_function(output) 2025-08-14T21:48:47.9412200Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:48:47.9412263Z return self.act(input) 2025-08-14T21:48:47.9412266Z 2025-08-14T21:48:47.9412360Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9412551Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9412610Z return mod(**inputs) 2025-08-14T21:48:47.9412842Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9412923Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9413155Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9413220Z outputs = layer_module( 2025-08-14T21:48:47.9413451Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 512, in forward 2025-08-14T21:48:47.9413642Z output_h = apply_chunking_to_forward(self.ff_chunk, self.chunk_size_feed_forward, self.seq_len_dim, output_h) 2025-08-14T21:48:47.9413889Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:48:47.9413960Z return forward_fn(*input_tensors) 2025-08-14T21:48:47.9414205Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 518, in ff_chunk 2025-08-14T21:48:47.9414270Z output_x = self.ff(output_x) 2025-08-14T21:48:47.9414502Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 466, in forward 2025-08-14T21:48:47.9414576Z output = self.layer_2(output) 2025-08-14T21:48:47.9414580Z 2025-08-14T21:48:47.9414675Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9414858Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9414927Z return mod(**inputs) 2025-08-14T21:48:47.9415159Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9415272Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9415504Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9415583Z outputs = layer_module( 2025-08-14T21:48:47.9415823Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9415888Z outputs = self.rel_attn( 2025-08-14T21:48:47.9416118Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 416, in forward 2025-08-14T21:48:47.9416217Z q_head_h = torch.einsum("ibh,hnd->ibnd", h, self.q) 2025-08-14T21:48:47.9416234Z 2025-08-14T21:48:47.9416330Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9416521Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9416580Z return mod(**inputs) 2025-08-14T21:48:47.9416814Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9416898Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9417130Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9417199Z outputs = layer_module( 2025-08-14T21:48:47.9417430Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9417492Z outputs = self.rel_attn( 2025-08-14T21:48:47.9417729Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 417, in forward 2025-08-14T21:48:47.9417822Z k_head_h = torch.einsum("ibh,hnd->ibnd", cat, self.k) 2025-08-14T21:48:47.9417825Z 2025-08-14T21:48:47.9417915Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9418108Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9418166Z return mod(**inputs) 2025-08-14T21:48:47.9418408Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9418482Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9418715Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9418781Z outputs = layer_module( 2025-08-14T21:48:47.9419010Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9419079Z outputs = self.rel_attn( 2025-08-14T21:48:47.9419309Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 425, in forward 2025-08-14T21:48:47.9419374Z attn_vec = self.rel_attn_core( 2025-08-14T21:48:47.9419629Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 263, in rel_attn_core 2025-08-14T21:48:47.9419751Z ac = torch.einsum("ibnd,jbnd->bnij", q_head + self.r_w_bias, k_head_h) 2025-08-14T21:48:47.9419754Z 2025-08-14T21:48:47.9419847Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9420033Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9420092Z return mod(**inputs) 2025-08-14T21:48:47.9420330Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9420404Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9420635Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9420703Z outputs = layer_module( 2025-08-14T21:48:47.9420960Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9421045Z outputs = self.rel_attn( 2025-08-14T21:48:47.9421280Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 422, in forward 2025-08-14T21:48:47.9421400Z k_head_r = torch.einsum("ibh,hnd->ibnd", r.type(self.r.dtype), self.r) 2025-08-14T21:48:47.9421404Z 2025-08-14T21:48:47.9421505Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9421689Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9421747Z return mod(**inputs) 2025-08-14T21:48:47.9422011Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9422085Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9422323Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9422383Z outputs = layer_module( 2025-08-14T21:48:47.9422613Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9422684Z outputs = self.rel_attn( 2025-08-14T21:48:47.9422912Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 425, in forward 2025-08-14T21:48:47.9422977Z attn_vec = self.rel_attn_core( 2025-08-14T21:48:47.9423229Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 266, in rel_attn_core 2025-08-14T21:48:47.9423347Z bd = torch.einsum("ibnd,jbnd->bnij", q_head + self.r_r_bias, k_head_r) 2025-08-14T21:48:47.9423350Z 2025-08-14T21:48:47.9423448Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9423630Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9423688Z return mod(**inputs) 2025-08-14T21:48:47.9423928Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9424002Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9424237Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9424295Z outputs = layer_module( 2025-08-14T21:48:47.9424523Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9424593Z outputs = self.rel_attn( 2025-08-14T21:48:47.9424897Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 418, in forward 2025-08-14T21:48:47.9424993Z v_head_h = torch.einsum("ibh,hnd->ibnd", cat, self.v) 2025-08-14T21:48:47.9425004Z 2025-08-14T21:48:47.9425101Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9425284Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9425350Z return mod(**inputs) 2025-08-14T21:48:47.9425584Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9425659Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9425899Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9425959Z outputs = layer_module( 2025-08-14T21:48:47.9426199Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9426262Z outputs = self.rel_attn( 2025-08-14T21:48:47.9426525Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 425, in forward 2025-08-14T21:48:47.9426600Z attn_vec = self.rel_attn_core( 2025-08-14T21:48:47.9426863Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 294, in rel_attn_core 2025-08-14T21:48:47.9426977Z attn_vec = torch.einsum("bnij,jbnd->ibnd", attn_prob, v_head_h) 2025-08-14T21:48:47.9426989Z 2025-08-14T21:48:47.9427082Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9427262Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9427329Z return mod(**inputs) 2025-08-14T21:48:47.9427564Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9427662Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9427903Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9427965Z outputs = layer_module( 2025-08-14T21:48:47.9428203Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9428264Z outputs = self.rel_attn( 2025-08-14T21:48:47.9428491Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 440, in forward 2025-08-14T21:48:47.9428578Z output_h = self.post_attention(h, attn_vec) 2025-08-14T21:48:47.9428825Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 304, in post_attention 2025-08-14T21:48:47.9428929Z attn_out = torch.einsum("ibnd,hnd->ibh", attn_vec, self.o) 2025-08-14T21:48:47.9428941Z 2025-08-14T21:48:47.9429034Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9429215Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9429283Z return mod(**inputs) 2025-08-14T21:48:47.9429515Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9429589Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9429826Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9429885Z outputs = layer_module( 2025-08-14T21:48:47.9430122Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9430183Z outputs = self.rel_attn( 2025-08-14T21:48:47.9430415Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 440, in forward 2025-08-14T21:48:47.9430502Z output_h = self.post_attention(h, attn_vec) 2025-08-14T21:48:47.9430756Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 304, in post_attention 2025-08-14T21:48:47.9430857Z attn_out = torch.einsum("ibnd,hnd->ibh", attn_vec, self.o) 2025-08-14T21:48:47.9430862Z 2025-08-14T21:48:47.9430962Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9431144Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9431209Z return mod(**inputs) 2025-08-14T21:48:47.9431440Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9431514Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9431752Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9431811Z outputs = layer_module( 2025-08-14T21:48:47.9432078Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 512, in forward 2025-08-14T21:48:47.9432269Z output_h = apply_chunking_to_forward(self.ff_chunk, self.chunk_size_feed_forward, self.seq_len_dim, output_h) 2025-08-14T21:48:47.9432526Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:48:47.9432602Z return forward_fn(*input_tensors) 2025-08-14T21:48:47.9432833Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 518, in ff_chunk 2025-08-14T21:48:47.9432898Z output_x = self.ff(output_x) 2025-08-14T21:48:47.9433133Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 463, in forward 2025-08-14T21:48:47.9433214Z output = self.layer_1(output) 2025-08-14T21:48:47.9433218Z 2025-08-14T21:48:47.9433319Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9433502Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9433559Z return mod(**inputs) 2025-08-14T21:48:47.9433798Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9433873Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9434114Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9434173Z outputs = layer_module( 2025-08-14T21:48:47.9434403Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 512, in forward 2025-08-14T21:48:47.9434600Z output_h = apply_chunking_to_forward(self.ff_chunk, self.chunk_size_feed_forward, self.seq_len_dim, output_h) 2025-08-14T21:48:47.9434839Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:48:47.9434912Z return forward_fn(*input_tensors) 2025-08-14T21:48:47.9435147Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 518, in ff_chunk 2025-08-14T21:48:47.9435215Z output_x = self.ff(output_x) 2025-08-14T21:48:47.9435451Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 464, in forward 2025-08-14T21:48:47.9435533Z output = self.activation_function(output) 2025-08-14T21:48:47.9435729Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:48:47.9435797Z return self.act(input) 2025-08-14T21:48:47.9435801Z 2025-08-14T21:48:47.9435897Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9436083Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9436141Z return mod(**inputs) 2025-08-14T21:48:47.9436377Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9436458Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9436688Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9436748Z outputs = layer_module( 2025-08-14T21:48:47.9436985Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 512, in forward 2025-08-14T21:48:47.9437173Z output_h = apply_chunking_to_forward(self.ff_chunk, self.chunk_size_feed_forward, self.seq_len_dim, output_h) 2025-08-14T21:48:47.9437417Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:48:47.9437485Z return forward_fn(*input_tensors) 2025-08-14T21:48:47.9437745Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 518, in ff_chunk 2025-08-14T21:48:47.9437819Z output_x = self.ff(output_x) 2025-08-14T21:48:47.9438066Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 466, in forward 2025-08-14T21:48:47.9438138Z output = self.layer_2(output) 2025-08-14T21:48:47.9438141Z 2025-08-14T21:48:47.9438233Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9438413Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9438478Z return mod(**inputs) 2025-08-14T21:48:47.9438708Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9438798Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9439037Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9439097Z outputs = layer_module( 2025-08-14T21:48:47.9439340Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9439403Z outputs = self.rel_attn( 2025-08-14T21:48:47.9439637Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 416, in forward 2025-08-14T21:48:47.9439733Z q_head_h = torch.einsum("ibh,hnd->ibnd", h, self.q) 2025-08-14T21:48:47.9439736Z 2025-08-14T21:48:47.9439829Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9440019Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9440077Z return mod(**inputs) 2025-08-14T21:48:47.9440315Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9440396Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9440637Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9440699Z outputs = layer_module( 2025-08-14T21:48:47.9440942Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9441003Z outputs = self.rel_attn( 2025-08-14T21:48:47.9441245Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 417, in forward 2025-08-14T21:48:47.9441337Z k_head_h = torch.einsum("ibh,hnd->ibnd", cat, self.k) 2025-08-14T21:48:47.9441340Z 2025-08-14T21:48:47.9441435Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9441626Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9441684Z return mod(**inputs) 2025-08-14T21:48:47.9441923Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9442006Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9442242Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9442308Z outputs = layer_module( 2025-08-14T21:48:47.9442544Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9442606Z outputs = self.rel_attn( 2025-08-14T21:48:47.9442848Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 425, in forward 2025-08-14T21:48:47.9442914Z attn_vec = self.rel_attn_core( 2025-08-14T21:48:47.9443172Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 263, in rel_attn_core 2025-08-14T21:48:47.9443332Z ac = torch.einsum("ibnd,jbnd->bnij", q_head + self.r_w_bias, k_head_h) 2025-08-14T21:48:47.9443337Z 2025-08-14T21:48:47.9443453Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9443642Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9443702Z return mod(**inputs) 2025-08-14T21:48:47.9443940Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9444022Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9444260Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9444344Z outputs = layer_module( 2025-08-14T21:48:47.9444579Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9444642Z outputs = self.rel_attn( 2025-08-14T21:48:47.9444885Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 422, in forward 2025-08-14T21:48:47.9445011Z k_head_r = torch.einsum("ibh,hnd->ibnd", r.type(self.r.dtype), self.r) 2025-08-14T21:48:47.9445015Z 2025-08-14T21:48:47.9445117Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9445300Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9445360Z return mod(**inputs) 2025-08-14T21:48:47.9445606Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9445682Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9445917Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9445983Z outputs = layer_module( 2025-08-14T21:48:47.9446219Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9446289Z outputs = self.rel_attn( 2025-08-14T21:48:47.9446521Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 425, in forward 2025-08-14T21:48:47.9446586Z attn_vec = self.rel_attn_core( 2025-08-14T21:48:47.9446844Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 266, in rel_attn_core 2025-08-14T21:48:47.9446962Z bd = torch.einsum("ibnd,jbnd->bnij", q_head + self.r_r_bias, k_head_r) 2025-08-14T21:48:47.9446965Z 2025-08-14T21:48:47.9447068Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9447252Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9447310Z return mod(**inputs) 2025-08-14T21:48:47.9447558Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9447633Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9447871Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9447940Z outputs = layer_module( 2025-08-14T21:48:47.9448173Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9448243Z outputs = self.rel_attn( 2025-08-14T21:48:47.9448477Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 418, in forward 2025-08-14T21:48:47.9448570Z v_head_h = torch.einsum("ibh,hnd->ibnd", cat, self.v) 2025-08-14T21:48:47.9448574Z 2025-08-14T21:48:47.9448675Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9448886Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9448949Z return mod(**inputs) 2025-08-14T21:48:47.9449191Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9449284Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9449523Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9449584Z outputs = layer_module( 2025-08-14T21:48:47.9449812Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9449896Z outputs = self.rel_attn( 2025-08-14T21:48:47.9450127Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 425, in forward 2025-08-14T21:48:47.9450198Z attn_vec = self.rel_attn_core( 2025-08-14T21:48:47.9450449Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 294, in rel_attn_core 2025-08-14T21:48:47.9450564Z attn_vec = torch.einsum("bnij,jbnd->ibnd", attn_prob, v_head_h) 2025-08-14T21:48:47.9450568Z 2025-08-14T21:48:47.9450665Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9450847Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9450906Z return mod(**inputs) 2025-08-14T21:48:47.9451145Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9451219Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9451457Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9451516Z outputs = layer_module( 2025-08-14T21:48:47.9451746Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9451815Z outputs = self.rel_attn( 2025-08-14T21:48:47.9452046Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 440, in forward 2025-08-14T21:48:47.9452131Z output_h = self.post_attention(h, attn_vec) 2025-08-14T21:48:47.9452380Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 304, in post_attention 2025-08-14T21:48:47.9452481Z attn_out = torch.einsum("ibnd,hnd->ibh", attn_vec, self.o) 2025-08-14T21:48:47.9452484Z 2025-08-14T21:48:47.9452581Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9452761Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9452819Z return mod(**inputs) 2025-08-14T21:48:47.9453058Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9453132Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9453371Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9453431Z outputs = layer_module( 2025-08-14T21:48:47.9453661Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9453729Z outputs = self.rel_attn( 2025-08-14T21:48:47.9453959Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 440, in forward 2025-08-14T21:48:47.9454042Z output_h = self.post_attention(h, attn_vec) 2025-08-14T21:48:47.9454299Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 304, in post_attention 2025-08-14T21:48:47.9454400Z attn_out = torch.einsum("ibnd,hnd->ibh", attn_vec, self.o) 2025-08-14T21:48:47.9454403Z 2025-08-14T21:48:47.9454528Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9454723Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9454780Z return mod(**inputs) 2025-08-14T21:48:47.9455017Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9455090Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9455325Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9455385Z outputs = layer_module( 2025-08-14T21:48:47.9455664Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 512, in forward 2025-08-14T21:48:47.9455863Z output_h = apply_chunking_to_forward(self.ff_chunk, self.chunk_size_feed_forward, self.seq_len_dim, output_h) 2025-08-14T21:48:47.9456106Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:48:47.9456184Z return forward_fn(*input_tensors) 2025-08-14T21:48:47.9456416Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 518, in ff_chunk 2025-08-14T21:48:47.9456481Z output_x = self.ff(output_x) 2025-08-14T21:48:47.9456718Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 463, in forward 2025-08-14T21:48:47.9456782Z output = self.layer_1(output) 2025-08-14T21:48:47.9456785Z 2025-08-14T21:48:47.9456880Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9457068Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9457127Z return mod(**inputs) 2025-08-14T21:48:47.9457371Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9457444Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9457677Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9457747Z outputs = layer_module( 2025-08-14T21:48:47.9457976Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 512, in forward 2025-08-14T21:48:47.9458170Z output_h = apply_chunking_to_forward(self.ff_chunk, self.chunk_size_feed_forward, self.seq_len_dim, output_h) 2025-08-14T21:48:47.9458410Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:48:47.9458480Z return forward_fn(*input_tensors) 2025-08-14T21:48:47.9458723Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 518, in ff_chunk 2025-08-14T21:48:47.9458790Z output_x = self.ff(output_x) 2025-08-14T21:48:47.9459020Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 464, in forward 2025-08-14T21:48:47.9459114Z output = self.activation_function(output) 2025-08-14T21:48:47.9459309Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:48:47.9459380Z return self.act(input) 2025-08-14T21:48:47.9459383Z 2025-08-14T21:48:47.9459476Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9459655Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9459723Z return mod(**inputs) 2025-08-14T21:48:47.9459952Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9460025Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9460288Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9460362Z outputs = layer_module( 2025-08-14T21:48:47.9460599Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 512, in forward 2025-08-14T21:48:47.9460787Z output_h = apply_chunking_to_forward(self.ff_chunk, self.chunk_size_feed_forward, self.seq_len_dim, output_h) 2025-08-14T21:48:47.9461025Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:48:47.9461118Z return forward_fn(*input_tensors) 2025-08-14T21:48:47.9461350Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 518, in ff_chunk 2025-08-14T21:48:47.9461421Z output_x = self.ff(output_x) 2025-08-14T21:48:47.9461650Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 466, in forward 2025-08-14T21:48:47.9461716Z output = self.layer_2(output) 2025-08-14T21:48:47.9461719Z 2025-08-14T21:48:47.9461821Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9462002Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9462069Z return mod(**inputs) 2025-08-14T21:48:47.9462298Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9462372Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9462609Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9462670Z outputs = layer_module( 2025-08-14T21:48:47.9462901Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9462975Z outputs = self.rel_attn( 2025-08-14T21:48:47.9463206Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 416, in forward 2025-08-14T21:48:47.9463303Z q_head_h = torch.einsum("ibh,hnd->ibnd", h, self.q) 2025-08-14T21:48:47.9463307Z 2025-08-14T21:48:47.9463398Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9463579Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9463647Z return mod(**inputs) 2025-08-14T21:48:47.9463878Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9463952Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9464190Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9464253Z outputs = layer_module( 2025-08-14T21:48:47.9464488Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9464552Z outputs = self.rel_attn( 2025-08-14T21:48:47.9464844Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 417, in forward 2025-08-14T21:48:47.9464956Z k_head_h = torch.einsum("ibh,hnd->ibnd", cat, self.k) 2025-08-14T21:48:47.9464960Z 2025-08-14T21:48:47.9465053Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9465244Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9465308Z return mod(**inputs) 2025-08-14T21:48:47.9465544Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9465628Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9465900Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9465977Z outputs = layer_module( 2025-08-14T21:48:47.9466220Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9466283Z outputs = self.rel_attn( 2025-08-14T21:48:47.9466521Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 425, in forward 2025-08-14T21:48:47.9466589Z attn_vec = self.rel_attn_core( 2025-08-14T21:48:47.9466836Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 263, in rel_attn_core 2025-08-14T21:48:47.9466984Z ac = torch.einsum("ibnd,jbnd->bnij", q_head + self.r_w_bias, k_head_h) 2025-08-14T21:48:47.9466988Z 2025-08-14T21:48:47.9467080Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9467271Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9467336Z return mod(**inputs) 2025-08-14T21:48:47.9467568Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9467649Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9467880Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9467940Z outputs = layer_module( 2025-08-14T21:48:47.9468177Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9468240Z outputs = self.rel_attn( 2025-08-14T21:48:47.9468476Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 422, in forward 2025-08-14T21:48:47.9468600Z k_head_r = torch.einsum("ibh,hnd->ibnd", r.type(self.r.dtype), self.r) 2025-08-14T21:48:47.9468603Z 2025-08-14T21:48:47.9468696Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9468887Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9468945Z return mod(**inputs) 2025-08-14T21:48:47.9469186Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9469259Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9469488Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9469556Z outputs = layer_module( 2025-08-14T21:48:47.9469787Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9469848Z outputs = self.rel_attn( 2025-08-14T21:48:47.9470086Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 425, in forward 2025-08-14T21:48:47.9470155Z attn_vec = self.rel_attn_core( 2025-08-14T21:48:47.9470414Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 266, in rel_attn_core 2025-08-14T21:48:47.9470534Z bd = torch.einsum("ibnd,jbnd->bnij", q_head + self.r_r_bias, k_head_r) 2025-08-14T21:48:47.9470537Z 2025-08-14T21:48:47.9470630Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9470818Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9470879Z return mod(**inputs) 2025-08-14T21:48:47.9471112Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9471191Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9471446Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9471528Z outputs = layer_module( 2025-08-14T21:48:47.9471762Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9471825Z outputs = self.rel_attn( 2025-08-14T21:48:47.9472069Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 418, in forward 2025-08-14T21:48:47.9472162Z v_head_h = torch.einsum("ibh,hnd->ibnd", cat, self.v) 2025-08-14T21:48:47.9472166Z 2025-08-14T21:48:47.9472267Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9472465Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9472523Z return mod(**inputs) 2025-08-14T21:48:47.9472763Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9472840Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9473072Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9473142Z outputs = layer_module( 2025-08-14T21:48:47.9473369Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9473435Z outputs = self.rel_attn( 2025-08-14T21:48:47.9473667Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 425, in forward 2025-08-14T21:48:47.9473733Z attn_vec = self.rel_attn_core( 2025-08-14T21:48:47.9473985Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 294, in rel_attn_core 2025-08-14T21:48:47.9474098Z attn_vec = torch.einsum("bnij,jbnd->ibnd", attn_prob, v_head_h) 2025-08-14T21:48:47.9474104Z 2025-08-14T21:48:47.9474203Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9474387Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9474445Z return mod(**inputs) 2025-08-14T21:48:47.9474684Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9474758Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9474989Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9475057Z outputs = layer_module( 2025-08-14T21:48:47.9475286Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9475353Z outputs = self.rel_attn( 2025-08-14T21:48:47.9475585Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 440, in forward 2025-08-14T21:48:47.9475666Z output_h = self.post_attention(h, attn_vec) 2025-08-14T21:48:47.9475925Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 304, in post_attention 2025-08-14T21:48:47.9476026Z attn_out = torch.einsum("ibnd,hnd->ibh", attn_vec, self.o) 2025-08-14T21:48:47.9476029Z 2025-08-14T21:48:47.9476128Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9476308Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9476365Z return mod(**inputs) 2025-08-14T21:48:47.9476603Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9476676Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9476947Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9477019Z outputs = layer_module( 2025-08-14T21:48:47.9477268Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9477338Z outputs = self.rel_attn( 2025-08-14T21:48:47.9477567Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 440, in forward 2025-08-14T21:48:47.9477644Z output_h = self.post_attention(h, attn_vec) 2025-08-14T21:48:47.9477903Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 304, in post_attention 2025-08-14T21:48:47.9478019Z attn_out = torch.einsum("ibnd,hnd->ibh", attn_vec, self.o) 2025-08-14T21:48:47.9478023Z 2025-08-14T21:48:47.9478121Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9478305Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9478364Z return mod(**inputs) 2025-08-14T21:48:47.9478605Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9478679Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9478908Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9478975Z outputs = layer_module( 2025-08-14T21:48:47.9479201Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 512, in forward 2025-08-14T21:48:47.9479400Z output_h = apply_chunking_to_forward(self.ff_chunk, self.chunk_size_feed_forward, self.seq_len_dim, output_h) 2025-08-14T21:48:47.9479639Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:48:47.9479711Z return forward_fn(*input_tensors) 2025-08-14T21:48:47.9479951Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 518, in ff_chunk 2025-08-14T21:48:47.9480017Z output_x = self.ff(output_x) 2025-08-14T21:48:47.9480254Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 463, in forward 2025-08-14T21:48:47.9480319Z output = self.layer_1(output) 2025-08-14T21:48:47.9480322Z 2025-08-14T21:48:47.9480415Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9480599Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9480660Z return mod(**inputs) 2025-08-14T21:48:47.9480889Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9480968Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9481199Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9481266Z outputs = layer_module( 2025-08-14T21:48:47.9481495Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 512, in forward 2025-08-14T21:48:47.9481684Z output_h = apply_chunking_to_forward(self.ff_chunk, self.chunk_size_feed_forward, self.seq_len_dim, output_h) 2025-08-14T21:48:47.9481930Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:48:47.9481999Z return forward_fn(*input_tensors) 2025-08-14T21:48:47.9482239Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 518, in ff_chunk 2025-08-14T21:48:47.9482304Z output_x = self.ff(output_x) 2025-08-14T21:48:47.9482559Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 464, in forward 2025-08-14T21:48:47.9482649Z output = self.activation_function(output) 2025-08-14T21:48:47.9482859Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:48:47.9482922Z return self.act(input) 2025-08-14T21:48:47.9482926Z 2025-08-14T21:48:47.9483025Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9483208Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9483273Z return mod(**inputs) 2025-08-14T21:48:47.9483505Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9483594Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9483829Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9483893Z outputs = layer_module( 2025-08-14T21:48:47.9484122Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 512, in forward 2025-08-14T21:48:47.9484318Z output_h = apply_chunking_to_forward(self.ff_chunk, self.chunk_size_feed_forward, self.seq_len_dim, output_h) 2025-08-14T21:48:47.9484558Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:48:47.9484743Z return forward_fn(*input_tensors) 2025-08-14T21:48:47.9484980Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 518, in ff_chunk 2025-08-14T21:48:47.9485049Z output_x = self.ff(output_x) 2025-08-14T21:48:47.9485285Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 466, in forward 2025-08-14T21:48:47.9485352Z output = self.layer_2(output) 2025-08-14T21:48:47.9485355Z 2025-08-14T21:48:47.9485459Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9485666Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9485729Z return mod(**inputs) 2025-08-14T21:48:47.9485974Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9486050Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9486287Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9486358Z outputs = layer_module( 2025-08-14T21:48:47.9486596Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9486669Z outputs = self.rel_attn( 2025-08-14T21:48:47.9486908Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 416, in forward 2025-08-14T21:48:47.9487002Z q_head_h = torch.einsum("ibh,hnd->ibnd", h, self.q) 2025-08-14T21:48:47.9487006Z 2025-08-14T21:48:47.9487108Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9487297Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9487367Z return mod(**inputs) 2025-08-14T21:48:47.9487606Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9487683Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9487931Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9487995Z outputs = layer_module( 2025-08-14T21:48:47.9488229Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9488362Z outputs = self.rel_attn( 2025-08-14T21:48:47.9488599Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 417, in forward 2025-08-14T21:48:47.9488722Z k_head_h = torch.einsum("ibh,hnd->ibnd", cat, self.k) 2025-08-14T21:48:47.9488725Z 2025-08-14T21:48:47.9488820Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9489006Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9489072Z return mod(**inputs) 2025-08-14T21:48:47.9489312Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9489418Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9489656Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9489718Z outputs = layer_module( 2025-08-14T21:48:47.9489962Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9490027Z outputs = self.rel_attn( 2025-08-14T21:48:47.9490268Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 425, in forward 2025-08-14T21:48:47.9490346Z attn_vec = self.rel_attn_core( 2025-08-14T21:48:47.9490601Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 263, in rel_attn_core 2025-08-14T21:48:47.9490731Z ac = torch.einsum("ibnd,jbnd->bnij", q_head + self.r_w_bias, k_head_h) 2025-08-14T21:48:47.9490736Z 2025-08-14T21:48:47.9490830Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9491014Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9491082Z return mod(**inputs) 2025-08-14T21:48:47.9491323Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9491406Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9491643Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9491705Z outputs = layer_module( 2025-08-14T21:48:47.9491947Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9492008Z outputs = self.rel_attn( 2025-08-14T21:48:47.9492243Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 422, in forward 2025-08-14T21:48:47.9492376Z k_head_r = torch.einsum("ibh,hnd->ibnd", r.type(self.r.dtype), self.r) 2025-08-14T21:48:47.9492379Z 2025-08-14T21:48:47.9492473Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9492667Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9492728Z return mod(**inputs) 2025-08-14T21:48:47.9492965Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9493047Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9493283Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9493343Z outputs = layer_module( 2025-08-14T21:48:47.9493582Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9493646Z outputs = self.rel_attn( 2025-08-14T21:48:47.9493887Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 425, in forward 2025-08-14T21:48:47.9493953Z attn_vec = self.rel_attn_core( 2025-08-14T21:48:47.9494234Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 266, in rel_attn_core 2025-08-14T21:48:47.9494375Z bd = torch.einsum("ibnd,jbnd->bnij", q_head + self.r_r_bias, k_head_r) 2025-08-14T21:48:47.9494379Z 2025-08-14T21:48:47.9494474Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9494665Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9494724Z return mod(**inputs) 2025-08-14T21:48:47.9494960Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9495057Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9495295Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9495358Z outputs = layer_module( 2025-08-14T21:48:47.9495602Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9495664Z outputs = self.rel_attn( 2025-08-14T21:48:47.9495907Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 418, in forward 2025-08-14T21:48:47.9496000Z v_head_h = torch.einsum("ibh,hnd->ibnd", cat, self.v) 2025-08-14T21:48:47.9496003Z 2025-08-14T21:48:47.9496097Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9496287Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9496348Z return mod(**inputs) 2025-08-14T21:48:47.9496594Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9496670Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9496911Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9496981Z outputs = layer_module( 2025-08-14T21:48:47.9497216Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9497279Z outputs = self.rel_attn( 2025-08-14T21:48:47.9497521Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 425, in forward 2025-08-14T21:48:47.9497587Z attn_vec = self.rel_attn_core( 2025-08-14T21:48:47.9497848Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 294, in rel_attn_core 2025-08-14T21:48:47.9497966Z attn_vec = torch.einsum("bnij,jbnd->ibnd", attn_prob, v_head_h) 2025-08-14T21:48:47.9497969Z 2025-08-14T21:48:47.9498065Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9498260Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9498320Z return mod(**inputs) 2025-08-14T21:48:47.9498570Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9498646Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9498883Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9498950Z outputs = layer_module( 2025-08-14T21:48:47.9499187Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9499251Z outputs = self.rel_attn( 2025-08-14T21:48:47.9499494Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 440, in forward 2025-08-14T21:48:47.9499577Z output_h = self.post_attention(h, attn_vec) 2025-08-14T21:48:47.9499873Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 304, in post_attention 2025-08-14T21:48:47.9499993Z attn_out = torch.einsum("ibnd,hnd->ibh", attn_vec, self.o) 2025-08-14T21:48:47.9499996Z 2025-08-14T21:48:47.9500091Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9500280Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9500341Z return mod(**inputs) 2025-08-14T21:48:47.9500638Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9500732Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9500973Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9501041Z outputs = layer_module( 2025-08-14T21:48:47.9501283Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9501348Z outputs = self.rel_attn( 2025-08-14T21:48:47.9501591Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 440, in forward 2025-08-14T21:48:47.9501673Z output_h = self.post_attention(h, attn_vec) 2025-08-14T21:48:47.9501935Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 304, in post_attention 2025-08-14T21:48:47.9502040Z attn_out = torch.einsum("ibnd,hnd->ibh", attn_vec, self.o) 2025-08-14T21:48:47.9502043Z 2025-08-14T21:48:47.9502137Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9502341Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9502400Z return mod(**inputs) 2025-08-14T21:48:47.9502633Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9502715Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9502947Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9503014Z outputs = layer_module( 2025-08-14T21:48:47.9503245Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 512, in forward 2025-08-14T21:48:47.9503436Z output_h = apply_chunking_to_forward(self.ff_chunk, self.chunk_size_feed_forward, self.seq_len_dim, output_h) 2025-08-14T21:48:47.9503682Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:48:47.9503754Z return forward_fn(*input_tensors) 2025-08-14T21:48:47.9503992Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 518, in ff_chunk 2025-08-14T21:48:47.9504059Z output_x = self.ff(output_x) 2025-08-14T21:48:47.9504292Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 463, in forward 2025-08-14T21:48:47.9504366Z output = self.layer_1(output) 2025-08-14T21:48:47.9504369Z 2025-08-14T21:48:47.9504462Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9504652Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9504712Z return mod(**inputs) 2025-08-14T21:48:47.9505004Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9505093Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9505325Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9505385Z outputs = layer_module( 2025-08-14T21:48:47.9505651Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 512, in forward 2025-08-14T21:48:47.9505854Z output_h = apply_chunking_to_forward(self.ff_chunk, self.chunk_size_feed_forward, self.seq_len_dim, output_h) 2025-08-14T21:48:47.9506098Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:48:47.9506167Z return forward_fn(*input_tensors) 2025-08-14T21:48:47.9506401Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 518, in ff_chunk 2025-08-14T21:48:47.9506499Z output_x = self.ff(output_x) 2025-08-14T21:48:47.9506732Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 464, in forward 2025-08-14T21:48:47.9506824Z output = self.activation_function(output) 2025-08-14T21:48:47.9507022Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:48:47.9507087Z return self.act(input) 2025-08-14T21:48:47.9507092Z 2025-08-14T21:48:47.9507193Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9507375Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9507435Z return mod(**inputs) 2025-08-14T21:48:47.9507672Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9507745Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9507984Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9508044Z outputs = layer_module( 2025-08-14T21:48:47.9508272Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 512, in forward 2025-08-14T21:48:47.9508469Z output_h = apply_chunking_to_forward(self.ff_chunk, self.chunk_size_feed_forward, self.seq_len_dim, output_h) 2025-08-14T21:48:47.9508710Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:48:47.9508786Z return forward_fn(*input_tensors) 2025-08-14T21:48:47.9509016Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 518, in ff_chunk 2025-08-14T21:48:47.9509081Z output_x = self.ff(output_x) 2025-08-14T21:48:47.9509315Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 466, in forward 2025-08-14T21:48:47.9509382Z output = self.layer_2(output) 2025-08-14T21:48:47.9509385Z 2025-08-14T21:48:47.9509479Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9509667Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9509726Z return mod(**inputs) 2025-08-14T21:48:47.9509959Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9510034Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9510264Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9510331Z outputs = layer_module( 2025-08-14T21:48:47.9510560Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9510626Z outputs = self.rel_attn( 2025-08-14T21:48:47.9510862Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 416, in forward 2025-08-14T21:48:47.9510950Z q_head_h = torch.einsum("ibh,hnd->ibnd", h, self.q) 2025-08-14T21:48:47.9510953Z 2025-08-14T21:48:47.9511079Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9511262Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9511336Z return mod(**inputs) 2025-08-14T21:48:47.9511574Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9511647Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9511884Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9511943Z outputs = layer_module( 2025-08-14T21:48:47.9512190Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9512259Z outputs = self.rel_attn( 2025-08-14T21:48:47.9512492Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 417, in forward 2025-08-14T21:48:47.9512587Z k_head_h = torch.einsum("ibh,hnd->ibnd", cat, self.k) 2025-08-14T21:48:47.9512592Z 2025-08-14T21:48:47.9512694Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9512871Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9512936Z return mod(**inputs) 2025-08-14T21:48:47.9513169Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9513243Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9513482Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9513544Z outputs = layer_module( 2025-08-14T21:48:47.9513780Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9513845Z outputs = self.rel_attn( 2025-08-14T21:48:47.9514076Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 425, in forward 2025-08-14T21:48:47.9514150Z attn_vec = self.rel_attn_core( 2025-08-14T21:48:47.9514398Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 263, in rel_attn_core 2025-08-14T21:48:47.9514517Z ac = torch.einsum("ibnd,jbnd->bnij", q_head + self.r_w_bias, k_head_h) 2025-08-14T21:48:47.9514520Z 2025-08-14T21:48:47.9514622Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9514802Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9514871Z return mod(**inputs) 2025-08-14T21:48:47.9515105Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9515187Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9515426Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9515487Z outputs = layer_module( 2025-08-14T21:48:47.9515719Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9515785Z outputs = self.rel_attn( 2025-08-14T21:48:47.9516013Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 422, in forward 2025-08-14T21:48:47.9516141Z k_head_r = torch.einsum("ibh,hnd->ibnd", r.type(self.r.dtype), self.r) 2025-08-14T21:48:47.9516146Z 2025-08-14T21:48:47.9516238Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9516417Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9516483Z return mod(**inputs) 2025-08-14T21:48:47.9516742Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9516839Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9517071Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9517131Z outputs = layer_module( 2025-08-14T21:48:47.9517367Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9517428Z outputs = self.rel_attn( 2025-08-14T21:48:47.9517658Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 425, in forward 2025-08-14T21:48:47.9517745Z attn_vec = self.rel_attn_core( 2025-08-14T21:48:47.9517995Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 266, in rel_attn_core 2025-08-14T21:48:47.9518120Z bd = torch.einsum("ibnd,jbnd->bnij", q_head + self.r_r_bias, k_head_r) 2025-08-14T21:48:47.9518124Z 2025-08-14T21:48:47.9518218Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9518401Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9518467Z return mod(**inputs) 2025-08-14T21:48:47.9518699Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9518780Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9519011Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9519073Z outputs = layer_module( 2025-08-14T21:48:47.9519313Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9519372Z outputs = self.rel_attn( 2025-08-14T21:48:47.9519603Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 418, in forward 2025-08-14T21:48:47.9519702Z v_head_h = torch.einsum("ibh,hnd->ibnd", cat, self.v) 2025-08-14T21:48:47.9519706Z 2025-08-14T21:48:47.9519797Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9519984Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9520042Z return mod(**inputs) 2025-08-14T21:48:47.9520272Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9520355Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9520590Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9520656Z outputs = layer_module( 2025-08-14T21:48:47.9520891Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9520954Z outputs = self.rel_attn( 2025-08-14T21:48:47.9521191Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 425, in forward 2025-08-14T21:48:47.9521255Z attn_vec = self.rel_attn_core( 2025-08-14T21:48:47.9521501Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 294, in rel_attn_core 2025-08-14T21:48:47.9521621Z attn_vec = torch.einsum("bnij,jbnd->ibnd", attn_prob, v_head_h) 2025-08-14T21:48:47.9521625Z 2025-08-14T21:48:47.9521718Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9521904Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9521962Z return mod(**inputs) 2025-08-14T21:48:47.9522226Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9522309Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9522554Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9522621Z outputs = layer_module( 2025-08-14T21:48:47.9522848Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9522908Z outputs = self.rel_attn( 2025-08-14T21:48:47.9523141Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 440, in forward 2025-08-14T21:48:47.9523237Z output_h = self.post_attention(h, attn_vec) 2025-08-14T21:48:47.9523486Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 304, in post_attention 2025-08-14T21:48:47.9523594Z attn_out = torch.einsum("ibnd,hnd->ibh", attn_vec, self.o) 2025-08-14T21:48:47.9523597Z 2025-08-14T21:48:47.9523691Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9523875Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9523934Z return mod(**inputs) 2025-08-14T21:48:47.9524163Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9524244Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9524475Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9524537Z outputs = layer_module( 2025-08-14T21:48:47.9524771Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 494, in forward 2025-08-14T21:48:47.9524832Z outputs = self.rel_attn( 2025-08-14T21:48:47.9525070Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 440, in forward 2025-08-14T21:48:47.9525151Z output_h = self.post_attention(h, attn_vec) 2025-08-14T21:48:47.9525398Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 304, in post_attention 2025-08-14T21:48:47.9525505Z attn_out = torch.einsum("ibnd,hnd->ibh", attn_vec, self.o) 2025-08-14T21:48:47.9525508Z 2025-08-14T21:48:47.9525600Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9525783Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9525847Z return mod(**inputs) 2025-08-14T21:48:47.9526078Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9526159Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9526392Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9526456Z outputs = layer_module( 2025-08-14T21:48:47.9526692Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 512, in forward 2025-08-14T21:48:47.9526882Z output_h = apply_chunking_to_forward(self.ff_chunk, self.chunk_size_feed_forward, self.seq_len_dim, output_h) 2025-08-14T21:48:47.9527127Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:48:47.9527198Z return forward_fn(*input_tensors) 2025-08-14T21:48:47.9527430Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 518, in ff_chunk 2025-08-14T21:48:47.9527502Z output_x = self.ff(output_x) 2025-08-14T21:48:47.9527761Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 463, in forward 2025-08-14T21:48:47.9527836Z output = self.layer_1(output) 2025-08-14T21:48:47.9527859Z 2025-08-14T21:48:47.9527954Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9528136Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9528203Z return mod(**inputs) 2025-08-14T21:48:47.9528435Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9528509Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9528747Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9528824Z outputs = layer_module( 2025-08-14T21:48:47.9529061Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 512, in forward 2025-08-14T21:48:47.9529253Z output_h = apply_chunking_to_forward(self.ff_chunk, self.chunk_size_feed_forward, self.seq_len_dim, output_h) 2025-08-14T21:48:47.9529494Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:48:47.9529570Z return forward_fn(*input_tensors) 2025-08-14T21:48:47.9529801Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 518, in ff_chunk 2025-08-14T21:48:47.9529870Z output_x = self.ff(output_x) 2025-08-14T21:48:47.9530100Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 464, in forward 2025-08-14T21:48:47.9530182Z output = self.activation_function(output) 2025-08-14T21:48:47.9530381Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:48:47.9530444Z return self.act(input) 2025-08-14T21:48:47.9530447Z 2025-08-14T21:48:47.9530543Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9530730Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9530791Z return mod(**inputs) 2025-08-14T21:48:47.9531028Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1607, in forward 2025-08-14T21:48:47.9531102Z transformer_outputs = self.transformer( 2025-08-14T21:48:47.9531334Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1368, in forward 2025-08-14T21:48:47.9531401Z outputs = layer_module( 2025-08-14T21:48:47.9531633Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 512, in forward 2025-08-14T21:48:47.9531830Z output_h = apply_chunking_to_forward(self.ff_chunk, self.chunk_size_feed_forward, self.seq_len_dim, output_h) 2025-08-14T21:48:47.9532072Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:48:47.9532141Z return forward_fn(*input_tensors) 2025-08-14T21:48:47.9532380Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 518, in ff_chunk 2025-08-14T21:48:47.9532445Z output_x = self.ff(output_x) 2025-08-14T21:48:47.9532675Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 466, in forward 2025-08-14T21:48:47.9532748Z output = self.layer_2(output) 2025-08-14T21:48:47.9532751Z 2025-08-14T21:48:47.9532846Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9533034Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9533093Z return mod(**inputs) 2025-08-14T21:48:47.9533362Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1624, in forward 2025-08-14T21:48:47.9533458Z logits = self.lm_loss(transformer_outputs[0]) 2025-08-14T21:48:47.9533479Z 2025-08-14T21:48:47.9533573Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:48:47.9533760Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:48:47.9533820Z return mod(**inputs) 2025-08-14T21:48:47.9534052Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/xlnet/modeling_xlnet.py", line 1630, in forward 2025-08-14T21:48:47.9534177Z loss = loss_fct(logits.view(-1, logits.size(-1)), labels.view(-1)) 2025-08-14T21:48:47.9534195Z 2025-08-14T21:48:59.2572614Z Compilation time (from dynamo_timed): 28.806777594 2025-08-14T21:48:59.2605182Z pass 2025-08-14T21:48:59.2608970Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:48:59.2611138Z TIMING: _recursive_pre_grad_passes:0.01151 _recursive_joint_graph_passes:1.22754 _recursive_post_grad_passes:0.20885 async_compile.wait:0.77353 code_gen:10.20854 inductor_compile:14.3447 backend_compile:23.30442 gc:0.00215 entire_frame_compile:28.80678 total_wall_time:28.80678 2025-08-14T21:48:59.2612600Z STATS: call_* op count: 818 | FakeTensorMode.__torch_dispatch__:56665 | FakeTensor.__torch_dispatch__:16773 | ProxyTorchDispatchMode.__torch_dispatch__:18623 2025-08-14T21:48:59.2616717Z Dynamo produced 1 graphs covering 818 ops with 0 graph breaks (0 unique) 2025-08-14T21:49:03.8704535Z /opt/conda/envs/py_3.9/lib/python3.9/site-packages/llvmlite/binding/ffi.py:175: UserWarning: pkg_resources is deprecated as an API. See https://setuptools.pypa.io/en/latest/pkg_resources.html. The pkg_resources package is slated for removal as early as 2025-11-30. Refrain from using this package or pin to Setuptools<81. 2025-08-14T21:49:03.8705623Z from pkg_resources import resource_filename 2025-08-14T21:49:04.4758356Z 2025-08-14T21:49:05.8174669Z loading model: 0it [00:00, ?it/s] 2025-08-14T21:49:05.8176177Z loading model: 0it [00:01, ?it/s] 2025-08-14T21:49:05.8195099Z cpu eval YituTechConvBert 2025-08-14T21:49:06.5698968Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:49:06.7908902Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:49:07.0116576Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:49:17.9003393Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9005188Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9007416Z return mod(**inputs) 2025-08-14T21:49:17.9012313Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9017283Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9022369Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9027022Z hidden_states = self.encoder( 2025-08-14T21:49:17.9033767Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9037853Z layer_outputs = layer_module( 2025-08-14T21:49:17.9042068Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9043935Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9044376Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 561, in forward 2025-08-14T21:49:17.9044833Z self_attention_outputs = self.attention( 2025-08-14T21:49:17.9045457Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 464, in forward 2025-08-14T21:49:17.9045853Z self_outputs = self.self( 2025-08-14T21:49:17.9046287Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 350, in forward 2025-08-14T21:49:17.9046686Z mixed_query_layer = self.query(hidden_states) 2025-08-14T21:49:17.9046854Z 2025-08-14T21:49:17.9046958Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9047318Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9047639Z return mod(**inputs) 2025-08-14T21:49:17.9047999Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9048449Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9048866Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9049243Z hidden_states = self.encoder( 2025-08-14T21:49:17.9049618Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9050004Z layer_outputs = layer_module( 2025-08-14T21:49:17.9050329Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9050683Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9051077Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 561, in forward 2025-08-14T21:49:17.9051459Z self_attention_outputs = self.attention( 2025-08-14T21:49:17.9051825Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 464, in forward 2025-08-14T21:49:17.9052197Z self_outputs = self.self( 2025-08-14T21:49:17.9052556Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 344, in forward 2025-08-14T21:49:17.9052933Z mixed_key_layer = self.key(hidden_states) 2025-08-14T21:49:17.9053060Z 2025-08-14T21:49:17.9053159Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9053494Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9053799Z return mod(**inputs) 2025-08-14T21:49:17.9054139Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9054515Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9054891Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9055261Z hidden_states = self.encoder( 2025-08-14T21:49:17.9055622Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9055991Z layer_outputs = layer_module( 2025-08-14T21:49:17.9056317Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9056652Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9057019Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 561, in forward 2025-08-14T21:49:17.9057394Z self_attention_outputs = self.attention( 2025-08-14T21:49:17.9057769Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 464, in forward 2025-08-14T21:49:17.9058131Z self_outputs = self.self( 2025-08-14T21:49:17.9058491Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 345, in forward 2025-08-14T21:49:17.9058930Z mixed_value_layer = self.value(hidden_states) 2025-08-14T21:49:17.9059074Z 2025-08-14T21:49:17.9059162Z cudagraph partition due to non gpu ops 2025-08-14T21:49:17.9059379Z cudagraph partition due to non gpu ops 2025-08-14T21:49:17.9059598Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9059939Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9060242Z return mod(**inputs) 2025-08-14T21:49:17.9060598Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9060978Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9061390Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9061772Z hidden_states = self.encoder( 2025-08-14T21:49:17.9062189Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9062652Z layer_outputs = layer_module( 2025-08-14T21:49:17.9063010Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9063390Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9063830Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 561, in forward 2025-08-14T21:49:17.9064238Z self_attention_outputs = self.attention( 2025-08-14T21:49:17.9064611Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 464, in forward 2025-08-14T21:49:17.9065218Z self_outputs = self.self( 2025-08-14T21:49:17.9065633Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 366, in forward 2025-08-14T21:49:17.9066107Z conv_out_layer = self.conv_out_layer(hidden_states) 2025-08-14T21:49:17.9066279Z 2025-08-14T21:49:17.9066364Z cudagraph partition due to non gpu ops 2025-08-14T21:49:17.9066620Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9067007Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9067370Z return mod(**inputs) 2025-08-14T21:49:17.9067774Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9068221Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9068669Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9069103Z hidden_states = self.encoder( 2025-08-14T21:49:17.9069522Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9069961Z layer_outputs = layer_module( 2025-08-14T21:49:17.9070322Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9070708Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9071152Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 561, in forward 2025-08-14T21:49:17.9071597Z self_attention_outputs = self.attention( 2025-08-14T21:49:17.9072038Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 464, in forward 2025-08-14T21:49:17.9072442Z self_outputs = self.self( 2025-08-14T21:49:17.9072825Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 347, in forward 2025-08-14T21:49:17.9073278Z mixed_key_conv_attn_layer = self.key_conv_attn_layer(hidden_states.transpose(1, 2)) 2025-08-14T21:49:17.9073767Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 282, in forward 2025-08-14T21:49:17.9074159Z x = self.depthwise(hidden_states) 2025-08-14T21:49:17.9074277Z 2025-08-14T21:49:17.9074380Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9074703Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9075006Z return mod(**inputs) 2025-08-14T21:49:17.9075351Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9075723Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9076166Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9076547Z hidden_states = self.encoder( 2025-08-14T21:49:17.9076921Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9077744Z layer_outputs = layer_module( 2025-08-14T21:49:17.9078106Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9078464Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9078864Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 561, in forward 2025-08-14T21:49:17.9079254Z self_attention_outputs = self.attention( 2025-08-14T21:49:17.9079628Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 464, in forward 2025-08-14T21:49:17.9079992Z self_outputs = self.self( 2025-08-14T21:49:17.9080341Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 347, in forward 2025-08-14T21:49:17.9080784Z mixed_key_conv_attn_layer = self.key_conv_attn_layer(hidden_states.transpose(1, 2)) 2025-08-14T21:49:17.9081244Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 283, in forward 2025-08-14T21:49:17.9081619Z x = self.pointwise(x) 2025-08-14T21:49:17.9081722Z 2025-08-14T21:49:17.9081824Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9082153Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9082455Z return mod(**inputs) 2025-08-14T21:49:17.9082810Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9083178Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9083546Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9083925Z hidden_states = self.encoder( 2025-08-14T21:49:17.9084296Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9084931Z layer_outputs = layer_module( 2025-08-14T21:49:17.9085274Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9085629Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9086069Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 561, in forward 2025-08-14T21:49:17.9086492Z self_attention_outputs = self.attention( 2025-08-14T21:49:17.9086908Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 464, in forward 2025-08-14T21:49:17.9087341Z self_outputs = self.self( 2025-08-14T21:49:17.9087855Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 360, in forward 2025-08-14T21:49:17.9088368Z conv_attn_layer = torch.multiply(mixed_key_conv_attn_layer, mixed_query_layer) 2025-08-14T21:49:17.9088627Z 2025-08-14T21:49:17.9088738Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9089123Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9089463Z return mod(**inputs) 2025-08-14T21:49:17.9089880Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9090324Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9090794Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9091213Z hidden_states = self.encoder( 2025-08-14T21:49:17.9091644Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9092056Z layer_outputs = layer_module( 2025-08-14T21:49:17.9092409Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9092796Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9093236Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 561, in forward 2025-08-14T21:49:17.9093667Z self_attention_outputs = self.attention( 2025-08-14T21:49:17.9094086Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 464, in forward 2025-08-14T21:49:17.9094453Z self_outputs = self.self( 2025-08-14T21:49:17.9094809Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 362, in forward 2025-08-14T21:49:17.9095227Z conv_kernel_layer = self.conv_kernel_layer(conv_attn_layer) 2025-08-14T21:49:17.9095395Z 2025-08-14T21:49:17.9095493Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9095835Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9096146Z return mod(**inputs) 2025-08-14T21:49:17.9096508Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9096890Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9097267Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9097638Z hidden_states = self.encoder( 2025-08-14T21:49:17.9097999Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9098358Z layer_outputs = layer_module( 2025-08-14T21:49:17.9098677Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9099002Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9099370Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 561, in forward 2025-08-14T21:49:17.9099739Z self_attention_outputs = self.attention( 2025-08-14T21:49:17.9100108Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 464, in forward 2025-08-14T21:49:17.9100460Z self_outputs = self.self( 2025-08-14T21:49:17.9100813Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 380, in forward 2025-08-14T21:49:17.9101225Z conv_out_layer = torch.matmul(conv_out_layer, conv_kernel_layer) 2025-08-14T21:49:17.9101391Z 2025-08-14T21:49:17.9101472Z cudagraph partition due to non gpu ops 2025-08-14T21:49:17.9101692Z cudagraph partition due to non gpu ops 2025-08-14T21:49:17.9101913Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9102263Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9102555Z return mod(**inputs) 2025-08-14T21:49:17.9102898Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9103273Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9103643Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9104022Z hidden_states = self.encoder( 2025-08-14T21:49:17.9104384Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9104800Z layer_outputs = layer_module( 2025-08-14T21:49:17.9105136Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9105475Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9105849Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 561, in forward 2025-08-14T21:49:17.9106227Z self_attention_outputs = self.attention( 2025-08-14T21:49:17.9106592Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 464, in forward 2025-08-14T21:49:17.9106954Z self_outputs = self.self( 2025-08-14T21:49:17.9107311Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 405, in forward 2025-08-14T21:49:17.9107717Z context_layer = torch.cat([context_layer, conv_out], 2) 2025-08-14T21:49:17.9107871Z 2025-08-14T21:49:17.9107967Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9108302Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9108608Z return mod(**inputs) 2025-08-14T21:49:17.9108947Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9109323Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9109694Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9110056Z hidden_states = self.encoder( 2025-08-14T21:49:17.9110406Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9110769Z layer_outputs = layer_module( 2025-08-14T21:49:17.9111086Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9111411Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9111788Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 561, in forward 2025-08-14T21:49:17.9112174Z self_attention_outputs = self.attention( 2025-08-14T21:49:17.9112553Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 471, in forward 2025-08-14T21:49:17.9112976Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:49:17.9113392Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 425, in forward 2025-08-14T21:49:17.9113769Z hidden_states = self.dense(hidden_states) 2025-08-14T21:49:17.9113894Z 2025-08-14T21:49:17.9113995Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9114314Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9114634Z return mod(**inputs) 2025-08-14T21:49:17.9115002Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9115387Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9115757Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9116120Z hidden_states = self.encoder( 2025-08-14T21:49:17.9116475Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9116830Z layer_outputs = layer_module( 2025-08-14T21:49:17.9117165Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9117494Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9117864Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 586, in forward 2025-08-14T21:49:17.9118240Z layer_output = apply_chunking_to_forward( 2025-08-14T21:49:17.9118616Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:49:17.9118980Z return forward_fn(*input_tensors) 2025-08-14T21:49:17.9119368Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 593, in feed_forward_chunk 2025-08-14T21:49:17.9119813Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:49:17.9120238Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 513, in forward 2025-08-14T21:49:17.9120624Z hidden_states = self.dense(hidden_states) 2025-08-14T21:49:17.9120754Z 2025-08-14T21:49:17.9120850Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9121188Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9121496Z return mod(**inputs) 2025-08-14T21:49:17.9121849Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9122224Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9122603Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9122976Z hidden_states = self.encoder( 2025-08-14T21:49:17.9123336Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9123716Z layer_outputs = layer_module( 2025-08-14T21:49:17.9124032Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9124359Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9124721Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 586, in forward 2025-08-14T21:49:17.9125099Z layer_output = apply_chunking_to_forward( 2025-08-14T21:49:17.9125468Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:49:17.9125828Z return forward_fn(*input_tensors) 2025-08-14T21:49:17.9126215Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 593, in feed_forward_chunk 2025-08-14T21:49:17.9126660Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:49:17.9127075Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 514, in forward 2025-08-14T21:49:17.9127479Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:49:17.9127882Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:49:17.9128214Z return self.act(input) 2025-08-14T21:49:17.9128316Z 2025-08-14T21:49:17.9128416Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9128742Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9129040Z return mod(**inputs) 2025-08-14T21:49:17.9129384Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9129752Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9130149Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9130509Z hidden_states = self.encoder( 2025-08-14T21:49:17.9130871Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9131227Z layer_outputs = layer_module( 2025-08-14T21:49:17.9131545Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9131872Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9132233Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 586, in forward 2025-08-14T21:49:17.9132599Z layer_output = apply_chunking_to_forward( 2025-08-14T21:49:17.9132966Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:49:17.9133326Z return forward_fn(*input_tensors) 2025-08-14T21:49:17.9133709Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 594, in feed_forward_chunk 2025-08-14T21:49:17.9134158Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:49:17.9134578Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 531, in forward 2025-08-14T21:49:17.9134952Z hidden_states = self.dense(hidden_states) 2025-08-14T21:49:17.9135077Z 2025-08-14T21:49:17.9135171Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9135500Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9135799Z return mod(**inputs) 2025-08-14T21:49:17.9136141Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9136512Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9136883Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9137247Z hidden_states = self.encoder( 2025-08-14T21:49:17.9137599Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9137964Z layer_outputs = layer_module( 2025-08-14T21:49:17.9138280Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9138614Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9138977Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 561, in forward 2025-08-14T21:49:17.9139347Z self_attention_outputs = self.attention( 2025-08-14T21:49:17.9139716Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 464, in forward 2025-08-14T21:49:17.9140078Z self_outputs = self.self( 2025-08-14T21:49:17.9140434Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 350, in forward 2025-08-14T21:49:17.9140837Z mixed_query_layer = self.query(hidden_states) 2025-08-14T21:49:17.9140994Z 2025-08-14T21:49:17.9141097Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9141430Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9141738Z return mod(**inputs) 2025-08-14T21:49:17.9142094Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9142473Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9142847Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9143250Z hidden_states = self.encoder( 2025-08-14T21:49:17.9143671Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9144046Z layer_outputs = layer_module( 2025-08-14T21:49:17.9144377Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9144718Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9145173Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 561, in forward 2025-08-14T21:49:17.9145556Z self_attention_outputs = self.attention( 2025-08-14T21:49:17.9145946Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 464, in forward 2025-08-14T21:49:17.9146334Z self_outputs = self.self( 2025-08-14T21:49:17.9146696Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 344, in forward 2025-08-14T21:49:17.9147073Z mixed_key_layer = self.key(hidden_states) 2025-08-14T21:49:17.9147210Z 2025-08-14T21:49:17.9147309Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9147646Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9147946Z return mod(**inputs) 2025-08-14T21:49:17.9148299Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9148682Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9149059Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9149427Z hidden_states = self.encoder( 2025-08-14T21:49:17.9149792Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9150163Z layer_outputs = layer_module( 2025-08-14T21:49:17.9150480Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9150814Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9151190Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 561, in forward 2025-08-14T21:49:17.9151572Z self_attention_outputs = self.attention( 2025-08-14T21:49:17.9151943Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 464, in forward 2025-08-14T21:49:17.9152315Z self_outputs = self.self( 2025-08-14T21:49:17.9152672Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 345, in forward 2025-08-14T21:49:17.9153068Z mixed_value_layer = self.value(hidden_states) 2025-08-14T21:49:17.9153210Z 2025-08-14T21:49:17.9153285Z cudagraph partition due to non gpu ops 2025-08-14T21:49:17.9153484Z cudagraph partition due to non gpu ops 2025-08-14T21:49:17.9153739Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9154072Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9154398Z return mod(**inputs) 2025-08-14T21:49:17.9154752Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9155134Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9155508Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9155882Z hidden_states = self.encoder( 2025-08-14T21:49:17.9156250Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9156635Z layer_outputs = layer_module( 2025-08-14T21:49:17.9156957Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9157294Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9157660Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 561, in forward 2025-08-14T21:49:17.9158029Z self_attention_outputs = self.attention( 2025-08-14T21:49:17.9158402Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 464, in forward 2025-08-14T21:49:17.9158766Z self_outputs = self.self( 2025-08-14T21:49:17.9159111Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 366, in forward 2025-08-14T21:49:17.9159506Z conv_out_layer = self.conv_out_layer(hidden_states) 2025-08-14T21:49:17.9159656Z 2025-08-14T21:49:17.9159729Z cudagraph partition due to non gpu ops 2025-08-14T21:49:17.9159946Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9160267Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9160566Z return mod(**inputs) 2025-08-14T21:49:17.9160909Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9161282Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9161643Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9162004Z hidden_states = self.encoder( 2025-08-14T21:49:17.9162362Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9162717Z layer_outputs = layer_module( 2025-08-14T21:49:17.9163034Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9163364Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9163733Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 561, in forward 2025-08-14T21:49:17.9164097Z self_attention_outputs = self.attention( 2025-08-14T21:49:17.9164463Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 464, in forward 2025-08-14T21:49:17.9164823Z self_outputs = self.self( 2025-08-14T21:49:17.9165167Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 347, in forward 2025-08-14T21:49:17.9165614Z mixed_key_conv_attn_layer = self.key_conv_attn_layer(hidden_states.transpose(1, 2)) 2025-08-14T21:49:17.9166059Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 282, in forward 2025-08-14T21:49:17.9166428Z x = self.depthwise(hidden_states) 2025-08-14T21:49:17.9166549Z 2025-08-14T21:49:17.9166680Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9167022Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9167354Z return mod(**inputs) 2025-08-14T21:49:17.9167700Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9168069Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9168441Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9168809Z hidden_states = self.encoder( 2025-08-14T21:49:17.9169182Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9169546Z layer_outputs = layer_module( 2025-08-14T21:49:17.9169869Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9170201Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9170564Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 561, in forward 2025-08-14T21:49:17.9170937Z self_attention_outputs = self.attention( 2025-08-14T21:49:17.9171312Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 464, in forward 2025-08-14T21:49:17.9171675Z self_outputs = self.self( 2025-08-14T21:49:17.9172017Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 347, in forward 2025-08-14T21:49:17.9172465Z mixed_key_conv_attn_layer = self.key_conv_attn_layer(hidden_states.transpose(1, 2)) 2025-08-14T21:49:17.9172911Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 283, in forward 2025-08-14T21:49:17.9173269Z x = self.pointwise(x) 2025-08-14T21:49:17.9173377Z 2025-08-14T21:49:17.9173471Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9173802Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9174101Z return mod(**inputs) 2025-08-14T21:49:17.9174438Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9174811Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9175183Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9175550Z hidden_states = self.encoder( 2025-08-14T21:49:17.9175900Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9176265Z layer_outputs = layer_module( 2025-08-14T21:49:17.9176584Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9176909Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9177277Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 561, in forward 2025-08-14T21:49:17.9177650Z self_attention_outputs = self.attention( 2025-08-14T21:49:17.9178017Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 464, in forward 2025-08-14T21:49:17.9178373Z self_outputs = self.self( 2025-08-14T21:49:17.9178723Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 360, in forward 2025-08-14T21:49:17.9179166Z conv_attn_layer = torch.multiply(mixed_key_conv_attn_layer, mixed_query_layer) 2025-08-14T21:49:17.9179355Z 2025-08-14T21:49:17.9179454Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9179806Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9180133Z return mod(**inputs) 2025-08-14T21:49:17.9180474Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9180840Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9181215Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9181586Z hidden_states = self.encoder( 2025-08-14T21:49:17.9181955Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9182408Z layer_outputs = layer_module( 2025-08-14T21:49:17.9182723Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9183056Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9183417Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 561, in forward 2025-08-14T21:49:17.9183803Z self_attention_outputs = self.attention( 2025-08-14T21:49:17.9184184Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 464, in forward 2025-08-14T21:49:17.9184556Z self_outputs = self.self( 2025-08-14T21:49:17.9185108Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 362, in forward 2025-08-14T21:49:17.9185542Z conv_kernel_layer = self.conv_kernel_layer(conv_attn_layer) 2025-08-14T21:49:17.9185713Z 2025-08-14T21:49:17.9185814Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9186159Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9186466Z return mod(**inputs) 2025-08-14T21:49:17.9186824Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9187215Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9187595Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9187978Z hidden_states = self.encoder( 2025-08-14T21:49:17.9188356Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9188737Z layer_outputs = layer_module( 2025-08-14T21:49:17.9189060Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9189403Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9189785Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 561, in forward 2025-08-14T21:49:17.9190167Z self_attention_outputs = self.attention( 2025-08-14T21:49:17.9190545Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 464, in forward 2025-08-14T21:49:17.9190919Z self_outputs = self.self( 2025-08-14T21:49:17.9191281Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 380, in forward 2025-08-14T21:49:17.9191703Z conv_out_layer = torch.matmul(conv_out_layer, conv_kernel_layer) 2025-08-14T21:49:17.9191880Z 2025-08-14T21:49:17.9191957Z cudagraph partition due to non gpu ops 2025-08-14T21:49:17.9192163Z cudagraph partition due to non gpu ops 2025-08-14T21:49:17.9192388Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9192722Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9193090Z return mod(**inputs) 2025-08-14T21:49:17.9193450Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9193857Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9194238Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9194613Z hidden_states = self.encoder( 2025-08-14T21:49:17.9194981Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9195348Z layer_outputs = layer_module( 2025-08-14T21:49:17.9195702Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9196041Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9196416Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 561, in forward 2025-08-14T21:49:17.9196802Z self_attention_outputs = self.attention( 2025-08-14T21:49:17.9197175Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 464, in forward 2025-08-14T21:49:17.9197535Z self_outputs = self.self( 2025-08-14T21:49:17.9197876Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 405, in forward 2025-08-14T21:49:17.9198278Z context_layer = torch.cat([context_layer, conv_out], 2) 2025-08-14T21:49:17.9198433Z 2025-08-14T21:49:17.9198528Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9198855Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9199145Z return mod(**inputs) 2025-08-14T21:49:17.9199492Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9199866Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9200233Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9200606Z hidden_states = self.encoder( 2025-08-14T21:49:17.9200973Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9201351Z layer_outputs = layer_module( 2025-08-14T21:49:17.9201669Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9202010Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9202396Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 561, in forward 2025-08-14T21:49:17.9202765Z self_attention_outputs = self.attention( 2025-08-14T21:49:17.9203132Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 471, in forward 2025-08-14T21:49:17.9203553Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:49:17.9203967Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 425, in forward 2025-08-14T21:49:17.9204330Z hidden_states = self.dense(hidden_states) 2025-08-14T21:49:17.9204465Z 2025-08-14T21:49:17.9204557Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9204893Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9205215Z return mod(**inputs) 2025-08-14T21:49:17.9205552Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9205924Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9206327Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9206710Z hidden_states = self.encoder( 2025-08-14T21:49:17.9207060Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9207420Z layer_outputs = layer_module( 2025-08-14T21:49:17.9207738Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9208060Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9208427Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 586, in forward 2025-08-14T21:49:17.9208830Z layer_output = apply_chunking_to_forward( 2025-08-14T21:49:17.9209206Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:49:17.9209566Z return forward_fn(*input_tensors) 2025-08-14T21:49:17.9209969Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 593, in feed_forward_chunk 2025-08-14T21:49:17.9210409Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:49:17.9210812Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 513, in forward 2025-08-14T21:49:17.9211195Z hidden_states = self.dense(hidden_states) 2025-08-14T21:49:17.9211331Z 2025-08-14T21:49:17.9211427Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9211763Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9212065Z return mod(**inputs) 2025-08-14T21:49:17.9212421Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9212794Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9213163Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9213520Z hidden_states = self.encoder( 2025-08-14T21:49:17.9213876Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9214239Z layer_outputs = layer_module( 2025-08-14T21:49:17.9214549Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9214883Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9215253Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 586, in forward 2025-08-14T21:49:17.9215635Z layer_output = apply_chunking_to_forward( 2025-08-14T21:49:17.9216001Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:49:17.9216367Z return forward_fn(*input_tensors) 2025-08-14T21:49:17.9216763Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 593, in feed_forward_chunk 2025-08-14T21:49:17.9217203Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:49:17.9217610Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 514, in forward 2025-08-14T21:49:17.9218015Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:49:17.9218367Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:49:17.9218675Z return self.act(input) 2025-08-14T21:49:17.9218786Z 2025-08-14T21:49:17.9218881Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9219248Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9219568Z return mod(**inputs) 2025-08-14T21:49:17.9219906Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9220282Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9220669Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9221045Z hidden_states = self.encoder( 2025-08-14T21:49:17.9221409Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9221803Z layer_outputs = layer_module( 2025-08-14T21:49:17.9222132Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9222470Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9222850Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 586, in forward 2025-08-14T21:49:17.9223238Z layer_output = apply_chunking_to_forward( 2025-08-14T21:49:17.9223619Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:49:17.9223980Z return forward_fn(*input_tensors) 2025-08-14T21:49:17.9224392Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 594, in feed_forward_chunk 2025-08-14T21:49:17.9224926Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:49:17.9225373Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 531, in forward 2025-08-14T21:49:17.9225755Z hidden_states = self.dense(hidden_states) 2025-08-14T21:49:17.9225898Z 2025-08-14T21:49:17.9225997Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9226342Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9226650Z return mod(**inputs) 2025-08-14T21:49:17.9227012Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9227405Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9227793Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9228169Z hidden_states = self.encoder( 2025-08-14T21:49:17.9228540Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9228921Z layer_outputs = layer_module( 2025-08-14T21:49:17.9229244Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9229593Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9229972Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 561, in forward 2025-08-14T21:49:17.9230358Z self_attention_outputs = self.attention( 2025-08-14T21:49:17.9230737Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 464, in forward 2025-08-14T21:49:17.9231115Z self_outputs = self.self( 2025-08-14T21:49:17.9231484Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 350, in forward 2025-08-14T21:49:17.9231881Z mixed_query_layer = self.query(hidden_states) 2025-08-14T21:49:17.9232020Z 2025-08-14T21:49:17.9232117Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9232505Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9232816Z return mod(**inputs) 2025-08-14T21:49:17.9233181Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9233618Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9233999Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9234372Z hidden_states = self.encoder( 2025-08-14T21:49:17.9234732Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9235125Z layer_outputs = layer_module( 2025-08-14T21:49:17.9235453Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9235788Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9236171Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 561, in forward 2025-08-14T21:49:17.9236565Z self_attention_outputs = self.attention( 2025-08-14T21:49:17.9236938Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 464, in forward 2025-08-14T21:49:17.9237296Z self_outputs = self.self( 2025-08-14T21:49:17.9237647Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 344, in forward 2025-08-14T21:49:17.9238022Z mixed_key_layer = self.key(hidden_states) 2025-08-14T21:49:17.9238147Z 2025-08-14T21:49:17.9238249Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9238571Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9238873Z return mod(**inputs) 2025-08-14T21:49:17.9239220Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9239585Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9239959Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9240328Z hidden_states = self.encoder( 2025-08-14T21:49:17.9240693Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9241131Z layer_outputs = layer_module( 2025-08-14T21:49:17.9241447Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9241778Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9242146Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 561, in forward 2025-08-14T21:49:17.9242509Z self_attention_outputs = self.attention( 2025-08-14T21:49:17.9242878Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 464, in forward 2025-08-14T21:49:17.9243243Z self_outputs = self.self( 2025-08-14T21:49:17.9243587Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 345, in forward 2025-08-14T21:49:17.9243972Z mixed_value_layer = self.value(hidden_states) 2025-08-14T21:49:17.9244113Z 2025-08-14T21:49:17.9244188Z cudagraph partition due to non gpu ops 2025-08-14T21:49:17.9244383Z cudagraph partition due to non gpu ops 2025-08-14T21:49:17.9244591Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9244923Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9245221Z return mod(**inputs) 2025-08-14T21:49:17.9245606Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9245998Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9246370Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9246735Z hidden_states = self.encoder( 2025-08-14T21:49:17.9247087Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9247449Z layer_outputs = layer_module( 2025-08-14T21:49:17.9247763Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9248098Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9248464Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 561, in forward 2025-08-14T21:49:17.9248835Z self_attention_outputs = self.attention( 2025-08-14T21:49:17.9249204Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 464, in forward 2025-08-14T21:49:17.9249561Z self_outputs = self.self( 2025-08-14T21:49:17.9249912Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 366, in forward 2025-08-14T21:49:17.9250303Z conv_out_layer = self.conv_out_layer(hidden_states) 2025-08-14T21:49:17.9250446Z 2025-08-14T21:49:17.9250525Z cudagraph partition due to non gpu ops 2025-08-14T21:49:17.9250734Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9251075Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9251388Z return mod(**inputs) 2025-08-14T21:49:17.9251734Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9252122Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9252494Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9252854Z hidden_states = self.encoder( 2025-08-14T21:49:17.9253205Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9253575Z layer_outputs = layer_module( 2025-08-14T21:49:17.9253898Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9254230Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9254606Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 561, in forward 2025-08-14T21:49:17.9254986Z self_attention_outputs = self.attention( 2025-08-14T21:49:17.9255364Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 464, in forward 2025-08-14T21:49:17.9255730Z self_outputs = self.self( 2025-08-14T21:49:17.9256087Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 347, in forward 2025-08-14T21:49:17.9256543Z mixed_key_conv_attn_layer = self.key_conv_attn_layer(hidden_states.transpose(1, 2)) 2025-08-14T21:49:17.9256996Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 282, in forward 2025-08-14T21:49:17.9257447Z x = self.depthwise(hidden_states) 2025-08-14T21:49:17.9257578Z 2025-08-14T21:49:17.9257674Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9258006Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9258301Z return mod(**inputs) 2025-08-14T21:49:17.9258685Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9259711Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9260091Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9260461Z hidden_states = self.encoder( 2025-08-14T21:49:17.9260830Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9261203Z layer_outputs = layer_module( 2025-08-14T21:49:17.9261531Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9261888Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9262267Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 561, in forward 2025-08-14T21:49:17.9262652Z self_attention_outputs = self.attention( 2025-08-14T21:49:17.9263023Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 464, in forward 2025-08-14T21:49:17.9263396Z self_outputs = self.self( 2025-08-14T21:49:17.9263755Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 347, in forward 2025-08-14T21:49:17.9264208Z mixed_key_conv_attn_layer = self.key_conv_attn_layer(hidden_states.transpose(1, 2)) 2025-08-14T21:49:17.9264651Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 283, in forward 2025-08-14T21:49:17.9265120Z x = self.pointwise(x) 2025-08-14T21:49:17.9265226Z 2025-08-14T21:49:17.9265331Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9265671Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9265973Z return mod(**inputs) 2025-08-14T21:49:17.9266326Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9266711Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9267085Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9267457Z hidden_states = self.encoder( 2025-08-14T21:49:17.9267836Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9268201Z layer_outputs = layer_module( 2025-08-14T21:49:17.9268513Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9268841Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9269212Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 561, in forward 2025-08-14T21:49:17.9269578Z self_attention_outputs = self.attention( 2025-08-14T21:49:17.9269944Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 464, in forward 2025-08-14T21:49:17.9270306Z self_outputs = self.self( 2025-08-14T21:49:17.9270655Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 360, in forward 2025-08-14T21:49:17.9271085Z conv_attn_layer = torch.multiply(mixed_key_conv_attn_layer, mixed_query_layer) 2025-08-14T21:49:17.9271289Z 2025-08-14T21:49:17.9271387Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9271734Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9272029Z return mod(**inputs) 2025-08-14T21:49:17.9272402Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9272791Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9273164Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9273522Z hidden_states = self.encoder( 2025-08-14T21:49:17.9273879Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9274240Z layer_outputs = layer_module( 2025-08-14T21:49:17.9274554Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9274902Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9275283Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 561, in forward 2025-08-14T21:49:17.9275669Z self_attention_outputs = self.attention( 2025-08-14T21:49:17.9276043Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 464, in forward 2025-08-14T21:49:17.9276408Z self_outputs = self.self( 2025-08-14T21:49:17.9276766Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 362, in forward 2025-08-14T21:49:17.9277180Z conv_kernel_layer = self.conv_kernel_layer(conv_attn_layer) 2025-08-14T21:49:17.9277341Z 2025-08-14T21:49:17.9277437Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9277780Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9278087Z return mod(**inputs) 2025-08-14T21:49:17.9278439Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9278874Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9279262Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9279641Z hidden_states = self.encoder( 2025-08-14T21:49:17.9280015Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9280381Z layer_outputs = layer_module( 2025-08-14T21:49:17.9280701Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9281033Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9281400Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 561, in forward 2025-08-14T21:49:17.9281778Z self_attention_outputs = self.attention( 2025-08-14T21:49:17.9282153Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 464, in forward 2025-08-14T21:49:17.9282519Z self_outputs = self.self( 2025-08-14T21:49:17.9282872Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 380, in forward 2025-08-14T21:49:17.9283292Z conv_out_layer = torch.matmul(conv_out_layer, conv_kernel_layer) 2025-08-14T21:49:17.9283458Z 2025-08-14T21:49:17.9283542Z cudagraph partition due to non gpu ops 2025-08-14T21:49:17.9283736Z cudagraph partition due to non gpu ops 2025-08-14T21:49:17.9283962Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9284299Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9284770Z return mod(**inputs) 2025-08-14T21:49:17.9285128Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9285568Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9285972Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9286382Z hidden_states = self.encoder( 2025-08-14T21:49:17.9286764Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9287138Z layer_outputs = layer_module( 2025-08-14T21:49:17.9287474Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9287793Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9288189Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 561, in forward 2025-08-14T21:49:17.9288570Z self_attention_outputs = self.attention( 2025-08-14T21:49:17.9288943Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 464, in forward 2025-08-14T21:49:17.9289297Z self_outputs = self.self( 2025-08-14T21:49:17.9289648Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 405, in forward 2025-08-14T21:49:17.9290047Z context_layer = torch.cat([context_layer, conv_out], 2) 2025-08-14T21:49:17.9290198Z 2025-08-14T21:49:17.9290294Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9290632Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9290939Z return mod(**inputs) 2025-08-14T21:49:17.9291290Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9291665Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9292048Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9292423Z hidden_states = self.encoder( 2025-08-14T21:49:17.9292778Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9293152Z layer_outputs = layer_module( 2025-08-14T21:49:17.9293473Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9293813Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9294180Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 561, in forward 2025-08-14T21:49:17.9294561Z self_attention_outputs = self.attention( 2025-08-14T21:49:17.9294941Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 471, in forward 2025-08-14T21:49:17.9295367Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:49:17.9295783Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 425, in forward 2025-08-14T21:49:17.9296167Z hidden_states = self.dense(hidden_states) 2025-08-14T21:49:17.9296295Z 2025-08-14T21:49:17.9296397Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9296724Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9297030Z return mod(**inputs) 2025-08-14T21:49:17.9297384Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9297766Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9298138Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9298511Z hidden_states = self.encoder( 2025-08-14T21:49:17.9298907Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9299296Z layer_outputs = layer_module( 2025-08-14T21:49:17.9299611Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9299947Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9300320Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 586, in forward 2025-08-14T21:49:17.9300700Z layer_output = apply_chunking_to_forward( 2025-08-14T21:49:17.9301079Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:49:17.9301494Z return forward_fn(*input_tensors) 2025-08-14T21:49:17.9301903Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 593, in feed_forward_chunk 2025-08-14T21:49:17.9302352Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:49:17.9302780Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 513, in forward 2025-08-14T21:49:17.9303166Z hidden_states = self.dense(hidden_states) 2025-08-14T21:49:17.9303296Z 2025-08-14T21:49:17.9303400Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9303732Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9304043Z return mod(**inputs) 2025-08-14T21:49:17.9304397Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9304823Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9305233Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9305629Z hidden_states = self.encoder( 2025-08-14T21:49:17.9306008Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9306406Z layer_outputs = layer_module( 2025-08-14T21:49:17.9306736Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9307084Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9307461Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 586, in forward 2025-08-14T21:49:17.9307855Z layer_output = apply_chunking_to_forward( 2025-08-14T21:49:17.9308239Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:49:17.9308613Z return forward_fn(*input_tensors) 2025-08-14T21:49:17.9309018Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 593, in feed_forward_chunk 2025-08-14T21:49:17.9309475Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:49:17.9309901Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 514, in forward 2025-08-14T21:49:17.9310323Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:49:17.9310670Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:49:17.9310986Z return self.act(input) 2025-08-14T21:49:17.9311088Z 2025-08-14T21:49:17.9311192Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9311515Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9311817Z return mod(**inputs) 2025-08-14T21:49:17.9312198Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9312589Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9312955Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9313319Z hidden_states = self.encoder( 2025-08-14T21:49:17.9313674Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9314032Z layer_outputs = layer_module( 2025-08-14T21:49:17.9314337Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9314686Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9315052Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 586, in forward 2025-08-14T21:49:17.9315423Z layer_output = apply_chunking_to_forward( 2025-08-14T21:49:17.9315793Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:49:17.9316156Z return forward_fn(*input_tensors) 2025-08-14T21:49:17.9316550Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 594, in feed_forward_chunk 2025-08-14T21:49:17.9316993Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:49:17.9317415Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 531, in forward 2025-08-14T21:49:17.9317794Z hidden_states = self.dense(hidden_states) 2025-08-14T21:49:17.9317922Z 2025-08-14T21:49:17.9318023Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9318348Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9318652Z return mod(**inputs) 2025-08-14T21:49:17.9318999Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9319370Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9319743Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9320106Z hidden_states = self.encoder( 2025-08-14T21:49:17.9320465Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9320823Z layer_outputs = layer_module( 2025-08-14T21:49:17.9321143Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9321472Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9321835Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 561, in forward 2025-08-14T21:49:17.9322213Z self_attention_outputs = self.attention( 2025-08-14T21:49:17.9322586Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 464, in forward 2025-08-14T21:49:17.9322952Z self_outputs = self.self( 2025-08-14T21:49:17.9323298Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 350, in forward 2025-08-14T21:49:17.9323681Z mixed_query_layer = self.query(hidden_states) 2025-08-14T21:49:17.9323823Z 2025-08-14T21:49:17.9323919Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9324251Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9324542Z return mod(**inputs) 2025-08-14T21:49:17.9324915Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9325290Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9325670Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9326031Z hidden_states = self.encoder( 2025-08-14T21:49:17.9326386Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9326746Z layer_outputs = layer_module( 2025-08-14T21:49:17.9327050Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9327410Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9327785Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 561, in forward 2025-08-14T21:49:17.9328160Z self_attention_outputs = self.attention( 2025-08-14T21:49:17.9328527Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 464, in forward 2025-08-14T21:49:17.9328892Z self_outputs = self.self( 2025-08-14T21:49:17.9329244Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 344, in forward 2025-08-14T21:49:17.9329613Z mixed_key_layer = self.key(hidden_states) 2025-08-14T21:49:17.9329747Z 2025-08-14T21:49:17.9329842Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9330173Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9330472Z return mod(**inputs) 2025-08-14T21:49:17.9330807Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9331183Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9331555Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9331916Z hidden_states = self.encoder( 2025-08-14T21:49:17.9332274Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9332639Z layer_outputs = layer_module( 2025-08-14T21:49:17.9332955Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9333278Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9333647Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 561, in forward 2025-08-14T21:49:17.9334023Z self_attention_outputs = self.attention( 2025-08-14T21:49:17.9334391Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 464, in forward 2025-08-14T21:49:17.9334753Z self_outputs = self.self( 2025-08-14T21:49:17.9335109Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 345, in forward 2025-08-14T21:49:17.9335499Z mixed_value_layer = self.value(hidden_states) 2025-08-14T21:49:17.9335635Z 2025-08-14T21:49:17.9335711Z cudagraph partition due to non gpu ops 2025-08-14T21:49:17.9335908Z cudagraph partition due to non gpu ops 2025-08-14T21:49:17.9336128Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9336462Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9336754Z return mod(**inputs) 2025-08-14T21:49:17.9337100Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9337478Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9337877Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9338260Z hidden_states = self.encoder( 2025-08-14T21:49:17.9338617Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9338979Z layer_outputs = layer_module( 2025-08-14T21:49:17.9339290Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9339621Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9339999Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 561, in forward 2025-08-14T21:49:17.9340399Z self_attention_outputs = self.attention( 2025-08-14T21:49:17.9340813Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 464, in forward 2025-08-14T21:49:17.9341191Z self_outputs = self.self( 2025-08-14T21:49:17.9341556Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 366, in forward 2025-08-14T21:49:17.9341959Z conv_out_layer = self.conv_out_layer(hidden_states) 2025-08-14T21:49:17.9342112Z 2025-08-14T21:49:17.9342187Z cudagraph partition due to non gpu ops 2025-08-14T21:49:17.9342412Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9342754Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9343059Z return mod(**inputs) 2025-08-14T21:49:17.9343414Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9343801Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9344177Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9344554Z hidden_states = self.encoder( 2025-08-14T21:49:17.9344990Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9345382Z layer_outputs = layer_module( 2025-08-14T21:49:17.9345701Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9346045Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9346429Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 561, in forward 2025-08-14T21:49:17.9346815Z self_attention_outputs = self.attention( 2025-08-14T21:49:17.9347194Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 464, in forward 2025-08-14T21:49:17.9347569Z self_outputs = self.self( 2025-08-14T21:49:17.9347942Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 347, in forward 2025-08-14T21:49:17.9348404Z mixed_key_conv_attn_layer = self.key_conv_attn_layer(hidden_states.transpose(1, 2)) 2025-08-14T21:49:17.9348871Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 282, in forward 2025-08-14T21:49:17.9349257Z x = self.depthwise(hidden_states) 2025-08-14T21:49:17.9349378Z 2025-08-14T21:49:17.9349480Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9349813Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9350121Z return mod(**inputs) 2025-08-14T21:49:17.9350476Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9350860Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9351271Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9351665Z hidden_states = self.encoder( 2025-08-14T21:49:17.9352033Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9352401Z layer_outputs = layer_module( 2025-08-14T21:49:17.9352733Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9353075Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9353459Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 561, in forward 2025-08-14T21:49:17.9353855Z self_attention_outputs = self.attention( 2025-08-14T21:49:17.9354234Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 464, in forward 2025-08-14T21:49:17.9354612Z self_outputs = self.self( 2025-08-14T21:49:17.9354969Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 347, in forward 2025-08-14T21:49:17.9355425Z mixed_key_conv_attn_layer = self.key_conv_attn_layer(hidden_states.transpose(1, 2)) 2025-08-14T21:49:17.9355887Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 283, in forward 2025-08-14T21:49:17.9356260Z x = self.pointwise(x) 2025-08-14T21:49:17.9356365Z 2025-08-14T21:49:17.9356459Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9356796Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9357098Z return mod(**inputs) 2025-08-14T21:49:17.9357446Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9357828Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9358199Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9358564Z hidden_states = self.encoder( 2025-08-14T21:49:17.9358914Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9359278Z layer_outputs = layer_module( 2025-08-14T21:49:17.9359596Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9359937Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9360318Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 561, in forward 2025-08-14T21:49:17.9360687Z self_attention_outputs = self.attention( 2025-08-14T21:49:17.9361059Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 464, in forward 2025-08-14T21:49:17.9361422Z self_outputs = self.self( 2025-08-14T21:49:17.9361764Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 360, in forward 2025-08-14T21:49:17.9362203Z conv_attn_layer = torch.multiply(mixed_key_conv_attn_layer, mixed_query_layer) 2025-08-14T21:49:17.9362391Z 2025-08-14T21:49:17.9362491Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9362809Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9363109Z return mod(**inputs) 2025-08-14T21:49:17.9363452Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9363822Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9364211Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9364580Z hidden_states = self.encoder( 2025-08-14T21:49:17.9364953Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9365303Z layer_outputs = layer_module( 2025-08-14T21:49:17.9365616Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9365946Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9366312Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 561, in forward 2025-08-14T21:49:17.9366694Z self_attention_outputs = self.attention( 2025-08-14T21:49:17.9367065Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 464, in forward 2025-08-14T21:49:17.9367428Z self_outputs = self.self( 2025-08-14T21:49:17.9367778Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 362, in forward 2025-08-14T21:49:17.9368181Z conv_kernel_layer = self.conv_kernel_layer(conv_attn_layer) 2025-08-14T21:49:17.9368343Z 2025-08-14T21:49:17.9368436Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9368765Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9369057Z return mod(**inputs) 2025-08-14T21:49:17.9369403Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9369777Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9370148Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9370504Z hidden_states = self.encoder( 2025-08-14T21:49:17.9370866Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9371228Z layer_outputs = layer_module( 2025-08-14T21:49:17.9371546Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9371868Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9372235Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 561, in forward 2025-08-14T21:49:17.9372612Z self_attention_outputs = self.attention( 2025-08-14T21:49:17.9372972Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 464, in forward 2025-08-14T21:49:17.9373336Z self_outputs = self.self( 2025-08-14T21:49:17.9373689Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 380, in forward 2025-08-14T21:49:17.9374101Z conv_out_layer = torch.matmul(conv_out_layer, conv_kernel_layer) 2025-08-14T21:49:17.9374266Z 2025-08-14T21:49:17.9374340Z cudagraph partition due to non gpu ops 2025-08-14T21:49:17.9374538Z cudagraph partition due to non gpu ops 2025-08-14T21:49:17.9374755Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9375073Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9375373Z return mod(**inputs) 2025-08-14T21:49:17.9375718Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9376096Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9376460Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9376824Z hidden_states = self.encoder( 2025-08-14T21:49:17.9377219Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9377621Z layer_outputs = layer_module( 2025-08-14T21:49:17.9377939Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9378277Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9378657Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 561, in forward 2025-08-14T21:49:17.9379031Z self_attention_outputs = self.attention( 2025-08-14T21:49:17.9379412Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 464, in forward 2025-08-14T21:49:17.9379805Z self_outputs = self.self( 2025-08-14T21:49:17.9380171Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 405, in forward 2025-08-14T21:49:17.9380582Z context_layer = torch.cat([context_layer, conv_out], 2) 2025-08-14T21:49:17.9380748Z 2025-08-14T21:49:17.9380844Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9381188Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9381491Z return mod(**inputs) 2025-08-14T21:49:17.9381847Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9382234Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9382619Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9382992Z hidden_states = self.encoder( 2025-08-14T21:49:17.9383362Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9383740Z layer_outputs = layer_module( 2025-08-14T21:49:17.9384067Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9384406Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9384936Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 561, in forward 2025-08-14T21:49:17.9385343Z self_attention_outputs = self.attention( 2025-08-14T21:49:17.9385783Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 471, in forward 2025-08-14T21:49:17.9386218Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:49:17.9386648Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 425, in forward 2025-08-14T21:49:17.9387038Z hidden_states = self.dense(hidden_states) 2025-08-14T21:49:17.9387170Z 2025-08-14T21:49:17.9387272Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9387620Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9387932Z return mod(**inputs) 2025-08-14T21:49:17.9388289Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9388666Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9389047Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9389422Z hidden_states = self.encoder( 2025-08-14T21:49:17.9389781Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9390154Z layer_outputs = layer_module( 2025-08-14T21:49:17.9390535Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9390873Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9391274Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 586, in forward 2025-08-14T21:49:17.9391660Z layer_output = apply_chunking_to_forward( 2025-08-14T21:49:17.9392043Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:49:17.9392408Z return forward_fn(*input_tensors) 2025-08-14T21:49:17.9392820Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 593, in feed_forward_chunk 2025-08-14T21:49:17.9393301Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:49:17.9393773Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 513, in forward 2025-08-14T21:49:17.9394145Z hidden_states = self.dense(hidden_states) 2025-08-14T21:49:17.9394282Z 2025-08-14T21:49:17.9394376Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9394705Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9395002Z return mod(**inputs) 2025-08-14T21:49:17.9395337Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9395710Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9396081Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9396441Z hidden_states = self.encoder( 2025-08-14T21:49:17.9396807Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9397180Z layer_outputs = layer_module( 2025-08-14T21:49:17.9397506Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9397830Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9398196Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 586, in forward 2025-08-14T21:49:17.9398571Z layer_output = apply_chunking_to_forward( 2025-08-14T21:49:17.9398940Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:49:17.9399297Z return forward_fn(*input_tensors) 2025-08-14T21:49:17.9399692Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 593, in feed_forward_chunk 2025-08-14T21:49:17.9400130Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:49:17.9400532Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 514, in forward 2025-08-14T21:49:17.9400935Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:49:17.9401288Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:49:17.9401607Z return self.act(input) 2025-08-14T21:49:17.9401711Z 2025-08-14T21:49:17.9401808Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9402146Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9402459Z return mod(**inputs) 2025-08-14T21:49:17.9402805Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9403172Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9403578Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9403944Z hidden_states = self.encoder( 2025-08-14T21:49:17.9404314Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9404675Z layer_outputs = layer_module( 2025-08-14T21:49:17.9404989Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9405322Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9405682Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 586, in forward 2025-08-14T21:49:17.9406077Z layer_output = apply_chunking_to_forward( 2025-08-14T21:49:17.9406444Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:49:17.9406795Z return forward_fn(*input_tensors) 2025-08-14T21:49:17.9407191Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 594, in feed_forward_chunk 2025-08-14T21:49:17.9407640Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:49:17.9408055Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 531, in forward 2025-08-14T21:49:17.9408425Z hidden_states = self.dense(hidden_states) 2025-08-14T21:49:17.9408557Z 2025-08-14T21:49:17.9408655Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9408993Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9409302Z return mod(**inputs) 2025-08-14T21:49:17.9409648Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9410041Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9410287Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9410361Z hidden_states = self.encoder( 2025-08-14T21:49:17.9410600Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9410669Z layer_outputs = layer_module( 2025-08-14T21:49:17.9410874Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9410947Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9411198Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 561, in forward 2025-08-14T21:49:17.9411272Z self_attention_outputs = self.attention( 2025-08-14T21:49:17.9411521Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 464, in forward 2025-08-14T21:49:17.9411594Z self_outputs = self.self( 2025-08-14T21:49:17.9411842Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 350, in forward 2025-08-14T21:49:17.9411934Z mixed_query_layer = self.query(hidden_states) 2025-08-14T21:49:17.9411937Z 2025-08-14T21:49:17.9412033Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9412220Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9412289Z return mod(**inputs) 2025-08-14T21:49:17.9412536Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9412618Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9412883Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9412967Z hidden_states = self.encoder( 2025-08-14T21:49:17.9413242Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9413307Z layer_outputs = layer_module( 2025-08-14T21:49:17.9413516Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9413594Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9413844Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 561, in forward 2025-08-14T21:49:17.9413944Z self_attention_outputs = self.attention( 2025-08-14T21:49:17.9414191Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 464, in forward 2025-08-14T21:49:17.9414256Z self_outputs = self.self( 2025-08-14T21:49:17.9414514Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 344, in forward 2025-08-14T21:49:17.9414593Z mixed_key_layer = self.key(hidden_states) 2025-08-14T21:49:17.9414596Z 2025-08-14T21:49:17.9414698Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9414884Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9414944Z return mod(**inputs) 2025-08-14T21:49:17.9415200Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9415272Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9415522Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9415593Z hidden_states = self.encoder( 2025-08-14T21:49:17.9415843Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9415914Z layer_outputs = layer_module( 2025-08-14T21:49:17.9416122Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9416192Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9416447Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 561, in forward 2025-08-14T21:49:17.9416518Z self_attention_outputs = self.attention( 2025-08-14T21:49:17.9416773Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 464, in forward 2025-08-14T21:49:17.9416840Z self_outputs = self.self( 2025-08-14T21:49:17.9417087Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 345, in forward 2025-08-14T21:49:17.9417182Z mixed_value_layer = self.value(hidden_states) 2025-08-14T21:49:17.9417186Z 2025-08-14T21:49:17.9417260Z cudagraph partition due to non gpu ops 2025-08-14T21:49:17.9417333Z cudagraph partition due to non gpu ops 2025-08-14T21:49:17.9417438Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9417622Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9417691Z return mod(**inputs) 2025-08-14T21:49:17.9417940Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9418015Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9418270Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9418333Z hidden_states = self.encoder( 2025-08-14T21:49:17.9418622Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9418699Z layer_outputs = layer_module( 2025-08-14T21:49:17.9418933Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9419011Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9419263Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 561, in forward 2025-08-14T21:49:17.9419337Z self_attention_outputs = self.attention( 2025-08-14T21:49:17.9419593Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 464, in forward 2025-08-14T21:49:17.9419674Z self_outputs = self.self( 2025-08-14T21:49:17.9419933Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 366, in forward 2025-08-14T21:49:17.9420030Z conv_out_layer = self.conv_out_layer(hidden_states) 2025-08-14T21:49:17.9420035Z 2025-08-14T21:49:17.9420110Z cudagraph partition due to non gpu ops 2025-08-14T21:49:17.9420215Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9420401Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9420461Z return mod(**inputs) 2025-08-14T21:49:17.9420717Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9420789Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9421042Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9421108Z hidden_states = self.encoder( 2025-08-14T21:49:17.9421357Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9421435Z layer_outputs = layer_module( 2025-08-14T21:49:17.9421642Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9421715Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9421972Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 561, in forward 2025-08-14T21:49:17.9422045Z self_attention_outputs = self.attention( 2025-08-14T21:49:17.9422299Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 464, in forward 2025-08-14T21:49:17.9422362Z self_outputs = self.self( 2025-08-14T21:49:17.9422610Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 347, in forward 2025-08-14T21:49:17.9422769Z mixed_key_conv_attn_layer = self.key_conv_attn_layer(hidden_states.transpose(1, 2)) 2025-08-14T21:49:17.9423021Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 282, in forward 2025-08-14T21:49:17.9423100Z x = self.depthwise(hidden_states) 2025-08-14T21:49:17.9423103Z 2025-08-14T21:49:17.9423198Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9423385Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9423456Z return mod(**inputs) 2025-08-14T21:49:17.9423705Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9423779Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9424035Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9424099Z hidden_states = self.encoder( 2025-08-14T21:49:17.9424388Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9424455Z layer_outputs = layer_module( 2025-08-14T21:49:17.9424683Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9424827Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9425083Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 561, in forward 2025-08-14T21:49:17.9425164Z self_attention_outputs = self.attention( 2025-08-14T21:49:17.9425415Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 464, in forward 2025-08-14T21:49:17.9425502Z self_outputs = self.self( 2025-08-14T21:49:17.9425759Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 347, in forward 2025-08-14T21:49:17.9425909Z mixed_key_conv_attn_layer = self.key_conv_attn_layer(hidden_states.transpose(1, 2)) 2025-08-14T21:49:17.9426160Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 283, in forward 2025-08-14T21:49:17.9426236Z x = self.pointwise(x) 2025-08-14T21:49:17.9426240Z 2025-08-14T21:49:17.9426337Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9426530Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9426591Z return mod(**inputs) 2025-08-14T21:49:17.9426841Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9426924Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9427172Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9427246Z hidden_states = self.encoder( 2025-08-14T21:49:17.9427496Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9427560Z layer_outputs = layer_module( 2025-08-14T21:49:17.9427774Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9427854Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9428099Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 561, in forward 2025-08-14T21:49:17.9428176Z self_attention_outputs = self.attention( 2025-08-14T21:49:17.9428419Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 464, in forward 2025-08-14T21:49:17.9428488Z self_outputs = self.self( 2025-08-14T21:49:17.9428735Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 360, in forward 2025-08-14T21:49:17.9428875Z conv_attn_layer = torch.multiply(mixed_key_conv_attn_layer, mixed_query_layer) 2025-08-14T21:49:17.9428880Z 2025-08-14T21:49:17.9428980Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9429159Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9429225Z return mod(**inputs) 2025-08-14T21:49:17.9429470Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9429541Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9429790Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9429853Z hidden_states = self.encoder( 2025-08-14T21:49:17.9430132Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9430203Z layer_outputs = layer_module( 2025-08-14T21:49:17.9430421Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9430498Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9430737Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 561, in forward 2025-08-14T21:49:17.9430807Z self_attention_outputs = self.attention( 2025-08-14T21:49:17.9431053Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 464, in forward 2025-08-14T21:49:17.9431138Z self_outputs = self.self( 2025-08-14T21:49:17.9431386Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 362, in forward 2025-08-14T21:49:17.9431494Z conv_kernel_layer = self.conv_kernel_layer(conv_attn_layer) 2025-08-14T21:49:17.9431500Z 2025-08-14T21:49:17.9431593Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9431782Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9431840Z return mod(**inputs) 2025-08-14T21:49:17.9432083Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9432162Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9432408Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9432480Z hidden_states = self.encoder( 2025-08-14T21:49:17.9432721Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9432786Z layer_outputs = layer_module( 2025-08-14T21:49:17.9433000Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9433072Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9433313Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 561, in forward 2025-08-14T21:49:17.9433391Z self_attention_outputs = self.attention( 2025-08-14T21:49:17.9433631Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 464, in forward 2025-08-14T21:49:17.9433700Z self_outputs = self.self( 2025-08-14T21:49:17.9433945Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 380, in forward 2025-08-14T21:49:17.9434061Z conv_out_layer = torch.matmul(conv_out_layer, conv_kernel_layer) 2025-08-14T21:49:17.9434065Z 2025-08-14T21:49:17.9434143Z cudagraph partition due to non gpu ops 2025-08-14T21:49:17.9434215Z cudagraph partition due to non gpu ops 2025-08-14T21:49:17.9434317Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9434497Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9434557Z return mod(**inputs) 2025-08-14T21:49:17.9434808Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9434881Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9435123Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9435199Z hidden_states = self.encoder( 2025-08-14T21:49:17.9435440Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9435509Z layer_outputs = layer_module( 2025-08-14T21:49:17.9435742Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9435828Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9436077Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 561, in forward 2025-08-14T21:49:17.9436149Z self_attention_outputs = self.attention( 2025-08-14T21:49:17.9436398Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 464, in forward 2025-08-14T21:49:17.9436461Z self_outputs = self.self( 2025-08-14T21:49:17.9436702Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 405, in forward 2025-08-14T21:49:17.9436831Z context_layer = torch.cat([context_layer, conv_out], 2) 2025-08-14T21:49:17.9436835Z 2025-08-14T21:49:17.9436929Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9437114Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9437182Z return mod(**inputs) 2025-08-14T21:49:17.9437427Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9437507Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9437750Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9437812Z hidden_states = self.encoder( 2025-08-14T21:49:17.9438064Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9438127Z layer_outputs = layer_module( 2025-08-14T21:49:17.9438336Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9438409Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9438657Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 561, in forward 2025-08-14T21:49:17.9438737Z self_attention_outputs = self.attention( 2025-08-14T21:49:17.9438980Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 471, in forward 2025-08-14T21:49:17.9439099Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:49:17.9439351Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 425, in forward 2025-08-14T21:49:17.9439429Z hidden_states = self.dense(hidden_states) 2025-08-14T21:49:17.9439432Z 2025-08-14T21:49:17.9439530Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9439713Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9439773Z return mod(**inputs) 2025-08-14T21:49:17.9440021Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9440095Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9440345Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9440407Z hidden_states = self.encoder( 2025-08-14T21:49:17.9440648Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9440715Z layer_outputs = layer_module( 2025-08-14T21:49:17.9440919Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9440988Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9441280Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 586, in forward 2025-08-14T21:49:17.9441359Z layer_output = apply_chunking_to_forward( 2025-08-14T21:49:17.9441627Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:49:17.9441696Z return forward_fn(*input_tensors) 2025-08-14T21:49:17.9441973Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 593, in feed_forward_chunk 2025-08-14T21:49:17.9442088Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:49:17.9442332Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 513, in forward 2025-08-14T21:49:17.9442430Z hidden_states = self.dense(hidden_states) 2025-08-14T21:49:17.9442434Z 2025-08-14T21:49:17.9442526Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9442707Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9442770Z return mod(**inputs) 2025-08-14T21:49:17.9443015Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9443087Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9443333Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9443395Z hidden_states = self.encoder( 2025-08-14T21:49:17.9443645Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9443708Z layer_outputs = layer_module( 2025-08-14T21:49:17.9443910Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9443988Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9444232Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 586, in forward 2025-08-14T21:49:17.9444316Z layer_output = apply_chunking_to_forward( 2025-08-14T21:49:17.9444557Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:49:17.9444626Z return forward_fn(*input_tensors) 2025-08-14T21:49:17.9444907Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 593, in feed_forward_chunk 2025-08-14T21:49:17.9445017Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:49:17.9445260Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 514, in forward 2025-08-14T21:49:17.9445369Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:49:17.9445568Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:49:17.9445642Z return self.act(input) 2025-08-14T21:49:17.9445645Z 2025-08-14T21:49:17.9445737Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9445920Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9445987Z return mod(**inputs) 2025-08-14T21:49:17.9446231Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9446313Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9446558Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9446623Z hidden_states = self.encoder( 2025-08-14T21:49:17.9446908Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9446974Z layer_outputs = layer_module( 2025-08-14T21:49:17.9447193Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9447272Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9447514Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 586, in forward 2025-08-14T21:49:17.9447595Z layer_output = apply_chunking_to_forward( 2025-08-14T21:49:17.9447833Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:49:17.9447923Z return forward_fn(*input_tensors) 2025-08-14T21:49:17.9448204Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 594, in feed_forward_chunk 2025-08-14T21:49:17.9448328Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:49:17.9448577Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 531, in forward 2025-08-14T21:49:17.9448651Z hidden_states = self.dense(hidden_states) 2025-08-14T21:49:17.9448654Z 2025-08-14T21:49:17.9448746Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9448934Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9448993Z return mod(**inputs) 2025-08-14T21:49:17.9449237Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9449320Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9449561Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9449631Z hidden_states = self.encoder( 2025-08-14T21:49:17.9449875Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9449939Z layer_outputs = layer_module( 2025-08-14T21:49:17.9450146Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9450215Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9450467Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 561, in forward 2025-08-14T21:49:17.9450540Z self_attention_outputs = self.attention( 2025-08-14T21:49:17.9450781Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 464, in forward 2025-08-14T21:49:17.9450852Z self_outputs = self.self( 2025-08-14T21:49:17.9451098Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 350, in forward 2025-08-14T21:49:17.9451181Z mixed_query_layer = self.query(hidden_states) 2025-08-14T21:49:17.9451185Z 2025-08-14T21:49:17.9451285Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9451465Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9451530Z return mod(**inputs) 2025-08-14T21:49:17.9451771Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9451845Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9452093Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9452159Z hidden_states = self.encoder( 2025-08-14T21:49:17.9452398Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9452503Z layer_outputs = layer_module( 2025-08-14T21:49:17.9452706Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9452802Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9453043Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 561, in forward 2025-08-14T21:49:17.9453116Z self_attention_outputs = self.attention( 2025-08-14T21:49:17.9453368Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 464, in forward 2025-08-14T21:49:17.9453432Z self_outputs = self.self( 2025-08-14T21:49:17.9453701Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 344, in forward 2025-08-14T21:49:17.9453775Z mixed_key_layer = self.key(hidden_states) 2025-08-14T21:49:17.9453779Z 2025-08-14T21:49:17.9453874Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9454060Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9454120Z return mod(**inputs) 2025-08-14T21:49:17.9454360Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9454439Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9454678Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9454747Z hidden_states = self.encoder( 2025-08-14T21:49:17.9454988Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9455051Z layer_outputs = layer_module( 2025-08-14T21:49:17.9455259Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9455327Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9455581Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 561, in forward 2025-08-14T21:49:17.9455652Z self_attention_outputs = self.attention( 2025-08-14T21:49:17.9455909Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 464, in forward 2025-08-14T21:49:17.9455977Z self_outputs = self.self( 2025-08-14T21:49:17.9456218Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 345, in forward 2025-08-14T21:49:17.9456302Z mixed_value_layer = self.value(hidden_states) 2025-08-14T21:49:17.9456312Z 2025-08-14T21:49:17.9456383Z cudagraph partition due to non gpu ops 2025-08-14T21:49:17.9456453Z cudagraph partition due to non gpu ops 2025-08-14T21:49:17.9456555Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9456733Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9456792Z return mod(**inputs) 2025-08-14T21:49:17.9457040Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9457111Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9457350Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9457420Z hidden_states = self.encoder( 2025-08-14T21:49:17.9457660Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9457729Z layer_outputs = layer_module( 2025-08-14T21:49:17.9457930Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9458030Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9458281Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 561, in forward 2025-08-14T21:49:17.9458370Z self_attention_outputs = self.attention( 2025-08-14T21:49:17.9458617Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 464, in forward 2025-08-14T21:49:17.9458679Z self_outputs = self.self( 2025-08-14T21:49:17.9458949Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 366, in forward 2025-08-14T21:49:17.9459080Z conv_out_layer = self.conv_out_layer(hidden_states) 2025-08-14T21:49:17.9459083Z 2025-08-14T21:49:17.9459158Z cudagraph partition due to non gpu ops 2025-08-14T21:49:17.9459256Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9459456Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9459519Z return mod(**inputs) 2025-08-14T21:49:17.9459781Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9459859Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9460112Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9460187Z hidden_states = self.encoder( 2025-08-14T21:49:17.9460438Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9460514Z layer_outputs = layer_module( 2025-08-14T21:49:17.9460723Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9460798Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9461060Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 561, in forward 2025-08-14T21:49:17.9461138Z self_attention_outputs = self.attention( 2025-08-14T21:49:17.9461389Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 464, in forward 2025-08-14T21:49:17.9461464Z self_outputs = self.self( 2025-08-14T21:49:17.9461718Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 347, in forward 2025-08-14T21:49:17.9461882Z mixed_key_conv_attn_layer = self.key_conv_attn_layer(hidden_states.transpose(1, 2)) 2025-08-14T21:49:17.9462137Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 282, in forward 2025-08-14T21:49:17.9462212Z x = self.depthwise(hidden_states) 2025-08-14T21:49:17.9462216Z 2025-08-14T21:49:17.9462322Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9462513Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9462586Z return mod(**inputs) 2025-08-14T21:49:17.9462840Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9462919Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9463179Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9463248Z hidden_states = self.encoder( 2025-08-14T21:49:17.9463503Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9463579Z layer_outputs = layer_module( 2025-08-14T21:49:17.9463818Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9463899Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9464165Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 561, in forward 2025-08-14T21:49:17.9464239Z self_attention_outputs = self.attention( 2025-08-14T21:49:17.9464494Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 464, in forward 2025-08-14T21:49:17.9464557Z self_outputs = self.self( 2025-08-14T21:49:17.9464870Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 347, in forward 2025-08-14T21:49:17.9465052Z mixed_key_conv_attn_layer = self.key_conv_attn_layer(hidden_states.transpose(1, 2)) 2025-08-14T21:49:17.9465300Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 283, in forward 2025-08-14T21:49:17.9465375Z x = self.pointwise(x) 2025-08-14T21:49:17.9465379Z 2025-08-14T21:49:17.9465474Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9465663Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9465731Z return mod(**inputs) 2025-08-14T21:49:17.9465981Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9466062Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9466313Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9466380Z hidden_states = self.encoder( 2025-08-14T21:49:17.9466637Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9466703Z layer_outputs = layer_module( 2025-08-14T21:49:17.9466912Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9466992Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9467238Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 561, in forward 2025-08-14T21:49:17.9467322Z self_attention_outputs = self.attention( 2025-08-14T21:49:17.9467569Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 464, in forward 2025-08-14T21:49:17.9467633Z self_outputs = self.self( 2025-08-14T21:49:17.9467887Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 360, in forward 2025-08-14T21:49:17.9468031Z conv_attn_layer = torch.multiply(mixed_key_conv_attn_layer, mixed_query_layer) 2025-08-14T21:49:17.9468035Z 2025-08-14T21:49:17.9468136Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9468325Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9468385Z return mod(**inputs) 2025-08-14T21:49:17.9468641Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9468714Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9468963Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9469034Z hidden_states = self.encoder( 2025-08-14T21:49:17.9469281Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9469355Z layer_outputs = layer_module( 2025-08-14T21:49:17.9469560Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9469666Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9469928Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 561, in forward 2025-08-14T21:49:17.9470020Z self_attention_outputs = self.attention( 2025-08-14T21:49:17.9470278Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 464, in forward 2025-08-14T21:49:17.9470343Z self_outputs = self.self( 2025-08-14T21:49:17.9470593Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 362, in forward 2025-08-14T21:49:17.9470729Z conv_kernel_layer = self.conv_kernel_layer(conv_attn_layer) 2025-08-14T21:49:17.9470733Z 2025-08-14T21:49:17.9470826Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9471014Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9471082Z return mod(**inputs) 2025-08-14T21:49:17.9471331Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9471413Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9471662Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9471726Z hidden_states = self.encoder( 2025-08-14T21:49:17.9471982Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9472045Z layer_outputs = layer_module( 2025-08-14T21:49:17.9472261Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9472333Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9472586Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 561, in forward 2025-08-14T21:49:17.9472668Z self_attention_outputs = self.attention( 2025-08-14T21:49:17.9472917Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 464, in forward 2025-08-14T21:49:17.9472981Z self_outputs = self.self( 2025-08-14T21:49:17.9473236Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 380, in forward 2025-08-14T21:49:17.9473352Z conv_out_layer = torch.matmul(conv_out_layer, conv_kernel_layer) 2025-08-14T21:49:17.9473355Z 2025-08-14T21:49:17.9473437Z cudagraph partition due to non gpu ops 2025-08-14T21:49:17.9473511Z cudagraph partition due to non gpu ops 2025-08-14T21:49:17.9473606Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9473797Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9473860Z return mod(**inputs) 2025-08-14T21:49:17.9474110Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9474194Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9474442Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9474514Z hidden_states = self.encoder( 2025-08-14T21:49:17.9474763Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9474827Z layer_outputs = layer_module( 2025-08-14T21:49:17.9475042Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9475113Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9475403Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 561, in forward 2025-08-14T21:49:17.9475481Z self_attention_outputs = self.attention( 2025-08-14T21:49:17.9475747Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 464, in forward 2025-08-14T21:49:17.9475821Z self_outputs = self.self( 2025-08-14T21:49:17.9476068Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 405, in forward 2025-08-14T21:49:17.9476172Z context_layer = torch.cat([context_layer, conv_out], 2) 2025-08-14T21:49:17.9476184Z 2025-08-14T21:49:17.9476278Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9476479Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9476544Z return mod(**inputs) 2025-08-14T21:49:17.9476797Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9476872Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9477131Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9477197Z hidden_states = self.encoder( 2025-08-14T21:49:17.9477452Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9477517Z layer_outputs = layer_module( 2025-08-14T21:49:17.9477731Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9477809Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9478048Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 561, in forward 2025-08-14T21:49:17.9478119Z self_attention_outputs = self.attention( 2025-08-14T21:49:17.9478374Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 471, in forward 2025-08-14T21:49:17.9478496Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:49:17.9478750Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 425, in forward 2025-08-14T21:49:17.9478827Z hidden_states = self.dense(hidden_states) 2025-08-14T21:49:17.9478831Z 2025-08-14T21:49:17.9478922Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9479116Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9479176Z return mod(**inputs) 2025-08-14T21:49:17.9479430Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9479504Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9479754Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9479827Z hidden_states = self.encoder( 2025-08-14T21:49:17.9480073Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9480136Z layer_outputs = layer_module( 2025-08-14T21:49:17.9480357Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9480425Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9480671Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 586, in forward 2025-08-14T21:49:17.9480747Z layer_output = apply_chunking_to_forward( 2025-08-14T21:49:17.9481014Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:49:17.9481092Z return forward_fn(*input_tensors) 2025-08-14T21:49:17.9481389Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 593, in feed_forward_chunk 2025-08-14T21:49:17.9481506Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:49:17.9481744Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 513, in forward 2025-08-14T21:49:17.9481817Z hidden_states = self.dense(hidden_states) 2025-08-14T21:49:17.9481820Z 2025-08-14T21:49:17.9481919Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9482114Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9482174Z return mod(**inputs) 2025-08-14T21:49:17.9482426Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9482499Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9482749Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9482812Z hidden_states = self.encoder( 2025-08-14T21:49:17.9483055Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9483126Z layer_outputs = layer_module( 2025-08-14T21:49:17.9483326Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9483404Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9483645Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 586, in forward 2025-08-14T21:49:17.9483719Z layer_output = apply_chunking_to_forward( 2025-08-14T21:49:17.9483968Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:49:17.9484039Z return forward_fn(*input_tensors) 2025-08-14T21:49:17.9484314Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 593, in feed_forward_chunk 2025-08-14T21:49:17.9484429Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:49:17.9484807Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 514, in forward 2025-08-14T21:49:17.9484922Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:49:17.9485121Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:49:17.9485187Z return self.act(input) 2025-08-14T21:49:17.9485191Z 2025-08-14T21:49:17.9485290Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9485472Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9485545Z return mod(**inputs) 2025-08-14T21:49:17.9485786Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9485862Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9486116Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9486181Z hidden_states = self.encoder( 2025-08-14T21:49:17.9486426Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9486500Z layer_outputs = layer_module( 2025-08-14T21:49:17.9486705Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9486841Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9487099Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 586, in forward 2025-08-14T21:49:17.9487198Z layer_output = apply_chunking_to_forward( 2025-08-14T21:49:17.9487443Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:49:17.9487510Z return forward_fn(*input_tensors) 2025-08-14T21:49:17.9487788Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 594, in feed_forward_chunk 2025-08-14T21:49:17.9487934Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:49:17.9488176Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 531, in forward 2025-08-14T21:49:17.9488259Z hidden_states = self.dense(hidden_states) 2025-08-14T21:49:17.9488265Z 2025-08-14T21:49:17.9488360Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9488540Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9488607Z return mod(**inputs) 2025-08-14T21:49:17.9488849Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9488930Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9489171Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9489238Z hidden_states = self.encoder( 2025-08-14T21:49:17.9489486Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9489550Z layer_outputs = layer_module( 2025-08-14T21:49:17.9489760Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9489831Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9490073Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 561, in forward 2025-08-14T21:49:17.9490154Z self_attention_outputs = self.attention( 2025-08-14T21:49:17.9490394Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 464, in forward 2025-08-14T21:49:17.9490456Z self_outputs = self.self( 2025-08-14T21:49:17.9490703Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 350, in forward 2025-08-14T21:49:17.9490787Z mixed_query_layer = self.query(hidden_states) 2025-08-14T21:49:17.9490791Z 2025-08-14T21:49:17.9490894Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9491084Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9491145Z return mod(**inputs) 2025-08-14T21:49:17.9491398Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9491472Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9491718Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9491791Z hidden_states = self.encoder( 2025-08-14T21:49:17.9492038Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9492110Z layer_outputs = layer_module( 2025-08-14T21:49:17.9492320Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9492406Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9492688Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 561, in forward 2025-08-14T21:49:17.9492777Z self_attention_outputs = self.attention( 2025-08-14T21:49:17.9493024Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 464, in forward 2025-08-14T21:49:17.9493086Z self_outputs = self.self( 2025-08-14T21:49:17.9493327Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 344, in forward 2025-08-14T21:49:17.9493406Z mixed_key_layer = self.key(hidden_states) 2025-08-14T21:49:17.9493425Z 2025-08-14T21:49:17.9493519Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9493701Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9493769Z return mod(**inputs) 2025-08-14T21:49:17.9494022Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9494105Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9494355Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9494420Z hidden_states = self.encoder( 2025-08-14T21:49:17.9494678Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9494745Z layer_outputs = layer_module( 2025-08-14T21:49:17.9494961Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9495033Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9495284Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 561, in forward 2025-08-14T21:49:17.9495367Z self_attention_outputs = self.attention( 2025-08-14T21:49:17.9495617Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 464, in forward 2025-08-14T21:49:17.9495682Z self_outputs = self.self( 2025-08-14T21:49:17.9495942Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 345, in forward 2025-08-14T21:49:17.9496027Z mixed_value_layer = self.value(hidden_states) 2025-08-14T21:49:17.9496031Z 2025-08-14T21:49:17.9496113Z cudagraph partition due to non gpu ops 2025-08-14T21:49:17.9496184Z cudagraph partition due to non gpu ops 2025-08-14T21:49:17.9496279Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9496472Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9496532Z return mod(**inputs) 2025-08-14T21:49:17.9496781Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9496865Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9497118Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9497190Z hidden_states = self.encoder( 2025-08-14T21:49:17.9497439Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9497503Z layer_outputs = layer_module( 2025-08-14T21:49:17.9497716Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9497789Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9498044Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 561, in forward 2025-08-14T21:49:17.9498148Z self_attention_outputs = self.attention( 2025-08-14T21:49:17.9498399Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 464, in forward 2025-08-14T21:49:17.9498489Z self_outputs = self.self( 2025-08-14T21:49:17.9498741Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 366, in forward 2025-08-14T21:49:17.9498837Z conv_out_layer = self.conv_out_layer(hidden_states) 2025-08-14T21:49:17.9498846Z 2025-08-14T21:49:17.9498919Z cudagraph partition due to non gpu ops 2025-08-14T21:49:17.9499010Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9499220Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9499280Z return mod(**inputs) 2025-08-14T21:49:17.9499534Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9499615Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9499864Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9499935Z hidden_states = self.encoder( 2025-08-14T21:49:17.9500185Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9500249Z layer_outputs = layer_module( 2025-08-14T21:49:17.9500460Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9500531Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9500779Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 561, in forward 2025-08-14T21:49:17.9500861Z self_attention_outputs = self.attention( 2025-08-14T21:49:17.9501112Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 464, in forward 2025-08-14T21:49:17.9501184Z self_outputs = self.self( 2025-08-14T21:49:17.9501431Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 347, in forward 2025-08-14T21:49:17.9501582Z mixed_key_conv_attn_layer = self.key_conv_attn_layer(hidden_states.transpose(1, 2)) 2025-08-14T21:49:17.9501836Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 282, in forward 2025-08-14T21:49:17.9501906Z x = self.depthwise(hidden_states) 2025-08-14T21:49:17.9501911Z 2025-08-14T21:49:17.9502014Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9502197Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9502257Z return mod(**inputs) 2025-08-14T21:49:17.9502514Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9502590Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9502836Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9502909Z hidden_states = self.encoder( 2025-08-14T21:49:17.9503154Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9503224Z layer_outputs = layer_module( 2025-08-14T21:49:17.9503431Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9503504Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9503760Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 561, in forward 2025-08-14T21:49:17.9503871Z self_attention_outputs = self.attention( 2025-08-14T21:49:17.9504121Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 464, in forward 2025-08-14T21:49:17.9504210Z self_outputs = self.self( 2025-08-14T21:49:17.9504464Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 347, in forward 2025-08-14T21:49:17.9504621Z mixed_key_conv_attn_layer = self.key_conv_attn_layer(hidden_states.transpose(1, 2)) 2025-08-14T21:49:17.9504931Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 283, in forward 2025-08-14T21:49:17.9505022Z x = self.pointwise(x) 2025-08-14T21:49:17.9505025Z 2025-08-14T21:49:17.9505131Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9505319Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9505389Z return mod(**inputs) 2025-08-14T21:49:17.9505643Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9505720Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9505978Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9506045Z hidden_states = self.encoder( 2025-08-14T21:49:17.9506295Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9506370Z layer_outputs = layer_module( 2025-08-14T21:49:17.9506581Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9506662Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9506914Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 561, in forward 2025-08-14T21:49:17.9506992Z self_attention_outputs = self.attention( 2025-08-14T21:49:17.9507250Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 464, in forward 2025-08-14T21:49:17.9507315Z self_outputs = self.self( 2025-08-14T21:49:17.9507584Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 360, in forward 2025-08-14T21:49:17.9507725Z conv_attn_layer = torch.multiply(mixed_key_conv_attn_layer, mixed_query_layer) 2025-08-14T21:49:17.9507729Z 2025-08-14T21:49:17.9507824Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9508014Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9508072Z return mod(**inputs) 2025-08-14T21:49:17.9508319Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9508399Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9508644Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9508713Z hidden_states = self.encoder( 2025-08-14T21:49:17.9508957Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9509019Z layer_outputs = layer_module( 2025-08-14T21:49:17.9509230Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9509302Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9509554Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 561, in forward 2025-08-14T21:49:17.9509658Z self_attention_outputs = self.attention( 2025-08-14T21:49:17.9509906Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 464, in forward 2025-08-14T21:49:17.9509993Z self_outputs = self.self( 2025-08-14T21:49:17.9510242Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 362, in forward 2025-08-14T21:49:17.9510349Z conv_kernel_layer = self.conv_kernel_layer(conv_attn_layer) 2025-08-14T21:49:17.9510360Z 2025-08-14T21:49:17.9510451Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9510633Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9510716Z return mod(**inputs) 2025-08-14T21:49:17.9510956Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9511029Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9511277Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9511345Z hidden_states = self.encoder( 2025-08-14T21:49:17.9511602Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9511667Z layer_outputs = layer_module( 2025-08-14T21:49:17.9511912Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9511999Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9512240Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 561, in forward 2025-08-14T21:49:17.9512312Z self_attention_outputs = self.attention( 2025-08-14T21:49:17.9512563Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 464, in forward 2025-08-14T21:49:17.9512625Z self_outputs = self.self( 2025-08-14T21:49:17.9512871Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 380, in forward 2025-08-14T21:49:17.9512985Z conv_out_layer = torch.matmul(conv_out_layer, conv_kernel_layer) 2025-08-14T21:49:17.9512988Z 2025-08-14T21:49:17.9513059Z cudagraph partition due to non gpu ops 2025-08-14T21:49:17.9513135Z cudagraph partition due to non gpu ops 2025-08-14T21:49:17.9513225Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9513411Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9513469Z return mod(**inputs) 2025-08-14T21:49:17.9513709Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9513790Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9514030Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9514095Z hidden_states = self.encoder( 2025-08-14T21:49:17.9514344Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9514406Z layer_outputs = layer_module( 2025-08-14T21:49:17.9514612Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9514680Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9514919Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 561, in forward 2025-08-14T21:49:17.9514998Z self_attention_outputs = self.attention( 2025-08-14T21:49:17.9515265Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 464, in forward 2025-08-14T21:49:17.9515329Z self_outputs = self.self( 2025-08-14T21:49:17.9515596Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 405, in forward 2025-08-14T21:49:17.9515700Z context_layer = torch.cat([context_layer, conv_out], 2) 2025-08-14T21:49:17.9515703Z 2025-08-14T21:49:17.9515802Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9515981Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9516040Z return mod(**inputs) 2025-08-14T21:49:17.9516288Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9516376Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9516637Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9516700Z hidden_states = self.encoder( 2025-08-14T21:49:17.9516948Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9517018Z layer_outputs = layer_module( 2025-08-14T21:49:17.9517221Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9517289Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9517541Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 561, in forward 2025-08-14T21:49:17.9517612Z self_attention_outputs = self.attention( 2025-08-14T21:49:17.9517882Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 471, in forward 2025-08-14T21:49:17.9518005Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:49:17.9518259Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 425, in forward 2025-08-14T21:49:17.9518345Z hidden_states = self.dense(hidden_states) 2025-08-14T21:49:17.9518349Z 2025-08-14T21:49:17.9518452Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9518640Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9518700Z return mod(**inputs) 2025-08-14T21:49:17.9518943Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9519024Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9519266Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9519330Z hidden_states = self.encoder( 2025-08-14T21:49:17.9519585Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9519648Z layer_outputs = layer_module( 2025-08-14T21:49:17.9519866Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9519934Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9520254Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 586, in forward 2025-08-14T21:49:17.9520338Z layer_output = apply_chunking_to_forward( 2025-08-14T21:49:17.9520575Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:49:17.9520653Z return forward_fn(*input_tensors) 2025-08-14T21:49:17.9520949Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 593, in feed_forward_chunk 2025-08-14T21:49:17.9521078Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:49:17.9521347Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 513, in forward 2025-08-14T21:49:17.9521422Z hidden_states = self.dense(hidden_states) 2025-08-14T21:49:17.9521425Z 2025-08-14T21:49:17.9521519Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9521709Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9521769Z return mod(**inputs) 2025-08-14T21:49:17.9522032Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9522127Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9522369Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9522446Z hidden_states = self.encoder( 2025-08-14T21:49:17.9522690Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9522763Z layer_outputs = layer_module( 2025-08-14T21:49:17.9522966Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9523035Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9523284Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 586, in forward 2025-08-14T21:49:17.9523361Z layer_output = apply_chunking_to_forward( 2025-08-14T21:49:17.9523602Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:49:17.9523679Z return forward_fn(*input_tensors) 2025-08-14T21:49:17.9523955Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 593, in feed_forward_chunk 2025-08-14T21:49:17.9524072Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:49:17.9524315Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 514, in forward 2025-08-14T21:49:17.9524420Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:49:17.9524626Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:49:17.9524690Z return self.act(input) 2025-08-14T21:49:17.9524693Z 2025-08-14T21:49:17.9524793Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9524973Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9525034Z return mod(**inputs) 2025-08-14T21:49:17.9525285Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9525361Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9525604Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9525675Z hidden_states = self.encoder( 2025-08-14T21:49:17.9525919Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9525989Z layer_outputs = layer_module( 2025-08-14T21:49:17.9526190Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9526261Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9526511Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 586, in forward 2025-08-14T21:49:17.9526628Z layer_output = apply_chunking_to_forward( 2025-08-14T21:49:17.9526872Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:49:17.9526957Z return forward_fn(*input_tensors) 2025-08-14T21:49:17.9527227Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 594, in feed_forward_chunk 2025-08-14T21:49:17.9527354Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:49:17.9527596Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 531, in forward 2025-08-14T21:49:17.9527688Z hidden_states = self.dense(hidden_states) 2025-08-14T21:49:17.9527691Z 2025-08-14T21:49:17.9527791Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9527972Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9528041Z return mod(**inputs) 2025-08-14T21:49:17.9528287Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9528361Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9528612Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9528676Z hidden_states = self.encoder( 2025-08-14T21:49:17.9528927Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9528990Z layer_outputs = layer_module( 2025-08-14T21:49:17.9529195Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9529269Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9529516Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 561, in forward 2025-08-14T21:49:17.9529590Z self_attention_outputs = self.attention( 2025-08-14T21:49:17.9529844Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 464, in forward 2025-08-14T21:49:17.9529907Z self_outputs = self.self( 2025-08-14T21:49:17.9530161Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 350, in forward 2025-08-14T21:49:17.9530243Z mixed_query_layer = self.query(hidden_states) 2025-08-14T21:49:17.9530246Z 2025-08-14T21:49:17.9530339Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9530529Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9530588Z return mod(**inputs) 2025-08-14T21:49:17.9530837Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9530918Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9531165Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9531237Z hidden_states = self.encoder( 2025-08-14T21:49:17.9531490Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9531555Z layer_outputs = layer_module( 2025-08-14T21:49:17.9531771Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9531844Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9532110Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 561, in forward 2025-08-14T21:49:17.9532185Z self_attention_outputs = self.attention( 2025-08-14T21:49:17.9532463Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 464, in forward 2025-08-14T21:49:17.9532552Z self_outputs = self.self( 2025-08-14T21:49:17.9532793Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 344, in forward 2025-08-14T21:49:17.9532866Z mixed_key_layer = self.key(hidden_states) 2025-08-14T21:49:17.9532877Z 2025-08-14T21:49:17.9532968Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9533149Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9533233Z return mod(**inputs) 2025-08-14T21:49:17.9533487Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9533561Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9533835Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9533899Z hidden_states = self.encoder( 2025-08-14T21:49:17.9534151Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9534213Z layer_outputs = layer_module( 2025-08-14T21:49:17.9534419Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9534494Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9534741Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 561, in forward 2025-08-14T21:49:17.9534815Z self_attention_outputs = self.attention( 2025-08-14T21:49:17.9535075Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 464, in forward 2025-08-14T21:49:17.9535140Z self_outputs = self.self( 2025-08-14T21:49:17.9535403Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 345, in forward 2025-08-14T21:49:17.9535489Z mixed_value_layer = self.value(hidden_states) 2025-08-14T21:49:17.9535493Z 2025-08-14T21:49:17.9535567Z cudagraph partition due to non gpu ops 2025-08-14T21:49:17.9535646Z cudagraph partition due to non gpu ops 2025-08-14T21:49:17.9535742Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9535930Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9535996Z return mod(**inputs) 2025-08-14T21:49:17.9536253Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9536336Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9536593Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9536661Z hidden_states = self.encoder( 2025-08-14T21:49:17.9536923Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9536987Z layer_outputs = layer_module( 2025-08-14T21:49:17.9537203Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9537275Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9537528Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 561, in forward 2025-08-14T21:49:17.9537610Z self_attention_outputs = self.attention( 2025-08-14T21:49:17.9537865Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 464, in forward 2025-08-14T21:49:17.9537958Z self_outputs = self.self( 2025-08-14T21:49:17.9538216Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 366, in forward 2025-08-14T21:49:17.9538332Z conv_out_layer = self.conv_out_layer(hidden_states) 2025-08-14T21:49:17.9538336Z 2025-08-14T21:49:17.9538422Z cudagraph partition due to non gpu ops 2025-08-14T21:49:17.9538523Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9538719Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9538791Z return mod(**inputs) 2025-08-14T21:49:17.9539054Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9539158Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9539479Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9539561Z hidden_states = self.encoder( 2025-08-14T21:49:17.9539840Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9539912Z layer_outputs = layer_module( 2025-08-14T21:49:17.9540144Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9540231Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9540516Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 561, in forward 2025-08-14T21:49:17.9540609Z self_attention_outputs = self.attention( 2025-08-14T21:49:17.9540898Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 464, in forward 2025-08-14T21:49:17.9540973Z self_outputs = self.self( 2025-08-14T21:49:17.9541269Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 347, in forward 2025-08-14T21:49:17.9541445Z mixed_key_conv_attn_layer = self.key_conv_attn_layer(hidden_states.transpose(1, 2)) 2025-08-14T21:49:17.9541740Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 282, in forward 2025-08-14T21:49:17.9541821Z x = self.depthwise(hidden_states) 2025-08-14T21:49:17.9541825Z 2025-08-14T21:49:17.9541935Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9542156Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9542226Z return mod(**inputs) 2025-08-14T21:49:17.9542527Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9542622Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9542928Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9543018Z hidden_states = self.encoder( 2025-08-14T21:49:17.9543298Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9543368Z layer_outputs = layer_module( 2025-08-14T21:49:17.9543607Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9543685Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9543979Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 561, in forward 2025-08-14T21:49:17.9544067Z self_attention_outputs = self.attention( 2025-08-14T21:49:17.9544349Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 464, in forward 2025-08-14T21:49:17.9544473Z self_outputs = self.self( 2025-08-14T21:49:17.9544814Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 347, in forward 2025-08-14T21:49:17.9545017Z mixed_key_conv_attn_layer = self.key_conv_attn_layer(hidden_states.transpose(1, 2)) 2025-08-14T21:49:17.9545312Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 283, in forward 2025-08-14T21:49:17.9545386Z x = self.pointwise(x) 2025-08-14T21:49:17.9545390Z 2025-08-14T21:49:17.9545507Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9545721Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9545820Z return mod(**inputs) 2025-08-14T21:49:17.9546129Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9546206Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9546459Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9546534Z hidden_states = self.encoder( 2025-08-14T21:49:17.9546787Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9546861Z layer_outputs = layer_module( 2025-08-14T21:49:17.9547073Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9547144Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9547408Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 561, in forward 2025-08-14T21:49:17.9547482Z self_attention_outputs = self.attention( 2025-08-14T21:49:17.9547749Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 464, in forward 2025-08-14T21:49:17.9547814Z self_outputs = self.self( 2025-08-14T21:49:17.9548072Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 360, in forward 2025-08-14T21:49:17.9548227Z conv_attn_layer = torch.multiply(mixed_key_conv_attn_layer, mixed_query_layer) 2025-08-14T21:49:17.9548231Z 2025-08-14T21:49:17.9548327Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9548524Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9548583Z return mod(**inputs) 2025-08-14T21:49:17.9548840Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9548924Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9549181Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9549246Z hidden_states = self.encoder( 2025-08-14T21:49:17.9549511Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9549575Z layer_outputs = layer_module( 2025-08-14T21:49:17.9549794Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9549865Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9550118Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 561, in forward 2025-08-14T21:49:17.9550199Z self_attention_outputs = self.attention( 2025-08-14T21:49:17.9550452Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 464, in forward 2025-08-14T21:49:17.9550538Z self_outputs = self.self( 2025-08-14T21:49:17.9550821Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 362, in forward 2025-08-14T21:49:17.9550952Z conv_kernel_layer = self.conv_kernel_layer(conv_attn_layer) 2025-08-14T21:49:17.9550955Z 2025-08-14T21:49:17.9551059Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9551265Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9551330Z return mod(**inputs) 2025-08-14T21:49:17.9551612Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9551714Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9552018Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9552091Z hidden_states = self.encoder( 2025-08-14T21:49:17.9552380Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9552474Z layer_outputs = layer_module( 2025-08-14T21:49:17.9552706Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9552784Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9553069Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 561, in forward 2025-08-14T21:49:17.9553150Z self_attention_outputs = self.attention( 2025-08-14T21:49:17.9553444Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 464, in forward 2025-08-14T21:49:17.9553516Z self_outputs = self.self( 2025-08-14T21:49:17.9553805Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 380, in forward 2025-08-14T21:49:17.9553949Z conv_out_layer = torch.matmul(conv_out_layer, conv_kernel_layer) 2025-08-14T21:49:17.9553954Z 2025-08-14T21:49:17.9554037Z cudagraph partition due to non gpu ops 2025-08-14T21:49:17.9554126Z cudagraph partition due to non gpu ops 2025-08-14T21:49:17.9554235Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9554448Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9554524Z return mod(**inputs) 2025-08-14T21:49:17.9554812Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9554898Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9555193Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9555267Z hidden_states = self.encoder( 2025-08-14T21:49:17.9555565Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9555639Z layer_outputs = layer_module( 2025-08-14T21:49:17.9555872Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9555963Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9556211Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 561, in forward 2025-08-14T21:49:17.9556285Z self_attention_outputs = self.attention( 2025-08-14T21:49:17.9556561Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 464, in forward 2025-08-14T21:49:17.9556634Z self_outputs = self.self( 2025-08-14T21:49:17.9556968Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 405, in forward 2025-08-14T21:49:17.9557091Z context_layer = torch.cat([context_layer, conv_out], 2) 2025-08-14T21:49:17.9557110Z 2025-08-14T21:49:17.9557219Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9557442Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9557511Z return mod(**inputs) 2025-08-14T21:49:17.9557804Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9557889Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9558172Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9558273Z hidden_states = self.encoder( 2025-08-14T21:49:17.9558561Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9558638Z layer_outputs = layer_module( 2025-08-14T21:49:17.9558885Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9558966Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9559259Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 561, in forward 2025-08-14T21:49:17.9559343Z self_attention_outputs = self.attention( 2025-08-14T21:49:17.9559628Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 471, in forward 2025-08-14T21:49:17.9559777Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:49:17.9560065Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 425, in forward 2025-08-14T21:49:17.9560161Z hidden_states = self.dense(hidden_states) 2025-08-14T21:49:17.9560167Z 2025-08-14T21:49:17.9560277Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9560492Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9560567Z return mod(**inputs) 2025-08-14T21:49:17.9560854Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9560937Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9561229Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9561306Z hidden_states = self.encoder( 2025-08-14T21:49:17.9561597Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9561671Z layer_outputs = layer_module( 2025-08-14T21:49:17.9561913Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9562000Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9562297Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 586, in forward 2025-08-14T21:49:17.9562391Z layer_output = apply_chunking_to_forward( 2025-08-14T21:49:17.9562677Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:49:17.9562758Z return forward_fn(*input_tensors) 2025-08-14T21:49:17.9563087Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 593, in feed_forward_chunk 2025-08-14T21:49:17.9563219Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:49:17.9563539Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 513, in forward 2025-08-14T21:49:17.9563652Z hidden_states = self.dense(hidden_states) 2025-08-14T21:49:17.9563672Z 2025-08-14T21:49:17.9563784Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9564006Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9564075Z return mod(**inputs) 2025-08-14T21:49:17.9564378Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9564477Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9564735Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9564825Z hidden_states = self.encoder( 2025-08-14T21:49:17.9565079Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9565148Z layer_outputs = layer_module( 2025-08-14T21:49:17.9565365Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9565439Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9565692Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 586, in forward 2025-08-14T21:49:17.9565781Z layer_output = apply_chunking_to_forward( 2025-08-14T21:49:17.9566031Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:49:17.9566109Z return forward_fn(*input_tensors) 2025-08-14T21:49:17.9566396Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 593, in feed_forward_chunk 2025-08-14T21:49:17.9566509Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:49:17.9566772Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 514, in forward 2025-08-14T21:49:17.9566879Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:49:17.9567090Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:49:17.9567157Z return self.act(input) 2025-08-14T21:49:17.9567160Z 2025-08-14T21:49:17.9567257Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9567453Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9567514Z return mod(**inputs) 2025-08-14T21:49:17.9567770Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9567854Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9568109Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9568183Z hidden_states = self.encoder( 2025-08-14T21:49:17.9568439Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9568504Z layer_outputs = layer_module( 2025-08-14T21:49:17.9568719Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9568792Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9569043Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 586, in forward 2025-08-14T21:49:17.9569131Z layer_output = apply_chunking_to_forward( 2025-08-14T21:49:17.9569377Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:49:17.9569455Z return forward_fn(*input_tensors) 2025-08-14T21:49:17.9570340Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 594, in feed_forward_chunk 2025-08-14T21:49:17.9570497Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:49:17.9570764Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 531, in forward 2025-08-14T21:49:17.9570841Z hidden_states = self.dense(hidden_states) 2025-08-14T21:49:17.9570845Z 2025-08-14T21:49:17.9570951Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9571140Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9571223Z return mod(**inputs) 2025-08-14T21:49:17.9571489Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9571568Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9571830Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9571900Z hidden_states = self.encoder( 2025-08-14T21:49:17.9572153Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9572226Z layer_outputs = layer_module( 2025-08-14T21:49:17.9572437Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9572509Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9572773Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 561, in forward 2025-08-14T21:49:17.9572850Z self_attention_outputs = self.attention( 2025-08-14T21:49:17.9573113Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 464, in forward 2025-08-14T21:49:17.9573179Z self_outputs = self.self( 2025-08-14T21:49:17.9573437Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 350, in forward 2025-08-14T21:49:17.9573531Z mixed_query_layer = self.query(hidden_states) 2025-08-14T21:49:17.9573535Z 2025-08-14T21:49:17.9573630Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9573826Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9573887Z return mod(**inputs) 2025-08-14T21:49:17.9574139Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9574221Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9574477Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9574543Z hidden_states = self.encoder( 2025-08-14T21:49:17.9574805Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9574871Z layer_outputs = layer_module( 2025-08-14T21:49:17.9575087Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9575158Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9575410Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 561, in forward 2025-08-14T21:49:17.9575493Z self_attention_outputs = self.attention( 2025-08-14T21:49:17.9575747Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 464, in forward 2025-08-14T21:49:17.9575812Z self_outputs = self.self( 2025-08-14T21:49:17.9576104Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 344, in forward 2025-08-14T21:49:17.9576202Z mixed_key_layer = self.key(hidden_states) 2025-08-14T21:49:17.9576205Z 2025-08-14T21:49:17.9576309Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9576512Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9576573Z return mod(**inputs) 2025-08-14T21:49:17.9576830Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9576904Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9577174Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9577239Z hidden_states = self.encoder( 2025-08-14T21:49:17.9577487Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9577558Z layer_outputs = layer_module( 2025-08-14T21:49:17.9577764Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9577835Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9578089Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 561, in forward 2025-08-14T21:49:17.9578162Z self_attention_outputs = self.attention( 2025-08-14T21:49:17.9578417Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 464, in forward 2025-08-14T21:49:17.9578482Z self_outputs = self.self( 2025-08-14T21:49:17.9578728Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 345, in forward 2025-08-14T21:49:17.9578822Z mixed_value_layer = self.value(hidden_states) 2025-08-14T21:49:17.9578825Z 2025-08-14T21:49:17.9578900Z cudagraph partition due to non gpu ops 2025-08-14T21:49:17.9578981Z cudagraph partition due to non gpu ops 2025-08-14T21:49:17.9579077Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9579260Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9579329Z return mod(**inputs) 2025-08-14T21:49:17.9579585Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9579661Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9579923Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9579990Z hidden_states = self.encoder( 2025-08-14T21:49:17.9580252Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9580318Z layer_outputs = layer_module( 2025-08-14T21:49:17.9580529Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9580609Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9580861Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 561, in forward 2025-08-14T21:49:17.9580936Z self_attention_outputs = self.attention( 2025-08-14T21:49:17.9581196Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 464, in forward 2025-08-14T21:49:17.9581262Z self_outputs = self.self( 2025-08-14T21:49:17.9581523Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 366, in forward 2025-08-14T21:49:17.9581635Z conv_out_layer = self.conv_out_layer(hidden_states) 2025-08-14T21:49:17.9581657Z 2025-08-14T21:49:17.9581735Z cudagraph partition due to non gpu ops 2025-08-14T21:49:17.9581858Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9582049Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9582118Z return mod(**inputs) 2025-08-14T21:49:17.9582374Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9582450Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9582707Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9582792Z hidden_states = self.encoder( 2025-08-14T21:49:17.9583051Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9583127Z layer_outputs = layer_module( 2025-08-14T21:49:17.9583342Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9583422Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9583682Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 561, in forward 2025-08-14T21:49:17.9583757Z self_attention_outputs = self.attention( 2025-08-14T21:49:17.9584018Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 464, in forward 2025-08-14T21:49:17.9584082Z self_outputs = self.self( 2025-08-14T21:49:17.9584342Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 347, in forward 2025-08-14T21:49:17.9584519Z mixed_key_conv_attn_layer = self.key_conv_attn_layer(hidden_states.transpose(1, 2)) 2025-08-14T21:49:17.9585007Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 282, in forward 2025-08-14T21:49:17.9585105Z x = self.depthwise(hidden_states) 2025-08-14T21:49:17.9585109Z 2025-08-14T21:49:17.9585218Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9585431Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9585510Z return mod(**inputs) 2025-08-14T21:49:17.9585811Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9585902Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9586168Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9586233Z hidden_states = self.encoder( 2025-08-14T21:49:17.9586492Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9586557Z layer_outputs = layer_module( 2025-08-14T21:49:17.9586764Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9586842Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9587088Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 561, in forward 2025-08-14T21:49:17.9587169Z self_attention_outputs = self.attention( 2025-08-14T21:49:17.9587418Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 464, in forward 2025-08-14T21:49:17.9587483Z self_outputs = self.self( 2025-08-14T21:49:17.9587735Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 347, in forward 2025-08-14T21:49:17.9587934Z mixed_key_conv_attn_layer = self.key_conv_attn_layer(hidden_states.transpose(1, 2)) 2025-08-14T21:49:17.9588192Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 283, in forward 2025-08-14T21:49:17.9588282Z x = self.pointwise(x) 2025-08-14T21:49:17.9588286Z 2025-08-14T21:49:17.9588381Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9588573Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9588633Z return mod(**inputs) 2025-08-14T21:49:17.9588879Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9588985Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9589238Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9589312Z hidden_states = self.encoder( 2025-08-14T21:49:17.9589564Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9589631Z layer_outputs = layer_module( 2025-08-14T21:49:17.9589846Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9589916Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9590171Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 561, in forward 2025-08-14T21:49:17.9590244Z self_attention_outputs = self.attention( 2025-08-14T21:49:17.9590494Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 464, in forward 2025-08-14T21:49:17.9590564Z self_outputs = self.self( 2025-08-14T21:49:17.9590813Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 360, in forward 2025-08-14T21:49:17.9590955Z conv_attn_layer = torch.multiply(mixed_key_conv_attn_layer, mixed_query_layer) 2025-08-14T21:49:17.9590967Z 2025-08-14T21:49:17.9591060Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9591247Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9591314Z return mod(**inputs) 2025-08-14T21:49:17.9591563Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9591637Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9591896Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9591959Z hidden_states = self.encoder( 2025-08-14T21:49:17.9592218Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9592282Z layer_outputs = layer_module( 2025-08-14T21:49:17.9592492Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9592569Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9592817Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 561, in forward 2025-08-14T21:49:17.9592890Z self_attention_outputs = self.attention( 2025-08-14T21:49:17.9593146Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 464, in forward 2025-08-14T21:49:17.9593213Z self_outputs = self.self( 2025-08-14T21:49:17.9593470Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 362, in forward 2025-08-14T21:49:17.9593581Z conv_kernel_layer = self.conv_kernel_layer(conv_attn_layer) 2025-08-14T21:49:17.9593623Z 2025-08-14T21:49:17.9593721Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9593929Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9593989Z return mod(**inputs) 2025-08-14T21:49:17.9594244Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9594320Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9594629Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9594719Z hidden_states = self.encoder( 2025-08-14T21:49:17.9594977Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9595197Z layer_outputs = layer_module( 2025-08-14T21:49:17.9595413Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9595483Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9595746Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 561, in forward 2025-08-14T21:49:17.9595820Z self_attention_outputs = self.attention( 2025-08-14T21:49:17.9596077Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 464, in forward 2025-08-14T21:49:17.9596151Z self_outputs = self.self( 2025-08-14T21:49:17.9596410Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 380, in forward 2025-08-14T21:49:17.9596538Z conv_out_layer = torch.matmul(conv_out_layer, conv_kernel_layer) 2025-08-14T21:49:17.9596542Z 2025-08-14T21:49:17.9596618Z cudagraph partition due to non gpu ops 2025-08-14T21:49:17.9596694Z cudagraph partition due to non gpu ops 2025-08-14T21:49:17.9596798Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9596988Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9597048Z return mod(**inputs) 2025-08-14T21:49:17.9597317Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9597391Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9597652Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9597718Z hidden_states = self.encoder( 2025-08-14T21:49:17.9598019Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9598089Z layer_outputs = layer_module( 2025-08-14T21:49:17.9598296Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9598373Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9598622Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 561, in forward 2025-08-14T21:49:17.9598694Z self_attention_outputs = self.attention( 2025-08-14T21:49:17.9598950Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 464, in forward 2025-08-14T21:49:17.9599012Z self_outputs = self.self( 2025-08-14T21:49:17.9599261Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 405, in forward 2025-08-14T21:49:17.9599371Z context_layer = torch.cat([context_layer, conv_out], 2) 2025-08-14T21:49:17.9599375Z 2025-08-14T21:49:17.9599469Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9599690Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9599767Z return mod(**inputs) 2025-08-14T21:49:17.9600014Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9600096Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9600341Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9600411Z hidden_states = self.encoder( 2025-08-14T21:49:17.9600663Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9600746Z layer_outputs = layer_module( 2025-08-14T21:49:17.9600961Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9601034Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9601280Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 561, in forward 2025-08-14T21:49:17.9601364Z self_attention_outputs = self.attention( 2025-08-14T21:49:17.9601611Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 471, in forward 2025-08-14T21:49:17.9601740Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:49:17.9601989Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 425, in forward 2025-08-14T21:49:17.9602067Z hidden_states = self.dense(hidden_states) 2025-08-14T21:49:17.9602070Z 2025-08-14T21:49:17.9602171Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9602356Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9602425Z return mod(**inputs) 2025-08-14T21:49:17.9602674Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9602748Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9603005Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9603068Z hidden_states = self.encoder( 2025-08-14T21:49:17.9603308Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9603378Z layer_outputs = layer_module( 2025-08-14T21:49:17.9603584Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9603661Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9603912Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 586, in forward 2025-08-14T21:49:17.9603989Z layer_output = apply_chunking_to_forward( 2025-08-14T21:49:17.9604249Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:49:17.9604317Z return forward_fn(*input_tensors) 2025-08-14T21:49:17.9604588Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 593, in feed_forward_chunk 2025-08-14T21:49:17.9604707Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:49:17.9604955Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 513, in forward 2025-08-14T21:49:17.9605038Z hidden_states = self.dense(hidden_states) 2025-08-14T21:49:17.9605042Z 2025-08-14T21:49:17.9605137Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9605355Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9605425Z return mod(**inputs) 2025-08-14T21:49:17.9605688Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9605768Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9606015Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9606078Z hidden_states = self.encoder( 2025-08-14T21:49:17.9606332Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9606414Z layer_outputs = layer_module( 2025-08-14T21:49:17.9606621Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9606700Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9606950Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 586, in forward 2025-08-14T21:49:17.9607038Z layer_output = apply_chunking_to_forward( 2025-08-14T21:49:17.9607288Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:49:17.9607356Z return forward_fn(*input_tensors) 2025-08-14T21:49:17.9607636Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 593, in feed_forward_chunk 2025-08-14T21:49:17.9607744Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:49:17.9607996Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 514, in forward 2025-08-14T21:49:17.9608098Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:49:17.9608296Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:49:17.9608369Z return self.act(input) 2025-08-14T21:49:17.9608373Z 2025-08-14T21:49:17.9608465Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9608646Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9608713Z return mod(**inputs) 2025-08-14T21:49:17.9608954Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9609037Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9609282Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9609345Z hidden_states = self.encoder( 2025-08-14T21:49:17.9609595Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9609659Z layer_outputs = layer_module( 2025-08-14T21:49:17.9609866Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9609936Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9610176Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 586, in forward 2025-08-14T21:49:17.9610256Z layer_output = apply_chunking_to_forward( 2025-08-14T21:49:17.9610493Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:49:17.9610563Z return forward_fn(*input_tensors) 2025-08-14T21:49:17.9610849Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 594, in feed_forward_chunk 2025-08-14T21:49:17.9611007Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:49:17.9611279Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 531, in forward 2025-08-14T21:49:17.9611370Z hidden_states = self.dense(hidden_states) 2025-08-14T21:49:17.9611373Z 2025-08-14T21:49:17.9611466Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9611652Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9611709Z return mod(**inputs) 2025-08-14T21:49:17.9611958Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9612050Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9612289Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9612360Z hidden_states = self.encoder( 2025-08-14T21:49:17.9612606Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9612672Z layer_outputs = layer_module( 2025-08-14T21:49:17.9612886Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9612957Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9613210Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 561, in forward 2025-08-14T21:49:17.9613284Z self_attention_outputs = self.attention( 2025-08-14T21:49:17.9613532Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 464, in forward 2025-08-14T21:49:17.9613605Z self_outputs = self.self( 2025-08-14T21:49:17.9613854Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 350, in forward 2025-08-14T21:49:17.9613945Z mixed_query_layer = self.query(hidden_states) 2025-08-14T21:49:17.9613950Z 2025-08-14T21:49:17.9614045Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9614238Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9614305Z return mod(**inputs) 2025-08-14T21:49:17.9614545Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9614617Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9614864Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9614928Z hidden_states = self.encoder( 2025-08-14T21:49:17.9615175Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9615240Z layer_outputs = layer_module( 2025-08-14T21:49:17.9615443Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9615521Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9615768Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 561, in forward 2025-08-14T21:49:17.9615847Z self_attention_outputs = self.attention( 2025-08-14T21:49:17.9616095Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 464, in forward 2025-08-14T21:49:17.9616158Z self_outputs = self.self( 2025-08-14T21:49:17.9616412Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 344, in forward 2025-08-14T21:49:17.9616486Z mixed_key_layer = self.key(hidden_states) 2025-08-14T21:49:17.9616489Z 2025-08-14T21:49:17.9616638Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9616830Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9616908Z return mod(**inputs) 2025-08-14T21:49:17.9617163Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9617237Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9617486Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9617556Z hidden_states = self.encoder( 2025-08-14T21:49:17.9617823Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9617893Z layer_outputs = layer_module( 2025-08-14T21:49:17.9618102Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9618173Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9618429Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 561, in forward 2025-08-14T21:49:17.9618503Z self_attention_outputs = self.attention( 2025-08-14T21:49:17.9618749Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 464, in forward 2025-08-14T21:49:17.9618822Z self_outputs = self.self( 2025-08-14T21:49:17.9619070Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 345, in forward 2025-08-14T21:49:17.9619164Z mixed_value_layer = self.value(hidden_states) 2025-08-14T21:49:17.9619168Z 2025-08-14T21:49:17.9619243Z cudagraph partition due to non gpu ops 2025-08-14T21:49:17.9619316Z cudagraph partition due to non gpu ops 2025-08-14T21:49:17.9619421Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9619601Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9619662Z return mod(**inputs) 2025-08-14T21:49:17.9619921Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9619993Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9620251Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9620314Z hidden_states = self.encoder( 2025-08-14T21:49:17.9620561Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9620635Z layer_outputs = layer_module( 2025-08-14T21:49:17.9620840Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9620919Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9621167Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 561, in forward 2025-08-14T21:49:17.9621240Z self_attention_outputs = self.attention( 2025-08-14T21:49:17.9621494Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 464, in forward 2025-08-14T21:49:17.9621558Z self_outputs = self.self( 2025-08-14T21:49:17.9621808Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 366, in forward 2025-08-14T21:49:17.9621912Z conv_out_layer = self.conv_out_layer(hidden_states) 2025-08-14T21:49:17.9621915Z 2025-08-14T21:49:17.9621987Z cudagraph partition due to non gpu ops 2025-08-14T21:49:17.9622089Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9622305Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9622366Z return mod(**inputs) 2025-08-14T21:49:17.9622644Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9622719Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9622973Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9623046Z hidden_states = self.encoder( 2025-08-14T21:49:17.9623300Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9623389Z layer_outputs = layer_module( 2025-08-14T21:49:17.9623595Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9623665Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9623928Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 561, in forward 2025-08-14T21:49:17.9624004Z self_attention_outputs = self.attention( 2025-08-14T21:49:17.9624261Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 464, in forward 2025-08-14T21:49:17.9624325Z self_outputs = self.self( 2025-08-14T21:49:17.9624577Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 347, in forward 2025-08-14T21:49:17.9624733Z mixed_key_conv_attn_layer = self.key_conv_attn_layer(hidden_states.transpose(1, 2)) 2025-08-14T21:49:17.9625053Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 282, in forward 2025-08-14T21:49:17.9625127Z x = self.depthwise(hidden_states) 2025-08-14T21:49:17.9625139Z 2025-08-14T21:49:17.9625240Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9625426Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9625498Z return mod(**inputs) 2025-08-14T21:49:17.9625750Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9625825Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9626083Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9626148Z hidden_states = self.encoder( 2025-08-14T21:49:17.9626407Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9626473Z layer_outputs = layer_module( 2025-08-14T21:49:17.9626684Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9626764Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9627013Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 561, in forward 2025-08-14T21:49:17.9627088Z self_attention_outputs = self.attention( 2025-08-14T21:49:17.9639912Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 464, in forward 2025-08-14T21:49:17.9640076Z self_outputs = self.self( 2025-08-14T21:49:17.9640395Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 347, in forward 2025-08-14T21:49:17.9640583Z mixed_key_conv_attn_layer = self.key_conv_attn_layer(hidden_states.transpose(1, 2)) 2025-08-14T21:49:17.9640850Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 283, in forward 2025-08-14T21:49:17.9641042Z x = self.pointwise(x) 2025-08-14T21:49:17.9641050Z 2025-08-14T21:49:17.9641165Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9641398Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9641476Z return mod(**inputs) 2025-08-14T21:49:17.9641742Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9641831Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9642103Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9642205Z hidden_states = self.encoder( 2025-08-14T21:49:17.9642470Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9642536Z layer_outputs = layer_module( 2025-08-14T21:49:17.9642756Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9642845Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9643111Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 561, in forward 2025-08-14T21:49:17.9643187Z self_attention_outputs = self.attention( 2025-08-14T21:49:17.9643443Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 464, in forward 2025-08-14T21:49:17.9643510Z self_outputs = self.self( 2025-08-14T21:49:17.9643772Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 360, in forward 2025-08-14T21:49:17.9643922Z conv_attn_layer = torch.multiply(mixed_key_conv_attn_layer, mixed_query_layer) 2025-08-14T21:49:17.9643926Z 2025-08-14T21:49:17.9644031Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9644234Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9644299Z return mod(**inputs) 2025-08-14T21:49:17.9644562Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9644640Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9644897Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9644974Z hidden_states = self.encoder( 2025-08-14T21:49:17.9645235Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9645302Z layer_outputs = layer_module( 2025-08-14T21:49:17.9645521Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9645598Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9645866Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 561, in forward 2025-08-14T21:49:17.9645941Z self_attention_outputs = self.attention( 2025-08-14T21:49:17.9646187Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 464, in forward 2025-08-14T21:49:17.9646259Z self_outputs = self.self( 2025-08-14T21:49:17.9646514Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 362, in forward 2025-08-14T21:49:17.9646636Z conv_kernel_layer = self.conv_kernel_layer(conv_attn_layer) 2025-08-14T21:49:17.9646640Z 2025-08-14T21:49:17.9646737Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9646932Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9647035Z return mod(**inputs) 2025-08-14T21:49:17.9647288Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9647384Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9647644Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9647712Z hidden_states = self.encoder( 2025-08-14T21:49:17.9647971Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9648067Z layer_outputs = layer_module( 2025-08-14T21:49:17.9648277Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9648356Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9648609Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 561, in forward 2025-08-14T21:49:17.9648705Z self_attention_outputs = self.attention( 2025-08-14T21:49:17.9648946Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 464, in forward 2025-08-14T21:49:17.9649010Z self_outputs = self.self( 2025-08-14T21:49:17.9649259Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 380, in forward 2025-08-14T21:49:17.9649375Z conv_out_layer = torch.matmul(conv_out_layer, conv_kernel_layer) 2025-08-14T21:49:17.9649379Z 2025-08-14T21:49:17.9649456Z cudagraph partition due to non gpu ops 2025-08-14T21:49:17.9649537Z cudagraph partition due to non gpu ops 2025-08-14T21:49:17.9649632Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9649823Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9649887Z return mod(**inputs) 2025-08-14T21:49:17.9650132Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9650215Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9650459Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9650522Z hidden_states = self.encoder( 2025-08-14T21:49:17.9650771Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9650835Z layer_outputs = layer_module( 2025-08-14T21:49:17.9651051Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9651120Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9651363Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 561, in forward 2025-08-14T21:49:17.9651443Z self_attention_outputs = self.attention( 2025-08-14T21:49:17.9651683Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 464, in forward 2025-08-14T21:49:17.9651753Z self_outputs = self.self( 2025-08-14T21:49:17.9651997Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 405, in forward 2025-08-14T21:49:17.9652101Z context_layer = torch.cat([context_layer, conv_out], 2) 2025-08-14T21:49:17.9652104Z 2025-08-14T21:49:17.9652204Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9652388Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9652446Z return mod(**inputs) 2025-08-14T21:49:17.9652727Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9652802Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9653067Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9653130Z hidden_states = self.encoder( 2025-08-14T21:49:17.9653372Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9653440Z layer_outputs = layer_module( 2025-08-14T21:49:17.9653643Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9653737Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9653981Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 561, in forward 2025-08-14T21:49:17.9654053Z self_attention_outputs = self.attention( 2025-08-14T21:49:17.9654304Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 471, in forward 2025-08-14T21:49:17.9654425Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:49:17.9654668Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 425, in forward 2025-08-14T21:49:17.9654752Z hidden_states = self.dense(hidden_states) 2025-08-14T21:49:17.9654756Z 2025-08-14T21:49:17.9654847Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9655039Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9655101Z return mod(**inputs) 2025-08-14T21:49:17.9655345Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9655425Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9655670Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9655743Z hidden_states = self.encoder( 2025-08-14T21:49:17.9655992Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9656056Z layer_outputs = layer_module( 2025-08-14T21:49:17.9656272Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9656344Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9656596Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 586, in forward 2025-08-14T21:49:17.9656684Z layer_output = apply_chunking_to_forward( 2025-08-14T21:49:17.9656935Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:49:17.9657017Z return forward_fn(*input_tensors) 2025-08-14T21:49:17.9657305Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 593, in feed_forward_chunk 2025-08-14T21:49:17.9657424Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:49:17.9657686Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 513, in forward 2025-08-14T21:49:17.9657762Z hidden_states = self.dense(hidden_states) 2025-08-14T21:49:17.9657766Z 2025-08-14T21:49:17.9657870Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9658062Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9658124Z return mod(**inputs) 2025-08-14T21:49:17.9658414Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9658491Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9658759Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9658831Z hidden_states = self.encoder( 2025-08-14T21:49:17.9659080Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9659154Z layer_outputs = layer_module( 2025-08-14T21:49:17.9659363Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9659453Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9659706Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 586, in forward 2025-08-14T21:49:17.9659781Z layer_output = apply_chunking_to_forward( 2025-08-14T21:49:17.9660032Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:49:17.9660105Z return forward_fn(*input_tensors) 2025-08-14T21:49:17.9660386Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 593, in feed_forward_chunk 2025-08-14T21:49:17.9660504Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:49:17.9660755Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 514, in forward 2025-08-14T21:49:17.9660861Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:49:17.9661069Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:49:17.9661135Z return self.act(input) 2025-08-14T21:49:17.9661138Z 2025-08-14T21:49:17.9661243Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9661436Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9661498Z return mod(**inputs) 2025-08-14T21:49:17.9661750Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9661827Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9662083Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9662148Z hidden_states = self.encoder( 2025-08-14T21:49:17.9662398Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9662473Z layer_outputs = layer_module( 2025-08-14T21:49:17.9662681Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9662762Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9663009Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 586, in forward 2025-08-14T21:49:17.9663087Z layer_output = apply_chunking_to_forward( 2025-08-14T21:49:17.9663337Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:49:17.9663406Z return forward_fn(*input_tensors) 2025-08-14T21:49:17.9663684Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 594, in feed_forward_chunk 2025-08-14T21:49:17.9663819Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:49:17.9664071Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 531, in forward 2025-08-14T21:49:17.9664153Z hidden_states = self.dense(hidden_states) 2025-08-14T21:49:17.9664185Z 2025-08-14T21:49:17.9664282Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9664482Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9664552Z return mod(**inputs) 2025-08-14T21:49:17.9664895Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9664988Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9665246Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9665334Z hidden_states = self.encoder( 2025-08-14T21:49:17.9665600Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9665666Z layer_outputs = layer_module( 2025-08-14T21:49:17.9665892Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9665975Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9666225Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 561, in forward 2025-08-14T21:49:17.9666307Z self_attention_outputs = self.attention( 2025-08-14T21:49:17.9666557Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 464, in forward 2025-08-14T21:49:17.9666625Z self_outputs = self.self( 2025-08-14T21:49:17.9666886Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 350, in forward 2025-08-14T21:49:17.9666976Z mixed_query_layer = self.query(hidden_states) 2025-08-14T21:49:17.9666979Z 2025-08-14T21:49:17.9667083Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9667273Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9667337Z return mod(**inputs) 2025-08-14T21:49:17.9667592Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9667664Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9667911Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9667984Z hidden_states = self.encoder( 2025-08-14T21:49:17.9668231Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9668304Z layer_outputs = layer_module( 2025-08-14T21:49:17.9668510Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9668582Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9668841Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 561, in forward 2025-08-14T21:49:17.9668916Z self_attention_outputs = self.attention( 2025-08-14T21:49:17.9669169Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 464, in forward 2025-08-14T21:49:17.9669233Z self_outputs = self.self( 2025-08-14T21:49:17.9669479Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 344, in forward 2025-08-14T21:49:17.9669562Z mixed_key_layer = self.key(hidden_states) 2025-08-14T21:49:17.9669566Z 2025-08-14T21:49:17.9669661Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9669843Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9669912Z return mod(**inputs) 2025-08-14T21:49:17.9670190Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9670298Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9670552Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9670616Z hidden_states = self.encoder( 2025-08-14T21:49:17.9670873Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9670934Z layer_outputs = layer_module( 2025-08-14T21:49:17.9671141Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9671236Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9671482Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 561, in forward 2025-08-14T21:49:17.9671563Z self_attention_outputs = self.attention( 2025-08-14T21:49:17.9671811Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 464, in forward 2025-08-14T21:49:17.9671875Z self_outputs = self.self( 2025-08-14T21:49:17.9672126Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 345, in forward 2025-08-14T21:49:17.9672222Z mixed_value_layer = self.value(hidden_states) 2025-08-14T21:49:17.9672225Z 2025-08-14T21:49:17.9672307Z cudagraph partition due to non gpu ops 2025-08-14T21:49:17.9672378Z cudagraph partition due to non gpu ops 2025-08-14T21:49:17.9672473Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9672662Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9672723Z return mod(**inputs) 2025-08-14T21:49:17.9672972Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9673055Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9673300Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9673373Z hidden_states = self.encoder( 2025-08-14T21:49:17.9673618Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9673683Z layer_outputs = layer_module( 2025-08-14T21:49:17.9673895Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9673968Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9674211Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 561, in forward 2025-08-14T21:49:17.9674295Z self_attention_outputs = self.attention( 2025-08-14T21:49:17.9674539Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 464, in forward 2025-08-14T21:49:17.9674613Z self_outputs = self.self( 2025-08-14T21:49:17.9674860Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 366, in forward 2025-08-14T21:49:17.9674957Z conv_out_layer = self.conv_out_layer(hidden_states) 2025-08-14T21:49:17.9674961Z 2025-08-14T21:49:17.9675042Z cudagraph partition due to non gpu ops 2025-08-14T21:49:17.9675136Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9675328Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9675388Z return mod(**inputs) 2025-08-14T21:49:17.9675665Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9675760Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9676020Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9676083Z hidden_states = self.encoder( 2025-08-14T21:49:17.9676331Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9676394Z layer_outputs = layer_module( 2025-08-14T21:49:17.9676600Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9676686Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9676928Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 561, in forward 2025-08-14T21:49:17.9677008Z self_attention_outputs = self.attention( 2025-08-14T21:49:17.9677249Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 464, in forward 2025-08-14T21:49:17.9677321Z self_outputs = self.self( 2025-08-14T21:49:17.9677564Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 347, in forward 2025-08-14T21:49:17.9677715Z mixed_key_conv_attn_layer = self.key_conv_attn_layer(hidden_states.transpose(1, 2)) 2025-08-14T21:49:17.9677970Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 282, in forward 2025-08-14T21:49:17.9678040Z x = self.depthwise(hidden_states) 2025-08-14T21:49:17.9678045Z 2025-08-14T21:49:17.9678139Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9678331Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9678391Z return mod(**inputs) 2025-08-14T21:49:17.9678646Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9678722Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9678968Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9679039Z hidden_states = self.encoder( 2025-08-14T21:49:17.9679287Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9679355Z layer_outputs = layer_module( 2025-08-14T21:49:17.9679559Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9679632Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9679887Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 561, in forward 2025-08-14T21:49:17.9679963Z self_attention_outputs = self.attention( 2025-08-14T21:49:17.9680208Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 464, in forward 2025-08-14T21:49:17.9680279Z self_outputs = self.self( 2025-08-14T21:49:17.9680527Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 347, in forward 2025-08-14T21:49:17.9680679Z mixed_key_conv_attn_layer = self.key_conv_attn_layer(hidden_states.transpose(1, 2)) 2025-08-14T21:49:17.9680986Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 283, in forward 2025-08-14T21:49:17.9681062Z x = self.pointwise(x) 2025-08-14T21:49:17.9681066Z 2025-08-14T21:49:17.9681164Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9681360Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9681446Z return mod(**inputs) 2025-08-14T21:49:17.9681698Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9681788Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9682046Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9682110Z hidden_states = self.encoder( 2025-08-14T21:49:17.9682368Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9682452Z layer_outputs = layer_module( 2025-08-14T21:49:17.9682655Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9682733Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9682976Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 561, in forward 2025-08-14T21:49:17.9683049Z self_attention_outputs = self.attention( 2025-08-14T21:49:17.9683296Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 464, in forward 2025-08-14T21:49:17.9683359Z self_outputs = self.self( 2025-08-14T21:49:17.9683611Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 360, in forward 2025-08-14T21:49:17.9683756Z conv_attn_layer = torch.multiply(mixed_key_conv_attn_layer, mixed_query_layer) 2025-08-14T21:49:17.9683761Z 2025-08-14T21:49:17.9683855Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9684044Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9684105Z return mod(**inputs) 2025-08-14T21:49:17.9684356Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9684440Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9684825Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9684903Z hidden_states = self.encoder( 2025-08-14T21:49:17.9685157Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9685222Z layer_outputs = layer_module( 2025-08-14T21:49:17.9685447Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9685520Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9685772Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 561, in forward 2025-08-14T21:49:17.9685848Z self_attention_outputs = self.attention( 2025-08-14T21:49:17.9686108Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 464, in forward 2025-08-14T21:49:17.9686185Z self_outputs = self.self( 2025-08-14T21:49:17.9686445Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 362, in forward 2025-08-14T21:49:17.9686566Z conv_kernel_layer = self.conv_kernel_layer(conv_attn_layer) 2025-08-14T21:49:17.9686570Z 2025-08-14T21:49:17.9686682Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9686866Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9686929Z return mod(**inputs) 2025-08-14T21:49:17.9687192Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9687333Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9687593Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9687684Z hidden_states = self.encoder( 2025-08-14T21:49:17.9687933Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9688007Z layer_outputs = layer_module( 2025-08-14T21:49:17.9688214Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9688284Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9688569Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 561, in forward 2025-08-14T21:49:17.9688639Z self_attention_outputs = self.attention( 2025-08-14T21:49:17.9688889Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 464, in forward 2025-08-14T21:49:17.9688953Z self_outputs = self.self( 2025-08-14T21:49:17.9689200Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 380, in forward 2025-08-14T21:49:17.9689328Z conv_out_layer = torch.matmul(conv_out_layer, conv_kernel_layer) 2025-08-14T21:49:17.9689332Z 2025-08-14T21:49:17.9689404Z cudagraph partition due to non gpu ops 2025-08-14T21:49:17.9689482Z cudagraph partition due to non gpu ops 2025-08-14T21:49:17.9689576Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9689759Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9689826Z return mod(**inputs) 2025-08-14T21:49:17.9690074Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9690150Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9690414Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9690477Z hidden_states = self.encoder( 2025-08-14T21:49:17.9690724Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9690786Z layer_outputs = layer_module( 2025-08-14T21:49:17.9690987Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9691060Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9691300Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 561, in forward 2025-08-14T21:49:17.9691373Z self_attention_outputs = self.attention( 2025-08-14T21:49:17.9691620Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 464, in forward 2025-08-14T21:49:17.9691681Z self_outputs = self.self( 2025-08-14T21:49:17.9691928Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 405, in forward 2025-08-14T21:49:17.9692028Z context_layer = torch.cat([context_layer, conv_out], 2) 2025-08-14T21:49:17.9692032Z 2025-08-14T21:49:17.9692123Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9692308Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9692366Z return mod(**inputs) 2025-08-14T21:49:17.9692610Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9692682Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9692964Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9693035Z hidden_states = self.encoder( 2025-08-14T21:49:17.9693288Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9693347Z layer_outputs = layer_module( 2025-08-14T21:49:17.9693551Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9693617Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9693862Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 561, in forward 2025-08-14T21:49:17.9693951Z self_attention_outputs = self.attention( 2025-08-14T21:49:17.9694193Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 471, in forward 2025-08-14T21:49:17.9694319Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:49:17.9694565Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 425, in forward 2025-08-14T21:49:17.9694648Z hidden_states = self.dense(hidden_states) 2025-08-14T21:49:17.9694651Z 2025-08-14T21:49:17.9694742Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9694922Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9694987Z return mod(**inputs) 2025-08-14T21:49:17.9695226Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9695301Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9695548Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9695611Z hidden_states = self.encoder( 2025-08-14T21:49:17.9695859Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9695923Z layer_outputs = layer_module( 2025-08-14T21:49:17.9696125Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9696201Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9696443Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 586, in forward 2025-08-14T21:49:17.9696523Z layer_output = apply_chunking_to_forward( 2025-08-14T21:49:17.9696761Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:49:17.9696830Z return forward_fn(*input_tensors) 2025-08-14T21:49:17.9697111Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 593, in feed_forward_chunk 2025-08-14T21:49:17.9697224Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:49:17.9697469Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 513, in forward 2025-08-14T21:49:17.9697551Z hidden_states = self.dense(hidden_states) 2025-08-14T21:49:17.9697554Z 2025-08-14T21:49:17.9697647Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9697835Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9697895Z return mod(**inputs) 2025-08-14T21:49:17.9698137Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9698218Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9698476Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9698564Z hidden_states = self.encoder( 2025-08-14T21:49:17.9698806Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9698884Z layer_outputs = layer_module( 2025-08-14T21:49:17.9699091Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9699159Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9699398Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 586, in forward 2025-08-14T21:49:17.9699494Z layer_output = apply_chunking_to_forward( 2025-08-14T21:49:17.9699730Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:49:17.9699803Z return forward_fn(*input_tensors) 2025-08-14T21:49:17.9700078Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 593, in feed_forward_chunk 2025-08-14T21:49:17.9700186Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:49:17.9700434Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 514, in forward 2025-08-14T21:49:17.9700533Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:49:17.9700733Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:49:17.9700798Z return self.act(input) 2025-08-14T21:49:17.9700801Z 2025-08-14T21:49:17.9700895Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9701083Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9701143Z return mod(**inputs) 2025-08-14T21:49:17.9701397Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9701481Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9701731Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9701801Z hidden_states = self.encoder( 2025-08-14T21:49:17.9702050Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9702113Z layer_outputs = layer_module( 2025-08-14T21:49:17.9702326Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9702398Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9702645Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 586, in forward 2025-08-14T21:49:17.9702729Z layer_output = apply_chunking_to_forward( 2025-08-14T21:49:17.9702968Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:49:17.9703048Z return forward_fn(*input_tensors) 2025-08-14T21:49:17.9703327Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 594, in feed_forward_chunk 2025-08-14T21:49:17.9703451Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:49:17.9703712Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 531, in forward 2025-08-14T21:49:17.9703788Z hidden_states = self.dense(hidden_states) 2025-08-14T21:49:17.9703791Z 2025-08-14T21:49:17.9703893Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9704076Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9704165Z return mod(**inputs) 2025-08-14T21:49:17.9704422Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9704513Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9704832Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9704906Z hidden_states = self.encoder( 2025-08-14T21:49:17.9705156Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9705226Z layer_outputs = layer_module( 2025-08-14T21:49:17.9705454Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9705525Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9705852Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 561, in forward 2025-08-14T21:49:17.9705931Z self_attention_outputs = self.attention( 2025-08-14T21:49:17.9706189Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 464, in forward 2025-08-14T21:49:17.9706253Z self_outputs = self.self( 2025-08-14T21:49:17.9706502Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 350, in forward 2025-08-14T21:49:17.9706594Z mixed_query_layer = self.query(hidden_states) 2025-08-14T21:49:17.9706598Z 2025-08-14T21:49:17.9706695Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9706891Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9706951Z return mod(**inputs) 2025-08-14T21:49:17.9707206Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9707290Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9707540Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9707604Z hidden_states = self.encoder( 2025-08-14T21:49:17.9707861Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9707924Z layer_outputs = layer_module( 2025-08-14T21:49:17.9708136Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9708207Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9708456Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 561, in forward 2025-08-14T21:49:17.9708537Z self_attention_outputs = self.attention( 2025-08-14T21:49:17.9708789Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 464, in forward 2025-08-14T21:49:17.9708854Z self_outputs = self.self( 2025-08-14T21:49:17.9709111Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 344, in forward 2025-08-14T21:49:17.9709186Z mixed_key_layer = self.key(hidden_states) 2025-08-14T21:49:17.9709189Z 2025-08-14T21:49:17.9709289Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9709473Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9709535Z return mod(**inputs) 2025-08-14T21:49:17.9709790Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9709863Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9710145Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9710228Z hidden_states = self.encoder( 2025-08-14T21:49:17.9710478Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9710549Z layer_outputs = layer_module( 2025-08-14T21:49:17.9710755Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9710824Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9711077Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 561, in forward 2025-08-14T21:49:17.9711166Z self_attention_outputs = self.attention( 2025-08-14T21:49:17.9711420Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 464, in forward 2025-08-14T21:49:17.9711487Z self_outputs = self.self( 2025-08-14T21:49:17.9711733Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 345, in forward 2025-08-14T21:49:17.9711828Z mixed_value_layer = self.value(hidden_states) 2025-08-14T21:49:17.9711831Z 2025-08-14T21:49:17.9711903Z cudagraph partition due to non gpu ops 2025-08-14T21:49:17.9711980Z cudagraph partition due to non gpu ops 2025-08-14T21:49:17.9712074Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9712259Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9712326Z return mod(**inputs) 2025-08-14T21:49:17.9712576Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9712649Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9712905Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9712971Z hidden_states = self.encoder( 2025-08-14T21:49:17.9713225Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9713287Z layer_outputs = layer_module( 2025-08-14T21:49:17.9713492Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9713568Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9713815Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 561, in forward 2025-08-14T21:49:17.9713890Z self_attention_outputs = self.attention( 2025-08-14T21:49:17.9714140Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 464, in forward 2025-08-14T21:49:17.9714205Z self_outputs = self.self( 2025-08-14T21:49:17.9714455Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 366, in forward 2025-08-14T21:49:17.9714552Z conv_out_layer = self.conv_out_layer(hidden_states) 2025-08-14T21:49:17.9714555Z 2025-08-14T21:49:17.9714627Z cudagraph partition due to non gpu ops 2025-08-14T21:49:17.9714726Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9714911Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9714977Z return mod(**inputs) 2025-08-14T21:49:17.9715221Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9715295Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9715546Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9715647Z hidden_states = self.encoder( 2025-08-14T21:49:17.9715896Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9715986Z layer_outputs = layer_module( 2025-08-14T21:49:17.9716192Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9716268Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9716516Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 561, in forward 2025-08-14T21:49:17.9716607Z self_attention_outputs = self.attention( 2025-08-14T21:49:17.9716859Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 464, in forward 2025-08-14T21:49:17.9716923Z self_outputs = self.self( 2025-08-14T21:49:17.9717174Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 347, in forward 2025-08-14T21:49:17.9717334Z mixed_key_conv_attn_layer = self.key_conv_attn_layer(hidden_states.transpose(1, 2)) 2025-08-14T21:49:17.9717583Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 282, in forward 2025-08-14T21:49:17.9717659Z x = self.depthwise(hidden_states) 2025-08-14T21:49:17.9717663Z 2025-08-14T21:49:17.9717759Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9717943Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9718015Z return mod(**inputs) 2025-08-14T21:49:17.9718262Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9718343Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9718593Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9718660Z hidden_states = self.encoder( 2025-08-14T21:49:17.9718915Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9718979Z layer_outputs = layer_module( 2025-08-14T21:49:17.9719182Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9719259Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9719511Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 561, in forward 2025-08-14T21:49:17.9719592Z self_attention_outputs = self.attention( 2025-08-14T21:49:17.9719833Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 464, in forward 2025-08-14T21:49:17.9719898Z self_outputs = self.self( 2025-08-14T21:49:17.9720144Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 347, in forward 2025-08-14T21:49:17.9720287Z mixed_key_conv_attn_layer = self.key_conv_attn_layer(hidden_states.transpose(1, 2)) 2025-08-14T21:49:17.9720536Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 283, in forward 2025-08-14T21:49:17.9720596Z x = self.pointwise(x) 2025-08-14T21:49:17.9720600Z 2025-08-14T21:49:17.9720690Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9720873Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9720933Z return mod(**inputs) 2025-08-14T21:49:17.9721172Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9721279Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9721527Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9721611Z hidden_states = self.encoder( 2025-08-14T21:49:17.9721864Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9721928Z layer_outputs = layer_module( 2025-08-14T21:49:17.9722144Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9722214Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9722496Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 561, in forward 2025-08-14T21:49:17.9722566Z self_attention_outputs = self.attention( 2025-08-14T21:49:17.9722805Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 464, in forward 2025-08-14T21:49:17.9722877Z self_outputs = self.self( 2025-08-14T21:49:17.9723115Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 360, in forward 2025-08-14T21:49:17.9723253Z conv_attn_layer = torch.multiply(mixed_key_conv_attn_layer, mixed_query_layer) 2025-08-14T21:49:17.9723263Z 2025-08-14T21:49:17.9723355Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9723534Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9723599Z return mod(**inputs) 2025-08-14T21:49:17.9723843Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9723914Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9724163Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9724227Z hidden_states = self.encoder( 2025-08-14T21:49:17.9724472Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9724533Z layer_outputs = layer_module( 2025-08-14T21:49:17.9724766Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9724841Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9725089Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 561, in forward 2025-08-14T21:49:17.9725162Z self_attention_outputs = self.attention( 2025-08-14T21:49:17.9725415Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 464, in forward 2025-08-14T21:49:17.9725479Z self_outputs = self.self( 2025-08-14T21:49:17.9725733Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 362, in forward 2025-08-14T21:49:17.9725846Z conv_kernel_layer = self.conv_kernel_layer(conv_attn_layer) 2025-08-14T21:49:17.9725849Z 2025-08-14T21:49:17.9725943Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9726134Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9726193Z return mod(**inputs) 2025-08-14T21:49:17.9726447Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9726523Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9726779Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9726849Z hidden_states = self.encoder( 2025-08-14T21:49:17.9727117Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9727194Z layer_outputs = layer_module( 2025-08-14T21:49:17.9727405Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9727474Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9727723Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 561, in forward 2025-08-14T21:49:17.9727794Z self_attention_outputs = self.attention( 2025-08-14T21:49:17.9728056Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 464, in forward 2025-08-14T21:49:17.9728127Z self_outputs = self.self( 2025-08-14T21:49:17.9728376Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 380, in forward 2025-08-14T21:49:17.9728505Z conv_out_layer = torch.matmul(conv_out_layer, conv_kernel_layer) 2025-08-14T21:49:17.9728510Z 2025-08-14T21:49:17.9728586Z cudagraph partition due to non gpu ops 2025-08-14T21:49:17.9728662Z cudagraph partition due to non gpu ops 2025-08-14T21:49:17.9728766Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9728957Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9729018Z return mod(**inputs) 2025-08-14T21:49:17.9729280Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9729359Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9729624Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9729689Z hidden_states = self.encoder( 2025-08-14T21:49:17.9729933Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9730008Z layer_outputs = layer_module( 2025-08-14T21:49:17.9730209Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9730279Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9730529Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 561, in forward 2025-08-14T21:49:17.9730602Z self_attention_outputs = self.attention( 2025-08-14T21:49:17.9730856Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 464, in forward 2025-08-14T21:49:17.9730923Z self_outputs = self.self( 2025-08-14T21:49:17.9731174Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 405, in forward 2025-08-14T21:49:17.9731287Z context_layer = torch.cat([context_layer, conv_out], 2) 2025-08-14T21:49:17.9731292Z 2025-08-14T21:49:17.9731388Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9731581Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9731643Z return mod(**inputs) 2025-08-14T21:49:17.9731893Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9731977Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9732226Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9732295Z hidden_states = self.encoder( 2025-08-14T21:49:17.9732552Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9732646Z layer_outputs = layer_module( 2025-08-14T21:49:17.9732860Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9732946Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9733192Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 561, in forward 2025-08-14T21:49:17.9733270Z self_attention_outputs = self.attention( 2025-08-14T21:49:17.9733519Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 471, in forward 2025-08-14T21:49:17.9733670Z attention_output = self.output(self_outputs[0], hidden_states) 2025-08-14T21:49:17.9733921Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 425, in forward 2025-08-14T21:49:17.9733997Z hidden_states = self.dense(hidden_states) 2025-08-14T21:49:17.9734002Z 2025-08-14T21:49:17.9734104Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9734289Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9734348Z return mod(**inputs) 2025-08-14T21:49:17.9734611Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9734683Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9734940Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9735006Z hidden_states = self.encoder( 2025-08-14T21:49:17.9735255Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9735325Z layer_outputs = layer_module( 2025-08-14T21:49:17.9735537Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9735615Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9735863Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 586, in forward 2025-08-14T21:49:17.9735939Z layer_output = apply_chunking_to_forward( 2025-08-14T21:49:17.9736189Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:49:17.9736258Z return forward_fn(*input_tensors) 2025-08-14T21:49:17.9736539Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 593, in feed_forward_chunk 2025-08-14T21:49:17.9736668Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:49:17.9736913Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 513, in forward 2025-08-14T21:49:17.9736992Z hidden_states = self.dense(hidden_states) 2025-08-14T21:49:17.9736997Z 2025-08-14T21:49:17.9737088Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9737265Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9737330Z return mod(**inputs) 2025-08-14T21:49:17.9737573Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9737652Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9737900Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9737965Z hidden_states = self.encoder( 2025-08-14T21:49:17.9738218Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9738310Z layer_outputs = layer_module( 2025-08-14T21:49:17.9738520Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9738613Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9738861Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 586, in forward 2025-08-14T21:49:17.9738943Z layer_output = apply_chunking_to_forward( 2025-08-14T21:49:17.9739186Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:49:17.9739255Z return forward_fn(*input_tensors) 2025-08-14T21:49:17.9739562Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 593, in feed_forward_chunk 2025-08-14T21:49:17.9739673Z intermediate_output = self.intermediate(attention_output) 2025-08-14T21:49:17.9739930Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 514, in forward 2025-08-14T21:49:17.9740034Z hidden_states = self.intermediate_act_fn(hidden_states) 2025-08-14T21:49:17.9740234Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/activations.py", line 69, in forward 2025-08-14T21:49:17.9740306Z return self.act(input) 2025-08-14T21:49:17.9740310Z 2025-08-14T21:49:17.9740403Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9740589Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9740657Z return mod(**inputs) 2025-08-14T21:49:17.9740909Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 925, in forward 2025-08-14T21:49:17.9740990Z generator_hidden_states = self.convbert( 2025-08-14T21:49:17.9741242Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 853, in forward 2025-08-14T21:49:17.9741308Z hidden_states = self.encoder( 2025-08-14T21:49:17.9741567Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 625, in forward 2025-08-14T21:49:17.9741632Z layer_outputs = layer_module( 2025-08-14T21:49:17.9741840Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/modeling_layers.py", line 94, in __call__ 2025-08-14T21:49:17.9741919Z return super().__call__(*args, **kwargs) 2025-08-14T21:49:17.9742170Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 586, in forward 2025-08-14T21:49:17.9742254Z layer_output = apply_chunking_to_forward( 2025-08-14T21:49:17.9742495Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/pytorch_utils.py", line 251, in apply_chunking_to_forward 2025-08-14T21:49:17.9742567Z return forward_fn(*input_tensors) 2025-08-14T21:49:17.9742854Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 594, in feed_forward_chunk 2025-08-14T21:49:17.9742979Z layer_output = self.output(intermediate_output, attention_output) 2025-08-14T21:49:17.9743237Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 531, in forward 2025-08-14T21:49:17.9743312Z hidden_states = self.dense(hidden_states) 2025-08-14T21:49:17.9743315Z 2025-08-14T21:49:17.9743409Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9743606Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9743667Z return mod(**inputs) 2025-08-14T21:49:17.9743925Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 938, in forward 2025-08-14T21:49:17.9744098Z prediction_scores = self.generator_predictions(generator_sequence_output) 2025-08-14T21:49:17.9744351Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 876, in forward 2025-08-14T21:49:17.9744469Z hidden_states = self.dense(generator_hidden_states) 2025-08-14T21:49:17.9744472Z 2025-08-14T21:49:17.9744566Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9744818Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9744895Z return mod(**inputs) 2025-08-14T21:49:17.9745146Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 939, in forward 2025-08-14T21:49:17.9745307Z prediction_scores = self.generator_lm_head(prediction_scores) 2025-08-14T21:49:17.9745311Z 2025-08-14T21:49:17.9745404Z cudagraph partition due to non gpu ops. Found from : 2025-08-14T21:49:17.9745591Z File "/var/lib/jenkins/workspace/benchmarks/dynamo/huggingface.py", line 532, in forward_pass 2025-08-14T21:49:17.9745659Z return mod(**inputs) 2025-08-14T21:49:17.9745909Z File "/opt/conda/envs/py_3.9/lib/python3.9/site-packages/transformers/models/convbert/modeling_convbert.py", line 945, in forward 2025-08-14T21:49:17.9746073Z loss = loss_fct(prediction_scores.view(-1, self.config.vocab_size), labels.view(-1)) 2025-08-14T21:49:17.9746076Z 2025-08-14T21:49:26.3419234Z Compilation time (from dynamo_timed): 18.285329356 2025-08-14T21:49:26.3473659Z pass 2025-08-14T21:49:26.3476116Z WARNING:common:Trying to call the empty_gpu_cache for device: cpu, which is not in list [cuda, xpu] 2025-08-14T21:49:26.3476927Z TIMING: _recursive_pre_grad_passes:0.00847 _recursive_joint_graph_passes:0.54551 _recursive_post_grad_passes:0.16152 async_compile.wait:0.53949 code_gen:7.9083 inductor_compile:9.97013 backend_compile:14.47951 gc:0.00101 entire_frame_compile:18.28533 total_wall_time:18.28533 2025-08-14T21:49:26.3477792Z STATS: call_* op count: 634 | FakeTensorMode.__torch_dispatch__:23085 | FakeTensor.__torch_dispatch__:7564 | ProxyTorchDispatchMode.__torch_dispatch__:8630 2025-08-14T21:49:26.3478269Z Dynamo produced 1 graphs covering 634 ops with 0 graph breaks (0 unique) 2025-08-14T21:49:27.7007079Z accuracy pass_rate=95.35% 2025-08-14T21:49:27.7014596Z calls_captured gmean=0.00x mean=609.233x 2025-08-14T21:49:27.7016811Z unique_graphs gmean=0.00x mean=1.093x 2025-08-14T21:49:27.7021419Z graph_breaks gmean=0.00x mean=0.140x 2025-08-14T21:49:27.7023443Z unique_graph_breaks gmean=0.00x mean=0.047x 2025-08-14T21:49:27.7028072Z autograd_captures gmean=0.00x mean=0.000x 2025-08-14T21:49:27.7032307Z autograd_compiles gmean=0.00x mean=0.000x 2025-08-14T21:49:27.7036458Z cudagraph_skips gmean=0.00x mean=1.093x 2025-08-14T21:49:27.7040596Z compilation_latency mean=17.766 seconds 2025-08-14T21:49:28.3506448Z + python benchmarks/dynamo/check_accuracy.py --actual /var/lib/jenkins/workspace/test/test-reports/inference_huggingface.csv --expected benchmarks/dynamo/ci_expected_accuracy/dynamic_cpu_inductor_huggingface_inference.csv 2025-08-14T21:49:28.6152061Z AlbertForMaskedLM PASS 2025-08-14T21:49:28.6156349Z AlbertForQuestionAnswering PASS 2025-08-14T21:49:28.6158381Z AllenaiLongformerBase PASS 2025-08-14T21:49:28.6162874Z BartForCausalLM PASS 2025-08-14T21:49:28.6165044Z BartForConditionalGeneration PASS 2025-08-14T21:49:28.6169945Z BertForMaskedLM PASS 2025-08-14T21:49:28.6174412Z BertForQuestionAnswering PASS 2025-08-14T21:49:28.6174791Z BlenderbotForCausalLM XFAIL 2025-08-14T21:49:28.6175038Z BlenderbotSmallForCausalLM PASS 2025-08-14T21:49:28.6180315Z BlenderbotSmallForConditionalGeneration PASS 2025-08-14T21:49:28.6184983Z CamemBert PASS 2025-08-14T21:49:28.6189201Z DebertaV2ForMaskedLM XFAIL 2025-08-14T21:49:28.6193581Z DebertaV2ForQuestionAnswering PASS 2025-08-14T21:49:28.6198540Z DistilBertForMaskedLM PASS 2025-08-14T21:49:28.6202594Z DistilBertForQuestionAnswering PASS 2025-08-14T21:49:28.6204907Z DistillGPT2 PASS 2025-08-14T21:49:28.6209430Z ElectraForCausalLM PASS 2025-08-14T21:49:28.6213664Z ElectraForQuestionAnswering PASS 2025-08-14T21:49:28.6215636Z GPT2ForSequenceClassification PASS 2025-08-14T21:49:28.6219815Z GoogleFnet PASS 2025-08-14T21:49:28.6221963Z LayoutLMForMaskedLM PASS 2025-08-14T21:49:28.6222319Z LayoutLMForSequenceClassification PASS 2025-08-14T21:49:28.6226990Z M2M100ForConditionalGeneration PASS 2025-08-14T21:49:28.6231917Z MBartForCausalLM PASS 2025-08-14T21:49:28.6232334Z MBartForConditionalGeneration PASS 2025-08-14T21:49:28.6232568Z MT5ForConditionalGeneration PASS 2025-08-14T21:49:28.6232763Z MegatronBertForCausalLM PASS 2025-08-14T21:49:28.6233057Z MegatronBertForQuestionAnswering PASS 2025-08-14T21:49:28.6242516Z MobileBertForMaskedLM PASS 2025-08-14T21:49:28.6244466Z MobileBertForQuestionAnswering PASS 2025-08-14T21:49:28.6249884Z OPTForCausalLM PASS 2025-08-14T21:49:28.6254225Z PLBartForCausalLM PASS 2025-08-14T21:49:28.6258699Z PLBartForConditionalGeneration PASS 2025-08-14T21:49:28.6260723Z PegasusForCausalLM PASS 2025-08-14T21:49:28.6261002Z PegasusForConditionalGeneration PASS 2025-08-14T21:49:28.6261216Z RobertaForCausalLM PASS 2025-08-14T21:49:28.6261417Z RobertaForQuestionAnswering PASS 2025-08-14T21:49:28.6261616Z T5ForConditionalGeneration PASS 2025-08-14T21:49:28.6269059Z T5Small PASS 2025-08-14T21:49:28.6273428Z TrOCRForCausalLM PASS 2025-08-14T21:49:28.6279726Z XGLMForCausalLM PASS 2025-08-14T21:49:28.6283980Z XLNetLMHeadModel PASS 2025-08-14T21:49:28.6288340Z YituTechConvBert PASS 2025-08-14T21:49:28.6692686Z + python benchmarks/dynamo/check_graph_breaks.py --actual /var/lib/jenkins/workspace/test/test-reports/inference_huggingface.csv --expected benchmarks/dynamo/ci_expected_accuracy/dynamic_cpu_inductor_huggingface_inference.csv 2025-08-14T21:49:28.9273792Z AlbertForMaskedLM PASS 2025-08-14T21:49:28.9275661Z AlbertForQuestionAnswering PASS 2025-08-14T21:49:28.9281508Z AllenaiLongformerBase PASS 2025-08-14T21:49:28.9285749Z BartForCausalLM PASS 2025-08-14T21:49:28.9289037Z BartForConditionalGeneration PASS 2025-08-14T21:49:28.9291127Z BertForMaskedLM PASS 2025-08-14T21:49:28.9291499Z BertForQuestionAnswering PASS 2025-08-14T21:49:28.9295783Z BlenderbotForCausalLM PASS 2025-08-14T21:49:28.9300547Z BlenderbotSmallForCausalLM PASS 2025-08-14T21:49:28.9300993Z BlenderbotSmallForConditionalGeneration PASS 2025-08-14T21:49:28.9301269Z CamemBert PASS 2025-08-14T21:49:28.9301470Z DebertaV2ForMaskedLM PASS 2025-08-14T21:49:28.9301690Z DebertaV2ForQuestionAnswering PASS 2025-08-14T21:49:28.9312800Z DistilBertForMaskedLM PASS 2025-08-14T21:49:28.9314755Z DistilBertForQuestionAnswering PASS 2025-08-14T21:49:28.9320155Z DistillGPT2 PASS 2025-08-14T21:49:28.9322178Z ElectraForCausalLM PASS 2025-08-14T21:49:28.9322534Z ElectraForQuestionAnswering PASS 2025-08-14T21:49:28.9327022Z GPT2ForSequenceClassification PASS 2025-08-14T21:49:28.9331516Z GoogleFnet PASS 2025-08-14T21:49:28.9336125Z LayoutLMForMaskedLM PASS 2025-08-14T21:49:28.9340148Z LayoutLMForSequenceClassification PASS 2025-08-14T21:49:28.9340421Z M2M100ForConditionalGeneration PASS 2025-08-14T21:49:28.9340628Z MBartForCausalLM PASS 2025-08-14T21:49:28.9345433Z MBartForConditionalGeneration PASS 2025-08-14T21:49:28.9345700Z MT5ForConditionalGeneration PASS 2025-08-14T21:49:28.9348179Z MegatronBertForCausalLM PASS 2025-08-14T21:49:28.9355399Z MegatronBertForQuestionAnswering PASS 2025-08-14T21:49:28.9357601Z MobileBertForMaskedLM PASS 2025-08-14T21:49:28.9362131Z MobileBertForQuestionAnswering PASS 2025-08-14T21:49:28.9363828Z OPTForCausalLM PASS 2025-08-14T21:49:28.9364113Z PLBartForCausalLM PASS 2025-08-14T21:49:28.9372256Z PLBartForConditionalGeneration PASS 2025-08-14T21:49:28.9374178Z PegasusForCausalLM PASS 2025-08-14T21:49:28.9374532Z PegasusForConditionalGeneration PASS 2025-08-14T21:49:28.9379350Z RobertaForCausalLM PASS 2025-08-14T21:49:28.9379765Z RobertaForQuestionAnswering PASS 2025-08-14T21:49:28.9384006Z T5ForConditionalGeneration PASS 2025-08-14T21:49:28.9384404Z T5Small PASS 2025-08-14T21:49:28.9389973Z TrOCRForCausalLM PASS 2025-08-14T21:49:28.9398404Z XGLMForCausalLM PASS_BUT_FLAKY 2025-08-14T21:49:28.9400650Z XLNetLMHeadModel PASS 2025-08-14T21:49:28.9405234Z YituTechConvBert PASS 2025-08-14T21:49:28.9807978Z + sccache_epilogue 2025-08-14T21:49:28.9810150Z + echo '::group::Sccache Compilation Log' 2025-08-14T21:49:28.9815814Z ##[group]Sccache Compilation Log 2025-08-14T21:49:28.9819592Z + echo '=================== sccache compilation log ===================' 2025-08-14T21:49:28.9820033Z =================== sccache compilation log =================== 2025-08-14T21:49:28.9820512Z + python /var/lib/jenkins/workspace/.ci/pytorch/print_sccache_log.py /var/lib/jenkins/sccache_error.log 2025-08-14T21:49:29.0015327Z + echo '=========== If your build fails, please take a look at the log above for possible reasons ===========' 2025-08-14T21:49:29.0017215Z =========== If your build fails, please take a look at the log above for possible reasons =========== 2025-08-14T21:49:29.0017686Z + sccache --show-stats 2025-08-14T21:49:29.0050117Z Compile requests 381 2025-08-14T21:49:29.0051651Z Compile requests executed 0 2025-08-14T21:49:29.0056650Z Cache hits 0 2025-08-14T21:49:29.0059000Z Cache misses 0 2025-08-14T21:49:29.0059271Z Cache hits rate - 2025-08-14T21:49:29.0059470Z Cache timeouts 0 2025-08-14T21:49:29.0059655Z Cache read errors 0 2025-08-14T21:49:29.0059828Z Forced recaches 0 2025-08-14T21:49:29.0060007Z Cache write errors 0 2025-08-14T21:49:29.0060197Z Cache errors 0 2025-08-14T21:49:29.0060370Z Compilations 0 2025-08-14T21:49:29.0060560Z Compilation failures 0 2025-08-14T21:49:29.0060752Z Non-cacheable compilations 0 2025-08-14T21:49:29.0060941Z Non-cacheable calls 41 2025-08-14T21:49:29.0061127Z Non-compilation calls 340 2025-08-14T21:49:29.0061316Z Unsupported compiler calls 0 2025-08-14T21:49:29.0061506Z Average cache write 0.000 s 2025-08-14T21:49:29.0061693Z Average compiler 0.000 s 2025-08-14T21:49:29.0061884Z Average cache read hit 0.000 s 2025-08-14T21:49:29.0062077Z Failed distributed compilations 0 2025-08-14T21:49:29.0062200Z 2025-08-14T21:49:29.0062266Z Non-cacheable reasons: 2025-08-14T21:49:29.0062434Z -E 41 2025-08-14T21:49:29.0062554Z 2025-08-14T21:49:29.0062706Z Cache location s3, name: ossci-compiler-cache-circleci-v2, prefix: / 2025-08-14T21:49:29.0062973Z Version (client) 0.10.0 2025-08-14T21:49:29.0063156Z + sccache --stop-server 2025-08-14T21:49:29.0067383Z Stopping sccache server... 2025-08-14T21:49:29.0073534Z Compile requests 381 2025-08-14T21:49:29.0075225Z Compile requests executed 0 2025-08-14T21:49:29.0075455Z Cache hits 0 2025-08-14T21:49:29.0075696Z Cache misses 0 2025-08-14T21:49:29.0075884Z Cache hits rate - 2025-08-14T21:49:29.0076059Z Cache timeouts 0 2025-08-14T21:49:29.0076238Z Cache read errors 0 2025-08-14T21:49:29.0076416Z Forced recaches 0 2025-08-14T21:49:29.0076595Z Cache write errors 0 2025-08-14T21:49:29.0076764Z Cache errors 0 2025-08-14T21:49:29.0076943Z Compilations 0 2025-08-14T21:49:29.0077129Z Compilation failures 0 2025-08-14T21:49:29.0077367Z Non-cacheable compilations 0 2025-08-14T21:49:29.0077556Z Non-cacheable calls 41 2025-08-14T21:49:29.0077741Z Non-compilation calls 340 2025-08-14T21:49:29.0077920Z Unsupported compiler calls 0 2025-08-14T21:49:29.0078116Z Average cache write 0.000 s 2025-08-14T21:49:29.0078315Z Average compiler 0.000 s 2025-08-14T21:49:29.0078503Z Average cache read hit 0.000 s 2025-08-14T21:49:29.0078700Z Failed distributed compilations 0 2025-08-14T21:49:29.0078834Z 2025-08-14T21:49:29.0078900Z Non-cacheable reasons: 2025-08-14T21:49:29.0079069Z -E 41 2025-08-14T21:49:29.0079186Z 2025-08-14T21:49:29.0079337Z Cache location s3, name: ossci-compiler-cache-circleci-v2, prefix: / 2025-08-14T21:49:29.0079606Z Version (client) 0.10.0 2025-08-14T21:49:29.0079840Z + echo ::endgroup:: 2025-08-14T21:49:29.0080307Z ##[endgroup] 2025-08-14T21:49:29.0080463Z + cleanup_workspace 2025-08-14T21:49:29.0080745Z + echo 'sudo may print the following warning message that can be ignored. The chown command will still run.' 2025-08-14T21:49:29.0081165Z sudo may print the following warning message that can be ignored. The chown command will still run. 2025-08-14T21:49:29.0081518Z + echo ' sudo: setrlimit(RLIMIT_STACK): Operation not permitted' 2025-08-14T21:49:29.0081782Z sudo: setrlimit(RLIMIT_STACK): Operation not permitted 2025-08-14T21:49:29.0082099Z + echo 'For more details refer to https://github.com/sudo-project/sudo/issues/42' 2025-08-14T21:49:29.0082429Z For more details refer to https://github.com/sudo-project/sudo/issues/42 2025-08-14T21:49:29.0082695Z + sudo chown -R 1000 /var/lib/jenkins/workspace 2025-08-14T21:49:29.3998951Z ##[group]Run pytorch/test-infra/.github/actions/upload-benchmark-results@main 2025-08-14T21:49:29.3999238Z with: 2025-08-14T21:49:29.3999408Z benchmark-results-dir: test/test-reports 2025-08-14T21:49:29.3999616Z dry-run: false 2025-08-14T21:49:29.3999760Z schema-version: v3 2025-08-14T21:49:29.4000107Z github-token: *** 2025-08-14T21:49:29.4000258Z env: 2025-08-14T21:49:29.4000392Z GIT_DEFAULT_BRANCH: main 2025-08-14T21:49:29.4000676Z DOCKER_CONTAINER_ID: 4dd890d366a3b8e8bea253d8d281baebc94267ef08a0525f01f6487a0edf5681 2025-08-14T21:49:29.4000966Z ##[endgroup] 2025-08-14T21:49:29.4024348Z ##[group]Run set -eux 2025-08-14T21:49:29.4024550Z set -eux 2025-08-14T21:49:29.4024886Z python3 -mpip install boto3==1.35.33 psutil==7.0.0 pynvml==12.0.0 2025-08-14T21:49:29.4025144Z  2025-08-14T21:49:29.4025288Z DEVICE_NAME="" 2025-08-14T21:49:29.4025447Z DEVICE_TYPE="" 2025-08-14T21:49:29.4025603Z  2025-08-14T21:49:29.4025788Z if command -v nvidia-smi; then 2025-08-14T21:49:29.4026057Z  # NB: I'm using PyTorch here to get the device name, however, it needs to 2025-08-14T21:49:29.4026375Z  # install the correct version of PyTorch manually for now. Any PyTorch 2025-08-14T21:49:29.4026683Z  # version is fine, I just use 2.7.1 to satify PYPIDEP linter 2025-08-14T21:49:29.4026933Z  python3 -mpip install torch==2.7.1 2025-08-14T21:49:29.4027137Z elif command -v rocminfo; then 2025-08-14T21:49:29.4027384Z  # NB: Installing torch on ROCm runner with pip here causes CI to fail 2025-08-14T21:49:29.4027756Z  # with a memoryview is too large error only on MI300 runners. Is pip 2025-08-14T21:49:29.4028066Z  # version on ROCm runner there too old? As a workaround, let's use the 2025-08-14T21:49:29.4028334Z  # GPU device name coming from rocminfo instead 2025-08-14T21:49:29.4028549Z  DEVICE_NAME=rocm 2025-08-14T21:49:29.4028831Z  DEVICE_TYPE=$(rocminfo | grep "Marketing Name" | tail -n1 | awk -F':' '{print $2}' | xargs) 2025-08-14T21:49:29.4029111Z fi 2025-08-14T21:49:29.4029239Z  2025-08-14T21:49:29.4029454Z echo "DEVICE_NAME=$DEVICE_NAME" >> $GITHUB_ENV 2025-08-14T21:49:29.4029688Z echo "DEVICE_TYPE=$DEVICE_TYPE" >> $GITHUB_ENV 2025-08-14T21:49:29.4037109Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-08-14T21:49:29.4037325Z env: 2025-08-14T21:49:29.4037477Z GIT_DEFAULT_BRANCH: main 2025-08-14T21:49:29.4037759Z DOCKER_CONTAINER_ID: 4dd890d366a3b8e8bea253d8d281baebc94267ef08a0525f01f6487a0edf5681 2025-08-14T21:49:29.4038047Z ##[endgroup] 2025-08-14T21:49:29.4064481Z + python3 -mpip install boto3==1.35.33 psutil==7.0.0 pynvml==12.0.0 2025-08-14T21:49:29.5714438Z Defaulting to user installation because normal site-packages is not writeable 2025-08-14T21:49:30.2690285Z Collecting boto3==1.35.33 2025-08-14T21:49:30.2838371Z Downloading boto3-1.35.33-py3-none-any.whl (139 kB) 2025-08-14T21:49:30.4846214Z Collecting psutil==7.0.0 2025-08-14T21:49:30.4874930Z Downloading psutil-7.0.0-cp36-abi3-manylinux_2_12_x86_64.manylinux2010_x86_64.manylinux_2_17_x86_64.manylinux2014_x86_64.whl (277 kB) 2025-08-14T21:49:30.5110236Z Collecting pynvml==12.0.0 2025-08-14T21:49:30.5141327Z Downloading pynvml-12.0.0-py3-none-any.whl (26 kB) 2025-08-14T21:49:30.5211390Z Requirement already satisfied: jmespath<2.0.0,>=0.7.1 in /usr/lib/python3.9/site-packages (from boto3==1.35.33) (0.10.0) 2025-08-14T21:49:30.5489956Z Collecting s3transfer<0.11.0,>=0.10.0 2025-08-14T21:49:30.5523168Z Downloading s3transfer-0.10.4-py3-none-any.whl (83 kB) 2025-08-14T21:49:31.3016373Z Collecting botocore<1.36.0,>=1.35.33 2025-08-14T21:49:31.3047929Z Downloading botocore-1.35.99-py3-none-any.whl (13.3 MB) 2025-08-14T21:49:31.4099762Z Collecting nvidia-ml-py<13.0.0a0,>=12.0.0 2025-08-14T21:49:31.4129537Z Downloading nvidia_ml_py-12.575.51-py3-none-any.whl (47 kB) 2025-08-14T21:49:31.4203894Z Requirement already satisfied: python-dateutil<3.0.0,>=2.1 in /usr/lib/python3.9/site-packages (from botocore<1.36.0,>=1.35.33->boto3==1.35.33) (2.8.1) 2025-08-14T21:49:31.4210839Z Requirement already satisfied: urllib3<1.27,>=1.25.4 in /usr/lib/python3.9/site-packages (from botocore<1.36.0,>=1.35.33->boto3==1.35.33) (1.25.10) 2025-08-14T21:49:31.5753239Z Requirement already satisfied: six>=1.5 in /usr/lib/python3.9/site-packages (from python-dateutil<3.0.0,>=2.1->botocore<1.36.0,>=1.35.33->boto3==1.35.33) (1.15.0) 2025-08-14T21:49:31.6671811Z Installing collected packages: botocore, s3transfer, nvidia-ml-py, pynvml, psutil, boto3 2025-08-14T21:49:31.9926046Z Attempting uninstall: nvidia-ml-py 2025-08-14T21:49:31.9929349Z Found existing installation: nvidia-ml-py 11.525.84 2025-08-14T21:49:31.9936401Z Uninstalling nvidia-ml-py-11.525.84: 2025-08-14T21:49:32.0052341Z Successfully uninstalled nvidia-ml-py-11.525.84 2025-08-14T21:49:32.0540730Z Attempting uninstall: psutil 2025-08-14T21:49:32.0542643Z Found existing installation: psutil 5.9.8 2025-08-14T21:49:32.0583299Z Uninstalling psutil-5.9.8: 2025-08-14T21:49:32.0587449Z Successfully uninstalled psutil-5.9.8 2025-08-14T21:49:32.1856307Z Successfully installed boto3-1.35.33 botocore-1.35.99 nvidia-ml-py-12.575.51 psutil-7.0.0 pynvml-12.0.0 s3transfer-0.10.4 2025-08-14T21:49:32.2871403Z + DEVICE_NAME= 2025-08-14T21:49:32.2876023Z + DEVICE_TYPE= 2025-08-14T21:49:32.2880092Z + command -v nvidia-smi 2025-08-14T21:49:32.2883644Z + command -v rocminfo 2025-08-14T21:49:32.2887758Z + echo DEVICE_NAME= 2025-08-14T21:49:32.2891115Z + echo DEVICE_TYPE= 2025-08-14T21:49:32.2915087Z ##[group]Run set -eux 2025-08-14T21:49:32.2915273Z set -eux 2025-08-14T21:49:32.2915416Z  2025-08-14T21:49:32.2915564Z if [[ -z "${GITHUB_TOKEN}" ]]; then 2025-08-14T21:49:32.2915777Z  echo "Missing github-token input" 2025-08-14T21:49:32.2915963Z  exit 1 2025-08-14T21:49:32.2916107Z fi 2025-08-14T21:49:32.2921892Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-08-14T21:49:32.2922126Z env: 2025-08-14T21:49:32.2922270Z GIT_DEFAULT_BRANCH: main 2025-08-14T21:49:32.2922548Z DOCKER_CONTAINER_ID: 4dd890d366a3b8e8bea253d8d281baebc94267ef08a0525f01f6487a0edf5681 2025-08-14T21:49:32.2922897Z DEVICE_NAME: 2025-08-14T21:49:32.2923039Z DEVICE_TYPE: 2025-08-14T21:49:32.2923382Z GITHUB_TOKEN: *** 2025-08-14T21:49:32.2923535Z ##[endgroup] 2025-08-14T21:49:32.2947933Z + [[ -z *** ]] 2025-08-14T21:49:32.2989741Z ##[group]Run pytorch/test-infra/.github/actions/get-workflow-job-id@main 2025-08-14T21:49:32.2990009Z with: 2025-08-14T21:49:32.2990293Z github-token: *** 2025-08-14T21:49:32.2990434Z env: 2025-08-14T21:49:32.2990576Z GIT_DEFAULT_BRANCH: main 2025-08-14T21:49:32.2990858Z DOCKER_CONTAINER_ID: 4dd890d366a3b8e8bea253d8d281baebc94267ef08a0525f01f6487a0edf5681 2025-08-14T21:49:32.2991139Z DEVICE_NAME: 2025-08-14T21:49:32.2991284Z DEVICE_TYPE: 2025-08-14T21:49:32.2991440Z ##[endgroup] 2025-08-14T21:49:32.3014813Z ##[group]Run set -eux 2025-08-14T21:49:32.3014992Z set -eux 2025-08-14T21:49:32.3015137Z  2025-08-14T21:49:32.3015420Z python3 "${GITHUB_ACTION_PATH}/../../scripts/get_workflow_job_id.py" "${GITHUB_RUN_ID}" "${RUNNER_NAME}" 2025-08-14T21:49:32.3019469Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-08-14T21:49:32.3019698Z env: 2025-08-14T21:49:32.3019846Z GIT_DEFAULT_BRANCH: main 2025-08-14T21:49:32.3020118Z DOCKER_CONTAINER_ID: 4dd890d366a3b8e8bea253d8d281baebc94267ef08a0525f01f6487a0edf5681 2025-08-14T21:49:32.3020403Z DEVICE_NAME: 2025-08-14T21:49:32.3020557Z DEVICE_TYPE: 2025-08-14T21:49:32.3020826Z GITHUB_TOKEN: *** 2025-08-14T21:49:32.3020967Z ##[endgroup] 2025-08-14T21:49:32.3041698Z + python3 /home/ec2-user/actions-runner/_work/_actions/pytorch/test-infra/main/.github/actions/get-workflow-job-id/../../scripts/get_workflow_job_id.py 16976255153 i-0819c8fa835cec089 2025-08-14T21:49:33.3214525Z setting job-id=48128039107 2025-08-14T21:49:33.3218697Z setting job-name=linux-jammy-cpu-py3.9-gcc11-inductor / test (dynamic_cpu_inductor_huggingface, 1, 1, linux.8xlarge.amx) 2025-08-14T21:49:33.3310332Z ##[group]Run set -eux 2025-08-14T21:49:33.3310529Z set -eux 2025-08-14T21:49:33.3310673Z  2025-08-14T21:49:33.3310909Z python3 "${GITHUB_ACTION_PATH}/../../scripts/benchmarks/gather_metadata.py" \ 2025-08-14T21:49:33.3311198Z  --schema-version "${SCHEMA_VERSION}" \ 2025-08-14T21:49:33.3311406Z  --repo "${REPO}" \ 2025-08-14T21:49:33.3311594Z  --head-branch "${HEAD_BRANCH}" \ 2025-08-14T21:49:33.3311788Z  --head-sha "${HEAD_SHA}" \ 2025-08-14T21:49:33.3311991Z  --workflow-id "${WORKFLOW_RUN_ID}" \ 2025-08-14T21:49:33.3312202Z  --run-attempt "${RUN_ATTEMPT}" \ 2025-08-14T21:49:33.3312395Z  --job-id "${JOB_ID}" \ 2025-08-14T21:49:33.3312571Z  --job-name "${JOB_NAME}" 2025-08-14T21:49:33.3316663Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-08-14T21:49:33.3316884Z env: 2025-08-14T21:49:33.3317024Z GIT_DEFAULT_BRANCH: main 2025-08-14T21:49:33.3317302Z DOCKER_CONTAINER_ID: 4dd890d366a3b8e8bea253d8d281baebc94267ef08a0525f01f6487a0edf5681 2025-08-14T21:49:33.3317593Z DEVICE_NAME: 2025-08-14T21:49:33.3317732Z DEVICE_TYPE: 2025-08-14T21:49:33.3317874Z SCHEMA_VERSION: v3 2025-08-14T21:49:33.3318032Z REPO: pytorch/pytorch 2025-08-14T21:49:33.3318191Z HEAD_BRANCH: refs/heads/main 2025-08-14T21:49:33.3318383Z HEAD_SHA: 1fc683cf17c8c673044538d10266c00f92987be2 2025-08-14T21:49:33.3318582Z WORKFLOW_RUN_ID: 16976255153 2025-08-14T21:49:33.3318804Z RUN_ATTEMPT: 1 2025-08-14T21:49:33.3318941Z JOB_ID: 48128039107 2025-08-14T21:49:33.3319257Z JOB_NAME: linux-jammy-cpu-py3.9-gcc11-inductor / test (dynamic_cpu_inductor_huggingface, 1, 1, linux.8xlarge.amx) 2025-08-14T21:49:33.3319588Z ##[endgroup] 2025-08-14T21:49:33.3344166Z + python3 /home/ec2-user/actions-runner/_work/_actions/pytorch/test-infra/main/.github/actions/upload-benchmark-results/../../scripts/benchmarks/gather_metadata.py --schema-version v3 --repo pytorch/pytorch --head-branch refs/heads/main --head-sha 1fc683cf17c8c673044538d10266c00f92987be2 --workflow-id 16976255153 --run-attempt 1 --job-id 48128039107 --job-name 'linux-jammy-cpu-py3.9-gcc11-inductor / test (dynamic_cpu_inductor_huggingface, 1, 1, linux.8xlarge.amx)' 2025-08-14T21:49:33.3593405Z ##[group]Run set -eux 2025-08-14T21:49:33.3593586Z set -eux 2025-08-14T21:49:33.3593730Z  2025-08-14T21:49:33.3593958Z python3 "${GITHUB_ACTION_PATH}/../../scripts/benchmarks/gather_runners_info.py" 2025-08-14T21:49:33.3597877Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-08-14T21:49:33.3598098Z env: 2025-08-14T21:49:33.3598238Z GIT_DEFAULT_BRANCH: main 2025-08-14T21:49:33.3598510Z DOCKER_CONTAINER_ID: 4dd890d366a3b8e8bea253d8d281baebc94267ef08a0525f01f6487a0edf5681 2025-08-14T21:49:33.3598807Z DEVICE_NAME: 2025-08-14T21:49:33.3598953Z DEVICE_TYPE: 2025-08-14T21:49:33.3599088Z ##[endgroup] 2025-08-14T21:49:33.3619599Z + python3 /home/ec2-user/actions-runner/_work/_actions/pytorch/test-infra/main/.github/actions/upload-benchmark-results/../../scripts/benchmarks/gather_runners_info.py 2025-08-14T21:49:33.3912201Z INFO:root:Fail to import torch to get the device name 2025-08-14T21:49:33.4007425Z ##[group]Run set -eux 2025-08-14T21:49:33.4007609Z set -eux 2025-08-14T21:49:33.4007756Z  2025-08-14T21:49:33.4007915Z # TODO (huydhn): Implement this part 2025-08-14T21:49:33.4008148Z echo "dependencies={}" >> "${GITHUB_OUTPUT}" 2025-08-14T21:49:33.4011590Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-08-14T21:49:33.4011806Z env: 2025-08-14T21:49:33.4011950Z GIT_DEFAULT_BRANCH: main 2025-08-14T21:49:33.4012225Z DOCKER_CONTAINER_ID: 4dd890d366a3b8e8bea253d8d281baebc94267ef08a0525f01f6487a0edf5681 2025-08-14T21:49:33.4012597Z DEVICE_NAME: 2025-08-14T21:49:33.4012739Z DEVICE_TYPE: 2025-08-14T21:49:33.4012879Z ##[endgroup] 2025-08-14T21:49:33.4032613Z + echo 'dependencies={}' 2025-08-14T21:49:33.4058723Z ##[group]Run set -eux 2025-08-14T21:49:33.4058919Z set -eux 2025-08-14T21:49:33.4059069Z  2025-08-14T21:49:33.4059233Z if [[ ! -d "${BENCHMARK_RESULTS_DIR}" ]]; then 2025-08-14T21:49:33.4059494Z  echo "${BENCHMARK_RESULTS_DIR} does not exist, skipping" 2025-08-14T21:49:33.4059772Z  # We don't want the job to fail if the directory doesn't exist 2025-08-14T21:49:33.4059993Z  exit 0 2025-08-14T21:49:33.4060126Z fi 2025-08-14T21:49:33.4060259Z  2025-08-14T21:49:33.4060409Z if [[ "${DRY_RUN}" == "true" ]]; then 2025-08-14T21:49:33.4060683Z  python3 "${GITHUB_ACTION_PATH}/../../scripts/upload_benchmark_results.py" \ 2025-08-14T21:49:33.4061002Z  --benchmark-results-dir "${BENCHMARK_RESULTS_DIR}" \ 2025-08-14T21:49:33.4061254Z  --metadata "${BENCHMARK_METADATA}" \ 2025-08-14T21:49:33.4061463Z  --runners "${RUNNER_INFO}" \ 2025-08-14T21:49:33.4061665Z  --dependencies "${DEPENDENCIES}" \ 2025-08-14T21:49:33.4061863Z  --dry-run 2025-08-14T21:49:33.4062020Z else 2025-08-14T21:49:33.4062237Z  python3 "${GITHUB_ACTION_PATH}/../../scripts/upload_benchmark_results.py" \ 2025-08-14T21:49:33.4062540Z  --benchmark-results-dir "${BENCHMARK_RESULTS_DIR}" \ 2025-08-14T21:49:33.4062780Z  --metadata "${BENCHMARK_METADATA}" \ 2025-08-14T21:49:33.4062977Z  --runners "${RUNNER_INFO}" \ 2025-08-14T21:49:33.4063235Z  --dependencies "${DEPENDENCIES}" 2025-08-14T21:49:33.4063425Z fi 2025-08-14T21:49:33.4066919Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-08-14T21:49:33.4067144Z env: 2025-08-14T21:49:33.4067289Z GIT_DEFAULT_BRANCH: main 2025-08-14T21:49:33.4067564Z DOCKER_CONTAINER_ID: 4dd890d366a3b8e8bea253d8d281baebc94267ef08a0525f01f6487a0edf5681 2025-08-14T21:49:33.4067848Z DEVICE_NAME: 2025-08-14T21:49:33.4067993Z DEVICE_TYPE: 2025-08-14T21:49:33.4068151Z BENCHMARK_RESULTS_DIR: test/test-reports 2025-08-14T21:49:33.4068374Z DRY_RUN: false 2025-08-14T21:49:33.4069127Z BENCHMARK_METADATA: {"timestamp": 1755208173, "schema_version": "v3", "name": "linux-jammy-cpu-py3.9-gcc11-inductor / test (dynamic_cpu_inductor_huggingface, 1, 1, linux.8xlarge.amx)", "repo": "pytorch/pytorch", "head_branch": "refs/heads/main", "head_sha": "1fc683cf17c8c673044538d10266c00f92987be2", "workflow_id": 16976255153, "run_attempt": 1, "job_id": 48128039107} 2025-08-14T21:49:33.4070073Z RUNNER_INFO: [{"cpu_info": "x86_64", "cpu_count": 32, "avail_mem_in_gb": 123, "extra_info": {"hostname": "ip-10-0-18-145.ec2.internal"}, "name": "", "type": ""}] 2025-08-14T21:49:33.4070419Z DEPENDENCIES: {} 2025-08-14T21:49:33.4070616Z ##[endgroup] 2025-08-14T21:49:33.4091400Z + [[ ! -d test/test-reports ]] 2025-08-14T21:49:33.4093118Z + [[ false == \t\r\u\e ]] 2025-08-14T21:49:33.4094721Z + python3 /home/ec2-user/actions-runner/_work/_actions/pytorch/test-infra/main/.github/actions/upload-benchmark-results/../../scripts/upload_benchmark_results.py --benchmark-results-dir test/test-reports --metadata '{"timestamp": 1755208173, "schema_version": "v3", "name": "linux-jammy-cpu-py3.9-gcc11-inductor / test (dynamic_cpu_inductor_huggingface, 1, 1, linux.8xlarge.amx)", "repo": "pytorch/pytorch", "head_branch": "refs/heads/main", "head_sha": "1fc683cf17c8c673044538d10266c00f92987be2", "workflow_id": 16976255153, "run_attempt": 1, "job_id": 48128039107}' --runners '[{"cpu_info": "x86_64", "cpu_count": 32, "avail_mem_in_gb": 123, "extra_info": {"hostname": "ip-10-0-18-145.ec2.internal"}, "name": "", "type": ""}]' --dependencies '{}' 2025-08-14T21:49:33.5161628Z INFO:root:Upload test/test-reports/inference_huggingface.json to s3://ossci-benchmarks/v3/pytorch/pytorch/16976255153/48128039107/inference_huggingface.json 2025-08-14T21:49:33.5424749Z INFO:botocore.credentials:Found credentials from IAM Role: gh-ci-github-action-runners-runner-role 2025-08-14T21:49:33.7291137Z ##[group]Run cat test/**/*_toprint.log || true 2025-08-14T21:49:33.7291397Z cat test/**/*_toprint.log || true 2025-08-14T21:49:33.7295252Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-08-14T21:49:33.7295479Z env: 2025-08-14T21:49:33.7295625Z GIT_DEFAULT_BRANCH: main 2025-08-14T21:49:33.7295893Z DOCKER_CONTAINER_ID: 4dd890d366a3b8e8bea253d8d281baebc94267ef08a0525f01f6487a0edf5681 2025-08-14T21:49:33.7296182Z DEVICE_NAME: 2025-08-14T21:49:33.7296330Z DEVICE_TYPE: 2025-08-14T21:49:33.7296484Z ##[endgroup] 2025-08-14T21:49:33.7362861Z cat: 'test/**/*_toprint.log': No such file or directory 2025-08-14T21:49:33.7397828Z ##[group]Run kill "$MONITOR_SCRIPT_PID" 2025-08-14T21:49:33.7398060Z kill "$MONITOR_SCRIPT_PID" 2025-08-14T21:49:33.7401460Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-08-14T21:49:33.7401686Z env: 2025-08-14T21:49:33.7401831Z GIT_DEFAULT_BRANCH: main 2025-08-14T21:49:33.7402098Z DOCKER_CONTAINER_ID: 4dd890d366a3b8e8bea253d8d281baebc94267ef08a0525f01f6487a0edf5681 2025-08-14T21:49:33.7402391Z DEVICE_NAME: 2025-08-14T21:49:33.7402549Z DEVICE_TYPE: 2025-08-14T21:49:33.7402699Z MONITOR_SCRIPT_PID: 47830 2025-08-14T21:49:33.7402860Z ##[endgroup] 2025-08-14T21:49:33.7489654Z Prepare all required actions 2025-08-14T21:49:33.7489958Z Getting action download info 2025-08-14T21:49:33.8983404Z Download action repository 'seemethere/upload-artifact-s3@v5' (SHA:baba72d0712b404f646cebe0730933554ebce96a) 2025-08-14T21:49:34.1167989Z Download action repository 'actions/upload-artifact@v4' (SHA:ea165f8d65b6e75b540449e92b4886f43607fa02) 2025-08-14T21:49:34.6134988Z ##[group]Run ./.github/actions/upload-test-artifacts 2025-08-14T21:49:34.6135209Z with: 2025-08-14T21:49:34.6135462Z file-suffix: test-dynamic_cpu_inductor_huggingface-1-1-linux.8xlarge.amx_48128039107 2025-08-14T21:49:34.6135748Z s3-bucket: gha-artifacts 2025-08-14T21:49:34.6135916Z env: 2025-08-14T21:49:34.6136060Z GIT_DEFAULT_BRANCH: main 2025-08-14T21:49:34.6136326Z DOCKER_CONTAINER_ID: 4dd890d366a3b8e8bea253d8d281baebc94267ef08a0525f01f6487a0edf5681 2025-08-14T21:49:34.6136674Z DEVICE_NAME: 2025-08-14T21:49:34.6136819Z DEVICE_TYPE: 2025-08-14T21:49:34.6136954Z ##[endgroup] 2025-08-14T21:49:34.6164463Z ##[group]Run # Remove any previous test jsons if they exist 2025-08-14T21:49:34.6164752Z # Remove any previous test jsons if they exist 2025-08-14T21:49:34.6164978Z rm -f test-jsons-*.zip 2025-08-14T21:49:34.6165234Z zip -r "test-jsons-${FILE_SUFFIX}.zip" test/test-reports -i '*.json' 2025-08-14T21:49:34.6168981Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-08-14T21:49:34.6169203Z env: 2025-08-14T21:49:34.6169347Z GIT_DEFAULT_BRANCH: main 2025-08-14T21:49:34.6169615Z DOCKER_CONTAINER_ID: 4dd890d366a3b8e8bea253d8d281baebc94267ef08a0525f01f6487a0edf5681 2025-08-14T21:49:34.6169901Z DEVICE_NAME: 2025-08-14T21:49:34.6170050Z DEVICE_TYPE: 2025-08-14T21:49:34.6170302Z FILE_SUFFIX: test-dynamic_cpu_inductor_huggingface-1-1-linux.8xlarge.amx_48128039107 2025-08-14T21:49:34.6170577Z ##[endgroup] 2025-08-14T21:49:34.6372577Z adding: test/test-reports/inference_huggingface.json (deflated 99%) 2025-08-14T21:49:34.6403810Z ##[group]Run # Remove any previous test reports if they exist 2025-08-14T21:49:34.6404101Z # Remove any previous test reports if they exist 2025-08-14T21:49:34.6404330Z rm -f test-reports-*.zip 2025-08-14T21:49:34.6404609Z zip -r "test-reports-${FILE_SUFFIX}.zip" test/test-reports -i '*.xml' -i '*.csv' 2025-08-14T21:49:34.6408159Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-08-14T21:49:34.6408380Z env: 2025-08-14T21:49:34.6408526Z GIT_DEFAULT_BRANCH: main 2025-08-14T21:49:34.6408796Z DOCKER_CONTAINER_ID: 4dd890d366a3b8e8bea253d8d281baebc94267ef08a0525f01f6487a0edf5681 2025-08-14T21:49:34.6409083Z DEVICE_NAME: 2025-08-14T21:49:34.6409229Z DEVICE_TYPE: 2025-08-14T21:49:34.6409480Z FILE_SUFFIX: test-dynamic_cpu_inductor_huggingface-1-1-linux.8xlarge.amx_48128039107 2025-08-14T21:49:34.6409755Z ##[endgroup] 2025-08-14T21:49:34.6458446Z adding: test/test-reports/inference_huggingface.csv (deflated 69%) 2025-08-14T21:49:34.6460703Z adding: test/test-reports/inference_huggingface_graph_breaks.csv (deflated 85%) 2025-08-14T21:49:34.6461123Z adding: test/test-reports/inference_huggingface_graph_break_deduped.csv (deflated 64%) 2025-08-14T21:49:34.6488386Z ##[group]Run # Remove any previous usage logs if they exist 2025-08-14T21:49:34.6488684Z # Remove any previous usage logs if they exist 2025-08-14T21:49:34.6488910Z rm -f logs-*.zip 2025-08-14T21:49:34.6489123Z zip "logs-${FILE_SUFFIX}.zip" 'usage_log.txt' || true 2025-08-14T21:49:34.6489427Z zip -r "logs-${FILE_SUFFIX}.zip" test/test-reports -i '*.log' || true 2025-08-14T21:49:34.6492868Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-08-14T21:49:34.6493086Z env: 2025-08-14T21:49:34.6493224Z GIT_DEFAULT_BRANCH: main 2025-08-14T21:49:34.6493494Z DOCKER_CONTAINER_ID: 4dd890d366a3b8e8bea253d8d281baebc94267ef08a0525f01f6487a0edf5681 2025-08-14T21:49:34.6493792Z DEVICE_NAME: 2025-08-14T21:49:34.6493928Z DEVICE_TYPE: 2025-08-14T21:49:34.6494272Z FILE_SUFFIX: test-dynamic_cpu_inductor_huggingface-1-1-linux.8xlarge.amx_48128039107 2025-08-14T21:49:34.6494552Z ##[endgroup] 2025-08-14T21:49:34.6552556Z adding: usage_log.txt (deflated 96%) 2025-08-14T21:49:34.6563593Z 2025-08-14T21:49:34.6566097Z zip error: Nothing to do! (logs-test-dynamic_cpu_inductor_huggingface-1-1-linux.8xlarge.amx_48128039107.zip) 2025-08-14T21:49:34.6590387Z ##[group]Run # Remove any previous debugging artifacts if they exist 2025-08-14T21:49:34.6590702Z # Remove any previous debugging artifacts if they exist 2025-08-14T21:49:34.6590943Z rm -f debug-*.zip 2025-08-14T21:49:34.6591116Z if [ -d 'test/debug' ]; then 2025-08-14T21:49:34.6591334Z  zip -r "debug-${FILE_SUFFIX}.zip" test/debug 2025-08-14T21:49:34.6591540Z fi 2025-08-14T21:49:34.6594883Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-08-14T21:49:34.6595156Z env: 2025-08-14T21:49:34.6595304Z GIT_DEFAULT_BRANCH: main 2025-08-14T21:49:34.6595592Z DOCKER_CONTAINER_ID: 4dd890d366a3b8e8bea253d8d281baebc94267ef08a0525f01f6487a0edf5681 2025-08-14T21:49:34.6595881Z DEVICE_NAME: 2025-08-14T21:49:34.6596030Z DEVICE_TYPE: 2025-08-14T21:49:34.6596287Z FILE_SUFFIX: test-dynamic_cpu_inductor_huggingface-1-1-linux.8xlarge.amx_48128039107 2025-08-14T21:49:34.6596570Z ##[endgroup] 2025-08-14T21:49:34.6668288Z ##[group]Run seemethere/upload-artifact-s3@v5 2025-08-14T21:49:34.6668492Z with: 2025-08-14T21:49:34.6668644Z s3-bucket: gha-artifacts 2025-08-14T21:49:34.6668847Z s3-prefix: pytorch/pytorch/16976255153/1/artifact 2025-08-14T21:49:34.6669046Z retention-days: 14 2025-08-14T21:49:34.6669207Z if-no-files-found: warn 2025-08-14T21:49:34.6669375Z path: test-jsons-*.zip 2025-08-14T21:49:34.6669525Z name: artifact 2025-08-14T21:49:34.6669673Z region: us-east-1 2025-08-14T21:49:34.6669814Z env: 2025-08-14T21:49:34.6669949Z GIT_DEFAULT_BRANCH: main 2025-08-14T21:49:34.6670228Z DOCKER_CONTAINER_ID: 4dd890d366a3b8e8bea253d8d281baebc94267ef08a0525f01f6487a0edf5681 2025-08-14T21:49:34.6670514Z DEVICE_NAME: 2025-08-14T21:49:34.6670656Z DEVICE_TYPE: 2025-08-14T21:49:34.6670790Z ##[endgroup] 2025-08-14T21:49:34.9383834Z NOTE: s3-prefix specified, ignoring name parameter 2025-08-14T21:49:34.9384161Z With the provided path, there will be 1 file uploaded 2025-08-14T21:49:34.9384460Z Uploading to s3 prefix: pytorch/pytorch/16976255153/1/artifact 2025-08-14T21:49:34.9412674Z Starting upload of test-jsons-test-dynamic_cpu_inductor_huggingface-1-1-linux.8xlarge.amx_48128039107.zip 2025-08-14T21:49:35.0609146Z Finished upload of test-jsons-test-dynamic_cpu_inductor_huggingface-1-1-linux.8xlarge.amx_48128039107.zip 2025-08-14T21:49:35.0791755Z ##[group]Run seemethere/upload-artifact-s3@v5 2025-08-14T21:49:35.0791965Z with: 2025-08-14T21:49:35.0792113Z s3-bucket: gha-artifacts 2025-08-14T21:49:35.0792331Z s3-prefix: pytorch/pytorch/16976255153/1/artifact 2025-08-14T21:49:35.0792539Z retention-days: 14 2025-08-14T21:49:35.0792703Z if-no-files-found: error 2025-08-14T21:49:35.0792873Z path: test-reports-*.zip 2025-08-14T21:49:35.0793030Z name: artifact 2025-08-14T21:49:35.0793166Z region: us-east-1 2025-08-14T21:49:35.0793309Z env: 2025-08-14T21:49:35.0793445Z GIT_DEFAULT_BRANCH: main 2025-08-14T21:49:35.0793714Z DOCKER_CONTAINER_ID: 4dd890d366a3b8e8bea253d8d281baebc94267ef08a0525f01f6487a0edf5681 2025-08-14T21:49:35.0793994Z DEVICE_NAME: 2025-08-14T21:49:35.0794133Z DEVICE_TYPE: 2025-08-14T21:49:35.0794270Z ##[endgroup] 2025-08-14T21:49:35.3159379Z NOTE: s3-prefix specified, ignoring name parameter 2025-08-14T21:49:35.3160816Z With the provided path, there will be 1 file uploaded 2025-08-14T21:49:35.3161231Z Uploading to s3 prefix: pytorch/pytorch/16976255153/1/artifact 2025-08-14T21:49:35.3186348Z Starting upload of test-reports-test-dynamic_cpu_inductor_huggingface-1-1-linux.8xlarge.amx_48128039107.zip 2025-08-14T21:49:35.4230323Z Finished upload of test-reports-test-dynamic_cpu_inductor_huggingface-1-1-linux.8xlarge.amx_48128039107.zip 2025-08-14T21:49:35.4387437Z ##[group]Run seemethere/upload-artifact-s3@v5 2025-08-14T21:49:35.4387649Z with: 2025-08-14T21:49:35.4387801Z s3-bucket: gha-artifacts 2025-08-14T21:49:35.4388006Z s3-prefix: pytorch/pytorch/16976255153/1/artifact 2025-08-14T21:49:35.4388208Z retention-days: 14 2025-08-14T21:49:35.4388428Z if-no-files-found: ignore 2025-08-14T21:49:35.4388599Z path: logs-*.zip 2025-08-14T21:49:35.4388737Z name: artifact 2025-08-14T21:49:35.4388881Z region: us-east-1 2025-08-14T21:49:35.4389026Z env: 2025-08-14T21:49:35.4389156Z GIT_DEFAULT_BRANCH: main 2025-08-14T21:49:35.4389431Z DOCKER_CONTAINER_ID: 4dd890d366a3b8e8bea253d8d281baebc94267ef08a0525f01f6487a0edf5681 2025-08-14T21:49:35.4389717Z DEVICE_NAME: 2025-08-14T21:49:35.4389861Z DEVICE_TYPE: 2025-08-14T21:49:35.4389996Z ##[endgroup] 2025-08-14T21:49:35.6715190Z NOTE: s3-prefix specified, ignoring name parameter 2025-08-14T21:49:35.6716930Z With the provided path, there will be 1 file uploaded 2025-08-14T21:49:35.6717361Z Uploading to s3 prefix: pytorch/pytorch/16976255153/1/artifact 2025-08-14T21:49:35.6745146Z Starting upload of logs-test-dynamic_cpu_inductor_huggingface-1-1-linux.8xlarge.amx_48128039107.zip 2025-08-14T21:49:35.8486562Z Finished upload of logs-test-dynamic_cpu_inductor_huggingface-1-1-linux.8xlarge.amx_48128039107.zip 2025-08-14T21:49:35.8646832Z ##[group]Run seemethere/upload-artifact-s3@v5 2025-08-14T21:49:35.8647035Z with: 2025-08-14T21:49:35.8647187Z s3-bucket: gha-artifacts 2025-08-14T21:49:35.8647392Z s3-prefix: pytorch/pytorch/16976255153/1/artifact 2025-08-14T21:49:35.8647592Z retention-days: 14 2025-08-14T21:49:35.8647751Z if-no-files-found: ignore 2025-08-14T21:49:35.8647919Z path: debug-*.zip 2025-08-14T21:49:35.8648056Z name: artifact 2025-08-14T21:49:35.8648200Z region: us-east-1 2025-08-14T21:49:35.8648343Z env: 2025-08-14T21:49:35.8648482Z GIT_DEFAULT_BRANCH: main 2025-08-14T21:49:35.8648770Z DOCKER_CONTAINER_ID: 4dd890d366a3b8e8bea253d8d281baebc94267ef08a0525f01f6487a0edf5681 2025-08-14T21:49:35.8649055Z DEVICE_NAME: 2025-08-14T21:49:35.8649198Z DEVICE_TYPE: 2025-08-14T21:49:35.8649331Z ##[endgroup] 2025-08-14T21:49:36.0965548Z No files were found with the provided path: debug-*.zip. No artifacts will be uploaded. 2025-08-14T21:49:36.1135885Z ##[group]Run # shellcheck disable=SC2156 2025-08-14T21:49:36.1136126Z # shellcheck disable=SC2156 2025-08-14T21:49:36.1136459Z find . -iname "core.[1-9]*" -exec docker exec "${DOCKER_CONTAINER_ID}" sh -c "gdb python {} -ex 'bt' -ex 'q'" \; 2025-08-14T21:49:36.1141538Z shell: /usr/bin/bash -e {0} 2025-08-14T21:49:36.1141726Z env: 2025-08-14T21:49:36.1141872Z GIT_DEFAULT_BRANCH: main 2025-08-14T21:49:36.1142145Z DOCKER_CONTAINER_ID: 4dd890d366a3b8e8bea253d8d281baebc94267ef08a0525f01f6487a0edf5681 2025-08-14T21:49:36.1142435Z DEVICE_NAME: 2025-08-14T21:49:36.1142589Z DEVICE_TYPE: 2025-08-14T21:49:36.1142731Z ##[endgroup] 2025-08-14T21:49:36.2830610Z Prepare all required actions 2025-08-14T21:49:36.2830867Z Getting action download info 2025-08-14T21:49:36.3842998Z ##[group]Run ./.github/actions/upload-utilization-stats 2025-08-14T21:49:36.3843220Z with: 2025-08-14T21:49:36.3843367Z job_id: 48128039107 2025-08-14T21:49:36.3843696Z job_name: linux-jammy-cpu-py3.9-gcc11-inductor / test (dynamic_cpu_inductor_huggingface, 1, 1, linux.8xlarge.amx) 2025-08-14T21:49:36.3844035Z workflow_name: inductor 2025-08-14T21:49:36.3844202Z workflow_run_id: 16976255153 2025-08-14T21:49:36.3844369Z workflow_attempt: 1 2025-08-14T21:49:36.3844510Z env: 2025-08-14T21:49:36.3844646Z GIT_DEFAULT_BRANCH: main 2025-08-14T21:49:36.3844918Z DOCKER_CONTAINER_ID: 4dd890d366a3b8e8bea253d8d281baebc94267ef08a0525f01f6487a0edf5681 2025-08-14T21:49:36.3845203Z DEVICE_NAME: 2025-08-14T21:49:36.3845337Z DEVICE_TYPE: 2025-08-14T21:49:36.3845476Z ##[endgroup] 2025-08-14T21:49:36.3869846Z ##[group]Run echo "workflow_id: 16976255153" 2025-08-14T21:49:36.3870104Z echo "workflow_id: 16976255153" 2025-08-14T21:49:36.3870300Z echo "workflow_attempt: 1" 2025-08-14T21:49:36.3870487Z echo "workflow_Name: inductor" 2025-08-14T21:49:36.3870673Z echo "job_id: 48128039107" 2025-08-14T21:49:36.3871041Z echo "job_name: linux-jammy-cpu-py3.9-gcc11-inductor / test (dynamic_cpu_inductor_huggingface, 1, 1, linux.8xlarge.amx)" 2025-08-14T21:49:36.3871457Z echo "artifact_prefix: " 2025-08-14T21:49:36.3871645Z python3 --version 2025-08-14T21:49:36.3875860Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-08-14T21:49:36.3876083Z env: 2025-08-14T21:49:36.3876227Z GIT_DEFAULT_BRANCH: main 2025-08-14T21:49:36.3876497Z DOCKER_CONTAINER_ID: 4dd890d366a3b8e8bea253d8d281baebc94267ef08a0525f01f6487a0edf5681 2025-08-14T21:49:36.3876784Z DEVICE_NAME: 2025-08-14T21:49:36.3876931Z DEVICE_TYPE: 2025-08-14T21:49:36.3877128Z ##[endgroup] 2025-08-14T21:49:36.3897540Z workflow_id: 16976255153 2025-08-14T21:49:36.3899656Z workflow_attempt: 1 2025-08-14T21:49:36.3900013Z workflow_Name: inductor 2025-08-14T21:49:36.3900285Z job_id: 48128039107 2025-08-14T21:49:36.3900699Z job_name: linux-jammy-cpu-py3.9-gcc11-inductor / test (dynamic_cpu_inductor_huggingface, 1, 1, linux.8xlarge.amx) 2025-08-14T21:49:36.3901050Z artifact_prefix: 2025-08-14T21:49:36.3909984Z Python 3.9.23 2025-08-14T21:49:36.3949156Z ##[group]Run nick-fields/retry@v3.0.0 2025-08-14T21:49:36.3949345Z with: 2025-08-14T21:49:36.3949484Z shell: bash 2025-08-14T21:49:36.3949628Z timeout_minutes: 5 2025-08-14T21:49:36.3949772Z max_attempts: 5 2025-08-14T21:49:36.3949929Z retry_wait_seconds: 30 2025-08-14T21:49:36.3950247Z command: set -eu python3 -m pip install python-dateutil==2.8.2 boto3==1.35.42 pandas==2.1.3 dataclasses_json==0.6.7 2025-08-14T21:49:36.3950575Z polling_interval_seconds: 1 2025-08-14T21:49:36.3950742Z warning_on_retry: true 2025-08-14T21:49:36.3950908Z continue_on_error: false 2025-08-14T21:49:36.3951075Z env: 2025-08-14T21:49:36.3951221Z GIT_DEFAULT_BRANCH: main 2025-08-14T21:49:36.3951496Z DOCKER_CONTAINER_ID: 4dd890d366a3b8e8bea253d8d281baebc94267ef08a0525f01f6487a0edf5681 2025-08-14T21:49:36.3951773Z DEVICE_NAME: 2025-08-14T21:49:36.3951915Z DEVICE_TYPE: 2025-08-14T21:49:36.3952057Z ##[endgroup] 2025-08-14T21:49:36.6351587Z Defaulting to user installation because normal site-packages is not writeable 2025-08-14T21:49:36.6920412Z Collecting python-dateutil==2.8.2 2025-08-14T21:49:36.7061838Z Downloading python_dateutil-2.8.2-py2.py3-none-any.whl (247 kB) 2025-08-14T21:49:37.3375191Z Collecting boto3==1.35.42 2025-08-14T21:49:37.3417148Z Downloading boto3-1.35.42-py3-none-any.whl (139 kB) 2025-08-14T21:49:37.6732315Z Collecting pandas==2.1.3 2025-08-14T21:49:37.6769765Z Downloading pandas-2.1.3-cp39-cp39-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (12.3 MB) 2025-08-14T21:49:37.7872211Z Requirement already satisfied: dataclasses_json==0.6.7 in /home/ec2-user/.local/lib/python3.9/site-packages (0.6.7) 2025-08-14T21:49:37.7883312Z Requirement already satisfied: six>=1.5 in /usr/lib/python3.9/site-packages (from python-dateutil==2.8.2) (1.15.0) 2025-08-14T21:49:37.7914859Z Requirement already satisfied: botocore<1.36.0,>=1.35.42 in /home/ec2-user/.local/lib/python3.9/site-packages (from boto3==1.35.42) (1.35.99) 2025-08-14T21:49:37.7919063Z Requirement already satisfied: jmespath<2.0.0,>=0.7.1 in /usr/lib/python3.9/site-packages (from boto3==1.35.42) (0.10.0) 2025-08-14T21:49:37.7922987Z Requirement already satisfied: s3transfer<0.11.0,>=0.10.0 in /home/ec2-user/.local/lib/python3.9/site-packages (from boto3==1.35.42) (0.10.4) 2025-08-14T21:49:37.8332503Z Requirement already satisfied: pytz>=2020.1 in /usr/lib/python3.9/site-packages (from pandas==2.1.3) (2022.7.1) 2025-08-14T21:49:37.8557948Z Collecting tzdata>=2022.1 2025-08-14T21:49:37.8589656Z Downloading tzdata-2025.2-py2.py3-none-any.whl (347 kB) 2025-08-14T21:49:38.3983973Z Collecting numpy<2,>=1.22.4 2025-08-14T21:49:38.4016809Z Downloading numpy-1.26.4-cp39-cp39-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (18.2 MB) 2025-08-14T21:49:38.5004877Z Requirement already satisfied: marshmallow<4.0.0,>=3.18.0 in /home/ec2-user/.local/lib/python3.9/site-packages (from dataclasses_json==0.6.7) (3.26.1) 2025-08-14T21:49:38.5005708Z Requirement already satisfied: typing-inspect<1,>=0.4.0 in /home/ec2-user/.local/lib/python3.9/site-packages (from dataclasses_json==0.6.7) (0.9.0) 2025-08-14T21:49:38.5066614Z Requirement already satisfied: urllib3<1.27,>=1.25.4 in /usr/lib/python3.9/site-packages (from botocore<1.36.0,>=1.35.42->boto3==1.35.42) (1.25.10) 2025-08-14T21:49:38.5130387Z Requirement already satisfied: packaging>=17.0 in /home/ec2-user/.local/lib/python3.9/site-packages (from marshmallow<4.0.0,>=3.18.0->dataclasses_json==0.6.7) (25.0) 2025-08-14T21:49:38.5200947Z Requirement already satisfied: typing-extensions>=3.7.4 in /home/ec2-user/.local/lib/python3.9/site-packages (from typing-inspect<1,>=0.4.0->dataclasses_json==0.6.7) (4.14.1) 2025-08-14T21:49:38.5205523Z Requirement already satisfied: mypy-extensions>=0.3.0 in /home/ec2-user/.local/lib/python3.9/site-packages (from typing-inspect<1,>=0.4.0->dataclasses_json==0.6.7) (1.1.0) 2025-08-14T21:49:38.6286446Z Installing collected packages: python-dateutil, tzdata, numpy, pandas, boto3 2025-08-14T21:49:42.1947841Z Attempting uninstall: boto3 2025-08-14T21:49:42.1949589Z Found existing installation: boto3 1.35.33 2025-08-14T21:49:42.2010039Z Uninstalling boto3-1.35.33: 2025-08-14T21:49:42.2018149Z Successfully uninstalled boto3-1.35.33 2025-08-14T21:49:42.2443105Z Successfully installed boto3-1.35.42 numpy-1.26.4 pandas-2.1.3 python-dateutil-2.8.2 tzdata-2025.2 2025-08-14T21:49:42.4599704Z Command completed after 1 attempt(s). 2025-08-14T21:49:42.4665495Z ##[group]Run python3 -m tools.stats.upload_utilization_stats.upload_utilization_stats \ 2025-08-14T21:49:42.4665903Z python3 -m tools.stats.upload_utilization_stats.upload_utilization_stats \ 2025-08-14T21:49:42.4666216Z  --workflow-run-id "16976255153" \ 2025-08-14T21:49:42.4666426Z  --workflow-name "inductor" \ 2025-08-14T21:49:42.4666629Z  --workflow-run-attempt "1" \ 2025-08-14T21:49:42.4666817Z  --job-id "48128039107" \ 2025-08-14T21:49:42.4667176Z  --job-name "linux-jammy-cpu-py3.9-gcc11-inductor / test (dynamic_cpu_inductor_huggingface, 1, 1, linux.8xlarge.amx)" \ 2025-08-14T21:49:42.4667545Z  --local-path "" \ 2025-08-14T21:49:42.4667725Z  --artifact-prefix "" 2025-08-14T21:49:42.4672960Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-08-14T21:49:42.4673187Z env: 2025-08-14T21:49:42.4673335Z GIT_DEFAULT_BRANCH: main 2025-08-14T21:49:42.4673614Z DOCKER_CONTAINER_ID: 4dd890d366a3b8e8bea253d8d281baebc94267ef08a0525f01f6487a0edf5681 2025-08-14T21:49:42.4673895Z DEVICE_NAME: 2025-08-14T21:49:42.4674046Z DEVICE_TYPE: 2025-08-14T21:49:42.4674196Z ##[endgroup] 2025-08-14T21:49:43.2135634Z repo: pytorch/pytorch 2025-08-14T21:49:43.2136848Z Search for test log in s3 bucket: ossci-utilization 2025-08-14T21:49:43.2137356Z Downloading logs-test-dynamic_cpu_inductor_huggingface-1-1-linux.8xlarge.amx_48128039107.zip 2025-08-14T21:49:43.2137955Z extracting usage_log.txt from zip file logs-test-dynamic_cpu_inductor_huggingface-1-1-linux.8xlarge.amx_48128039107.zip 2025-08-14T21:49:43.2138458Z Converted Log Model: UtilizationMetadata: 2025-08-14T21:49:43.2139800Z UtilizationMetadata(level='metadata', workflow_id='16976255153', job_id='48128039107', workflow_name='inductor', job_name='linux-jammy-cpu-py3.9-gcc11-inductor / test (dynamic_cpu_inductor_huggingface, 1, 1, linux.8xlarge.amx)', usage_collect_interval=1.0, data_model_version=1.5, start_at=1755206897, gpu_count=0, cpu_count=32, gpu_type=None, error=None) 2025-08-14T21:49:43.2140698Z [Db Segments] detected pytest cmd: 10, generated segments: 10 2025-08-14T21:49:43.2140944Z [db model] Peek db timeseries 2025-08-14T21:49:43.2141126Z :{ 2025-08-14T21:49:43.2141272Z "created_at": 1755208183, 2025-08-14T21:49:43.2141449Z "type": "utilization", 2025-08-14T21:49:43.2141605Z "tags": [ 2025-08-14T21:49:43.2141743Z "record" 2025-08-14T21:49:43.2141880Z ], 2025-08-14T21:49:43.2142011Z "time_stamp": 1755206897, 2025-08-14T21:49:43.2142182Z "repo": "pytorch/pytorch", 2025-08-14T21:49:43.2142545Z "workflow_id": 16976255153, 2025-08-14T21:49:43.2142708Z "run_attempt": 1, 2025-08-14T21:49:43.2142865Z "job_id": 48128039107, 2025-08-14T21:49:43.2143033Z "workflow_name": "inductor", 2025-08-14T21:49:43.2143381Z "job_name": "linux-jammy-cpu-py3.9-gcc11-inductor / test (dynamic_cpu_inductor_huggingface, 1, 1, linux.8xlarge.amx)", 2025-08-14T21:49:43.2143721Z "json_data": "{}" 2025-08-14T21:49:43.2143871Z } 2025-08-14T21:49:43.2144161Z Writing 1 documents to S3 ossci-utilization/util_metadata/v_1.5/pytorch/pytorch/16976255153/1/48128039107/metadata 2025-08-14T21:49:43.2145430Z Done! Finish writing document to S3 ossci-utilization/util_metadata/v_1.5/pytorch/pytorch/16976255153/1/48128039107/metadata 2025-08-14T21:49:43.2145934Z Writing 255 documents to S3 ossci-utilization/util_timeseries/v_1.5/pytorch/pytorch/16976255153/1/48128039107/time_series 2025-08-14T21:49:43.2146450Z Done! Finish writing document to S3 ossci-utilization/util_timeseries/v_1.5/pytorch/pytorch/16976255153/1/48128039107/time_series 2025-08-14T21:49:43.3026594Z ##[group]Run pytorch/test-infra/.github/actions/teardown-linux@main 2025-08-14T21:49:43.3026856Z with: 2025-08-14T21:49:43.3026997Z env: 2025-08-14T21:49:43.3027133Z GIT_DEFAULT_BRANCH: main 2025-08-14T21:49:43.3027417Z DOCKER_CONTAINER_ID: 4dd890d366a3b8e8bea253d8d281baebc94267ef08a0525f01f6487a0edf5681 2025-08-14T21:49:43.3027705Z DEVICE_NAME: 2025-08-14T21:49:43.3027842Z DEVICE_TYPE: 2025-08-14T21:49:43.3027983Z ##[endgroup] 2025-08-14T21:49:43.3049890Z ##[group]Run set -eou pipefail 2025-08-14T21:49:43.3050104Z set -eou pipefail 2025-08-14T21:49:43.3050281Z  2025-08-14T21:49:43.3050497Z echo "Holding runner for 2 hours until all ssh sessions have logged out" 2025-08-14T21:49:43.3050767Z for _ in $(seq 1440); do 2025-08-14T21:49:43.3050973Z  # Break if no ssh session exists anymore 2025-08-14T21:49:43.3051177Z  if [ "$(who)" = "" ]; then 2025-08-14T21:49:43.3051348Z  break 2025-08-14T21:49:43.3051528Z  fi 2025-08-14T21:49:43.3051666Z  echo "." 2025-08-14T21:49:43.3051817Z  sleep 5 2025-08-14T21:49:43.3051961Z done 2025-08-14T21:49:43.3056325Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-08-14T21:49:43.3056543Z env: 2025-08-14T21:49:43.3056686Z GIT_DEFAULT_BRANCH: main 2025-08-14T21:49:43.3056960Z DOCKER_CONTAINER_ID: 4dd890d366a3b8e8bea253d8d281baebc94267ef08a0525f01f6487a0edf5681 2025-08-14T21:49:43.3057241Z DEVICE_NAME: 2025-08-14T21:49:43.3057383Z DEVICE_TYPE: 2025-08-14T21:49:43.3057527Z ##[endgroup] 2025-08-14T21:49:43.3077622Z Holding runner for 2 hours until all ssh sessions have logged out 2025-08-14T21:49:43.3176009Z ##[group]Run # ignore expansion of "docker ps -q" since it could be empty 2025-08-14T21:49:43.3176536Z # ignore expansion of "docker ps -q" since it could be empty 2025-08-14T21:49:43.3176792Z # shellcheck disable=SC2046 2025-08-14T21:49:43.3177019Z docker stop $(docker ps -q) || true 2025-08-14T21:49:43.3177222Z # Prune all of the docker images 2025-08-14T21:49:43.3177425Z docker system prune -af 2025-08-14T21:49:43.3180914Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-08-14T21:49:43.3181128Z env: 2025-08-14T21:49:43.3181275Z GIT_DEFAULT_BRANCH: main 2025-08-14T21:49:43.3181551Z DOCKER_CONTAINER_ID: 4dd890d366a3b8e8bea253d8d281baebc94267ef08a0525f01f6487a0edf5681 2025-08-14T21:49:43.3181836Z DEVICE_NAME: 2025-08-14T21:49:43.3181982Z DEVICE_TYPE: 2025-08-14T21:49:43.3182130Z ##[endgroup] 2025-08-14T21:49:54.3428712Z 4dd890d366a3 2025-08-14T21:49:54.6128879Z Deleted Containers: 2025-08-14T21:49:54.6129287Z 4dd890d366a3b8e8bea253d8d281baebc94267ef08a0525f01f6487a0edf5681 2025-08-14T21:49:54.6129581Z 2025-08-14T21:50:01.1925773Z Deleted Images: 2025-08-14T21:50:01.1930427Z untagged: 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/ci-image:pytorch-linux-jammy-py3.9-gcc11-inductor-benchmarks-bfa89110622ba7202628e9faac705f183070defe 2025-08-14T21:50:01.1931763Z untagged: 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/ci-image@sha256:4236794baba289041d240d08fd393bbd57497c3012e5e0ccd9fd98f61ebf35c6 2025-08-14T21:50:01.1932263Z deleted: sha256:0899ae453036ee7a91795ea95b1db61000579eeb74b140edab5976919ee64bbe 2025-08-14T21:50:01.1932634Z deleted: sha256:aa7b544271e9ba3105dabd1afb12e315887018f3471e03135c1d50e64cc550c4 2025-08-14T21:50:01.1932998Z deleted: sha256:4c685831817cc2fc6dfdfda1726df1f402222d8cdccc40daad3198cf8b17e3f4 2025-08-14T21:50:01.1937591Z deleted: sha256:cedf3fb09a62e68c6d7e22cedbce12e77166a50649d0269200ee0efce8a57b88 2025-08-14T21:50:01.1939528Z deleted: sha256:1b3a9a237b4153f8f523a85cead9d36e29717eb57182e2f75069788681627d95 2025-08-14T21:50:01.1940086Z deleted: sha256:67bd313103dfbe7fe0172e6f4f7ee420fad9743a64a1cc1cd20bc22250d3602c 2025-08-14T21:50:01.1940585Z deleted: sha256:b17820137ada46a2a726c67aa08cce73d2ead7c95db08575cf5e69bedb4b600d 2025-08-14T21:50:01.1941432Z deleted: sha256:b16c9bc40cc1cf924638323aece4168d6332cfae212dad2a431a584a44fe967c 2025-08-14T21:50:01.1941877Z deleted: sha256:ab35ed781133eb4aaa1b2478aea73fb80dc71bceffbe474b55e1a60fc6c5ffbe 2025-08-14T21:50:01.1942241Z deleted: sha256:b9d0b0720dd9c0bcb4f174ae6770a7c2fe540c6983872180f3a5e18300434cdb 2025-08-14T21:50:01.1942584Z deleted: sha256:f5d1a4f32d90030cc174d73b579758d28f95c992a8cf21360e5addee99dea169 2025-08-14T21:50:01.1942938Z deleted: sha256:4af408141f8591f4b69cef9b425b6caa3c4cbc62ced38b5d08f3150f0c8ff449 2025-08-14T21:50:01.1943289Z deleted: sha256:e0019e5c461051e54a9af37ae22b49cfd2c2e5366da57a20304f6ef89171a3b3 2025-08-14T21:50:01.1943638Z deleted: sha256:542f999b2cfc965b97861645356840864e9946fa2fa40f1f5c4c45684e91c239 2025-08-14T21:50:01.1943978Z deleted: sha256:633629aa3d4ae6472e222a1c0b2ceb729b0d84ccb48e12d52ba2d2987c9063e1 2025-08-14T21:50:01.1944337Z deleted: sha256:ea645aba1ba54baac43713f3df7f1b89dd119764a747273897eb2931fea42856 2025-08-14T21:50:01.1944696Z deleted: sha256:1f50e367efff88c7182b9dc3ff618c1cf7bd34edf2f31805e268c50fac02a627 2025-08-14T21:50:01.1945156Z deleted: sha256:aff22d7ae43d842befa617e2e5f9878d09a82b67c362b0c44a40a4c88be92120 2025-08-14T21:50:01.1945511Z deleted: sha256:4275d4addb77b473ed40194e42918cf2aeb484d1d8e25cf54d374392643a095c 2025-08-14T21:50:01.1945862Z deleted: sha256:66471f6c8dc869455ff193909110d824b5d65f7383877a7d0face6331b21fff3 2025-08-14T21:50:01.1946213Z deleted: sha256:8cfd2d55570494ff2b993725f5eb13d0440a5698fa905823ca1677d2d16febb8 2025-08-14T21:50:01.1946559Z deleted: sha256:5c8cf8b9c4a76f679994decc8800bc6eefd258a8dc6293a714d5e100fea3a1bc 2025-08-14T21:50:01.1946923Z deleted: sha256:1acc162c6b9de62d13ce7fd33bb9b134458f7e7dbe996e5442e0047ec8f70c80 2025-08-14T21:50:01.1947287Z deleted: sha256:044bab98f3bceb1948c626ce6bdd19d3ec8f9c5ad42a4f635dd685a7ae9c9024 2025-08-14T21:50:01.1947646Z deleted: sha256:2acb11a9448f13c2c2d29c4d0d4013e046862bd019cf5ec9fe04bdf35299f1dd 2025-08-14T21:50:01.1947989Z deleted: sha256:8e7b56334416233f301944000dec16952e13bb69296cc80e1031bfecaf6e7f9d 2025-08-14T21:50:01.1948347Z deleted: sha256:4a4d1ec727c43389a601aefccdaeff6b3bf54c0daefb12e0c2098c3e18b383ba 2025-08-14T21:50:01.1948698Z deleted: sha256:8b9ca4276331196a2f03c2fa3a87422d2042cf06011b49368c2335be7da829c1 2025-08-14T21:50:01.1949036Z deleted: sha256:5076357fd3cc8b06ed54a0f692362a38f1ebafa4843c0b0bf8021f9021d2e583 2025-08-14T21:50:01.1949387Z deleted: sha256:f9451fa0842798e2a67c059fda5124cafb401801bb8c40d03ae736ff3ef5ed20 2025-08-14T21:50:01.1949733Z deleted: sha256:52b716f02091d6af6b79e7b2e1f5bbd7391235993d415c7a852d6752220c8b65 2025-08-14T21:50:01.1950075Z deleted: sha256:748225161c361d3779c96eb7ae5ea0c33d35311f9445c371d62616b98e3426e8 2025-08-14T21:50:01.1950414Z deleted: sha256:5eeda1478a46d8d58267e8917422eb0a182a40c8bdfb4bfe0869923f8114c770 2025-08-14T21:50:01.1950766Z deleted: sha256:66d4cebb04304f556dd191b425a876f7dbbcde8c3c647af4ef47c10804e51f5a 2025-08-14T21:50:01.1951112Z deleted: sha256:0b526447174d22890be2bc866228e40989483b1102a0430b4ab3ad16dc6c7787 2025-08-14T21:50:01.1951507Z deleted: sha256:1aa31d55f8f9bb51f1eb702ba7d46ceda8290ed90e8e8cf299bb8a9179bf2ae2 2025-08-14T21:50:01.1951860Z deleted: sha256:dd1f47c8dc7518f303a91fc8aae81a512caff53987d5a89a378bb24c1c6d7707 2025-08-14T21:50:01.1952218Z deleted: sha256:d60f9527fcb284e73795a37d4f536badd451a2eade4c9314ebe549d31efcc876 2025-08-14T21:50:01.1952566Z deleted: sha256:f23ad0355704751b0f71a8900169354e3bf23a7b3f5fa2cd9b2478a561bfbb45 2025-08-14T21:50:01.1952913Z deleted: sha256:10e7acf6460743fcad0c1fff0bbd01158fbeb88151621c1e15ae5994f1c8ef55 2025-08-14T21:50:01.1953266Z deleted: sha256:f674e3067e97f1407f4cd55202d4c0c8641f02811550e65a00a875fc19354b75 2025-08-14T21:50:01.1953636Z deleted: sha256:8a9c75c896425ccd25101f0cf39316bec7779111954f44df726842bf583e907b 2025-08-14T21:50:01.1953984Z deleted: sha256:9730d30edfcaa135287479d80f1720b39c6f728228df6d0eb7f095e917cc16b6 2025-08-14T21:50:01.1954322Z deleted: sha256:2787e13cf97e870ca65312526c3000163ebf3da20fe59e5f5d53b1aeb4fb424b 2025-08-14T21:50:01.1954740Z deleted: sha256:d61197909174795bd69f8d5f534f1b086065d36b7aa6c5a50744eca6f8d6b12b 2025-08-14T21:50:01.1955100Z deleted: sha256:ecdfbb81e95b2ae2c8e9ab4ca72ba8564095caabb0512a47da8f866923f71bff 2025-08-14T21:50:01.1955451Z deleted: sha256:cd2d7c644df243742a0c0349af0d37570c06fdd1711ddc367e79514757a6d5cc 2025-08-14T21:50:01.1955804Z deleted: sha256:6703ab1ced70b30a87660c0dd778fe95fb90b04ed8461c2a331272aa54eb3499 2025-08-14T21:50:01.1956160Z deleted: sha256:b7088ce49d7df1d6fb18eee5fc5664e637c5649c89e581d972c76a83f60d0a62 2025-08-14T21:50:01.1956518Z deleted: sha256:d0d2786658af9907d8c4ecfa84fa9e2bd07131257264395b804deef744a5c39c 2025-08-14T21:50:01.1956866Z deleted: sha256:d46baf72d8e570e6004c6f95131cea6ede27eb01c213d8c1e8b263ab95fdfe95 2025-08-14T21:50:01.1957222Z deleted: sha256:0219ea0bd0e38d169ed596ed80807b0f70b609ec5f886d671c249d10575dff2c 2025-08-14T21:50:01.1957572Z deleted: sha256:77d1a1f15cf8ae85a4c5495d800378c307967004360814810fd13b07a74aee5e 2025-08-14T21:50:01.1957908Z deleted: sha256:47c77d89ce8782a94a6f5435b1611a76b47f830153ba4b462d3e08dcbdaa40f7 2025-08-14T21:50:01.1958261Z deleted: sha256:d5120b2e61fb0ccc32a2ad02fc0b2b908bc69f1f174268bde3d26d79ce46f046 2025-08-14T21:50:01.1958616Z deleted: sha256:65626052fd7e03a8e90c72072a54f0eaa43788cfcb0835ffb98b700be89b0567 2025-08-14T21:50:01.1958963Z deleted: sha256:05c09c0832c35f0128e0258b1d3069d7bb4b94ce58239faba5d585e49c34e904 2025-08-14T21:50:01.1959303Z deleted: sha256:2d6749fb2c30585eebb1d97e99318434ec34e0f7a4414e552fd4a44175f86839 2025-08-14T21:50:01.1959657Z deleted: sha256:2d65e2932810021e5b3cfedd89cfd851dd47fce63fbe5dc6959e59f3d8a98499 2025-08-14T21:50:01.1960022Z deleted: sha256:b2e71ddacad35b6caa3a77429bab51b654f6acaccc9e9263f1cb43edb8c53ac3 2025-08-14T21:50:01.1960375Z deleted: sha256:632a43100a629c40972b4da95fbbb581f29fe8b073a96386c72931d27ffbbefa 2025-08-14T21:50:01.1960718Z deleted: sha256:11964e5f5833fdf2bcc61c52f33d5aebf9b5504c6792baf58beb96b90398d10a 2025-08-14T21:50:01.1961067Z deleted: sha256:f0c1cb4c9e4655464b9b62b6589ac5005c2392213765ab4175bd61e3f6462643 2025-08-14T21:50:01.1961424Z deleted: sha256:5113aaee4b4d5ee45b58bcee467ac314112b02e4c4e5e9c3cc7a236dd308e9de 2025-08-14T21:50:01.1961773Z deleted: sha256:9cdc88c7b7fe728e15c72d0e8eef813ace31905b4b317a0a23f1334b6a22e604 2025-08-14T21:50:01.1962122Z deleted: sha256:8056a3da01752a91095e2d0afd80b625172f0915f22f7d998b9b926b9462dc5f 2025-08-14T21:50:01.1962462Z deleted: sha256:8a99968112e0edd39c242f3452b05d167911724468fdd9b18d11a8f5fa9c3ac8 2025-08-14T21:50:01.1962815Z deleted: sha256:6f70653bcfea9c1dd39aba76713adac0ac8f6f4c202387ff86a3ffe45d2079f2 2025-08-14T21:50:01.1963164Z deleted: sha256:9a0ed45f26188ecbfcf7658f46e29922b441969b2aded64d1d6b287b6de2e49c 2025-08-14T21:50:01.1963508Z deleted: sha256:f84c75780b110e68f7593fe9592456387118761b365a954a105aee72016adeac 2025-08-14T21:50:01.1963850Z deleted: sha256:1a5a81f8cbb945eee96e25ee8b4958d7140bb6751b86bc2e4a6aa9e18a16846c 2025-08-14T21:50:01.1964194Z deleted: sha256:7e072dc6aa8c1831ddc97ba8229235081976cb8036c06ee1320b33606e03f9a4 2025-08-14T21:50:01.1964542Z deleted: sha256:369af3627df8ecb48c51ea4fd3267e561b2f6821075ddce314e9485494447f16 2025-08-14T21:50:01.1964912Z deleted: sha256:4d49b99f2eee0f82788e33a9c771f75b1411b0b70ce47771fc1b3bc160f23961 2025-08-14T21:50:01.1965265Z deleted: sha256:fe04dcb9c711f36f9ed1df5b2d0854d30dc5abaa6e6cd493b85d4c2e2d2c3e1b 2025-08-14T21:50:01.1965611Z deleted: sha256:4800771a0435c52d6e480540ffa8a65ecc51fdc82a91302c1a373e6021bc37ca 2025-08-14T21:50:01.1965955Z deleted: sha256:90a2bf02e851326fc70d05470553ed33e578342d6e06bfa0cfaf331c4079b7e4 2025-08-14T21:50:01.1966159Z 2025-08-14T21:50:01.1966244Z Total reclaimed space: 51.8GB 2025-08-14T21:50:01.2032662Z Post job cleanup. 2025-08-14T21:50:01.2072576Z Post job cleanup. 2025-08-14T21:50:01.2803617Z [command]/usr/bin/git version 2025-08-14T21:50:01.2834119Z git version 2.47.1 2025-08-14T21:50:01.2862656Z Copying '/home/ec2-user/.gitconfig' to '/home/ec2-user/actions-runner/_work/_temp/b64fa881-bb4f-4916-a724-2527e9cd7671/.gitconfig' 2025-08-14T21:50:01.2880544Z Temporarily overriding HOME='/home/ec2-user/actions-runner/_work/_temp/b64fa881-bb4f-4916-a724-2527e9cd7671' before making global git config changes 2025-08-14T21:50:01.2881263Z Adding repository directory to the temporary git global config as a safe directory 2025-08-14T21:50:01.2885807Z [command]/usr/bin/git config --global --add safe.directory /home/ec2-user/actions-runner/_work/pytorch/pytorch 2025-08-14T21:50:01.2939754Z [command]/usr/bin/git config --local --name-only --get-regexp core\.sshCommand 2025-08-14T21:50:01.2969793Z [command]/usr/bin/git submodule foreach --recursive sh -c "git config --local --name-only --get-regexp 'core\.sshCommand' && git config --local --unset-all 'core.sshCommand' || :" 2025-08-14T21:50:01.3266884Z Entering 'android/libs/fbjni' 2025-08-14T21:50:01.3318443Z Entering 'third_party/FP16' 2025-08-14T21:50:01.3370255Z Entering 'third_party/FXdiv' 2025-08-14T21:50:01.3423105Z Entering 'third_party/NNPACK' 2025-08-14T21:50:01.3474261Z Entering 'third_party/NVTX' 2025-08-14T21:50:01.3532807Z Entering 'third_party/VulkanMemoryAllocator' 2025-08-14T21:50:01.3583323Z Entering 'third_party/XNNPACK' 2025-08-14T21:50:01.3648189Z Entering 'third_party/aiter' 2025-08-14T21:50:01.3696992Z Entering 'third_party/aiter/3rdparty/composable_kernel' 2025-08-14T21:50:01.3757552Z Entering 'third_party/benchmark' 2025-08-14T21:50:01.3812236Z Entering 'third_party/composable_kernel' 2025-08-14T21:50:01.3869441Z Entering 'third_party/cpp-httplib' 2025-08-14T21:50:01.3924363Z Entering 'third_party/cpuinfo' 2025-08-14T21:50:01.3976205Z Entering 'third_party/cudnn_frontend' 2025-08-14T21:50:01.4031703Z Entering 'third_party/cutlass' 2025-08-14T21:50:01.4087754Z Entering 'third_party/fbgemm' 2025-08-14T21:50:01.4142355Z Entering 'third_party/fbgemm/external/asmjit' 2025-08-14T21:50:01.4194717Z Entering 'third_party/fbgemm/external/composable_kernel' 2025-08-14T21:50:01.4251099Z Entering 'third_party/fbgemm/external/cpuinfo' 2025-08-14T21:50:01.4303704Z Entering 'third_party/fbgemm/external/cutlass' 2025-08-14T21:50:01.4362142Z Entering 'third_party/fbgemm/external/googletest' 2025-08-14T21:50:01.4413691Z Entering 'third_party/fbgemm/external/hipify_torch' 2025-08-14T21:50:01.4461614Z Entering 'third_party/fbgemm/external/json' 2025-08-14T21:50:01.4526384Z Entering 'third_party/flash-attention' 2025-08-14T21:50:01.4577760Z Entering 'third_party/flash-attention/csrc/composable_kernel' 2025-08-14T21:50:01.4631762Z Entering 'third_party/flash-attention/csrc/cutlass' 2025-08-14T21:50:01.4689568Z Entering 'third_party/flatbuffers' 2025-08-14T21:50:01.4744134Z Entering 'third_party/fmt' 2025-08-14T21:50:01.4797535Z Entering 'third_party/gemmlowp/gemmlowp' 2025-08-14T21:50:01.4853295Z Entering 'third_party/gloo' 2025-08-14T21:50:01.4905485Z Entering 'third_party/googletest' 2025-08-14T21:50:01.4957680Z Entering 'third_party/ideep' 2025-08-14T21:50:01.5009751Z Entering 'third_party/ideep/mkl-dnn' 2025-08-14T21:50:01.5065855Z Entering 'third_party/ittapi' 2025-08-14T21:50:01.5119233Z Entering 'third_party/kineto' 2025-08-14T21:50:01.5168460Z Entering 'third_party/kineto/libkineto/third_party/dynolog' 2025-08-14T21:50:01.5217336Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/DCGM' 2025-08-14T21:50:01.5268926Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/cpr' 2025-08-14T21:50:01.5322281Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/fmt' 2025-08-14T21:50:01.5372815Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags' 2025-08-14T21:50:01.5427315Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags/doc' 2025-08-14T21:50:01.5478695Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/glog' 2025-08-14T21:50:01.5531953Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/googletest' 2025-08-14T21:50:01.5581266Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/json' 2025-08-14T21:50:01.5633117Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/pfs' 2025-08-14T21:50:01.5686501Z Entering 'third_party/kineto/libkineto/third_party/fmt' 2025-08-14T21:50:01.5738451Z Entering 'third_party/kineto/libkineto/third_party/googletest' 2025-08-14T21:50:01.5795124Z Entering 'third_party/kleidiai' 2025-08-14T21:50:01.5847653Z Entering 'third_party/mimalloc' 2025-08-14T21:50:01.5900583Z Entering 'third_party/nlohmann' 2025-08-14T21:50:01.5954748Z Entering 'third_party/onnx' 2025-08-14T21:50:01.6017450Z Entering 'third_party/onnx/third_party/pybind11' 2025-08-14T21:50:01.6071769Z Entering 'third_party/opentelemetry-cpp' 2025-08-14T21:50:01.6120836Z Entering 'third_party/opentelemetry-cpp/third_party/benchmark' 2025-08-14T21:50:01.6171796Z Entering 'third_party/opentelemetry-cpp/third_party/googletest' 2025-08-14T21:50:01.6223702Z Entering 'third_party/opentelemetry-cpp/third_party/ms-gsl' 2025-08-14T21:50:01.6274914Z Entering 'third_party/opentelemetry-cpp/third_party/nlohmann-json' 2025-08-14T21:50:01.6325768Z Entering 'third_party/opentelemetry-cpp/third_party/opentelemetry-proto' 2025-08-14T21:50:01.6375220Z Entering 'third_party/opentelemetry-cpp/third_party/opentracing-cpp' 2025-08-14T21:50:01.6427376Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp' 2025-08-14T21:50:01.6475902Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/civetweb' 2025-08-14T21:50:01.6529151Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/googletest' 2025-08-14T21:50:01.6581923Z Entering 'third_party/opentelemetry-cpp/tools/vcpkg' 2025-08-14T21:50:01.6649722Z Entering 'third_party/pocketfft' 2025-08-14T21:50:01.6700107Z Entering 'third_party/protobuf' 2025-08-14T21:50:01.6753722Z Entering 'third_party/protobuf/third_party/benchmark' 2025-08-14T21:50:01.6807293Z Entering 'third_party/protobuf/third_party/googletest' 2025-08-14T21:50:01.6860519Z Entering 'third_party/psimd' 2025-08-14T21:50:01.6913735Z Entering 'third_party/pthreadpool' 2025-08-14T21:50:01.6965423Z Entering 'third_party/pybind11' 2025-08-14T21:50:01.7017866Z Entering 'third_party/python-peachpy' 2025-08-14T21:50:01.7070434Z Entering 'third_party/sleef' 2025-08-14T21:50:01.7120900Z Entering 'third_party/tensorpipe' 2025-08-14T21:50:01.7174130Z Entering 'third_party/tensorpipe/third_party/googletest' 2025-08-14T21:50:01.7225208Z Entering 'third_party/tensorpipe/third_party/libnop' 2025-08-14T21:50:01.7275936Z Entering 'third_party/tensorpipe/third_party/libuv' 2025-08-14T21:50:01.7328622Z Entering 'third_party/tensorpipe/third_party/pybind11' 2025-08-14T21:50:01.7376825Z Entering 'third_party/tensorpipe/third_party/pybind11/tools/clang' 2025-08-14T21:50:01.7452879Z [command]/usr/bin/git config --local --name-only --get-regexp http\.https\:\/\/github\.com\/\.extraheader 2025-08-14T21:50:01.7474257Z http.https://github.com/.extraheader 2025-08-14T21:50:01.7481718Z [command]/usr/bin/git config --local --unset-all http.https://github.com/.extraheader 2025-08-14T21:50:01.7515509Z [command]/usr/bin/git submodule foreach --recursive sh -c "git config --local --name-only --get-regexp 'http\.https\:\/\/github\.com\/\.extraheader' && git config --local --unset-all 'http.https://github.com/.extraheader' || :" 2025-08-14T21:50:01.7816472Z Entering 'android/libs/fbjni' 2025-08-14T21:50:01.7847927Z http.https://github.com/.extraheader 2025-08-14T21:50:01.7880233Z Entering 'third_party/FP16' 2025-08-14T21:50:01.7914836Z http.https://github.com/.extraheader 2025-08-14T21:50:01.7951013Z Entering 'third_party/FXdiv' 2025-08-14T21:50:01.7981627Z http.https://github.com/.extraheader 2025-08-14T21:50:01.8019916Z Entering 'third_party/NNPACK' 2025-08-14T21:50:01.8051631Z http.https://github.com/.extraheader 2025-08-14T21:50:01.8081904Z Entering 'third_party/NVTX' 2025-08-14T21:50:01.8119319Z http.https://github.com/.extraheader 2025-08-14T21:50:01.8154101Z Entering 'third_party/VulkanMemoryAllocator' 2025-08-14T21:50:01.8186005Z http.https://github.com/.extraheader 2025-08-14T21:50:01.8219180Z Entering 'third_party/XNNPACK' 2025-08-14T21:50:01.8250481Z http.https://github.com/.extraheader 2025-08-14T21:50:01.8297136Z Entering 'third_party/aiter' 2025-08-14T21:50:01.8328816Z http.https://github.com/.extraheader 2025-08-14T21:50:01.8363846Z Entering 'third_party/aiter/3rdparty/composable_kernel' 2025-08-14T21:50:01.8396307Z http.https://github.com/.extraheader 2025-08-14T21:50:01.8439890Z Entering 'third_party/benchmark' 2025-08-14T21:50:01.8472148Z http.https://github.com/.extraheader 2025-08-14T21:50:01.8504263Z Entering 'third_party/composable_kernel' 2025-08-14T21:50:01.8538631Z http.https://github.com/.extraheader 2025-08-14T21:50:01.8577252Z Entering 'third_party/cpp-httplib' 2025-08-14T21:50:01.8611875Z http.https://github.com/.extraheader 2025-08-14T21:50:01.8647348Z Entering 'third_party/cpuinfo' 2025-08-14T21:50:01.8677512Z http.https://github.com/.extraheader 2025-08-14T21:50:01.8713888Z Entering 'third_party/cudnn_frontend' 2025-08-14T21:50:01.8745456Z http.https://github.com/.extraheader 2025-08-14T21:50:01.8777980Z Entering 'third_party/cutlass' 2025-08-14T21:50:01.8811935Z http.https://github.com/.extraheader 2025-08-14T21:50:01.8853644Z Entering 'third_party/fbgemm' 2025-08-14T21:50:01.8883273Z http.https://github.com/.extraheader 2025-08-14T21:50:01.8919520Z Entering 'third_party/fbgemm/external/asmjit' 2025-08-14T21:50:01.8951857Z http.https://github.com/.extraheader 2025-08-14T21:50:01.8983672Z Entering 'third_party/fbgemm/external/composable_kernel' 2025-08-14T21:50:01.9016037Z http.https://github.com/.extraheader 2025-08-14T21:50:01.9058046Z Entering 'third_party/fbgemm/external/cpuinfo' 2025-08-14T21:50:01.9091303Z http.https://github.com/.extraheader 2025-08-14T21:50:01.9129854Z Entering 'third_party/fbgemm/external/cutlass' 2025-08-14T21:50:01.9164647Z http.https://github.com/.extraheader 2025-08-14T21:50:01.9205997Z Entering 'third_party/fbgemm/external/googletest' 2025-08-14T21:50:01.9237544Z http.https://github.com/.extraheader 2025-08-14T21:50:01.9270279Z Entering 'third_party/fbgemm/external/hipify_torch' 2025-08-14T21:50:01.9306171Z http.https://github.com/.extraheader 2025-08-14T21:50:01.9336729Z Entering 'third_party/fbgemm/external/json' 2025-08-14T21:50:01.9369415Z http.https://github.com/.extraheader 2025-08-14T21:50:01.9406277Z Entering 'third_party/flash-attention' 2025-08-14T21:50:01.9438748Z http.https://github.com/.extraheader 2025-08-14T21:50:01.9472739Z Entering 'third_party/flash-attention/csrc/composable_kernel' 2025-08-14T21:50:01.9503751Z http.https://github.com/.extraheader 2025-08-14T21:50:01.9545616Z Entering 'third_party/flash-attention/csrc/cutlass' 2025-08-14T21:50:01.9578051Z http.https://github.com/.extraheader 2025-08-14T21:50:01.9621176Z Entering 'third_party/flatbuffers' 2025-08-14T21:50:01.9654197Z http.https://github.com/.extraheader 2025-08-14T21:50:01.9693871Z Entering 'third_party/fmt' 2025-08-14T21:50:01.9724654Z http.https://github.com/.extraheader 2025-08-14T21:50:01.9760031Z Entering 'third_party/gemmlowp/gemmlowp' 2025-08-14T21:50:01.9794283Z http.https://github.com/.extraheader 2025-08-14T21:50:01.9827825Z Entering 'third_party/gloo' 2025-08-14T21:50:01.9862631Z http.https://github.com/.extraheader 2025-08-14T21:50:01.9897457Z Entering 'third_party/googletest' 2025-08-14T21:50:01.9929221Z http.https://github.com/.extraheader 2025-08-14T21:50:01.9964911Z Entering 'third_party/ideep' 2025-08-14T21:50:01.9999492Z http.https://github.com/.extraheader 2025-08-14T21:50:02.0032158Z Entering 'third_party/ideep/mkl-dnn' 2025-08-14T21:50:02.0064261Z http.https://github.com/.extraheader 2025-08-14T21:50:02.0103898Z Entering 'third_party/ittapi' 2025-08-14T21:50:02.0139208Z http.https://github.com/.extraheader 2025-08-14T21:50:02.0173596Z Entering 'third_party/kineto' 2025-08-14T21:50:02.0206989Z http.https://github.com/.extraheader 2025-08-14T21:50:02.0240911Z Entering 'third_party/kineto/libkineto/third_party/dynolog' 2025-08-14T21:50:02.0270575Z http.https://github.com/.extraheader 2025-08-14T21:50:02.0306820Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/DCGM' 2025-08-14T21:50:02.0338062Z http.https://github.com/.extraheader 2025-08-14T21:50:02.0374776Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/cpr' 2025-08-14T21:50:02.0406678Z http.https://github.com/.extraheader 2025-08-14T21:50:02.0443431Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/fmt' 2025-08-14T21:50:02.0475596Z http.https://github.com/.extraheader 2025-08-14T21:50:02.0510233Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags' 2025-08-14T21:50:02.0542166Z http.https://github.com/.extraheader 2025-08-14T21:50:02.0574631Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags/doc' 2025-08-14T21:50:02.0608398Z http.https://github.com/.extraheader 2025-08-14T21:50:02.0645398Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/glog' 2025-08-14T21:50:02.0676348Z http.https://github.com/.extraheader 2025-08-14T21:50:02.0714158Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/googletest' 2025-08-14T21:50:02.0745127Z http.https://github.com/.extraheader 2025-08-14T21:50:02.0777268Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/json' 2025-08-14T21:50:02.0812983Z http.https://github.com/.extraheader 2025-08-14T21:50:02.0848171Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/pfs' 2025-08-14T21:50:02.0878184Z http.https://github.com/.extraheader 2025-08-14T21:50:02.0919903Z Entering 'third_party/kineto/libkineto/third_party/fmt' 2025-08-14T21:50:02.0951955Z http.https://github.com/.extraheader 2025-08-14T21:50:02.0984385Z Entering 'third_party/kineto/libkineto/third_party/googletest' 2025-08-14T21:50:02.1016581Z http.https://github.com/.extraheader 2025-08-14T21:50:02.1053107Z Entering 'third_party/kleidiai' 2025-08-14T21:50:02.1083288Z http.https://github.com/.extraheader 2025-08-14T21:50:02.1121896Z Entering 'third_party/mimalloc' 2025-08-14T21:50:02.1154718Z http.https://github.com/.extraheader 2025-08-14T21:50:02.1189889Z Entering 'third_party/nlohmann' 2025-08-14T21:50:02.1222223Z http.https://github.com/.extraheader 2025-08-14T21:50:02.1258692Z Entering 'third_party/onnx' 2025-08-14T21:50:02.1290441Z http.https://github.com/.extraheader 2025-08-14T21:50:02.1336697Z Entering 'third_party/onnx/third_party/pybind11' 2025-08-14T21:50:02.1369297Z http.https://github.com/.extraheader 2025-08-14T21:50:02.1405183Z Entering 'third_party/opentelemetry-cpp' 2025-08-14T21:50:02.1438102Z http.https://github.com/.extraheader 2025-08-14T21:50:02.1472781Z Entering 'third_party/opentelemetry-cpp/third_party/benchmark' 2025-08-14T21:50:02.1504998Z http.https://github.com/.extraheader 2025-08-14T21:50:02.1539153Z Entering 'third_party/opentelemetry-cpp/third_party/googletest' 2025-08-14T21:50:02.1570100Z http.https://github.com/.extraheader 2025-08-14T21:50:02.1605301Z Entering 'third_party/opentelemetry-cpp/third_party/ms-gsl' 2025-08-14T21:50:02.1636753Z http.https://github.com/.extraheader 2025-08-14T21:50:02.1669496Z Entering 'third_party/opentelemetry-cpp/third_party/nlohmann-json' 2025-08-14T21:50:02.1703376Z http.https://github.com/.extraheader 2025-08-14T21:50:02.1737956Z Entering 'third_party/opentelemetry-cpp/third_party/opentelemetry-proto' 2025-08-14T21:50:02.1769888Z http.https://github.com/.extraheader 2025-08-14T21:50:02.1805737Z Entering 'third_party/opentelemetry-cpp/third_party/opentracing-cpp' 2025-08-14T21:50:02.1837615Z http.https://github.com/.extraheader 2025-08-14T21:50:02.1868770Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp' 2025-08-14T21:50:02.1901354Z http.https://github.com/.extraheader 2025-08-14T21:50:02.1936443Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/civetweb' 2025-08-14T21:50:02.1967289Z http.https://github.com/.extraheader 2025-08-14T21:50:02.2004183Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/googletest' 2025-08-14T21:50:02.2036278Z http.https://github.com/.extraheader 2025-08-14T21:50:02.2073744Z Entering 'third_party/opentelemetry-cpp/tools/vcpkg' 2025-08-14T21:50:02.2106107Z http.https://github.com/.extraheader 2025-08-14T21:50:02.2158998Z Entering 'third_party/pocketfft' 2025-08-14T21:50:02.2192963Z http.https://github.com/.extraheader 2025-08-14T21:50:02.2226832Z Entering 'third_party/protobuf' 2025-08-14T21:50:02.2259271Z http.https://github.com/.extraheader 2025-08-14T21:50:02.2294814Z Entering 'third_party/protobuf/third_party/benchmark' 2025-08-14T21:50:02.2327171Z http.https://github.com/.extraheader 2025-08-14T21:50:02.2360849Z Entering 'third_party/protobuf/third_party/googletest' 2025-08-14T21:50:02.2393134Z http.https://github.com/.extraheader 2025-08-14T21:50:02.2430749Z Entering 'third_party/psimd' 2025-08-14T21:50:02.2463652Z http.https://github.com/.extraheader 2025-08-14T21:50:02.2499482Z Entering 'third_party/pthreadpool' 2025-08-14T21:50:02.2530349Z http.https://github.com/.extraheader 2025-08-14T21:50:02.2565243Z Entering 'third_party/pybind11' 2025-08-14T21:50:02.2595670Z http.https://github.com/.extraheader 2025-08-14T21:50:02.2630853Z Entering 'third_party/python-peachpy' 2025-08-14T21:50:02.2663237Z http.https://github.com/.extraheader 2025-08-14T21:50:02.2698797Z Entering 'third_party/sleef' 2025-08-14T21:50:02.2730061Z http.https://github.com/.extraheader 2025-08-14T21:50:02.2765140Z Entering 'third_party/tensorpipe' 2025-08-14T21:50:02.2798341Z http.https://github.com/.extraheader 2025-08-14T21:50:02.2831288Z Entering 'third_party/tensorpipe/third_party/googletest' 2025-08-14T21:50:02.2862405Z http.https://github.com/.extraheader 2025-08-14T21:50:02.2900263Z Entering 'third_party/tensorpipe/third_party/libnop' 2025-08-14T21:50:02.2931064Z http.https://github.com/.extraheader 2025-08-14T21:50:02.2965575Z Entering 'third_party/tensorpipe/third_party/libuv' 2025-08-14T21:50:02.2997720Z http.https://github.com/.extraheader 2025-08-14T21:50:02.3031027Z Entering 'third_party/tensorpipe/third_party/pybind11' 2025-08-14T21:50:02.3060650Z http.https://github.com/.extraheader 2025-08-14T21:50:02.3094393Z Entering 'third_party/tensorpipe/third_party/pybind11/tools/clang' 2025-08-14T21:50:02.3124892Z http.https://github.com/.extraheader 2025-08-14T21:50:02.3260576Z A job completed hook has been configured by the self-hosted runner administrator 2025-08-14T21:50:02.3285118Z ##[group]Run '/home/ec2-user/runner-scripts/after_job.sh' 2025-08-14T21:50:02.3288156Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-08-14T21:50:02.3288388Z ##[endgroup] 2025-08-14T21:50:02.3376266Z [!ALERT!] Swap in detected! [!ALERT!] 2025-08-14T21:50:10.0507424Z [!ALERT!] Swap out detected [!ALERT!] 2025-08-14T21:50:23.2216037Z Cleaning up orphan processes